

# PROJECT WHIRLWIND

Contract N5ori60

R-1777

A METHOD OF TEST CHECKING
AN ELECTRONIC DIGITAL COMPUTER

MARCH 15, 1950

SERVOMECHANISMS LABORATORY
MASSACHUSETTS INSTITUTE OF TECHNOLOGY

**Copy** 37



Report R-177

A METHOD OF TEST CHECKING AN ELECTRONIC DIGITAL COMPUTER

Submitted to the OFFICE OF NAVAL RESEARCH Under Contract N50r160 Project NR-048-097

Report by Gerald Cooper

SERVOMECHANISMS LABORATORY
MASSACHUSETTS INSTITUTE OF TECHNOLOGY
Cambridge 39, Massachusetts
Project DIC 6345

March 15, 1950 (Thesis Date: January 14, 1950)

#### FOREWORD

Because it presents information of lasting value, this thesis report, which has had only very limited distribution, is being issued as a Project Whirlwind R-series report.

Reliability of operation is one of the most important goals of the research being done on the large-scale electronic digital computer. Part of this general problem is the development of methods of error detection and trouble location. A very useful technique adopted by Project Whirlwind is the provision of facilities for the variation of d-c supply voltages to various parts of the computer. By this means, dynamic and incipient failures can be converted into steady-state failures, and deteriorating components removed before they cause errors.

This so-called marginal checking makes use of specially designed check problems. A method of designing such problems for use with the Whirlwind I computer is developed in this thesis.

# TABLE OF CONTENTS

| FORE  | NORD  |        |           |       |        |       |     |      |     |     |     |    |   |   |   |   | 2   |
|-------|-------|--------|-----------|-------|--------|-------|-----|------|-----|-----|-----|----|---|---|---|---|-----|
| ABSTI | RACT  |        | G & 4 & 2 |       |        |       |     |      |     | u   |     |    |   |   |   | ٠ | 7   |
| ACKNO | OMIED | GMENT  |           |       |        |       | •   |      |     |     |     |    |   |   | v |   | 8   |
| ORGAI | NIZAT | ion .  |           |       |        |       |     |      |     |     |     | ·  |   | 6 |   | 0 | 9   |
| 1     | INTR  | ODUCTI | ON        |       |        |       | ٠   |      |     |     |     |    | , |   |   |   | 10  |
|       | 1.1   | The A  | utomatic  | Die   | gital  | Comp  | ite | r.   |     |     | 0   | 0  |   |   |   |   | 10  |
|       | 1.2   | Relia  | bility    | 5 4   |        |       |     | a a  |     | •   |     |    |   | • |   |   | 11  |
|       | 1.3   | Princ  | iples of  | Che   | ecking | : .   |     |      |     |     |     |    |   | A |   |   | 15  |
|       |       | 1.31   | Built-    | în Cl | neckin | g.    | •   |      |     |     |     | o  | 4 |   |   |   | 16  |
|       |       | 1.32   | Program   | nmed. | Check  | ing   | ٥   | 0 0  | o   | •   | ,   |    | • | 8 | , | , | 17  |
|       | 1.4   | Scope  | of this   | The   | esis . |       |     |      | .0  |     |     |    | 6 |   | 0 |   | 50  |
| 2     | DESCI | RIPTIO | n of whi  | (RLW) | IND I  | •     |     |      |     |     |     |    | 3 |   |   | • | 21  |
|       | 2.1   | Machin | ae Unite  |       |        |       |     |      | d   |     |     | D  | , | D |   |   | 21  |
|       |       | 2.11   | The Bas   | ic I  | eatur  | 88 .  | •   |      |     | ۰   |     |    | 3 |   |   |   | 22  |
|       |       |        | 2.111     | The   | Flip-  | Flop  |     |      |     |     |     |    |   |   |   | ٥ | 22  |
|       |       |        | 2.112     | The   | Gate ! | Tube  |     |      |     |     |     |    |   |   |   |   | 23  |
|       |       |        | 2.113     | Resi  | torati | on.   |     |      |     |     | 4   |    | 0 | ٠ |   |   | 24. |
|       |       |        | 2.114     | The   | Elect  | ronic | S   | wit  | ch  |     |     |    | ٥ |   |   | 0 | 25  |
|       |       | 2.12   | Simple    | Comi  | oinati | ons o | of  | Bas  | ic  | St  | ae  | 99 | 1 |   |   |   | 25  |
|       |       |        | 2.121     | The   | Binar  | y Fl  | lp- | Flo  | p I | leg | gis | te | r | 0 |   |   | 25  |
|       |       |        | 2.122     | The   | Shift  | ing 1 | Reg | ist  | er  |     |     |    | a |   |   |   | 26  |
|       |       |        | 2.123     | The   | Binar  | y Add | ler |      |     |     |     |    |   | 0 |   | 0 | 27  |
|       |       |        | 2.124     | The   | Binar  | y Cou | ınt | er   |     |     |     |    |   | D |   |   | 28  |
|       |       |        | 2.125     | The   | Time-  | Pulse | e D | ist: | rib | ut  | or  |    |   | 0 |   |   | 29  |

|   |      | 2.13   | The Sys  | ten . |        | 0 0   | 0    |     | 9 6   |    | •   |   |   | ٥ | 0  | ,  | ٠ | . 29 |
|---|------|--------|----------|-------|--------|-------|------|-----|-------|----|-----|---|---|---|----|----|---|------|
|   |      |        | 2.131    | Centr | al Co  | ntr   | 1    | 0   |       |    |     | a |   | 0 | •  |    | 0 | . 30 |
|   |      |        | 2.132    | Test  | Stor   | ::-0. | •    | •   | • •   |    |     | c |   |   | 0  | 13 | • | . 31 |
|   |      |        | 2.133    | Arith | me tic | E1    | am € | nt  | 0 (   |    | a   |   | • |   | ۰  |    |   | .32  |
|   |      |        | 2.134    | In-ut | -Cut-  | ut I  | C1.e | me  | nt    |    |     | 0 | • | • | 9  | 9  | • | .33  |
|   |      |        | 2.135    | Pu118 | -In (  | Check | cin  | g   |       |    | •   |   | 9 |   | 0  | 5  |   | . 34 |
|   |      |        | 2.136    | Test  | Conts  | rol   | 6    |     | 9 (   |    |     |   |   |   |    |    |   | . 34 |
|   |      |        | 2.137    | Plect | rosts  | atic  | 98   | or  | a-ere |    |     | 0 |   |   | ٥  |    | 0 | . 35 |
|   | 5.5  | Machi  | ne Cycle |       | 4 9.   |       |      | •   |       |    |     |   | 0 | 0 | •  | 6  |   | .35  |
|   |      | 2.21   | Frogram  | Timi  | ng .   | . :   | 9    |     |       |    |     |   | • | • |    | •  | 9 | .36  |
|   |      | 2.22   | Cpereti  | on Ti | ring   | 0 6   | 0    |     | 0 6   |    | 6   | 0 |   |   |    |    | ٥ | .37  |
| \ | 2.3  | Progr  | emring a | nd Co | ding   |       |      |     | 0 4   |    |     | ٥ |   | • |    |    |   | .38  |
|   |      | 2.31   | Nomencl  | ature |        |       |      |     |       |    | •   |   |   | 0 |    |    | ۰ | .38  |
|   |      | 2.32   | The Ope  | ratio | ns .   |       | •    |     | ٥ .   | ۰  |     | 0 |   |   | .0 | •  |   | . 39 |
| 3 | DISC | recton | OF FAIL  | HDFC. |        |       |      |     |       |    |     |   |   |   |    |    |   | lin  |
|   |      |        | ificatio |       |        |       |      |     |       |    |     |   |   |   |    |    |   |      |
|   | 202  |        | Steady-S |       |        |       |      |     |       |    |     |   |   |   |    |    |   |      |
|   |      |        | Dynamic  |       |        |       |      |     |       |    |     |   |   |   |    |    |   |      |
|   |      |        | Incipie  |       |        |       |      |     |       |    |     |   |   |   |    |    |   |      |
|   |      |        |          |       |        |       |      |     |       |    |     |   |   |   |    |    |   |      |
|   |      |        | Intermi  |       |        |       |      |     |       |    |     |   |   |   |    |    |   | 17   |
|   |      |        | Febrodu  |       |        |       |      |     |       |    |     |   |   |   |    |    |   |      |
|   |      |        | Random   |       |        |       |      |     |       |    |     |   |   |   |    |    |   |      |
|   | 3.2  |        | ing Cons |       |        |       |      |     |       |    |     |   |   |   |    |    |   |      |
|   |      |        | Influen  |       |        |       |      |     |       |    |     |   |   |   |    |    |   |      |
|   |      | 3.55   | Operat1  | onal  | rifed  | ta c  | 2    | Fa. | ilu   | TO | B . | 0 | 0 |   |    |    |   | .45  |

|   |      |        | 3.22  | 21   | Ampl        | 111      | ere          |       |     | 9     |              | et i |       | 4   | 9            |      | 0   | 0   | 60 |   |   | .46  |
|---|------|--------|-------|------|-------------|----------|--------------|-------|-----|-------|--------------|------|-------|-----|--------------|------|-----|-----|----|---|---|------|
|   |      |        | 3.22  | 303  | Gate        | Tul      | be .         |       |     |       |              |      |       |     | ò            |      | 6   | ý   |    | 3 |   | .47  |
|   |      |        | 3.20  | 23   | Fisp        | -F10     | age          |       | •   |       |              |      | p 18  |     | a            |      | 0   |     |    | 9 |   | . 48 |
|   |      |        | 3.22  | oli  | Port        | ore:     | r Pu         | 180   | 8   |       |              | а    |       |     | 3            |      | ۰   | ,   | 9  |   | • | . 49 |
|   | 3.3  | Margi  | nal ( | Chec | king        | n<br>1 0 |              | 0     |     |       |              | 6    |       |     | 9            | •    |     | à   |    |   | a | . 50 |
|   | 3.4  | Sura   | ry .  |      |             |          |              |       | •   |       |              |      |       |     |              |      |     | 0   | 9  | • |   | . 53 |
| ħ | Pulk | CTPL S | OF T  | resi | <b>9</b> 50 | UER      | מ אם         | esi   | GN  |       |              |      |       |     | 4            |      | 0   | 6   | ٠  |   |   | . 54 |
| * | 4.1  | Object |       |      |             |          |              |       |     |       |              |      |       |     |              |      |     |     |    |   |   |      |
|   | 4.2  | Histor |       |      |             |          |              |       |     |       |              |      |       |     |              |      |     |     |    |   |   |      |
|   | 4.3  | Propo  | sed I | roc  | edur        | · •      |              | ٠,    |     |       |              |      |       |     |              | ø    |     |     | •  |   |   | .56  |
|   |      | 4.31   | Subo  | 1171 | sion        | of       | the          | Co    | m i | uŧ    | er           | •    | . 0   | ۰   |              |      | •   | 9   | •  |   | ۰ | . 56 |
|   |      | 4.32   | The   | Che  | r.e1        | Con      | ocep         | 8     |     |       |              |      |       |     | •            |      | ,   | ٥   |    |   |   | . 57 |
|   |      | 4.33   | Use   | 02   | the         | Chai     | nnel         | Co    | nc  | ep    | *            | 0    |       |     |              | 0    | •   | 8   |    |   | • | .58  |
|   |      | 4.34   | Mult  | tipl | e Fa        | ult      |              |       |     |       |              |      |       |     |              | 0    |     | 9   |    |   |   | . 59 |
|   |      | 4.35   | Troi  | ible | Loc         | atio     | on .         |       |     |       | •            |      |       |     | •            |      | •   | •   |    | • |   | . 61 |
| 5 | EXAM | PLES O | F DE  | SION | FFO         | 0700     | JRE          |       |     |       |              |      |       |     |              |      |     | 6   |    |   |   | . 63 |
|   | 5.1  | Single | e Far | 11.6 | Sequ        | enc      | 88.          | ۵     |     |       |              |      | b 0   | ٥   | 0            |      | •   | ٥   |    |   |   | . 63 |
|   |      | 5.11   | Arit  | time | tic         | Conf     | trol         | Fl    | ip  | an E. | 1.0          | ps   |       |     | 0            | 9    | ę   | a   |    |   |   | . 63 |
|   |      |        | 5.11  | 11   | Chan        | nel      | Seg          | men   | 8   | 1     | -            | 31,  | m     | Co  | n <b>t</b> 1 | rol  |     | 0   |    |   |   | . 64 |
|   |      |        | 5.11  | rs   | Chan        | nel      | Seg          | men   | \$  | 2     | 400          | Mu   | 1 1 1 | rl; | y            | 0    | •   |     |    |   |   | . 65 |
|   |      |        | 5.11  | 13   | Chan        | mel      | 50g          | De n  | ts  | 3     | A            | an   | 1 3   | B.  |              | 51-4 | 121 | 2 . |    |   | ٥ | . 67 |
|   |      |        | 5.11  | 14   | Chan        | nel      | Seg          | men   | t   | 4     | -            | D4.  | rić   | le  |              | •    |     | •   |    |   |   | . 68 |
|   |      |        | 5.13  | 15   | Chan        | nel      | Seg          | man   | \$  | 5     | <b>(80</b> ) | Sp   | ec 1  | al  | -A           | h    |     |     |    |   |   | . 73 |
|   |      |        | 5.12  | 16   | Chan        | nel      | Seg          | m r m | ŧ   | 6     | 40           | Po   | int   | -0  | r            |      |     | 0   | 9  |   |   | .72  |
|   |      |        | 5.13  | 17   | The         |          | plet<br>trol |       |     |       |              |      |       |     |              |      |     |     |    |   | • | .72  |
|   |      | 5.12   | Inpu  | 15-( | ut u        | e E      | l em e       | nt    |     |       | •            |      |       |     |              |      |     | ٠   | 6  |   |   | . 72 |

| 5.121 In-Out Register                             | 74   |
|---------------------------------------------------|------|
| 5.122 In-Cut Control Flip-Flops                   | 77   |
| 5.13 Step Counter                                 | 79   |
| 5.131 Step Counter Flip-Flops                     | 81   |
| 5.132 Step Counter Cate-Tubes                     | 8H   |
| 5.2 Fultiple Faults - The Program Counter         | 85   |
| 5.3 Trouble Location - Control-Pulse Out-ut Units | 91   |
| 6 RESULTS AND CONCLUSIONS                         | 95   |
| 6.1 Experimental Results                          | 95   |
| 6.11 Discussion of Low Margin on Line 113         | 97   |
| 6.12 Discussion of Low Fargin on Line 79          | 98   |
| 6.2 Evaluation of the Design Procedure            | 99   |
| 6.3 Surgestions for Further Work                  | 101  |
| AFFENDIX I - A Short Guide to Coding              | 102  |
| AI SEDIX II - Details of Goded Programs           | 105  |
| ATTENDIX III - Glorsery                           | 110  |
| SIFLIOCRAPHY                                      | 111  |
| LIST OF ILLUSTRATIONS                             | .113 |
| ILLUSTRATIONS                                     | 114  |

#### ABSTRACT

This thesis study describes a method of designing check problems to be used in detecting errors and locating trouble in a large-scale digital computer. The computer is systematically subdivided into small sections. A study is made to determine the paths (channel segments) in these sections along which information is routed, as well as the sources, destinations, and methods of transmission of the information.

One of these methods is selected for the routing of each type of information from one of the sources to one of the destinations of each channel segment. The destination is then examined to see whether it received the proper information.

This process, used in conjunction with the Project Whirlwind method of marginal checking, permits a sequence of instructions
to the computer to be specified. Limited tests in the Whirlwind I
computer indicate that test sequences designed by this method will
be quite useful.

#### ACKNOWLEDGEMENT

The author wishes to express his sincere thanks to Mr. N. H.

Taylor for supervising this thesis and making many helpful suggestions for the conduct of the work and the preparation of this report. He is also indebted to Mr. J. W. Forrester for making available the facilities of Project Whirlwind for the treparation of this thesis. Thanks are also due to the personnel of Project Whirlwind with whom the author had several stimulating and helpful discussions, particularly to Mr.

R. R. Werett, Mr. C. W. Adams, Mr. J. M. Salzer, and Mr. R. P. Mayer.

#### ORGANIZATION

This thesis deals with a very specialized to ild which requires considerable background material for its complete treatment. The first three chapters provide most of this material while develoring some of the bases for the method presented. In detail, the material is arranged as follows:

Chapter 1 presents the problem of checking digital commute a and indicates some possible solutions.

Chapter 2 is a fairly detailed description of the operation of Wil.

Charter 3 discusses the nature of failures and their influence on checking procedures.

Chapter 4 presents a systematic approach to designing special problems to check the operation of the computer.

Chapter 5 gives a number of examiles of the applications of the approach to checking WVI.

Chapter 6 discusses and evaluates the results of the thesis.

Appendix I is a short guide to coding prepared by the Electronic Commuter Division of the Servomechanisms Laboratory, N.I.T.

Appendix II contains the coded programs which have been developed by the method described in this thesis.

Appendix III contains a glossory of special terms used in this thesis.

#### 1 - INTRODUCTION

### 1.1 The Automatic Digital Computer

numerical problems. It is capable of performing entirely automatically any desired sequence of arithmetic operations on given data. This is in distinction to analogue commutation where the data is handled as physical quantities in a system obeying rhysical laws having the same form as the operations involved (see Fig. 1). It also has the ability to choose among several alternate computing routines on the basis of its own calculations. A wide variety of problems can be solved through the use of the automatic di ital corruter because of the great flexibility implied in these properties. The high calculating speed which is characteristic of the automatic digital consuter, makes it varticularly valuable because of the great number of problems whose solution by more customery means, though possible in principle, is impossible in a practical sense because of the prohibitive amount of time and labor involved.

As one might suspect, a rather complicated device is required to accomplish these results. For example, Whirlwind I (WWI), the electronic digital computer under development at the F.I.T. Servomechanisms Laboratory, consists of approximately 4500 vacuum tubes plus associated components such as resistors, capacitors, inductors, and crystal rectifier diodes. Good computer design requires that each individual component be operated under conditions that ensure as long a life as possible. Nevertheless, each

Superscripts refer to the numbered bibliography.

component will eventually fail to operate satisfactorily. When such a failure occurs, it is quite likely, though by no means certain, that an error will appear in the computed results. Such errors, if they remain undetected, may cause considerable damage. There is no longer any doubt that computers can be made to work; the tables of functions calculated at the Harvard Computation Laboratory have amply demonstrated this fact. The ouestion is: how consistently will they provide correct results? Reliable results must be obtained if the latent value of the automatic digital conputer is to be realized in practice.

#### 1.2 Peliability

but they are also inevitable. This is not just a reculiarity arising from the use of the autoratic digital computer; other methods of problem solution are also subject to error. What level of error occurrence can be telerated? Pefore this question can be answered quantitatively, a considerable amount of statistical data must be accumulated. It will be difficult to interpret such data on a comparative basis because of the speed differences among various methods. The amount of computation which can be done during the periodetween errors must be taken into consideration in some suitable manner. Thus far, only qualitative reasoning has been applied to this problem. Rowever, one error per week has been suggested by a leading authority as a reasonable goal for a computer of the same nature as Whirlwind I.

This statement was rade by J. W. Forrester during a discussion of his talk -t the Harvard Symposium on Large-Scale Digital Calculating Machinery on September 13, 1949. The proceedings of this symposium have not yet been published.

The fact that errors will occur does not mean that we must be resigned to accept erroneous results. There are many methods for checking on the validity of calculated results (see section 1.3 and references 6 and 9). Then it is known that a particular result is in error, it is usually possible to remove the source of the error and then proceed to calculate the correct value. Thus, by using these methods, the effective reliability of the computer can be increased tremendously, although the speed is reduced somewhat. Of course, there are a number of applications in which time is an important factor so that there is no time available for a second calculation under error-free conditions. Such applications will require very special techniques to achieve satisfactory reliability, but error detection is essential even then.

errors, for it is certainly conceivable that two failures can occur simultaneously in such a way as to multify the method being used. This is not to say that there is no method for detecting errors caused by two compensating, simultaneous failures. It is always possible, at least in principle, to devise a scheme which will detect any specified combination of faults, but in the most general case, additional equipment would be required.

This additional equipment would give rise to new combinations of faults which could not be detected without further resort to extra equipment.

For a given computer, there will always be some combinations of faults which cannot be detected without changing the computer. It becomes necessary to decide upon an adequate degree of error detection. Here the economics of the situation is an important factor. Since any desired level

of checking can be provided at the cost of either soney or time, some balance must be achieved between conflicting interests. It seems to be quite generally agreed that the probability of an undetected error cocurring in a machine which is checked in all cases of single failures is sufficiently low. However, any method which provides a check in many cases of multiple faults at little extra cost should be carefully considered.

As mentioned previously the time required to detect an error by a given method is frequently of vital importance in applications where time. is a factor. These may be exemplified by the problem of directing the flow of air traffic in accordance with the specified schedules and the actual positions and velocities of the aircraft involved. 13 The computer is used. to correlate this data and surrly the appropriate instructions to the pilots of the sircraft regarding the flight pattern to be maintained. When these instructions are in error, the pilots and the operator of the control tower should be informed immediately so that arroppriate emergency measures may be taken. The importance of instantaneous error detection is quite evident in this cass. Indeed we may even look askance at the fact that it is possible for some errors to go undetected. However, we must remember that it is quite possible for the sircraft itself to fail. This is a very special case and the accuracy requirements imposed are far greater than would be encountered in rost ordinary arclications. Even where the danger to human life is not so immediate, speed in error detection is quite important. A considerable amount of erroneous commutation might be accumulated before an error is detected. This will be wasteful, but it is again difficult to pass other than a qualitative judgment on the value of speed in error detection. In general, speed is actieved at the cost of equipment

and additional equipment is another source of trouble, aside from the monetary aspects involved. It may be said, with little fear of contradiction, that it is preferable to save equipment at the cost of time, all other factors being equal.

Once a failure is known to exist, the next step is to repair it. Pefore any repairs can be carried out, the faulty component must be located. Conventional methods of trouble location in small pieces of equipment are completely inadequate for a job of this ramnitude. Special autoratic methods must be devised to isolate the faulty component. It is not accesary to locate the specific component by these automatic means, but the fault should be isolated to a unit in which conventional techniques may be employed in a reasonably short time. A computer which makes one arror a week cannot be said to be very reliable if it takes a week to locate its source. In that time, another error may have occurred and another week of troable location is in prospect. Under such conditions, all of the operating time of the computer would be taken up in trouble location. The useful outrut would be practically mil. It does not seem unreasonable to expect that the computer perform useful work during at least half the time it is operating. Indeed, this would be a rather poor corputer. On the other hand, it takes a finite arount of time to make the actual repairs. Experience with Whirlwind I indicates that this time is of the order of magnitude of 15 or 20 minutes, on the average. It is pointless to go to great lengths to develor an automatic trouble location scheme which isolates the fault within a few seconds. 20 A rerfectly satisfactory method would be one which located the faulty component in less than 10 minutes. This would mean that approximately an hour would be required

Less than 1% of the operating time of the computer (assuming one error per week) would be lost for this reason.

the detection of errors in the operation of the entire computer. When an error is detected by such a method, the faulty component is known to be somewhere in the computer. Thus a measure of trouble location has been achieved, even though it is of dubious value. When an error is indicated by a method devised to check a relatively small unit, the faulty component is known to exist in that unit. A large measure of trouble loc: tion has been achieved. There is no difference between checking and trouble loc: tion. The distinction is one of degree rather than kind.

## 1.3 Principles of Checking

methods is that of providing a suitable standard to compare with the results obtained from the unit being checked. The choice of the size of the unit to be checked depends on the degree of trouble location that is desired. It is clear that if no fault exists in any of the small units which comprise the computer, then no fault exists in the entire computer. Thus, if a scheme is devised to check every small unit of the computer, including all their interconnections, then this cheme checks the entire computer. The various methods which have been suggested thus far differ in the flexibility with which they may be applied units of varying size.

There are essentially two different amroaches to the roblem of generating a commercian standard: built-in checking and programmed

checking schemes. In programmed checking, the computer treats the calculation of this standard as another problem which it can solve. Built-in checking permits higher computing speeds but requires more equipment than program ed checking. More than one type of checking is frequently used in a computer. The amount of each type is dictated by economic considerations.

## 1.31 Built-In Checking

an identical one to duplicate its function. The results obtained by the duplicate unit are a standard with which to compare those of the original unit. A special unit is provided to make this comparison. If an error is detected, it may have been caused by a failure in either unit. It is also possible (though improbable) for the same failure to occur in both units simultaneously, giving a correct check in the presence of an error. This has been discussed in section 1.2 and is not a serious objection. It is, of course, possible to provide three or more identical units to guard against such double failures. However, none of the existing or projected computers has incorporated more than two sets of identical equipment because of the cost involved. Checks of this type are known as multiple checks. Sultiple checks are generally used for large units; the number of comparison units required when small units are involved is felt to be excessive.

A more efficient use of checking equipment is achieved by a comvarison of some identifying characteristic (or tag) of the actual and

expected results rather than the results themselves. Such checks are called tag checks; "casting out nines" is a tag check used in decimal multiplication. In some very special cases, certain characteristics of the result of an operation may be independent of the numbers involved. It is frequently advantageous to provide a check which takes cognizance of the situation. An example of such a predetermined check is given in section 5.12.

A more detailed discussion of built-in checking may be found in reference 6.

#### 1.32 Program ed Checking

The process by which a problem is stated in a form suitable for solution by a computer is called programing. Thus, those checking methods which treat the generation of a comparison standard as a problem to be solved by the computer are referred to as programmed checking. There are two approaches possible to programmed checking: mathematical checks and test checking.

results of a calculation. Checking procedures based on these identities are applicable to all corruters for they are essentially another group of problems which are within the problem-solving capabilities of digital computers. There are certain types of identities which are encountered so freque thy that names have been assigned to the checking procedures which are based upon their use. However, there is undoubtedly a group of miscellaneous relationships which hold in special cases that cannot be fitted into such a classification. These miscellaneous techniques:

depend largely on the alertness and in-equity of the programmer and cannot be discussed here.

the solution of a problem is repeated a specified number of times and both (or more) sets of results are corpared. In identity checking, two (or more) equivalent mathematical procedures are used to obtain two (or more) sets of results for comparison. With some problems, a rather good check may be obtained by reversing the process of problem solution and finding the given initial data from the alleged solution, yielding what is known as inverse checking. In sample checking, the solution to a general class of problems is obtained for a special case in which the answer is known; the correct solution indicates that the computer is probably capable of solving this general class of problems. It is possible to examine the solutions of problems which are known to lead to continuous functions for continuity, thus giving rise to smoothness checking.

on the whole, mathematical checks are applicable to large groups of units and hence are not particularly useful in trouble location. They wary in effectiveness with the tyres of failures that exist, but they are quite useful for an overall check on a specific problem. It is not necessary to know that the whole comuter is working to be reasonably sure that the results are correct. Mathematical checks are discussed more fully in reference 6.

In test checking, the results obtained from the solution of specially designed problems are corpored with the known answers. Techniques for designing such problems have not yet been fully developed.

Most of the work in this field has been of an exploratory nature. 2,3,4,5,5,2,21.

The several amproaches which have been suggested are applicable only under very restricted conditions. The most severe shortcoming of some of these is that they make use of single functions in sequences which cannot be performed by the commuter. Each operation that the commuter performs consists of a predetermined sequence of such functions; the programmer may choose the sequence of operations arbitrarily, but he cannot do so with the functions. Some of the amproaches which do not suffer from this chortcoming may be quite useful in specific cases, but they lack generality. This thesis represents an attempt to develor a systematic amproach (for use with WWI) which is generally applicable to the design of such problems. The resulting problems will be referred to as test sequences.

endingenuity are valuable assets. Systematic ar reaches to programming have been proposed which are quite good, but a completely satisfactory method has not yet been found. 14,16,22,23 In many cases, much more efficient programs may be obtained more easily by rethods other than a literal application of the general method. This would seem to imply that a systematic method of test checking is not necessarily the best method in a specific case. Indeed, this thesis contains examples in which departures from the general method have been made in the interests of simplicity and efficiency. In short, the proposed method is only to be used where simpler methods are not immediately obvious.

## 1.4 Organization of This Thesis

is required to thoroughly exclain many of the examples of this thesis.

The paper would become quite unwieldy if all this detail were included.

Much of the information found in some of the references has therefore been omitted. 12,16,19 However, in order to make this paper reasonably self-contained, a considerable amount of material has been included. The procedure followed in this respect has been to give fairly detailed explanations of the basic elements, followed by examples to illustrate the manner in which more complex details can be explained in terms of the basic elements. The same procedure has been followed in explaining the examples of that sequence design.

Froject Whirlwind are defined in reference 15. Every effort has been made to define such terms when they are first used. As a matter of convenience, additional special terms have been adopted for some of the concepts arising in checking. These are defined when they are introduced and have been collected for easy reference in Arpendix III. Some, but not all, of these terms have come into standard usage at Project Whirlwind and elsewhere.

#### 2.1 Machine Units

Whirlwind I (and, indeed, every computer) may be thought of as being divided into four main functional units: the memory (or storage), the control, the arithmetic element, and the input-output device (see Fig. 2). The memory stores the numerical data required for the solution of problems, as well as the instructions (coded into numerical form) which govern the actual steps of problem solution. The control interprets the instructions and provides the machine with the signals needed to perform them. The arithmetic element actually performs certain elementary arithmetic operations (addition, subtraction, shifting, multiplication, and division) whenever the control supplies the appropriate command signals. The input-output device is the link between the machine and the outside world; it provides the machine with raw data and receives the computed results. It is frequently used to supplement the remory by connecting a large-capacity external memory (having a slower operating speed) to the machine.

Two different devices are used for the internal memory of WWI.

These are the flip-flop and the electrostatic storage tube. The flipflop is discussed in section 2.111. The dectrostatic storage tube is
a cathode-ray type tube in which the numbers are stored as charged spots
on a dielectric surface. The final model of this tube is expected to
have a capacity equivalent to 1024 flip-flops, thus saving a considerable
amount of space. Its speed of operation is somewhat slower than that of
the flip-flop, but it is faster than magnetic tape or drum memories. The

control contrins a source of pulses which are routed to the rest of the computer in accordance with the settings of electronic switches. Flip-flops are used to manipulate the numbers in the arithmetic element. A great diversity of external equivment may be connected to the computer by means of the input-output element. Photographic film is expected to be the main external storage medium, but it will be possible to use provided so that analogue information may also be headled.

#### 2.11 The Pasic Features

With few exceptions, three basic stages are used throughout the con uter; the flip-flop, the gate tube, and the electronic switch. Of course, amplifiers are provided where needed to obtain appropriate signal or impedance levels. The flip-flop is essentially a memory device. The gate tube is a sensing device, usually employed to sense the contents of a flip-flop. The electronic switch is what its name implies a multiposition switch which operates at high speeds.

#### 2.111 The Flir-Flop

tubes so interconnected that, at any time, one is fully conducting and the other is completely cut off (see Fig. 3). The plate potential of the cut off tube is "high" (nearly equal to the plate supply voltage), while that of the conducting tube is "low" (considerably less than the plate supply voltage, the difference being the drop in the plate load resistor). If tube A (see Fig. 3) is cut off, the potential at the O output is high, that at the loutrut is low, and we say that the flip-flop contains O,

or is chared. If tube I is out off, the O outcut is low, the 1 output is high, and we say that the flip-flop contains 1, or is set. By applying a positive palse at an appropriate point in the circuit (the trigger in ut), the flip-flop may be nade to switch its state, i.e., if it formerly contained 0, it will now centain 1, or if it formerly contained 1, it will now contain 0. This action is referred to as triggering or complementing. There are two additional inputs which require negative rulses. On is the clear input, which puts the flip-flor in the clear condition regardless of its condition before the pulse was applied. The other is the set input, which outs the flip-flop in the set condition regardless of its previous centent. The flip-flop is eften represented symbolically by a rectangle having three input lines and two output lines, as is also shown in Fig. 3, and is abbreviated FF. Sach TF in WWI is designated by a system number.

## 2.112 The Gate Tube

The gate tube is a coincidence device: it provides an output
if, and only if, two signals occur simultaneously. Its basic element
is usually a pentode, with one signal being applied to the control grid,
the other, to the suppressor grid (see Fig. 4). These two grids are biased
sufficiently to maintain the tube cut off even when full signal is applied
to one (but only one) of them. As mentioned above, its principal use is
in sensing the content of a flir-flop. If it is desired to know if the
flir-flor contains 1, we marely connect the 1 output to the suppressor
of the gate tube through a suitable coupling circuit. This coupling circuit must change the d-c level of the output but must preserve the distinction between the two possible voltage levels (see section 2.113). If the

I side is high, the super soor potential will be brought above cut-off, and there will be an out-ut rulse when the control grid is pulsed. The cute issaid to be open. If the I side is low, the superessor will remain below cut-off, and no output is obtained when the control grid is pulsed. The gate is said to be closed. Thus, the presence of a pulse on the output line at the time of sensing indicates that the flip-flop contains 1, while the absence of a rulse at t is time indicates that it contains 0. The gate tube is represented symbolically by a square having two in ut lines and one output line, as shown in Fig. 4, and is abbreviated as GT. Each CT in Wall is designated by a system number.

#### 2.113 Restoration

its output takes on the characteristics of a d-c signal. Bather than use d-c courling circuits, the scheme of peridocially complementing and recommending the flip-flops was adopted to permit the use of n-c courling circuits. This scheme is called restoration and the pulses which are used to complement and recomplement the flip-flops are called restorate pulses. The restorer-pulses occur in pairs with an interval of 1 usec. between them. These restorer-pulse pairs are repeated at a fixed interval (except during the operation of electrostatic storage) of 16 usec. between the first pulses of successive pairs (the restorer interval). The surply of pulses to the computer is suppressed during the in erval occupied by the restorer-pulse pairs, so that operations are not carried out while the flip-flows contain the complements of the numbers they are supposed to hold.

## 2.114 The Electronic Switch

open, depending on the contents of a flip-flows (see Fig. 5, drawn for a = 2). Plate current for the flip-flop tubes sust pass through the diods matrix and all but one of the resistors (R1,R2,R3, and R4). If, for example, the right hand tubes of both flir-flops should be conducting, current will be drawn through R2, R3, and R4. No current will flow through F1 because it is con exted only to the left-hand tubes. Thus the upper line will be at a positive potential with respect to the others and the gate tule connected to it will be placed at a positive potential with respect to the others and the respect to the others and the case to the others for each different combination of flip-flo contents. An altern to form of electronic switch, using give tules rather than a diode matrix, is frequently used.

# 2.12 Simple Combinations of Paste Stages

There are many possible ways of combining the basic stages to obtain devices capable of performing the functions required in a computing machine. To illustrate these possibilities, some of the rore company encountered combinations will be discussed below. The symbols used in the various diagrams are explained in Fig. 6.

# 2.121 The Binary Flip-Flop Pegister

A collection of n flip-flors is essentially a memory device capable of storing an n-digit binary number. By permanently assigning each flip-flop to a particular digit column of the number, the so-called binary flip-flop register is obtained. Three digit columns of a typical

register, the A-register (AR), are shown in Fig. 7. Three sets of gats tubes are provided so that the number in the register may be examined, two sets on the 1 side, one set on the 0 side. A pulse applied to one of the cets on the 1 side will cause the number contained in the A-register to amear on the out ut lines of that set. One set is used to transfer the number in the A-register to the Accumulator (CTO5, when the add line is rulsed), while the other set transfers this number to the "bus" (CTO2, when the read-out line is pulsed). The bus may be connected to any of several units. The set on the 0 side is used to obtain the "nines-complements" of the number (obtained by replacing each 0 by 1 and each 1 by 0, see A rendix I) for use in subtraction (CTO4, when the subtract line is pulsed). The read-in gate tubes (CTO1) serve as switches through which the bus may be connected to the in ut of the A-register when a gating signal is applied. It might be remarked that the flip-flows associated with the electronic switch in reality form a binary register.

## 2.122 The Stifting Perister

A register in which provision has been rade to shift its contents either to the left or to the right is called a shifting register. Three digit columns of a typical shifting register, the B-register (BR), are shown in Fig. 8. The facility to shift the content of each digit column one digit column to the left is provided by GTO4 and GTO5. The facility for shifting right is provided by GTO6 and GTO7. The mechanism of shifting is quite simple. Consider shift left. If a particular digit column contains 1, a pulse will agreen on the output of GTO4 when the shift left line is pulsed. After a slight delay, the flip-flop of the

digit column to the left will be set by this pulse. If the digit column had contained 0, a pulse would have been obtained on the output of 6505 (instead of 6504) which would clear the flip-flow to the left. Hence, the content of the criginal digit column would agreer in the next digit column on the left. This process is carried on simultaneously for all digit columns, with some special conventions adopted to handle the columns at each end of the register.

## 2.123 The Binary Adder.

All arithmetic processes in WWI are performed as a combination of additions and shifts. The device which performs the additions in WWI is called the Accumulator (see Fig.9). The Accumulator (AC) is capable of a number of operations other than addition, but we will only discuss it as a b nary adder. Consider only FFOl in the column labeled .Cl5 (partial sum) and FFO2 in the column labeled ACI4 (carry). Only CTO6 of ACI5 (partial sum) and CT12 of ACI4 (carry) are of interest. First let us examine the binary addition table:

Assume that a number is already in AC and that the number contained in AR is to be added to it. Only one digit column will be discussed (AC15); the same process takes place simultaneously in each digit column. When the aid gate tubes of AR are pulsed, the number will are error the lines labeled "from AR". Assume AC15 (FFO1) contains O initially. If the number being added in is O, no pulse will appear at the input ("from AR") no change occurs and FFO1 continues to contain O, as it sould. If the number being added was 1, a pulse appearing at the input will be applied

to GTOS, but it will find GTOS closed. After a slight delay, the pulse will be applied to the trigger input of FFOI causing it to chance its content to 1. Thus the correct result is also obtained for this case.

Now assume that AGIS (FFOI) contained 1 initially. If the number being added in is 0, no chance occurs; FFOI containes to contain 1, as it should. If the number being added in is 1, the pulse will find GTOS open and will proceed to set FFO2 of AGI4. After a slight delay, the pulse will trigger FFOI of GI5 to 0. This, again is the correct result, i.e., the sum is 0, with 1 to carry. The carry is subsequently added in to the next digit column by applying a pulse to the carry line. This uses GT12 to sense the carry flip-flop (FFO2). This second addition ray also result in a carry, which would require another carry pulse, and so on. However, a storter method has been devised which requires only one carry pulse. The method tutilizes GTO5 and is called the high-speed carry. It will not be discussed further.

# 2.124 The B nary Counter

The binary counter is just what its name implies: it counts the number of pulses applied to its input, indicates this number in the binary system and supplies, if desired, an outsut pulse when its capacity is exceeded, the counter being returned to 0 when this occurs. A typical binary counter, the Program Counter (PC), having a maximum capacity of 2048, is shown in Fig. 10. Assuming each filt-flop is initially clear, two input pulses must be applied to any flip-flop in order to obtain an output pulse from the associated 6705. Thus, two outsut pulses are required from PC6 in order to obtain an output pulse from PC6. This, in turn, requires four pulses from PC7, which in turn, requires eight

pulses from PCE, etc. The capacity of the counter is increased by a factor of two for each flip-flop stage. The number of pulses the counter has received is indicated at any time by the contents of the flip-flops, FC15 giving the 2° digit, FC14, the 2¹ digit, etc., with PC5 giving the 2¹0 digit. It is possible to preset this counter so that the overflow pulse is obtained after any predetermined number of pulses (less than 2045, of course).

## 2.125 The Time-Pulse Distributor

tary steps in proper time sequence in order to build up a more complex operation. Thus, if we have a supply of timing pulses, we want the first one to go to one part of the computer, the second one to another part, the third one to still another, etc. The time-pulse distributor accomplishes this conversion from a time distribution to a time end space distribution. It is a combination of a binary counter and an electronic switch (see Fig. 11). The out ut lines are selected in accordance with the contents of the counter. The input rules, in addition to performing the counting, emerges on the selected line. There is no need for the overflow pulse in t is type of operation so that no provision has been made for it.

# 2.13 The System

The interconnections among the various units are shown in detail. A 16-digit binary number will be referred to as a word. A word may represent an instruction or an actual numerical value. Words are transferred from one part of the system to another via a set of 16 dables which are

connected to the unite via gate tubes. All digits are transferred simultaneously (parallel digit transmission) with a separate cable being recuired for each. A word is transmitted to the bus from a particular unit by pulsing its read-out gate tubes. A word which is on the bus will be transmitted to any unit whose read-in gate tubes are open. Another set of 16 cables (the "check bus") is provided as a multiple check on the bus. By a suitable arrangement of the timing, it is possible to check corei erably more than the condition of the bus. The other connections, which transmit pulses that control the performance of the various steps of the overations, are made directly. Such pulses are called commands.

## 2.131 Central Control

It also contains a source of pulses which eventually become commands.

It also contains a source of pulses which eventually become commands.

The master clock supplies these rulees in various forms. The basic rulee source is the pulse generator which supplies high-frequency clock rulees (HFCP) at a 2 mc. rate and low-frequency clock pulses (LFCF) at a 1 mc. rate. The frequency divider is used in conjunction with the restorer—pulse generator (RFC) to generate the restorer pulses. Clock-pulse control (CFC) determines which pulses will be supplied to the remainder of the computer during such times as restoration and the operation of an auxiliary control. The synchronizer provides single pulses at the control of a rush button. The bulk of the operations are carried out by the time-pulses which appear on successive lines at intervals of 1 usec.

(the interval is longer during restoration and the operation of an auxiliary control). There pulses are provided by the time-pulse distributor (TFD)

(see section 2.125). The program counter (FC) is the device which contains the location of the next instruction to be performed. The control switch (CS) determines which operation is to be performed by energizing the appropriate line of the control matrix (CK). Each line of the control matrix opens certain gate tubes (control pulse output units). Nach of there gate tubes is sensed on a certain time pulse (in some cases, on two time pulses). The pulses which emerge from these gate tubes are the commands. Thus by setting the control switch to a given position, the sequence of commands required to perform the operation determined by that position is obtained. A different sequence of commands is obtained by changing the centents of the control switch. The program register (F) is a fuffer resister which serves two different purposes, to be duscussed in sections 2.137 and 2.21. Storage selection control will be discussed in section 2.137.

## 2.132 Test Storage

Fart of the memory of WWI consist of 5 flip-flor registers and 27 togrie-switch registers (each capable of holding a word). These registers are known as test storage (TS). The 5 flir-flop registers may receive and transmit words, but the 27 toggle-switch registers can only transmit words. A switch must be manually operated in order to change the contents of a toggle-switch register. The registers of test storage are numbered from 0 to 31; the number assigned to a particular register is called the address of the register. The flip-flop registers may be assigned to any desired address by changing see cables. A 32-position switch called the test storage switch (TSS) determines the register to which the machine has access. If the commend storage read-out is given,

to the number contained in the storage register whose address corresponds to the number contained in the storage switch. The command storage read-in will read a word into the register whose address is in the storage switch (if it hap one to be a togele-switch register, the re-d-in process connot take place).

#### 2.133 Arithmetic Element

The arithmetic element (AE) is shown in Fig. 14. The numbers are manipulated in the A-register, the B-register, and the accumulator, which have been discussed in sections 2.121, 2.122, and 2.123, respectively. Fany of the arithmetic operations require a longer sequence of commands than c:n be provided with the eight time-pulses available from central control. The remainder of the arithmetic element (with the exception of erecial-add memory and overflow, sign control, and divide error) is used as an sarilinry control to provide these corrands. One of the commands provided by the control matrix on such operations is stop clock, which is sent to clock-pulse control and stops the supply of pulses to the time-pulse distributor. At the same time a command is sant to arithmetic control which posmits it to commonce operation. Wither LECF of HFCP (depending on the operation) are used to carry out these arithmetic operations. When the operation is complete, an end-carry bulse which permits the time-pulses to resume is sent to clock-pulse control by rithretic control.

Certain conventions have been adopted regarding number representation. These are explained in Arpendix I. The use of the "ninescomplement" to represent a negative number means that the negative -y
of a positive number y is actually represented by 2-y-2-15 in this machine.

This is perfectly all right for addition and subtraction as long as endround carry is provided. Shifting multiplication, and division are
quite complicated for negative numbers if the "nines-complement" is
used. The sign control has been provided so that these operations can
be carried out for the positive magnitudes of the numbers involved
and the proper sich affixed to the result by complementing each digit.

Special-add memory and overflow is necessary if the computer is to have the facility of using two words to represent a double-length number. The addition of the less significant halves of two numbers represented in this fashion may produce a carry into the least significant digit of the other half of the sum. The special-add memory and overflow is used to remember such a carry. An addition of two single-length numbers which results in such a carry means that the capacity of the corruter has been exceeded. When such a situation occurs, the special-add remory and overflow will surply an clarm signal which can be used to stor the corrutation at that point and provide some indication to the overator. A similar situation can occur in the divide operation if the dividend is greater than the divisor. The divide error is used to provide an indication of this situation.

# 2.134 In ut-Cutrut Element

A block diagram of the input-out ut element is shown in Fig. 15. Words are transferred between the in-out register and the external equipment on commands supplied by the external equipment. When such a transfer is to take pl ce, the command stop clock is sent to clock-pulse control. Then the transfer is completed, an in-out control end-carry rules is sent

to clock-pulse control to permit the time-pulses to resume. A rather complicated system is provided to permit the computer to operate at the same time as the external equipment. The comparison register is used as a built-in check on the operation of the in-out register. Its overation will be described in section 5.12.

## 2.135 Built-In Checking

It was mentioned in section 2.13 that the check bus has been rrowided to check on the transfer of words via the bus. The check register is used to compare the words transmitted by both buses. When a word is transferred from unit A to unit B via the bus, it is also transferred from unit a to the tripper invute of the check register via the check bus. If the check register was originally clear, it will receive the word that was transferred. Some time later, the word received by unit D is transferred to the trigger inputs of the check register via the bus. No provision is rade for carries. If the two words are the same, the check register will contain 0. The gate tubes on the 1 stdes of the flip-:lops are sensed. If any of the flip-flors contains la an out ut calse is obtained. This output palse is sent to the alarm indicator and is also used to stop the supply of pulses to the computer (with the exception of restorer-pulses and the pulses required for electrostatic storage). The alarm indicator provides visual and sural indication of the discrepancy. Alarm pulses may also be obtained from other parts of the corruter, such us the divide error, the in-out element, and the special-add nemory and overflow.

# 2.136 Test Control

The operation of the comuter is controlled by a number of push buttons located in test control. With these push buttons and some

may be required to operate the computer. A single step of the problem can be performed using another rush button. Test equipment is provided to set up some special operating conditions that have been found useful for testing purposes.

#### 2.137 Electrostatic Storige

The bulk of the memory of WWI will be provided by electrostatic storage tubes (SST). It is ultimately expected to have a capacity of 2048 words. It is convenient to think of electrostatic storage (ES) as being made up of registers having addresses ranging from 0 to 2047. Storing selection control determines whether ES or TS is to be used. If both are used simultaneously, the addresses from 0 to 31 must te assigned to TS while the addresses from 32 to 2047 must be assigned to ES. The program register is used as a buffer between ES and the rest of the computer. When a word is transferred to 35 it is actually transferred to the program register. A subsequent command IS write will stop the suprly of time-pulses and restorer-pulses and will start electrostatic stor ge control (ESC). The word in FR will be transferred to the position in 'S corresponding to the address contained in the ES decoder (ESD) (it is similar in function to TSS, but it is not a switch) under the control of ESC. After this transfer has been completed on ESC endcarry is sent to CFC which will permit the supply of time-rulees and restorer-rulses to be resumed.

## 2.2 Machine Cycle

The period of time required to fill the counter of the TPD

when starting from the clear condition will be referred to as the machine cycle. During this period, the computer performs one operation and prespares to perform the next operation.

### 2.21 Frogram Timing

The series of cormands necessary to enable the comuter to proceed from one operation to the next is known as program timing (see Fig. 16).

For convenience, a separate line of the control matrix is used to gate open the proper commands. Nearly all of the overation lines are connected to the program timing line. Those which are not connected to this line, are nevertheless connected to those gate tubes which provide the commands essential to sequencing. Program timing provides some additional commands which are cormon to nearly all operations (these additional commands do not interfere with any of the operations which do not require them). We will discuss only that portion of program timing which is essential to sequencing. Some of the cormands are concerned with checking the transfers by use of the check register.

on time-pulse? (TP?), the convends PC read out and SS read in are given. Both TSS and ESD will receive the address contained in FC, but SSC will determine whether TS or ES is used. In order for the read in to give the convect result, both TSS and ESD must be clear. This will have been accomplished by the convend SS clear on TP1. On TP4, the word contained at the address specified by PC is transferred to FR. This transfer is direct if ES is used, but requires the command PP read in if TS is used. TP4 also clears CS after a 1/2 usec. delay. This energizes the line with controls the ri operation. This operation does not have

program timing, but the commands which occur on TP5 of the ri operation are an essential part of program timing. The commands PR read out,

GS read in, and SS read in are given on TP5 of ri. This results in GS
being set up to perform the overation specified at the address which was
contained in PC at TP2. At the same time, SS is set up so that the correct
operand will be available from stora e. The commands necessary to perform
the particular operation are obtained starting with TP6. Inasmuch as GS
will not be changed until TP42 of the next machine cycle, it is possible
to use TP1 to TF4 for obtaining these corrects. On TP7 of program timing,
the address contained in PG is increased by 1 by the command add to FC.
Thus, on the next machine cycle, the instruction contained at this new
address will be performed.

Flexibility in the choice of addresses for successive orders is provided by a special operation, subprogram (sp). The sp operation chan es the contents of PC to the number specified in its address section. An additional operation, conditional program (cp) which makes such a change conditional upon the sign of the number contained in AC (if the number is negative, the change is made; if positive, it is not changed), permits the michine to choose its own computing routine on the basis of the results of some computation.

# 2.22 Operation Timing

The series of commands which are concerned with actually carrying out the overation specified by the particular setting of CS is called operation timing. As an example, operation timing for an will be discussed. The address contained in PC at TP7, indicating the register containing

the instruction which would ordinarily follow, is transferred to AR, where it is available for further use, if desired. Thus, AR is cleared on TP6. On TP8, the commands PC read out and AB read in transfer this address to AB. On TP8 1/2 (TPE delayed by 1/2 psec.), PC is cleared. The address specified in the sp instruction is still contained in FR (it was there at TP4 of program timing). Hence, on TP1, the commands PR read out and PC read in transfer the desired address to PC. This address will be read out of PC to SS on TP2, thus the next instruction will be obtained from the address specified in the sp instruction.

### 2.3 Programming and Coding

The term programming is used in a general sense to indicate the process by which a problem is prepared for solution by the computer. Coding is a much more specific term referring to the step in which the problem is stated in terms of operations which the computer performs. Then a problem has been put in this form, it is referred to as a coded program. We shall be mainly concerned with coding in this themis. A more detailed discussion of coding for WWI may be found in reference 16. A compact summary of the basic information has been included in this thesis as Appendix I.

# 2.31 Nomenclature

we have previously defined the term word as a 16-digit binary number. If the word is interpreted as an instruction, the first (left-hand) 5 digits are the coded representation of the operation to be performed. There are 32 operations possible with this convention. In

when TS is used, only the last 5 digits specify the address. When the operation code specifies a shift, the last 5 digit indicates its extent. For convenience in writing codes, two-letter abbreviations have been adopted for each operation (e.g., sp. cp. ri. ca. etc.). Another convenient notation is FC(x), which indicates the address of the resister containing x. The symbol CR (y) indicates the contents of the register whose address is y. Thus, the combination of symbols on FC(x) represents an instruction which will transfer the contents of the storage register containing x to the accumulator. Of course, it is necessary to assign a specific address to the register containing x before the instruction can be used in the computer. If x were put in register 31, this instruction would be written on 31. In the machine, this would a peer as 10000000000011111 (10000 is the code for ca).

The computer distinguishes between a word used as an instruction and a word used as a number on the basis of the time in the michine
cycle at which this number is transferred from stories. If it is transferred on TP4 and TP5 (via the program register), it will be treated as
an instruction. At any other time, it will be treated as a number. Thus,
it is possible to manipulate an instruction as though it were a number.
Indeed, it is a very common practice to do just that in programming (see,
for example, reference 3).

### 2.32 The Operations

The operations which have been accepted for permanent use in WWI are listed in Ampendix I. A brief description of each is included

there. In addition, there are a number of operations which have been temporarily wired into the control matrix. These operations are distinguished by a two-letter code in which the first letter is q. One of these, gc (check), will be used very frequently in this thesis. It corpores the contents of the storage register indicated in its address with the contents of the accumulator. The corporison is carried out in the check register in a manner very similar to checking of the bus transfers (see section 2.135). An alarm is obtained if the words do not agree. Though gc is only a temporary operation at present, it seems very likely that it will be made a permanent operation.

### 3 - DISCUSSION OF FAILUPES

#### 3.1 Classification of Failures

functions. Its ability to perform these functions devends on many factors. These factors generally vary with time in an irregular fashion.

The status of all these factors at a particular instant will determine whether the stage can perform one of its functions at that instant. A failure need not be precent at all times. However, if it is not present during a checking process it will not be detected. Thus, the nature of the factors causing failures and the degree to which they can be controlled exert an important influence on the process of error detection. It is helpful to classify failures according to a scheme which indicates this influence.

# 3.11 Steady-State Failures

an irreversible change in one or more factors which renders a state permanently incapable of performing a function is colled a steady-state failure. The stage will consistently yield the same incorrect response at every application of the signal. A typical example is an open cable; it can never transmit a pulse. However, it must not be concluded that a permanent failure of a component is necessarily a steady-state failure of the stage. If a resistor used to terminate a cable should become open, a distorted pulse waveform would result. Whether this will cause a failure depends on several other factors which need not always be unfavorable.

### 3.12 Dynamic Failures

Closely akin to the steady-state failure, though somewhat more subtle in its effect, is the dynamic failure. 21 The distinguishing factor is that it appears only on certain applications of the signal. It may be that a coupling condenser will charge slightly every time it passes a rulee. If there is not sufficient time for it to discharge between successive pulses, the charge will continue to build ap. The amplitude of the outrut pulses will decrease as the charge on the contensor increases. Sventually the outcut pulse will be too small to drive the next stage. This offect is referred to as FRF-sensitivity (pulse-repetition frequency) and is an example of a dynamic failure. The open terminating resistor referred to in section 7.11 might be a dynamic failure or it might not be a failure at all. Another interesting example of a dynamic failure found in the overetion of WWI (after considerable effort) was the result of "ringing" on a d-c power surely line. The various supply lines have been provided with IC-filters to prevent the coupling of pulses between circlits. The current surges which accompany the switching of vacuum tubes cause the filter circuits to oscillate or "ring" with a fairly small amplitude. It so happened that the timing of a problem was such that the fourth time a particular pulse occurred coincided with the peak of this oscillation. This was barely sufficient to interfere with correct operation; indeed an error did not always occur on this pulse. This failure was not due to a defective component; it was a design weakness which, in all fairness. the designer could not have been expected to anticipate.

#### 3.13 Incipient Pailures

Some factors change in a continuous fashion. For example, the cathods emission of a tube tends to decrease gradually as it ages. Eventually, the change will have progressed to a point where correct operation is no longer possible and a steady-state failure will evist. Pefore this point is reached, the operation of the stage becomes more sensitive to other factors. An incipient failure is said to exist. Slight we reations in some factors would not cause any failures in a stage without an incipient failure, but these variations would cause failures when an incipient failure is present. Aging effects are found in all components, but are more pronounced in tubes and crystal rectifiers.

### 3.14 Intermittent Failures

Frequently a stars will be abnormally sensitive to a factor which fluctuates in a random fashion. This sensitivity is not the result of a progressive deterioration as is the sensitivity exhibited by an incident failure, but is more or less constantly present. The high contact resistance present in a poorly soldered joint might be a cause of such sensitivity. The failure will appear at irregularly spaced intervals. Such a failure is called an intermittent failure. It is quite similar to the incipient failure in arrearance, but not in cause.

#### 3.15 Reproducible Failures

Another tyre of failure which is manifested at irregularly spaced intervals is the reproducible failure. Its im-ediate cause is an external disturbance of some sort: mechanical vibrations, presence

of r-f radiation, and the like. Of course, there must be some other factor with makes the stage sensitive to these disturbances, but it may operate very satisfactorily as long as the disturbance, is not present.

Py artificially causing such disturbances, reproducible failures may be made to occur whenever desired, and hence take on the characteristics of steady-state failures.

### 3.16 Random Failures

is the random failure. It does not require an abnormally sensitive stage for its occurrence. It generally results from an unusual combination of unfavorable conditions among the factors which are subject to veriation. It may be due to an exceptionally large variation of one such factor, e.g., a power line transient far exceeding the normal lim ts. Although there is always a cause for such a failure, it remains unknown to the overator.

### 3.2 Checking Considerations

### 3.21 Influence of Various Tyres of Failures

determining the ability of the commuter to function properly. If the ability should change during the course of the experiment, it becomes difficult, if not impossible, to interpret the results in terms of the condition of the commuter. It is quite possible to obtain the correct solution to a problem even though dynamic, incipient, intermittent, reproducible, and rendom failures are present. In short, test checking is applicable to only steady-state failures. In designing test sequences,

no consideration need be given to any failures which are not essentially steady-state failures. However, it has been noted that dynamic, incipient, and reproducible failures are very closely related to steady-state failures. If supplementary means can be found to convert these types of failures into steady-state failures during the performance of the test sequences, test checking may be used for their detection. Of course, if an intermittent or random failure should occur at an optortune time, it, too, would be discovered by test checking. However, mathematical checks are much more effective in dealing with irregularly spaced failures.

### 3.22 Operational Effects of Failures

ability to perform a certain function. The problem of designing a checking sequence may be stated in terms of these functions; the checking sequence must require each stage to perform all such functions. One of the first tasks, therefore, is to determine these functions for each of the elementary stages. It is not necessary to consider each component failure for this purpose, as will become arrarent when it is done in sections 3.221 to 3.224.

When it is desired to use a problem for trouble location, a more detailed knowledge of the operational effects of the failures is required. The actual response of the faulty state rust be known. This is an important practical distinction between checking and trouble location. It would perhaps be well to restate this in the form of the questions which the designer must ask in each case:

Checking: What must the stage do?

Trouble Location: What will the stage do?

Some of the answers in typical cases are given below.

### 3.221 Amplifiers

amplitude to drive the succeeding stages when the appropriate input signal is applied. The output amplitude may lie among a wide range of values when a faulty component is present. The particular amplitude level which is obtained will either be capable of driving a succeeding bi-stable stage or it will not be capable of driving it. Since the output will ultimately be applied to a bi-stable stage, it may be said that the amplifier will not supply any output pulse if a failure exists. This does not mean that the actual amplitude is zero, but that it might just as well be zero as far as its effect on the operation of the computer is concerned. If the output is capable of driving the succeeding stage, no failure exists.

It is, of course, possible to obtain an out ut when no input signal is applied. However, there is no way of predicting the relation of the times of occurrence of such spurious pulses to the timing of computer operations. Thus, failures of this type are essentially intermittent and need not be considered. As additional justification for disregarding them, it should be noted that such failures are extremely rare in well-designed amplifiers.

To summarize this discussion, the two questions of 3.22 will be enswered. An amplifier stage must provide an outrut whenever the appropriate signal is applied to the input. If a failure exists, no output

will be obtained when a signal is applied to the invut. The extreme simplicity of these conclusions tends to belittle their importance. Geretainly they are intuitively obvious, but they must be clearly realized in designing test sequences. Note that the failure of the amplifier may be due a large number of causes (open filament, open plate load resistor, grid-to-cathode short, etc.), all of which exhibit the same symptom.

### 3.222 Gate Tubes

The gate tube can fail in three ways:

- 1. An output pulse is obtained without coincident invuts.
- 2. No outrut rulee is obtained for coincident invuts.
- 3. An outrut pulce is coupled back to an in ut line.

Case 3 is extremely rare if care has been given to the layout. Furtherrore, it is correctly termless if the input line is driven by an amplifier, as is often true. This case will therefore not be considered in
this thesis. Case 2 is quite possible and fortunately, is amenable to
easy analysis. Two possibilities must be considered under case 1: the
output pulse is obtained without any input or it is obtained with only
one input. The former possibility represents an oscillatory condition
which is extremely unlikely. If it should exist, it would be extremely
difficult to determine the times when a spurious pulse would occur. It
would take on the characteristics of an intermittent failure, so that it
need not be considered. The latter possibility also may result in spurious
pulses at indeterminate times, which, again, need not be considered.
However, in the usual amplication of the gate tube, a signal is amplied to
one input to determine whether the other input is energized at that time.

ever this sensing signal is applied, even though the other input is not energized. This is a straightforward manifestation and will be considered. To surrarize, it will be assumed that to the failure of a gate tube can exhibit only two possible effects:

- 1. It will never provide an output when a sensing signal is applied.
- It will always provide an output when a sensing signal is applied.

### 3.223 Flip-Flops

of three inputs and to retain such information indefinitely. A faulty component may cause it to develop a preferred position, i.e., it will not remain in the other position indefinitely, though it will remain in the preferred position for any desired period of time. Once again an indeterminate time factor appears in the consideration of the effects of failures: the veriod of time a flip-flop can hold the non-preferred information before reverting to its preferred state. If this period is longer than the time it is required to hold the non-preferred information, this weakness will not have an adverse effect on its operation. The longest interval between restorer-pulse pairs is the maximum amount of time that a flip-flop would be required to hold one state. However, for reasons which are not fully understood, flir-flops have been observed to revert to the preferred position even though restorer pulses are used. Some

on computer operation when to trouble location is considered. Nevertheless, the checking requirements can be clearly stated:

- 1. It must receive and retain a C via the clear input
- 2. It must receive end retain a 1 via the set invut
- 3. It must receive and retain a O via the trigger input
- h. It must receive and retain a 1 via the trigger imut.

### 3.224 Restorer Fulses

The loss of restorer pulses in some part of the computer is another type of failure which does not lend itself to a simple analysis. If the contents of the flir-flore are changed at sufficiently frequent intervals, the charge on the coupling condensers will probably be maintained, so that correct operation will continue. However, this is not likely. When a flip-flop remains in the same position long enough, the suppressor of its sensing gate tube will assume the potential of the point to which it is returned. Those gote tubes in which the suppressor are returned to ground will be open, regardless of the condition of the flipflops. Most gate tutes are designed this way. Those gate tutes in which the surpressor are returned to -15 volts will be closed, regardless of the condition of the flip-flops. Read-in gate tubes are generally of this type. The same symptoms will, of course, arrear if a flin-floo is not capable of responding to a signal at the trigger input, for it is equivelant to the loss of the restorer-pulse supply to that flip-flop. Effectively, these failures may be interpreted as flip-flow failures in which:

- 1. The flip-flop appears to contain 0 and 1 simultaneously.
- 2. The flip-flop appears to contain neither 0 nor 1.

Because restorer pulses play such a vital role in the operation of the computer, it may be well to include some built-in device to indicate the loss of restorers at some critical point (or points). This subject cannot be discussed any further in this paper.

### 3.3 Marginal Checking

A very useful technique adopted by Project Whirlwind is the provision of facilities for the variation of d-c sup ly voltages to various parts of the computer. The variation of these voltages is a powerful adjunct to other checking procedures. Dynamic and incident failures can be converted into steady-state failures through its use. It helps simplify the problem of trouble location. The possibility of removing a deteriorating component before it causes a failure in computer overation is enhanced by this voltage-variation scheme.

A variable-voltage generator is provided which may be placed in series with a number of the d-c supply lines. Each of these lines supplies a rather limited number of stages. A telephone dial may be used to select the line on which the voltage is to be varied. Manual control of the variation is available within the limit imposed by the generator characteristics. In addition, there is provision for automatically switching the generator into each line in sequence and causing the voltage to be varied within predetermined limits before switching to the next line.

These limits may and generally will, be different for each line. It is possible to both increase and decrease the voltage. Since only a limited number of stages is affected by a single voltage-variation line, a considerable degree of trouble location may be affected through its use.

For example, if an incipient failure under normal voltage is converted to a steady-state failure by v rying the voltage on a particular line, the trouble is associated with one of the stage supplied by that line.

In general, there is a certain maximum telerance (both positive and negative) in the supply voltages of each stare, which is determined by its design. A telerance will be associated with each supply line.

This telerance is some function of the telerances of the individual stages supplied by it. These telerances are called margins. When a stage is operated very close to its margin, it becomes much more sensitive to fluctuations in the factors which cause failures. In particular, dynamic and incipient failures show up much more clearly. Decreasing the screen surply voltage on an aging pentode tends to accentuate the decrease in cath de emission. The plate current decrease in a normal tube may be sufficient to cause a steady-state failure with a margin of +30 volts; for an aging tule, the targin at which a steady-state failure accents may be only -15 volts.

These facilities may be used to measure the margin on each line by verying the voltage to the point at which a failure occurs. By measuring the actual margins at regular intervals, considerable information is obtained regarding the effects of aging on the operation of the contuter. When margins become unduly low, it is time to reclice the offending component. Skilled personnel contend that the trends of this data often indicate the faulty component quite accurately. They have proved their contention on a number of occasions. This procedure of checking the margins has given rise to the term marginal checking to refer to the use of the voltage-variation facilities for any purpose.

volting-variation line without specifying the conditions under which it was measured. No failure would be obtained at any voltage if the stages surplied by that line were not in use. One standard set of conditions is the complementing of all flip-flops at a fixed rate, determined by an external pulse source. This is used only for the marginal checking of the flip-flops. By observing the indicator lights, the margin at which each flip-flop fails to complement can be measured. This technique has proven quite effective. No equivalent technique has been developed for other types of stages. It is necessary to design problems which utilize the stage undergoing voltage-variation. Nuce study must be given to the coordination of marginal checking with test checking.

minent occurence of a failure encreets that it ray also be used to predict that no failures will occur for some definite period of time (the prediction interval) if the rargin is greater than some predetermined value. There is not yet sufficient evidence to justify this conclusion, but it does seem plausible. Of course, t is conclusion could only apply to steady-state, dynamic, and incivient failures. By providing some apparatus to produce suitable external disturbances, it might be possible to extend this reasoning to reproducible failures. If the existence of a reasonably long prediction interval can be demonstrated conclusively, test-checking procedures would be considerably more attractive. An overall check problem could be performed at regular intervals (somewhat shorter than the prediction interval) while each surply line was

varied through the range required for the prediction to be valid. If
the problem indicated that no failures were present, it may be concluded
that problems solved during the remainder of the prediction interval
are error-free, provided that some mathematical checks are included to
guard against interrittent and random failures. If the prediction
interval were long compared to the time required to perform the check
problem, the loss of time inherent in test-checking procedures would
become negligible.

### 3.4 Summary

Test-checking procedures are only effective against steady-state failures. However, by use of sup-lementary means, many failures may be converted into steady-state failures. Those which cannot be converted are sufficiently random to be detected by mathematical checks. For purposes of test sequence design, it may be assumed that only steadystate follures need be detected. To use a sequence for trouble location, it is necessary to determine the results which will be obtained in the presence of all possible faults. Each fault is assumed to be manifested. in a clearly defined manner, although this is by no reans accurate. Those foults which do not exhibit clearly defined effects are gnerally sufficiently rare to justify their being neglected. Furthermore, if such a peculiar case arises, analysis on the basis of these assumtions should lead to contradictions, thus indicating that a special case is involved. However, these assumptions do not eliminate the need for further study and classification of corouter failures. Ferhaps such study may lead to a more satisfactory treatment of tent checking.

#### 4 - PRINCIPLES OF TEST SEQUENCE DESIGN

### 1.1 Objectives

Will be in demand as long as there is a need for computers. At present, the demand for such methods is quite pronounced. The rain emphasis is towards developing techniques which give useful results as soon as possible, even though they leave something to be desired. More elegant and effective methods will be required in time, but the important thing is to keep the computer operating with a minimum of laborious raintenance. Trouble-location techniques are of more immediate interest than checks on the overall results. The ultimate goal is twofold:

- 1. to obtain a compact, overall test problem, coordinated with marginal checking, which will indicate that every stage of the computer is working properly (barring the inevitable exceptions) and will continue to work properly for the prediction interval.
- ?. to obtain a collection of test sequences which may be used (probably in conjunction with rarginal checking) to locate the source of any failure which is present with as little ambiguity as possible.

This theris is mainly concerned with the second of these goals.

However, it is felt that some insight into the design of an overall check broblem can be gained by studying the checking of small units. In addition, the results of such a study can also be applied to trouble location. Thus, the main objective of this thesis is the development of a systematic method of designing test sequences for checking relatively small units in coordination with marginal checking. A secondary objective is to discover

those limitations of test checking which might require the use of other checking procedures.

### 4.2 History

- E. I. Flumenthal and G. C. Hoberg were the first men to study the problem of test sequence design in detail. They were concerned with locating certain types of steady-state failures in the Five-Digit Multiplier, the prototype of the Arithmetic Element of Whirlwind I. They considered only single failures which are equivalent to the removal of some vacuum tube. The procedure they adopted is quite straightforward. It consists, essentially, of tabulating the functional abilities lost upon the removal of each tube. Generally, this tabulation will suggest a secuence which will detect the presence of the failure. For example, survose that a verticular failure would render a flip-flop register incapable of receiving a 1 in digit column 15. The sequence for checking for this failure is almost self-evident. Fead a word which contains 1 in digit column 15 into the register. Read the word received by this register to a comparison unit (such as the check register). Compare the word in the corparison unit with the word originally read into the register. If they agree, then the particular failure does not exist (every other part of the corpuler is working properly because of the single fault assumption). If they do not agree, then the failure exists. However, these men utilized sequences of corrands rather than operations, thus severely limiting the practical applicability of their results.
- G. C. Summer recognized that information is routed along certain clearly defined channels in many typical computers units. He concrived of a method of trouble location analogous to the conventional procedure

of signal tracing. The test sequence would set up the unit so that the information would be routed along a particular channel, supply information to the infut of the channel, and check that the appropriate information appeared at the output. He designed a few sequences (using commands) to illustrate the procedure, assuming only single failures.

### 4.3 Processed Procedure

A method will then be developed for designing a test sequence to check any one section, assuming that all other sections are working properly.

Modification of the procedure to include multiple faults will be considered. The use of the sequences for trouble location will be discussed.

### 11.31 Subdivision of the Computer

A section consists of all stages which are suplied by a single voltage-variation line. This subdivision of the computer automatically coordinates the test sequences with marginal checking. Of course, most stages require more than one d-c voltage supply, and it frequently happens that the same section corresponds to several voltage-variation lines. Thus, there are less sections than there are voltage-variation lines, and one test sequence can be used with several voltage-variation lines. This method of subdivision also helps create a firmer foundation for the single fault assumption. It is more likely that a failure will occur in a section undergoing voltage-variation than in one whose surply voltage is normal, particularly if the periodic checking scheme discussed in section 3.3 (in connection with the prediction interval) is used.

### 4.32 The Channel Concept

The picture of clearly defined channels of information flow conceived by Sumner (see section 4.2) may be readily generalized to include the entire computer. Information arriving at the input-output element is routed to storage. From storage it may be transmitted to the arithmetic element, where it may be combined with other information to form the desired information. It is then routed to the input-output element via storage and made available to the operator. Another possible channel takes the information from storage to control, where it may be used to manipulate other information in the computer. The paths traced out in this way can become very complex, especially since the information frequently undergoes transformation and may frequently be used to determine its own route. Fany channels may coincide for several stages. These corrlications make Sumner's idea of automatic signal tracing untenable when arrived to the whole computer. The practical difficulties involved are very nearly insurmountable.

Essentially, Summer considered those segments which were within each of the ty ical units that he treated. Each of those segments could conceivably be shared by rany different channels. Blumenthal and Hoberg also considered channel segments, although they ray not have been aware of it. Their segments consisted of single vacuum tutes. The channel segments to be considered in this thesis will be those which are within a given section. A channel segment will be defined as a vortion of a channel which is included within a given section and includes all states in that section which are wart of the channel. In special cases, this definition may reduce to that of Summer or that of Blumenthal and Hoberg.

describe e of channel segment. A channel segment may be described without any knowledge of the complete of number of which it is a part. It is only necessary to examine a diagram of a section, choose a stage which receives information from sources wholly outside that section, and proceed to trace the flow of information from this stage to all stages until a stage is found which is outside the section. There may be many alternate paths for the flow of information. Each of these is another channel segment. To find all channel segments belonging to a given section, all such paths must be traced out, starting with each stage which receives information from sources wholly outside the section. This process may seem very laborious, but in many cases it is really oute sirple. Every alternative mute encountered in tracing out these maths about the carefully noted down in order to avoid overlooking one of the paths.

# 4.33 Use of The Channel Concept

Once a tabulation of all the channel segments of a given section has been made, it is a comparatively simple matter to design a test sequence for each of them. When there are several channel segments sharing the same stars (or starses) it is sufficient to check one of the channel segments in order to check the stars. The choice may be made arbitarily. However, it is frequently desirable to check all channel segments, if only for the greater assurance it provides. The sources and destinations of the information transmitted by each channel segment should be tabulated too, as should to the command pulses which control

the transmission of this information. It is then only necessary to choose one of the sources (if there is only one source, no choice exists), one of the destinations, and one of the sequences of commands for transmitting the information from this source to this destination. Any convenient method of supplying the source with the information and examining the destination may be used. All choices of this nature may be made quite arbitrarily if there are alternate possibilities. If there is but one possibility, then that is the way it must be done. It is sometimes possible to make choices in such a way as to economize on the arount of storage required. These points should become clear when some examiles are considered.

There is one point that requires special attention. Frequently, the information supplied by a channel segment to its destination is used to manipulate some numbers in an arithmetic process. In some cases, it may happen that the correct result will be obtained even though the proper information is not supplied. Whenever such a channel segment is being checked, care must be taken to be sure that the numbers being manipulated do not lead to one of these special cases.

# 4.34 Fultible Faults

The test sequences which have been obtained on the assumption that all marts of the computer (other than the channel segment being considered) are working properly, may not apply under less restricted conditions. In many cases, it is possible to design other test sequences which see useful when multiple faults are present. However, it is still necessary to make some assumption regarding the combinations of faults

small unit which may consist of several sections. The concept of a channel segment must be broadened to include all stayes of the channel which are in any of these sections. This is very nearly the same type of channel segment that Summer used.

The procedure requires that careful attention be given to the order in which the channel segments are checked. First, all single-stage channel segments should be checked. Then, design test sequences for all two-stage channel segments which share stages already checked, and so on. Of course, it is quite possible that there are multi-stage channel segments consisting of several stages that cannot be checked independently. If such a situation exists, then the whole channel segment must be treated as though it were a single stage.

tions than were considered in its original design, it is usually necessary to start all over and design a completely new sequence. The additional stages which come into consideration generally change the composition of the channel segments so that the original sequence is not applicable. In particular, the one-stage channel segments of the smaller unit may become multi-stage channel segments. This feature of the amproach is somewhat of a disadvantage because it requires that much analysis must be carried out every time a larger unit is considered. However, in lieu of a better approach, this disadvantage must be tolerated.

### 4.35 Trouble Location

The test sequences designed for checking the various channel segments may be used in trouble location. It is only necessary to determine the point in the sequence at which an alarm is obtained in order to discover the faulty channel segment. The type of information which was not transmitted is determined by analyzing the contents of the various flip-flops. These two steps usually reduce the number of possible locations of the failure to a reasonable value. Manual trouble-location techniques may then be used.

A greater degree of trouble location can be obtained when a mimber of channel a gments share the same stages (see Fig. 17). It is necessary to use a test sequence to check each channel segment. A mumber of possibilities may be eliminated by considering the results obtained for each sequence. Referring to Fig. 17 and assuming only single failures, the following pattern of results will be obtained:

| Defective<br>Stage | Channel<br>Segment | Channel<br>Segment | Channel<br>Segment<br>#3 | Channel<br>Segment |
|--------------------|--------------------|--------------------|--------------------------|--------------------|
| A                  | Alama              | Correct            | Alam                     | Alara              |
| В                  | Alarm              | Alarn              | Alarm                    | Correct            |
| C                  | Alarm              | Alerm              | Correct                  | Alarm              |
| n i                | Alarm              | Alarm              | Al arm                   | Alara              |

Actually, it is not necessary to perform the test sequence for channel segment #1 because the alarm will always be obtained. The indicates

a real difference between shocking and trouble location. For checking, it would be sufficient to perform the test sequence designed for channel segment #1. If no slarm is obtained, then all four stages are operating correctly. If an alarm is obtained, the other three test sequences may be used to isolate the failure.

### 5. - EXAMPLES OF DESIGN PROCEDURE

### 5.1 Single Fault Sequences

Several test sequences have been designed to check typical sections of the corputer. It is assumed that every channel segment, other than the one being checked, is overeting correctly (i.e., only a single fault exists). Consideration of storage has been oritted because the channel segments consist of a prohibitively large number of storage. Special techniques will probably be more suitable for checking storage. A design for an entire section has also been carried out in the hore that it would point the way to the design of test sequences for larger sections.

### 5.11 Arithmetic Control Flip-Flows

Test sequence design is probably most simply illustrated in arithmetic control because it consists mainly of disjunct channel segments on thining few stages. Arithmetic control has been discussed in section 2.133 where a block diagram (Fig. 14) has been included. The section defined by voltage-variation line 79 will be considered. This section is identical with the one defined by voltage-variation line 80.

The following stages are in tile section:

| FF304.01<br>FF306.01 |          | *                  | (sign control) (multiply) |
|----------------------|----------|--------------------|---------------------------|
| FF307.01;            | FF307.02 |                    | (ahift)                   |
|                      |          | FF308.03: FF308.04 | (divide)                  |
| FF309.01;            |          |                    | (aproial-add)             |
| TT210.01:            |          |                    | (point off)               |

The channel segments, together with the sources and destinations of the information and the sources of the commands, will now be described in

detril. These descriptions will be followed by a discussion of the design of the test sequences.

### 5.111 Channel Segment 1 - Sign Control

Only one stere is involved, FF304.01. It may receive information via its trigger input from itself (assuming a slight delay in transmission), and the sign digits of AR and AC. It supplies information to AC and itself. The correct AR sign check (which occurs on mr, mh, and dy see Fig. 18) is required to obtain the information from AP. The correct AC sign check (which occurs on mr, mh, dy, sl, sr, and sf, see Fig. 18) transmitted and received on the correct product sign (which occurs on mr, mh, dy, sl, sr, and sf, see Fig. 18) which also transmitted information to AC.

information 1. Assume that it is been decided to supply this information from AR. The more easily accessible definetion is AC. Either mr, wh, or dv may be used to transmit the information. Suppose mr is chosen (dv is a somewhat more complicated overation). The operand must be a negative number, say - 1/2. Thus we must order mr °C(-1/2). Some number must have previously been in AC, and, in order for the information 1 to be transferred on the command product sign, it should have been positive, say +1/2. This can be put in AC by the instruction ca (PC(1/2). The resulting product is, of course, -1/4. Thus the test sequence has been develored:

cs RC(1/2) mr RC(-1/2) qc RC(-1/4) Care rust be taken to be sure that the correct answer is not obtained if the channel segment fails to supply the proper information. In this problem it is quite arrarent that the results of the sultirlication would be +1/4 if the proper information were not supplied, so that the check is indeed valid. Note that only the signs of the operands are instrumental in producing the desired information; their magnitudes may be chosen arbitrarily. The particular numbers given were chosen because of their simils representation in binary notation.

To check its ability to transmit the information O, the sequence of instructions:

ca \*C(1/2) mr HC(1/2) qc PC(1/4)

may be used. If the information 0 were not transmitted, then the information 1 would be transmitted. The result would be -1/4 so that the creck is valid.

There six instructions completely check the ability of this channel segrent to perform its function, assuming that any failure symptoms which may exist will exert its effect throughout the time required to carry out these sequences. Note that it was not necessary to use all of the information sources or destinations. Indeed it would have been quite difficult to base a check on the use of FF304.01 is its own source and destination.

# 5.11? Channel Segment ? - Multiply

Again, only one stage is involved, FF306.01. It receives the information 1 from central control via the set in ut on the command sultiply (which occurs on mr and mh, see Fig. 18). It receives the information 0

occur on mr, mh, dv, al, ar, and af. It may also receive the information on ACl to point off (on af only). The information is transmitted to PRI5 only whenever a righ-frequency clack pulse occurs.

To check the ability of the charmel segment to transmit the information 1, all that is necessary is to use either the mr or mh operation. The sequence:

ca FC(1/2) mr C(1/2) qc FC(1/h)

does just this. It is, of course, one of the requences ared for channel segment 1 and his been used again in the interest of economy. If the information 0 were transmitted, none of the stape of the multiplication would be performed. No step counter end-garry would be available to permit the flow of time pulses to resume. The commuter would remain interest indefinitely ("prolonged stop clock"). At present, no slarm is given when this situation exists, although the operator can easily detect it by observing the indicator lights. It has been provosed that an alarm indication for this condition be included in WVI. It is likely that this proposal will meet with approval.

If the channel segment cannot transmit the information O, spurious shifts and additions in the arithmetic element would occur at all times (though such actions are proper on TP? of mr or mh). Thus, the instruction ca EC(1/2) will place 1/2 in AC and the instruction qc EC(1/2) can be used to check that it remained there. Wille no definite prediction can be mide regarding the content of AC in the presence of the spurious

O, it is safe to assume that it would not be 1/2 (though there are times when it would be 1/2). A more definite check cannot be obtained.

fault assumption in the design of these sequences. The sequence required to check FF304.01 depends on the correct operation of FF306.01, while the sequence for checking FF306.01 depends on the current 'paration of FF304.01. Assuming only single faults, both flip-flors cannot fail simultaneously. If there is a failure in FF304.01, FF306.01 will operate properly to effect the check and vice versa. As it happens, even if both should fail simultaneously, some alarm indication would be obtained, but this is just a fortunate coincidence. No effort was made to obtain alarm indication in the presence of multiple failts.

### 5.113 Channel Segmente 3A and The Siffs

Secause channel segment 3A, consisting of FF307.01, is quite similar to channel segment 3B consisting of FF307.02, they have been grouped together. With minor changes (e.g., replacing mr and mh by sl for 3A and gr for 3B), the discussion in section 5.112 is applicable.

To test that segment 3A can transmit the information 1, the sequence:

ca RC(1/4) sl 1 qc PC(1/2)

may be used. Similarly, the sequence:

ca RC(1/2) sr 1 qc PC(1/4) may be used for segment 3B. To check that both segments can transmit it information 0 the sequences designed for channel segment 2:

ea PC(1/2) go PC(1/2)

is satisfactory.

use stor e economically dictates that same set of numbers be used whenever possible. This criterion has suided the selection of the operands in this sequence. Further reduction of storage requirements can be obtained by making use of the contents of aC after the gc operation.

This is a standard device used in coding. The sequences for channel segments 3A and 3B may thus be reduced to:

ca PC(1/2) qc RC(1/2) sr 1 qc FC(1/4) sl 1 qc RC(1/2)

This saves two orders.

# 5.114 Channel Segment 4 - Divide

The first multi-stage channel segment to be discussed comists of FF308.01, FF308.02, FF308.03, and FF308.04. Several apparent
difficulties may become evident from a study of the block diagram. Attempts to apply the channel concept to the two flip-flops of the dividepulse distributor may meet with failure because of the complicated manner
in which they are interconnected. However, if they are treated as a
generalized flip-flop stage having three stable states, the channel
segment becomes clearly defined. No real difficulty is intro used by

the need for information to pass through an external section (AC) in being transmitted from the divide-pulse distributor to FF308.03. It is only necessary to choose the operands so that the appropriate information is suplied to FF308.03. Lecause FF308.03 is intended to provide an clarm when an improver division is performed, to check this function it is necessary to actually order a division which results in an alarm. The analysis of tis channel segment becomes quite—straightforward once these few difficulties have been resolved. In fact, all information is transmitted along this channel by a sequence of corrands which automatically follows the command divide (in normal operation).

The division of 1/4 by 1/2 would, of course, cause the information to be transmitted to the F305.01. This would initiate the transmission of all types of information from the divide-pulse distributor. The particular operands require that both divide zero and divide one be transmitted to FF305.03. However, divide zero is not transmitted at a time than F705.03 contains 1 = only an improper division (more fully discussed below) can provide this condition. A sequence for checking this channel segment is:

ca RC(1/4) dw RC(1/2) qc RC(+0) sl 15 qc PC(1/2)

That this is not a special case in which the correct result is obtained though the proper information is not transmitted can only be verified by the tedious process of manually carrying out the steps of the division in the same manner as the computer, taking due note of the possible failures.

Such verification has been made for this sequence, but the details are not presented here. Some failures will result in a "prolonged stop clock" which has been discussed in section 5.112. These particular operands, chosen for their convenience, are therefore satisfactory. Had it proven otherwise, a trial and error procedure would have been necessary. The same sequence that is developed in section 5.112 for checking that FF306.01 can transmit the information 0 may be used to check that FF305.01 can transmit the information 0.

improper division is 1/2 divided by 1/4. If FF308.03 is working properly, an alarm should be obtained on the first ster of the process.

If no such alarm is obtained, the con uter will continue to operate on the numbers in the arithmetic element as though it were performing a valid division. When the step counter indicates that the process is complete, AC will contain 0 in all digit columns, while FF will contain 1 in all digit columns. If the operation is followed by all 15, an overflow will occur during the round-off, resulting in an arithmetic check alarm. An alarm will be obtained in both correct and incorrect operation. In correct operation, it will be a divide error alarm, while in incorrect operation, it will be an arithmetic check alarm. The sequence,

ca RC(1/2) dv RC(1/4) sl 15

must be restarted quite frequently. This situation always arises when it is necessary to check a stage designed to give an alarm.

### 5.115 Channel Segment 5 - Special-Add

Another multi-stage channel segment consists of FF309.01 and FF309.02 (see Fig. 19). While is is possible to analyze it using the channel concept, a sequence may be designed more directly. Although FF309.01 receives information on many operations, including sa, FF309.02 receives information only on sa. Furthermore, if the sum of the two operands involved in a special-add is greater than or equal to +1, FF309.01 receives the information 1 and FF309.02 receives the information 0, representing a carry of +2-15. If the sum is less than or equal to -1, FF309.01 receives the information 0 and FF309.02 receives the information 1, representing a carry of -2-15. These carries are transmitted to AC on the command special carry of a subsequent ca, cs, or cm. Thus the sequence:

ca PO(1/2) ea FO(1/2) cc RO(+0) ca RO(1/4) cc RO(1/4+2-15)

may be used to check that a carry of  $+2^{-15}$  can be transmitted. The sequence:

ca EC(-1/2) sa FC(-1/2) qc PC(-0) ca EC(1/4 + 2<sup>-15</sup>) qc RC(1/4)

may be used to check the carry of -2-15. As usual, the numbers were chosen for convenience.

If both flip-flors contain 0, a carry of +0 is transmitted, while if both flip-flors contain 1, a carry of -0 is transmitted. The correct result cannot be obtained unless the correct information is supplied by these flip-flops.

## 5.116 Channel Segment 6 - Point-Off

Two stages, FF310.01 and FF310.02, compose this channel segment. The design details may be carried out in a perfectly straightforward manner. Only the sequences will be given here:

> ca RC(1/2) qc RC(1/2) ca EC(1/4) ef x qc EC(1/2)

(x represents the address of any convenient register).

## 5.117 The Complete Sequence for Arithentic Control Flip-Flops

The complete sequence for checking voltage-variation lines 79 and fo is to be found in Appendix II as Test Sequence Fumber II. The short sequences have been combined in such a way as to minimize the arount of storage required. Addresses have been assigned to the various registers. The resulting coded program can be used with test storage. The requence for checking that a divide error alarm is obtained has not been included.

## 5.12 Invut-Outrut Element

The input-output element (IOE) (see Fig. 15) was designed with a considerable amount of built-in checking. The comparison register (COP) is used in a multiple checking scheme to check the in-out register (IOR). These two registers are operated in such a way as to give a fairly thorough check of the external film equipment, assuming single faults.

A special predtermined check is used to guard COR against single faults.

Consider Fig. 5. After the word has been transferred from the film to

IOE (the reading process), both IOR and COR should contain the word which has been read. After the recording process, both IOR and COP should contain the complement of the word which has been recorded. In any event, bot! registers should have the same contents when the film equipment provides a completion pulse. The rules will cormand add IOR to COR, which will supply the word in IOP to the trigger inputs of COR. In a manner entirely analogous to the checking of transfers discussed in section 2.135, this should leave COR containing O (with the exception of COR check which contains 1). The completion pulse is delayed 1/2 11896. and used to sense GTO4 of each digit column If one of these should resvide an output it will be applied to CTO7 which will be open (only single fault rossible), so that an in-out alarm will be given. Then, to make cert in that each digit of COR can contain 1 (and that COR check can contain (), this delayed completion rules is further delayed by 0.1 usec. and used to complement all flip-flors. It is delayed another 0.3 used and applied to GTO7 and to all GT's Oh in series. It is applied to CTO7 so that an in-out alarm is obtained if COR check contains 1. Since the remaining FF's should contain 1, the pulse sprided to GTO4 should eventually be applied to GTOE and emerge as a continue operation rules. The continue operation pulse performs a number of functions; the one which is directly concerned with the checking process is setting COR check to 1. If the continue operation pulse does not emerge, COR check will remain in O (if it were in 1, an alarm would have : lready been obtained). Thus, after an additional delay of 1.3 usec., the completion pulse is used to sense CTO6. If CCR cont: ins O, CTO6 will be open and an in-out

alarm is obtained. This came type of predetermined check is also used in checking the check register (check-register check).

Test sequences will be designed for the flip-flore of ICR (making use of the facilities provided by COP) and the flip-flore of in-out control (IOC). There are eight voltage-variation lines surelying the O sides of the flip-flore of IOR (each line supplies two digit columns: n and n+8). Fight rore supply the 1 sides of these flip-flope. These resultines also surely the flip-flore of all the repetitive units (e.g. AR, AC, FC, FR, etc.). One voltage-variation line (181) supplies the O sides of all the flip-flore of IOC (except FF410.06 and FF410.07, which are not standard flip-flope). The 1 sides of these flip-flope are surplied by voltage-variation line 182. The two special flip-flope are surplied by voltage-variation line 182. The two special flip-flope

## 5.121 In-Out Register

lines, it is convenient to treat the entire register as a single section.

Foreover, it may be treated as a channel regment. The test sequence design makes use of CCP to obtain an alarm when a failure exists. This technique requires that at least one of the digit columns of COR contain 1 after the completion pulse corrends add ICR to COR when there is a failure in ICR. An alarm pulse would then be obtained when the delayed completion rules senses CTO<sup>h</sup> of that digit column.

Then a failure occurs in one of the flip-flors, it is not possible to credict the behavior of those flip-flors to its left with any degree of certainty. This is largely because the shifting gate tubes (6706 and 3707) are a-c coupled to their flir-flows. Should a flir-flop remain in one position permanently, both gate tubes would be open. Thus, the next flir-flor to the left will be ruleed on both, the clear and set invats alreet simultinequally. In all probability, this will cause it to trigger, but there definitely is doubt as to its actual behavior. It is also important to note that the command read shift applies a pulse to 0710 on the 0 side of FF410.04. The cutrut of the GT is used to shift COR. If IOR 0 should transmit a pulse on the IOR one out line, FF410.04 would be set to 1 and the contents of COR would no longer be shifted. Thus, a failure of one of the flip-flops of IOR may also affect the shifting of COR. The maximum possible number (and, indeed, the correct number) of times that the contents of COR will be shifted is 16, the number of shift pulses provided by the film unit. It should also be noted that COR will be shifted at least once (a shift pulse is required to produce a rulee on the IOR one out line).

To check IOR, it is only necessary to specify words to be read from the film. These words must be chosen so as to avoid the possibility of coincidental agreement of the contents of IOR and CCR. We will discuss the choice of these words when COR is not shifted the correct number of times separately from the choice when COR is shifted correctly. If CCR is shifted correctly, it will contain the correct word. If one of the flip-flops of IOR always transmits a 1, the word should contain a 0 in that digit in order that COR be provided with 0 in that digit before the correct add IOR to COR. To check every digit column of IOR,

every digit of the word should be 0. Thus, one of the words to be read is +0. If, on the other hand, one of the flip-flops of IOF always transmits a 0, the word should contain a 1 in that digit column. Hence, the word -0 should also be read.

If COR is not shifted the correct mamber of times, COR 15 will be used to obtain the alarm. Note that COR 15 will contain one of the first fifteen digits of the word read, depending on the number of shifts. IOR 15, on the other hand, will always contain the sixteenth digit of the word read (unless it is the faulty digit). Thus, if we read a word inwhich the sixteenth digit is different from the first fifteen, we will get the desired alarm (assuming TOR 15 is not at fault). This gives us the choice of using either +2-15 or -2-15 for this purpose. Now, suprose that TOR 15 always transmits 1. If +2-15 is used, it will provide CCR 15 with 0 and the alarm is obtained. If, on the other hand, for 15 always trans-its 0, -2-15 will provide COR with 1, yielding the alarm. Thus, both words should be used. A little thrught will show that there two words would also be satisfactory when the correct number of shifts is rerformed, so that +0 and -0 are unnecessary. In fact, any two works which are the corriements of each other will provide the desired check when the correct number of shifts is performed.

Thus far, IOF auxiliary has not been considered. It is a rather special case, for, if it has failed, the content of every digit of IOF is indeterminate. However, COR would be shifted the requisite 16 times and would therefore cort in the word being read. There is some possibly lity that IOP would also contain this word for every read operation, but

its probability is quite negligible. For practical purposes, it may be assumed that IOR auxiliary is checked.

## 5.122 In-Out Control Flip-Flops

In-out central is also largely self-checking, regardless of
the words being transferred between storage and the remote equipment. It
is considered as a single section, although it is composed of two sections.
The rajor exception to this statement is provided by a failure in FFMO.06,
which controls the sur ly of restorer pulses to the in-out element. It
is necessary to sto the flow of restorer pulses to IOE during the operation of the remote equipment because it is not synchronized with the rest
of the computer. This is the function of FFMIO.06. If it fails to
stor this flow, it is not possible to predict the behavior of IOE.

In all probability, this situation will be detected, but the uncert in
time relations make it impossible to carry out any analysis. If FMIO.06
never permits the restorer ruless to flow to IOE, a similar difficulty
of analysis obtains. The loss of rectorer pulses has been discussed in
section 3.224.

The analysis of the remaining portion of IOC is quite straightforward. However, it involves a tremendous amount of detail which would
not add very ruch to the understanding of the method being used. It
requires a thorough knowledge of the interaction of the remote equipment
and the in-out element proper. These details have been omitted here.
It turns out that only a very simple sequence is required to check
ICC. This sequences has been combined with that required for checking
ICC and is presented below. One point worth noting is that sufficient

time must be alletted for the completition of an re operation in order to check the ability of FF410.04 to transmit 1. The instructions in A5 and A5 have been used to obtain a delay of approximately 1280 usec. (with Ts) for this purpose.

| Instructions | Constant Storege                     |
|--------------|--------------------------------------|
| Al rf Kl     | B1 + 0                               |
| A2 re B2     | BP - 0                               |
| A3 ca B5     | B3 + 2 <sup>-15</sup>                |
| A4 ts C1     | nu = 2-15                            |
| A5 ac C1     | H5 - 2 <sup>-6</sup>                 |
| A6 on A5     | Veriable Store Pe                    |
| A7 29 mm     | (FF) C1 (counts delay - see A5 and A |
| AS rf KP     | (FT) CP                              |
| A9 rd C?     | (FF) C3 store numbers read           |
| AlO rd 03    | (F <sup>-</sup> ) C5                 |
| All rd C4    | Remote Equipment                     |
| Ale rd c5    | Kl - recorder                        |
| A13 ca C?    |                                      |
| A14 qe B1    | K? - reader - film for use with      |
| A15 ca C3    | this reader should contain           |
| A16 qc R2    | blocks of four words separated       |
| Al7 ca C4    | by a distance which permits the      |
| A15 qc H3    | reader to stop before it reaches     |
| A19 ca C5    | the next block. Each block should    |
| A20 cc H4    | contains +0, -0, +2-15, and          |
| A21 78       | -2-15, in that order.                |
| A22 80 A1    |                                      |

## 5.13 Step Counter

The step counter (SC) is that part of arithmetic control which counts the number of shifts which are performed in the arithmetic processes which require the use of arithmetic control (mr, mh, dv, sl, sr, and sf). In all these operations (with sf nearly always an exception), SC is the device which provides the end-carry pulse which returns control to central control. SC must be reset (or preset) in all operations (except sf) so that the end-carry is obtained after the projet musber of shifts. A block diagram of SC is slown in Fig. 20. Mock schematics, which show the individual stages, are given in Fig. 21 and Fig. 22.

We will design test sequences which may be used for the entire step counter.

There are 11 different voltage-variation lines which supply SC.

They are (stage numbers refer to the block schematics):

| Line No. | Application      | Stares Affected                                                     |
|----------|------------------|---------------------------------------------------------------------|
| 77       | FF screens (O)   | PF 10, 11, 12, 13, 14, 15                                           |
| 78       | FF screens (1)   | FF 10, 11, 12, 13, 14, 15                                           |
| 113      | OT screens       | orol (5 digita), 02 (6 digita)<br>04 (5 digita), 05, 307.06, 307.07 |
| 199      | OT control grids | OTO1 (5 digits), 02 (6 digits) O4 (5 digits), 05, 307.06, 307.07    |
| 721      | GT suppressors   | oros (6 digita), 04 (5 digita) 05, 307.06, 307.07                   |
| 200      | CT suppressors   | GTON (5 digits)                                                     |
|          | FA control       | RA VI (step counter output, Fig. 22)                                |
| 97       | TT Flates        | TT V205, 305, 405, 505, 605, 705                                    |
|          | NA acreens       | RA VIO1, 201, 301, 401, 501, 601<br>(Fig. 22)                       |
|          | ED screens       | HD V102, 202, 302, 402, 502, 602                                    |

| 96  | CC plates and screens | GG V102 (V102 cathode)                        |
|-----|-----------------------|-----------------------------------------------|
| 17  | BA plates and sordens | RA V103, 104, 105, 106, 704, 801<br>804, 805  |
|     | GC plates and screens | GG V101                                       |
|     | GT plates             | GT 04 (6 digita)                              |
|     | BD plates             | PD V102, 202, 302, 102, 502, 602<br>(Fig. 22) |
|     |                       | PA V1 (Fig. 22)                               |
| 201 | MA control            | HA V104, 105, 106, 704, 801, 801, 805         |
|     |                       | RA V101, 201, 301, 401, 501, 601 (Fig. 22)    |
| 141 | RA control            | PA V103                                       |
|     | BD control            | PD V102, 202, 302, 402, 502, 602<br>(F1E. 22) |

It is clear that lines 77 and 78 define the same section. Line 113 defines a different section. It may be shown that the sections defined by all the other lines (with some few exceptions) may be considered as mart of the section defined by line 113. This is clear for lines 199 and 221. In addition to CTO1 (which is surfied by line 113), line 200 supplies buffer amplifier BA VI. However, a failure of BA VI will affect CTO2 (6 digits), so that a test sequence desimed to check GTO2 will discover the presence of this failure. Similarly, for line 97; failure of the trigger tubes (TT) will affect the read-in process and is checked with GTO1, while failures of the buff ramplifiers and bus drivers (BD) are checked with GTO2. The failure of the gate generator (Gr) supplied by line 96 will be checked with GTO1 (see section 6.11). With the exception of BA VIO3, 10h, 105, 106, 70h, 801, 80h, and 805, the stages found on lines 17, 201, and 141 have been discussed. The failures of

BA VIO1, 105, 106, and 704 will be checked with GTO5. The failure of BA VEO1 will affect the restorer-pulse supply and will not be considered (see section 3.224). Thus every stage except BA VEO1 and SO5 will be checked by the sequence designed for line 113. The failure of these buffer amplifiers will affect the information supplied to FF306.01 and FF305.01, and will therefore be checked by Test Sequence Number II. Nevertheless, they will also be considered in connection with lines 77 and 78.

## 5.131 Step Counter Flir-Flore

The rost convenient channel segment uses all of the flip-flors, with FF15 transmitting information to FF14. FF14 transmitting it to FF13, etc. Information is surplied to this channel segment by the command add to SC. The way to check that all flir-flo s can transmit both O and I is to start it from the clear position and have it count until its maximum capacity is exceeded. This can only be accomplished by ordering either al 31 or ar 31. Wither of these overations would leave O in AC, the sign depending on the sign of the number that was in AC before the shift over tion. Since this is a case in witch the check depends on the manipulation of numbers, we must observe the effect on a failure on these overations. If one of the flip-flops cannot transmit 1, the SC end-carry pulse would never be obtained, resulting in a "prolonged stor clock". Tris condition is therefore checked. If one of the flip-flore cannot transmit O, it is as tough it were by-passed. The capacity of the counter is reduced by a factor of ?, so that the endcarry would be obtained after 15 shifts. If BB were clear before the

start of the shift operation and if al 31 were ordered, then AC would contain 0 (with appropriate sign) after the 15 shifts regardless of its original contents. Thus, al 31 does not provide the desired check. However, if AC originally contained 1/2 and ar 31 were used, BRO would contain 1 after 15 shifts. The subsequent round-off would put 2-15 in AC so that this would provide the desired check. The sequence:

ca FC(1/2) sr 31 qc RC(+0)

is sufficient to check all the flig-flops of SC.

sideration. The amplifiers which drive the two reset lines (BA VEOU and EO5) are checked with arithmetic control while the preset line is driven directly by a control-pulse out ut unit and should be considered with that section. Nevertheless, the crystal rectifier diodes which consect these lines to the flir-flors might fail, so that a test sequence should be designed to check the ability of SC to be reset and preset. It was felt that it would be suitable to include this sequence with the other sequence designed for lines 77 and 78, partially because of the shortness of the other sequence and partially because the ability of the flip-flo s to receive information from these sources is affected by voltage variation on this line. It would be well to use this sequence with the other voltage-variation lines affecting the SC resets and preset.

The test sequence is quite easily designed. First the preset was considered. The failure of any digit to preset properly would result in the wrong number of shifts, regardless of the number specified in the

instruction (the single fault assumption is still used.) Since SC is present to Ollill under normal conditions, it would be desirable to have it contain 100000 before the present pulse in order to check all digits at once. However, it is not possible to perform an operation that would leave this quantity in SC. The best we can do is to use two sequences, one testing the ability of VF's 11, 12, 13, 14, and 15 present properly, the other testing this ability of FF10. For the first case, SC is made to contain 000000 before presenting by performing the after operation with 1/2 originally in AC. For the second case, SC is made to contain 100001 before presenting by performing the after operation with +0 originally in AC. The present takes place on the caloperation, but this does not affect validity of the check on FF10. The particular numbers were chosen for economical use of storage. The sequence follows:

ca RO(1/2) ef x er 6 qo RO(2<sup>-7</sup>) ca PO(+0) es x ca PC(1/2) er 14 qc PO(2<sup>-15</sup>)

(x is the address of any convenient register)

To check the multiply and divide resets, it is only necessary to prform a multiplication and a division. The operands were chosen for economy of the required storage, and the tedicus process of determining the results when an incorrect number of shifts is obtained was carried out. It was found that these operands are satisfactory. The sequences are:

### Multiply Reset

ca RC(2-7) mr RC(2-7) qc RC(2-1)

### Divide Reset

ca RC(2<sup>-15</sup>) dv RC(2<sup>-1</sup>) qc RC(+0) el 15 qc RC(2<sup>-14</sup>)

The complete sequence, written in a form suitable for use with test storage, is given in Appendix II as Test Sequence Rusber I. An additional check on the performance of the ef operations is included, thought it is not needed.

## 5.13a Step Counter Gate Tubes

The same general procedure may also be applied to the cate tibes. As discussed in section 3.222, the only checks that are included determine whether the gate tube is carable of passing or rejecting a pilee at the times when it is sensed. This procedure will also check the various amplifiers discussed in section 5.13. There are no particular difficulties involved, but one symple will be given because this is the first time that gate tubes have been discussed.

Consider GTO1, the read-in gate tube. It is sated on TP5 of every operation, because the control switch is always clear at that time. However, the information is only used for the el and ar operations. If we want to check all of these gite tubes at once, we could try to order al 31 or ar 31. It is would always leave 0 (with appropriate sign) in AC. If GTO1 of FF15 failed to wass a rules, the

effect would be to order <u>sl</u> 30 or <u>sr</u> 30, which would also leave 0 (with appropriate sign) in AC. Two steps must therefore be used, <u>sr</u> 16 to check GTOl of FF1 and <u>sr</u> 15 to check all other GT's Ol (<u>sr</u> was chosen in order that the initial contents of AC be significant). Thus, the sequence:

ca RC(1/2) sr 16 qc RC(+0) ca RC(1/4) sr 15 qc RC(+0)

It should be clear that the alarm will be obtained from one or the other of the two halves of the sequence in case a failure exists.

To check that all of these gate tubes do not always transmit a pulse, the obvious expedients of ordering either ar 0 or al 0 will do. Hence:

ca  $RC(2^{-15})$ sl 0 qc  $RC(2^{-15})$ 

Note that the use of <u>sr</u> O with AC containing 2-15 is not satisfactory.

This same technique may be applied to the other gate tubes.

The complete sequence is given in Appendix II as Test Sequence Number III.

A considerable amount of tailoring was required in coding this sequence to fit into test storage. The sequences developed here appear in somewhat altered form in Test Sequence Number III.

### 5.2 Multiple Faults - The Program Counter

A test sequence to check the program counter has been designed in order to illustrate the application of the procedure for multiple faults. A failure in the program counter affects the order

in which the instructions are performed, so that a good deal of attention must be given to this phase of the problem. Actually, there exists an excellent rethod for checking the program counter (as well as TS and TSS) based on a special order which has been temperarily wired into the control matrix. However, this should not adversely affect the value of this test sequence in illustrating the procedure.

A block diagram of PC appears in Fig. 10 and its overation is briefly discussed in section 2.124. The voltage-variation lines involved will not be listed. They are the same lines that supply all repetitive units (e.g., AC, IOR, etc). Only 5 digits of FC (PCL1 to PCL5) will be discussed since operation with TS requires only there 5 digits. We will assume that the only GT failure ressible is the inability to pass a rule when it is sensed. Failures in the read-in gate tubes or the resets will not be considered. With this limitation, there is but one way of supplying information to PC: the command and to FC on TP7 of program timing. Let us assume that FC has been cleared before the commuter is started. It is also assumed that AC has been cleared.

There conditions can be checked by observing the indicator lights before precaing the restart push button.

The computer will necessarily perform the instruction that is stored in register 0. The reason for the assumption that the cate tubes do not pass a sourious rules when sensed should now be clear. If this ressibility were admitted, the first instruction could conceivably be obtained from any of the 32 registers. The design would become unduly complicated. We will now discuss the design in detail.

Consider the first add to PC pulse. It will require PC15 to trigger from 0 to 1, thus providing a means of checking both the flip-flop itself and the trigger tube (not shown in the diagram) which emplifies the pulse. There are two possibilities:

- 1. This rules will succeed in triggering the FF to 1.
- 2. The FF will continue to contain O.

It is not ressible to determine which event has taken place until the sursequent command, PC read out. If the read-out gate tube (GTOP) has failed, the second instruction would be obtained from register O regardless of the behavior of the flir-flor. However, a multiple check on CTO? is provided by CTO3, so that an alarm would be obtained on the transfer check. If both CTO2 and CTO3 should fail similtaneously, the transfer check would indicate agreement, and the instruction contained in register 0 was d indeed be performed twice in succession. The same sequence of instructions would result from case 2. In order to get an alarr in there two cases, it is necessary to store an appropriate instruction in register O. This instruction should yield an alarm when performed twice in succession. Such an instruction is ad RC(1/2). The addition of 1/2 to 1/2 will result in an overflow and give an arithmotic chec: alarm. If no alarm is given, the second instruction will be taken from register 1 and it may be concluded that the fliv-flosis capable of triggering and that both gate tubes can transmit a sensing rulee.

The second add to PC pulse (which will not occur unless the previous functions were performed correctly) will require that three

functions be performed:

- 1. PC15 trigger to 0
- 2. GTO5 of PC15 trans it a sensing rules
- 3. POLU trigger to 1.

There is a slight ressibility that a flip-flep which can trigger to 1 cannot trigger to 0, but consideration of this possibility greatly complicates the design of the test sequence. The consideration of this possibility will be postponed until the end of this section. If CTO5 does not transmit the rules, PC 14 cannot possibly be triggered. Hence, the contents of PC after this second add to PC pulse will a sither 0 or 2. A aim, it is not possible to determine which event occurred until the subsequent command, PC read out. The failure of the read-out gate tube. (and also CTO3) in this digit column will cause the instruction in register 0 to follow the instruction in register 1, regardless of the behavior of the fli-flop. If the flir-flow failed to trigger, this same situation would be obtained. Since register 0 contains ad (RC1/2), and arithmetic check alarm will be obtained if register 1 contains can RC(1/2). If no alarm is given, the next instruction is taken from register 2.

The next add to FC pulse will chance the contents of FC to 3, for it has elreidy been demonstrated that FC15 is capable of triggering, Furthermore, the command FC read out will yield the proper read-out.

Thus, the instructions will be obtained from register 0, 1, 2, and 3 in succession. The instruction contained in register 2 may therefore be fairly arbitrary, subject to the condition that it will not cause an alarm when performed in the proper order.

The fourth add to FC pulse will change the contents of FCl4 and PCl5 to 0 and attempt to change the content of PCl3 to 1. If it fails, the next instruction is taken from register 0, as would be true if the read-out gate tube of PCl3 fails. To obtain an alarm when this failure occurs, it is necessary to use an instruction in register 3 which leaves leaves 1/2 in AC. A suitable sequence thus far is:

0 ad RC(1/2) 1 ca RC(1/2) 2 qc PC(1/2) 3 oc RC(1/2)

If this failure does of exist, the next instruction is correctly taken from register 4.

Continuing to reason in the same way, it is found that the instructions in registers 4, 5, 6, 7 will necessarily be performed in the proper order. Register 7 should cont in an instruction which will leave 1/2 in AC so that an alarm will be obtained if the next instruction is taken from register 0 instead of register 5. Similarly, register 15 should contain an instruction which leave 1/2 in AC. The contents of other registers may be chosen fore or less arbitrarily, subject to the condition that they do not cause an alarm when performed in the normal order.

Now let us discuss the modifications which are necessary if the possibility of a FF failing to trigger from 1 to 0 is considered. We return to the second add to PC rulse. If this failure exists, PC will contain either 1 or 3 after this rulse. The subsequent FC read out corrand will cause the next instruction to be taken from register 1 in the first case and from either register 1 or 3 in the second case, depending

on whether GTO2 and GTO3 have failed or not. If it is taken from register 1, the instructions in registem 0, 1, and 1 will be performed in that order. Register 1 should contain an instruction which yields an alarm when performed twice after the instruction in register 0, but not when it is performed once after the instruction in register 0. Furthermore, it should be such that an alarm is obtained when the instructions are referred in the order 0, 1, 0. The instruction and RC(1/4) satisfies these requirements. The other possibility is that the next instruction is taken from register 3; the instructions are the reformed in the order 0, 1, 3. Suprose that register 3 also contains and RC(1/4). With this order of instructions, an alarm is obtained. If none of these faults is present, then the instructions will be performed in the correct order, i.e., 0, 1, 2, 3. In order to avoid an alarm in correct operation, register 2 should contain su RC(2), where z is greater than 0.

Dy continuing this reasoning to the other digits of PC, it is possible to obtain a correlate sequence on the basis of the assurptions. The details become highly involved, though, and will not be given here. The process of determining appropriate operands (e.g. the quantity z mentioned above) to be used with the various instructions eventually requires the solution of simultaneous inequalities. A set of solutions, using numbers which are expressed simply in binary notation, was obtained by trial and error. The sequence which is based on this set of solutions is given in Appendix-II as Test Sequence Number IV. Again, only the contents of certain registers need be specified to obtain this check.

The others may be used for any purposes desired. The choice of the registers to hold the operands was made quite arbitrarily. The two sp instructions were used to provide continuity in the coded program. Their use requires that the PC read-in gate tubes be working properly.

## 5.3 Trouble Location - Control-Pulce Cutrut Units

The test sequences designed in this thesis may be used in trouble-location procedures. Then an alarm is obtained during the performance of a test sequence, a failure is known to exist. The type of slarm indication, together with the contents of the various registers, should provide information which simplifies the task of locating the trouble. In addition, if this alarm indication is not obtained with the normal supply voltages, but only under marginal checking conditions, it is reasonable to assume that the defective component is supplied by the particular voltage-variation line involved. If the margin thus obtained is abnormally low, the defective component should be found and regained (or replaced).

There are two closely related arroaches to the use of information provided by test sequences in trouble-location procedures. One
is to corpore this data with a list of the symptoms manifested by the
various possible failures. Such a tabulation may be obtained by determining the concuted results in the presence of each possible failure.
If the information obtained from the performance of a test sequence
matches one of the tabulated sets of symptoms, the fault is fairly well
located. However, it is quite possible that the alarm was caused by a

combination of failures not considered in the compilation. This leads us to the second approach to trouble location, deduction based on the evailable data. Deduction may be used to supriseent a list of symptoms, or it may be used as a completely independent method of trouble location. It requires highly skilled personnel and, on the average, is more time consuming than the use of a list. This the tabulation of symptoms is a tedious and unattractive task, it makes it possible for relatively unskilled personnel to handle routine problems in trouble location. Highly skilled personnel may be called in when the list proves inadequate.

The use of both of these approaches will be illustrated in connection with Test Sequence Fumber V (see Appendix II), which has been designed to check the control-pulse output units (see Fig. 18) under the single fault assumption. The design details have been omitted, but they are quite straightforward. Test Sequence Number V does not apply to all these units because the 32 registers of test storage are in-adequate for all the short sequences which are required.

will not pass a pulse when sensed. The pulse which it normally provides is used to read out the contents of the step counter on TP7 of the sf operation (see Fig. 16). This failure would thus result in 0 being stored as the supposed scale factor. The only of operation is stored in register 12. The scale factor which is stored as a result of this operation is checked by the oc instruction in register 14. The check is not terformed until  $TE\frac{1}{2}$ , so that PC will contain 15 at that time.

Digit column 15, 14, and 12 of the check register will have received 1 from register 3 but not from AC. The check register will be complemented as part of check register check, but will not be recomplemented because the series path of gate tubes will not transmit a pulse. Thus, CR will contain 0 in digit columns 15, 14 and 12 and 1 in all others. An alarm pulse will be obtained which will go to the alarm indicator and CPC, but it may be that TPI will have occurred before the alarm rules arrived at CPC. Thus TPD may contain either 2 or 1. AC will contain +0 as will register 8. The contents of other registers are not immediately relevant.

during voltage variation. Using the second approach, we can deduce that the failure is caused by a control-pulse output unit. Further, it is apparent that AC and register & contain +0 when they should contain 11 x 2. This means that register & did not receive the scale factor on the instruction of &. This could be due to the failure of SC to read out or of register & to read in (or both). Either unit number 1 failed to provide the command SC read out or unit number 67 failed to provide the command scale out or unit number 67 is also used on the ta and td operations, and, as it happens, there is an instruction using ta stored in register 11. If 67 were the defective unit, register 9 would contain +0. If 1 were the defective unit, register 9 would contain +0. If 1 were the defective unit, register 9 would contain +0. This reasoning assumes only single faults.

ably "ore complicated. If the multiple fault results in symptoms which can be explained by a single fault (e.g., the failure of units 1 and 10 to provide a pulse would have the same effect as the failure of unit 67 to provide a pulse), the single fault should be investigated, either by reflecement of the supposedly defective unit or be observation of appropriate waveforms. If the unit appears to be operating estimated which explain the symptoms. This is also the procedure to be followed when a multiple fault yields symptoms which cannot be explained by a single fault.

Further exemples of the use of test sequences in trouble location are given in sections 6.11 and 6.12.

## 6 - BESULTS AND CONCLUSIONS

## 6.1 Experimental Results

Now that the proposed method of designing test sequences has been discussed in some detail, we may well question their usefulness. It is quite conceivable that the assumptions involved in their design severely limit their applicability in all practical cituations. The final answer can only be obtained from the actual use of the test sequences. In order to get some indication of their value, some time was scent in performing them on WMI. Extensive tests could not be carried out because only a limited time was available. The results are therefore only suggestive, not conclusive.

The most logical way of determing the effectiveness of a test sequence is to cause a failure in one of the stages it is designed to check. The voltage-variation facilities provide a convenient means for causing such a failure. Measurements were rade of some of the margina evailable with some sample problems. Several problems used for testing and demonstration purposes were included to provide a basis for a comparison. Test Program Humber I causes each flip-flop (except some in control) to receive both 0 and 1. Test Program Bumber II performs each of the operations which have been accepted for permanent use in WWI. Display Program Number I consists of two separate problems: one displays the curves y = x,  $y = x^2$ , and  $y = x^3$  on an oscilloscope, the other displays a family of parabolas. The measurements have been tabulated in Fig. 23.

The rargins obtained with Test Program Numbers I and II did not differ materially from those obtained with the test sequences.

This indicates that the test programs are about as effective as the test sequences for the types of failures that occurred. On the other hand, the margins obtained with Distlay Frogram Number I differed significantly from the rargine obtained with the test sequences. In general, Marlay Program Rumber I had larger rarging than the test sequences, indicating that the test sequences are more effective. However, the negotive cargin on line 79 with Display Program Fumber I was about half the margin with Test Sequence Number II. The low margin was traced to a defective flip-flop in the divide-pulse distributor (see section 6.12), but Display Program Number I does not use the divide operation at all! This points up the need for further study of the factors which cause failures, warticularly with regard to the influence of voltage variation. The most plausible errlanation for the difference in the values of the margin is that a dynamic failure existed which was accentuated by the frequency with which the multiply operation is performed in Display Program Fumber I. Perhaps the fact that the rangin was reasured by noting the voltage at which the rattern on the display oscilloscope became distorted (no alarm was obtained) had some influence on this difference. In other words, mirtakes in corrutation might appear before an alarm is obtained. Further investigation to obtain a more satisfactory extlan tion could not be carried out.

A clear-cut example of the inadequacy of Display Program

Number I for checking purposes (of course, it was not intended as a check

rroblem) is provided by the negative rargin on line 113. The margin

obtained with Display Program Number I was about twice that obtained

read-in gate generator in the oter counter (see section 6.11). Display Program Number I does not use the shift overation and hence is not all fected by this failure.

Of some interest is the difference between negative and positive rargins on line 113. It indicates that the failure of a gate tube to pass a pulse is far more likely than the generation of a spurious pulse. Also of interest are the differences between positive and negative margins on lines 79 and 60. It has been more or less standard practice to use the positive margins as an indication of the condition of a flip-flop when the complementing technique (see section 3.3) is employed. The nositive margins on these lines were nore than adequate, but some of the negative a gins were quite low. This suggests that negative margins right well be taken into account in the complementing procedures.

The most significant result of the use of the test sequences is their architection to the location of the causes of the low margins. This is described in detail below.

# 6.11 Discussion of Low Fargin on Line 113

In measuring the mergin on line 113, with Test Sequence Number III, it was noted that the alarm was consitently obtained on the gg instruction contained in register 10. The check register showed that digit column 1 was at fault. AC contained +1/2 instead of +0, indicating that no shift was renformed. Since other shift operations were renformed successfully, the fault was probably due to the failure of the read in

gate tube of digit column 11 to tass a pules. An escilloscope probe placed on the suppressor of this gate tube showed the gating signal had an unusually low amplitude. Furthermore, the amplitude of this gate varied with the voit; re applied to line 113. This was a rather surprising result because the gate generator was not suprlied by line 113. However, further rearch showed that a wiring mistake had been made. The screen of GC VIO2 (see Fig. 21), with should have been supplied by line 96, was actually supplied by line 113. This substantiates the reasoning of section 5.13 in which it is shown that a failure on line 96 will be detected by this requence. It is interecting to note that this mistake did not affect over tion at normal voltage, even though the screen was supplied with +00 volts instead of +120. The entire investigation, from the time the results were analyzed to the time the correction in wiring was made, took approximately 15 minutes. This is less than the arbitrary standard of 1 hour discussed in section 1.? and is considered fairly noteworthy.

# 6.12 Discussion of Low Fargin on Line 79

In measuring the margin on line 79 with Test Sequence Furber II, it was noted that the alarm was consistently obtained on the ge instruction contained in register 29. The check register indicated that the discrepancy was in digit column 1. This was confirmed by AC which contained 0 instead of 41/2. This could most easily be accounted for by a failure to shift left. However, since only the flip-flops of arithmetic control were undergoing voltage variation, the failure to shift would mean that FF307.01 cannot transmit 1. This would result in

a "prolonged stop clock", not a check register alarm. To make this hypothesis even more untenable, there was another al instruction in register 15 which arearently gave no trouble. The only other conclusion was that there was something wrong with the divide operation. It could not be the failure of FF306.01 to transmit 1, for that would result in a "prolonged stop clock". The only other possible source of error was the divide-pulse distributor, but the results obtained were not any of those which had been calculated for steady-state failures of the divide-pulse distributor. The complementing check was the performed on these flip-flops and FF308.04 was found to have a low margin. The tubes were replaced and the rargin (with Test Sequence Panter II) was increased from -17 wolts to -31 volts. Furthermore, the alaxa was obtained at a different point in the secuence. The only possible exclanation of the imponsistent results is that the fault was not steady-state, even at the reduced voltage. The time required to correct this situation was about 75 minutes. Some time could undoubtedly have been saved if the complementing procedure had been used immediately.

## 6.2 Evaluation of the Design Procedure

The previous sections of this chapter have discussed the results obtained from the use of some of the test sequences. Even though these investigations were very limited in scope, some very useful information was obtained, particularly regarding trouble location. The test sequences are a rently do accomplish their avowed aims, and indeed, even exceed expectations. The location of a non-steady-state failure in the divide-pulse distributor, despite the fact that an alternate method

was used for confirmation, was a notable achievement. A complete set of test sequences a ould be a powerful tool in achieving computer reliability. Can the design procedure cutline in this thesis be used to obtain such a set! In theory, at least, the answer is accurately yea. The limitations seem to be of a practical nature and concern the amount of work required for some of the more complicated channel segments. The effort required in some of these cases may not justify the results. It would probably be more efficient to devise alternate schemes. A certain amount of study should precede the design of a test sequence in order to determine whether the effort might better be expended in other directions.

The efficiency problem is most likely to be serious for those channel regments which control the manipulation of information in other verts of the concuter. The time-pulse distributor is verhans the greatest offender in this repact, since it controls most of the operation of the computer. A good deal of analysis is required to determine whather an alarm is actually obtained in the presence of a failure. An indication of the rarnitude of this task may be had from the fact that about 25 hours were required to determine whether an alarm would be obtained with program timing only (assuming all registers clear and operation 0 giving program timing) for an open circuit in each of the crystals of the ratrix. Incidentally, it is interesting to note that an alarm would be obtained for only 6 of the 24 crystals. In cases of this nature, 1t is probably most efficient to circumvent the detailed analysis by the installation of a built-in checking device (such a device has been proposed for the time-pulse distributor). However, it seems likely that the procedure is not unduly corrlicated in most cases. Of course, the

complexity increases rapidly when multiple faults are considered.

Test sequences designed for complicated channel segments are of dubious value in trouble-location procedures. In many cases the test sequence will only specify the channel assment in which the failure exists. The knowledge that a failure exists somewhere in a large number of stages is not particularly helpful. Such information could probably be obtained much more simply. The effort of designing a test sequence is hardly justifiable.

## 6.3 Suggestions for Further Work

The possibilities of test checking have not been fully explored in this thesis by any stretch of the imagnination. A much simpler approach to the problem might very well exist. If such an approach could be found, it would certainly be a great boon to the people who are conceremed with designing test problems.

Teg rdless of whether a new approach can be found, much work remains to be done. Quantitative investigations of the factors influencing failures are extremely important. The design of test sequences should take into account as many of these factors as possible. Of course, there are the tasks of actually designing test sequences for the entire computer and analysing the results for trouble location purposes.

In conclusion, it should be emphasized that the method presented in this thesis can undoubtedly be used for designing test sequences. The results obtained are encouraging and serve as a starting point for future study which should be directed towards simpler and more efficient methods.

Signed Mudd Cooper Approved W. W. Forrester

AFPENDIX I

A Short Guide to Coding.

## A SHORT GUIDE TO CODING

Using the Whirlwind I Code of October 1949

Electronic Computer Division Servomechanisms Laboratory Massachusetts Institute of Technology Cambridge, Massachusetts

#### A SHORT GUIDE TO CODING

(using the Whirlwind I code of October 1949)

#### COMPUTER PROGRAMS

<u>Program</u>. A program is a sequence of actions by which a computer handles a problem. The process of determining the sequence of actions is known as programming.

<u>Flow diagrams</u>. A flow diagram is a series of statements of what the computer has to do at various stages in a program. Lines of flow indicate how the computer passes from one stage of the program to another.

Coded program. Programs and flow diagrams are largely independent of computer characteristics, but instructions for a computer must be expressed in terms of a code. A set of instructions that will enable a computer to execute a program is called a coded program, and the process of preparing a coded program is known as coding.

Orders and operations. Individual coded instructions are known as orders and call for specific operations such as multiply, add, shift, etc.

The computer code. The computer code described here is that of Whirlwind I, an experimental computer using binary digits, single-address order code, parallel operation, and electrostatic storage. It is expected that computers of this type will ultimately achieve an average speed of 50,000 operations per second.

#### COMPUTER COMPONENTS

Registers and words. A register has 16 digit positions each able to store a one or a zero. A word is a set of 16 digits that may be stored in a register. A word can represent an order or a number.

Arithmetic element. Arithmetic operations take place in the arithmetic element, whose main components are three flip-flop registers, the A-register, the accumulator, and the B-register (AR, AC, BR). The 16 digit positions of AR starting from the left are denoted by AR 0, AR 1, . . ., AR 15. Similarly for AC, BR. Words enter AC through AR; BR is an extension of AC.

Storage. The term "register" by itself refers to the main electrostatic storage, which consists of  $2^{11}$  or 2048 registers, each of which is identified by an address. These addresses are 11-digit binary numbers from 0 to 2047. The computer identifies a register by its address.

Input-output. All information entering or leaving the computer is temporarily stored in the input-output register (IOR). The computer regulates the flow of information between the internal storage and IOR, and also calls for any necessary manipulation of external units. The descriptive names of the input-output orders were chosen for photographic film reader-recorder units, but the orders are applicable to other types of external equipment.

Control element. The control element controls the sequence of computer operations and their execution. Instructions are obtained from storage in the form of individual orders, each of which is represented by a single word.

Inter-connections. The four main elements (storage, control, arithmetic, and input-output) are connected by a parallel communications system. known as the bus.

### REPRESENTATION OF ORDERS

Operation section. When a word is used to represent an order the first (left-hand) 5 digits, or operation section, specify a particular operation in accordance with the order code.

Address section. The remaining 11 digits, or address section, are interpreted as a number with the binary point at the right-hand end. In the majority of orders this number is the address of the register whose contents will be used in the operation. In orders sl, sr, the number specifies the extent of a shift; in rf, rb, the number specifies an external unit; in ri, rs, the address section is not used.

<u>Example</u>. The order cax has the effect of clearing AC (making all the digits zero) and then putting into AC the word that is in the register whose address is x. If q is a quantity in some register, the order needed to put q in AC is not ca q but cax, where x is the address of the register that contains q.

#### REPRESENTATION OF NUMBERS

Single-word representations. When a word is used to represent a number the first digit indicates the sign and the remaining 15 are numerical digits. For a positive number the sign digit is zero, and the 15 numerical digits with a binary point at their left specify the magnitude of the number. The negative -y of a positive number y is represented by complementing all the digits, including the sign digit, that would represent y. (The complement is formed by replacing every zero by a one and every one by a zero.) In this way a word can represent any multiple of  $2^{-15}$  from  $2^{-15}$  - 1 to  $1-2^{-15}$ . Neither +1 nor -1 can be represented by a single word. Zero has two representations, either 16 zeros or 16 ones, which are called +0 and -0 respectively.

Overflow — increase of range and accuracy. With single-word representation the range is limited to numbers between  $2^{-15}$  – 1 and  $1 - 2^{-15}$ . Programs must be so planned that arithmetic operations will not cause an overflow beyond this range. The range may be extended by using a scale factor, which must be separately stored. Accuracy can be increased by using two words to represent a 30-digit number.

#### COMPUTER PROCEDURE

Sequence of operations. After the execution of an order the program counter in the control element holds the address of the register from which the next order is to be taken. Control calls for this order and carries out the specified operation. If the order is not sp or cp(-) the address in the program counter then increases by one so that the next order is taken from the next consecutive register. The sp and cp(-) orders permit a change in this sequential procedure.

Transfers. A transfer of a digit from one digit position to another affects only the latter digit position, whose previous content is lost.

Negative zero. The subtraction of equal numbers produces a negative zero in AC, except when AC contains +0, and -0 is subtracted from it.

Procedure in the arithmetic element. The execution of an addition includes the process of adding in carries; this process treats all 16 digits as if they were numerical digits, a carry from AC 0 being added into AC 15. A subtraction is executed by adding the complement. Multiplication, division, shifting and round-off are all executed with positive numbers, complementing being performed before and after the process when necessary. For round-off the digit in BR 0 is added into AC 15.

### NOTATION FOR CODING

Addresses. A coded program requires certain registers to be used for specified purposes. The addresses of these registers must be chosen before the program can be put into a computer, but for study purposes this final choice is unnecessary, and the addresses can be indicated by a system of symbols or index numbers.

Writing a coded program. Registers from which control obtains orders may be called action registers, and should be listed separately from registers containing other information, which may be called data registers. A coded program is written out in two columns; the first contains the index number of each action or data register, and the second column indicates the word that is initially stored in that register. In many cases part or all of a word may be immaterial because the contents of the register in question will be changed during the course of the program. This state of affairs is indicated by two dashes, for example, ca---

The abbreviations RC, CR. Abbreviations used in referring to the register that contains a certain word or to the word in a certain register

RC . . . = (Address of) Register Containing . . . CR . . . = Contents of Register (whose address is) . . .

The symbol ri x. When an address forms part of an order it is represented by the last  $\overline{11}$  digits of a word whose first 5 digits specify an operation. An address x that is not part of an order is represented by the last 11 digits of a word whose first 5 digits are zero, which is equivalent to specifying the operation ri. Thus the word for an unattached address x may be written ri x. It could also be written  $x \times 2^{-15}$ .

#### THE ORDER CODE

AC = Accumulator

AR = A-Register

BR = B-Register

x is the address of a storage register; n is a positive integer; k designates an external unit

|        | Opera                   | tion    |        |                                                                                                                                                                                                                                                                                                                         |
|--------|-------------------------|---------|--------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Order  | Name                    | Co      |        | Function                                                                                                                                                                                                                                                                                                                |
|        | Name                    | Decimal | Binary |                                                                                                                                                                                                                                                                                                                         |
| ri     | read initially          | 0       | 00000  | Take words from external unit until internal storage is full.                                                                                                                                                                                                                                                           |
| rs     | remote unit stop        | 1       | 00001  | Stop external unit.                                                                                                                                                                                                                                                                                                     |
| rf k   | run forward             | 2       | 00010  | Prepare to use external unit k in forward direction.                                                                                                                                                                                                                                                                    |
| rb k   | run backward            | 3       | 00011  | Prepare to use external unit k in backward direction.                                                                                                                                                                                                                                                                   |
| rd x   | read                    | 4       | 00100  | Transfer to register x a word supplied by external unit.                                                                                                                                                                                                                                                                |
| rc x   | record                  | 5       | 00101  | Arrange for transfer of contents of register x to external unit.                                                                                                                                                                                                                                                        |
| ts x   | transfer to storage     | 8       | 01000  | Transfer contents of AC to register x.                                                                                                                                                                                                                                                                                  |
| td x   | transfer digits         | 9       | 01001  | Transfer last 11 digits from AC to last 11 digit positions of register x.                                                                                                                                                                                                                                               |
| ta x   | transfer address        | 10      | 01010  | Transfer last 11 digits from AR to last 11 digit positions of register x.                                                                                                                                                                                                                                               |
| cp(-)x | conditional program     | 14      | 01110  | If number in AC is negative, proceed as in sp; if number is positive disregard the cp(-order, but clear the AR.                                                                                                                                                                                                         |
| sp x   | subprogram              | 15      | 01111  | Take next order from register x. If the sp order was at address y, store $y+1$ in last 1 digit positions of AR.                                                                                                                                                                                                         |
| ca x   | clear and add           | 16      | 10000  | Clear AC and BR, then put contents of register ${\bf x}$ into AC. If necessary, add in carry from previous sa addition.                                                                                                                                                                                                 |
| cs x   | clear and subtract      | 17      | 10001  | Clear AC and BR, then put complement of contents of register ${\bf x}$ into AC. If necessary add in carry from previous sa addition.                                                                                                                                                                                    |
| ad x   | add                     | 18      | 10010  | Add contents of register x to contents of AC, storing result in AC.                                                                                                                                                                                                                                                     |
| su x   | subtract                | 19      | 10011  | Subtract contents of register x from contents of AC, storing result in AC.                                                                                                                                                                                                                                              |
| cm x   | clear and add magnitude | 20      | 10100  | Clear AC and BR, then put positive magnitude of contents of register ${\bf x}$ into AC. If necessary add in carry from previous sa addition.                                                                                                                                                                            |
| sa x   | special add             | 21      | 10101  | Add contents of register x to contents of AC, storing result in AC and retaining any over<br>flow for next ca, cs, or cm order. Only orders 1 through 15 may be used between the s<br>order and ca, cs, or cm orders for which the sa is a preparation.                                                                 |
| ao x   | add one                 | 22      | 10110  | Add the number $1 \times 2^{-15}$ to the contents of register x. Store result in AC and in register x                                                                                                                                                                                                                   |
| mr x   | multiply and round off  | 24      | 11000  | Multiply contents of register x by contents of AC; round off result to 15 numerical digit and store in AC. Clear BR.                                                                                                                                                                                                    |
| mh x   | multiply and hold       | 25      | 11001  | Multiply contents of register x by contents of AC and retain the full product in AC and the first 15 digit positions of BR, the last digit position of BR being cleared.                                                                                                                                                |
| dv x   | divide                  | 26      | 11010  | Divide contents of AC by contents of register x, leaving 16 numerical digits of the quotien in BR and $\pm 0$ in AC according to sign of the quotient. (The order sl 15 following the dv order will round off the quotient to 15 numerical digits and store it in AC.)                                                  |
| sl n   | shift left              | 27      | 11011  | Multiply the number represented by the contents of AC and BR by 2 <sup>n</sup> . Round off the result to 15 numerical digits and store it in AC. Disregard overflow caused by the multiplication, but not that caused by round-off. Clear BR.                                                                           |
| sr n   | shift right             | 28      | 11100  | Multiply the number represented by the contents of AC and BR by $2^{-n}$ . Round off the resulto 15 numerical digits and store it in AC. Clear BR.                                                                                                                                                                      |
| sf x   | scale factor            | 29      | 11101  | Multiply the number represented by the contents of AC and BR by 2 sufficiently often to make the positive magnitude of the product equal to or greater than $1/2$ . Leave the final product in AC and BR. Store the number of multiplications as last 11 digits of register $x$ , the first 5 digits being undisturbed. |

### NOTES ON THE ORDER CODE

Effect of operations. The functions of the various orders are described above. It is to be assumed that AR, AC, BR, and the register whose address is x are undisturbed unless the contrary is stated.

AR. AR is primarily a buffer register for passing words into AC. After orders cax, csx, adx, sux, sax, and aox it contains the number originally contained in register x. After orders cmx, mrx, mhx, and dvx it contains the magnitude of the contents of x. The effect of spx and cp(-)x is stated above. No other order changes the contents of AR.

BR. A number stored in BR always appears as a positive magnitude, the sign of the number being assumed to be that indicated by the sign digit in AC. This convention has no effect on the logical result of the operations involving BR except that when BR contains a number that will be used later it is necessary to retain the appropriate sign digit.

Alarms. If the result of an arithmetic operation exceeds the register

capacity (i.e., if overflow occurs), a suitable alarm is given except as mentioned in connection with orders sax and sln.

Shift orders. A multiplication overflow in sl is lost without giving an alarm, but an overflow from round-off gives an alarm. Orders sr 0 and sl 0 only cause round-off, an alarm being given if an overflow occurs. The integer n is treated modulo 32, i.e., sl 32 = sl 0, sl 33 = sl 1, etc.

Scale factors. If all the digits in BR are zero and AC contains  $\pm 0$ , the order sfx leaves AC and BR undisturbed and stores the number 33 in the last 11 digit positions of register x.

<u>Division</u>. Let u and v be the numbers in AC and register x when the order  $dv \times is$  used. If |u| < |v| the correct quotient is obtained and no overflow can arise. If |u| > |v| overflow occurs and gives an alarm. If  $u = v \neq 0$  the dv order leaves 16 ones in BR and round-off in a subsequent sl 15 would cause overflow and give an alarm. If u = v = 0 a zero quotient is obtained.

# AFFENDIX II - Datails of Coded Programs

# THET SEQUENCE NUMPER I

(To be used with voltage-variation lines 77 and 78)

| REP1ster    | Contents                | Order & |
|-------------|-------------------------|---------|
| 0           | 10                      |         |
| 1           | 2-7                     |         |
| ?           | 2-14                    |         |
| 3           | 33 x p-15               |         |
| (FF)4       | (receives scale factor) |         |
| 5           | 1/2                     |         |
| 6           | 2-15                    |         |
| 7           | ca 5                    | 1 24    |
| g           | er 31                   | 2 25    |
| 9           | q <b>c</b> 0            | 3 etc.  |
| 10          | 32 4                    | Ţt.     |
| 11          | ca 5                    | 5       |
| 12          | er 13                   | 6       |
| 13          | de S                    | 7       |
| <b>3</b> /t | ce 4                    | 8       |
| 15          | qe 3                    | 9       |
| 16          | ca 5                    | 10      |
| 17          | eg 4                    | 11      |
| 18          | er 6                    | 12      |
| 19          | qc 1                    | 13      |
| 20          | mr 1                    | 14      |
| 21          | qc 2                    | 15      |
| 25          | ca 4                    | 16      |
| 23          | qe O                    | 17      |
| 5/1         | ca 6                    | 18      |
| 26          | dv 5                    | 19      |
| 26          | qe O                    | 20      |
| 27          | el 15                   | 21      |
| 28          | de 5                    | 25      |
| 29          | ep 7                    | 23      |

PC reset to 7: PC end-carry may, but need not, reset PC.

THET SEQUENCE NUMBER IT

(To be used with voltage-variation lines 79 and 80)

| Register + | Con   | tents            | Order  |
|------------|-------|------------------|--------|
| 0          | 1     | 1/2              |        |
| 1          |       | 1/4              |        |
| 2          |       | -1/2             |        |
| 3          |       | -1/4             |        |
| (FF)4      | (rece | eives scale fact | or)    |
| 5          |       | 1/4 + 2-15       |        |
| 6          |       | +0               |        |
| 7          |       | -0               |        |
| 8          |       | ca O             | 1 26   |
| 9          | 1     | mr O             | 2 27   |
| 10         |       | qe 1             | 3 etc. |
| 11         |       | of 4             | 14     |
| 12         |       | g <b>c</b> 0     | 5      |
| 13         |       | sr 1             | 6      |
| J)t        |       | qc 1             | 7      |
| 15         |       | al 1             | g      |
| 16         | 4     | η <b>c</b> 0     | 10     |
| 17         |       | ea O             | 11     |
| 18         |       | qc 6             | 15     |
| 19         |       | ca 1             | 13     |
| 50         |       | ne 5             | 14     |
| 21         |       | ca ?             | 15     |
| 55         |       | 5 as             | 16     |
| 23         |       | ce 7             | 17     |
| 5/1        |       | ca 5             | 18     |
| 25         | (     | ee 1             | 19     |
| 56         |       | dv O             | 20     |
| 27         |       | ne 6             | 21     |
| 28         |       | el 15            | 55     |
| 29         |       | qc O             | 23     |
| 30         | ī     | mr ?             | 5/1    |
| 31         | (     | no 3             | 25     |

(For use with voltage-variation lines 113, 199, 221, 200, 97, 96 17, 201, and 141)

| Part Car I | Contents                | der  | S. Advances |
|------------|-------------------------|------|-------------|
| 0          | +0                      |      |             |
| 1          | 14 x 2-15               |      |             |
| (FT)2      | ts 0 (=1/2) (chenged )  | by o | rder #8)    |
| 3          | 33 x 2 <sup>-15</sup>   |      |             |
| (F=)4      | (receives scale factor) |      |             |
| 5          | 1/2                     |      |             |
| 6          | 2-9                     |      |             |
| 7          | .2-15                   |      |             |
| 8          | ce 5                    | 1    | 25          |
| 9          | er 16                   | 2    | 26          |
| 10         | ne O                    | 3    | etc.        |
| 11         | sf h                    | 11   |             |
| 12         | ad 4                    | 5    |             |
| 13         | ne 3                    | 6    |             |
| 14         | er 1                    | 7    |             |
| 15         | td ?                    | 8    |             |
| 16         | cs 6                    | 9    |             |
| 17         | rr, 6                   | 10   |             |
| 18         | e 2 lt                  | 11   |             |
| 19         | e.d 4                   | 15   |             |
| ന          | de S                    | 13   |             |
| 27         | ca 7                    | 24   |             |
| 22         | el 1 <sup>1</sup> 4     | 15   |             |
| 23         | qc 5                    | 16   |             |
| əft        | er 15                   | 17   |             |
| 25         | qe 7                    | 18   |             |
| 26         | 67 0                    | 19   |             |
| 27         | ae 7                    | 50   |             |
| 26         | af 4                    | 21   |             |
| 29         | qc 5                    | 55   |             |
| 30         | ce li                   | 23   |             |
| 31         | qc 1                    | 24   |             |

PC reset to 8; PC end-carry must reset FC and Register #2.

# TEST SEQUENCE NUMBER IV

# (For checking the program country)

| ROFISTON F | Contents                    | Order # (in correct operation |
|------------|-----------------------------|-------------------------------|
| 0          | ad 20                       | 1: 15                         |
| 1          | ad 21                       | 2 16                          |
| 2          | m2 27                       | 3 etc.                        |
| 3          | ad 21                       | 4                             |
| 4          | ad 28                       | 5                             |
| 5          | eu 20                       | 6                             |
| 6          | ad 21                       | 7                             |
| 7          | ad 21                       | 8                             |
| 8          | no 29                       | 9                             |
| 9          | sp 15                       | 10                            |
| 10         | ФФФФ                        |                               |
| 11         | clinery gro-db              |                               |
| 12         | nd 21                       |                               |
| 13         | बार क्षित्र वात्र क्ष्म्य   |                               |
| 14         | 60-40 40 th                 |                               |
| 15         | ca 20                       | 11                            |
| 16         | na 20                       | 12                            |
| 17         | sp 31:                      | 13                            |
| 18         | 00-00                       |                               |
| 19         | +0                          |                               |
| 20         | +1/2                        |                               |
| 21         | +1/4                        |                               |
| 25         | COS ADD ADD ADD             |                               |
| 23         | co es es                    |                               |
| 54         | ad 20                       |                               |
| 25         | spinor do do                |                               |
| 56         | काल स्थाप स्थाप स्थाप स्थाप |                               |
| 27         | +1/8                        |                               |
| 58         | +1/16                       |                               |
| 29         | +15/16                      |                               |
| 30         | 60 63 63 69                 |                               |
| 31         | ca 19                       | I,11                          |

## TEST SE VENCE NUMBER V

(To be used with voltage-variation lines 1h, 110, 190, 191, and 231.)

| Register # | Content                  | Order # (in normal | opera- |
|------------|--------------------------|--------------------|--------|
| 0          | dv 1                     |                    | orous) |
| 1          | +0                       |                    |        |
| 2          | _2-15                    |                    |        |
| 3          | +11x2 <sup>-15</sup>     |                    |        |
| 4          | +31x2 <sup>-15</sup>     |                    |        |
| 5          | -1/4                     |                    |        |
| 6          | +1/2                     |                    |        |
| 7          | -1/2                     |                    |        |
| (FT) E.    | (reserves scale factor)  | (reset to +0)      |        |
| (FF)9      | (receives address on ta) | (reset to +0)      |        |
| (F )10     | (is added to on ac)      | (reset impaterial) |        |
| 11         | ta o                     | 3 22               |        |
| 15         | nf 6                     | 4 23               |        |
| 13         | ca f                     | 5 etc.             |        |
| 14         | q <b>e</b> 3             | 6                  |        |
| 15         | sl 1 <sup>14</sup>       | 7                  |        |
| 16         | ac 6                     | 8                  |        |
| 17         | mr 7                     | 9                  |        |
| 18         | ac 5                     | 10                 |        |
| 19         | cp 21                    | 11                 |        |
| 50         | ċ▼ 1                     |                    |        |
| 21         | er 14                    | 12                 |        |
| 55         | qe 2                     | 13                 |        |
| 23         | ts 10                    | 14                 |        |
| 5/1        | ca 9                     | 15                 |        |
| 25         | oc 4                     | 16                 |        |
| 26         | ao 10                    | 17                 |        |
| 27         | cm 10                    | 18                 |        |
| 28         | qc 1                     | 19                 |        |
| 29         | ed 3                     | 1 20               |        |
| 30         | ep 11                    | 5 51               |        |
| 31         | dv 1                     |                    |        |
|            |                          |                    |        |

FC reset to 29; FC end-carry may, but need not, reset FC.
AC should be clear at start.

### APP NDIX III - Glossary

Channel a Any

Any of the routes which may be taken by information

suprlied to the imput of the computer.

Channel Segment:

The portion of a channel which is contained in a section.

Component:

The most elementary parts of which the computer is

constructed, e.g., resistors, condensers, vacuum tubes, etc.

Sections

All stages which are supplied by a given voltage-variation

line.

Stage

An elementary circuit containing at least one vacuum tube (or its equivalent) which performs a single function (e.g., a flip-flop, a gate tube, or an electronic switch, but not a register, a counter, nor an adder).

Test Sequence:

A groun of instructions to the computer which cause information to be routed along certain channel segments. As used in this thesis, test sequence refers to a group which may be performed with the 32 registers of test storage.

Unita

Any portion of the computer which consists of at

least one stage.

#### BIBLIOGRAPHY

- 1. Adams, C. V., "Temporary Operations for Whirlwind I", Project Whirlwind, Servonechanisms Lab., M.I.T., Engineering Note E-250, 1949.
- P. Adems, C. V., "Test Program Number I. Computer Complementing", Project Whirlwind, Servomechanisms Lab., V.I.T., Engineering Note E-295, 1949.
- 3. Adams, C. V., "Test Program Number II, Operation Matrix Check", Froject Whirlwind, Servomechanisms Lab., M.I.T. Engineering Note E-296, 1949.
- 4. Adams, C. W., "Temporary Operation qs Switch Check: and Test Program Number VI - Switch Check", Project Whirlwind, Servomechanisms Lab., N.I.T., Engineering Note E-313, 1949.
- 5. Blumenthal, E. I. and G. G. Hoberg, "A Trouble-Location Scheme for a Digital Electronic Computer", S.F. Theris, F.I.T. 1945.
- 6. Cooper, G., "Checking Methods for Digital Computers", E.E. Seminar, M.I.T., 1949.
- 7. Cooper, G., "Display Program Number I: Family of Farabolas and Powers of X", Project Whirlwind, Servomechanisms Lab., M.I.T., Engineering Fote E-300, 1949.
- E. Cooper, G., "Proposed Method for Automatic Test Checking of Arithmetic Control Flip-Flops", Project Whirlwind, Servomechanisms Lab., N.I.T., Vemorandum N-877, 1949.
- 9. Eckert, J. P., J. W. Mauchly, E. E. Coldetine, and J. G. Prainerd, "Description of EFIAC and Comments on Electronic Pigital Computing Machines", APT Report 171.2P, University of Tennsylvania, 1945.
- 10. Eckert, J. P., "Reliability and Checking", Theory and Techniques for Design of Electronic Digital Computers, University of Pennsylvania, 1947.
- 11. Everett, R. P., "Checking by Check Problems", Project Whirlwind, Servemechanisms Lab., M.I.T., Vemorandum M-161, 1947.
- 12. Everett, P. P., "Whirlwind I Computer Flock Diagrams", Project
  Whirlwind, Serverechenisms Lab., M.I.T., Pepert F-127,
  1947. This report is to be superseded by Ferort
  R-180, (reference 19) when it is printed.

- 13. Forrester, J. W., "Digital Corvuters as Information-Processing Systems", Project Whirlwind, Servorechanisms Lab., M.I.T., Feport R-166, 1949.
- 14. Goldstine, H. H. and J. von Neumann, "Planning and Coding of Problems for an Electronic Computing Instrument", Institute for Advanced Study, 1947.
- 15. Roberg, G. G. and J. N. Ulman, Jr., "Gloseary of Commuter Terms",

  Project Whirlwind, Servomechanisms Lab., F.I.T.,

  Peport R-138, 1948.
- 16. Israel, D. R., "Introduction to Coding", Air Traffic Control
  Project, Servemechanisms Lab., F.I.T., Engineering
  Fote 3-2000, 1949.
- 17. Salzer, J. F., "Study of Input-Output Checking", Project
  Whirlwind, Servomechanisms Lab., F.I.T., Engineering
  Note E-225, 1949.
- 18. Salzer, J. M., C. W. Adams, and R. F. Fayer. "Description of Whirlwind I Codes", Project Whirlwind, Servenechanisms Lab., V.I.T., Engineering Note R-235, 1949.
- 19. Salzer, J. V., A still untitled report to be published shortly, Project Thirlwind, Servemechanisms Lab., V.I.T., Perort R-150.
- 20. Stibitz, C. P., "The Organization of Large-Colle Calculating Fachinery", Proceedings of a Symposium on Large-Scale Digital Calculating Fachinery, Harvard University, 1948.
- 21. Summer, G. C., "Trouble Location in a Large-Scale Electronic Pigital Computer", S.F. Thesis, M.T.T., 1948.
- 22. Velchman, W. G., "Notations for Coding", Project Whirlwind, Servomechanisms Lab., N.I.T., Conference Note C-93, 1949.
- 23. Welchman, W. C., "Example of Coding Procedure", Project Thirlwind, Servomechanisms Lab., N.I.T., Conference Fote C-04, 1949.

# LIST OF ILIUSTRATIONS

| FIG. No. | Drawing No. |                                                                   |
|----------|-------------|-------------------------------------------------------------------|
| 1        | A-35132     | Comparison between Digital and Analogue<br>Mathods of Computation |
| 2        | A-34353-1   | Digital Computer-Nain Units                                       |
| 3        | A-75129     | Flip-Flop                                                         |
| Į‡       | A=35130     | Gate Circuit                                                      |
| 5        | A-35131     | The Electronic Switch                                             |
| 6        | B-37001-5   | Flock Diagram Symbols, WWI                                        |
| 7        | C-37056     | A-Register, WII                                                   |
| g        | C-37097-6   | H-Pegister, WI                                                    |
| 9        | n-37173-2   | Accumulator, WVI                                                  |
| 10       | B-37062-6   | Program Counter, WWI                                              |
| 11       | C-37068-6   | Time-pulse Distributor, WWI                                       |
| 12       | D-37071-6   | System Block Diagram, Wil                                         |
| 13       | B-37098-6   | Central Control, WWI                                              |
| 14       | 0-37072-10  | Arithmetic Element, WWI                                           |
| 15       | 0-37178-2   | Invut-Cut ut Element, WI                                          |
| 16       | R-37195-1   | Program Timing, MI                                                |
| 17       | 4-35133     | Fulti-Channel Trouble Location                                    |
| 18       | n-35146     | Control Fatrix Cut-ut Connections, WWI                            |
| 19       | B-371711-2  | Special Add Pemory and Overflow, WVI                              |
| 20       | 7-37071-8   | Step Counter, WWI (Plock Diagrem)                                 |
| 21       | 1-31626-2   | Step Counter, WWI (Block Schematic)                               |
| 55       | A-32723-1   | Step-Counter Output, WWI (Mock Schematic                          |
| 23       | A-3513h     | Vargine of Correct Operation for Sample Problems.                 |

# DIGITAL APPROACH

ADDITION:

MULTIPLICATION:

# ANALOGUE APPROACH

ADDITION:



MULTIPLICATION:



IF AE // DC, THEN  $\Delta$  ABE  $\sim$   $\Delta$  DBC HENCE,  $\frac{BE}{AB} = \frac{BC}{DB} = \frac{BC}{I}$ 

BE = AB x BC = 5 x 3 = 15

COMPARISON BETWEEN DIGITAL AND ANALOGUE METHODS OF COMPUTATION



DIGITAL COMPUTER-MAIN UNITS





FLIP-FLOP CIRCUIT DIAGRAM



(b)

BLOCK REPRESENTATION OF FLIP-FLOP
FIG. 13

FLIP-FLOP CIRCUIT DIAGRAM



GATE CIRCUIT
(INPUT 2 FROM FLIP-FLOP)



BLOCK REPRESENTATION OF GATE CIRCUIT



# (a) SCHEMATIC REPRESENTATION



(b) BLOCK DIAGRAM REPRESENTATION

THE ELECTRONIC SWITCH

"ALBANENE" NO. 106L EAE CO. M. Y. X







"ALDANENE" NO. 100L EAE CO. N. Y. X











| CONTROL DIN CEC                                       | T |     |     |   |    |                |     |     |   |      |     |   | CONTRO  |
|-------------------------------------------------------|---|-----|-----|---|----|----------------|-----|-----|---|------|-----|---|---------|
| CONTROL PULSES                                        | 1 |     |     | T | IM | E P            | UL: | SES | 3 |      |     |   | MATRI   |
| (COMMANDS)                                            | T | 2   | 3   | 4 | 5  | 6              | 7   | 8   | 1 | 2    | 3   | 4 | OUTPU   |
| 102 PROGRAM COUNTER READ OUT                          | T | 2.  |     |   |    |                |     |     |   | 2    |     |   | 12      |
| PROGRAM COUNTER READ TO CHECK BUS                     |   | 2 1 |     |   |    |                |     |     |   | 21/2 |     |   | 9       |
| ADD TO PROGRAM COUNTER                                |   |     |     |   |    |                | 7   |     |   |      |     |   | 14      |
| 103 PROGRAM REGISTER CLEAR                            |   | 2   |     |   |    |                |     |     |   | 2    |     |   | 38      |
| SEE PROGRAM REGISTER READ IN                          |   |     |     | 4 |    |                |     |     |   |      |     | 4 | 39      |
| PROGRAM REGISTER READ OUT                             |   |     |     |   | #  |                |     |     |   |      |     |   | 28‡     |
| 104 CONTROL SWITCH CLEAR                              |   |     |     | 4 |    |                |     |     |   |      |     | 4 | 68      |
| CONTROL SWITCH READ IN                                |   |     |     |   | 1  |                |     |     |   |      |     |   | 66‡     |
| CONTROL SWITCH READ OUT                               |   |     |     |   |    | 6              |     |     |   |      |     |   | 77      |
| 201 STORAGE SWITCH CLEAR TSS 1/2 US LATER             | * |     |     | 4 |    |                |     |     | * |      |     | 4 | *,20    |
| 820 STORAGE SWITCH READ IN SELECTS TS OR ES AND LEAVE |   | 2   |     |   | #  |                |     |     |   | 2    |     |   | 17      |
| STORAGE SWITCH READ OUT                               |   |     | 3   |   |    | 6              |     |     |   |      | 3   |   | 37,18,3 |
| 200 STORAGE READ OUT PR ALWAYS, TS                    |   |     |     | 4 |    |                |     |     |   |      |     | 4 | 76      |
| 103 STORAGE READ TO CHECK BUS ONLY IF SELECTED        |   | 1.  |     | 4 |    | _              |     |     |   |      |     | 4 | 74      |
| 305 STEP COUNTER PRESET                               |   |     |     | 4 |    |                |     |     |   |      |     | 4 | 3       |
| STEP COUNTER READ IN                                  | 1 | -   |     |   | #  |                |     |     |   |      |     |   | 2‡      |
| 601 CHECK REGISTER READ IN                            |   |     | 3   |   |    | 6              |     |     |   |      | 3   |   | 49      |
| TRANSFER CHECK                                        |   | •   | 3 2 |   |    | $6\frac{1}{2}$ |     |     |   | 1/2  | 3 2 | L | 56      |
| CHECK REGISTER CHECK                                  | 1 |     | 1   |   |    |                |     | 82  |   |      |     |   | 47      |
| 810 ES READ                                           | 1 | 1   | 3   |   |    | -              |     |     |   |      | 3   |   | 81      |
|                                                       |   |     |     |   |    |                |     |     |   |      |     |   |         |
|                                                       |   |     |     |   |    |                |     |     |   |      |     |   |         |
|                                                       |   |     |     |   |    |                |     |     |   |      |     |   |         |
|                                                       |   |     |     |   |    |                |     |     |   |      |     |   |         |
| STORAGE SWITCH CLEAR ALWAYS OCCURS ON                 |   | 1   | 1   |   |    |                |     |     |   |      |     |   |         |
| TIME PULSE I, BUT OUTPUT NUMBER 82 OR                 |   |     |     |   |    |                |     |     |   |      |     |   | Latin ! |
| 19 IS USED, DEPENDING ON WHETHER OR NOT               | L |     |     |   |    |                |     |     |   |      |     |   |         |
| ES WRITE OCCURS. SEE EACH INDIVIDUAL                  |   |     |     |   |    |                |     |     |   |      |     | • |         |
| OPERATION .                                           |   |     |     |   |    |                | ,   |     |   |      |     |   |         |
|                                                       |   |     |     |   |    | -              |     | -   |   |      |     |   |         |
|                                                       |   |     |     |   |    |                |     |     |   |      |     |   |         |
|                                                       |   |     |     |   |    |                |     |     |   |      |     | - |         |
|                                                       |   |     |     |   |    |                |     |     |   |      |     |   |         |
|                                                       |   |     |     |   |    | 1              |     |     |   |      |     |   |         |

| 145/14<br>146 | SCALE         | D I V                 |  |  |   |  |
|---------------|---------------|-----------------------|--|--|---|--|
| R.P. W        |               | MASSAC<br>DIVISION OF |  |  |   |  |
| CK            |               | - T -                 |  |  |   |  |
|               | TIMING<br>PRO | HUSETTS<br>NOUSTRIAL  |  |  |   |  |
| •             | GR GR         | 0 41                  |  |  |   |  |
| APP.          | AG<br>M       | INSTIT                |  |  |   |  |
|               | RAM           | UTE<br>UTE            |  |  | L |  |

TECHNOLOGY

GRADED BY: DATE: THIS IS A GRADED DRAWING OF HIGHEST GRADE APPROVED BELOW:

GRADE I FOR REFERENCE ONLY DEE 11-21-49 GRADE II PRELIMINARY DESIGN

GRADE III FINAL DESIGN

\* THESE OUTPUT UNITS, WHICH ARE PULSED ON TIME PULSE 5, ARE GATED BY THE TI OPERATION LINE AND NOT BY THE PROGRAM TIMING LINE (OUTPUT NUMBER IT IS GATED BY BOTH). THIS IS BECAUSE "CONTROL SWITCH CLEAR" ON TIME PULSE 4 SELECTS TI, DE-ENERGIZING THE PROGRAM TIMING LINE.



MULTI-CHANNEL TROUBLE LOCATION

OPERATION LINES (AND PROGRAM TIMING LINE) TIME PULSES CONTROL PULSES -- PROGRAM TIMING LINE ON -reffreder telata ndvisi sr of 109 STOP CLOCK 50 OZ PROGRAM COUNTER CLEAR READ IN 8 12 READ OUT 10 READ TO CHECK BUS 9 ADD TO PROGRAM COUNTER 14 29 OUTPUT UNIT NUMBER 103 PROGRAM REGISTER CLEAR 38 (365) 85 READ IN 39 READ OUT 28 00 10 20 30 40 50 60 70 80 90 100 110 120 0 74 8 4 1 6 7 6 2 4 7 40 104 CONTROL SWITCH CLEAR 68 1 7 7 2 6 3 8 1 3 3 3 2 5 2 1 4 8 3 6 7 1 4 7 66 77 3 4 74 6 2 7 6 8 8 8 3 READ OUT 5 7½ 3 6 2 0 7 7 6 6 8 0 0½ 8 7 2 3 6½ 5 4 7 ½ 6 7 6 5 1 3 8½ 7 7 6 1 8 1 6 5 2 2 8 4 2 7 STORAGE SWITCH CLEAR (TSS & US LATER) ri retf th 3 24 1 6 4 2 7 8 7 READ IN ( SE LEAVE OF LEAR 17 READ OUT 18 37 103 STORAGE CLEAR PR CLEAR 61 ac 62 rd SEE 103 FOR PR - (rt.II) 75 - READ IN PRIF ES SELECTED 71 72 (rt. 11) 67 OUTPUT UNIT NUMBER ONLY P SELECTED READ OUT 63 64 13 76 - READ TO CHECK BUS 65 300 301 ADD 3 305 200 304 307 308 305 200 301 810 408 410
4 305 102 306 300 309 502 200 200 810 408 410
5 306 502 310 306 500 302 200 200 810 408 410
5 306 502 310 306 501 302 200 200 103 403
6 102 206 300 308 300 601 104 200
403 401
7 661 201 304 201 601 302 200 104
8 102 201 103 103 103 601 306 300
9 102 201 103 103 103 601 307 301 301 . SUBTRACT 80 302 CARRY 46 Cacsa cacsadsucm ao 600 ARITHMETIC CHECK 32 309 SPECIAL ADD SPECIAL CARRY 26 Cacs 20 302 END AROUND CARRY 16 303 ROUNDOFF 52 304 COMPARE 21 301 A-REGISTER CLEAR 70 79 73 READ OUT 69 60 302 ACCUMULATOR CLEAR 41 42 51 READ OUT 43 THE OUTPUT UNITS WHICH ARE PULSED ON TIME PULSE 5 ARE GATED BY THE [] OPERATION LINE AND NOT BY THE PROBRAM THINING LINE. (OUTPUT NUMBER!? IS GATED BY BOTH). THIS IS BECAUSE CONTROL SWITCH CLEAR ON TIME PULSE 4 SELECTS 1, DC-ENERGIZING THE PROGRAM TIMING LINE. READ TO CHECK BUS -(rt.ii) 55 READ TO B-REGISTER 59 CARRY CLEAR 303 B-REGISTER CLEAR 304 CHECK MAGNITUDE 27 AR SIGN CHECK 22 AC SIGN CHECK 23 STORAGE SWITCH CLEAR ALWAYS OCCURS ON TIME PULSE I, BUT OUTPUT NUMBER 82 OR 19 IS USED, DEPENDING ON WHETHER OR NOT <u>ES WRITE</u> OCCURS, AS DETERMINED BY EACH INDIVIDUAL OPERATION. PRODUCT SIGN COUNTER PRESET READ IN 5 READ OUT ADD TO STEP COUNTER 305/306 MULTIPLY 305/308 DIVIDE 36 305/307 SHIFT LEFT 33 305/307 SHIFT RIGHT 31 305/310 POINT OFF 400 RUN FORWARD 90 RUN BACKWARD 92 REMOTE UNIT STOP 91 403 IN OUT REGISTER CLEAR 93 READ OUT 95 98 READ TO CHECK BUS 100 ADD IN OUT REGISTER TO COMPARISON RESISTER COMPARISON REGISTER SET 105 IN OUT READ 101 IN OUT RECORD 103 TC SENSE 7 IN SENSE GOI CHECK REGISTER CLEAR CONTROL MATRIX OUTPUT CONNECTIONS, WWI 3 30 49 READ IN 58 - (rt. 11) TRANSFER CHECK 56 CHECK REGISTER CHECK 47 A.M.G. 1.5.50 BIO ES READ 81 TES CLEAR ING LATER 458 WRITE 83 D-35146

FIG.

1

2









| PROBLEM                        |            | VOLTAGE-VARIATION LINE NUMBER |            |            |            |            |            |            |            |     |  |  |  |
|--------------------------------|------------|-------------------------------|------------|------------|------------|------------|------------|------------|------------|-----|--|--|--|
| BEING<br>PERFORMED             | 77         |                               | 78         |            | 7          | 9          | 8          | 0          | 11         | 3   |  |  |  |
| DISPLAY PROGRAM #1             | (1)<br>-32 | (1)<br>+34                    | (1)<br>-30 | (1)<br>+31 | -10        | (1)<br>+49 | (I)<br>-13 | (1)<br>+30 | (I)<br>-17 | +48 |  |  |  |
| DISPLAY PROGRAM *I (PARABOLAS) | (I)<br>-32 | (1)<br>+34                    | (1)<br>-30 | (I)<br>+31 | -10        | (1)<br>+49 | -16        | +30        | -18        | +48 |  |  |  |
| TEST PROGRAM *I                | -28        | +39                           | -29        | +27        | (2)<br>-20 | +41        | -24        | (2)<br>+33 | -9.5       | +46 |  |  |  |
| TEST PROGRAM #2                | -25        | +30                           | -26        | +25        | -18        | +40        | -22        | +23        | -10        | +38 |  |  |  |
| TEST SEQUENCE #1               | -26        | +33                           | -25        | +26        |            |            |            |            |            |     |  |  |  |
| TEST SEQUENCE #2               |            |                               |            |            | -17        | +42        | -22        | +31        |            |     |  |  |  |
| TEST SEQUENCE #3               |            |                               |            |            |            |            |            |            | -11        | +39 |  |  |  |

NOTE I: NO ACTUAL ALARM OBTAINED. MARGIN MEASURED BY NOTING VOLTAGE AT WHICH THE PATTERN ON THE DISPLAY OSCILLOSCOPE BECOMES DISTORTED.

NOTE 2: NO ACTUAL ALARM OBTAINED. THE "PROLONGED STOP-CLOCK" CONDITION OCCURS AT THE INDICATED VOLTAGES.

# MARGINS OF CORRECT OPERATION FOR SAMPLE PROBLEMS (DATA TAKEN ON DECEMBER 9, '49)

