## Simple Processor Design Single Cycle Implementation Chapter 5.1-5.4 EEC170 FQ 2005 "How to eat an elephant? One byte at a time"











-





































































































## • By nature of design CPI = 1

• CCT is determined by the longest instruction, Load, which uses all executions units

| Instruction<br>Class | Functional Units used by instruction class |          |     |           |           |  |  |  |  |  |
|----------------------|--------------------------------------------|----------|-----|-----------|-----------|--|--|--|--|--|
| R-type               | I Fecth                                    | Reg Read | ALU | Reg Write |           |  |  |  |  |  |
| Load                 | I Feeth                                    | Reg Read | ALU | Mem Read  | Reg Write |  |  |  |  |  |
| Store                | I Fecth                                    | Reg Read | ALU | Mem Write |           |  |  |  |  |  |
| Branch               | I Fecth                                    | Reg Read | ALU |           |           |  |  |  |  |  |
| Jump                 | I Fecth                                    |          |     |           |           |  |  |  |  |  |

• Example timing for functional units

• Register file: 50ps

ALU and adders: 100ps

• Memory: 200ps

 Clock cycle time is determined by Load instruction

| Instruction<br>Class | IFetch | Reg<br>Read | ALU   | Memory<br>Access | Reg<br>Write | Total |
|----------------------|--------|-------------|-------|------------------|--------------|-------|
| R-type               | 200ps  | 50ps        | 100ps |                  | 50ps         | 400ps |
| Load                 | 200ps  | 50ps        | 100ps | 200ps            | 50ps         | 600ps |
| Store                | 200ps  | 50ps        | 100ps | 200ps            |              | 550ps |
| Branch               | 200ps  | 50ps        | 100ps |                  |              | 350ps |
| Jump                 | 200ps  |             |       |                  |              | 200ps |

Summary
• Good news: CPI = 1 is excellent

Bad news:

• CCT = 600ps is poor

• Redundant hardware

Multiple adders Two memory units

Figure of merit: average instruction execution time (IET) = CPI x CCT = 1 x 600ps = 600ps

Can we do better? Next attempt Multicycle design