|
|
|
|
|
|
|
13. If a set of shadow registers, RS1, RS2, RS3, RS4 were available, how could the code in problem 9 be restructured to minimize dependencies? Show the new M1. |
|
|
|
|
|
|
|
|
14. Using the assumptions of study 7.1, time out the code in problem 9 for |
|
|
|
 |
|
|
|
|
(a) Control flow (improved). |
|
|
|
 |
|
|
|
|
(b) Dataflow (with value-holding reservation stations). |
|
|
|
 |
|
|
|
|
(c) Control flow with shadow registers. |
|
|
|
|
|
|
|
|
15. How would the dataflow timing change in study 7.1 if all three instructions were issued in the same cycle? |
|
|
|
|
|
|
|
|
16. Repeat problem 15 for the improved scoreboard, again using the functional unit timing of study 7.1. |
|
|
|
|
|
|
|
|
17. A certain vector processor has a memory buffer that does not allow reconfiguration (each TBF/m is a separate buffer). How does this affect the overall overflow probability? |
|
|
|
|
|
|
|
|
18. Using the baseline area assumptions of Chapters 2,4, and this chapter, estimate the overall area (f = 0.75) of a dual-issue superscalar processor with 8KB I-cache and 8KB D-cache. Assume double-sized ALU to accommodate execution. State all assumptions and show functional area allocation. |
|
|
|
|
|