< previous page page_510 next page >

Page 510
13. If a set of shadow registers, RS1, RS2, RS3, RS4 were available, how could the code in problem 9 be restructured to minimize dependencies? Show the new M1.
14. Using the assumptions of study 7.1, time out the code in problem 9 for
d87111c01013bcda00bb8640fdff6754.gif
(a) Control flow (improved).
d87111c01013bcda00bb8640fdff6754.gif
(b) Dataflow (with value-holding reservation stations).
d87111c01013bcda00bb8640fdff6754.gif
(c) Control flow with shadow registers.
15. How would the dataflow timing change in study 7.1 if all three instructions were issued in the same cycle?
16. Repeat problem 15 for the improved scoreboard, again using the functional unit timing of study 7.1.
17. A certain vector processor has a memory buffer that does not allow reconfiguration (each TBF/m is a separate buffer). How does this affect the overall overflow probability?
18. Using the baseline area assumptions of Chapters 2,4, and this chapter, estimate the overall area (f = 0.75) of a dual-issue superscalar processor with 8KB I-cache and 8KB D-cache. Assume double-sized ALU to accommodate execution. State all assumptions and show functional area allocation.

 
< previous page page_510 next page >