page_75

< previous page

page_75

Page 75



		2. Partition the actions into approximately S_opt segments.



		We begin by partitioning the actions into possible segments (Figure 2.12(a)), using as a rough rule (not a maximum) the expected time per segment,



		T_seg = time per segment = 10.0 ns.



		We label each stage with the principal action that occurs within it.



		Note that some stages (e.g., 2 and 3) will not "fit" into the target T_seg = 10 ns, but they do not greatly exceed T_seg.



		3. We now compute the total instruction execution time based on several trial cycle times.



		In the preceding partition, the worst stage delay was 13 ns (stage 2), excluding clocking effects. So now let us use T_seg = 13 ns:



		Dt



		=



		cycle time = max stage time + overheads



		=



		(1 + k)T_seg + C



		=



		(13 ns)(1.05) + 4 = 17.65 ns



		T_inst



		=



		(stages)(Dt) = (9)(17.65 ns)



		T_inst



		=



		159 ns.



		We could also try a minimum partition of (say) T_seg = 9 ns:



		Dt



		=



		a stage time + overheads



		=



		9(1.05) + 4 = 13.5 ns



		T_inst



		=



		(stages that fit in 9 ns)(Dt) + (stages that do not fit) (2Dt)



		=



		(3)(13.5) + (6)(2)(13.5)



		T_inst



		=



		202.5 ns.



		At the other extreme, if we tried a coarser partition such as T_seg = 20 ns, we would have Figure 2.12(b), which creates a cycle of:



		Dt = (20 ns)(1.05) + 4 = 25 ns



		and a total instruction execution time of:



		T_inst = (5 stages)(25 ns/stage) = 125 ns.



		4. We now compute the expected performance G in million instructions per second for each trial cycle used previously.



		The performance for each partition can be computed:

< previous page

page_75