Question 4: Pipeline Performance Calculation (1 Point) Consi…

Question 4: Pipeline Performance Calculation (1 Point) Consider a pipeline with the following stages and their respective delays: Stage 1 (Fetch): 200 ps (can be parallelized over 2 units, each unit costs 150 ps) Stage 2 (Decode): 150 ps (cannot be parallelized) Stage 3 (Execute): 300 ps (can be parallelized over 3 units, each unit costs 120 ps) Stage 4 (Memory): 250 ps (cannot be parallelized) Stage 5 (Write Back): 100 ps (can be parallelized over 2 units, one unit costs 70ps, and the other costs 90ps) a) Calculate the proper clock frequency for this pipeline to achieve the highest performance without any timing violations. b) Calculate the total delay to process one instruction without any parallelization, assuming we would like to achieve the highest performance without any timing violations. c) Calculate the new stage delays when parallelization is applied as specified for each of the stages, and determine the new total delay to process one instruction, assuming we would like to achieve the highest performance without any timing violations. d) Compute the speedup achieved through parallelization, assuming the pipeline can be fully loaded. e) Based on the parallelized processor in (c) Assuming we need to run 10 instructions and there is no hazard, compute the total delay to finish all the 10 instructions.

Question 1: Verilog Timing Diagram Interpretation (1 Point)…

Question 1: Verilog Timing Diagram Interpretation (1 Point) Consider the following Verilog code snippet: module sequence_detector(     input clk,     input reset,     input in_bit,     output reg out_bit );     reg state;     always @(posedge clk or posedge reset) begin         if (reset)             state

Question 5: Pipeline Arrangement Optimization (1 Point) Give…

Question 5: Pipeline Arrangement Optimization (1 Point) Given that the pipeline performance is limited by the longest stage, propose how you would split the stages from Question 4 to achieve better balance. Assuming that you can split Stage 4 into two stages while keeping the total delay of 250ps. Note that you can not split other stages.

Question 3: Pipeline Execution Diagram (1 Point) Let’s assum…

Question 3: Pipeline Execution Diagram (1 Point) Let’s assume a simple processor has a 5-stage instruction pipeline: Instruction Fetch (IF) Instruction Decode (ID) – Register operations happen here Execute (EX) Memory Access (MEM) Write Back (WB) Given the following sequence of instructions: LOAD R1, 0(R2); load memory address to register R1 ADD  R3, R1, R4; add register R1 and R4, write to R3 SUB  R5, R3, R6; subtract register R3 with R6, write to R5 STORE R5, 0(R7); store register R5 to memory address The pipeline execution diagram for these instructions is shown below when assuming there are no pipeline hazards or stalls.   Identify any data hazards present and list them. If hazards exist, redraw the pipeline diagram using the provided template to include stalls to handle the hazards, assuming the register does not support write-through reads.   Insert a table in the text box to answer this question like the following LOAD IF xx xx xx ADD SUB STORE