Answer from cs61c-cb (minh uyen nguyen 16765774) for Question 2 Because the second instruction require $t0 in the second step of executing the instruction and $t0 is not ready until the fifth stage of the first instruction, therefore we cannot achieve the full speed. I'm not sure about the hardware schemes we use to get reduce the performance penalty.