0% found this document useful (0 votes)

230 views6 pages

18116029

This document contains the answers to four questions regarding assembly language programs, pipelining, and processor stage latencies. For the first question, it lists all the dependencies in a sample assembly program. For the second question, it calculates speedup for single-cycle and pipelined processors with and without stalls. For the third question, it analyzes the timing and cycles required to execute a loop in a pipelined processor with and without forwarding. For the fourth question, it calculates clock cycle times and speedup for single-cycle and pipelined processors given stage latencies, and determines optimal stage groupings for 3-stage and 6-stage pipelines.

Uploaded by

Gurpreet Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

230 views6 pages

18116029

Uploaded by

Gurpreet Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

CSN-221-Assignment-4

GURPREET SINGH-18116029
22 October 2019

1 Question
Consider the following assembly language program.
I1: MOV R3, R7
I2: LD R8, [R3]
I3: ADD R3, R3, 4
I4: LOAD R9, [R3]
I5: BNE R8, R9, I3
List all the dependencies in this code.

Answer

True Dependency - RAW -

I1 = > I2
I2 = > I5
I1 = > I3
I3 = > I4
I4 = > I5

Output Dependency - WAW -

I1 = > I3

False Dependency - WAR -

I2 = > I3

1
2 Question
We have a single stage, no pipelined machine, and a pipelined machine with 5-
stages. The cycle time for the former is 5 ns and the latter is 1 ns.
a. Assume no stalls, what is the speedup of the pipelined machine over the
single staged machine?
b. Given the pipeline stalls 1 cycle for 40 % of the instructions, what is the
speedup now?

Answer

a)
let number of instructions is n.
Speedup = 1 x n x 5/(5+n-1) = 5n/(4+n)
when number of instructions is very large , by taking limit n - > infinity
speedup = 5
b) Average CPI = 1 + 0.4 x 1 = 1.4
Speedup = 5n/1.4n = 3.58

2
3 Question
Use the following code fragment.
I1: Loop: LD R1, 0[R2]
I2: DADDI R1, R1, 1
I3: SD 0[R2], R1
I4: DADDI R2, R2, 4
I5: DSUB R4, R3, R2
I6: BNEZ R4, Loop

a. List all the True RAW data dependencies.

b. Show the timing of this instruction sequence for a 5-stage pipeline along
with the number of cycles required to execute one iteration of the loop with no
forwarding.

c. Show the timing of this instruction sequence for a 5-stage pipeline along
with the number of cycles required to execute one iteration of the loop with
forwarding.
Assume registers can be written and read in the same cycle, during write back.
(The number of cycles for the execution of one iteration of the loop ends after
the A (ALU) stage of BNEZ instruction.)

Answer :

a) RAW Dependencies [Total = 4] :

I1-I2
I2-I3
I4-I5
I5-I6

b) 16 Cycles

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
LD F D E M W
DADDI F S S D E M W
SD F S S D E M W
DADDI F D E M W
DSUB F S S D E M W
BNEZ F S S D E
outside F S

3
c) 9 Cycles

1 2 3 4 5 6 7 8 9
LD F D E M W
DADDI F S D E M W
SD F D E M W
DADDI F D E M W
DSUB F D E M
BNEZ F D E
outside F S

4
4 Question
Individual stages of a processor have the following latencies.
F D A M W
210 90 110 240 50

If the processor is pipelined, each pipeline latch adds a latency of 20 ps to

the stage that precedes it – this is so called “setup-latency”, where the signals
need to be stable at the input of the latch for some amount of time before they
can be latched correctly at the end of the cycle. In this approach, no pipeline
is used, and in each cycle one instruction is executed from start (F) to finish (W).

a. What is the clock cycle time if we implement this processor using single-
cycle approach (in ps)?

b. What is the clock cycle time if we implement this processor using a 5-stage
pipeline (in ps)?

c. What is the speedup of the pipelined processor over a single-cycle processor

if the single cycle processor has a CPI of 1 and the pipelined processor achieves
a CPI of 1.2?

d. If the processor must be implemented with a 3-stage pipeline, some of the

existing 5-stages must be combined (assume that the existing 5-stages can not
be split). Which of the existing five stages (F, D, A, M, W) should be placed
into which stage of the 3-stage pipeline to minimize the resulting clock cycle
time?

e. If the processor is to be implemented with a 6-stage pipeline, but the design

effort and time to market are such that there is only enough time to split one of
the five existing (F, D, A, M, W) stages into two new stages, which stage would
you choose to split?

Answer :

a) Cycle Time : 210+90+110+240+50 = 700 ps

b) Cycle TIme : 240+20 = 260 ps

c) CPU Time = CPI x CT x #Instructions

CPUA = 1 x 700 x N

CPUB = 1.2 x 260 x N

Speedup = CP UA /CP UB = 2.24

5
d) 3 Stage pipeline :

Stage 1 : F - 210 ps

Stage 2 : A,D - 200 ps

Stage 3 : M,W - 290 ps

Total Cycle Time = 290 + 20 = 310 ps

e) Split the stage having maximum time .

Hence, we split the stage : M ,
into two equal halves each having a stage time of 145 ps.
Therefore , the new reduced Cycle Time = 210 + 20 = 230 ps

Co MODULE 3 - Merged
No ratings yet
Co MODULE 3 - Merged
102 pages
COA Practice Problems
No ratings yet
COA Practice Problems
59 pages
Pipeline Processing
No ratings yet
Pipeline Processing
43 pages
CA07 2022S3 New
No ratings yet
CA07 2022S3 New
29 pages
PS4 Solution
No ratings yet
PS4 Solution
6 pages
Question 1 (50 Points) Pipelining
No ratings yet
Question 1 (50 Points) Pipelining
3 pages
Ca Mid1 2017
No ratings yet
Ca Mid1 2017
9 pages
Sheet 9
No ratings yet
Sheet 9
12 pages
Chapter 5 Report
No ratings yet
Chapter 5 Report
7 pages
Computer Architecture MA 305: Dr. Daya Sagar Gupta
No ratings yet
Computer Architecture MA 305: Dr. Daya Sagar Gupta
10 pages
Unit 3 Problems
No ratings yet
Unit 3 Problems
18 pages
Solution of Questions From Chapter 4-COAL
No ratings yet
Solution of Questions From Chapter 4-COAL
28 pages
IT3030E CA Chap5 CPU Exercises
No ratings yet
IT3030E CA Chap5 CPU Exercises
9 pages
B. Tech, High Performance Computer Architecture (CS-3010), Autumn End Semester Examination 2021
No ratings yet
B. Tech, High Performance Computer Architecture (CS-3010), Autumn End Semester Examination 2021
9 pages
Lecture: Pipelining Basics
No ratings yet
Lecture: Pipelining Basics
28 pages
Lecture 4
No ratings yet
Lecture 4
19 pages
Ex4 Updated
No ratings yet
Ex4 Updated
4 pages
2018 Second
No ratings yet
2018 Second
7 pages
Assignment 4 Solutions Pipelining and Hazards: 1 Processor Performance
100% (1)
Assignment 4 Solutions Pipelining and Hazards: 1 Processor Performance
4 pages
Solution of CSE340 Assignment 3 Spring 2022
No ratings yet
Solution of CSE340 Assignment 3 Spring 2022
7 pages
3 Pipelining Pipeline:: "Folder" Takes 20 Minutes
No ratings yet
3 Pipelining Pipeline:: "Folder" Takes 20 Minutes
8 pages
350 Exam 2 Spring 2024
No ratings yet
350 Exam 2 Spring 2024
7 pages
HPC Question Bank
No ratings yet
HPC Question Bank
5 pages
Parallelism in Uniprocessor System and Granularity
100% (5)
Parallelism in Uniprocessor System and Granularity
5 pages
ACA Question Bank
No ratings yet
ACA Question Bank
19 pages
CompEng 361 - Homework 3 Solutions
No ratings yet
CompEng 361 - Homework 3 Solutions
6 pages
Computer Architecture and Design QP Set A CA 3
No ratings yet
Computer Architecture and Design QP Set A CA 3
6 pages
CO Gate 2023
No ratings yet
CO Gate 2023
6 pages
Homework3 Solution v2
No ratings yet
Homework3 Solution v2
41 pages
CompEng 361 Final Review Problems - Solutions
No ratings yet
CompEng 361 Final Review Problems - Solutions
6 pages
Sample Problems Pipe&Memory
No ratings yet
Sample Problems Pipe&Memory
57 pages
CMPE361-Final - Sanple
No ratings yet
CMPE361-Final - Sanple
8 pages
Assignment4 Solutions PDF
No ratings yet
Assignment4 Solutions PDF
4 pages
PIPELINE
No ratings yet
PIPELINE
13 pages
COA Tute 8 Main
No ratings yet
COA Tute 8 Main
3 pages
Walkie Talkie
100% (1)
Walkie Talkie
7 pages
Nmam Institute of Technology: Department of Computer Science and Engineering
No ratings yet
Nmam Institute of Technology: Department of Computer Science and Engineering
8 pages
اسمبلي ٩
No ratings yet
اسمبلي ٩
3 pages
Assignment5 Soln
No ratings yet
Assignment5 Soln
5 pages
Numerical: Central Processing Unit
No ratings yet
Numerical: Central Processing Unit
28 pages
Pipeline Ex.1
No ratings yet
Pipeline Ex.1
1 page
COE301 Final Solution 162
No ratings yet
COE301 Final Solution 162
10 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Gate Sample Paper
No ratings yet
Gate Sample Paper
7 pages
Computer Architecture hw6
No ratings yet
Computer Architecture hw6
3 pages
Coa Applied
No ratings yet
Coa Applied
13 pages
Aoc Mod 43s5045 78g
No ratings yet
Aoc Mod 43s5045 78g
50 pages
Pipeline
No ratings yet
Pipeline
36 pages
Csit Cog R2 A1 (1
No ratings yet
Csit Cog R2 A1 (1
3 pages
DC Motor Controller
No ratings yet
DC Motor Controller
28 pages
CSE 560 - Practice Problem Set 4 Solution
No ratings yet
CSE 560 - Practice Problem Set 4 Solution
3 pages
CSN-221 Pipelines-Quiz: Enrollment No.: 18114031 Name - Hemil Panchiwala
No ratings yet
CSN-221 Pipelines-Quiz: Enrollment No.: 18114031 Name - Hemil Panchiwala
6 pages
CS641
No ratings yet
CS641
2 pages
Mid Term 13-14
No ratings yet
Mid Term 13-14
3 pages
High Performance Computing - CS 3010 - MID SEM Question by Subhasis Dash With Solution
No ratings yet
High Performance Computing - CS 3010 - MID SEM Question by Subhasis Dash With Solution
12 pages
F10 E1 Solution
No ratings yet
F10 E1 Solution
5 pages
Illinois Exam2 Practice Solfa08
No ratings yet
Illinois Exam2 Practice Solfa08
4 pages
BFE Final Organization Fall 2014 Answer
No ratings yet
BFE Final Organization Fall 2014 Answer
8 pages
Homework 2
No ratings yet
Homework 2
8 pages
DCCN Project
No ratings yet
DCCN Project
3 pages
Imca150e 040
No ratings yet
Imca150e 040
118 pages
RBX1 Technical Data ENU
No ratings yet
RBX1 Technical Data ENU
21 pages
CS433 hw1 Fall 07
No ratings yet
CS433 hw1 Fall 07
3 pages
Fenwal Analaser Intelligent Interface Module Stand Alone F 89 254 Print
No ratings yet
Fenwal Analaser Intelligent Interface Module Stand Alone F 89 254 Print
2 pages
Digital Filter Matlab
No ratings yet
Digital Filter Matlab
37 pages
6421c250fad3f7fad4abd84f Aria 7X Datasheet
No ratings yet
6421c250fad3f7fad4abd84f Aria 7X Datasheet
2 pages
Scan 12-Mar-2024
No ratings yet
Scan 12-Mar-2024
17 pages
Coa Lab 4
No ratings yet
Coa Lab 4
6 pages
PHYSICS Lab Manual 2022-23-1
No ratings yet
PHYSICS Lab Manual 2022-23-1
30 pages
TS-990S In-Depth User
No ratings yet
TS-990S In-Depth User
107 pages
This Set of 8051 Micro
No ratings yet
This Set of 8051 Micro
21 pages
Hoja Tecnica PLC'S VTP 402
No ratings yet
Hoja Tecnica PLC'S VTP 402
7 pages
Saia-Burgess UDR
No ratings yet
Saia-Burgess UDR
2 pages
Experiment 1
No ratings yet
Experiment 1
12 pages
Ai ML On Cpu Whitepaper PDF
No ratings yet
Ai ML On Cpu Whitepaper PDF
10 pages
Expired License List
No ratings yet
Expired License List
10 pages
Millimeter-Wave Bandpass Filter On Printed Circuit Board With Conventional Microstrip Line Structure
No ratings yet
Millimeter-Wave Bandpass Filter On Printed Circuit Board With Conventional Microstrip Line Structure
2 pages
CD5/CD10: Your Music + Our Passion
No ratings yet
CD5/CD10: Your Music + Our Passion
10 pages
Manual Home hts6500-55
No ratings yet
Manual Home hts6500-55
40 pages
Digital Systems Design
No ratings yet
Digital Systems Design
20 pages
7X 14168
No ratings yet
7X 14168
3 pages
RK84 Manual
No ratings yet
RK84 Manual
1 page
Image Deconvolution by Nonlinear Signal Processing: Bahram Javidi, H. John Caulfield, and Joseph L. Horner
No ratings yet
Image Deconvolution by Nonlinear Signal Processing: Bahram Javidi, H. John Caulfield, and Joseph L. Horner
6 pages
Ab MSR22LM Psdi
No ratings yet
Ab MSR22LM Psdi
12 pages
Harish Resume
No ratings yet
Harish Resume
3 pages
P808 For Windows. Configuration For 530 (230V) Module Page 1 of 2
No ratings yet
P808 For Windows. Configuration For 530 (230V) Module Page 1 of 2
2 pages
Boylestad Electronics Multiple Choice Q&a Chapter
No ratings yet
Boylestad Electronics Multiple Choice Q&a Chapter
8 pages
CCNA (640-802) Exam Questions Cisco
From Everand
CCNA (640-802) Exam Questions Cisco
Eddie Vi
4.5/5 (14)
IGNOU BCA Fundamentals of Computer Networks Previous Year Unsolved Papers BCS 041
From Everand
IGNOU BCA Fundamentals of Computer Networks Previous Year Unsolved Papers BCS 041
Manish Soni
No ratings yet
Comptia Network+ Primer
From Everand
Comptia Network+ Primer
John Greene
No ratings yet

18116029

Uploaded by

18116029

Uploaded by

CSN-221-Assignment-4

True Dependency - RAW -

Output Dependency - WAW -

False Dependency - WAR -

a. List all the True RAW data dependencies.

a) RAW Dependencies [Total = 4] :

If the processor is pipelined, each pipeline latch adds a latency of 20 ps to

c. What is the speedup of the pipelined processor over a single-cycle processor

d. If the processor must be implemented with a 3-stage pipeline, some of the

e. If the processor is to be implemented with a 6-stage pipeline, but the design

a) Cycle Time : 210+90+110+240+50 = 700 ps

b) Cycle TIme : 240+20 = 260 ps

c) CPU Time = CPI x CT x #Instructions

CPUB = 1.2 x 260 x N

Speedup = CP UA /CP UB = 2.24

Stage 2 : A,D - 200 ps

Stage 3 : M,W - 290 ps

Total Cycle Time = 290 + 20 = 310 ps

e) Split the stage having maximum time .

You might also like