Programming Assignments: A1 - Systemc and Openmp

The document outlines the programming assignments for CS701 High Performance Computing. It includes two assignments - one focused on SystemC and OpenMP (A1) and another focused on CUDA and OpenCL (A2). For A1, students are asked to implement designs including a full adder, single register, and basic interconnection network using SystemC. They also must complete OpenMP programs including printing thread information, summing arrays in parallel, and matrix multiplication. A2 focuses on parallel programming with CUDA and OpenCL. Students must implement matrix multiplication and SAXPY operations on both platforms, and the OpenCL program should print host and device environment details.

Uploaded by

Himanshu Patel

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

60 views

Programming Assignments: A1 - Systemc and Openmp

Uploaded by

Himanshu Patel

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

CS701 High Performance Computing

Programming Assignments
A1 - SystemC and OpenMP
SystemC Programming
(a) Soft Deadline: 00:00AM August, 10. Hard Deadline: 00:00AM August, 12. Submissions to be done through
email only. Pack your report, code, screenshots and other files in an archive and mail to [email protected].
(b) Bonus marks for creative problem solving. (c) All are team assignments. Not more than two students in a
team. One submission per team.
Submission guidelines: (a) Assignment report: Answer to each question will typically contain block diagrams/microarchitecture, brief explanation, and other relevant info. (b) Auxilary files to submit: Per question,
include one or more of the following files along with the report wherever valid: SystemC codes of the design
and the testbench, execution screenshots, VCD dump, gtkwave screenshots.
1. Full Adder. Implement a combinational full adder (FA).
2. Single Register. Implement an 8 bit register inside a Register Block. The register block takes in 3 inputs
- (a) read bit (b) write bit (c) 8 bit write data. It has one output - 8b read data. Working of the register block
follows. At the positive edge of the clock:
If read input is ON, output value from the register.
If write input is ON, write the value from write data into the register.
If both read and write are ON, read precedes write.
3. Basic interconnection network. Implement a 2 node point to point interconnection network as shown in
the following figure.

Implement a version exhibiting the following behaviour: After random intervals, A sends one message to B. B
responds with 4 replies. A prints sent and received messages at the output.

OpenMP Programming
1. Hello World program. Fork out multiple threads from the main process. All the threads are assigned
individual identifiers from the main process. Each thread should identify itself and print out a hello world
message. The master thread should print out environmental information. Environmental information include
total number of CPUs/cores available for OpenMP (use omp get num procs()), current thread ID in the parallel
region, total number of threads available in this parallel region, total number of threads requested.
2. Sum of Two Arrays. Compute the itemwise sum of two large arrays A and B and populate array C.
(C[i]=A[i] + B[i] in a loop). Portions of the arrays are computed in parallel across the team of threads.
3. Matrix Multiply. Implement parallel implementation of multiplication of large matrices (100 x 100 or more).
Threads can share row iterations evenly.

A2 - CUDA and OpenCL

Points to Note: (a) Soft Deadline: 00:00AM August, 22. Hard Deadline: 00:00AM August, 24. Submissions
to be done through email only. Pack your report, code, screenshots and other files in an archive and mail to
[email protected]. (b) Bonus marks for creative problem solving. (c) All are team assignments. Not more
than two students in a team. One submission per team.
CUDA and OpenCL can be used to distribute computational tasks between the CPU (the host) and the graphics
accelerator/GPU (the device). Program the following two problems on CUDA and OpenCL platforms. OpenCL
SDK from AMD is here: http://developer.amd.com/tools-and-sdks/.
1. Matrix Multiply. Parallel implementation of multiplication of large matrices (100 x 100 or more). Threads
can share row iterations evenly.
2. SAXPY program. SAXPY: S stands for Single precision, A is a scalar value, X and Y are one-dimensional
vectors, P stands for Plus. Operation a*X[i] + Y[i]. Write an OpenCL program to perform SAXPY on two
large vectors X and Y. The main program should print out environment details of the host and the device
before beginning computation. You may use functions such as clGetDeviceIDs() and clGetDeviceInfo() to get
the following info: no. of hosts, no. of devices, device type, no. of compute units in the device, clock frequency,
address bits, memory size, and other parameters of your interest.

Example: 201201014-GPU-AS2: Assignments For GPU Programming Course/ Lab
No ratings yet
Example: 201201014-GPU-AS2: Assignments For GPU Programming Course/ Lab
4 pages
CUDA Exercises
No ratings yet
CUDA Exercises
185 pages
UCS631
No ratings yet
UCS631
1 page
Aca Lab Manual Final
No ratings yet
Aca Lab Manual Final
28 pages
Conc Ass 1
No ratings yet
Conc Ass 1
1 page
HPC Int2 Key
No ratings yet
HPC Int2 Key
10 pages
Csnb594csnb4423 Lab 5 01a Harveen Velan Sw0104101
No ratings yet
Csnb594csnb4423 Lab 5 01a Harveen Velan Sw0104101
19 pages
18CS73
No ratings yet
18CS73
2 pages
.Trashed-1650000204-Hpc Prac Exam
No ratings yet
.Trashed-1650000204-Hpc Prac Exam
5 pages
CSCI 332 - Spring 2025 - PA1
No ratings yet
CSCI 332 - Spring 2025 - PA1
4 pages
OpenACC Advanced Fixed
No ratings yet
OpenACC Advanced Fixed
53 pages
S3076 Getting Started With OpenACC
No ratings yet
S3076 Getting Started With OpenACC
58 pages
Cuda Review 1
No ratings yet
Cuda Review 1
13 pages
Resume Prayat Anil Hegde - Firmware - Embedded - Systems - Engineer
No ratings yet
Resume Prayat Anil Hegde - Firmware - Embedded - Systems - Engineer
3 pages
A1422296549 23789 15 2020 Task3
No ratings yet
A1422296549 23789 15 2020 Task3
11 pages
Cs - Ualberta.ca Thesis
100% (3)
Cs - Ualberta.ca Thesis
5 pages
HPC - Assignment 1
No ratings yet
HPC - Assignment 1
2 pages
Assignment Questions
No ratings yet
Assignment Questions
3 pages
written_asst2
No ratings yet
written_asst2
27 pages
OpenCL Guide
No ratings yet
OpenCL Guide
19 pages
CE5015 - GPU Programming
No ratings yet
CE5015 - GPU Programming
5 pages
Cuda Talk
100% (1)
Cuda Talk
82 pages
PDC Assignment 01 (Theory)
No ratings yet
PDC Assignment 01 (Theory)
3 pages
Assign 1-Statistical Summaries Using Pthreads
No ratings yet
Assign 1-Statistical Summaries Using Pthreads
4 pages
ffad27a5dada45e3b626d7fd26ecad86.pdf
No ratings yet
ffad27a5dada45e3b626d7fd26ecad86.pdf
2 pages
VLSI QP_3
No ratings yet
VLSI QP_3
2 pages
Parallel Processing Previous Year Question
No ratings yet
Parallel Processing Previous Year Question
11 pages
CSE5006 Multicore-Architectures ETH 1 AC41
No ratings yet
CSE5006 Multicore-Architectures ETH 1 AC41
9 pages
HPC
No ratings yet
HPC
7 pages
PDC Experiments
No ratings yet
PDC Experiments
11 pages
6CS005 - Assessment 21-22
No ratings yet
6CS005 - Assessment 21-22
4 pages
Digital Assignment
No ratings yet
Digital Assignment
2 pages
Fall 2024 - CS604P - 1
100% (1)
Fall 2024 - CS604P - 1
3 pages
Cs3461 Os Lab Manual Master Copy
No ratings yet
Cs3461 Os Lab Manual Master Copy
75 pages
Computerorganization PDF
No ratings yet
Computerorganization PDF
8 pages
OpenACC 2017spring
No ratings yet
OpenACC 2017spring
46 pages
ASSIGNMENT 1 questions and isntrutions (1)
No ratings yet
ASSIGNMENT 1 questions and isntrutions (1)
2 pages
Operating Systems Lab
0% (1)
Operating Systems Lab
3 pages
CSGC 342
No ratings yet
CSGC 342
7 pages
Micro
No ratings yet
Micro
30 pages
02 Basicarch
No ratings yet
02 Basicarch
83 pages
QP - FAT - C1 - 2023 - Monday
No ratings yet
QP - FAT - C1 - 2023 - Monday
5 pages
BCS3413 Principle & Applications of Parallel Programming Quiz 2: Gpgpu Cuda
No ratings yet
BCS3413 Principle & Applications of Parallel Programming Quiz 2: Gpgpu Cuda
3 pages
Cs3461 Os Lab Manual Master
100% (1)
Cs3461 Os Lab Manual Master
75 pages
3-CUDA
No ratings yet
3-CUDA
5 pages
Fall 2023 - CS604P - 1
No ratings yet
Fall 2023 - CS604P - 1
3 pages
Introduction To Gpu Programming With Cuda and Openacc
100% (1)
Introduction To Gpu Programming With Cuda and Openacc
40 pages
Final Project Report: For Advanced Microprocessor Design
No ratings yet
Final Project Report: For Advanced Microprocessor Design
13 pages
Computer Systems: CS553 Homework #2
No ratings yet
Computer Systems: CS553 Homework #2
2 pages
CA2021_project2_spec
No ratings yet
CA2021_project2_spec
7 pages
Recipe For Running Simple CUDA Code On A GPU Based Rocks Cluster
No ratings yet
Recipe For Running Simple CUDA Code On A GPU Based Rocks Cluster
17 pages
HPC La2-2023qs
No ratings yet
HPC La2-2023qs
5 pages
COSC 4101 Parallel and Distributed Computing Final
No ratings yet
COSC 4101 Parallel and Distributed Computing Final
4 pages
Andylangerresume
No ratings yet
Andylangerresume
2 pages
Gujarat Technological University: W.E.F. AY 2018-19
No ratings yet
Gujarat Technological University: W.E.F. AY 2018-19
3 pages
CS-6712 - Grid and Cloud Computing Lab Syllabus PDF
No ratings yet
CS-6712 - Grid and Cloud Computing Lab Syllabus PDF
2 pages
Openmp Lab: Antonio Gómez-Iglesias Agomez@Tacc - Utexas.Edu Texas Advanced Computing Center
No ratings yet
Openmp Lab: Antonio Gómez-Iglesias Agomez@Tacc - Utexas.Edu Texas Advanced Computing Center
17 pages
Programming Assignment #1: - DUE 5PM Friday, January 30 - Turning in Assignment
No ratings yet
Programming Assignment #1: - DUE 5PM Friday, January 30 - Turning in Assignment
4 pages
dpco lesson plan
No ratings yet
dpco lesson plan
6 pages
C Programming for the Pc the Mac and the Arduino Microcontroller System
From Everand
C Programming for the Pc the Mac and the Arduino Microcontroller System
Peter D Minns
No ratings yet
New Doc 27
No ratings yet
New Doc 27
20 pages
Logistic Regression: Abhishek Panchal 15CS14F
No ratings yet
Logistic Regression: Abhishek Panchal 15CS14F
15 pages
Test Summary: Result Section
No ratings yet
Test Summary: Result Section
4 pages
Report of Assignment 3
No ratings yet
Report of Assignment 3
4 pages
Report of Assignment 3
No ratings yet
Report of Assignment 3
3 pages
An Efficient Wireless Noc With Congestion-Aware Routing For Multicore Chips
No ratings yet
An Efficient Wireless Noc With Congestion-Aware Routing For Multicore Chips
5 pages
Naive Bayes - Report (Repaired)
No ratings yet
Naive Bayes - Report (Repaired)
5 pages
A Seminar Report On
No ratings yet
A Seminar Report On
6 pages
Hardik 2
No ratings yet
Hardik 2
1 page
Miniproject
No ratings yet
Miniproject
131 pages
Sethour (Int Hour) Setminute (Int Min), Setsecond (Int Sec)
No ratings yet
Sethour (Int Hour) Setminute (Int Min), Setsecond (Int Sec)
2 pages
The Google File System
No ratings yet
The Google File System
21 pages
Relational Model: Tuple Relational Calculus Domain Relational Calculus
No ratings yet
Relational Model: Tuple Relational Calculus Domain Relational Calculus
22 pages
What Is Balanced Nutrition: Search Chat
No ratings yet
What Is Balanced Nutrition: Search Chat
6 pages
DBMS Assignment 7
No ratings yet
DBMS Assignment 7
3 pages
BlazeDTV v6.0 Manual
No ratings yet
BlazeDTV v6.0 Manual
14 pages
VT Directed Io Spec
No ratings yet
VT Directed Io Spec
285 pages
SImple NPS Configuration As Radius Part 1 PDF
No ratings yet
SImple NPS Configuration As Radius Part 1 PDF
39 pages
TDK - 6 4 2024 - AN 000455 TDK InvenSense DK 42688P 9X and DK 42670P 9X User Guide v1.0
No ratings yet
TDK - 6 4 2024 - AN 000455 TDK InvenSense DK 42688P 9X and DK 42670P 9X User Guide v1.0
11 pages
Advanced Topics&Reference Guide
100% (1)
Advanced Topics&Reference Guide
202 pages
Ford Obdii 2007
No ratings yet
Ford Obdii 2007
160 pages
Safety Measures
100% (1)
Safety Measures
7 pages
Unix: System Administration and Security
100% (2)
Unix: System Administration and Security
33 pages
Iso 2320 2008 FR en PDF
No ratings yet
Iso 2320 2008 FR en PDF
8 pages
Simran Thesis
No ratings yet
Simran Thesis
70 pages
Tweaks Win98se S YXG100
No ratings yet
Tweaks Win98se S YXG100
7 pages
Lexar Ns100 SSD Datasheet
No ratings yet
Lexar Ns100 SSD Datasheet
1 page
ASE 9575 VEH Vehicle Dock June 7 2018
No ratings yet
ASE 9575 VEH Vehicle Dock June 7 2018
6 pages
DLD Assignment 1
0% (1)
DLD Assignment 1
5 pages
Operatingsystems (Autosaved)
No ratings yet
Operatingsystems (Autosaved)
17 pages
Standard Metric Hex Nuts Per ANSI - ASME B18.2.4.1M and B18.2.4.2M
No ratings yet
Standard Metric Hex Nuts Per ANSI - ASME B18.2.4.1M and B18.2.4.2M
4 pages
R07 Set No. 2
No ratings yet
R07 Set No. 2
4 pages
OpenScape Voice V8
No ratings yet
OpenScape Voice V8
12 pages
Comandos Show Juniper
No ratings yet
Comandos Show Juniper
2 pages
ACON Automation System Technical Spec B
No ratings yet
ACON Automation System Technical Spec B
15 pages
Caanon Printer Window 8 Update Settings Poppopop
No ratings yet
Caanon Printer Window 8 Update Settings Poppopop
4 pages
Seismic Tolco
No ratings yet
Seismic Tolco
24 pages
Study of Programmable Logic Controller (Simatic S7-200 in Paper Mill)
No ratings yet
Study of Programmable Logic Controller (Simatic S7-200 in Paper Mill)
55 pages
Synopsis SS
No ratings yet
Synopsis SS
7 pages
Huawei Fusionserver rh5885h v3 Rack Server Datasheet
No ratings yet
Huawei Fusionserver rh5885h v3 Rack Server Datasheet
7 pages
MagPi70 PDF
100% (1)
MagPi70 PDF
100 pages
Processors
No ratings yet
Processors
8 pages
Thermofisher-Msa Manual
No ratings yet
Thermofisher-Msa Manual
87 pages
Lift Guard Contact Alarm Removal
No ratings yet
Lift Guard Contact Alarm Removal
9 pages
Xcel Shortcuts
No ratings yet
Xcel Shortcuts
14 pages

Programming Assignments: A1 - Systemc and Openmp

Uploaded by

Programming Assignments: A1 - Systemc and Openmp

Uploaded by

CS701 High Performance Computing

A2 - CUDA and OpenCL

You might also like