Conc Ass 1

This document outlines three questions for a concurrent programming assignment. Students will work in groups of four. Q1 asks students to implement a matrix multiplication algorithm in parallel using OpenMP, Intel TBB, and Cilk++. They must analyze performance with different matrix sizes and block sizes. Optional extension to GPU programming is suggested. Q2 requires studying and analyzing the bitonic sorting algorithm using divide-and-conquer and implementing it using Intel TBB's task-based programming model. Q3 involves designing programs to sort lines of numbers from a file sequentially, using a bitonic sorter implemented as in Q2, and using Intel TBB's pipeline pattern. Performance of each implementation should be analyzed on

Uploaded by

CSEBaba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views1 page

Conc Ass 1

Uploaded by

CSEBaba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Concurrent Programming*

*- This assignment will be will have about 4 questions and they will be added during the course. Students
groups (4 students) can work on this assignment. The final submission date will be announced later.

Q1.) Write a well optimized parallel algorithm with support for cache (blocking) for the following matrix
operation using OpenMP, Intel Thread Building Blocks and Cilk++.
A = B*C + B*D’

All matrices are of same dimension (n x n). You must provide a performance analysis with different
matrix dimensions and block sizes.

Optional (not be evaluated but highly encouraged)

Try the above implementation in GPU with CUDA/OpenCL programming. Learn different memory
structures in GPU (texture) and warp methods and test them with your implementation.
(http://developer.nvidia.com/object/cuda_3_2_downloads.html)

Q2) Bitonic search is a network sorting algorithm efficient with multiprocessors. Study the Bitonic search
algorihms (http://en.wikipedia.org/wiki/Bitonic_sorter) and analyze it using the divide and conquer
pattern. Use the fork-join pattern to implement the algorithms. Use Intel thread building blocks and its
Task-Based Programming model.

Q3) Assume you are given a file which contains lines of numbers separated by spaces. Each line consists
of 1000-2000 or more numbers and there should about 10000 or more such lines. (you have to create a
such file). You have to design a program that read each line from the file and sort it and write the sorted
lines to another file. For the sorting you have to use a Bitonic sorter. You are required to,

A.) Implement program for sequential processing.

B.) Implement using the algorithm you developed in Q2) above.
C.) Implement the program using the pipeline pattern using Intel Thread Building Blocks. Design
appropriate pipeline stages for best performance gains. Apply cache optimization if possible.
Experiment with the pipeline stages with the Bitonic sort itself.
D.) Show performances of each of the implementations. Test you program on different multicore
machines.

Assign 1-Statistical Summaries Using Pthreads
No ratings yet
Assign 1-Statistical Summaries Using Pthreads
4 pages
Operating Systems Lab Manual JNTU
100% (1)
Operating Systems Lab Manual JNTU
9 pages
A1422296549 23789 15 2020 Task3
No ratings yet
A1422296549 23789 15 2020 Task3
11 pages
2022 ST2 Main
No ratings yet
2022 ST2 Main
4 pages
Memory Management Problem Set
No ratings yet
Memory Management Problem Set
1 page
Os Models Questions
No ratings yet
Os Models Questions
2 pages
Assignment Questions
No ratings yet
Assignment Questions
3 pages
Mid 19
No ratings yet
Mid 19
3 pages
SIT315 M2 - S2P TaskSheet
No ratings yet
SIT315 M2 - S2P TaskSheet
1 page
Programming Assignments: A1 - Systemc and Openmp
No ratings yet
Programming Assignments: A1 - Systemc and Openmp
2 pages
OS-Lab Sessional-II (Fall-2023)
No ratings yet
OS-Lab Sessional-II (Fall-2023)
3 pages
Parallel Distributed Computing Assignment 2
No ratings yet
Parallel Distributed Computing Assignment 2
2 pages
HPC
No ratings yet
HPC
7 pages
OS Lab Manual 2014
No ratings yet
OS Lab Manual 2014
21 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
PP Manual
No ratings yet
PP Manual
22 pages
206 DSL-PR
No ratings yet
206 DSL-PR
7 pages
Multithreading Lab Worksheet
No ratings yet
Multithreading Lab Worksheet
4 pages
Parallel Project Section 3
No ratings yet
Parallel Project Section 3
2 pages
SPOS Assignment List17-18
No ratings yet
SPOS Assignment List17-18
3 pages
Cs3461 Os Lab Manual Master
No ratings yet
Cs3461 Os Lab Manual Master
75 pages
Untitled
No ratings yet
Untitled
25 pages
Assignment No. 2 PDC 21L-1786
No ratings yet
Assignment No. 2 PDC 21L-1786
6 pages
PDC Lab Manual
No ratings yet
PDC Lab Manual
5 pages
Cs3461 Os Lab Manual Master
100% (1)
Cs3461 Os Lab Manual Master
75 pages
II Usecase Project OS COA DSA An Batch
No ratings yet
II Usecase Project OS COA DSA An Batch
13 pages
CLAT3 - Set B
No ratings yet
CLAT3 - Set B
3 pages
OS Scheduling & Memory Assignment
No ratings yet
OS Scheduling & Memory Assignment
4 pages
Data Structures Lab Cycle Final-1
No ratings yet
Data Structures Lab Cycle Final-1
2 pages
HPC Codes
No ratings yet
HPC Codes
14 pages
APT06 2024S2 New
No ratings yet
APT06 2024S2 New
21 pages
Interview Qns
No ratings yet
Interview Qns
5 pages
Os Lab 11
No ratings yet
Os Lab 11
3 pages
OS Lab Final Fall'23
No ratings yet
OS Lab Final Fall'23
2 pages
OS Lab Manual for Engineering Students
No ratings yet
OS Lab Manual for Engineering Students
74 pages
Instructions:: Q1. Answer The Following Questions: (Marks 10)
No ratings yet
Instructions:: Q1. Answer The Following Questions: (Marks 10)
3 pages
Attachment 1 27
No ratings yet
Attachment 1 27
6 pages
O.S. Lab Assignment
No ratings yet
O.S. Lab Assignment
1 page
OS Design & Implementation Course
No ratings yet
OS Design & Implementation Course
9 pages
Lab Syllabus
No ratings yet
Lab Syllabus
21 pages
18CS73
No ratings yet
18CS73
2 pages
Mock 41243 1712322751002
No ratings yet
Mock 41243 1712322751002
36 pages
Computing Lab Lab Test 2
No ratings yet
Computing Lab Lab Test 2
2 pages
OS Lab Guide for CS Students
No ratings yet
OS Lab Guide for CS Students
8 pages
DSL
No ratings yet
DSL
5 pages
High Performance Computing Labs & Concepts
No ratings yet
High Performance Computing Labs & Concepts
5 pages
Operating System Labsheet
No ratings yet
Operating System Labsheet
18 pages
Algorithm Design Assignment
No ratings yet
Algorithm Design Assignment
3 pages
CSL201 - KQB KtuQbank
No ratings yet
CSL201 - KQB KtuQbank
8 pages
CS69201 Week9
No ratings yet
CS69201 Week9
5 pages
Assignment 6 - P1
No ratings yet
Assignment 6 - P1
7 pages
Parallel Processing Previous Year Question
No ratings yet
Parallel Processing Previous Year Question
11 pages
Model Os QP 2024
No ratings yet
Model Os QP 2024
3 pages
Os Record
No ratings yet
Os Record
28 pages
HPC Codes-2
No ratings yet
HPC Codes-2
15 pages
CSC2002S PCP1 Assignment 2025
No ratings yet
CSC2002S PCP1 Assignment 2025
3 pages
6th Sem Syllabus PDF
No ratings yet
6th Sem Syllabus PDF
6 pages

Conc Ass 1

Uploaded by

Conc Ass 1

Uploaded by

Concurrent Programming*

Optional (not be evaluated but highly encouraged)

A.) Implement program for sequential processing.

You might also like