Name: CS6023: GPU Programming Assignment 2 solution
SKU: 43388
Price: 30.00 USD
Availability: InStock

Description

5/5 - (1 vote)

1. Problem Statement

Given four input matrices 𝐴, 𝐵, 𝐶, and 𝐷. Compute the output matrix, 𝑋 = (𝐴 + 𝐵 𝑇 ) 𝐶 𝐷 𝑇 Write an efficient code to compute the output matrix. While writing the code, consider aspects like memory coalescing, shared memory, degree of divergence, etc.

2. Input and Output

2.1. Input ● 4 integers: 𝑝, 𝑞, 𝑟 and 𝑠 ● Matrix 𝐴 of size 𝑝 × 𝑞 ● Matrix 𝐵 of size 𝑞 × 𝑝 ● Matrix 𝐶 of size 𝑞 × 𝑟 ● Matrix 𝐷 of size 𝑠 × 𝑟 2.2. Output ● Matrix 𝑋 of size 𝑝 × 𝑠 2.3. Constraints ● 2 ≤ 𝑝, 𝑞, 𝑟, 𝑠 ≤ 2 10 ● All the elements in the input matrices will be in the range [-10, 10]

3. Sample Testcase

● Input matrices 𝐴, 𝐵, 𝐶 and 𝐷: Input will be given as: 2 3 3 2 2 5 0 3 -2 1 6 1 -4 2 1 3 1 9 6 -6 7 2 2 4 -3 10 0 5 1 3 -3 First line represents the values 𝑝, 𝑞, 𝑟 and 𝑠 Next 𝑝 lines represents the rows of matrix 𝐴 Next 𝑞 lines represents the rows of matrix 𝐵 Next 𝑞 lines represents the rows of matrix 𝐶 Next 𝑠 lines represents the rows of matrix 𝐷 ● (𝐴 + 𝐵 𝑇 ) ● Output matrix, 𝑋 = (𝐴 + 𝐵 𝑇 ) 𝐶 𝐷 𝑇

4. Points to be noted

● The file ‘main.cu’ provided by us contains the code, which takes care of taking the input, printing the result and printing the execution time. ● Don’t write any code in the main() function. ● You need to implement the compute() function provided in the ‘main.cu’. ● You are free to use any number of functions/kernels. ● You can launch the kernels as you wish. ● It is compulsory to optimize for coalesced accesses. Also, make use of shared memory. ● Do not write any print statements. ● Test your code on large input matrices.

5. Submission Guidelines

● Use the file ‘main.cu’ provided by us. ● Don’t change anything in the main() function. ● Rename the file ‘main.cu’, which contains the implementation of the above-described functionality, to .cu ● For example, if your roll number is CS20M039, then the name of the file you submit on the Moodle should be CS20M039.cu (submit only the .cu file). ● After submission, download the file and make sure it was the one you intended to submit.

6. Learning Suggestions

● Write a CPU-version of code achieving the same functionality. Time the CPU code and GPU code separately for large matrices and compare the performances. ● Exploit shared memory as much as possible to gain performance benefits. ● Try reducing thread divergence as much as possible.

CS6023: GPU Programming Assignment 2 solution

Description

1. Problem Statement

2. Input and Output

3. Sample Testcase

4. Points to be noted

5. Submission Guidelines

6. Learning Suggestions

Related products

CS6023: GPU Programming Assignment 1 solution

CS6023 : GPU Programming Assignment 3 solution

CS6023: GPU Programming Assignment 4 solution