Kürzlich gesucht

Keine Ergebnisse gefunden

Tags

Keine Ergebnisse gefunden

Dokument

Keine Ergebnisse gefunden

Startseite Schulen Themen

Anmelden

Exercise2(Inter-BlockSynchronization, BonusCredits ) Exercise1(SortingNetworks, 4Credits ) DueDate23.07.2014 AssignmentonMassivelyParallelAlgorithms-Sheet11

Aktie "Exercise2(Inter-BlockSynchronization, BonusCredits ) Exercise1(SortingNetworks, 4Credits ) DueDate23.07.2014 AssignmentonMassivelyParallelAlgorithms-Sheet11"

N/A

N/A

Protected

Studienjahr: 2021

Info

Protected

Academic year: 2021

Aktie "Exercise2(Inter-BlockSynchronization, BonusCredits ) Exercise1(SortingNetworks, 4Credits ) DueDate23.07.2014 AssignmentonMassivelyParallelAlgorithms-Sheet11"

Copied!

1

0

0

1

0

0

Wird geladen.... (Jetzt Volltext ansehen)

Jetzt herunterladen ( 1 Seite )

Volltext

(1)

Prof. G. Zachmann A. Srinivas

University of Bremen School of Computer Science

CGVR Group July 16, 2014

Summer Semester 2014

Assignment on Massively Parallel Algorithms - Sheet 11

Due Date 23. 07. 2014

Exercise 1 (Sorting Networks, 4 Credits )

a) Modify the bubble sort cuda implementation (single block) in the previous assignment (assignment 10) so that it can handle array lengths greater than 2 times the maximum number of threads per block for device (GPU) used (using multiple blocks).

b) Compare the runtimes of parallel version of bubble sort (implemented above) with the sequential version. Plot a graph of speed up ( where speed up = runtime of sequential version / runtime of parallel version) along y axis vs size of input array along x axis. Interpret the plot and provide your arguments.

Hint: consider logarithm of size of input array along the x axis while plotting the above graph.

Exercise 2 (Inter-Block Synchronization , Bonus Credits)

a) Is it possible to achieve global synchronization of all threads in all blocks within a CUDA kernel method? Support your answer with appropriate arguments.

1

Referenzen

Jetzt herunterladen ( PDF - 1 Seite - 85.61 KB )

ÄHNLICHE DOKUMENTE

Exercise2(MatrixVectorMultiplication, Credits ) Exercise1(Histogram, Credits ) DueDate AssignmentonMassivelyParallelAlgorithms-Sheet3

b) Implement a method to store the above Matrix in column major order and then modify the above Matrix vector multiplication kernel to handle matrix stored in column major order ..

Exercise2(Lineofsightusingmaxscanoperation, 5Credits ) Exercise1(SegmentedScan, 2Credits ) DueDate AssignmentonMassivelyParallelAlgorithms-Sheet4

i) Note that the Blelloch Algorithm performs exclusive scan operation. Please perform appropriate modifications to generate the inclusive max scan result.. ii) Use the

Exercise2(Labdemos, 4Credits ) Exercise1(VirtualReality, 5Credits ) DueDateOctober29.2017 AssignmentonVirtualRealityandPhysically-BasedSimulation-Sheet1

b) Imagine the following scenario: You are standing on a glass floor, from beneath that glass floor a virtual skyscraper is being projected, so that you can see your own body

Exercise2(Presence, 4Credits ) Exercise1(VirtualReality, 4Credits ) DueDate04.11.2014 AssignmentonVirtualRealityandPhysically-BasedSimulation-Sheet1

a) Form groups of four people and either try out the demo ”Titan of Space” and ”Lava” on Oculus Rift 2 device or watch a movie in the cinema theatre and answer the following

Exercise2(Amdahl’slaw, 2Credits ) Exercise1(Moore’sLawandPowerconsumption, 3Credits ) DueDate30.04.2014 AssignmentonMassivelyParallelAlgorithms-Sheet1

a) Consider two approaches of doubling the number of transistors: halving the size of a single transistor while maintaining constant die area (Moore’s Law) versus maintaining the

Exercise2(CUDAbasics:Launchingkernels, 3Credits ) Exercise1(CUDAbasics:Memory, 3Credits ) DueDate07.05.2014 AssignmentonMassivelyParallelAlgorithms-Sheet2

Hint: You can use one of the examples on the lecture homepage or from the Cuda SDK ( included in the Cuda installation package ) to test if Cuda works at all on your computer.

Exercise2(FindtheSynchronizationBug, 2Credits ) Exercise1(Reduceoperations, 8Credits ) DueDate21.05.2014 AssignmentonMassivelyParallelAlgorithms-Sheet4

b) Implement another version of the kernel using global memory only for all intermediate results.. Note: CUDA does not support synchronization across different blocks of a

Exercise2(MatrixMultiplicationforAPSP, 5Credits ) Exercise1(MatrixVectorMultiplication, 5Credits ) DueDate11.06.2014 AssignmentonMassivelyParallelAlgorithms-Sheet6

Hint: Please note that the tiled version of Matrix Multiplication is used in the above given framework and use the similarities between algorithm EXTEND-PATH and Matrix

ÄHNLICHE DOKUMENTE

Homework Assignment 10

Homework Assignment 10

2

0

0

Task 1 Sort the array

Task 1 Sort the array

5

0

0

Distributed Systems 2014 – Assignment 2

Distributed Systems 2014 – Assignment 2

23

0

0

Homework assignment 10

Homework assignment 10

1

0

0