Starting from:

$24.99

ECE408 Parallel Scan Solution

Objective
Prerequisites
Before starting this lab, make sure that:
Instruction
Edit the code in the code tab to perform the following:
allocate device memory copy host memory to device
initialize thread block and kernel grid dimensions
invoke CUDA kernel copy results from device to host deallocate device memory
implement the work efficient scan routine
use shared memory to reduce the number of global memory accesses, handle the boundary conditions when loading input list elements into the shared memory
Instructions about where to place each part of the code is demarcated by the //@@ comment lines.

More products