CUDA Programming 1

Write a cuda kernel to initialize an integer array, copy the result back to the CPU, and then print all the elements on the CPU. In this assignment, we assume there are 16 integers in the array and thread block size is 4.