CUDA, (the Compute Unified Device Architecture), is a parallel computing platform and programming model created by NVIDIA and implemented by the graphics processing units (GPUs) that they produce. CUDA gives program developers direct access to the virtual instruction set and memory of the parallel computational elements in CUDA GPUs.
Python is a high level programming language that aims to combine remarkable power with very clear syntax. Anaconda is a free cross-platform Python distribution from Continuum Analytics. It comes built in with various scientific Python packages such as NumPy, SciPy, Pandas, Matplotlib, Numba, etc.
The Hoard memory allocator is a fast, scalable, and memory-efficient memory allocator for Linux, Solaris, Mac OS X, and Windows. Hoard is a drop-in replacement for malloc that can dramatically improve application performance, especially for multithreaded programs running on multiprocessors and multicore CPUs.
Python is a high level programming language that aims to combine remarkable power with very clear syntax. The Enthought Python Distribution is a cross-platform environment for scientific computing in Python, and includes the Canopy IDE and package manager. MSI has installed an academic-licensed version that includes hundreds of modules, including tools that enable efficient parallel computations.
DDT is a debugger for serial and parallel applications written in C, C++,i FORTRAN 77, FORTRAN 90. It supports multiple parallel programming paradigms including MPI, and OpenMP.
The Intel(R) VTune(TM) Amplifier XE provides information on the code performance for users developing serial and multithreaded applications On Linux systems, VTune Amplifier XE works as a standalone GUI client. You can benefit from using the command-line interface for collecting data remotely or for performing regression testing. VTune Amplifier XE helps you analyze the algorithm choices and identify where and how your application can benefit from available hardware resources.
GNU DDD is a graphical front-end for command-line debuggers such as GDB, DBX, WDB, Ladebug, JDB, XDB, the Perl debugger, the bash debugger, or the Python debugger. Besides ``usual'' front-end features such as viewing source texts, DDD has become famous through its interactive graphical data display, where data structures are displayed as graphs.
Lahey/Fujitsu Fortran 95 Express is a Fortran 77, 90, and 95 compiler and debugger.
TotalView is a multiprocess debugger for applications written in C, C++, FORTRAN 77, Fortran 90, and PGI HPF. It supports multiple parallel programming paradigms including MPI, PVM and OpenMP.
gdb - The GNU Debugger. It allows you to debug programs written in C, C++, FORTRAN, and Modula-2.