- (abs, pdf) Haschke et al., Metallicity distribution functions of the old populations of the Magellanic Clouds from RR Lyrae stars
- (abs, pdf) Hopkins et al., Realistic Stellar Feedback & Bulge Formation in Clumpy Disks
- (abs, pdf) Salvadori & Ferrara, First stars in Damped Lyman Alpha systems
- (abs, pdf) Hassan et … Continue Reading ››
Larne Pekowsky (CRA)
- CUDA FFT is a drop-in replacement for FFTW. Because of the memory bandwidth, CUDA is faster above 2^14 element FFTs in 1D. Tested on GTX 580.
- Used in LIGO data analysis.
Lionel London (CRA) – GPU Computing in Matlab
- Matlab Parallel Computing Toolbox (PCT) vs. Jacket
- PCT allows for multi-cpu and GPU computing. Limited to 12 cores on the local machine. Very high level.
- GPU computing with PCT requires an nVidia card with v1.3 compute capabilities. GPGPU with Jacket has relaxed requirements but is very expensive ($4k for 5 … Continue Reading ››
- (abs, pdf) Pritchard & Loeb, 21-cm cosmology
- (abs, pdf) Lin et al., Large-Scale High-Lundquist Number Reduced MHD Simulations of the Solar Corona Using GPU Accelerated Machines
- (abs, pdf) Maeder & Meynet, Rotating massive stars through the ages, with applications to WR stars, Pop III stars and Gamma Ray Bursts
Matt Kinsey: Porting the 2D Wave Equation to the GPU
- Optimal number of threads per block is 32*n-1, where n is an integer. The best performance in the example shown was 63 threads per block.
- Minimum number of blocks per grid is 32, according to the user's guide.
- Every time a kernel is called, the memory needs to … Continue Reading ››
- (abs, pdf) Fluke et al., Astrophysical Supercomputing with GPUs: Critical Decisions for Early Adopters
- (abs, pdf) Schive et al., GAMER: a GPU-Accelerated Adaptive Mesh Refinement Code for Astrophysics
- (abs, pdf) Wang et al., Adaptive Mesh Fluid Simulations on GPU
- (abs, pdf) Jonsson & Primack, Accelerating Dust Temperature Calculations with Graphics … Continue Reading ››