View article

[PDF] from academia.edu

Benchmarking of communication techniques for GPUs

Authors

M Bernaschi, M Bisson, D Rossetti

Publication date

2013/2

Journal

Journal of Parallel and Distributed Computing

Volume

Issue

Pages

250-255

Publisher

Elsevier

Description

We report about the performances obtained, at the application level, by two MPI implementations for Infiniband that allow direct exchange of data stored in the global memory of Graphic Processing Units (GPU) based on the Nvidia CUDA. For the same purpose, we tested also the Application Programming Interface of APEnet, which is a custom, high performance interconnect technology. As a benchmark we consider the time required to update a single spin of the 3D Heisenberg spin glass model by using the over-relaxation algorithm. The results show that CUDA streams are instrumental in achieving the best possible performances.

Total citations

Cited by 24

2013201420152016201720182019202020216 7 3 1 4 1 2

Scholar articles

Benchmarking of communication techniques for GPUs

M Bernaschi, M Bisson, D Rossetti - Journal of Parallel and Distributed Computing, 2013