Authors
Gabor Csordas, Mikhail Asiatici, Paolo Ienne
Publication date
2019/12/9
Conference
2019 International Conference on Field-Programmable Technology (ICFPT)
Pages
188-196
Publisher
IEEE
Description
For efficient acceleration on FPGA, it is essential for external memory to match the throughput of the processing pipelines. However, the usable DRAM bandwidth decreases significantly if the access pattern causes frequent row conflicts. Memory controllers reorder DRAM commands to minimize row conflicts; however, general-purpose controllers must also minimize latency, which limits the depth of the internal queues over which reordering can occur. For latency-insensitive applications with irregular access pattern, nonblocking caches that support thousands of in-flight misses (miss-optimized memory systems) improve bandwidth utilization by reusing the same memory response to serve as many incoming requests as possible. However, they do not improve the irregularity of the access pattern sent to the memory, meaning that row conflicts will still be an issue. Sending out bursts instead of single memory requests …
Total citations
20212022202320243212
Scholar articles
G Csordas, M Asiatici, P Ienne - 2019 International Conference on Field-Programmable …, 2019