Authors
Eddy Caron, Gil Utard
Publication date
2004/3/1
Journal
Parallel Computing
Volume
30
Issue
3
Pages
357-375
Publisher
North-Holland
Description
In this paper, we present an analytical performance model of the parallel left–right looking out-of-core LU factorization algorithm for cluster-like architectures. We show the accuracy of the performance prediction model for the ScaLAPACK library. We analyze the overhead introduced by the out-of-core part of the algorithm and we outline a limitation which was never seen before: for large problems the algorithm has a poor efficiency. This overhead is divided into an IO part and a communication part. We derive an overlapping scheme and minimum memory requirement to avoid the IO overhead. The new scheme is validated by a prototype implementation in ScaLAPACK. We show the impact of the communication overhead on two-dimensional distributions. Then we show that with similar memory requirements a second overlapping scheme may be implemented to avoid the communication overhead. If the size of the …
Total citations
200320042005200620072008200920102011201220131442131211