Towards an Efficient Tile Matrix Inversion of Symmetric Positive Definite Matrices on Multicore Architectures
Emmanuel Agullo (Dpt of Electrical Engineering and Computer Science, University of Tennessee)
Henricus Bouwmeester (Dpt of Mathematical and Statistical Sciences, University of Colorado Denver)
Jack Dongarra (Dpt of Electrical Engineering and Computer Science, University of Tennessee)
Jakub Kurzak (Dpt of Mathematical and Statistical Sciences, University of Colorado Denver)
Julien Langou (University of Colorado Denver)
Lee Rosenberg (Dpt of Mathematical and Statistical Sciences, University of Colorado Denver)
Abstract:
The algorithms in the current sequential numerical linear algebra libraries (e.g. LAPACK) do not parallelize well on multicore architectures. A new family of algorithms, the tile algorithms, has recently been introduced. Previous research has shown that it is possible to write efficient and scalable tile algorithms for performing a Cholesky factorization, a (pseudo) LU factorization, a QR factorization, and computing the inverse of a symmetric positive definite matrix. In this extended abstract, we revisit the computation of the inverse of a symmetric positive definite matrix. We observe that, using a dynamic task scheduler, it is relatively painless to translate existing LAPACK code to obtain a ready-to-be-executed tile algorithm. However we demonstrate that, for some variants, non trivial compiler techniques (array renaming, loop reversal and pipelining) need then to be applied to further increase the parallelism of the application. We present preliminary experimental results.
Keywords:
Numerical Algorithms for CS&E, Numerical Algorithms for Multicore Architectures