A parallel radix-4 block cyclic reduction algorithm
Myllykoski, M., & Rossi, T. (2014). A parallel radix-4 block cyclic reduction algorithm. Numerical Linear Algebra with Applications, 21(4), 540-556. https://doi.org/10.1002/nla.1909
Published in
Numerical Linear Algebra with ApplicationsDate
2014Copyright
© 2013 John Wiley & Sons, Ltd. This is a final draft version of an article whose final and definitive form has been published by Wiley. Published in this repository with the kind permission of the publisher.
A conventional block cyclic reduction algorithm operates by halving the size of the linear system at each reduction step, that is, the algorithm is a radix-2 method. An algorithm analogous to the block cyclic reduction known as the radix-q partial solution variant of the cyclic reduction (PSCR) method allows the use of higher radix numbers and is thus more suitable for parallel architectures as it requires fever reduction steps. This paper presents an alternative and more intuitive way of deriving a radix-4 block cyclic reduction method for systems with a coefficient matrix of the form tridiag{ − I,D, − I}. This is performed by modifying an existing radix-2 block cyclic reduction method. The resulting algorithm is then parallelized by using the partial fraction technique. The parallel variant is demonstrated to be less computationally expensive when compared to the radix-2 block cyclic reduction method in the sense that the total number of emerging subproblems is reduced. The method is also shown to be numerically stable and equivalent to the radix-4 PSCR method. The numerical results archived correspond to the theoretical expectations.
...
Publisher
John Wiley & Sons Ltd.ISSN Search the Publication Forum
1070-5325Keywords
Publication in research information system
https://converis.jyu.fi/converis/portal/detail/Publication/22534584
Metadata
Show full item recordCollections
Related items
Showing items with similar title or keywords.
-
Fast Poisson solvers for graphics processing units
Myllykoski, Mirko; Rossi, Tuomo; Toivanen, Jari (Springer, 2013)Two block cyclic reduction linear system solvers are considered and implemented using the OpenCL framework. The topics of interest include a simplified scalar cyclic reduction tridiagonal system solver and the impact ... -
On GPU-accelerated fast direct solvers and their applications in image denoising
Myllykoski, Mirko (University of Jyväskylä, 2015) -
Poissonin yhtälön nopeat ratkaisijat
Jauhiainen, Susanne (2016)Tutkielmassa esitellään Poissonin yhtälö sekä sen diskretointi. Lisäksi käydään läpi kaksi nopeaa numeerista menetelmää yhtälön ratkaisemiseksi. Yksinkertaisuuden vuoksi rajoitutaan kaksiulotteisiin tehtäviin, joissa on ... -
Parallel global optimization : structuring populations in differential evolution
Weber, Matthieu (University of Jyväskylä, 2010) -
On solving separable block tridiagonal linear systems using a GPU implementation of radix-4 PSCR method
Myllykoski, Mirko; Rossi, Tuomo; Toivanen, Jari (Academic Press, 2018)Partial solution variant of the cyclic reduction (PSCR) method is a direct solver that can be applied to certain types of separable block tridiagonal linear systems. Such linear systems arise, e.g., from the Poisson and ...