Parallelizing a high-order WENO scheme for complicated flow structures on GPU and MIC

Liang Deng; Fang Wang; Han-Li Bai; Qing-Xin Xu

doi:10.2991/iccse-15.2015.86

<Previous Article In Volume

Next Article In Volume>

Parallelizing a high-order WENO scheme for complicated flow structures on GPU and MIC

Authors

Liang Deng, Fang Wang, Han-Li Bai, Qing-Xin Xu

Corresponding Author

Liang Deng

Available Online July 2015.

DOI: 10.2991/iccse-15.2015.86 How to use a DOI?
Keywords: High-order, WENO, Fermi GPU, Kepler GPU, Intel MIC coprocessor, Optimization techniques
Abstract: As a conservative, high-order accurate, shock-capturing method, weighted essentially non-oscillatory (WENO) scheme have been widely used to effectively resolve complicated flow structures in computational uid dynamics (CFD) simulations. However, using a high-order WENO scheme can be highly time-consuming, which greatly limits the CFD application’s performance efficiency. In this paper, we present various parallel strategies base on the latest many-core platform such as NVIDIA Fermi GPU, NVIDIA Kepler GPU and Intel MIC coprocessor to accelerate a high-order WENO scheme. Comparison analysis of the two generations GPUs between Fermi and Kepler, and cross-platform performance analysis (focusing on Kepler GPU and MIC) are also detailed discussed. The experiments show that the Kepler GPU offers a clear advantage in contrast to the previous Fermi GPU maintaining exactly the same source code. Furthermore, while Kepler GPU can be several times faster than MIC without utilizing the increasingly available SIMD computing power on Vector Processing Unit (VPU), MIC can provide the computing capability equivalent to Kepler GPU when VPU is utilized. Our implementations and optimization techniques can serve as case studies for paralleling high-order schemes on many-core architectures.
Copyright: © 2015, the Authors. Published by Atlantis Press.
Open Access: This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

<Previous Article In Volume

Next Article In Volume>

Volume Title: Proceedings of the 2015 International Conference on Computational Science and Engineering
Series: Advances in Computer Science Research
Publication Date: July 2015
ISBN: 978-94-62520-89-9
ISSN: 2352-538X
DOI: 10.2991/iccse-15.2015.86 How to use a DOI?
Open Access: This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

ris enw bib

TY  - CONF
AU  - Liang Deng
AU  - Fang Wang
AU  - Han-Li Bai
AU  - Qing-Xin Xu
PY  - 2015/07
DA  - 2015/07
TI  - Parallelizing a high-order WENO scheme for complicated flow structures on GPU and MIC
BT  - Proceedings of the 2015 International Conference on Computational Science and Engineering
PB  - Atlantis Press
SP  - 459
EP  - 470
SN  - 2352-538X
UR  - https://doi.org/10.2991/iccse-15.2015.86
DO  - 10.2991/iccse-15.2015.86
ID  - Deng2015/07
ER  -

download .riscopy to clipboard