Proceedings of the 2015 International Conference on Computational Science and Engineering

Parallelizing a high-order WENO scheme for complicated flow structures on GPU and MIC

Authors
Liang Deng, Fang Wang, Han-Li Bai, Qing-Xin Xu
Corresponding Author
Liang Deng
Available Online July 2015.
DOI
10.2991/iccse-15.2015.86How to use a DOI?
Keywords
High-order, WENO, Fermi GPU, Kepler GPU, Intel MIC coprocessor, Optimization techniques
Abstract

As a conservative, high-order accurate, shock-capturing method, weighted essentially non-oscillatory (WENO) scheme have been widely used to effectively resolve complicated flow structures in computational uid dynamics (CFD) simulations. However, using a high-order WENO scheme can be highly time-consuming, which greatly limits the CFD application’s performance efficiency. In this paper, we present various parallel strategies base on the latest many-core platform such as NVIDIA Fermi GPU, NVIDIA Kepler GPU and Intel MIC coprocessor to accelerate a high-order WENO scheme. Comparison analysis of the two generations GPUs between Fermi and Kepler, and cross-platform performance analysis (focusing on Kepler GPU and MIC) are also detailed discussed. The experiments show that the Kepler GPU offers a clear advantage in contrast to the previous Fermi GPU maintaining exactly the same source code. Furthermore, while Kepler GPU can be several times faster than MIC without utilizing the increasingly available SIMD computing power on Vector Processing Unit (VPU), MIC can provide the computing capability equivalent to Kepler GPU when VPU is utilized. Our implementations and optimization techniques can serve as case studies for paralleling high-order schemes on many-core architectures.

Copyright
© 2015, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the 2015 International Conference on Computational Science and Engineering
Series
Advances in Computer Science Research
Publication Date
July 2015
ISBN
10.2991/iccse-15.2015.86
ISSN
2352-538X
DOI
10.2991/iccse-15.2015.86How to use a DOI?
Copyright
© 2015, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Liang Deng
AU  - Fang Wang
AU  - Han-Li Bai
AU  - Qing-Xin Xu
PY  - 2015/07
DA  - 2015/07
TI  - Parallelizing a high-order WENO scheme for complicated flow structures on GPU and MIC
BT  - Proceedings of the 2015 International Conference on Computational Science and Engineering
PB  - Atlantis Press
SP  - 459
EP  - 470
SN  - 2352-538X
UR  - https://doi.org/10.2991/iccse-15.2015.86
DO  - 10.2991/iccse-15.2015.86
ID  - Deng2015/07
ER  -