Kirchhoff prestack time migration on large heterogeneous computing systems
Zhao Changhai1, Luo Guoan2, Zhang Xudong2, Wang Shihu2, Zhang Jianlei2, Wang Chengxiang2
1. Beijing Branch, Research & Development Center, BGP Inc. CNPC, Beijing 100088, China;
2. Research & Development Center, BGP Inc. CNPC, Zhuozhou, Hebei 072751, China
Abstract:The size of seismic data from a single survey for the moment has reach to 100TB, and may exceed 1PB in the near future. To support increasingly huge survey data sizes and processing complexity, we propose a practical approach to large-scale parallel processing of 3D prestack Kirchhoff time migration (PSTM) with multi-dimension imaging space decomposition on heterogeneous computing systems. The parallel algorithm is based on three-level decomposition of the imaging space. Firstly, the imaging space is partitioned by offsets. Each node runs just one process, and all processes are divided into several distinct groups. The imaging work of common-offset space is assigned to a group, and the common-offset input traces are dynamically distributed to the processes of the group. Once all input traces are migrated, the local imaging sections of all the processes in a group are added to form the final common-offset image. In a node, the common-offset imaging section is further partitioned equally by CMP. If the size of a common-offset imaging section exceeds the total physical memory on the compute node, the whole imaging space should be firstly partitioned along in-line direction so that each common-offset imaging space can fit in memory. The algorithm greatly reduces the dependencies among tasks. The task partitions can be easily mapped to multiple heterogeneous processors and execute asynchronously. Compared to the production CPU version of PSTM, its GPU version achieves up to 4.8 speed times and MIC version achieves up to 2 speed times. Comparative analysis of GPU and MIC is also given on power consumption, performance, and programmablity. The PSTM implementation can obtain close to linear speedup when it processes real data on the Tianhe-1 supercomputer.
Huang Yi, Shi Xueming, Fan Jianke et al. Review on parallel computing and its application in exploration geophysics.Progress in Geophysics,2010,25(2):642-649.
[3]
Panetta J, de Souza Filho R P R, da Cunha Filho C A et al. Computational characteristics of production seismic migration and its performance on novel processor architectures. Prococeedings of 19th International Symp on Computer Architecture and High Performance Computing, Piscataway,New Jersey:IEEE, 2007, 11-18.
Liu Guofeng, Liu Hong, Wang Xiumin et al. Two kinds of traveling time computation and parallel com-puting methods of Kirchhoff migration. Progress inGeophysics, 2009, 24(1):131-136.
[5]
Thakur R, Rabenseifner R, Gropp W. Optimization of collective communication operations in MPICH. International Journal of High Performance Computing Applications, 2005, 19(1):49-66.
[6]
Shi X, Li C, Wang X et al. A practical approach of curved ray prestack Kirchhoff time migration on GPGPU. Proceedings of the 8th Int Symp on Advanced Parallel Processing Technologies, Springer, Berlin, 2009, 165-176.
[7]
Dai Hengchang.Parallel processing of prestack Kirchhoff time migration on a PC cluster. Computers & Geosciences, 2005, 31(7):891-899.
[8]
Li J, Hei D, Yan L. Partitioning algorithm of 3-Dprestack parallel Kirchhoff depth migration for imaging spaces. Proceedings of 8th Int Conf on Grid and Cooperative Computing, Piscataway,NJ:IEEE, 2009, 276-280.
[9]
Panetta J, Teixeira T, de Souza Filho P R P et al. Accelerating time and depth seismic migration by CPU and GPU cooperation. International Journal of Parallel Programming, 2012, 40(3):290-312.
Wang Huazhong, Cai Jiexiong, Kong Xiangning et al. An implementation of Kirchhoff integral migration for large-scale data. Chinese Journal of Geophysics, 2010, 53(7):1699-1709.
Liu Guofeng, Liu Hong, Li Bo et al. Method of prestack time migration of seismic data of mountai-nous regions and its GPU implementation. Chinese Journal of Geophysics, 2009, 52(12):3101-3108.
Chen Guoliang, Sun Guangzhong, Xu Yun et al. Methodology of research on parallel algorithms. Chinese Journal of Computers, 2008, 31(9):1493-1502.
[13]
Schroeder B,Gibson G A. A Large-scale study of failures in high-performance computing systems. IEEE Trans on Dependable and Secure Computing, 2010, 7(4):337-351.