Hui Liu University of Calgary

Slides:



Advertisements
Similar presentations
A parallel scientific software for heterogeneous hydrogeoloy
Advertisements

Numerical simulation of solute transport in heterogeneous porous media A. Beaudoin, J.-R. de Dreuzy, J. Erhel Workshop High Performance Computing at LAMSIN.
Sparse linear solvers applied to parallel simulations of underground flow in porous and fractured media A. Beaudoin 1, J.R. De Dreuzy 2, J. Erhel 1 and.
CSE 245: Computer Aided Circuit Simulation and Verification Matrix Computations: Iterative Methods (II) Chung-Kuan Cheng.
Parallel Jacobi Algorithm Steven Dong Applied Mathematics.
How to solve a large sparse linear system arising in groundwater and CFD problems J. Erhel, team Sage, INRIA, Rennes, France Joint work with A. Beaudoin.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy under contract.
A Discrete Adjoint-Based Approach for Optimization Problems on 3D Unstructured Meshes Dimitri J. Mavriplis Department of Mechanical Engineering University.
1 Numerical Solvers for BVPs By Dong Xu State Key Lab of CAD&CG, ZJU.
Dual Mesh Method in Upscaling Pascal Audigane and Martin Blunt Imperial College London SPE Reservoir Simulation Symposium, Houston, 3-5 February 2003.
Numerical Parallel Algorithms for Large-Scale Nanoelectronics Simulations using NESSIE Eric Polizzi, Ahmed Sameh Department of Computer Sciences, Purdue.
Extending the capability of TOUGHREACT simulator using parallel computing Application to environmental problems.
Stochastic Optimization of Complex Energy Systems on High-Performance Computers Cosmin G. Petra Mathematics and Computer Science Division Argonne National.
Multilevel Incomplete Factorizations for Non-Linear FE problems in Geomechanics DMMMSA – University of Padova Department of Mathematical Methods and Models.
Development of a Reservoir Simulator with Unique Grid-Block System
M ODELING OF D ISCRETE F RACTURE N ETWORK U SING V ORONOI G RID S YSTEM.
Multi-Scale Finite-Volume (MSFV) method for elliptic problems Subsurface flow simulation Mark van Kraaij, CASA Seminar Wednesday 13 April 2005.
An Algebraic Multigrid Solver for Analytical Placement With Layout Based Clustering Hongyu Chen, Chung-Kuan Cheng, Andrew B. Kahng, Bo Yao, Zhengyong Zhu.
Monica Garika Chandana Guduru. METHODS TO SOLVE LINEAR SYSTEMS Direct methods Gaussian elimination method LU method for factorization Simplex method of.
Direct and iterative sparse linear solvers applied to groundwater flow simulations Matrix Analysis and Applications October 2007.
1 Parallel Simulations of Underground Flow in Porous and Fractured Media H. Mustapha 1,2, A. Beaudoin 1, J. Erhel 1 and J.R. De Dreuzy IRISA – INRIA.
An approach for solving the Helmholtz Equation on heterogeneous platforms An approach for solving the Helmholtz Equation on heterogeneous platforms G.
Fast Thermal Analysis on GPU for 3D-ICs with Integrated Microchannel Cooling Zhuo Fen and Peng Li Department of Electrical and Computer Engineering, {Michigan.
ANS 1998 Winter Meeting DOE 2000 Numerics Capabilities 1 Barry Smith Argonne National Laboratory DOE 2000 Numerics Capability
Hans De Sterck Department of Applied Mathematics University of Colorado at Boulder Ulrike Meier Yang Center for Applied Scientific Computing Lawrence Livermore.
ParCFD Parallel computation of pollutant dispersion in industrial sites Julien Montagnier Marc Buffat David Guibert.
CFD Lab - Department of Engineering - University of Liverpool Ken Badcock & Mark Woodgate Department of Engineering University of Liverpool Liverpool L69.
Simulating complex surface flow by Smoothed Particle Hydrodynamics & Moving Particle Semi-implicit methods Benlong Wang Kai Gong Hua Liu
Linear System function [A,b,c] = assem_boundary(A,b,p,t,e); np = size(p,2); % total number of nodes bound = unique([e(1,:) e(2,:)]); % boundary nodes inter.
Distributed Computation: Circuit Simulation CK Cheng UC San Diego
Case Study in Computational Science & Engineering - Lecture 5 1 Iterative Solution of Linear Systems Jacobi Method while not converged do { }
Domain Decomposition in High-Level Parallelizaton of PDE codes Xing Cai University of Oslo.
October 2008 Integrated Predictive Simulation System for Earthquake and Tsunami Disaster CREST/Japan Science and Technology Agency (JST)
A Dirichlet-to-Neumann (DtN)Multigrid Algorithm for Locally Conservative Methods Sandia National Laboratories is a multi program laboratory managed and.
High performance computing for Darcy compositional single phase fluid flow simulations L.Agélas, I.Faille, S.Wolf, S.Réquena Institut Français du Pétrole.
1 Mark F. Adams 22 October 2004 Applications of Algebraic Multigrid to Large Scale Mechanics Problems.
23/5/20051 ICCS congres, Atlanta, USA May 23, 2005 The Deflation Accelerated Schwarz Method for CFD C. Vuik Delft University of Technology
CO 2 maîtrisé | Carburants diversifiés | Véhicules économes | Raffinage propre | Réserves prolongées © IFP Écrire ici dans le masque le nom de votre Direction.
Circuit Simulation using Matrix Exponential Method Shih-Hung Weng, Quan Chen and Chung-Kuan Cheng CSE Department, UC San Diego, CA Contact:
Performance of a Semi-Implicit, Semi-Lagrangian Dynamical Core for High Resolution NWP over Complex Terrain L.Bonaventura D.Cesari.
A Parallel Hierarchical Solver for the Poisson Equation Seung Lee Deparment of Mechanical Engineering
Adaptive grid refinement. Adaptivity in Diffpack Error estimatorError estimator Adaptive refinementAdaptive refinement A hierarchy of unstructured gridsA.
Parallel Computing Activities at the Group of Scientific Software Xing Cai Department of Informatics University of Oslo.
ANSYS, Inc. Proprietary © 2004 ANSYS, Inc. Chapter 5 Distributed Memory Parallel Computing v9.0.
Hybrid Parallel Implementation of The DG Method Advanced Computing Department/ CAAM 03/03/2016 N. Chaabane, B. Riviere, H. Calandra, M. Sekachev, S. Hamlaoui.
High Performance Computing Seminar II Parallel mesh partitioning with ParMETIS Parallel iterative solvers with Hypre M.Sc. Caroline Mendonça Costa.
LabVIEW Real Time for High Performance Control Applications
José S. A. Cavalcante Filho, Thomas D.S. Oliveira, Silvio R.R. Costa,
Algorithm of the explicit type for porous medium flow simulation
Xing Cai University of Oslo
Hasan Nourdeen Martin Blunt 10 Jan 2017
Analysis of the Solver Performance for Stokes Flow Problems in Glass Forming Process Simulation Models Speaker: Hans Groot Supervisors: Dr. Hegen (TNO.
Dual Mesh Method in Dynamic Upscaling
A computational loop k k Integration Newton Iteration
Accelerating MapReduce on a Coupled CPU-GPU Architecture
Meros: Software for Block Preconditioning the Navier-Stokes Equations
Introduction to Scientific Computing II
Innovative Multigrid Methods
Shengxin Zhu The University of Oxford
A robust preconditioner for the conjugate gradient method
Matrix Methods Summary
ArcEOR A posteriori error estimate tools to enhance the performance of
CS 252 Project Presentation
Hybrid Programming with OpenMP and MPI
Introduction to Scientific Computing II
A Software Framework for Easy Parallelization of PDE Solvers
CINECA HIGH PERFORMANCE COMPUTING SYSTEM
Parallelizing Unstructured FEM Computation
Ph.D. Thesis Numerical Solution of PDEs and Their Object-oriented Parallel Implementations Xing Cai October 26, 1998.
A computational loop k k Integration Newton Iteration
Presentation transcript:

Scalable Multi-stage Preconditioners for Numerical Simulations of Multi-Phase Flow in Porous Media Hui Liu University of Calgary Email: hui.j.liu@ucalgary.ca Joint work with Kun Wang and Zhangxin Chen

Outline Introduction Multi-Stage Preconditioners Results and Discussions Conclusions

Introduction

Black Oil Model Conservation laws: Constraints:

Oil-Water Model Conservation laws: Constraints:

Traditional Simulators Designed for desktop/workstation Multi-cores, OpenMP Efficient for small/medium cases Long simulation time for large-scale cases with tens of millions of grid cells Most applications are using the shared-memory programming models – based on OpenMP. The simulation speed is very slow.

New Generration Reservoir Simulators Clusters/supercomputers OpenMP for shared memory - multi-cores, easy to use MPI among computation nodes - communication - MPI-IO, read input and write output Large-scale simulations General Purpose Cluster (GPC) IBM Blue Gene/Q

Challenges for Linear Solvers Reservoir models are highly coupled nonlinear systems Heterogeneous permeability and porosity Linear systems are ill-conditioned and hard to solve Preconditioners: -- ILU(k), ILUT, ILUC, and block ILU(k) -- CPR (Constrained Pressure Residual) CPR is the most effective method, and it is serial Parallel preconditioners: -- Effective -- Scalable

Multi-Stage Preconditioners Highly nonlinear system, F(x) = 0 Linearization by Newton methods: A x = b Pressure block 𝐴 𝑝𝑝 , positive definite Saturation (bubble point) block 𝐴 𝑠𝑠 Well block 𝐴 𝑤𝑤 Coupled matrices

Constrained Pressure Residual Method Preconditioning system: M x = f

Constrained Pressure Residual Method AMG methods: scalable for parallel computing. ILU: LU x = f (Ly = f, Ux = y); triangular solvers introduce global data dependence; not scalable. Restricted Additive Schwarz method (RAS): scalable; less communication and better convergence than other domain decomposition preconditioners.

CPR-like Preconditioners CPR-PF method CPR-FPF method

Matrix Decoupling Quasi-IMPES Alternative Block Factorization (ABF)

Results and Discussions PRSI Blue Gene/Q, IBM -- parallel reservoir simulation platform -- 2 racks (4 midplanes) -- distributed systems, MPI, OpenMP -- one midplance:16 node -- arbitrary grid size and MPI procs -- one node: 32 cards -- grids, DOF -- one card: PowerPC A2 -- distributed matrix and vector -- 16+1 cores, 16 GB mem, per card -- linear solvers (Krylov and AMG) -- A2: low performance (1/10) -- AMG is BoomerAMG -- system: strong network, scalable -- preconditioners -- parallel IO, key words parsing -- well modelling -- visualization -- in-house parallel simulators -- highly scalable, 8192 CPU cores

Case 1: Matrices from SPE10 Project Oil-water model Grid: 60 x 110 x 85, 1.12 million Highly heterogeneous permeability 2000 days One injection well, four producers RAS, ILU(0) AMG: 6 levels -- Falgout coarsening -- One V-cycle -- Symmetric Gauss-Seidel smoother -- 2 smoothing iteration -- CMI interpolation

Case 1: convergence

Case 2: refined SPE10, oil-water Grid: 180 x 660 x 255, 30 million CPR-FPF, Quasi-IMPES 10 days, max step: 5 days RAS, overlap 1, ILU(0) Inexact Newton: 1e-2 AMG, one V-cycle, 6 levels BiCGSTAB, max: 100 IBM Blue Gene/Q

Case 2: scalability

Case 3: refined SPE10, black oil Grid: 180 x 660 x 255, 30 million CPR-FPF, ABF decoupling 10 days, max: 5 days RAS, overlap 1, ILU(0) Inexact Newton: 1e-2 AMG, one V-cycle, 6 levels BiCGSTAB, max: 100 IBM Blue Gene/Q

Case 3: scalability

Case 4: a giant oil-water model Homogeneous model Grid: 140 million BiCGSTAB solver CPR-FPF preconditioner 10 days Quasi-IMPES decoupling Inexact Newton method, tol: 1e-2 IBM Blue Gene/Q supercomputer MPI Steps Newton Solver Time (s) 256 27 102 119 21056.59 512 26 99 114 9321.31 1024 25 90 104 4235.68 2048 2110.16 4096 98 124 1265.42

Case 4: scalability

Case 5: refined SPE1 Black oil model Grid: 100 million Inexact Newton method tol: 1e-3 Solver: BiCGSTAB Preconditioner: CPR-FPF ABF decoupling 10 days

Case 5: scalability

Conclusions Study CPR-like preconditioners for parallel systems Apply Restricted Additive Schwarz method instead of ILU methods Propose multi-stage CPR-like preconditioners These methods are effective to heterogeneous models Good convergence Highly scalable, thousands of CPUs They work for black oil model and derived models, such as polymer flooding and extended model. They also work for compositional model.

Future Work CPR-like methods do not work with thermal models, which consider multiple components and temperature change. To study and develop multi-level methods for thermal models

Thank you.