UT-BATTELLE Scaling of First Principles Electronic Structure Methods on Future Architectures W.A. Shelton Oak Ridge National Laboratory.

Slides:



Advertisements
Similar presentations
A Large-Grained Parallel Algorithm for Nonlinear Eigenvalue Problems Using Complex Contour Integration Takeshi Amako, Yusaku Yamamoto and Shao-Liang Zhang.
Advertisements

05/11/2005 Carnegie Mellon School of Computer Science Aladdin Lamps 05 Combinatorial and algebraic tools for multigrid Yiannis Koutis Computer Science.
Andrew Canning and Lin-Wang Wang Computational Research Division LBNL
Algebraic, transcendental (i.e., involving trigonometric and exponential functions), ordinary differential equations, or partial differential equations...
Point-wise Discretization Errors in Boundary Element Method for Elasticity Problem Bart F. Zalewski Case Western Reserve University Robert L. Mullen Case.
Molecular Quantum Mechanics
MATH 685/ CSI 700/ OR 682 Lecture Notes
2D and time dependent DMRG
Limitation of Pulse Basis/Delta Testing Discretization: TE-Wave EFIE difficulties lie in the behavior of fields produced by the pulse expansion functions.
Designing Tensile Structures Using Generic CAD Applications. Structural membranes 2007, Barcelona, September 2007 Javier Sánchez, Tecnun, University.
Numerical Parallel Algorithms for Large-Scale Nanoelectronics Simulations using NESSIE Eric Polizzi, Ahmed Sameh Department of Computer Sciences, Purdue.
1 Jelly model Look at homogeneous crystal; We inserted a point charge in this model Inhomogeneous crystal Answer 1: = + Prob1: point charge field at position.
Nonlinear Evolution of Whistler Turbulence W.A. Scales, J.J. Wang, and O. Chang Center of Space Science and Engineering Research Virginia Tech L. Rudakov,
Reconfigurable Application Specific Computers RASCs Advanced Architectures with Multiple Processors and Field Programmable Gate Arrays FPGAs Computational.
CHE/ME 109 Heat Transfer in Electronics LECTURE 11 – ONE DIMENSIONAL NUMERICAL MODELS.
Completeness of the Coulomb eigenfunctions Myles Akin Cyclotron Institute, Texas A&M University, College Station, Texas University of Georgia, Athens,
Large-Scale Density Functional Calculations James E. Raynolds, College of Nanoscale Science and Engineering Lenore R. Mullin, College of Computing and.
MATH 685/ CSI 700/ OR 682 Lecture Notes Lecture 6. Eigenvalue problems.
Physics “Advanced Electronic Structure” Pseudopotentials Contents: 1. Plane Wave Representation 2. Solution for Weak Periodic Potential 3. Solution.
Monte Carlo Simulation of Interacting Electron Models by a New Determinant Approach Mucheng Zhang (Under the direction of Robert W. Robinson and Heinz-Bernd.
Eigenvalue Problems Solving linear systems Ax = b is one part of numerical linear algebra, and involves manipulating the rows of a matrix. The second main.
© Fujitsu Laboratories of Europe 2009 HPC and Chaste: Towards Real-Time Simulation 24 March
Algorithms and Software for Large-Scale Simulation of Reactive Systems _______________________________ Ananth Grama Coordinated Systems Lab Purdue University.
Massively Parallel Magnetohydrodynamics on the Cray XT3 Joshua Breslau and Jin Chen Princeton Plasma Physics Laboratory Cray XT3 Technical Workshop Nashville,
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Nanoscale Electronics / Single-Electron Transport in Quantum Dot Arrays Dene Farrell SUNY.
Simulating Quarks and Gluons with Quantum Chromodynamics February 10, CS635 Parallel Computer Architecture. Mahantesh Halappanavar.
The Nuts and Bolts of First-Principles Simulation Durham, 6th-13th December : DFT Plane Wave Pseudopotential versus Other Approaches CASTEP Developers’
Implementation of Density Functional Theory based Electronic Structure Codes on Advanced Computing Architectures W.A. Shelton, E. Aprá, G.I. Fann and R.J.
Physics “Advanced Electronic Structure” LMTO family: ASA, Tight-Binding and Full Potential Methods Contents: 1. ASA-LMTO equations 2. Tight-Binding.
Christopher Devulder, Slava V. Rotkin 1 Christopher Devulder, Slava V. Rotkin 1 1 Department of Physics, Lehigh University, Bethlehem, PA INTRODUCTION.
Fast Low-Frequency Impedance Extraction using a Volumetric 3D Integral Formulation A.MAFFUCCI, A. TAMBURRINO, S. VENTRE, F. VILLONE EURATOM/ENEA/CREATE.
Simulating Electron Dynamics in 1D
R. Martin - Pseudopotentials1 African School on Electronic Structure Methods and Applications Lecture by Richard M. Martin Department of Physics and Materials.
Modeling Plasmonic Effects in the Nanoscale Brendan McNamara, Andrei Nemilentsau and Slava V. Rotkin Department of Physics, Lehigh University Methodology.
R. Ryne, NUG mtg: Page 1 High Energy Physics Greenbook Presentation Robert D. Ryne Lawrence Berkeley National Laboratory NERSC User Group Meeting.
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY 1 Parallel Solution of the 3-D Laplace Equation Using a Symmetric-Galerkin Boundary Integral.
O AK R IDGE N ATIONAL L ABORATORY U. S. D EPARTMENT OF E NERGY 1 On-line Automated Performance Diagnosis on Thousands of Processors Philip C. Roth Future.
1 Introduction to Computer Graphics with WebGL Ed Angel Professor Emeritus of Computer Science Founding Director, Arts, Research, Technology and Science.
EEE 3394 Electronic Materials Chris Ferekides Fall 2014 Week 6.
Lanczos approach using out-of-core memory for eigenvalues and diagonal of inverse problems Pierre Carrier, Yousef Saad, James Freericks, Ehsan Khatami,
1 Complex Images k’k’ k”k” k0k0 -k0-k0 branch cut   k 0 pole C1C1 C0C0 from the Sommerfeld identity, the complex exponentials must be a function.
Ted Fox Interim Associate Laboratory Director Energy and Engineering Sciences Oak Ridge, Tennessee March 21, 2006 Oak Ridge National Laboratory.
Computational Aspects of Multi-scale Modeling Ahmed Sameh, Ananth Grama Computing Research Institute Purdue University.
O AK R IDGE N ATIONAL L ABORATORY U. S. D EPARTMENT OF E NERGY Modeling Electron and Spin Transport Through Quantum Well States Xiaoguang Zhang Oak Ridge.
Advanced Drift Diffusion Device Simulator for 6H and 4H-SiC MOSFETs
Comp. Mat. Science School 2001 Lecture 21 Density Functional Theory for Electrons in Materials Richard M. Martin Bands in GaAs Prediction of Phase Diagram.
Density Functional Theory A long way in 80 years L. de Broglie – Nature 112, 540 (1923). E. Schrodinger – 1925, …. Pauli exclusion Principle.
Physics “Advanced Electronic Structure” Lecture 1. Theoretical Background Contents: 1. Historical Overview. 2. Basic Equations for Interacting Electrons.
October 2008 Integrated Predictive Simulation System for Earthquake and Tsunami Disaster CREST/Japan Science and Technology Agency (JST)
1 Spallation Neutron Source Data Analysis Jessica Travierso Research Alliance in Math and Science Program Austin Peay State University Mentor: Vickie E.
F.F. Assaad. MPI-Stuttgart. Universität-Stuttgart Numerical approaches to the correlated electron problem: Quantum Monte Carlo.  The Monte.
Dr. Jacob Barhen Computer Science and Mathematics Division.
Physics “Advanced Electronic Structure”
Parallel coupling: problems arising in the context of magnetic fusion John R. Cary Professor, University of Colorado CEO, Tech-X Corporation.
Physics “Advanced Electronic Structure” Lecture 2. Density Functional Theory Contents: 1. Thomas-Fermi Theory. 2. Density Functional Theory. 3.
Lanczos Representation Methods in Application to Rovibrational Spectroscopy Calculations Hong Zhang and Sean Smith Quantum & Molecular Dynamics Group Center.
A. Ambrosetti, F. Pederiva and E. Lipparini
Electromagnetically biased Self-assembly
C OMPUTATIONAL R ESEARCH D IVISION 1 Defining Software Requirements for Scientific Computing Phillip Colella Applied Numerical Algorithms Group Lawrence.
F. Fairag, H Tawfiq and M. Al-Shahrani Department of Math & Stat Department of Mathematics and Statistics, KFUPM. Nov 6, 2013 Preconditioning Technique.
A Parallel Hierarchical Solver for the Poisson Equation Seung Lee Deparment of Mechanical Engineering
Comp. Mat. Science School Electrons in Materials Density Functional Theory Richard M. Martin Electron density in La 2 CuO 4 - difference from sum.
TEMPLATE DESIGN © H. Che 2, E. D’Azevedo 1, M. Sekachev 3, K. Wong 3 1 Oak Ridge National Laboratory, 2 Chinese University.
Condensed Matter Physics and Materials Science: David M
DEVELOPMENT OF SEMI-EMPIRICAL ATOMISTIC POTENTIALS MS-MEAM
High Performance Computing in materials science from the semiempirical approaches to the many-body calculations Fabio Trani Laboratoire de Physique de.
The Materials Computation Center. Duane D. Johnson and Richard M
Elements of Quantum Mechanics
Prof. Sanjay. V. Khare Department of Physics and Astronomy,
Presentation transcript:

UT-BATTELLE Scaling of First Principles Electronic Structure Methods on Future Architectures W.A. Shelton Oak Ridge National Laboratory

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Collaborators Locally Self-consistent Multiple Scattering Method (Real Space) Oak Ridge National Laboratory N.Y. Moghadam, D.M.C. Nicholson, G.M. Stocks, X.-G. Zhang, and B. Ujfalussy Pittsburgh Supercomputer Center Y. Wang National Energy Research Supercomputer Center A. Canning Screened Methods (Tight-binding like methods) Oak Ridge National Laboratory A. Smirnov University of Illinois (Urbana-Champaign) D.D. Johnson

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Acknowledgement of Sponsors Department of Energy/Office of Science Office of Advanced Scientific Computing Research Mathematics, Information and Computer Science Applied Mathematical Sciences Program Oak Ridge National Laboratory Laboratory Directors Research and Development Program Computing Resources at the Center of Computational Sciences located at Oak Ridge National Laboratory Pittsburgh Supercomputing Center National Energy Research Supercomputing Center located at Lawrence Berkeley National Laboratory

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Motivation Rethink the mathematical model Design new algorithms that renders a numerical solution Open new possibilities –Improved scaling –For solving problems that previously were untenable –Software technologies being used by researchers with access to less advanced hardware technologies The introduction of new architectures

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Motivation N anoscale S cience and E ngineering T echnology Initiative SOFT MATERIALS –Synthetic Polymers and Bio-Inspired Materials –Systems Dominated by Organic-Inorganic Interconnections –Interfacing Nanostructures to Biological Systems HYBRID SOFT-HARD MATERIALS –Carbon-Based Nanostructures –Characterization of Active Sites in Catalytic Materials –Nanoporous Membranes and Nanomaterials for Ultra-Selective Catalysis COMPLEX HARD MATERIALS –Magnetism in Nanostructured Materials –Nanoscale Manipulation of Collective Behavior –Nanoscale Interface Science (Nanoparticles and Nanograins) –Electromagnetic Fields in Confined Structures THEORY / MODELING / SIMULATION –Virtual Synthesis and Nanomaterials Design –Theoretical Nano-Interface Science

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Multiscale Simulations Connect Microscopic-level Processes to Macroscopic Response of Material

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Scalable and Accurate First Principles Method Atomistic Methods

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Quantum Simulation Goals: Accuracy and Predictive Capabilities

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Advances In Hardware Alone Are Not Sufficient

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Linear Scaling Algorithms Will Enable Solutions to New Problems The combination of new advanced computing platforms and new scaling algorithms will open new areas in quantum-level materials simulations

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Algorithm Design for future generation architectures More accurate Spectral or pseudo-spectral accuracy Wider range of applicability Sparse representation Memory requirements grow linearly Each processor can treat thousands of atoms Make use of large number of processors Message-Passing Each atom/node local message-passing is independent of the size of the system Time consuming step of model Sparse linear solver Direct or preconditioned iterative approach

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Density Functional Theory (DFT) DFT in principle is an exact method for treating the many body quantum mechanical effects of electron exchange and correlation At the heart of this formulation is the ascertain that the ground-state total energy of an electron system in the field of the atomic nuclei is a unique functional of the electronic charge density The total energy functional attains it minimum value when evaluated with the true ground-state electronic charge density. The quantum mechanical effects of electron exchange and correlation are contained in the non-local exchange-correlation potential V ex-corr (r,r ’). –Hence, the electronic interactions are explicitly accounted for by the fundamental quantity, the electronic charge density –Unfortunately, there are no analytical forms for calculating V ex-corr (r,r ’ )

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY LSDA &Multiple Scattering Theory (MST ) No Yes Mix in & out Initial guess n in (r), m in (r) Calculate V eff [n,m] in Solve Schrodinger Equation Recalculate n out (r), m out (r) n in (r) = n out (r) ? m in (r) =m out (r) ? Calculate Total Energy  Multiple Scattering Theory (MST) J. Korringa, Physica 13, 392, (1947) W. Kohn, N. Rostoker, PR, 94, 1111,(1954)  MST Green function methods B. Gyorffy, and M. J. Stott, “Band Structure Spectroscopy of Metals and Alloys”, Ed. D.J. Fabian and L. M. Watson (Academic 1972) S.J. Faulkner and G.M. Stocks, PR B 21, 3222, (1980)

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Complex Energy Plane The scattering properties at complex energy can be used to develop highly efficient real-space and k-space methods Real  Im  ff Semi-conductors and insulators could work well since they have no states at  f Scattering is local since there are no states near the bottom of the energy contour Scattering is local since a large Im  is equivalent to rising temperature which smears out the states Near  f scattering is non-local (metal) Real  Im  ff  f is the highest occupied electronic state in energy

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY  Multiple scattering theory Green function Scattering path matrix Multiple Scattering Theory decay slowly with increasing distance contain free-electron singularities ][ M  Generalization of t-matrix. Converts incoming wave at siteinto outgoing wave at site in the presence of all the other sites

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Real Space Algorithm Design Linear scaling –Each node performs a fixed size local calculation Thus each node performs the same number of flops Message-Passing –Each atom/node local message-passing is independent of the size of the system Time consuming step of model –Reduce to Linear Algebra step BLAS level 3

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Real Space Parallel Implementation Green’s function Scattering path matrix: real space t : scattering from single site G: structure constant matrix M=[t -1 (  )-G(R ij,  ]  =M -1 Once M is fixed increasing N does not affect the local calculation of M -1 The LSMS naturally scales linearly with increasing N

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Matrix Inversion A B CD Take A and continue to partition until the desired matrix size (l max + 1) 2 of the central site is reached Partition the m(l max + 1) 2 matrix,  =MxM into M=M 1 + M 2 into four blocks two of size M 1 and two of size M 2 A -1 =A-B D -1 C Note that the LxL diagonal block of A -1 is the same LxL block that is desired.

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY (N) Scaling of Real Space Method 1998 Gordon Bell Prize 1.02 TFLOPS on a 1500 node Cray T3E

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY (N) Scaling of Real Space Method

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Real Space Accuracy fcc Cu bcc Cu bcc Mo hcp Co

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Tight-Binding MST Representation Tight Binding Multiple Scattering Theory –Embed a constant repulsive potential Shifts the energy zero allowing for calculations at negative energy –Rapidly decaying interactions –Free electron singularities are not a problem –Sparse representation 0 VsVs Constant inside a sphere > } 2 Ryd. eiReiR R

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Screened Structure Constants Screened Structure Constants G s on the left unscreened on the right –Screened structure constants rapidly go to zero, whereas the free space structure constants have hardly changed Linear solve using m atom cluster that is less than the n atom system Easy to perform Fourier transform –K-space method

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Screened MST Methods Formulation produces a sparse matrix representation –2-D case has tridiagonal structure with a few distant elements due to periodicity –3-D case has scattered elements Mainly due to mapping 3-D structure to a matrix (2-D) A few elements due to periodic boundary conditions Require block diagonals of the inverse of  matrix –Block diagonals represent the site  matrix and are needed to calculate the Green’s function for each atomic site Sparse direct and preconditioned iterative methods are used to calculate  ii (  ) –SuperLU –Transpose free Quasi-Minimal Residual Method (TFQMR)

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Screened KKR Accuracy fcc Cu bcc Cu bcc Mo hcp Co

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Timing and Scaling of Scr-K KKR-CPA

Computer Science and Mathematics UT-BATTELLE U.S. D EPARTMENT OF E NERGY O AK R IDGE N ATIONAL L ABORATORY Conclusion Initial benchmarking of the Screened KKR method –SuperLU N 1.8 for finding the inverse of the upper left block of  –TFQMR with block Jacobi preconditioner N 1.06 for finding the inverse of the upper left block of  Extremely high sparsity (97%-99% zeros increases with increasing system size) Large number of atoms on a single processor Real-space/Scr-KKR hybrid may provide the most efficient parallel approach for new generation architectres Single code contains –LSMS, KKR-CPA, Scr-LSMS and Scr-KKR-CPA