Andrew Canning and Lin-Wang Wang Computational Research Division LBNL

Slides:



Advertisements
Similar presentations
Lecture 1 Periodicity and Spatial Confinement Crystal structure Translational symmetry Energy bands k·p theory and effective mass Theory of.
Advertisements

The H 2 molecule without convergence studies Objectives: - the (pseudo)total energy - the bond length r HH Acknowledgment: exercise inspired on the first.
Molecular dynamics modeling of thermal and mechanical properties Alejandro Strachan School of Materials Engineering Purdue University
Modelling of Defects DFT and complementary methods
Quantum Theory of Solids
PARATEC and the Generation of the Empty States (Starting point for GW/BSE) Andrew Canning Computational Research Division, LBNL and Chemical Engineering.
A New Design Tool for Nanoplasmonic Solar Cells using 3D Full Wave Optical Simulation with 1D Device Transport Models Liming Ji* and Vasundara V. Varadan.
First Principle Electronic Structure Calculation Prof. Kim Jai Sam ( ) Lab. 공학 ( ) Students : Lee Geun Sik,
. Chapter 1. Introduction, perspectives, and aims. On the science of simulation and modelling. Modelling at bulk, meso, and nano scale. (2 hours). Chapter.
Quantum Dots PA2003: Nanoscale Frontiers Artificial atoms Schr ö dinger equation Square well potential Harmonic oscillator 2D Harmonic oscillator Real.
Acknowledgments: Thanks to Professor Nicholas Brummell from UC Santa Cruz for his help on FFTs after class, and also thanks to Professor James Demmel from.
Ab Initio Total-Energy Calculations for Extremely Large Systems: Application to the Takayanagi Reconstruction of Si(111) Phys. Rev. Lett., Vol. 68, Number.
Nanomaterials & Nanotechnology
Parallel Methods for Nano/Materials Science Applications Andrew Canning Computational Research Division LBNL & UC Davis, Applied Science Dept. (Electronic.
Numerical study of transport properties of carbon nanotubes Dhanashree Godbole Brian Thomas Summer Materials Research Training Oakland University 2006.
Large-Scale Density Functional Calculations James E. Raynolds, College of Nanoscale Science and Engineering Lenore R. Mullin, College of Computing and.
Parallel Methods for Nano/Materials Science Applications Andrew Canning Computational Research Division LBNL & UC Davis (Electronic Structure Calculations)
Towards Petascale Computing for Science Horst Simon Lenny Oliker, David Skinner, and Erich Strohmaier Lawrence Berkeley National Laboratory The Salishan.
Physics “Advanced Electronic Structure” Pseudopotentials Contents: 1. Plane Wave Representation 2. Solution for Weak Periodic Potential 3. Solution.
Lectures Introduction to computational modelling and statistics1 Potential models2 Density Functional.
Deformation of Nanotubes Yang Xu and Kenny Higa MatSE 385
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Nanoscale Electronics / Single-Electron Transport in Quantum Dot Arrays Dene Farrell SUNY.
Computational Solid State Physics 計算物性学特論 第4回 4. Electronic structure of crystals.
J.J.Rehr1, John Vinson1, E.L.Shirley2 J.J. Kas1 and F. Vila1
The Nuts and Bolts of First-Principles Simulation Durham, 6th-13th December : DFT Plane Wave Pseudopotential versus Other Approaches CASTEP Developers’
Javier Junquera Code structure: calculation of matrix elements of H and S. Direct diagonalization José M. Soler = N  N N  1.
R. Martin - Pseudopotentials1 African School on Electronic Structure Methods and Applications Lecture by Richard M. Martin Department of Physics and Materials.
Problem is to compute: f(latitude, longitude, elevation, time)  temperature, pressure, humidity, wind velocity Approach: –Discretize the.
Feb 23, 2010, Tsukuba-Edinburgh Computational Science Workshop, Edinburgh Large-Scale Density-Functional calculations for nano-meter size Si materials.
1 Investigative Tools--Theory, Modeling, and Simulation Rational You ITRI-IEK-NEMS 2001/08/06 Source: IWGN (1999/09)
Nanoscale Science and Engineering. Nanoscale Science and Engineering embodies fundamental research and technology development of materials, structures,
The Nuts and Bolts of First-Principles Simulation Durham, 6th-13th December : Computational Materials Science: an Overview CASTEP Developers’ Group.
Chapter 2 Properties on the Nanoscale
Comp. Mat. Science School 2001 Lecture 21 Density Functional Theory for Electrons in Materials Richard M. Martin Bands in GaAs Prediction of Phase Diagram.
Fundamentals of DFT R. Wentzcovitch U of Minnesota VLab Tutorial Hohemberg-Kohn and Kohn-Sham theorems Self-consistency cycle Extensions of DFT.
Density Functional Theory A long way in 80 years L. de Broglie – Nature 112, 540 (1923). E. Schrodinger – 1925, …. Pauli exclusion Principle.
Computing Environment The computing environment rapidly evolving ‑ you need to know not only the methods, but also How and when to apply them, Which computers.
1 THE EARTH SIMULATOR SYSTEM By: Shinichi HABATA, Mitsuo YOKOKAWA, Shigemune KITAWAKI Presented by: Anisha Thonour.
Physics “Advanced Electronic Structure” Lecture 1. Theoretical Background Contents: 1. Historical Overview. 2. Basic Equations for Interacting Electrons.
3/6/ APS March Meeting1 Structure and interface properties of the electrolyte material Li 4 P 2 S 6 * Zachary D. Hood, a Cameron M. Kates, b,c.
F. Sacconi, M. Povolotskyi, A. Di Carlo, P. Lugli University of Rome “Tor Vergata”, Rome, Italy M. Städele Infineon Technologies AG, Munich, Germany Full-band.
Preliminary CPMD Benchmarks On Ranger, Pople, and Abe TG AUS Materials Science Project Matt McKenzie LONI.
Dr. Jacob Barhen Computer Science and Mathematics Division.
Linear Algebra Libraries: BLAS, LAPACK, ScaLAPACK, PLASMA, MAGMA
Electromagnetically biased Self-assembly
Start. Technische Universität Dresden Physikalische Chemie Gotthard Seifert Tight-binding Density Functional Theory DFTB an approximate Kohn-Sham DFT.
Determination of surface structure of Si-rich SiC(001)(3x2) Results for the two-adlayer asymmetric dimer model (TAADM) are the only ones that agree with.
Advanced methods of molecular dynamics 1.Monte Carlo methods 2.Free energy calculations 3.Ab initio molecular dynamics 4.Quantum molecular dynamics 5.Trajectory.
Linear Scaling Quantum Chemistry Richard P. Muller 1, Bob Ward 2, and William A. Goddard, III 1 1 Materials and Process Simulation Center California Institute.
Linear Algebra Libraries: BLAS, LAPACK, ScaLAPACK, PLASMA, MAGMA Shirley Moore CPS5401 Fall 2013 svmoore.pbworks.com November 12, 2012.
© copyright 2011 William A. Goddard III, all rights reservedCh121a-Goddard-L14 Periodic Boundary Methods and Applications: Ab-initio Quantum Mechanics.
Comp. Mat. Science School Electrons in Materials Density Functional Theory Richard M. Martin Electron density in La 2 CuO 4 - difference from sum.
Multicore Applications in Physics and Biochemical Research Hristo Iliev Faculty of Physics Sofia University “St. Kliment Ohridski” 3 rd Balkan Conference.
2/23/2015PHY 752 Spring Lecture 171 PHY 752 Solid State Physics 11-11:50 AM MWF Olin 107 Plan for Lecture 17: Reading: Chapter 10 in MPM Ingredients.
Isolated Si atoms.
BY SELLAVEL E (CA15M006) Guided By Prof.B.Viswanathan
Introduction to Tight-Binding
PHY 752 Solid State Physics
Production of an S(α,β) Covariance Matrix with a Monte Carlo-Generated
Goals for Today: Syllabus Review
Maintaining Adiabaticity in Car-Parrinello Molecular Dynamics
Quantum corral of 48 iron atoms on copper surface
The Nuts and Bolts of First-Principles Simulation
Centar ( Global Signal Processing Expo
Prof. Sanjay. V. Khare Department of Physics and Astronomy,
First principles calculation on field emission of boron/nitrogen doped carbon nanotube I’m going to talk about the first principles calculation on field.
Welcome to The Workshop
The Nuts and Bolts of First-Principles Simulation
Molecular Transport Group
Nano Technology Dr. Raouf Mahmood. Nano Technology Dr. Raouf Mahmood.
Presentation transcript:

Andrew Canning and Lin-Wang Wang Computational Research Division LBNL Scaling First Principles Materials and Nanoscience Codes to Thousands of Processors (and Thousands of Atoms) (Plane-wave, DFT codes) Andrew Canning and Lin-Wang Wang Computational Research Division LBNL

Nanostructures as a new material Definition: Nanostructure is an assembly of nanometer scale “building blocks”. Why nanometer scale: This is the scale when the properties of these “building blocks” become different from bulk. size Electron Wavefunction Nanostructure Both are in nanometers

Example: Quantum Dots (QD) CdSe Band gap increase CdSe quantum dot (size) Single electron effects on transport (Coulomb blockade). Mechanical properties, surface effects and no dislocations

Computational challenges (larger nanostructures) atoms molecules nanostructures bulk Infinite (1-10 atoms in a unit cell) 1-100 atoms 1000-10^6 atoms size Ab initio method PARATEC Ab initio Method PARATEC Challenge for computational nanoscience. method Effective mass method Ab initio elements and reliability New methodology and algorithm (ESCAN) Even larger Supercomputer

1. Plane-wave expansion for Plane-wave Pseudopotential Method in DFT Solve Kohn-Sham Equations self-consistently for electron wavefunctions 1. Plane-wave expansion for 2. Replace “frozen” core by a pseudopotential Different parts of the Hamiltonian calculated in different spaces (fourier and real) 3d FFT used

Parallel Algorithm Details Divide sphere of plane-waves in columns between processors (planes of grid in real space) Parallel 3d FFT used to move between real space and fourier space FFT requires global communications data packet size ~ 1/(# processors)^2

Parallel 3D FFT 3D FFT done via 3 sets of 1D FFTs and 2 transposes (b) 3D FFT done via 3 sets of 1D FFTs and 2 transposes Most communication in global transpose (b) to (c) little communication (d) to (e) Flops/Comms ~ logN Many FFTs done at the same time to avoid latency issues Only non-zero elements communicated/calculated Much faster than vendor supplied 3D-FFT (c) (d) (e) (f)

How to scale up the parallel 3d FFTs Minimize the global communications part Latency problem: Use all-band method in conjunction with many FFTs at the same time to make data packets larger (new all band CG method for metals ) FFT part scales as N logN while other parts scale as N (larger systems scale better) 2 3

Materials/Nanoscience codes Parallel Fourier techniques (many-band approach) used in the following codes: PARATEC PARAllel Total Energy Code (plane-wave pseudopotential code) PEtot (plane-wave ultrasoft pseudopotential code) ESCAN (Energy SCAN) Uses folded spectrum method for non-selfconsistent nanoscale calculations with plane-waves for larger systems.

PARATEC (PARAllel Total Energy Code) PARATEC performs first-principles quantum mechanical total energy calculation using pseudopotentials & plane wave basis set Designed to run on large parallel machines IBM SP etc. but also runs on PCs PARATEC uses all-band CG approach to obtain wavefunctions of electrons Generally obtains high percentage of peak on different platforms Developed with Louie and Cohen’s groups (UCB, LBNL), Raczkowski (Multiple 3d FFTs Peter Haynes and Michel Cote) DFT has become standard technique in Mat Sci to calculate the structural and electronic properties of new materials with a full QUANTUM MECHANICAL treatment of the electrons. Picture shows induced current and charge density in crystalized glcine.The simulations used to better undersand nuclear magnetic resonance.

432 Si-atom system with 5CG steps PARATEC: Performance P IBM SP Power 3 IBM SP Power4 SGI Altix NEC ES CRAY X1 Gflops/P %peak 32 0.950 63% 2.02 39% 3.71 62% 4.76 60% 3.04 24% 64 0.848 57% 1.73 33% 3.24 54% 4.67 59% 2.59 20% 128 0.739 49% 1.50 29% 4.74 1.91 15% 256 0.572 38% 1.08 21% 4.17 52% 512 0.413 28% 3.39 42% 1024 2.08 26% 432 Si-atom system with 5CG steps 432 Atom Si Bulk System 5 CG step First time code achieved over 1 Tflop Aggregate 2.6 TFlops for 686 Si atom Previous best was .7 TFlop on Power3 using 1500 procs Work carried out with L. Oliker (CRD) J.T. Carter (NERSC)

PARATEC: Performance on ES SX6 NEC SX6 ES CRAY X1 Gflops/P Gflops %peak 64 5.25 335 66% 3.73 239 29% 128 4.95 633 62% 3.01 385 24% 256 4.59 1176 57% 512 3.76 1924 47% 1024 2.53 2591 32% Scaling Issues 686 atom Si % time in communications increasing (3D FFT) Fewer number of multiple 1D FFTs for each proc. (vector length drops) Matrices for BLAS3 becoming smaller (vector length drops)

PARATEC: Scaling ES vs. Power3 ES can run the same system about 10 times faster than the IBM SP (on any number of processors) Main advantage of ES for these types of codes is the fast communication network Fast processors require less fine-grain parallelism in code to get same performance as RISC machines QD is 309 atom CdSe Quantum Dot

Applications: Free standing quantum dots (CdSe) TEM image Chemically synthesised (Alivisatos, UCB, LBNL) Interior atoms are in bulk crystal structure Surface atoms are passivated Diameter ~ 20-100 A A few thousand atoms, beyond ab initio method

CdSe quantum dots as biological tags Optically more stable than dye molecules Can have multiple colors

Computational challenges (larger nanostructures) atoms molecules nanostructures bulk Infinite (1-10 atoms in a unit cell) 1-100 atoms 1000-10^6 atoms size Ab initio method PARATEC Ab initio Method PARATEC Challenge for computational nanoscience. method Effective mass method Ab initio elements and reliability New methodology and algorithm (ESCAN) Even larger Supercomputer

Charge patching method for larger systems (L-W. Wang) Non-selfconsistent LDA quality potential for nanotube Selfconsistent LDA calculation of a single graphite sheet Get information from small system ab initio calc., then generate the charge densities for large systems

1. Plane-wave expansion for Plane-wave Pseudopotential Method in DFT Solve Kohn-Sham Equations non-selfconsistently for electron wavefunctions in desired energy range using patched charge density (can study larger nanosystems 10,000 atoms) 1. Plane-wave expansion for 2. Replace “frozen” core by a pseudopotential Different parts of the Hamiltonian calculated in different spaces (fourier and real) 3d FFT used

Motif based charge patching method Error: 1%, ~20 meV eigen energy error.

+ Folded Spectrum Method (ESCAN)

Charge patching: free standing quantum dots In675P652 LDA quality calculations (eigen energy error ~ 20 meV) 64 processors (IBM SP3) for ~ 1 hour Total charge density motifs CBM VBM

Polarization of CdSe quantum rods The electron wavefunctions of a quantum rods

CdSe tetrapod electronic states

GaN (111) and (112) quantum wires (WZ) (111) GaN wire (112) GaN wire CB1 CB2

Conclusion First principles calculation New algorithm methodology Large scale supercomputer + + Million atom nanostructures

Acknowledgements This research was in part funded by the DOE as part of the Modeling and Simulation in Nanoscience Initiative (MICS and BES) This research used resources of NERSC at LBNL ad CCS at ORNL supported under contract No DE-AC03-76SF00098 and DE-AC05-00OR22725. The authors were in part supported by the Office of Advanced Scientific Computing Research in the DOE Office of Science under contract number DE-AC03-76SF00098 A. Canning would like to thank the staff of the Earth Simulator Center, especially Dr. T. Sato, S. Kitawaki Y. Tsuda and D. Parks, J. Snyder (NEC, USA) for their assistance during his visit.