Feb 23, 2010, Tsukuba-Edinburgh Computational Science Workshop, Edinburgh Large-Scale Density-Functional calculations for nano-meter size Si materials.

Slides:



Advertisements
Similar presentations
Electronic transport properties of nano-scale Si films: an ab initio study Jesse Maassen, Youqi Ke, Ferdows Zahid and Hong Guo Department of Physics, McGill.
Advertisements

Andrew Canning and Lin-Wang Wang Computational Research Division LBNL
PHYS466 Project Kyoungmin Min, Namjung Kim and Ravi Bhadauria.
Modelling of Defects DFT and complementary methods
Peter De á k Challenges for ab initio defect modeling. EMRS Symposium I, Challenges for ab initio defect modeling Peter.
Quantum Theory of Solids
PARATEC and the Generation of the Empty States (Starting point for GW/BSE) Andrew Canning Computational Research Division, LBNL and Chemical Engineering.
PA4311 Quantum Theory of Solids Quantum Theory of Solids Mervyn Roy (S6) www2.le.ac.uk/departments/physics/people/mervynroy.
First Principle Electronic Structure Calculation Prof. Kim Jai Sam ( ) Lab. 공학 ( ) Students : Lee Geun Sik,
. Chapter 1. Introduction, perspectives, and aims. On the science of simulation and modelling. Modelling at bulk, meso, and nano scale. (2 hours). Chapter.
Sohrab Ismail-Beigi Applied Physics, Physics, Materials Science Yale University Describing exited electrons: what, why, how, and what it has to do with.
DFT – Practice Simple Molecules & Solids [based on Chapters 5 & 2, Sholl & Steckel] Input files Supercells Molecules Solids.
Computational Chemistry
Finite Size Effects in Conducting Nanoparticles: Classical and Quantum Theories George Y. Panasyuk Wright-Patterson Air Force Base, OH, February 8, 2011.
DFT Calculations Shaun Swanson.
Electronic structure of La2-xSrxCuO4 calculated by the
Stanford 11/10/11 Modeling the electronic structure of semiconductor devices M. Stopa Harvard University Thanks to Blanka Magyari-Kope, Zhiyong Zhang and.
Density Functional Theory: a first look Patrick Briddon Theory of Condensed Matter Department of Physics, University of Newcastle, UK.
Lattice regularized diffusion Monte Carlo
Ab Initio Total-Energy Calculations for Extremely Large Systems: Application to the Takayanagi Reconstruction of Si(111) Phys. Rev. Lett., Vol. 68, Number.
Advanced Semiconductor Physics ~ Dr. Jena University of Notre Dame Department of Electrical Engineering SIZE DEPENDENT TRANSPORT IN DOPED NANOWIRES Qin.
Computing lattice constant, bulk modulus and equilibrium energies of solids Bulk Si Diamond structure.
Large-Scale Density Functional Calculations James E. Raynolds, College of Nanoscale Science and Engineering Lenore R. Mullin, College of Computing and.
FUNDAMENTALS The quantum-mechanical many-electron problem and Density Functional Theory Emilio Artacho Department of Earth Sciences University of Cambridge.
First Principle Simulations in Nano-science Tianshu Li University of California, Berkeley University of California, Davis.
Computational Solid State Physics 計算物性学特論 第7回
Why? Three options for studying the Earth’s interior 1.Direct observations e.g. seismics, electrical conductivity 2.High pressure experiments, e.g. multi-anvil.
Excitons in Single Wall Dr. Fazeli and Dr. Mozaffari
An approach for solving the Helmholtz Equation on heterogeneous platforms An approach for solving the Helmholtz Equation on heterogeneous platforms G.
Lectures Introduction to computational modelling and statistics1 Potential models2 Density Functional.
Мэдээллийн Технологийн Сургууль Монгол Улсын Их Сургууль Some features of creating GRID structure for simulation of nanotransistors Bolormaa Dalanbayar,
ATK 方法的扩展及其应用 王雪峰 苏州大学物理系 ZJNU-JinHua.
First principles modeling with Octopus: massive parallelization towards petaflop computing and more A. Castro, J. Alberdi and A. Rubio.
Algorithms for Total Energy and Forces in Condensed-Matter DFT codes
Norm-conserving pseudopotentials and basis sets in electronic structure calculations Javier Junquera Universidad de Cantabria.
The Nuts and Bolts of First-Principles Simulation Durham, 6th-13th December : DFT Plane Wave Pseudopotential versus Other Approaches CASTEP Developers’
Algorithms for a large sparse nonlinear eigenvalue problem Yusaku Yamamoto Dept. of Computational Science & Engineering Nagoya University.
The g DFTB method applied to transport in Si nanowires and carbon nanotubes 1 Dip. di Ingegneria Elettronica, Universita` di Roma Tor Vergata 2 Computational.
One-sided Communication Implementation in FMO Method J. Maki, Y. Inadomi, T. Takami, R. Susukita †, H. Honda, J. Ooba, T. Kobayashi, R. Nogita, K. Inoue.
Comp. Mat. Science School 2001 Lecture 21 Density Functional Theory for Electrons in Materials Richard M. Martin Bands in GaAs Prediction of Phase Diagram.
Linear scaling fundamentals and algorithms José M. Soler Universidad Autónoma de Madrid.
Fundamentals of DFT R. Wentzcovitch U of Minnesota VLab Tutorial Hohemberg-Kohn and Kohn-Sham theorems Self-consistency cycle Extensions of DFT.
Density Functional Theory A long way in 80 years L. de Broglie – Nature 112, 540 (1923). E. Schrodinger – 1925, …. Pauli exclusion Principle.
Electronic transport properties of nano-scale Si films: an ab initio study Jesse Maassen, Youqi Ke, Ferdows Zahid and Hong Guo Department of Physics, McGill.
Density Functional Theory The Basis of Most Modern Calculations
Physics “Advanced Electronic Structure” Lecture 1. Theoretical Background Contents: 1. Historical Overview. 2. Basic Equations for Interacting Electrons.
Lobachevsky State University of Nizhniy Novgorod
Development of density functional theory for unconventional superconductors Ryotaro Arita Univ. Tokyo/JST-PRESTO.
Ferroelectricity induced by collinear magnetic order in Ising spin chain Yoshida lab Ryota Omichi.
F. Sacconi, M. Povolotskyi, A. Di Carlo, P. Lugli University of Rome “Tor Vergata”, Rome, Italy M. Städele Infineon Technologies AG, Munich, Germany Full-band.
Preliminary CPMD Benchmarks On Ranger, Pople, and Abe TG AUS Materials Science Project Matt McKenzie LONI.
Introduction to the SIESTA method SUMMER SCHOOL ON COMPUTATIONAL MATERIALS SCIENCE University of Illinois at Urbana-Champaign, June 13-23, 2005 Some technicalities.
Theoretical Solid State Physics Marvin L. Cohen and Steven G. Louie, University of California at Berkeley, DMR Carbon nanotubes possess novel properties.
Institute for the Theory of Advanced Materials in Information Technology James R. Chelikowsky, University of Texas at Austin, DMR Workshop on Materials.
Complex magnetism of small clusters on surfaces An approach from first principles Phivos Mavropoulos IFF, Forschungszentrum Jülich Collaboration: S. Lounis,
First-Principles calculations of the structural and electronic properties of the high-K dielectric HfO 2 Kazuhito Nishitani 1,2, Patrick Rinke 2, Abdallah.
Determination of surface structure of Si-rich SiC(001)(3x2) Results for the two-adlayer asymmetric dimer model (TAADM) are the only ones that agree with.
Linear Scaling Quantum Chemistry Richard P. Muller 1, Bob Ward 2, and William A. Goddard, III 1 1 Materials and Process Simulation Center California Institute.
Comp. Mat. Science School Electrons in Materials Density Functional Theory Richard M. Martin Electron density in La 2 CuO 4 - difference from sum.
Computing lattice constant, bulk modulus and equilibrium energies of bulk cubic SrTiO 3 Bulk SrTiO 3 Cubic structure.
GPAW Setup Optimization Center for Atomic-scale Materials Design Technical University of Denmark Ask Hjorth Larsen.
OMEN: a Quantum Transport Modeling Tool for Nanoelectronic Devices
Yosuke Harashima, Keith Slevin
The Materials Computation Center. Duane D. Johnson and Richard M
Prof. Sanjay. V. Khare Department of Physics and Astronomy,
Welcome to The Workshop
Masoud Aryanpour & Varun Rai
Metastability of the boron-vacancy complex (C center) in silicon: A hybrid functional study Cecil Ouma and Walter Meyer Department of Physics, University.
TDDFT Prof. Dr. E.K.U. Gross Prof. Dr. Mark Casida.
Presentation transcript:

Feb 23, 2010, Tsukuba-Edinburgh Computational Science Workshop, Edinburgh Large-Scale Density-Functional calculations for nano-meter size Si materials Jun-Ichi Iwata Center for Computational Sciences University of Tsukuba

Outline Quantum Mechanical (First-Principles) Simulation in Solid-State Physics Density-Functional Theory W. Kohn (Nobel Prize in 1998) Density-Functional simulations for large systems Real-Space DFT program code for Parallel Computation -RSDFT- Applications of RSDFT for Si nano materials >10,000-atom system

First-Principles Calculation in Material Physics We describe material properties from the behavior of electrons and ions. ions → classical, electrons → quantum We solve the Schrodinger equation for electronic ground state Density-functional theory is a powerful tool for this purpose.

Density-Functional Theory electron density Energy Functional We get stable atomic & electronic structures. ( minimize ) P. Hohenberg and W. Kohn, Phys. Rev. 136 (1964) B864. W. Kohn and L. J. Sham, Phys. Rev. 140 (1965) A1133. minimize with respect to Potential Kohn-Sham equation → We have to solve this equation self-consistently ( Nonlinear eigenvalue problem )

M. T. Yin and M. L. Cohen Phys. Rev. B26, 5668 (1982). Exchange functional in Local-Density Approx. DFT calc. Expt. Lattice Constant ( Å ) Bulk Modulus (Mb) Si ( in diamond structure ) Performance of DFT with simple approximation quantitatively good results Correctly describe various properties

Proteins ( cytochrome c oxidase ) ~ 30,000 atoms Nano structures (Si pyramid) ~ 100,000 atoms A. Ichimiya et al., Surf. Sci. 493, 555 (2001). Everybody wants to apply the DFT for Large systems Usually, we treat 10- to 1000-atom systems by DFT. However, we need to treat larger systems. to study large objects (nano structures, proteins) to make the atomic model more realistic

Real-Space DFT program code (RSDFT) Solve Kohn-Sham equation (eigenvalue problem) → Computational costs ~ O(N 3 ) Developed for parallel computers

discretize function Column vector Laplacian → Higher-Order Finite-Difference Higher-order finite difference pseudopotential method J. R. Chelikowsky et al., Phys. Rev. B, (1994) Real-Space Method continuous spacediscrete space Typical number of grid points : 10,000 ~ 1,000,000 ( ⇔ Reciprocal-Space (Plane-Wave) Method )

Real-Space Finite-Difference Sparse Matrix FFT free (FFT is inevitable in the conventional plane-wave code) Kohn-Sham eq. (finite-difference) 3D grid is divided by several regions for parallel computation. Higher-order finite difference Integration MPI_ISEND, MPI_IRECV MPI_ALLREDUCE RSDFT – suitable for parallel first-principles calculation - MPI ( Message Passing Interface ) library CPU0 CPU8CPU7 CPU6 CPU5 CPU4CPU3 CPU2CPU1

Convergence behavior for Si H 1996 The largest system in the present study → Si H 1996 Massively Parallel Computing Computational Time (with 1024 nodes of PACS-CS) 6781 sec. × 60 iteration step = 113 hour Based on the finite-difference pseudopotential method (J. R. Chelikowsky et al., PRB1994) Highly tuned for massively parallel computers Computations are done on a massively-parallel cluster PACS-CS at University of Tsukuba. (Theoretical Peak Performance = 5.6GFLOPS/node) with our recently developed code “RSDFT” Iwata et al, J. Comp. Phys. (2010) Real-Space Density-Functional Theory code (RSDFT) Grid points = 3,402,059 Bands = 22,432

Conjugate-Gradient Method Gram-Schmidt orthonormalization Density, Potentials update Subspace Diagonalization Total Computational Cost ~ O(N 3 ) O(N 3 ) O(N) O(N 2 ) Flow chart Calc. Ionic Potentials Input initial configuration of Ions Hellman-Feynman Force Move ions Convergence Check Electronic structure optimization must be performed in each atomic optimization step Atomic structure optimization Electronic structure optimization Algorithm → subspace iteration method (Rayleigh-Ritz method) yes

Algorithm 1 → Subspace Iteration Method ( Rayleigh-Ritz Method ) M-dimensional eigenvalue problem We need smallest N( ≪ M) eigen-pairs Minimize Reyleigh quotients by Conjugate-Gradient Method wave function update Initial guess Problem

Algorithm 2 O(MN 2 ) O(N 3 ) Subspace Diagonalization O(MN 2 ) ( Ritz vectors ) Gram-Schmidt Orthogonalization → as a basis set ← initial guess for the next iteration O(MN 2 ) Calc. Matrix Elements

Gram-Schmidt orthogonalization Time (sec) GFLOPS/node Old algorithm 661 (710) 0.70 (0.65) New algorithm 111 (140) 4.30 (3.50) Time & Performance for Gram-Schmidt O(N 3 ) part can be computed at 80% of the theoretical peak performance! ~ Active use of Level 3 BLAS in O(N 3 ) computation ~ → Collaboration with computer scientists much improve the performance of the RSDFT! Theoretical peak performance = 5.6 GFLOPS/node Part of the calculations can be performed as Matrix × Matrix operation! Algorithm of GS

PACS-CS(5.6GFLOPS/node) 256nodes → time for O(N 2 )-part and O(N 3 )-part become comparable Elapsed time for 1 step of iteration O(N 2 ) O(N 3 )

Application 1 Nano-meter size Si quantum dots

Si quantum dot is a promising material for several device applications  Memory  Single-electron transistor  Optical Device Clarifying the relation between the “Dot size” and “Band gap” is important for controlling the device properties.  System size is very large! A model of the Si quantum dot of 6.6 nm diameter ( Si 7055 H 1596 ) First-principles calculations are useful for such studies? → Yes, but …

(eV) Experimental fit curve From STS measurement B.Zanknoon et al., Nano letters 8, 1689 (2008). The ΔSCF gap seems to be closer to the ΔKS gap … Band Gaps 300 atoms>10,000 atoms

Application 2 Si nanowires

IEDM2005IEDM2006 Diameter of NW10 nm8 nm Gate length30 nm15 nm Vdd1.0 V I_on (n) 2.64 mA/  m1.4 mA/  m I_on (p) 1.11 mA/  m1.94 mA/  m I_off (n) 3.1 nA/  m2.0 nA/  m I_off (p) mA/  m1.0 nA/  m Samsung Si nanowire devices

4 nm diameter ( 425 atoms) 10 nm diameter ( 2341 atoms ) 20 nm diameter ( 8941 atoms ) There may be an optimum diameter in the region of 10 nm ~ 20 nm. Several size of Si nanowires

d=1nm Si21H20 ( 41 atoms ) Eg=2.60eV (LDA Bulk : 0.53eV)  X Band Structure and DOS of SiNW (d=1nm)

d=4nm Si341H84 ( 425 atoms ) Eg=0.81eV (LDA Bulk=0.53eV)  X Band Structure and DOS of SiNW (d=4nm)

 X Si1361H164(1525 atoms), Eg=0.61eV Band Structure and DOS of SiNW (d=8nm)  X Bulk Si Eg=0.53eV

Si12822H1544 ( 14,366 atoms ) ・ 10nm diameter 、 3.3nm height 、 (100) ・ Grid spacing : 0.45Å (~14Ry) ・ # of grid points : 4,718,592 ・ # of bands : 29,024 ・ Memory : 1,022GB ~ 2,044GB Si12822H1544 Top View Side View Si nano wire with surface roughness

PACS-CS1024 nodes ( peak performance : 5.6 GFLOPS/node ) Subspace diagonalization : 4600 sec. Gram-Schmidt : 2300 sec. Conjugate-Gradient Method : 3700 sec. Total Energy calc. : 1200 sec. Total(1 step) : 12,000 sec. DOS of SiNW with roughness DOS of Bulk Si d=10nm ( with roughness ) Si12822H1544(14,366 atoms) Eg=0.57eV

Application3 Si divacancy

Structure of Si divacancy : Small-yellow balls : vacancies (no atoms) Green balls : Si atoms with dangling bonds. There are two possibilities for the structure of Si divacancy. Resonant-Bond type Large-paring type ・ Both “Large-paring” and “Resonant-Bond” structure were found. ・ Large-Paring type is the most stable (RB type is a local minimum) More recent LDA calculation (Oguet et al., 1999) EPR experiment (Watkins & Corbett, 1965) LDA calculation (Saito & Oshiyama, 1994) Large-Paring type Resonant-Bond type is stable (Large-Paring type was not found) What is the stable structure ? Model size ~ 60 atoms Model size ~ 300 atoms →Model Size dependence ?

Si divacancy d ac, d ab (Å) Model size (# of atoms) Large-paring Resonant-Bond Small-Paring Structures converge at 998-atom model. LP structure appears at 510 or larger models. RB structure is most stable, but the energy difference is very small (<10 meV) J.-I. Iwata, et al., Phys. Rev. B 77 (2008) Structure of Si divacancy : Small-yellow balls : vacancies (no atoms) Green balls : Si atoms with dangling bonds. There are two possibilities for the structure of Si divacancy. Resonant-Bond type Large-paring type

We have developed Real-Space DFT program code for large systems by utilizing the massively parallel computers Collaboration with computer scientist much improve the performance of RSDFT (Especially, O(N 3 )-part calculation with BLAS 3) By using a few hundred ~ 1000CPUs, we have achieved the first-principles calculation for ・ Si 1000-atom system with atomic structure optimization ・ Self-Consistent electronic structures of Si 10,000-atom systems By using large atomic models → eliminate the model-size dependence We have applied the RSDFT for nano-meter scale Si materials (SiNW, SiQD) I think the RSDFT becomes an useful tool for future device development Summary