Development of a 3-D tight-binding-based electronic structure simulator for multi-million atom systems NEMO 3-D Gerhard Klimeck, Fabiano Oyafuso, Paul.

Slides:



Advertisements
Similar presentations
Outline Memory characteristics SRAM Content-addressable memory details DRAM © Derek Chiou & Mattan Erez 1.
Advertisements

GWDAW 16/12/2004 Inspiral analysis of the Virgo commissioning run 4 Leone B. Bosi VIRGO coalescing binaries group on behalf of the VIRGO collaboration.
Anodic Aluminum Oxide.
Beowulf Supercomputer System Lee, Jung won CS843.
Nathan S. Lewis George L. Argyros Professor of Chemistry California Institute of Technology with George Crabtree, Argonne NL Arthur Nozik, NREL Mike Wasielewski,
Network for Computational Nanotechnology (NCN) UC Berkeley, Univ.of Illinois, Norfolk State, Northwestern, Purdue, UTEP First-time User Guide for Piecewise.
Nanofabrication Breakout Session Results. Vision Elements Ability to fabricate, by directed or self assembly methods, functional structures or devices.
Coulomb Interaction in quantum structures - Effects of
Electrical Techniques MSN506 notes. Electrical characterization Electronic properties of materials are closely related to the structure of the material.
Future of Computation with Electronic Nanotechnogy Presented By Shubhra Karmakar.
Gerhard Klimeck Applied Cluster Computing Technologies Group Quantum and semi-classical transport in RTDs using NEMO 1-D Gerhard Klimeck Jet Propulsion.
Studies of Minority Carrier Recombination Mechanisms in Beryllium Doped GaAs for Optimal High Speed LED Performance An Phuoc Doan Department of Electrical.
Introduction Lecture 1 CSCI 1405, CSCI 1301 Introduction to Computer Science Fall 2009.
PARALLEL PROCESSING The NAS Parallel Benchmarks Daniel Gross Chen Haiout.
Future Computers CSCI 107, Spring When Moore’s law runs out of room When transistors become only tens of atoms thick –Quantum mechanics applies.
INTRODUCTION TO NANOTECHNOLOGY
Modeling of Energy States of Carriers in Quantum Dots
ELE 523E COMPUTATIONAL NANOELECTRONICS W1: Introduction, 8/9/2014 FALL 2014 Mustafa Altun Electronics & Communication Engineering Istanbul Technical University.
IWCE, Purdue, Oct , 2004 Seungwon Lee Exchange Coupling in Si-Quantum-Dot-Based Quantum Computer Seungwon Lee 1, Paul von Allmen 1, Susan N. Coppersmith.
Lecture on Electronic Memories. What Is Electronic Memory? Electronic device that stores digital information Types –Volatile v. non-volatile –Static v.
TPB Models Development Status Report Presentation to the Travel Forecasting Subcommittee Ron Milone National Capital Region Transportation Planning Board.
“High-performance computational GPU-stand for teaching undergraduate and graduate students the basics of quantum-mechanical calculations“ “Komsomolsk-on-Amur.
Efficient solution algorithm of non-equilibrium Green’s functions in atomistic tight binding representation Yu He, Lang Zeng, Tillmann Kubis, Michael Povolotskyi,
NanotechnologyNanoscience Modeling and Simulation Develop models of nanomaterials processing and predict bulk properties of materials that contain nanomaterials.
InAs on GaAs self assembled Quantum Dots By KH. Zakeri sharif University of technology, Spring 2003.
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Nanoscale Electronics / Single-Electron Transport in Quantum Dot Arrays Dene Farrell SUNY.
 Design model for a computer  Named after John von Neuman  Instructions that tell the computer what to do are stored in memory  Stored program Memory.
Applications of Quantum Physics
TECHNICAL SEMINAR ON TECHNOLOGIES AND DESIGNS FOR ELECTRONIC NANOCOMPUTERS PRESENTED BY : BIJAY KUMAR XESS ADMN NO : 4 I&E/2K.
Presented By : LAHSAINI Achraf & MAKARA Felipe.  Introduction  Difficult Challenges : > Difficult Challenges between 2013 – 2020 > Difficult Challenges.
2005 Materials Computation Center External Board Meeting The Materials Computation Center Duane D. Johnson and Richard M. Martin (PIs) Funded by NSF DMR.
Network for Computational Nanotechnology (NCN) Purdue, Norfolk State, Northwestern, MIT, Molecular Foundry, UC Berkeley, Univ. of Illinois, UTEP Multi.
Page 1 Reconfigurable Communications Processor Principal Investigator: Chris Papachristou Task Number: NAG Electrical Engineering & Computer Science.
Nano-electronics Vision: Instrumentation and methods for analysis of atomic scale physical properties, and methods to correlate these properties with nano-electronic.
ENE 311 Lecture 9.
Network for Computational Nanotechnology (NCN) UC Berkeley, Univ.of Illinois, Norfolk State, Northwestern, Purdue, UTEP Generation of Empirical Tight Binding.
Parallel and Distributed Simulation Introduction and Motivation.
Recombination Dynamics in Nitride Heterostructures: role of the piezoelectric field vs carrier localization A.Vinattieri, M.Colocci, M.Zamfirescu Dip.Fisica-
1 Modeling and Simulation International Technology Roadmap for Semiconductors, 2004 Update Ashwini Ujjinamatada Course: CMPE 640 Date: December 05, 2005.
Network for Computational Nanotechnology (NCN) SungGeun Kim Purdue, Norfolk State, Northwestern, MIT, Molecular Foundry, UC Berkeley, Univ. of Illinois,
Computer Organization & Assembly Language © by DR. M. Amer.
Computing Environment The computing environment rapidly evolving ‑ you need to know not only the methods, but also How and when to apply them, Which computers.
Nanoscale Electrochemical Switches Dr. James G. Kushmerick OFFON.
Organization Introduction Simulation Approach Results and Discussion
Gerhard Klimeck High Performance Computing Group Software Development at Academia, Government Labs, Industry John Barker Agenda/Questions: How can the.
The Distributed Data Interface in GAMESS Brett M. Bode, Michael W. Schmidt, Graham D. Fletcher, and Mark S. Gordon Ames Laboratory-USDOE, Iowa State University.
MULTICORE PROCESSOR TECHNOLOGY.  Introduction  history  Why multi-core ?  What do you mean by multicore?  Multi core architecture  Comparison of.
Deepak Srivastava Computational Nanotechnology at CSC/NAS NASA Ames Research Center Moffett Field, CA Collaborators: M. Menon – University of Kentucky.
“NanoElectronics Modeling tool – NEMO5” Jean Michel D. Sellier Purdue University.
Introduction to HCI Lecture #1.
Succeeding with Technology Chapter 2 Hardware Designed to Meet the Need The Digital Revolution Integrated Circuits and Processing Storage Input, Output,
Ohio State University Department of Computer Science and Engineering Servicing Range Queries on Multidimensional Datasets with Partial Replicas Li Weng,
Advanced Computing and Information Systems laboratory Nanocomputing technologies José A. B. Fortes Dpt. of Electrical and Computer Eng. and Dpt. of Computer.
Electromagnetically biased Self-assembly
Network for Computational Nanotechnology (NCN) Gerhard Klimeck Berkeley, Univ. of Florida, Univ.of Illinois, Norfolk State, Northwestern, Purdue, Stanford,
3/12/2013Computer Engg, IIT(BHU)1 INTRODUCTION-1.
Multicore Applications in Physics and Biochemical Research Hristo Iliev Faculty of Physics Sofia University “St. Kliment Ohridski” 3 rd Balkan Conference.
MOLETRONICS An Invisible technology Amit Dwivedi Ec 3rd Year
SPRING 2012 Assembly Language. Definition 2 A microprocessor is a silicon chip which forms the core of a microcomputer the concept of what goes into a.
Contact Resistance Modeling and Analysis of HEMT Devices S. H. Park, H
Single-molecule transistors: many-body physics and possible applications Douglas Natelson, Rice University, DMR (a) Transistors are semiconductor.
A modified hyperplane clustering algorithm allows for efficient and accurate clustering of extremely large datasets Ashok Sharma, Robert Podolsky, Jieping.
Contact Resistance Modeling in HEMT Devices
Band Structure Lab with NEMO5 Yi Shen, Nicolás Esquivel Camacho, Michael Povolotskyi ,and Gerhard Klimeck Approach: The communication between Rappture.
OMEN: a Quantum Transport Modeling Tool for Nanoelectronic Devices
Goals for Today: Syllabus Review
COMPUTERS IN CRISIS Computers have not changed their basic approach since the beginning (von Neumann binary) Conventional computers are commodity items.
Development of the Nanoconfinement Science Gateway
Multiscale Modeling and Simulation of Nanoengineering:
Husky Energy Chair in Oil and Gas Research
Presentation transcript:

Development of a 3-D tight-binding-based electronic structure simulator for multi-million atom systems NEMO 3-D Gerhard Klimeck, Fabiano Oyafuso, Paul von Allmen Jet Propulsion Laboratory, Caltech Tim Boykin, U Alabama in Huntsville gekco@jpl.nasa.gov, 818-354-2182 http://hpc.jpl.nasa.gov/PEP/gekco This research was carried out by at the Jet Propulsion Laboratory, California Institute of Technology under a contract with the National Aeronautics and Space Administration.

Presentation Outline Introduction / Motivation NASA Motivation, Critical look at the SIA Roadmap. Nano-scale device examples. What is a quantum dot? Modeling agenda. Software Issues Problem size. Parallel computing. Graphical user interfaces. Physics modeling results Strain. Alloy Disorder. Interface Interdiffusion. Conclusion / Outlook

Limit of Military Interest Limit of Commercial Interest NASA radiation and temperature requirements are outside commercial and military interest

Technology Push Toward Fundamental Limitations Commercial market pushes computing (FLOPS/weight/power): Enabled by device miniaturization chip size increase Limited by: Costs of fabrication Discrete atoms/electrons Moore’s Law for Lithography 2D Feature 1-D feature 5-100 Å 2-D Lithography Growth

Technology Push Toward Fundamental Limitations Commercial market pushes computing (FLOPS/weight/power): Enabled by device miniaturization chip size increase Limited by: Costs of fabrication Discrete atoms/electrons Additional NASA Requirements: High radiation tolerance Extreme temperature operation-hot/cold Moore’s Law for Lithography 2D Feature 1-D feature 5-100 Å 2-D Lithography Growth

Technology Push Toward Fundamental Limitations Commercial market pushes computing (FLOPS/weight/power): Enabled by device miniaturization chip size increase Limited by: Costs of fabrication Discrete atoms/electrons Additional NASA Requirements: High radiation tolerance Extreme temperature operation-hot/cold 1-D feature 5-100 Å 2-D Lithography Growth

Technology Push Toward Fundamental Limitations Commercial market pushes computing (FLOPS/weight/power): Enabled by device miniaturization chip size increase Limited by: Costs of fabrication Discrete atoms/electrons Additional NASA Requirements: High radiation tolerance Extreme temperature operation-hot/cold Quantum Dots Detectors / lasers Memory and logic 1-D feature 5-100 Å 2-D Lithography Growth Quantum dots go beyond the SIA roadmap and enable near and long term NASA applications

What is a Quantum Dot ? Basic Application Mechanisms Physical Structure: Well conducting domain surrounded in all 3 dim. by low conducting region(s) Domain size on the nanometer scale Electronic structure: Contains a countable number of electrons Electron energy may be quantized -> artificial atoms (coupled QD->molecule) The objective for future work is to model the devices which are the building blocks for high-speed computation, high-density memories and wide-band communication systems. Our approach is to expand NEMO to include 2-dimensional and 3-dimensional modeling, high-frequency modeling, and optical interactions. The impact will be accelerated technology development. The figures illustrate the three major areas of future development 2-dimensional modeling for ultra-scaled MOS and HFETs 3-dimensional modeling for quantum dots, single electron transistors (SETs), and molecular devices Optical and high frequency modeling for high-speed, high-bandwidth communications

What is a Quantum Dot ? Basic Application Mechanisms Physical Structure: Well conducting domain surrounded in all 3 dim. by low conducting region(s) Domain size on the nanometer scale Electronic structure: Contains a countable number of electrons Electron energy may be quantized -> artificial atoms (coupled QD->molecule) The objective for future work is to model the devices which are the building blocks for high-speed computation, high-density memories and wide-band communication systems. Our approach is to expand NEMO to include 2-dimensional and 3-dimensional modeling, high-frequency modeling, and optical interactions. The impact will be accelerated technology development. The figures illustrate the three major areas of future development 2-dimensional modeling for ultra-scaled MOS and HFETs 3-dimensional modeling for quantum dots, single electron transistors (SETs), and molecular devices Optical and high frequency modeling for high-speed, high-bandwidth communications

What is a Quantum Dot ? Basic Application Mechanisms Physical Structure: Well conducting domain surrounded in all 3 dim. by low conducting region(s) Domain size on the nanometer scale Electronic structure: Contains a countable number of electrons Electron energy may be quantized -> artificial atoms (coupled QD->molecule) Photon Emission Tunneling/Transport Occupancy of states Photon Absorption The objective for future work is to model the devices which are the building blocks for high-speed computation, high-density memories and wide-band communication systems. Our approach is to expand NEMO to include 2-dimensional and 3-dimensional modeling, high-frequency modeling, and optical interactions. The impact will be accelerated technology development. The figures illustrate the three major areas of future development 2-dimensional modeling for ultra-scaled MOS and HFETs 3-dimensional modeling for quantum dots, single electron transistors (SETs), and molecular devices Optical and high frequency modeling for high-speed, high-bandwidth communications Detectors/ Input Lasers/ Output Logic / Memory

What is a Quantum Dot ? Basic Application Mechanisms Physical Structure: Well conducting domain surrounded in all 3 dim. by low conducting region(s) Domain size on the nanometer scale Electronic structure: Contains a countable number of electrons Electron energy may be quantized -> artificial atoms (coupled QD->molecule) Photon Emission Tunneling/Transport Occupancy of states Photon Absorption The objective for future work is to model the devices which are the building blocks for high-speed computation, high-density memories and wide-band communication systems. Our approach is to expand NEMO to include 2-dimensional and 3-dimensional modeling, high-frequency modeling, and optical interactions. The impact will be accelerated technology development. The figures illustrate the three major areas of future development 2-dimensional modeling for ultra-scaled MOS and HFETs 3-dimensional modeling for quantum dots, single electron transistors (SETs), and molecular devices Optical and high frequency modeling for high-speed, high-bandwidth communications Detectors/ Input Lasers/ Output Logic / Memory Quantum dots are artificial atoms that can be custom designed for a variety of applications

Nanotechnology / Nanoelectronic Example Implementations Self-assembled , InGaAs on GaAs. Pyramidal or dome shaped R.Leon,JPL(1998) JPL Application: IR Sensors Nanotube Arrays, Jimmy Xu, Brown Univ. (1999) JPL Applications: Transducers, filters Nanocrystals: Si implanted in SiO2 Atwater, Caltech (1996) JPL Applications: Non-volatile Memory Molecular Dots Ruthenium-based molecule Ru4(NH3)16(C4H4N2)410+ proposed by Marya Lieberman, Notre Dame (1999) Computing Appl. The objective for future work is to model the devices which are the building blocks for high-speed computation, high-density memories and wide-band communication systems. Our approach is to expand NEMO to include 2-dimensional and 3-dimensional modeling, high-frequency modeling, and optical interactions. The impact will be accelerated technology development. The figures illustrate the three major areas of future development 2-dimensional modeling for ultra-scaled MOS and HFETs 3-dimensional modeling for quantum dots, single electron transistors (SETs), and molecular devices Optical and high frequency modeling for high-speed, high-bandwidth communications Low Dimensional quantum confinement can be achieved in a variety of material systems

Need for Nanoelectronic Simulation Problems: Design space is huge Choice of materials, shapes, orientations, dopings, heat anneals Characterizations are incomplete and invasive / destructive Simulation Impact: Aide Design Fast, cost effective. -> Device performance successful for 1-D quantum devices Aide Characterization Non-invasive More accurate -> Structure and doping analysis Simulation Characterization Fabrication Modeling, Characterization and Fabrication are inseparable for nano-scale devices

Nano-scale Device Analysis / Synthesis Development of a Bottom-Up Nanoelectronic Modeling Tool Atomic Orbitals size: 0.2nm Nanoscale Structures (~20nm) Assertions / Problems: Nanoscale structures are built today! The design space is huge: choice of materials, compositions, doping, size, shape Radiation on today’s sub-micron devices modifies the electronics on a nanoscale. Approach: Deliver a 3-D atomistic simulation tool Enable analysis of arbitrary crystal structures, particles, atom compositions and bond/structure at arbitrary temperatures and ambient electric and magnetic fields. Collaborators: U. of Alabama, Ames, Purdue, Ohio State, NIST

Nano-scale Device Analysis / Synthesis Development of a Bottom-Up Nanoelectronic Modeling Tool New Devices for Sensing and Computing Analyze Devices: Environment and Failures Atomic Orbitals size: 0.2nm Nanoscale Structures (~20nm) Assertions / Problems: Nanoscale structures are built today! The design space is huge: choice of materials, compositions, doping, size, shape Radiation on today’s sub-micron devices modifies the electronics on a nanoscale. Approach: Deliver a 3-D atomistic simulation tool Enable analysis of arbitrary crystal structures, particles, atom compositions and bond/structure at arbitrary temperatures and ambient electric and magnetic fields. Collaborators: U. of Alabama, Ames, Purdue, Ohio State, NIST NASA Relevance: Enable new devices needed for NASA missions beyond existing industry roadmap: Water detection -> 2-5mm Lasers and detectors. Avionics -> High density, low power computing. Analyze state-of-the-art devices for non-commercial environments: Europa -> Radiation and low temperature effects. Aging and failure modes. Jovian system -> Magnetic field effects Venus -> high temperature materials: SiGe Impact: Low cost development of revolutionary techn. Modeling will narrow the empirical search space!

Presentation Outline Introduction / Motivation NASA motivation, Critical look at the SIA Roadmap. Nano-scale device examples. What is a quantum dot? Modeling agenda. Software Issues Problem size. Parallel computing. Graphical user interfaces. Physics modeling results Strain. Alloy Disorder. Interface Interdiffusion. Conclusion / Outlook

How big is a realistic problem? How many atoms? Rule of thumb: 43 atoms/nm3 1 quantum dot: 40x40x15nm3 -> 1 million atoms 2x2 array of dots: 90x90x15nm3 -> 5.2 million atoms 70x70x70nm3 cube of Si -> 15 million atoms What is the Basis? 5 orbitals (sp3s*), 2 spins -> basis=10 10 orbitals (sp3d5s*), 2 spins -> basis 20 How big are the matrices? Atoms x basis ~ 107x107, sparse 4 neighbors/atom Storage=106x5x(20x20)x16bytes/2=16GB Basis States Impurity in quantum dot GaAs Non-ideal interface AlAs A deca-nano device contains >1 Million Atoms

Four Generations of Cluster Experience NewYork (2002) 66 Xserve G4 1GHz 1GB RAM per node 33 GB total 60 GB Disc per node 2 TB total 100 Mb/s ethernet crossbar MAC OS X, MPI 495GFlops Pluto (2001) 64 Pentium IIIs 800MHz dual CPUs 2 GB RAM per node 64 GB total 10 GB Disc per node 320 GB total 2 Gb/s Myricom crossbar Linux, MPI 51.2 GFlops Nimrod (1999) 32 Pentium IIIs 450MHz 512 MB RAM per node 16 GB total 8GB Disc per node 256 GB total 100 Mb/s ethernet crossbar Linux, MPI 14.4 GFlops Hyglac (1997) 16 Pentium Pros 200MHz 128 MB RAM per node 2 GB total 5GB Disc per node 80 GB total 100 Mb/s ethernet crossbar Linux, MPI 3.2GFlops Gordon Bell Prize 1997

Parallelization of NEMO 3-D Divide Simulation domain into slices. Communication only from one slice to the next (nearest neighbor) Communication overhead across the surfaces of the slices. Limiting operation: sparse matrix-vector multiplication Enable Hamiltonian storage or re-computation on the fly.

Algorithm scales very nicely on commodity cluster (Beowulf) Code Parallelization Problem: Need to calculate eigenvalues of a complex matrix of the order of 40 million. => must parallelize code Approach: Evaluate 2 parallel programming paradigms Shared memory (OpenMP) - CPUs can access the same memory. Distributed memory - CPUs exchange data through messages (MPI) - data synchronization performed explicitly by program. Vision: Utilize a designated Beowulf cluster of PC’s as a workhorse for these simulations. Each node might have 1-4 shared memory CPUs on one motherboard. Envision a “mixed” code with outer level MPI parallelism and inner level OpenMP parallelism. This will run on a commercial supercomputer like an SGI Origin 2000 as well as a beowulf. Results: Inner level OpenMP parallelism does not speed up code significantly. Dynamic creation and destruction of threads is too expensive. Decided to abandon the OpenMP implementation and concentrate on the optimization and scaling of the MPI version. Impact: Enabled simulation of 2 million atom systems with 20 orbitals on each atom => matrix of order 40million 30 Lanczos Iterations (sparse matrix-vector multiply) Algorithm scales very nicely on commodity cluster (Beowulf)

Parallel Eigenvalue Solver on a Beowulf (32 node, dual CPU Pentium III, 800MHz, 64GB RAM, Linux)

Parallel Eigenvalue Solver on a Beowulf (32 node, dual CPU Pentium III, 800MHz, 64GB RAM, Linux) Computation scales almost linear with system size Eigenvalue computation ranging from 1/4 to 16 million atoms Hamiltonian storage provides speed-up of >3x If the Hamiltonian is stored the maximum system size is reduced to about 8 million atoms Dual CPU Intel Pentium III has serious memory latency problems.

Comparison of 2 Beowulf Generations 2.5 Years old: 32 CPU, 450 MHz Pentium III, 500MB RAM each CPU 1.5 Years old: 64 CPU (32 nodes), 933 MHz Pentium III, 1GB RAM each node Brand new: 128 CPU (64 nodes), 2.2GHz P4, 1GB RAM each node (RESULTS not in plot) 450MHz and 933MHz results: Plot total CPU cycles in (time x MHz rating) Our problem remains CPU limited, small communication overhead No Problem feeding the memory to the CPUs. Doubling the CPU Frequency, doubled the speed on slower machines only, 2.2GHz does not scale as nice

Comparison of 2 Computing Platforms SGI Origin 2000, 128 CPUs, 300MHz R12000, 2GB RAM per 4 CPUs (4 years old) Beowulf, 64 CPUs, 933MHz Pentium PIII, 1GB per 2 CPUs (1.5 years old) If matrix is recomputed on the fly: Beowulf ~8x faster If matrix is stored: Beowulf is ~ 2.5x faster SGI very fast memory access Cluster of commodity PC’s can beat a supercomputer for our problem

Parallelization of Strain Calculation Problem: Serial strain computation: ~43 min. Serial electronic structure calculation: ~ 9 hours Parallel electronic structure computation on 20 CPUs: ~30 min. Solution: Parallelize strain calculation as well Result: Reduce time to 2-5 minutes on a parallel machine. See difference between a fast 2Gbps and a 100Mbps network. Do not see that difference in the electronic structure calculation. Parallel strain computation is more communication dependent than the electronic structure calculation.

Hierarchical Ordering of User Input Fact of Life: Typical simulators have a huge wash list of parameters. Problem: What parameters are really needed? What is the dependence between these parameters? Approach: Parent/child related hierarchical ordering Dynamic window generation. Semi-classical Potential Specify desired outputs Quantum region: “Where are wave-functions?” Non-equilibrium region: “Where are the reservoirs?” Quantum Potential exchange & correlation? how to go from bias to bias? Specify desired outputs Quantum region: “Where are wave-functions?” Non-equilibrium region: “Where are the reservoirs?” Quantum Charge region: “Where is the charge quantum mechanically calculated?”

Generic Data Structure I/O Dynamic GUI Design. data structure member descriptor -> I/O for GUI or files Create Read Data Structure PotType potential real hbarovertau Boolean Ec RangeStruct NonEq Graphical User Interface Translator File/Batch User Interface Read potential=Hartree hbarovertau=0.0066 Ec=FALSE < start=45, end=69 > Create Theorist Software Engineer Flexible software design enables use in various different simulators

XML Based Generic Data Structure I/O C++ data structures Read Create Create Read XML Structure PotType potential real hbarovertau Boolean Ec RangeStruct NonEq Graphical User Interface Translator File/Batch User Interface Read potential=Hartree hbarovertau=0.0066 Ec=FALSE < start=45, end=69 > Create Theorist

Presentation Outline Introduction / Motivation NASA Motivation, Critical look at the SIA Roadmap. Nano-scale device examples. What is a quantum dot? Modeling agenda. Software Issues Problem size. Parallel computing. Graphical user interfaces. Physics modeling results Strain. Alloy Disorder. Interface Interdiffusion. Conclusion / Outlook

Small strain has dramatic effects on the electronic structure. Mechanical Strain Calculations Problem: Self-assembly dot formation due to strain Small mechanical strain (5% bond length) -> dramatic effects on electronic structures Approach: Nanomechanical strain calculation Nanoelectronic strain calculation. Pyramidal InAs Dot Simulation Orbital overlap changes bandgaps and masses Base: 7nmx7nm Height: 3nm Embedded in GaAs 1.02eV Eeh 1.33eV Unstrained Dot Strained Dot Results: Implemented a mechanical strain model. Implemented atomistic bandstructure model that comprehends strain. Impact: Can simulate realistic quantum dots. Can estimate optical transition energies. Mechanics Problem: Minimize elastic strain (Keating) Rst i j Equilibrium Strained Electronics Problem: Effect of overlap changes Ga In As Small strain has dramatic effects on the electronic structure.

Alloy Disorder in Quantum Dots Problem: Cations are randomly distributed in alloy dots. Does alloy disorder limit electronic structure uniformity for dot ensembles? Approach: Simulate a statistical ensemble of alloyed dots. Requires atomistic simulation tool. Results: Simulated 50 dots with random cation distributions. Inhomogeneous broadening factor of 9.4meV due to alloy disorder. Impact: Fundamental uniformity limit for ensemble of alloy-based quantum dots. Eeh=1.05eV G=0.1-5meV Simulation of Alloy Dot Ensemble Measured G=34.6 meV (R. Leon, PRB, 58, R4262) 5meV Theoretical Lower Limit In0.6Ga0.4As Lens Shaped Dot (Diameter=30nm, Height=5nm, GaAs embedded) In and Ga atoms are randomly distributed Inhomogeneous Broadening? Alloy disorder presents a theoretical lower limit on optical linewidths

We are looking for motivated people!!! Conclusions / Future Vision Parallelization (16 million atoms), Graded junctions, alloy disorder, strain Made significant progress towards a general atomistic simulation tool Envision this tool to have impact on quantum dots, end of SIA roadmap issues, and molectronics. Quantum Dots Grading Atomistic Simulation Graded Abrupt We are looking for motivated people!!! CS, EE, Phys At all degree levels!!!! Modeling is an Integral Part of Nano-Science Transport in Molecules Carbon Nanotubes DNA End of SIA Roadmap (Ba,Sr)TiO3 TiO2 Dopant Fluctuations in Ultra-scaled CMOS Electron Transport in Exotic Dielectrics

Interdiffusion widens the bandgap => blueshift Atomistic Grading Simulation Problem: Quantum dot interfaces may not be sharp. There may be cation redistribution around the interface => grading of the concentration. How does the interfacial grading affect the electronic structure? Approach: Simulate quantum dot atomistically with graded interfaces as a function of interdiffusion length. Results: More Ga in the quantum dot raises the energy of the transition energies. Less Ga in the barriers softens the barriers, reduces the binding of the excited states to the quantum dot and reduces DE=E2-E1. Impact: Verify experimentally suggested interdiffusion process may be responsible for blue shift and reduction in DE. Pyramidal InAs in GaAs, Diameter=10nm, Height=4.2nm 5 samples per data point Cartoon Visualization of Interdiffusion Slice through 2 Qdots with thickness of 3 atoms - with and without interdiffusion Ga In As Interdiffusion widens the bandgap => blueshift

Progressive Spacecraft Miniaturization Cassini 1000 kg Mars Pathfinder Lewis Clark Solar Probe NEAR Pluto/Kuiper Express 100 kg Mars 98 Lander/Orbiter Spacecraft Mass Europa Orbiter Stardust 10 kg “Microspacecraft” Past Present Future Low weight, low power and high efficiency Have a special meaning to NASA

Motivation Penetrators Atmospheric Probes Hydrobot Minaturized-Spacecraft Our enthusiasm for Nano-technology stems from its potential value in addressing Deep Space technology needs: Autonomous navigation and maneuvering, Miniature in-situ sensors, Radiation and temperature tolerant electronics. Miniaturized-Rovers Integrated Inflatable Sailcraft Distributed Sensors Landing on Small Bodies Nano-technology will provide essential computing and sensing capabilities.

Parallel Eigenvalue Solver on a Beowulf (32 node, dual CPU Pentium III, 800MHz, Linux) Eigenvalue computation ranging from 1/4 to 16 million atoms Problems are too big for a single CPU (memory requirements) sp3s* basis set Matrix sizes up to 1.6 108x1.6 108 Recompute Hamiltonian matrix on the fly. Measure time for 30 Lanczos iterations, Full problem 1000-5000 iterations Solver behaves almost linear in system size!!!