1 CE 530 Molecular Simulation Lecture 21 Histogram Reweighting Methods David A. Kofke Department of Chemical Engineering SUNY Buffalo

Slides:



Advertisements
Similar presentations
Grand Canonical Ensemble and Criteria for Equilibrium
Advertisements

The microcanonical ensemble Finding the probability distribution We consider an isolated system in the sense that the energy is a constant of motion. We.
Review Of Statistical Mechanics
Introduction to Statistical Thermodynamics (Recall)
N.D.GagunashviliUniversity of Akureyri, Iceland Pearson´s χ 2 Test Modifications for Comparison of Unweighted and Weighted Histograms and Two Weighted.
Modern Monte Carlo Methods: (2) Histogram Reweighting (3) Transition Matrix Monte Carlo Jian-Sheng Wang National University of Singapore.
1 CE 530 Molecular Simulation Lecture 8 Markov Processes David A. Kofke Department of Chemical Engineering SUNY Buffalo
Visual Recognition Tutorial
Bose-Einstein distribution function Armed with We calculate Probability to find the state in the grandcanonical ensemble With  we can calculate thermal.
Importance Sampling. What is Importance Sampling ? A simulation technique Used when we are interested in rare events Examples: Bit Error Rate on a channel,
BCOR 1020 Business Statistics Lecture 17 – March 18, 2008.
Error Propagation. Uncertainty Uncertainty reflects the knowledge that a measured value is related to the mean. Probable error is the range from the mean.
Chapter 3 Cumulative Test Sample 30 people spent on books ($)
The Question The Answer P = 94 %. Practical Uses of   To infer  from S x To compare a sample to an assumed population To establish a rejection criterion.
Review of normal distribution. Exercise Solution.
Physics 114: Lecture 15 Probability Tests & Linear Fitting Dale E. Gary NJIT Physics Department.
1 CE 530 Molecular Simulation Lecture 7 David A. Kofke Department of Chemical Engineering SUNY Buffalo
1 Statistical Mechanics and Multi- Scale Simulation Methods ChBE Prof. C. Heath Turner Lecture 11 Some materials adapted from Prof. Keith E. Gubbins:
Prediction concerning Y variable. Three different research questions What is the mean response, E(Y h ), for a given level, X h, of the predictor variable?
Boltzmann Distribution and Helmholtz Free Energy
Introduction to (Statistical) Thermodynamics
880.P20 Winter 2006 Richard Kass Propagation of Errors Suppose we measure the branching fraction BR(Higgs  +  - ) using the number of produced Higgs.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
Stats for Engineers Lecture 9. Summary From Last Time Confidence Intervals for the mean t-tables Q Student t-distribution.
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
Simulating the value of Asian Options Vladimir Kozak.
Geo479/579: Geostatistics Ch12. Ordinary Kriging (1)
1 CE 530 Molecular Simulation Lecture 17 Beyond Atoms: Simulating Molecules David A. Kofke Department of Chemical Engineering SUNY Buffalo
1 CE 530 Molecular Simulation Lecture 18 Free-energy calculations David A. Kofke Department of Chemical Engineering SUNY Buffalo
Lecture 19: Free Energies in Modern Computational Statistical Thermodynamics: WHAM and Related Methods Dr. Ronald M. Levy Statistical.
1 CE 530 Molecular Simulation Lecture 6 David A. Kofke Department of Chemical Engineering SUNY Buffalo
1 Lesson 8: Basic Monte Carlo integration We begin the 2 nd phase of our course: Study of general mathematics of MC We begin the 2 nd phase of our course:
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
The Ideal Monatomic Gas. Canonical ensemble: N, V, T 2.
Lecture 2 Forestry 3218 Lecture 2 Statistical Methods Avery and Burkhart, Chapter 2 Forest Mensuration II Avery and Burkhart, Chapter 2.
1 CE 530 Molecular Simulation Lecture 22 Chain-Molecule Sampling Techniques David A. Kofke Department of Chemical Engineering SUNY Buffalo
1 Estimation of Standard Deviation & Percentiles Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY.
1 FEBRUARY 2005 Javier Pérez PHASE DIAGRAM PREDICTION OF ACID GAS MIXTURES BY MOLECULAR SIMULATION Javier Pérez Pellitero †, P. Ungerer ‡, Allan D. Mackie.
Computational Intelligence: Methods and Applications Lecture 23 Logistic discrimination and support vectors Włodzisław Duch Dept. of Informatics, UMK Google:
THEORY The argumentation was wrong. Halting theorem!
© Copyright McGraw-Hill 2000
Physics 114: Lecture 14 Mean of Means Dale E. Gary NJIT Physics Department.
LECTURE 3: ANALYSIS OF EXPERIMENTAL DATA
5. Maximum Likelihood –II Prof. Yuille. Stat 231. Fall 2004.
T Test for Two Independent Samples. t test for two independent samples Basic Assumptions Independent samples are not paired with other observations Null.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Chapter 8. Process and Measurement System Capability Analysis
Statistics for Political Science Levin and Fox Chapter Seven
ES 07 These slides can be found at optimized for Windows)
Quick review of some key ideas CEE 11 Spring 2002 Dr. Amelia Regan These notes draw liberally from the class text, Probability and Statistics for Engineering.
Other Partition Functions
Statistical Mechanics and Multi-Scale Simulation Methods ChBE
1 MONALISA Compact Straightness Monitor Simulation and Calibration Week 2 Report By Patrick Gloster.
R. Kass/W03 P416 Lecture 5 l Suppose we are trying to measure the true value of some quantity (x T ). u We make repeated measurements of this quantity.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Tests of Significance -how to put them in writing.
Pairwise comparisons: Confidence intervals Multiple comparisons Marina Bogomolov and Gili Baumer.
Lesson 8: Basic Monte Carlo integration
Physics 114: Lecture 13 Probability Tests & Linear Fitting
Assistant Professor of Public Policy
Introduction to estimation: 2 cases
Maximum Likelihood Estimation
12. Reweighting Methods.
Inference for Geostatistical Data: Kriging for Spatial Interpolation
Statistical Methods For Engineers
CONCEPTS OF ESTIMATION
Statistical Mechanics and Canonical Ensemble
The Second Order Adjoint Analysis
Grand Canonical Ensemble and Criteria for Equilibrium
Presentation transcript:

1 CE 530 Molecular Simulation Lecture 21 Histogram Reweighting Methods David A. Kofke Department of Chemical Engineering SUNY Buffalo

2 Histogram Reweighting  Method to combine results taken at different state conditions  Microcanonical ensemble  Canonical ensemble  The big idea: Combine simulation data at different temperatures to improve quality of all data via their mutual relation to  (E) Probability of a microstate Probability of an energy Number of microstates having this energy Probability of each microstate

3 In-class Problem 1.  Consider three energy levels  What are Q, distribution of states and at  = 1?

4 In-class Problem 1A.  Consider three energy levels  What are Q, distribution of states and at  = 1?

5 In-class Problem 1A.  Consider three energy levels  What are Q, distribution of states and at  = 1?  And at  = 3?

6 Histogram Reweighting Approach  Knowledge of  (E) can be used to obtain averages at any temperature  Simulations at different temperatures probe different parts of  (E)  But simulations at each temperature provides information over a range of values of  (E)  Combine simulation data taken at different temperatures to obtain better information for each temperature E  (E) 11 22 33

7 In-class Problem 2.  Consider simulation data from a system having three energy levels M = 100 samples taken at  = 0.5 m i times observed in level i  What is  (E)?

8 In-class Problem 2.  Consider simulation data from a system having three energy levels M = 100 samples taken at  = 0.5 m i times observed in level i  What is  (E)? Reminder

9 In-class Problem 2.  Consider simulation data from a system having three energy levels M = 100 samples taken at  = 0.5 m i times observed in level i  What is  (E)? Hint Can get only relative values!

10 In-class Problem 2A.  Consider simulation data from a system having three energy levels M = 100 samples taken at  = 0.5 m i times observed in level i  What is  (E)?

11 In-class Problem 3.  Consider simulation data from a system having three energy levels M = 100 samples taken at  = 0.5 m i times observed in level i  What is  (E)?  Here’s some more data, taken at  = 1 what is  (E)?

12 In-class Problem 3.  Consider simulation data from a system having three energy levels M = 100 samples taken at  = 0.5 m i times observed in level i  What is  (E)?  Here’s some more data, taken at  = 1 what is  (E)?

13 Reconciling the Data  We have two data sets  Questions of interest what is the ratio Q A /Q B ? (which then gives us  A) what is the best value of  1 /  0,  2 /  0 ? what is the average energy at  = 2?  In-class Problem 4 make an attempt to answer these questions

14 In-class Problem 4A.  We have two data sets  What is the ratio Q A /Q B ? (which then gives us  A) Consider values from each energy level  What is the best value of  1 /  0,  2 /  0 ? Consider values from each temperature  What to do?

15 Accounting for Data Quality  Remember the number of samples that went into each value We expect the A-state data to be good for levels 1 and 2 …while the B-state data are good for levels 0 and 1  Write each  as an average of all values, weighted by quality of result

16 Histogram Variance  Estimate confidence in each simulation result  Assume each histogram follows a Poisson distribution probability P to observe any given instance of distribution the variance for each bin is

17 Variance in Estimate of   Formula for estimate of   Variance

18 Optimizing Weights  Variance  Minimize with respect to weight, subject to normalization In-class Problem 5 Do it!

19 Optimizing Weights  Variance  Minimize with respect to weight, subject to normalization Lagrange multiplier  Equation for each weight is  Rearrange  Normalize

20 Optimal Estimate  Collect results  Combine

21 Calculating   Formula for   In-class Problem 6 explain why this formula cannot yet be used

22 Calculating   Formula for   We do not know the Q partition functions  One equation for each   Each equation depends on all   Requires iterative solution

23 In-class Problem 7  Write the equations for each  using the example values

24 In-class Problem 7  Write the equations for each  using the example values  Solution

25 In-class Problem 7A.  Solution  Compare  “Exact” solution  Free energy difference “Design value” = 10 (?)

26 Extensions of Technique  Method is usually used in multidimensional form  Useful to apply to grand-canonical ensemble  Can then be used to relate simulation data at different temperature and chemical potential  Many other variations are possible