Max Baak (CERN) 1 Summary of experiences with HistFactory and RooStats Max Baak (CERN) (on behalf of list of people) RooFit / RooStats meeting January.

Slides:



Advertisements
Similar presentations
Model Evaluation Tools MET. What is MET Model Evaluation Tools ( MET )- a powerful and highly configurable verification package developed by DTC offering:
Advertisements

Lecture 14 User-defined functions Function: concept, syntax, and examples © 2007 Daniel Valentine. All rights reserved. Published by Elsevier.
Part II – TIME SERIES ANALYSIS C3 Exponential Smoothing Methods © Angel A. Juan & Carles Serrat - UPC 2007/2008.
ONYX RIP Version Technical Training General. Overview General Messaging and What’s New in X10 High Level Print and Cut & Profiling Overviews In Depth.
Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
I’ll not discuss about Bayesian or Frequentist foundation (discussed by previous speakers) I’ll try to discuss some facts and my point of view. I think.
API Design CPSC 315 – Programming Studio Fall 2008 Follows Kernighan and Pike, The Practice of Programming and Joshua Bloch’s Library-Centric Software.
MINOS Feb Antineutrino running Pedro Ochoa Caltech.
Software Quality Control Methods. Introduction Quality control methods have received a world wide surge of interest within the past couple of decades.
G. Cowan RHUL Physics Comment on use of LR for limits page 1 Comment on definition of likelihood ratio for limits ATLAS Statistics Forum CERN, 2 September,
By: Jamie McPeek. 1. Background Information 1. Metasearch 2. Sets 3. Surface Web/Deep Web 4. The Problem 5. Application Goals.
Automated Tests in NICOS Nightly Control System Alexander Undrus Brookhaven National Laboratory, Upton, NY Software testing is a difficult, time-consuming.
Statistical aspects of Higgs analyses W. Verkerke (NIKHEF)
Unit Testing & Defensive Programming. F-22 Raptor Fighter.
Discovery Experience: CMS Giovanni Petrucciani (UCSD)
From BlueJ to NetBeans SWC 2.semester.
Unit tests, Integration tests Physics tests Andrea Dotti, Gunter Folger, Pere Mato CERN – PH/SFT Geant4 workshop 2012.
What is Sure BDCs? BDC stands for Batch Data Communication and is also known as Batch Input. It is a technique for mass input of data into SAP by simulating.
CSC 213 – Large Scale Programming. Why Do We Test?
Introduction to Hall-D Software February 27, 2009 David Lawrence - JLab.
880.P20 Winter 2006 Richard Kass 1 Confidence Intervals and Upper Limits Confidence intervals (CI) are related to confidence limits (CL). To calculate.
Imperial College Tracker Slow Control & Monitoring.
Business Unit or Product Name © 2007 IBM Corporation Introduction of Autotest Qing Lin.
OHTO -99 SOFTWARE ENGINEERING “SOFTWARE PRODUCT QUALITY” Today: - Software quality - Quality Components - ”Good” software properties.
Software Development Software Testing. Testing Definitions There are many tests going under various names. The following is a general list to get a feel.
Software Project Planning Defining the Project Writing the Software Specification Planning the Development Stages Testing the Software.
Results of combination Higgs toy combination, within and across experiments, with RooStats Grégory Schott Institute for Experimental Nuclear Physics of.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #23.
Introduction Advantages/ disadvantages Code examples Speed Summary Running on the AOD Analysis Platforms 1/11/2007 Andrew Mehta.
15/06/2006 The Future of Visual DCT EPICS Collaboration 2006 Argonne National Laboratory
More About Objects and Methods Chapter 5. Outline Programming with Methods Static Methods and Static Variables Designing Methods Overloading Constructors.
1 Proposal to change limits for exclusion N.Andari, L. Fayard, M.Kado, F.Polci LAL- Orsay Y.Fang, Y.Pan, H.Wang, Sau Lan Wu University of Wisconsin-Madison.
Statistical Methods for Data Analysis Introduction to the course Luca Lista INFN Napoli.
ROOT and statistics tutorial Exercise: Discover the Higgs, part 2 Attilio Andreazza Università di Milano and INFN Caterina Doglioni Université de Genève.
Karsten Köneke October 22 nd 2007 Ganga User Experience 1/9 Outline: Introduction What are we trying to do? Problems What are the problems? Conclusions.
Operator Overloading Week 5.
SEG 4110 – Advanced Software Design and Reengineering Topic T Introduction to Refactoring.
Easy Limit Statistics Andreas Hoecker CAT Physics, Mar 25, 2011.
1 What I did last year Nick Barlow Manchester Christmas meeting January 2005.
G. Cowan, RHUL Physics Statistics for early physics page 1 Statistics jump-start for early physics ATLAS Statistics Forum EVO/Phone, 4 May, 2010 Glen Cowan.
LECTURE 02: EVALUATING MODELS January 27, 2016 SDS 293 Machine Learning.
Project Planning Defining the project Software specification Development stages Software testing.
BlueFin Best Linear Unbiased Estimate Fisher Information aNalysis Andrea Valassi (IT-SDC) based on the work done with Roberto Chierici TOPLHCWG meeting.
Software testing techniques Software testing techniques REGRESSION TESTING Presentation on the seminar Kaunas University of Technology.
1 Studies on Exclusion Limits N.Andari, L. Fayard, M.Kado, F.Polci LAL- Orsay LAL meeting 23/07/2009.
1 Comparing Unbinned likelihood methods IceCube/Antares Common Point source analysis from IC22 & Antares 2007/2008 data sets J. Brunner.
This was written with the assumption that workbooks would be added. Even if these are not introduced until later, the same basic ideas apply Hopefully.
Reliability of Disk Systems. Reliability So far, we looked at ways to improve the performance of disk systems. Next, we will look at ways to improve the.
S. Ferrag, G. Steele University of Glasgow. RooStats and MClimit comparison Exercise to use RooStats by an MClimit-formatted person: – Use two programs.
REGRESSION TESTING Audrius Čėsna IFM-0/2. Regression testing is any type of software testing that seeks to uncover new errors, or regressions, in existing.
AliRoot survey: Calibration P.Hristov 11/06/2013.
Analysis Tools interface - configuration Wouter Verkerke Wouter Verkerke, NIKHEF 1.
Maria Alandes Pradillo, CERN Training on GLUE 2 information validation EGI Technical Forum September 2013.
DESY BT analysis - updates - S. Uozumi Dec-12 th 2011 ScECAL meeting.
ANALYSIS TRAIN ON THE GRID Mihaela Gheata. AOD production train ◦ AOD production will be organized in a ‘train’ of tasks ◦ To maximize efficiency of full.
23 Jan 2012 Background shape estimates using sidebands Paul Dauncey G. Davies, D. Futyan, J. Hays, M. Jarvis, M. Kenzie, C. Seez, J. Virdee, N. Wardle.
Hands-on Session RooStats and TMVA Exercises
The asymmetric uncertainties on data points in Roofit
LOW-COMPLEXITY ARBITRARY SAMPLE-RATE CONVERTER
How To Format Your Note Cards
A PID based approach for antineutrino selection
CMS RooStats Higgs Combination Package
Grégory Schott Institute for Experimental Nuclear Physics
Strategy 6th December 2010 Bill Murray LHC-HCG
TRANSLATORS AND IDEs Key Revision Points.
Phil Tayco Slide version 1.0 Created Oct 2, 2017
Attributes of Information
Environmental Monitoring: Coupling Function Calculator
Peter Seibel Practical Common Lisp Peter Seibel
Presentation transcript:

Max Baak (CERN) 1 Summary of experiences with HistFactory and RooStats Max Baak (CERN) (on behalf of list of people) RooFit / RooStats meeting January 19 th 2012

Max Baak (CERN) 2 Overview of this talk  In past months, have actively started using HistFactory and RooStats code for model/workspace creation and limit setting. In context of Atlas SUSY group Previously, had own RooFit-based code for doing this.  This talk: some feedback on experiences with using this code.  Suggestions in several areas for possible additions and improvements to the code. Additions indicated in Red.  (Some of these suggestions I may already have told or mailed in private.)

Max Baak (CERN) 3 Our ‘HistFitter’ package  Big fan of HistFactory idea: very transparent way of creating one (complex) fit model. Easy for non-RooFit-experts to get started.  Our physics situation: need to deal with a (2D) grid of (O(100)) signal models to test.  ‘HistFitter’ package: built around HistFactory code, with purpose of creating/managing/testing many workspaces in flexible way. Used by group of people, not necessarily RooFit (= model-building) or (= limit-setting) RooStats experts. Possibly useful code for adding to RooStats:  Several classes to program xml input files for HistFactory, through python.  Machinery for bookkeeping many workspaces (one workspace for each signal model) and HypoTest results. Examples follow.

Max Baak (CERN) 4 Example signal model grid  Signal model parameters on x and y axes.  Every point is a grid point with certain (mass) parameter values.

Max Baak (CERN) 5 HypoTestTool  Though many examples available, setting up RooStats tools for limit setting not entirely straight-forward for newcomer.  Suggest that one-line wrapper functions are added with reasonable default settings for doing. This is how my non-RooStats-expert colleagues tend to use the code.  Eg. Hypothesis test at mu=0 or 1 (CL S ) 95% CL upper limit Nice upper limit plot

Max Baak (CERN) 6 HypoTestTool  HypoTestTool: wrapper class to setup several RooStats HypoTestInverters  Class originally written by Gena. Taken from: $ROOTSYS/tutorials/roostats/StandardHypoTestInvDemo.C  Several modifications/additions to HypoTestTool ensure the class: Works stand-alone, Can do both hypothesis test and hypothesis-test inversion, … both for exclusion and discovery  Had to patch ProfileLikelihoodTestStat.cxx to be able to evaluate discovery test statistic (!). (I.e. do fit at both mu=0 and mu=free.) Consistent generation of both background-only and signal+background toys Reasonable default settings (e.g. for upper limit plots)  Usual set of configurable options still available …

Max Baak (CERN) 7 Wrapper functions calling HypoTestTool  Stand-alone one-line functions that should exist in RooStats Perhaps they do, but I’m simply not aware of them Ie. I wrote these, calling HypoTestTool …  Double get_Freq_CLs( RooWorkspace* w )  Double get_Asymp_CLs( RooWorkspace* w )  Double get_Freq_pValue( RooWorkspace* w )  HypoTestResult* DoHypoTest( RooWorkspace* w )  HypoTestInverterResult* DoHypoTestInversion( RooWorkspace* w ) I imagine this is how most non-experts will like to run RooStats...

Max Baak (CERN) 8 ToyMCSampler  Slightly black-box feeling to it. (For me at least.)  Would be great of ToyMCSampler class can return set of fit results for set of toys that are generated and fit.  … ala existing RooMCStudy functionality.  Vital for every analysis to understand toy MC behaviour. Especially if unconditional ensemble of toys is non-trivial to generate.  (E.g. what is happening with failed fits in case of limit setting?)  Wrote some simple bit of code to do this externally. (Slow.)  CERN has technical student in next months who can help implement this in ToyMCSampler …

Max Baak (CERN) 9 HypoTestResult  Some more info that could (imo should) be added to this class:  RooFitResult(s!) of evaluated data test statistic  CLs expected, CLs ±1 and ±2 sigma uncertainty values Right now I had to get those from HypoTestInverterResult, evaluated at just one mu value.  Fractions of failed toy fits.  Is it possible to add up 2 HypoTestResults, resulting in more precise summed HypoTestResult ? Have my own LimitResult class that contains / can do all this. Useful to add?

Max Baak (CERN) 10 Conversion HypoTestInverterResult  Eventually, need to turn grid of hypothesis tests in contour (exclusion) plot on right-hand side. Observed, expected, ±1 sigma uncertainty band contours …  Tool for converting set of HypoTestInverterResults into TTree, which can then be used for plotting purposes. Input file contains many hypotest results with specific name format.  TString outputfile = CollectAndWriteHypoTestResults( inputfile, “hypo_SU_%f_%f_0_10”, “m0:m12”, “m12>600” ) ;

Max Baak (CERN) 11 Minor fixes and annoyances  HistFactory: Gauss sigma settable (not a constant)  HistFactory: stat error pdf only included when rel. error > X  HistFactory: needs to have POI set – crashes otherwise.  HistFactory: always add luminosity pdf, even  To make PLL test statistic work in discovery mode: initial_mu > 0