Analysis Tools interface - configuration Wouter Verkerke Wouter Verkerke, NIKHEF 1.

Slides:

Advertisements

Similar presentations

Bayesian tools for analysing and reducing uncertainty Tony OHagan University of Sheffield.

Advertisements

Automating Software Module Testing for FAA Certification Usha Santhanam The Boeing Company.

10 Software Engineering Foundations of Computer Science ã Cengage Learning.

Current limits (95% C.L.): LEP direct searches m H > GeV Global fit to precision EW data (excludes direct search results) m H < 157 GeV Latest Tevatron.

1 B-tagging meeting overview Li bo Shandong University.

14 Sept 2004 D.Dedovich Tau041 Measurement of Tau hadronic branching ratios in DELPHI experiment at LEP Dima Dedovich (Dubna) DELPHI Collaboration E.Phys.J.

The Comparison of the Software Cost Estimating Methods

Jörgen Sjölin Stockholm University LHC experimental sensitivity to CP violating gtt couplings November, 2002 Page 1 Why CP in gtt? Standard model contribution.

Summary of Results and Projected Precision Rediscovering the Top Quark Marc-André Pleier, Universität Bonn Top Quark Pair Production and Decay According.

Top Turns Ten March 2 nd, Measurement of the Top Quark Mass The Low Bias Template Method using Lepton + jets events Kevin Black, Meenakshi Narain.

Kevin Black Meenakshi Narain Boston University

Introduction to Software Design Chapter 1. Chapter 1: Introduction to Software Design2 Chapter Objectives To become familiar with the software challenge.

Event Reweighting Tools Contents: ● Goals ● Reweighting Packages ● Usage & Reweight Friendly Packages ● Validation ● Caveats.

Search for resonances The fingerprints of the Top Quark Jessica Levêque, University of Arizona Top Quark Mass Measurement Top Turns Ten Symposium, Fermilab,

Top mass in t-tbar  6jets Status Report Physics and Astronomy University of Victoria British Columbia, Canada Top meeting CERN, 21 Feb 2007 Michel Lefebvre.

Iterative development and The Unified process

Chapter 1 Program Design

Role and Place of Statistical Data Analysis and very simple applications Simplified diagram of a scientific research When you know the system: Estimation.

1 Seventh Lecture Error Analysis Instrumentation and Product Testing.

Role and Place of Statistical Data Analysis and very simple applications Simplified diagram of a scientific research When you know the system: Estimation.

Chapter 9 Flashcards. measurement method that uses uniform procedures to collect, score, interpret, and report numerical results; usually has norms and.

Robin McDougall, Ed Waller and Scott Nokleby Faculties of Engineering & Applied Science and Energy Systems & Nuclear Science 1.

The Project AH Computing. Functional Requirements  What the product must do!  Examples attractive welcome screen all options available as clickable.

Statistical aspects of Higgs analyses W. Verkerke (NIKHEF)

RUP Implementation and Testing

E. Devetak - LCWS t-tbar analysis at SiD Erik Devetak Oxford University LCWS /11/2008 Flavour tagging for ttbar Hadronic ttbar events ID.

©Ian Sommerville 2000, Mejia-Alvarez 2009 Slide 1 Software Processes l Coherent sets of activities for specifying, designing, implementing and testing.

The Adoption of METIS GSBPM in Statistics Denmark.

Irakli Chakaberia Final Examination April 28, 2014.

Event View G. Watts (UW) O. Harris (UW). Philosophy EventView Goals Object Identification & Interpretation Is that a jet or an electron? Is that jet a.

Generic Approaches to Model Validation Presented at Growth Model User’s Group August 10, 2005 David K. Walters.

1 A Preliminary Model Independent Study of the Reaction pp  qqWW  qq ℓ qq at CMS  Gianluca CERMINARA (SUMMER STUDENT)  MUON group.

Page 1 Charles Plager LJ+MET, March 23, 2009 Charles Plager UCLA LJ+MET Meeting March 23, 2008 “Throwing PEs” and More.

Current and Future Applications of the Generic Statistical Business Process Model at Statistics Canada Laurie Reedman and Claude Julien May 5, 2010.

Progress on H/Abb -> 4b’s channel for the FTK physics case ~ 4jets Trigger w/ and w/o FTK ~ Kohei Yorita Young-Kee Kim University of the FTK.

1 Methods of Experimental Particle Physics Alexei Safonov Lecture #23.

BPA M&V Protocols Overview of BPA M&V Protocols and Relationship to RTF Guidelines for Savings and Standard Savings Estimation Protocols.

G.Corti, P.Robbe LHCb Software Week - 19 June 2009 FSR in Gauss: Generator’s statistics - What type of object is going in the FSR ? - How are the objects.

Visual Basic Programming

August 30, 2006 CAT physics meeting Calibration of b-tagging at Tevatron 1. A Secondary Vertex Tagger 2. Primary and secondary vertex reconstruction 3.

Object Oriented Software Development

Sensitivity Prospects for Light Charged Higgs at 7 TeV J.L. Lane, P.S. Miyagawa, U.K. Yang (Manchester) M. Klemetti, C.T. Potter (McGill) P. Mal (Arizona)

Possibility of tan  measurement with in CMS Majid Hashemi CERN, CMS IPM,Tehran,Iran QCD and Hadronic Interactions, March 2005, La Thuile, Italy.

The Software Development Process

October 19, 2000ACAT 2000, Fermilab, Suman B. Beri Top Quark Mass Measurements Using Neural Networks Suman B. Beri, Rajwant Kaur Panjab University, India.

BUILDING THE REGRESSION MODEL Data preparation Variable reduction Model Selection Model validation Procedures for variable reduction 1 Building the Regression.

P4Helpers P4Helpers.h provides static helper functions for kinematic calculation on objects deriving from I4Momentum. Anthony Denton-Farnworth, The University.

Measurements of Top Quark Properties at Run II of the Tevatron Erich W.Varnes University of Arizona for the CDF and DØ Collaborations International Workshop.

Liu Minghui Nanjing MC study of W polarization and ttbar spin correlation Liu Minghui, Zhu Chengguang April 27, 2008.

Why A Software Review? Now have experience of real data and first major analysis results –What have we learned? –How should that change what we do next.

OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.

1 The Software Development Process ► Systems analysis ► Systems design ► Implementation ► Testing ► Documentation ► Evaluation ► Maintenance.

Search for High-Mass Resonances in e + e - Jia Liu Madelyne Greene, Lana Muniz, Jane Nachtman Goal for the summer Searching for new particle Z’ --- a massive.

Fully Hadronic Top Anti-Top Decay (Using TopView) Fully Hadronic Top Anti-Top Decay (Using TopView) Ido Mussche NIPHAD meeting, Februari 9 th :

Learning Curves Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill.

1 Measurement of the Mass of the Top Quark in Dilepton Channels at DØ Jeff Temple University of Arizona for the DØ collaboration DPF 2006.

4/12/05 -Xiaojian Zhang, 1 UIUC paper review Introduction to Bc Event selection The blind analysis The final result The systematic error.

06/2006I.Larin PrimEx Collaboration meeting  0 analysis.

Stano Tokar, slide 1 Top into Dileptons Stano Tokar Comenius University, Bratislava With a kind permissison of the CDF top group Dec 2004 RTN Workshop.

Program Design. Simple Program Design, Fourth Edition Chapter 1 2 Objectives In this chapter you will be able to: Describe the steps in the program development.

Extreme Software Engineering A Hands-On Approach From Extreme Software Engineering: A Hands-On Approach Daniel H. Steinberg Daniel W. Palmer.

SEARCH FOR DIRECT PRODUCTION OF SUPERSYMMETRIC PAIRS OF TOP QUARKS AT √ S = 8 TEV, WITH ONE LEPTON IN THE FINAL STATE. Juan Pablo Gómez Cardona PhD Candidate.

Statistical process model Workshop in Ukraine October 2015 Karin Blix Quality coordinator

23 Jan 2012 Background shape estimates using sidebands Paul Dauncey G. Davies, D. Futyan, J. Hays, M. Jarvis, M. Kenzie, C. Seez, J. Virdee, N. Wardle.

TNSmooth: Root Multi-dimensional PDFs

Higgs → t+t- in Vector Boson Fusion

Constructive Cost Model

Introduction Software maintenance:

Presentation transcript:

Analysis Tools interface - configuration Wouter Verkerke Wouter Verkerke, NIKHEF 1

Typical workflow of a physics analysis 1) ‘Data reduction’ 2) 'Final processing’ : Data processing loop [ data, various MC samples ] Apply all appropriate final correction factors to objects according to latest prescriptions Calculate derived physics quantities needed for final selection criteria (this range from simple (variants of transverse mass) to very complex (kinematic likelihood fits of objects to e.g. a ttbar decay topology) Apply final analysis cuts Optional calculation of further derived physics quantities that are only valid for object that survive final selection (e.g. concepts that require a minimum number of final state objects that were not necessarily there in every event before the final selection Dumping of histograms and counters 3) 'Statistical analysis’ Aggregate estimated signal and background yields from histogram/counting output of simulation samples in to (template) likelihood models that describes the predicted distributions for a given observable (also need to import all kinds of meta-data here to complete model: cross-section, k-factors, luminosities etc...) Fit / limit setting procedure based Likelihood constructed from (template) likelihood model and observed data With systematic uncertainties – procedure is more elaborate - For each systematic uncertainty do - Data processing loop [ various MC samples ] with one systematic param 'up' - Data processing loop [ various MC samples ] with one systematic param ‘down' Wouter Verkerke, NIKHEF 2

Who writes what tool in analysis workflow Apply all appropriate final correction factors to objects according to latest prescriptions –Tools always originate from CP groups, code will be in analysis [base] release Calculate derived physics quantities needed for final selection criteria(…) –Tools originate either from physics group (e.g. standard calculation of mT2), code will be analysis release [specific to a physics group ] Apply final analysis cuts –3) Tool origin will vary. It can either be - Specific code by analysis author - Generic cut-flow tool [ in base release ] configured by author Optional calculation of further derived physics quantities that are only valid for object that survive final selection –Tools originate either from physics group (e.g. top kinematic fitter) [ code in analysis release] or be specific code to this analysis Dumping of histograms and counters –Tool origin will vary. It can either be - Specific code by analysis author - Generic cut-flow tool [ in base release ] configured by author Wouter Verkerke, NIKHEF 3

Use cases ‘classes of tools’ A) Performance group tools for physics use –Examples: Application of JES/electron/muon calibration, b-tagging scale factors, jet/lepton resolution smearing and efficiency correction tools –Author characteristics: which we can presume that the authors have an 'expert-level' understanding of code, which will deliver code to analyzers through analysis releases. –Release cycle: The release cycle of code is O(weeks), but smaller updates related to updated calibration constants can be expected to be delivered on a shorter time scale. –User configurability: The bulk of the configurable options will be preconfigured by experts and shipped with analysis release, but there is an important subset of parameters with frequent user interaction: those that relate to systematic uncertainty variations: these will need to be varied frequently (almost on a per-job basis) B) Standardized physics analysis tools. –Examples: configuration-driven cut flow implementations, histogram dumpers, –Author characteristics: Expert-level. –Release cycle: Long cycles - once functionality is established –User configurability: Depends on tool functionality, but generally extensive Wouter Verkerke, NIKHEF 4

Use cases ‘classes of tools’ C) Physics group common physics code. –Examples: calculation of mT2, kinematic fit of final selection objects to ttbar topology, identification of specific physics objects in MC decay trees (e.g. truth Higgs boson) –Author characteristics: Experience will vary a bit more here, but I think we can generally assume that the same level of expertise and effort is put in code as is done by performance group –Release cycle: Very long cycles, after initial development. Typical tasks such as mT2 calculation or kinematic fits tend to undergo very few changes over time –User configurability: Typically input collections of physics objects must be configured by analysis users for calculations that take 'final selection' sets of objects as input, for which no default naming convention may exist D) Analysis specific code –Examples: all specific code needed to perform a particular physics analysis –Author characteristics: Highly variable, but a substantial contingent of 'novice' users with limited programming skills. Authors are invariably busy and time-constrained, so a fair degree of 'whatever works quickly' code –Release cycle: Very short. Barring systematic variation runs, code tends to be developed in between every iteration –User configurability: Depends on tool functionality and authors programming style [ you'll find anything from people working with extensive configuration tools to 'everything hardcoded' (which also works as the tool is single-purpose) Wouter Verkerke, NIKHEF 5

Some points for discussion (my personal view) Configuration language –‘Interactive’ configuration by user should be possible in both C++ and python so that analyzer can work only in his language of choice (important for ‘novice programmers’) –Options that are usually not touched as part of configuring job (which are essential ‘prescription’ configurations) can always be python (for simplicity) as the’re usually not touched by user. Configuration ‘documentation’ –The full set of available options should be trivially accessible by the user (e.g. through a ‘tool->dumpProperties() command). This should list all available options, not just those (re)configured. Configuration of multiple tool instances –This is a necessary use case, need some scheme to deal with this. –How strongly intertwined is this with sequencing? Wouter Verkerke, NIKHEF 6