OPTIMAL JET FINDER Ernest Jankowski University of Alberta in collaboration with D. Grigoriev F. Tkachov.

Slides:



Advertisements
Similar presentations
Maria Jose Costa, CERN DIS 2004 April 14 th -18 th 2004, Slovakia b -mass effects in 3 and 4 jets events with the DELPHI detector at LEP b u,d,s.
Advertisements

FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
Charged Particle Jet measurements with the ALICE Experiment in pp collisions at the LHC Sidharth Kumar Prasad Wayne State University, USA for the ALICE.
Jet and Jet Shapes in CMS
Visual Recognition Tutorial
Michel Lefebvre, 2007/03/02Optimal Jet Finder1 Optimal Jet Finder work in progress Physics and Astronomy University of Victoria British Columbia, Canada.
Dijet Transverse Thrust cross sections at DØ Veronica Sorin University of Buenos Aires.
2007/03/20Optimal Jet Finder1 Physics and Astronomy University of Victoria British Columbia, Canada Damir Lelas Rolf Seuster Michel Lefebvre, Rob McPherson,
© University of Minnesota Data Mining for the Discovery of Ocean Climate Indices 1 CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance.
DØ Run II jet algorithms E. Busato (LPNHE, Paris) TeV4LHC Workshop 12/1/2004 Outline:  Introduction  The Ideal Jet Algorithm  DØ Run II Cone Jet Algorithm.
Jet Reconstruction Algorithms Eric Vazquez PHENIX- Journal Club.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
1 Andrea Bangert, ATLAS SCT Meeting, Monte Carlo Studies Of Top Quark Pair Production Andrea Bangert, Max Planck Institute of Physics, CSC T6.
Peter Loch University of Arizona Tucson, Arizona USA
Jet finding Algorithms at Tevatron B.Andrieu (LPNHE, Paris) On behalf of the collaboration Outline: Introduction The Ideal Jet Algorithm Cone Jet Algorithms:
Introduction to Hadronic Final State Reconstruction in Collider Experiments Introduction to Hadronic Final State Reconstruction in Collider Experiments.
A Comparison of Three-jet Events in p Collisions to Predictions from a NLO QCD Calculation Sally Seidel QCD’04 July 2004.
Introduction to Hadronic Final State Reconstruction in Collider Experiments Introduction to Hadronic Final State Reconstruction in Collider Experiments.
Maximum likelihood (ML)
Radial Basis Function Networks
Measurement of Inclusive Jet cross section Miroslav Kop á l University of Oklahoma on behalf of the D Ø collaboration DIS 2004, Štrbské pleso, Slovakia.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #14.
 s determination at LEP Thorsten Wengler University of Manchester DIS’06, Tsukuba, Japan.
Glauber shadowing at particle production in nucleus-nucleus collisions within the framework of pQCD. Alexey Svyatkovskiy scientific advisor: M.A.Braun.
Irakli Chakaberia Final Examination April 28, 2014.
Jet Studies at CMS and ATLAS 1 Konstantinos Kousouris Fermilab Moriond QCD and High Energy Interactions Wednesday, 18 March 2009 (on behalf of the CMS.
Top-partner mass reconstruction by using jets Michihisa Takeuchi ( KEK,YITP ) In collaboration with Mihoko M. Nojiri (KEK), Naotoshi Okamura (KEK)
S.D. Ellis: Les Houches Jet Algorithms and Jet Reconstruction: Lessons from the Tevatron (A Continuing Saga) (Thanks especially to Joey Huston &
Studies of the jet fragmentation in p+p collisions in STAR Elena Bruna Yale University STAR Collaboration meeting, June
Stephen D. Ellis University of Washington Maria Laach September 2008 Lecture 3 : Calculating with QCD at Colliders - Jets in the Final State Quantum Chromodynamics,
STAR Sti, main features V. Perevoztchikov Brookhaven National Laboratory,USA.
Particle Physics Chris Parkes Experimental QCD Kinematics Deep Inelastic Scattering Structure Functions Observation of Partons Scaling Violations Jets.
Optimization of parameters for jet finding algorithm for p+p collisions at E cm =200 GeV T. G. Dedovich & M.V. Tokarev JINR, Dubna  Motivations.
The D  Run II Cone jet algorithm: problems and suggested changes B.Andrieu (LPNHE, Paris) Outline:  Introduction  Algorithm description  Present and.
STAR Kalman Track Fit V. Perevoztchikov Brookhaven National Laboratory,USA.
1 P. Loch U of Arizona July 22, 2011 Experimental aspects of jet reconstruction and jet physics at the LHC (Part II) What Are Jets? The experimentalist’s.
Jet Physics at CDF Sally Seidel University of New Mexico APS’99 24 March 1999.
A Comparison Between Different Jet Algorithms for top mass Reconstruction Chris Tevlin University of Manchester (Supervisor - Mike Seymour) Atlas UK top.
DIS Conference, Madison WI, 28 th April 2005Jeff Standage, York University Theoretical Motivations DIS Cross Sections and pQCD The Breit Frame Physics.
From Quark to Jet: A Beautiful Journey Lecture 2 1 iCSC2014, Tyler Dorland, DESY From Quark to Jet: A Beautiful Journey Lecture 2 Jet Clustering, Classification,
Jets and α S in DIS Maxime GOUZEVITCH Laboratoire Leprince-Ringuet Ecole Polytechnique – CNRS/IN2P3, France On behalf of the collaboration On behalf of.
2005 Unbinned Point Source Analysis Update Jim Braun IceCube Fall 2006 Collaboration Meeting.
Measurement of b-quark mass effects for multi-jet events at LEP Juan A. Fuster Verdú EPS-2003 Aachen, July 2003.
October 2011 David Toback, Texas A&M University Research Topics Seminar1 David Toback Texas A&M University For the CDF Collaboration CIPANP, June 2012.
A. Bertolin on behalf of the H1 and ZEUS collaborations Charm (and beauty) production in DIS at HERA (Sezione di Padova) Outline: HERA, H1 and ZEUS heavy.
Jet finding Algorithms at Tevatron B.Andrieu (LPNHE, Paris) On behalf of the collaboration Outline: Introduction The Ideal Jet Algorithm Cone Jet Algorithms:
Monday, Jan. 31, 2005PHYS 3446, Spring 2005 Jae Yu 1 PHYS 3446 – Lecture #4 Monday, Jan. 31, 2005 Dr. Jae Yu 1.Lab Frame and Center of Mass Frame 2.Relativistic.
Giansalvo EXIN Cirrincione unit #4 Single-layer networks They directly compute linear discriminant functions using the TS without need of determining.
Mean Charged Multiplicity in DIS, Michele Rosin U. WisconsinZEUS Collaboration Meeting, Oct. 21st Analysis Update: Mean Charged Multiplicity in.
Color Reconnection in W Pair Events Guillaume Leibenguth Université Catholique de Louvain Belgium DIS 2003, St. Petersburg On behalf of the LEP collaborations.
View From a(n Old!) Theorist Stephen D. Ellis University of Washington MC4LHC CERN July 2006 A lot of Philosophy From Long Experience – See the Next talk.
F Don Lincoln f La Thuile 2002 Don Lincoln Fermilab Tevatron Run I QCD Results Don Lincoln f.
IFIC. 1/15 Why are we interested in the top quark? ● Heaviest known quark (plays an important role in EWSB in many models) ● Important for quantum effects.
High p T hadron production and its quantitative constraint to model parameters Takao Sakaguchi Brookhaven National Laboratory For the PHENIX Collaboration.
Moriond 2001Jets at the TeVatron1 QCD: Approaching True Precision or, Latest Jet Results from the TeVatron Experimental Details SubJets and Event Quantities.
Performance of jets algorithms in ATLAS
T2 Jet Reconstruction Studies
Decomposing p+p Events at √s = 200 GeV with STAR
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Peter Loch University of Arizona Tucson, Arizona USA
kT Asymmetry in Longitudinally Polarized pp Collisions
Optimal Jet Finder (OJF) in ATLAS
Text Categorization Berlin Chen 2003 Reference:
Jet fragmentation results at CDF
Inclusive Jet Production at the Tevatron
Unfolding with system identification
Peter Loch University of Arizona Tucson, Arizona USA
Measurement of b-jet Shapes at CDF
Presentation transcript:

OPTIMAL JET FINDER Ernest Jankowski University of Alberta in collaboration with D. Grigoriev F. Tkachov

Acknowledgements Alberta Ingenuity Fund J Gordin Kaplan Graduate Student Award

Plan of the talk introduction Optimal Jet Definition and its implementation comparison with cone and k T algorithms benchmark physics application test based on motivations behind OJD

Introduction final states of particle collisions in HEP experiments often consist of sprays of hadrons simple interpretation: in the collision quarks with high kinetic energy are produced or released from the colliding particles but quarks interact very strongly at large distances and never appear as free particles (confinement) any attempt to separate free quarks results in production of extra pairs of quarks and antiquarks which recombine into colorless states: hadrons if the collision energy is high enough the hadrons appear in sprays, called jets

Hadronic jets: an example

Jet algorithms roughly jets correspond to quarks and gluons produced in hard scattering process jet algorithm takes the final state hadrons and assigns them to jets a jet’s momentum is then computed from the momenta of the particles that belong to that jet (so called recombination scheme) the jet’s momentum corresponds approximately to the momentum of quarks and gluons in hard scattering process (partons in perturbative calculations) jet algorithms used: cone algorithm and successive recombination algorithms, such as Durham (k T )

Optimal Jet Definition I will present so called Optimal Jet Definition (OJD) proposed by Fyodor Tkachov a short introduction to the subject is Phys. Rev. Lett. 91, (2003) FORTRAN 77 implementation of OJD called Optimal Jet Finder (OJF) is described in hep-ph/ (Comp. Phys. Commun., in print)

Recombination matrix z aj HEP event: list of particles (partons hadrons calorimeter cells towers preclusters) result: list of jets recombination matrix the 4-momentum q j of the j-th jet expressed by 4-momenta p a of the particles

Recombination matrix z aj z aj describes the fraction of the a-th particle that belongs to the j-th jet (i.e. we split up particles between jets) conventional jet algorithms have z aj equal to 0 or 1, i.e. a particle either entirely belongs to some jet or does not belong to that jet at all fragmentation and hadronization is always effect of interaction of (at least) two hard partons evolving into two jets, so some hadrons that emerge in this process can belong partially to both jets but this is also very convenient algorithmically, because a jet configuration is described by a set of continuous numbers z aj

Recombination matrix z aj i.e. no more than 100% of each particle is assigned to jets the fraction of the energy of the a-th particle that does not go into any jet the 4-momentum q j of the j-th jet expressed by 4-momenta p a of the particles (a=1,2,...,n parts ) the fraction of the energy of the a-th particle can be positive only

Optimal Jet Definition any allowed value of the recombination matrix {z aj } describes some jet configuration the desired optimal jet configuration is the one that minimizes some function  ({z aj }) details of  are different for CM lepton-lepton collisions (spherical kinematics) and collisions involving hadrons (cylindrical kinematics), where boost invariance along the beam axis should be maintained

Optimal Jet Definition: spherical kinematics  aj is the angle between the a-th particle and j-th jet E a is the energy of the a-th particle R>0 is a parameter with a similar meaning as the cone radius width of the j-th jetenergy outside jets

Optimal Jet Finder: fixed number of jets desired optimal jet configuration corresponds to minimum of  ({z aj }) the program finds a minimum iteratively using a simple gradient-based method facilitated by analytical formulas for gradient start with some candidate minimum (the initial jet configuration) and descend into a local minimum in subsequent iterations the initial jet configuration may be completely random

Minimization: 2 jets a point inside the simplex a point on the boundary of the simplex

iteration = 0

iteration = 1

iteration = 2

iteration = 3

iteration = 4

iteration = 5

iteration = 6

Local minima of  each time the programs starts with a random configuration, it finds a local minimum which does not need to be the global minimum of  ({z aj }) this is similar to how the result of cone algorithm depends on the initial position for the cone iterations but here in contrast with the cone algorithm, we know which local minimum we should choose: the one that gives the smallest value of 

Optimal Jet Finder: number of tries in order to find the global minimum or increase the probability of finding it, the program tries a couple different random initial configurations OJF parameter: number of tries n tries it was sufficient to take n tries  10 (even n tries  3) in the cases we studied compromise between quality of jets found and computing time used

OJF: number of jets to be determined 1.assume some small positive parameter  cut, which is analogous to the jet resolution parameter y cut in binary recombination algorithms 2.start with (for example) n jets =1 3.find the best configuration with the number of jets equal to n jets as described previously (starting from n tries different initial configurations and choosing the best configuration) 4.check if 5.if so, this is the final jet configuration and the final number of jets is n jets 6.if not, increase n jets by 1 and go to point 3

Cone algorithm cone algorithm defines a jet as all particles within a cone of certain radius in  space  j,  j are coordinates of the geometrical center of the cone;  a,  a are coordinates of the particles  j,  j are found from the requirement that E  -weighted centroid computed from all particles in the cone coincides with the geometrical center of the cone

Cone algorithm  j,  j are found in an iterative procedure: start with some trial position  j,  j for a cone  for all particles within the cone compute E  -weighted centroid  ’ j,  ’ j (usually different from  j,  j ) use the centroid as a new position of the center of the cone the content of the cone will change accordingly, update it and go to  procedure ends when the cone center and E  _ weighted centroid for all particles in the cone coincide: a stable cone

Cone algorithm: seeds the iterative procedure described above needs some initial cone position look for stable cones starting everywhere (i.e. at every tower or cell) – very expensive computationally start only at energetive towers, so called seeds

Problems with seeds problems arise when we compare theory and experimental data when we apply the cone algorithm involving seeds to partons beyond the LO, soft radiation or collinear splitting of partons may change the jet configuration significantly whereas the experimental jet configuration remains unchanged

Soft radiation at NLO R < (angle between two partons) < 2R the event is reconstructed to 2 jets with a soft gluon as a seed the event is reconstructed into 1 jet

Collinear splitting at NNLO R < (angle between two partons) < 2R the event is reconstructed to 1 jet using the most energetic parton (red) as the seed the “red” parton splits into 2 collinear pieces; now the “blue” parton is the most energetic and the event is reconstructed into 2 jets

“Collinear splitting” in the experiment the particle in the center hits a single calorimeter cell; the cell has sufficient energy to be a seed the particle in the center hits two separate cells; none of the cells has enough energy to be a seed

Cone algorithm: overlapping cones after all stable cones are found, deal with overlapping cones if the fraction f of overlapping energy (with respect to the smaller energy jet) exceeds some threshold (e.g. f>50%) merge the two jets otherwise split two jets, assigning particles to the closest jet if there are more than two jets overlapping, the final result may depend on the ordering of this procedure (so some standard order has to be specified) it is difficult to take account of the merging \ splitting procedure in theoretical calculations

Cone algorithm: overlapping cones experiment theory

Solution to seed problems cone algorithm involving seeds becomes unstable at higher order perturbative calculations seedless algorithm: look for jets everywhere Improved Legacy Cone Algorithm (ILCA): midpoints p ab =p a +p b, p abc =p a +p b +p c,... are used as seeds still problem with overlapping cones successive recombination algorithms (binary algorithms), such as JADE, Durham (k T )

 is an infrared and collinear safe

Successive recombination algorithms for each pair of particles compute the “distance” d ab between the particles in the pair, for example choose the pair with the smallest distance and if min(d ab ) < y cut (y cut is the jet resolution parameter) combine the two particles into one particle using p new =p a +p b and go back to the beginning (now having the number of particles decreased by one) if min(d ab )  y cut then stop (we achieved the final jet configuration)

OJF and k T k T merges only 2 particles at a time (binary recombination 2  1) other recombination schemes also considered 3  2, m  n OJF takes into account the global structure of the energy flow in the event, i.e. jet configuration is found from the momenta of all particles in the event (corresponds to m  n) OJF finds more regular jets that k T (still a jet is not a cone)

OJF and k T OJF is much faster that k T if a large number of particles has to be analyzed average time per event  n parts for OJF  n 3 parts for k T this does not matter when we apply the algorithm to theoretical calculations, but in experiments we have to analyze large number of calorimeter cells D0  cells, ATLAS  cells

OJF and k T the cubic dependence determines the way k T has to be used in experiments it cannot be applied directly at the level of cells a preclustering step is needed to reduce the input data to  200 preclusters (D0 and CDF practice) how does the preclustering affect measurements? it is difficult to account for the preclustering in theoretical calculations the preclustering is a completely independent procedure from the k T algorithm itself

Benchmark test: W-boson mass extraction benchmark test based on the W-boson mass extraction from the process modeled on the OPAL analysis (CERN-EP ) we compared OJF with JADE and Durham (k T ) algorithm (the best algorithm used by the OPAL collaboration) we obtained the same accuracy as Durham (still we did not explore all possibilities) we studied the speed of OJF and we found that it is much faster then Durham when a large number of calorimeter cells need to be analyzed

Quality of jets ALGORITHMstatistical error of W-boson mass (corresponding to 1000 experimental events) based on Fisher’s information [MeV] (±3) Durham (k T )105 JADE118 OJF106

time [seconds] number of particles \ cells average time per event kTkT OJF n tries =10 OJF n tries =5

time [seconds] number of particles \ cells average time per event kTkT OJF n tries =10, n tries =5

Method of moments parameter estimation, a fundamental problem of mathematical statistic for an event P theory gives the probability density  M (P) that depends on some parameter M, i.e. M W or  s given an experimental sample of events, we want to estimate the best value of the parameter M with possibly small statistical error method of maximal likelihood reinterpreted as method of moments

Method of moments depends on M computed from the experimental sample

Method of moments informativeness of the observable f, based on the statistical error that f gives Fisher’s information Rao-Cramer inequality

Event representation: basic shape observables event: particles event as energy flow collinearly invariant basic shape observables f function of a direction only

Factorial estimate event  values of all basic shape observables f(P) the optimal observable for the measurement of some parameter can be expressed as a combination of basic shape observables applying a jet algorithm we reduce the available information about the event P to make the analysis computationally manageable we use values of all basic shape observables f(Q), taken on the jet configuration

Hadronic event approximated by jets good approximations for f(P) could exist among functions that depend only on Q, which is a parameterization of P in terms of a few jets, found from the condition modeled after (q is partonic structure of the event) this expresses the fact that most of the information about the event is inherited from its gluon-and-quark structure

Factorial estimate for any f; C f,R depends only on f, it does not depend on the event or jet configuration configuration  depends only on the event P and the jet configuration Q we choose the jet configuration so that the loss of information about the event is minimal generically for all f (we minimize  ) this is essentially the Optimal Jet Definition

Summary I presented the Optimal Jet Finder based on the global energy flow in the event infra-red and collinear safe: no seed-related problems no overlapping jets returns additional numerical characteristics of the jet configuration found (so called dynamical width and soft energy) which may be helpful in construction of (quasi-) optimal observables in statistical problems much faster than k T for a large number of input cells

Optimal Jet Definition: cylindrical kinematics