ATLAS-CONF-2013-072 20/07/2013
ATLAS NOTE
ATLAS-CONF-2013-072
July 18, 2013
Minor revision: July 20, 2013
Di ff erential cross sections of the Higgs boson measured in the diphoton decay channel with the ATLAS detector
using 8 TeV proton-proton collision data
The ATLAS Collaboration
Abstract
This note presents differential cross section measurements of the Higgs boson, per- formed in the diphoton decay channel. The dataset used corresponds to 20.3 fb-1of proton- proton collisions at √
s=8 TeV, produced by the LHC and collected by the ATLAS detector in 2012. With its high signal selection efficiency the diphoton decay channel is well suited to probe the underlying kinematic properties of the signal production and decay. Measure- ments for several diphoton and jet distributions are made for isolated photons within the geometric acceptance of the detector and they are corrected for experimental acceptance and resolution. Results are compared to theoretical predictions at the particle level.
Revised legend on Figure 9 in the Appendix with respect to the version on July 18, 2013
c
Copyright 2013 CERN for the benefit of the ATLAS Collaboration.
Reproduction of this article or parts of it is allowed as specified in the CC-BY-3.0 license.
1 Introduction
In 2012, the ATLAS and CMS collaborations observed a new particle [1, 2] in the search for the Stan- dard Model (SM) Higgs boson [3–5]. With additional data and an improved analysis strategy, ATLAS measured a combined total production rate of 1.33
±0.21 times the SM Higgs boson expectation [6], and excluded a specific spin-2 hypothesis with a confidence level above 99.9% [7]. Spin and relative rates in the different production modes conform to the SM expectation, and in the diphoton channel ATLAS observed a signal with a local significance of 7.4
σ, and a total rate of 1.6±0.3 times the SM expectation [6].
Direct di
fferential cross section measurements of the Higgs boson further elucidate its production and decay properties, and thus complement earlier analyses of its spin and couplings [6–8]. This note presents the measurement of eight Higgs boson observables in the diphoton decay channel. It is based on the full 2012 dataset recorded by the ATLAS detector, corresponding to 20.3 fb
-1of proton-proton collisions produced by the LHC at
√s
=8 TeV. The cross sections are measured within a fiducial region defined by two isolated photons within the geometrical acceptance of the detector. For each di
fferential measurement, the dataset is divided in bins of the observable, and the resonant signal yield is separated from the dominant, continuum diphoton and photon-jet backgrounds using signal plus background fits to the diphoton invariant mass spectrum for each of these bins. These yields are corrected for detector acceptance as well as resolution in the measured observable to determine the fiducial di
fferential cross sections.
The eight observables studied are: the transverse momentum p
γγTand rapidity
|yγγ|of the Higgs boson, the helicity angle
|cos
θ∗|in the Collins-Soper frame [9], the jet multiplicity N
jetsin H
→ γγevents and from that the jet veto fractions
σNjets=i/σNjets≥ifor jet multiplicities i, the transverse momentum distribution of the leading jet p
Tj1, the azimuthal angle between the leading and the subleading jet
∆φj j, and the transverse component of the vector sum of the momenta of the Higgs boson and dijet system
p
γγj jT.
Directly measuring the transverse momentum, rapidity, and mass of the diphoton system determines the kinematic properties of the Higgs boson. Inclusive Higgs boson production is dominated by gluon- gluon fusion (ggH), for which the transverse momentum of the Higgs boson is largely balanced by the emission of soft gluons. Measuring p
γγTthus directly probes the understanding of perturbative QCD calculations for this process. The rapidity distribution of the Higgs boson is sensitive to QCD radiative corrections and the parton distribution functions of the colliding protons. The helicity angle is defined as the angle between the beam axis and the photons in the Collins-Soper frame of the Higgs boson, and has been used to study the spin of the Higgs boson. The jet multiplicity is sensitive to the relative rates of the Higgs boson production mechanisms. Inclusive and 0-jet events are dominated by
ggHprocesses, with vector boson fusion (VBF) and associated production with a weak vector boson (W H/ZH) becoming more important in 1- and 2-jet events. The small rate of Higgs bosons produced in association with a t¯ t pair (t¯ tH) becomes relevant for N
jets >2. With the measured jet multiplicities, the jet veto fractions can be calculated, defined as the ratio of exclusive to inclusive cross sections
σNjets=i/σNjets≥i. The exclusive and inclusive jet cross sections are defined as the cross section of Higgs boson production with exactly i jets and i or more jets, respectively. These ratios are sensitive to the strong coupling
αs, the theoret- ical description of quark and gluon radiation, and the relative rates of the production modes. In
ggH,the transverse momentum of the leading jet corresponds to the hardest QCD radiation in Higgs boson production. The spectrum can be compared to higher order predictions.
The remaining observables are defined only for the subset of events which have at least two jets.
The azimuthal angle between the leading and sub-leading jets measures angular correlations between the
radiated objects. In the
ggHand VBF production modes, it is sensitive to the spin and CP eigenvalue
of the Higgs boson [10]. The transverse momentum of the Higgs boson and dijet system is often used
to discriminate VBF against the much more abundant
ggHproduction. In VBF production, the com- bined diphoton and dijet system tends to be well-balanced in p
T, since extra radiation is considerably suppressed. This causes a smaller p
γγj jTthan in the
ggHprocess, where such a suppression is absent. In VBF searches, the discriminating power of p
γγj jTis limited by the large theoretical uncertainties on the
ggH+2 jets contamination. In this early measurement, no additional cuts were applied to suppress the
ggHproduction mode.
The note is organized as follows. The ATLAS detector is briefly described in Section 2. Simulated data samples are presented in Section 3, and event and object selection are defined in Section 4. Section 5 describes parameterizations of the distributions of the diphoton invariant mass for signal and background events. Uncertainties on the signal yields, and on migrations between bins of the observables, are item- ized in Section 6. These are followed by a description of the extraction of the binned signal yields, and of their unfolding, in Sections 7 and 8. The differential cross sections are presented and compared to theoretical predictions in Section 9. Conclusions are drawn in Section 10.
2 The ATLAS detector
The ATLAS detector [11] is a multipurpose particle physics experiment with forward-backward sym- metric cylindrical geometry. The inner tracking detector (ID) consists of a silicon pixel detector and a silicon microstrip detector, both covering the pseudorapidity
1region
|η|<2.5; and a straw-tube transition radiation tracker, covering the pseudorapidity region
|η|<2.0.
The ID is surrounded by a thin superconducting solenoid which provides a 2 T magnetic field, and by the high granularity liquid-argon (LAr) sampling electromagnetic calorimeter. The electromagnetic calorimeter is divided into a central barrel (|η|
<1.475) and endcap regions on either end of the detector (1.375
<|η|<3.2). In the region matched to the ID (|η|
<2.5), it is radially segmented into three layers.
The first layer has a fine segmentation in
η, in the region |η| <1.4 and 1.5
≤ |η| <2.4, to facilitate the separation of es and
γs fromπ0s and to improve the resolution of the shower position and direction measurements. The second layer collects most of the total reconstructed energy, and the third layer is used to correct for leakage beyond the calorimeter. In the region
|η|<1.8, the electromagnetic calorime- ter is preceded by a presampler detector to correct for upstream energy losses. An steel-scintillator tile calorimeter gives hadronic coverage in the central rapidity range (|η|
<1.7), while two copper
/LAr hadronic calorimeters provide coverage over 1.5
< |η| <3.2. The forward regions (3.2
< |η| <4.9) are instrumented with LAr calorimeters for both electromagnetic and hadronic measurements. The muon spectrometer that surrounds the calorimeters is not used in the present analysis. The combination of all these systems provides charged particle measurements together with e
fficient and precise photon mea- surements in the pseudorapidity range
|η| <2.5. Jets and missing transverse energy are reconstructed using energy deposits over the full coverage of the calorimeters,
|η|<4.9.
3 Monte Carlo simulations
The acceptance inside the fiducial volume of the detector, and the detector response, are simulated using Monte Carlo (MC) techniques. Simulated processes play three distinct roles in this analysis. (i) They provide shape templates for describing the signal in the diphoton mass spectrum (Section 5.1). (ii) They are used in evaluating the appropriateness of analytic shapes for the background (Section 5.2), which
1ATLAS uses a right-handed coordinate system with its origin at the nominal interaction point (IP) in the centre of the detector, and the z-axis along the beam line. The x-axis points from the IP to the centre of the LHC ring, and the y-axis points upwards. Cylindrical coordinates (r, φ) are used in the transverse plane,φbeing the azimuthal angle around the beam line.
Observables labelled transverse are projected into thexyplane. The pseudorapidity is defined in terms of the polar angleθas η=−ln[tan(θ/2)].
2
are critical for the extraction of the signal yield (Section 7). (iii) The relationship between the particle level signal decays and the simulated detector response provides information necessary to unfold the measured yields to obtain differential cross sections, as discussed in Section 8. The samples used are described below.
3.1 Signal samples
The Higgs boson production cross sections are computed at next-to-next-to-leading order (NNLO) in QCD for the
ggHprocess [12–17]. POWHEG [18–20] is tuned to match calculations with finite mass effects and soft-gluon resummations up to next-to-next-to-leading logarithmic order (NNLL) [21–23].
Next-to-leading order (NLO) electroweak (EW) corrections from Refs. [24,25] are applied. These results are compiled in Refs. [26–28] assuming factorization between QCD and EW corrections. The cross section for the VBF process has been calculated with full NLO QCD and EW corrections [29–31], and approximate NNLO QCD corrections are applied [32]. The W H and ZH processes have been calculated at NLO [33] and at NNLO [34], and NLO EW radiative corrections from Ref. [35] are applied. The full NLO QCD corrections for t¯ tH are used [36–39]. The Higgs boson cross sections, branching ratios [40–42] and their uncertainties are compiled in Refs [23, 43].
The Higgs boson production and decay are simulated separately for 11 values of the Higgs boson mass m
Hin 5 GeV steps from 100 to 150 GeV, in order to create a parameterization of the reconstructed signal line shape as a function of m
H(Section 5.1). The five dominant production modes are simu- lated for each mass value. The normalization and factorization scales are set to the Higgs mass. The common ATLAS simulation tunes and parton distribution function (PDF) sets are used [44, 45]. Parton level
ggHand VBF samples are generated using POWHEG with the CT10 PDF tune and interfaced to P
ythia8 [46] to simulate the decay of the Higgs boson, showering and hadronization. Higgs bosons produced in association with a W or Z boson, or t¯ t, are generated with Pythia8 with the CTEQ6L1 PDF tune. The interaction of particles with the detector is simulated [47] using GEANT4 [48]. Effects from multiple interactions in a bunch crossing (pileup) are simulated by overlaying each signal event with a variable number of simulated inelastic proton-proton collisions, such that the distribution of this number of interactions agrees data.
The Higgs boson to diphoton decays simulated with P
ythia8 contain H
→γγ∗decays with
γ∗→f f ¯ . These
γγ∗events pass the full selection at a very low rate, and can enter the signal region. Due to an artificial upper limit on the
γ∗mass in P
ythia8, the simulated
BR(H→γγ∗) is lower than the branching fraction calculated in Ref. [49]. The rate of H
→ γγ∗decays is therefore reweighted to match this prediction. An uncertainty of 100% is applied to this reweighting, due to the limited knowledge of this branching fraction. The efficiency for simulated H
→ γγ∗events to pass the full event selection is 18 times lower than for H
→γγ, resulting in an small overall uncertainty ranging from 0.5% to 0.7%.Resonant H
→γγproduction interferes destructively with the
gg→γγcontinuum. This reduces the overall size of the expected contribution from
ggH, and the measured yield of Higgs bosons, by 2.2%for m
H =126.5 GeV [50]. The interference varies with
|cos
θ∗|, and the effect reaches its maximum of 11% for
|cos
θ∗|near 1 (see Table 6, in the Appendix). The calculation was made in the low p
γγTlimit, so it is most valid in that realm. Interference effects in the presence of additional radiation, i.e.
extra jets, are not considered. The simulated signal decays are weighted to account for this e
ffect for the yield extraction and unfolding. The impact of this interference is thus not removed in the measurement.
In principle, theoretical predictions should include this small correction when comparing to the cross sections presented.
Simulated signal events are reweighted to match the longitudinal spread of the beam spot observed
in data. The reconstructed photon energies are modified to correct the photon energy resolution, based
on the comparison of Z
→ee events in data and simulation. To improve the description of the shapes of
the photon showers in simulation, the variables that describe these shapes are corrected to improve the agreement of the simulated and observed distributions [51].
3.1.1 Higher-orderggHpredictions, for comparisons
For the later comparison with the unfolded cross sections, two higher-order predictions for
ggHwere produced. The multi-scale improved NLO, MINLO (rev. 2290) for H
+1 jet [52] with the CT10 PDF set [44,45], and the NNLO+NNLL prediction of HRes (rev 1.0) [53], which uses the MSTW 2008 NNLO PDFs [54] and uses the infinite top quark mass approximation. Because HR
esis a parton level prediction with an inclusive treatment of the QCD radiative correction, it is not possible to apply any isolation requirement on this sample (Section 4.2). MINLO was interfaced with Pythia8 for the simulation of underlying event, showering, and hadronization. Both samples were generated with m
H =126.8 GeV, the central value of the measured mass in the diphoton channel [6].
3.2 Background samples
Generated events are used to validate the analytical parameterization of the background shape in the diphoton invariant mass, and to estimate the systematic uncertainties from the parameterization itself.
The backgrounds from prompt diphoton and photon-jet processes (about 77% and 20% of the total back- ground) are simulated using S
herpa[55], with up to three quark or gluon emissions from the matrix element using a leading order multijet merging algorithm. The jet-jet background, about 3% of the total background, is simulated with P
ythia8. The Drell-Yan component, about 0.8% of the total background, does not significantly a
ffect the shape of the background and is not considered. Due to the large number of simulated background events needed to validate the background shape in the diphoton invariant mass spectrum with negligible statistical uncertainty (Section 5), simulating the full detector acceptance and interaction is not feasible. The detector response to photons is parameterized as a smearing which de- pends on the transverse momentum and pseudorapidity. This procedure takes into account the varying resolutions and reconstruction efficiencies of different detector regions.
For each variable defined using jets, the particle level definition of the variable is smeared using a detector response matrix. This response matrix is derived from a sample of fully simulated diphoton plus jets events.
The method is validated by comparing the diphoton invariant mass shapes produced to that from fully simulated background events, and to the data events passing the event selection outside the signal region. A good agreement is observed within the statistical uncertainties.
4 Event selection
4.1 Event and photon selection in data
The event and object selection used in this analysis follows closely the approach of earlier ATLAS H
→γγanalyses, e.g. Ref. [56] where more details can be found. Events are recorded using a diphoton trigger with an e
fficiency above 99% with respect to the o
ffline selection, requiring two energy clusters that match criteria according to expectations for photon-induced electromagnetic showers. One of the clusters must have transverse energy, E
T, greater than 35 GeV, and the other one must be greater than 25 GeV. They must lie within the fiducial acceptance of
|η| <2.37, excluding the transition region between the barrel and endcap calorimeters, 1.37
≤ |η| <1.56. Basic data quality requirements ensure that all necessary components of the detector are operational.
The photon energies are calibrated based on a detailed simulation of the detector geometry and response. The calibration is refined by applying
η-dependent correction factors determinedin situ, from
4
studies of Z
→ee decays. These factors range from
±0.5% to±1.5% depending on the pseudorapidityof the photon [57].
A primary vertex is selected for the event using a neural network combining the ‘pointing’ infor- mation of the photons in the calorimeters and the tracks of converted photons, with the
Pp
Tand
Pp
2Tof tracks associated to a vertex, and the
∆φbetween the diphoton system and the sum of the tracks’
momenta. The photon pointing determines the vertex position along the beam axis by combining infor- mation from the longitudinal segmentation of the calorimeter with a constraint from the average beam spot position. The z-position of the selected vertex is then used to correct the
ηand hence E
Tof the two photons.
Each photon of the pair must further satisfy isolation requirements from the inner detector and calorimeter. The total energy in a cone around the photon of
∆R
= p(
∆η)2+(
∆φ)2 <0.4 must be less than 6 GeV. A core region around the photon shower is excluded from that total energy and there are further corrections for the energy leakage from this core region into the outer cone. Energy from the underlying event or from minimum bias interactions (pileup) occurring in the same or neighboring bunch crossings is removed on an event-by-event basis [58]. The scalar sum of the p
Tof tracks originating from the diphoton vertex with p
T>1 GeV and in a cone of
∆R
<0.2 must also be less than 2.6 GeV.
In order to simplify the shapes of the diphoton invariant mass distributions of backgrounds in the yield extractions, cuts are imposed not on the absolute E
Ts of the photons but on the ratios of each photon’s E
Tto their combined mass: E
T/mγγ>0.35 (0.25) for the higher-E
T(lower-E
T) photon of the pair. Finally, the mass of the selected pair must satisfy 105 GeV
<m
γγ <160 GeV. With 20.3 fb
-1of data in 2012, ATLAS collected 94135 events satisfying these requirements. Expected signal yields from MC are listed in Table 1.
4.2 Fiducial and particle level definitions
The particle level fiducial definition is chosen to mirror the event selection in data. This minimizes the extrapolation from detector level to particle level quantities. The selection criteria are as follows: the two highest-E
T, isolated final state photons, within
|η| <2.37 and with 105 GeV
<m
γγ <160 GeV are selected. Note that the transition region between the barrel and endcap calorimeters is not removed in this definition. After the pair is selected, the same cut on E
T/mγγis applied as in the event selection in data: E
T/mγγ>0.35 (0.25) for the two photons.
The isolation criterion is defined as follows: the sum of the p
Tof all stable particles
2excluding muons and neutrinos is required to be less than 14 GeV within
∆R
<0.4 of the photon. This requirement was found to correspond approximately to the calorimetric isolation cut of 6 GeV at reconstruction level.
The track isolation cut at detector level is much looser, and does not impact the derivation of this value.
This isolation requirement dramatically reduces the rate at which non-signal photons are selected. It also reduces the dependence of the measured cross sections on the model used to generate the unfolding corrections, by allowing for a more reliable relationship between photons produced in the fiducial region and those selected in the analysis (see Section 8).
4.3 Jet selection
Jets are selected both at particle and reconstruction level, using the anti-k
talgorithm [59] with a distance parameter of R
=0.4. At reconstruction level, the inputs are clusters of energy in the electromagnetic and hadronic calorimeters. At particle level all stable particles excluding muons and neutrinos serve as input.
The jet must have transverse momentum exceeding 30 GeV, and rapidity
|y| <4.4. Jets are calibrated using event-by-event corrections based on the energy density and simulated instrumental effects [58]. A
2A particle is considered stable if it has a lifetime of more than 10 ps.
Fiducial Fully Simulated and Selected Signal
Signal (MC) Yield
ggH[%] VBF [%] W H [%] ZH [%] t¯ tH [%]
Inclusive 612 407 87.9 7.3 2.8 1.6 0.5
≥
1 Jet 252 180 76.3 15.1 4.8 2.7 1.1
≥
2 Jets 85 64 59.4 25.6 7.6 4.4 2.9
Table 1: Expected selected yields for the Standard Model at
√s
=8 TeV with m
H =125 GeV. The relative fractions selected in the five production modes,
ggH, VBF,W H, ZH, and t¯ tH are detailed. The fiducial signal is calculated at generator level.
residual correction from in situ measurements is applied to refine the jet calibration. Jets within
∆R
<0.4 of either of the selected photons, or
∆R
<0.2 of any electron, are vetoed. Electrons are not otherwise used in this analysis. Their selection criteria are identical to Ref. [6].
To reduce the pileup dependence at reconstruction level, jets with
|η| <2.4 and p
T <50 GeV must be consistent with originating from the selected primary vertex. This is ensured by considering the ratio of the (scalar) p
Tsum of tracks within the jet coming from the primary vertex, over the (scalar) p
Tsum over all tracks associated to that jet. This ‘jet vertex fraction’ (JVF) is required to exceed 0.25. If no tracks can be found to perform the test, the jet is kept.
5 Signal and background modelling
5.1 Signal modelling
The SM Higgs boson signal produces a narrow peak in m
γγ, with a width completely dominated by the detector resolution. This shape is modelled analytically using a Crystal Ball function [60] plus a wide Gaussian component for modelling tails in the resolution function. Most parameters of this model are constant with respect to the hypothesized Higgs boson mass m
H, while the width increases with m
Hand is well-described as a linear function of it. A simultaneous fit to signal MC samples at di
fferent masses is performed to determine the constant parameters and the evolution of the width. This provides a parameterized signal shape for values of m
Hwhere MC samples are not available. This procedure is performed separately for each bin in each observable that is considered. The fitted width at the central value of the measured Higgs boson mass m
H =126.8 GeV is provided in the Appendix for each variable and bin.
5.2 Background modelling
The analytical function used to describe the background in the diphoton mass spectrum is selected in each bin individually using the procedure below, which is intended to minimize potential bias.
The background samples described in Section 3.2 provide a reference shape with minimal statistical fluctuations. Using these shapes, potential background models are tested by performing signal plus background fits on this background-only sample. This test is made in 1 GeV steps, shifting the signal model in a window of 119
−135 GeV, corresponding to a 16 GeV range centered around the Higgs boson mass. At each step, the measured (spurious) signal must be less than 20% of the statistical uncertainty on the background in the data or less than 10% of the yield of the expected SM signal. Two classes of models are evaluated: Bernstein polynomials and exponentials of polynomials of the form e
ax+···+zxN. The model with the lowest number of degrees of freedom that satisfies the requirements described above is selected. The largest bias found in the modeling of the MC background shape is taken as an uncertainty
6
on the observed signal yield for each observable. The resulting systematic is typically much smaller than the upper limits imposed here. The models and the assessed uncertainties are presented in the variables’
summary tables in the Appendix. The background shape in all bins is described using exponentials of polynomials of first and second order.
6 Experimental uncertainties
The measured di
fferential cross sections are subject to uncertainties on the overall yield and on migrations between bins of the observables. Further, the signal resolution affects the fitted shape, so resolution uncertainties have an important impact on the measured yield. Uncertainties on the energy scale and signal peak position do not appreciably impact the measured yields. The uncertainties described here are evaluated according to the methods used in Ref. [56].
6.1 Uncertainties on the yield
Uncertainties on the luminosity, trigger, and identification and isolation e
fficiencies impact the expected number of events. A bias from a poor choice of background function would change the extracted yield.
•
The uncertainty on the integrated luminosity is obtained from beam separation scans taken in April 2012. It is found to be 2.8%, following the method presented in Ref [61].
•
The uncertainty on the trigger efficiency is measured using a ‘boot-strapping’ method from mini- mum bias events as described in Ref. [62], and found to be 0.5%.
•
The single-photon identification efficiency and its uncertainties are evaluated in data. These un- certainties are propagated into the diphoton topologies, according to the method described in Ref. [56]. The event level uncertainty of 2.4% is found to be constant across bins of the di
fferential variables.
•
The e
fficiencies of the combined calorimetric and tracking isolation cuts are measured with Z
→ee MC and data, and the difference is assigned as an uncertainty. For the non-jet inclusive variables and p
Tj1this amounts to a 1% uncertainty. A 2% uncertainty applies to the two-jet variables. For the N
jetsspectrum, the uncertainty increases by bin: 1% for the first two bins, 2% for the 3rd, and 4% for the final bin.
•
The uncertainty on the signal yield from the background shape is described in Section 5.2 and presented in the Appendix.
6.2 Uncertainties on photon energy resolution
The calorimeter energy resolution is measured in data using electrons from Z decays. Its uncertainty, together with the uncertainty on the extrapolation from electrons to photons, accounts for a 22-31%
uncertainty on the resolution of the signal in the diphoton invariant mass, depending on the observable and bin. The resolution uncertainties are included in the fit and are transformed into yield uncertainties, as described in Section 7.2.
6.3 Migration uncertainties
Three sources of uncertainties are considered for the variables that are based on jets: jet energy resolution
(JER) and scale (JES), jet vertex fraction (JVF), and jets from pileup (that do not originate from the
primary vertex).
The jet energy scale and resolution uncertainties are estimated by applying shifts and smearings to the jet energy within their expected uncertainties. These shifts are derived in situ, exploiting the transverse momentum balance in
γ+jet,Z
+jet, dijet and multijet events. Discrepancies between data and MC for thejet energy scale for these measurements lead to a set of baseline uncertainties. The most important JES uncertainties for this analysis are from the
η-intercalibration that particularly impacts the calibration offorward jets, and due to the unknown composition and modelling of the associated calorimeter response of quark and gluon initiated jets.
Uncertainties from JVF modelling are quantified by varying the JVF cut up and down according to its uncertainty around the nominal cut value of 0.25. The JVF uncertainty is estimated by comparing simulation with data in Z
+jet events and is parametrized as a function of jet p
Tand
η.The uncertainty associated with the modelling of jets originating from pileup interactions is evaluated by randomly subtracting a fraction of the simulated pileup jets. The fraction of pileup jets removed is estimated by comparing the data to MC ratio of jets in pile-up enriched control regions of Z
+jets events.
The combined impact of these uncertainties on the signal yields ranges from 3-15% according to the observable and bin, and is tabulated in the Appendix.
Migration effects due to the photon energy scale, between bins in the cross section observables, were estimated to be negligible.
The contamination of the jet related observables, due to the occurrence of double parton interaction (DPI) in the context of simultaneous dijet and Higgs boson production, were found to be negligible.
The e
ffect was evaluated using the expected dijet cross section for the jet requirements presented in Section 4.3, with the measured e
ffective area parameter for hard DPI [63].
7 Signal extraction
7.1 Binning of the observables
The dataset is divided into bins of the observables and the signal yield in each bin is extracted using an extended unbinned likelihood fit to the diphoton invariant mass distribution. The binning was chosen to allow differential measurements, with statistics sufficient for a significant measurement in each bin, using the expected signal yields from simulation. The nominal target significances were 2
σ(1.5
σ) perbin for the inclusive (2-jet) variables. To obtain a reliable unfolding, the migrations between bins in the reconstructed distributions must be small. The binnings chosen based on these considerations are displayed in the variable summary tables in the Appendix.
7.2 Fit procedure and likelihood
An unbinned fit of the diphoton invariant mass is performed simultaneously in all bins, for each ob- servable. The Higgs boson mass m
Hand the nuisance parameters on the signal shape and position are common among all bins for each observable. The likelihood function maximized has the form
L(mγγ
;
νsig,νbkg,m
H)
=Yi
e
−νin
i!
ni
Y
j
hνsigi Si
(m
γγj; m
H)
+νbkgi Bi(m
γγj)
i
× Y
k
Gk
(1)
with
νsigiand
νbkgibeing the number of signal and background events estimated in data in the i
thbin of the observable,
νi = νsigi +νbkgithe mean value of the underlying Poisson distribution of the n
ievents, and m
γγjis the diphoton mass for event j. The signal probability density functions (PDF)
Sidescribed in Section 5.1 depend on the Higgs boson mass m
Hand on nuisance parameters from the energy resolution and scale.
8
The background PDFs
Biare chosen as described in Section 5.2. The term
Gkis a function of the k
thnuisance parameters and implements constraints from the photon energy resolution and scale into the fit. Uncertainties that do not affect the shape of the fit, for instance the background model uncertainty or trigger yield, are not included at this stage. Rather, they are applied during the unfolding procedure (Section 8).
For observables where a set of events is not included in the measured spectrum, the un-categorized events are placed into an additional bin, which is included in the fit. These include, for example, events with p
γγTlarger than the upper edge of the highest bin, or events with 0 or 1 jet for
∆φj j. The events in this additional bin help to constrain the Higgs boson mass and other nuisance parameters.
The fitted yields and errors are validated using an ensemble of pseudo-experiments and the statistical component of the error on the total yield is separated using a second fit with the nuisance parameters fixed to their profiled values.
Results for the diphoton invariant mass fit are shown for the jet multiplicity binning, as an example, in Figure 1. The determined yields for the inclusive and dijet observables are shown in Figures 2 and 3. The yields of all observables are in good agreement with the simulated SM expectation at m
H=125 GeV.
8 Unfolding procedure and associated uncertainties
8.1 Procedure
The data yields extracted in the previous section are corrected for detector e
ffects using bin-by-bin fac- tors. These are derived as the ratio of the yields from particle level to reconstruction level from simulated Higgs boson events, according to the SM expectation listed in Table 1. In each bin,
c
i=n
Particle leveli /nReconstructed
i
(2)
is used to correct the extracted signal yield in data. This unfolding procedure corrects for all efficien- cies, acceptances, and resolution effects. The correction factors range from 1.2 to 1.8, and include the extrapolation (about 20%, across all bins and observables) over the small region in
|η|excluded from reconstructed photon candidates (Section 4.1).
The method is formally unbiased provided that c
MCi =c
Datai. In practice, the requirement to use this method is that the ‘purity’ of events reconstructed into the same bin in which they were generated should not be too low. Among the measured observables, p
γγT,
|cos
θ∗|, and|yγγ|have very high purity (> 87%);
for the jet variables, it can be as low as 50%. The lower purity for the jet variables may be understood as a ‘double migration’: first for passing or failing the jet definition, and second for the migrations of the observable. The purities and derived correction factors for each bin of each observable are presented in the summary tables in the Appendix.
8.2 Uncertainties on the correction factors
To the extent that the physics processes and simulation model do not perfectly reproduce the data, the correction factors defined in Eq. 2 will be biased and model dependent. This dependence must be fully evaluated and considered as an uncertainty on the method. The potential bias can be categorized in two aspects: sample composition and shape.
The sample composition and description a
ffect correction factors, because di
fferent production pro-
cesses or generators may have different reconstruction level efficiencies. For instance the larger jet activ-
ity in t¯ tH events results in lower isolation efficiencies and larger correction factors than
ggH. The shapeof the input distributions matter, because the net migrations in a distribution will be from a more popu-
lated bin towards its less populated neighbour. The uncertainties from these considerations are evaluated
by manipulating the Monte Carlo from which the correction factors are determined, in four ways:
[GeV]
γ mγ
110 120 130 140 150 160
Events / GeV
500 1000 1500 2000 2500
Data Sig+Bkg Fit Bkg
Preliminary ATLAS
= 8 TeV s γ, γ
→
→H pp
dt = 20.3 fb-1
∫L
jets = 0 N
[GeV]
γ
mγ
110 120 130 140 150 160
Data-Bkg
-100 0 100 200
(a) Njets=0
[GeV]
γ mγ
110 120 130 140 150 160
Events / GeV
100 200 300 400 500 600 700 800 900
Data Sig+Bkg Fit Bkg
Preliminary ATLAS
= 8 TeV s γ, γ
→
→H pp
dt = 20.3 fb-1
∫L
jets = 1 N
[GeV]
γ
mγ
110 120 130 140 150 160
Data-Bkg
-50 0 50 100
(b)Njets=1
[GeV]
γ mγ
110 120 130 140 150 160
Events / GeV
50 100 150 200 250 300
Data Sig+Bkg Fit Bkg
Preliminary ATLAS
= 8 TeV s γ,
→γ
→H pp
dt = 20.3 fb-1
∫L
jets = 2 N
[GeV]
γ
mγ
110 120 130 140 150 160
Data-Bkg
-50 0 50
(c) Njets=2
[GeV]
γ mγ
110 120 130 140 150 160
Events / GeV
20 40 60 80 100 120
Data Sig+Bkg Fit Bkg
Preliminary ATLAS
= 8 TeV s γ,
→γ
→H pp
dt = 20.3 fb-1
∫L
≥ 3 Njets
[GeV]
γ
mγ
110 120 130 140 150 160
Data-Bkg -20
0 20
(d)Njets≥3
Figure 1: Diphoton invariant mass distributions are presented for the 4 bins of the N
jetsextraction.
The curves show the results of the single simultaneous fit to data for all N
jetsbins. The red line is the combined signal and background PDF, and the dashed line shows the background PDF. The difference of the two curves is the extracted signal yield. The bottom inset displays the residuals of the data with respect to the fitted background component, and the dotted red line corresponds to the signal PDF.
10
[GeV]
γ γ
pT
Reconstructed ]-1 [GeV Tp / dNd
0 5 10 15 20
25 ATLAS Preliminary
= 8 TeV s γ, γ
→ H
∫L dt = 20.3 fb-1 data syst. unc.
H X ) + 8 PY
+ OWHEG NLO+PS (P
→H gg
H t t + VH = VBF + H X
[GeV]
γ γ
pT
Reconstructed
0 20 40 60 80 100 120 140 160 180 200
OWHEGRatio to P
0 2
(a) pγγT
γ| yγ
| Reconstructed
|y / d|Nd
0 100 200 300 400 500 600 700 800
Preliminary ATLAS
= 8 TeV s γ, γ
→ H
∫L dt = 20.3 fb-1
data syst. unc.
H X ) + 8 PY
+ OWHEG NLO+PS (P
→H gg
H t t + VH = VBF + H X
γ| yγ
| Reconstructed 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 2.2 2.4
OWHEGRatio to P
0 2 4
(b)|yγγ|
θ*)|
|cos(
Reconstructed
*)|θ / d|cos(Nd
0 200 400 600 800 1000 1200 1400 1600 1800 2000 2200 2400
Preliminary ATLAS
= 8 TeV s γ, γ
→ H
∫L dt = 20.3 fb-1
data syst. unc.
H X ) + 8 PY
+ OWHEG NLO+PS (P
→H gg
H t t + VH = VBF + H X
θ*)|
|cos(
Reconstructed 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
OWHEGRatio to P
0 2 4 6
(c) |cosθ∗|
[GeV]
T,jet1
p Reconstructed ]-1 [GeV Tp / dNd
0 2 4 6 8 10 12 14 16
Preliminary ATLAS
= 8 TeV s γ, γ
→ H
∫L dt = 20.3 fb-1 data syst. unc.
H X ) + 8 PY
+ OWHEG NLO+PS (P
→H gg
H t t + VH = VBF + H X
[GeV]
T,jet1
p Reconstructed
0 20 40 60 80 100 120 140
OWHEGRatio to P
0 1 2 3
(d) pTj1
Figure 2: The fitted yield per bin for the Higgs p
T, rapidity,
|cos
θ∗|, and the leading jetp
Tare shown.
The bin below 30 GeV in the leading jet p
Tspectrum is populated by events with a jet multiplicity of
zero. The grey bands for each data point show the systematic uncertainty on the yields, while the black
bars represent the total systematic and statistical error. The hatched histogram shows the SM expectation
with m
H =125 GeV simulated with the full detector response and e
fficiencies. Its spread corresponds to
the quadratic sum of the migration uncertainties outlined in Section 6, and theoretical uncertainties. The
theory uncertainties are due to missing higher order corrections, the PDF set adopted, the simulation of
the underlying event, and the H
→ γγbranching fraction. The green line shows the contribution from
VBF, W H, ZH, and t¯ tH production.
Njets
Reconstructed
eventsN
0 50 100 150 200 250 300 350 400
450 ATLAS Preliminary
= 8 TeV s γ, γ
→ H
∫L dt = 20.3 fb-1 data syst. unc.
H X ) + 8 PY
+ OWHEG NLO+PS (P
→H gg
H t t + VH = VBF + H X
Njets
Reconstructed
0 1 2 ≥3
OWHEGRatio to P
0 1 2 3
(a)Njets
φjj
∆ Reconstructed
φ∆ / dNd
0 20 40 60 80 100 120 140 160
180 ATLAS Preliminary
= 8 TeV s γ,
→γ H
∫L dt = 20.3 fb-1 data syst. unc.
H X ) + 8 PY
+ OWHEG NLO+PS (P
→H gg
H t t + VH = VBF + H X
φjj
∆ Reconstructed
0 0.5 1 1.5 2 2.5 3
OWHEGRatio to P
0 5
(b)∆φj j
[GeV]
Hjj
pT,
Reconstructed ]-1 [GeV Tp / dNd
0 1 2 3 4
5 ATLAS Preliminary
= 8 TeV s γ, γ
→ H
∫L dt = 20.3 fb-1 data syst. unc.
H X ) + 8 PY
+ OWHEG NLO+PS (P
→H gg
H t t + VH = VBF + H X
[GeV]
Hjj
pT,
Reconstructed
0 20 40 60 80 100 120 140
OWHEGRatio to P
0 2 4 6
(c) pγγj jT