Measurement of the di ff erential cross-section of highly boosted top quarks as a function of their transverse momentum using the ATLAS detector in

(1)

ATLAS-CONF-2014-057 30September2014

ATLAS NOTE

ATLAS-CONF-2014-057

September 26, 2014

Measurement of the di ff erential cross-section of highly boosted top quarks as a function of their transverse momentum using the ATLAS detector in

√ s = 8 TeV proton-proton collisions

The ATLAS Collaboration

Abstract

The di

ff

erential cross-section for boosted top quark pair production is measured in 20.3 fb

⁻¹

of proton-proton collisions at a center-of-mass energy of 8 TeV. The measurement is per- formed for t¯ t events in the lepton+jets channel, where the hadronically decaying top quark has a transverse momentum above 300 GeV. Jet substructure techniques are employed to identify top quarks, which are reconstructed with an anti-k

t

jet with radius parameter R

=

1.0. The cross-section is reported as a function of the hadronically decaying top quark transverse momentum. The observed yield is corrected for detector e

ff

ects to obtain a cross- section at particle-level in a fiducial region close to the event selection. A parton-level cross- section extrapolated to the full phase-space is also reported for top quarks with transverse momentum above 300 GeV. The predictions of next-to-leading-order and leading-order ma- trix element plus parton shower Monte Carlo generators are found to generally overestimate the measured cross-sections.

c

Reproduction of this article or parts of it is allowed as specified in the CC-BY-3.0 license.

(2)

1 Introduction

A wide range of theories beyond the Standard Model (BSM) predict new phenomena observable at the Large Hadron Collider (LHC). Several of the predicted new particles, such as vector-like quarks and the supersymmetric partners of quarks and gluons, are expected to produce final states with several jets, often including b-jets, lepton(s) and missing transverse energy (E

^miss_T

). Top-antitop quark (t¯ t) production presents a very similar experimental signature and, since t¯ t events are produced with a high cross-section at the LHC, this process often constitutes a dominant background to these searches for new physics. An improved understanding of t¯ t production, and a more precise determination of the parton distribution functions, would therefore be of great benefit to the LHC program. Furthermore, BSM t¯ t production mechanisms can distort the top quark transverse momentum (p

T

) spectrum, in particular at high p

_T

(see e.g. Ref. [1, 2]). A precise measurement of the boosted top quark production rate might reveal a hint of physics beyond the Standard Model. The large number of t¯ t events produced so far at the LHC provide a unique opportunity to improve our understanding of t¯ t production and test the Standard Model (SM) at the TeV scale.

The first measurements of t¯ t production at the LHC have focused on a determination of the inclusive t¯ t cross-section. Now that the experimental uncertainties on these measurements (see e.g. Ref. [3–5]) are comparable to or lower than the theory uncertainties [6–11], the interest in di

ff

erential top cross-section measurements has gained traction. Measurements of the di

ff

erential cross-section as a function of the kinematics of the top quark, or the top-antitop quark pair, have been performed by the ATLAS [12, 13]

and CMS collaborations [14–16], where the highest top quark p

_T

bin in a fully corrected measurement corresponds to the interval 350-800 GeV [13]. In this note the first such measurement is performed using techniques specifically designed to deal with the collimated decay topology of boosted top quarks.

In particular, the hadronic top quark decay is reconstructed as a single large-radius (large-R) jet. The selection and reconstruction are based on an algorithm developed [17] and used in t¯ t resonance searches [18–20]. It increases the t¯ t selection efficiency at high top quark p

_T

and extends the kinematic reach into the TeV range. This analysis utilizes the lepton+jets channel where one W boson decays hadronically and the other leptonically, assuming each top quark decays to a W boson and a b-quark. The cross-section is measured as a function of the hadronically decaying top quark p

_T

. A particle-level cross-section is measured in a fiducial region resembling the detector-level selections for large-R jet p

_T

above 300 GeV. A parton-level di

ff

erential cross-section is also reported as a function of the top quark p

_T

. It is extrapolated to the full kinematic phase space except for a lower limit on top quark p

_T

of 300 GeV.

This note is structured as follows. The analysis strategy is introduced in Sec. 2. This is followed by the descriptions of the ATLAS detector (Sec. 3), the data and Monte Carlo (MC) samples used in the analysis (Sec. 4) and the object definition and event selection (Sec. 5). The estimation of the background is presented in Sec. 6. The systematic uncertainties affecting the measurement are described in Sec. 7 followed by a comparison of the observed yield to the sum of the signal and background expectations in Sec. 8. The fiducial region at the particle-level and the technique used to unfold the reconstructed top p

_T

spectrum to the particle and parton-levels are discussed in Sec 9, as well as the propagation of the statistical and systematic uncertainties through the unfolding procedure. The results and conclusions are presented in Sec. 10 and 11, respectively.

2 Analysis Strategy

The analysis starts by selecting a sample enriched in semileptonic t¯ t events (≈ 85%) with the following

event requirements. The hadronically decaying top quark is reconstructed using an anti-k

t

[21] jet of

radius R

=

1.0. The jet is trimmed [22] to mitigate the impact of initial state radiation, underlying event

activity, and additional pp interactions in the same bunch crossing (pile-up). Large-R jets containing

(3)

hadronically-decaying top quarks are selected by applying jet substructure cuts, which exploits the fact that they contain several high-p

_T

sub-jets and a high mass, unlike jets originating from QCD radiation.

The leading large-R jet meeting these requirements is referred as the top-jet candidate. Since the two top quarks are predominantly produced back-to-back, a spatial separation between the leptonically decaying and the hadronically decaying top quarks is required. The lepton (e or

µ) is required to be isolated using a

procedure that retains a high efficiency when the decay products of the leptonic top quark are collimated.

A small-R anti-k

t

jet (R

=0.4) is required to be close to the lepton and a significant amount of missing

transverse energy (E

^miss_T

) is demanded. At least one small-R jet has to be identified as originating from a bottom quark jet (b-tagged). More details on the identification and performance of boosted top quarks in ATLAS can be found in Ref. [23, 24].

After the event selection the background is composed primarily, in order of importance, of W

+

jets, t¯ t dilepton, single-top, and QCD multijets events. They are estimated by MC simulation (either normalized to a theoretical cross-section or, for the case of t¯ t dilepton, as to determine its fraction of the full t¯ t sample), except for W

+

jets, whose normalization and heavy-flavor content are adjusted in data control regions, and QCD multijets that is estimated with a fully-data driven method.

To minimize the dependence of the reported result on theoretical input, a particle-level fiducial region is designed to closely follow the event selection at the detector-level. Unfolding is carried out to correct for detector ine

ffi

ciencies and smearing e

ff

ects to produce a cross-section measurement as a function of the p

_T

of the leading selected particle-level large-R jets. A parton-level top quark p

_T

measurement is also reported after correcting the particle-level spectrum for hadronization and fragmentation e

ff

ects, and extrapolating to the full phase space of t¯ t events with top quark p

_T >

300 GeV. The unfolded particle-level and parton-level spectra are then compared to the prediction of several MC generators.

3 The ATLAS detector

ATLAS is a multi-purpose detector [25] that provides nearly full solid angle coverage around the inter- action point. This analysis exploits all major components of the detector. Charged particle trajectories are reconstructed by the inner detector, comprised of a silicon pixel detector, a silicon microstrip detector and a transition radiation tracker (TRT), which covers pseudorapidity

¹ |η| <

2.5. The inner detector is embedded in a 2 T solenoidal magnetic field. Sampling calorimeters with several different designs span the pseudorapidity range up to

|η| =

4.9. High granularity liquid argon (LAr) electromagnetic (EM) calorimeter are available up to

|η| =

3.2. Hadronic calorimetry based on scintillating tile active material cover

|η|<

1.7 while LAr technology is utilized for hadronic calorimetry from

|η|=

1.5 to

|η|=

4.9. The calorimeters are surrounded by a muon spectrometer. A magnetic field in the spectrometer is provided by air-core toroid magnets. Three layers of precision gas chambers track muons up to

|η|=

2.7 and muon trigger chambers cover

|η|<

2.4. 4 Data and Monte Carlo samples

The cross-section is measured using a dataset collected by the ATLAS detector during the 2012 LHC pp run at

√

s

=8 TeV, which corresponds to an integrated luminosity of 20.3±

0.6 fb

⁻¹

. The luminosity is measured using techniques similar to those described in Ref. [26] with a preliminary calibration of the luminosity scale derived from beam-overlap scans performed in November 2012. The average number of interactions per bunch crossing in 2012 was around 21. Data are considered only if they are acquired

1ATLAS uses a right-handed coordinate system with its origin at the nominal interaction point (IP) in the centre of the detector and thez-axis along the beam pipe. Thex-axis points from the IP to the centre of the LHC ring, and they-axis points upward. Cylindrical coordinates (r,φ) are used in the transverse plane,φbeing the azimuthal angle around the beam pipe. The pseudorapidity is defined in terms of the polar angleθasη=−ln tan(θ/2).

(4)

under stable beam conditions and with all sub-detectors fully operational. The sample is collected using the logical OR of two single electron triggers with transverse momentum thresholds of 60 GeV, low- ered to 24 GeV in case of isolated electrons, and two single muon triggers with transverse momentum thresholds of 36 GeV, lowered to 24 GeV in case of isolated muons.

Samples of MC simulated events are used to characterize the detector response and e

ffi

ciency to reconstruct t¯ t events, estimate systematic uncertainties and predict the background contributions from various physics processes. The simulated events are weighted such that the distribution of the average number of pp interactions per bunch crossing agrees with data. The response of the detector is simulated using a detailed model implemented in GEANT4 [27]. For the evaluation of some systematic uncertain- ties, generated samples are passed to a fast simulation using a parameterisation of the performance of the ATLAS electromagnetic and hadronic calorimeters [28]. Simulated events are reconstructed using the same algorithms that are applied to collision data.

The nominal signal t¯ t sample is generated using the P



(P



-hvq patch4) [29] generator that is based on next-to-leading order QCD matrix elements. The CT10 [30] parton distribution functions (PDF) are employed and the top quark mass set to m

top =

172.5 GeV. Parton shower and hadronization are simulated with P v6.425 [31] using the Perugia 2011 tune [32]. Electroweak corrections are applied to this sample, which are extracted with H



2.1-



[33] implementing the theoretical cal- culations of Ref. [34–36]. The prediction of P



is compared to that obtained with other generators such as MC@NLO v4.01 [37] interfaced to H v6.520 [38] for parton showering and hadronization, J



v4.31 [39] for the modeling of multiple parton scattering and CT10 for the PDF set. In addition, the leading order (LO) multi-leg generator A



v2.13 [40] interfaced to H



is used where up to four additional partons in the matrix-element (ME) are produced. The MLM [40] matching scheme is employed to avoid double counting of configurations generated by both the parton shower and the ME calculation. The CTEQ6L1 [41] PDF set is employed. Heavy-flavor quarks are included in the ME calculations to produce the t¯ t

+

b b ¯ and t¯ t

+

c c ¯ processes. The overlap between the heavy-flavor quarks produced from the ME calculations and from the parton shower is removed. For the evaluation of sys- tematic uncertainties due to the parton showering model, a P

+

H



sample is compared to a P+P sample. The uncertainties due to QCD initial and final state radiation (ISR and FSR) modeling are estimated with samples generated with A



M



v3.8 [42] interfaced to P



for which the parton shower parameters are varied according to a measurement of the additional QCD activity in t¯ t events [43]. The electroweak corrections that are applied to the nominal P+P sample are not applied to the other samples. The t¯ t samples are normalized to the NNLO+NNLL cross-section

²

[6–11]:

σtt¯ =

253

⁺₋₁₅¹³

pb. Each t¯ t sample is produced requiring at least one leptonic decay in the top anti-top quark pair.

Leptonic decays of vector bosons produced in association with several high-p

T

jets, referred to as W

+

jets and Z

+

jets, constitute the largest background in this analysis. Samples of simulated W

/Z+

jets events with up to five additional partons in the LO matrix-elements are produced with the A gen- erator interfaced to P



for parton showering using the MLM matching scheme. Heavy-flavor quarks are included in the ME calculations to produce the Wb b, ¯ Wc¯ c, Wc, Zb b ¯ and Zc¯ c processes. The over- lap between the heavy-flavor quarks produced by the matrix element and by parton shower is removed.

W

+jets samples are normalized to the inclusive

W boson NNLO cross-section [45, 46] and corrected applying additional scale factors derived from data, as described in Sec. 6.

Single top quark production is simulated using multiple generators. Production in the t-channel is re- alized using the A



M



generator while production in the s-channel and production in association with a W boson are modeled with P



. Both generators are interfaced with P



using the CTEQ6L1 PDF set and the Perugia 2011 tune for parton showering modeling. All samples are generated requir-

2The top++2.0 [44] calculation includes the next-to-next-to leading order QCD corrections and resums next-to-leading logarithmic soft gluon terms. The quoted cross-section corresponds to a top quark mass of 172.5 GeV.

(5)

ing the presence of a leptonically decaying W boson. The cross-sections multiplied by the branching ratios for the leptonic W decay employed for these processes are 28.4 pb (t-channel) [47], 22.4 pb (Wt production) [48], and 1.8 pb (s-channel) [49], as obtained from NLO+NNLL calculations.

Diboson production is modeled using S



[50] with the CT10 PDF set and the yields are normal- ized to the NLO cross-sections [51].

5 Object definition and event selection

Electron candidates are reconstructed as charged particle tracks in the inner detector associated with energy deposits in the EM calorimeter. They must pass identification criteria based on the shower shape in the EM calorimeter, on track quality, and on the transition radiation observed in the TRT detector. The EM clusters are required to be in the pseudorapidity region

|η| <

2.47, excluding the transition region between the barrel and the encdap calorimeters (1.37

<|η|<

1.52). They must have a transverse energy E

T >

25 GeV. The associated track must have a longitudinal impact parameter

|z₀|<

2 mm with respect to the primary vertex.

Muon candidates are defined by matching track segments in the muon spectrometer with tracks in the inner detector. The track p

_T

is determined through a global fit of the track that takes into account the energy loss in the calorimeters. The track is required to have

|z₀|<

2 mm and a transverse impact param- eter significance indicating the track is consistent with coming from the hard scatter,

|d₀/σ(d₀

)|

<

3. Muons are required to have p

_T >

25 GeV and be in the fiducial region

|η|<

2.5. Lepton candidates are required to be isolated to suppress background leptons originating from jets.

The variable “mini-isolation” [52] is introduced and is defined as I

_mini = P

tracks

p

^track_T /

p

^l_T

, where p

^l_T

is the lepton transverse momentum and the sum is over all tracks (excluding the lepton track) that have p

_T >

0.4 GeV, pass quality cuts and have

∆

R(l, track)

<

K

_T/

p

^l_T

. The isolation cone radius decreasing with the increasing p

^l_T

improves the selection e

ffi

ciency for high-p

_T

decays. The parameter K

_T

is set to 10 GeV and the isolation requirement I

mini <

0.05 is applied for both electrons and muons.

Jets are reconstructed using the anti-k

t

algorithm [21] implemented in the F



J



package [53] with radius parameter R

=

0.4 or R

=

1.0, using as input calibrated topological clusters [54, 55]. A pile- up correction is applied for small-R jets by estimating the transverse momentum induced by pile-up interactions on an event-by-event basis using the jet area [56] and the median of the transverse momentum distribution density [57] of all jets found by the k

_t

algorithm [58,59]. The p

_T

of small and large-radius jets and the large-R jet mass, obtained from the four-momentum sum of all jet constituents, are calibrated using energy- and

η-dependent correction factors. After this calibration, the

p

_T

and mass of the jets correspond on average to the ones of the corresponding truth-level jets, which are built from the stable particles produced by the MC event generator [23, 60]. Differences between the small-R jet response in data and MC simulation are evaluated from control samples and corresponding corrections are applied to data. Small-R jets are required to be in the fiducial region

|η| <

2.5 and must have p

_T >

25 GeV.

The jet vertex fraction (JVF) exploits the fraction of tracks associated to the jet coming from the primary vertex and is computed as the summed p

_T

of all tracks matched to the jet from the primary vertex divided by the summed p

_T

of all tracks matched to the jet. Small-R jets with p

_T <

50 GeV are rejected when

|JV F| <

0.5, to reduce the contribution of jets generated by pile-up interactions. The trimming algorithm [22] with parameters R

sub=

0.3 and f

cut=

0.05 is applied to large-R jets to suppress the effect of the pile-up and improve background rejection. Trimmed large-R jets are considered for the analysis if

|η|<

2.0 and p

_T >

300 GeV.

Small-R jets originating from b-quarks are tagged exploiting the long decay time and the large mass

of the b-quark hadrons. Information from the track impact parameters, secondary vertex location and

decay topology are combined in a neural network based algorithm (MV1) [61]. The operating point

corresponds to an overall 70% b-tagging efficiency in t¯ t events, and to a probability to mistag light-flavor

(6)

jets of approximately 1%.

Since leptons deposit energy in the calorimeters, an overlap removal procedure is applied in order to avoid double counting of leptons and small-R jets. In order to improve the reconstruction efficiency in the boosted topology, a specific overlap removal procedure has been adopted. First, jets close to electrons, with

∆

R(e, jet

_R₌_0.4

)

<

0.4 are corrected by subtracting the electron four-vector from the jet four-vector and the JVF is recalculated removing the electron track. The new e-subtracted jet is retained if it passes the jet selection criteria listed before, otherwise it is rejected. After this procedure, electrons that are still within

∆

R(e, jet

_R₌_0.4

)

<

0.2 from a small-R jet are removed. The muon-jet overlap removal procedure removes muons that fall inside a cone of

∆

R(µ, jet

_R_=0.4

)

<

0.04

+

10 GeV/p

_T,µ

around a small-R jet axis.

The missing transverse energy E

^miss_T

is computed as the vector sum of the energy of all calorimeter cells. Their energy is corrected on the basis of the associated physics object. The contribution of muons is added using their momentum obtained from the tracking system and the muon spectrometer.

The event selection proceeds as follows. Each event must have a reconstructed primary vertex with five or more associated tracks. The events are required to contain exactly one reconstructed lepton can- didate geometrically matched to the trigger object (where the trigger is defined in Sec. 4). The transverse mass of the lepton and E

^miss_T

is defined as m

^W_T = q

2p

^lepton_T

E

^miss_T

(1

−

cos

∆φ), where∆φ

is the azimuthal angle between the lepton and E

_T^miss

. Events are retained if E

^miss_T >

20 GeV and E

^miss_T +

m

^W_T >

60 GeV to suppress QCD multijets events.

The selection exploits the fact that the boosted top quark decay products tend to be collimated.

Therefore events are selected by requiring the presence of at least one small-R jet close to the lepton (

∆

R(l, jet

_R₌_0.4

)

<

1.5) and the existence of a reconstructed large-R trimmed jet with mass m

_jet>

100 GeV.

To improve the rejection of jets originating from QCD radiation from light quark or gluon jets a cut on the k

t

splitting scale [58, 59] of the large-R jets is made. The k

t

splitting scale is calculated by reclustering the large-R jet with the k

_t

-clustering algorithm, and taking the k

_t

distance between the two proto-jets of the final clustering step

√

d

₁₂ =

min(p

_T1,

p

_T2

)

∆

R

₁₂

. It is expected to have large values for jets containing two hard subjets, as expected in the decay of massive objects. Events are selected if the large-R jet has

√

d

₁₂ >

40 GeV [24]. The large-R jet must be well separated from the lepton (

∆φ(l,

jet

_R₌_1.0

)

>

2.3) and from the small-R jet associated to the lepton (

∆

R(jet

_R₌_1.0,

jet

_R₌_0.4

)

>

1.5).

If more than one large-R jets satisfy these criteria, the one with highest p

_T

is considered as the top-jet candidate. Finally, at least one of the two top candidates must be b-tagged. This implies that either the highest p

_T

small-R jet close to the lepton (

∆

R(l, jet

_R₌_0.4

)

<

1.5) or at least one small-R jet close to the large-R jet (∆ R(jet

_R_=1.0,

jet

_R_=0.4

)

<

1.0) are b-tagged

³

.

The event selection is summarized in Tab. 1. After these requirements the data sample contains 4148 and 3604 events in the electron and muon channels, respectively. The observed yield is compared to the Standard Model prediction in Sec. 8.

6 Background estimation

W

+jets production represent the largest non-t¯

t contribution in the final sample. The strategy to pre- dict this background is to use as a starting point the A

+

P



W

+jets samples normalized to the

inclusive W boson NNLO cross-section. The normalization and heavy flavor fraction of the W

+

jets background have large theoretical uncertainties, and are thus determined from data.

The heavy flavor fraction scale factors correct for the mis-modeling in the generator of the fractions of W production associated di

ff

erent flavour components (W

+

b b, ¯ W

+

c¯ c, W

+

c). They are estimated in a sample with the same lepton and E

^miss_T

selections as the signal selection, but with only two small-R

3Note that the reconstruction of a large-Rjet does not prevent the reconstruction of small-Rjets overlapping with it.

(7)

Cut Detector-level Particle-level

e+jets µ+jets

Leptons

|z0|<2 mm Imini<0.05

|η|<1.37 OR 1.52<|η|<2.47

pT >25 GeV

|z0|<2 mm &|d0/σ(d0)|<3.;

Imini<0.05;

|η|<2.5 pT >25 GeV

|η|<2.5 p_T >25 GeV

Anti-ktR=0.4 jets

pT>25 GeV

|η|<2.5

JVF>0.5 (ifpT<50 GeV)

|η|<2.5 p_T >25 GeV

Overlap removal

if∆R(e,jet_R=0.4)<0.4:

jet⁰_R₌_0.4=jet_R=0.4−e if∆R(e,jet⁰_R₌_0.4)<0.2:

eremoved

if∆R(µ,jet⁰_R=0.4)<0.04+10.GeV/pT(µ):

µremoved None

E^miss_T ,m^W_T E^miss_T >20 GeV,E_T^miss+m^W_T >60 GeV

Leptonic top At least one anti-ktR=0.4 jet satisfying∆R(l,jet_R=_0.4)<1.5 Hadronic top

The leading trimmed anti-k_tR=1.0 jet pT>300 GeV,m>100 GeV,√

d12>40 GeV

∆R(jetR=1.0,jet_R₌_0.4)>1.5,∆φ(l,jet_R₌_1.0)>2.3 B-tagging

At least one of:

1) the leading anti-ktR=0.4 jet satisfying∆R(l,jet_R=0.4)<1.5 isb-tagged;

2) at least one anti-ktR=0.4 jet satisfying∆R(jetR=1.0,jet_R₌_0.4)<1.0 isb-tagged

Table 1: Summary of event selections for detector-level and truth particle-level events.

jets and no b-tagging requirements. The b-jet multiplicity, in conjunction with the knowledge of the b- tagging and mistag e

ffi

ciency, is used to extract the heavy flavor fraction in this sample. This information is extrapolated to the signal region using the MC simulation, assuming constant relative rates for the signal and control regions.

The overall W

+

jets normalization is obtained by exploiting the expected charge asymmetry in the production of W

⁺

and W

⁻

at a pp collider. This asymmetry is predicted precisely by theory, and other processes in the t¯ t sample are symmetric in charge except for a small contamination from single-top and WZ events which is corrected by MC. The total number of W

+

jets events in the sample can thus be estimated with the following equation:

N

_W++

N

_W⁻ =

r

_MC+

1 r

MC−

1

!

(D

corr+−

D

_corr−

), (1)

where r

MC

is the ratio of the number of events with positive to negative leptons in the MC, and D

corr+

and D

_corr−

are the number of events with positive and negative leptons in the data, respectively. The signal sample has too few events and is too depleted in W

+jets events to apply Eqn. 1 directly. Instead

a sample enhanced in W

+jets events is created by dropping the

b-tagging,

∆φ(jet_R₌_1.0,

l), jet mass and

√

d

₁₂

requirements. The obtained scale factors are applied to the normalized W

+

jets MC samples. After correction by this scale factor W

+jets events are expected to make up approximately 6% of the total event

yield in the signal region.

QCD multijet events can, in rare instances, mimic the lepton

+

jets signature. This background is estimated directly from data by using the matrix-method technique (see e.g. Ref. [62]). A sample enhanced in fake leptons, characterized by non-prompt leptons or jets misidentified as prompt leptons, is obtained by loosening the lepton identification requirements. The number of events with fake leptons in the signal region can be predicted as:

N

_QCD=

(

−

1) f

−

f N

_T+

f

−

f N

_A,

(2)

(8)

where and f are the efficiencies for leptons that passed the loose selections to also pass the tight (signal) selections, for real and fake leptons respectively, N

_T

is the number of events with a tight lepton, and N

_A

is the number of events with a loose lepton which failed the tight cuts. The efficiency f is measured using data in fake-enhanced control regions and the rate is extracted from MC and validated in data. QCD multijets events contribute to the total event yield at approximately the percent level.

Single-top, Z

+jets and diboson events, all sub-dominant backgrounds, are estimated from MC gen-

erators normalized to the best known theory calculation of their cross-sections. Their corresponding MC samples are described in Sec. 4.

Top quark pair events with both top and anti-top quarks decaying leptonically (including decays to

τ) can sometimes pass the event selection, contributing to approximately 5% of the total event yield. The

fraction of dileptonic t¯ t events in each p

_T

bin is estimated using the same MC sample used for the signal modeling (see Sec. 4). Leptonic top quark decays to

τ-leptons in lepton+jets

t¯ t events are considered as signal only if the

τ

lepton decays leptonically.

7 Systematic uncertainties

Systematic uncertainties, which arise from object reconstruction and calibration, MC generator modeling and background estimation, are described below. The propagation of systematic uncertainties through the unfolding procedure is described in Sec. 9.4.

The uncertainty on the large-R jet energy scale (JES), jet mass scale (JMS) and k

t

splitting scale is obtained using two di

ff

erent data-driven methods. For p

_T >

800 GeV for JES, and for all p

_T

for the JMS and k

_t

splitting scale, the ratio of the large-R jets kinematic variables reconstructed from the calorimeter clusters to inner detector tracks is compared between data and MC. For p

_T<

800 GeV for JES, the p

_T

of large-R jets are compared to the well calibrated p

_T

of photons in a large statistics photon

+

jets sample. An additional MC-based uncertainty, referred to as large-R JES topology uncertainty, is included to reflect the fact that the jets in these calibration samples have a different response (gluon or light quark jets) than those in t¯ t events (top-jets). The full di

ff

erence between the response of these two types of jets is conservatively assumed as the corresponding systematic uncertainty. The uncertainty on the large-R jet energy resolution (JER) is determined by smearing the jet energy such that the resolution is increased by 20% [63] and evaluating the effect on the final result. The same smearing procedure is applied to determine the uncertainty due to the large-R jet mass resolution (JMR). The uncertainties on the large-R jets JES represent the dominant contribution to the total uncertainty of this measurement, in particular the photon+jet calibration and the topology uncertainties.

The small-R jet energy scale uncertainty is derived using a combination of simulations, test beam data and in situ measurements [55, 64, 65]. Additional contributions from the jet flavour composition, calorimeter response to different jet flavours and pile-up are taken into account. Uncertainties in the jet energy resolution are obtained with an in situ measurement of the jet response asymmetry in di-jet events [66].

The efficiency to tag real and fake b-jets is corrected in Monte Carlo events by applying b-tagging scale factors, extracted in t¯ t and dijet samples, that compensate for the residual di

ff

erence between data and simulation. The associated systematic uncertainty is computed by varying the scale factors within their uncertainty [61, 67, 68]. The b-jet calibration is performed up to p

_T=300 GeV; for larger transverse

momenta an additional MC-based extrapolation uncertainty, which ranges from 10% to 30% increasing with b-jet p

_T

from 300 GeV to 1200 GeV, is applied

⁴

.

The lepton reconstruction efficiency in the simulation is corrected by scale factors derived from mea- surements of these e

ffi

ciencies in data using a Z

→

l

⁺

l

⁻

enriched control regions. The lepton trigger

4Theb-jetpTdistribution extends to approximately 600 GeV in this analysis.

(9)

and reconstruction efficiency scale factors, energy scale and resolution are varied within their uncertain- ties [69, 70].

The uncertainty associated to E

^miss_T

is calculated by propagating the energy scale and resolution systematics on all physics objects to the E

^miss_T

calculation. Additional E

^miss_T

uncertainties arising from energy deposits not associated with any reconstructed objects are also included [71].

The signal modeling uncertainty, including the parton shower and hadronization models, is evaluated by comparing the result obtained with P

+

P



with the ones obtained with MC@NLO

+

H



, A

+

H



and P

+

H



. The largest deviation between all of these alternate t¯ t samples and the nominal P+P sample is symmetrized and used as the signal modeling uncertainty. The modeling of QCD initial and final state radiation (ISR and FSR) is evaluated separately using dedicated A



M

 +

P



samples with variation of the P



parameters for QCD radiation. In addition, a relative uncertainty of 10% is included on the t¯ t production electroweak correction [34–36], which has an impact of less than 1% on the reported cross-section.

The W

+

jets systematic uncertainties due to the overall normalization and the heavy flavor fraction are obtained by varying the data-driven scale factors within their uncertainty. A W

+

jets shape uncertainty is also extracted by varying the renormalization and matching scales in A. The W

+jets MC statistical

uncertainty is also taken into account, its contribution to the cross-section uncertainty increases with the top-jet candidate p

_T

from approximately 1% to 6%. The uncertainty on the fake lepton background is included by varying the definition of loose leptons, changing the selection used to form the control region and propagating the statistical uncertainty of parameterisations of the e

ffi

ciency and the fake rate.

A 50% uncertainty is applied to the normalization of the Z

+

jets and diboson backgrounds. The single- top background is assigned an uncertainty associated to the theory calculations used for its normalization [47–49], as well as a shape uncertainty extracted by comparing two MC generators.

8 Data and MC comparison at detector-level

Having evaluated the background and systematic uncertainties, the agreement between the predicted signal and background yields and data is assessed. Tab. 2 gives the number of observed and expected events for each process, where the systematic uncertainties on the background estimates, objects energy scale and reconstruction efficiencies, and MC statistics are taken into account. The prediction is generally found to overestimate the data by approximately one standard deviation (when signal modeling and t¯ t cross-section uncertainties are taken into account, which are not included in the table).

e

+jets µ+jets

t¯ t

`+jets

4020

±

460 3500

±

400 t¯ t dilepton 227

±

36 210

±

26 W

+

jets 263

±

50 252

±

48 single top 136

±

27 134

±

25 Multijet 91

±

17 3

±

1 Z

+

jets 34

±

18 14

±

8 Dibosons 22

±

11 18

±

9 Prediction 4790

±

540 4130

±

470 Data 4148 3604

Table 2: Observed and expected number of events in the signal e

+

jets and

µ+

jets samples. The systematic

uncertainties include the background estimates, objects energy scale and reconstruction e

ffi

ciencies, and

MC statistics

(10)

The agreement of the data with the prediction is further tested by studying the distributions of several variables of interest in Fig. 1-2. Each figure shows on the left and right the electron and muon channels, respectively. The systematic uncertainties on the objects energy scale and reconstruction efficiencies, on the background estimates, and from MC statistics are shown. Figure 1 shows the p

_T

of the lepton candidate, the p

_T

of the selected small-R jet and E

_T^miss

. Figure 2 shows kinematic distributions for the leading large-R jet in the event, i.e.

η, mass and

p

_T

. While the prediction generally overestimates the data, as observed already from Tab. 2, the shape of each variable is generally well modelled by the simulation. Exceptions are the p

_T

of the candidate lepton and the large-R jet. For these distributions the disagreement increases with p

_T

, with the simulation overestimating the data by up to approximately 50% in the highest bin. The distribution of the p

_T

of the leading large-R jets constitutes the input to the unfolding procedure described below.

9 Di ff erential cross-section determination

The di

ff

erential cross-section is measured as a function of the p

_T

of the particle-level top-jet candidate, defined as the leading trimmed large-R jet in the event and in a specific fiducial region of phase space close to the event selection. Both the large-R jet and the fiducial region are defined at the particle-level.

The di

ff

erential cross-section is also measured at the parton-level as a function of the p

_T

of the top quark.

The combination of the results of e

+

jets and

µ+

jets channels is described in Sec. 9.1. To mimimize the corrections of the observed data to the particle level, particles and particle jets are defined in a kinematic phase-space that is close to the experimental selection for observable objects (fiducial region). A second correction extrapolates the result to the full parton-level phase space above top quark p

_T >

300 GeV.

The definitions are presented in Sec. 9.2. The procedures to correct for the detector inefficiencies and finite resolution from the reconstructed to the particle- and parton-level are presented in Sec. 9.3. The propagation of the systematic and statistical uncertainties through the unfolding procedure, and their results, are discussed in Sec. 9.4.

9.1 Combination of decay channels

A combined

`+

jets data sample is built by a logical “OR” of the e

+

jets and

µ+

jets selections at the detector-level. The MC samples used to derive simulated predictions for signal and backgrounds are also subjected to the combined selection. The resulting combined simulated samples are built according to the proportions dictated by their respective efficiency. The data-driven backgrounds are derived in each channel independently as described in Sec. 6 and added to the combined simulated predictions.

A combined simulated t¯ t sample is used to derive the particle and parton fiducial phase spaces and the associated unfolding corrections described in Sec. 9.3. The uncertainties are evaluated based on the sum of the events. This approach properly accounts for all correlations since e

ff

ects that are independent between channels (e.g. electron scale factors) are accounted for only in the specific subsample (e.g.

e

+

jets events), while common effects (e.g. jet energy scale) affect the full sample, with each channel contributing with the appropriate weight.

This combination method is cross-checked by performing the unfolding in each channel individually

to the

`+

jets phase space described in Sec. 9.2 and comparing these alternate cross-section estimates

with the one based on the combined data.

(11)

0 100 200 300 400 500 600

Events / GeV

10 20 30 40

50 ^Datatt Single lepton

Dilepton t t Single top W+jets Multijet Z+jets Diboson ATLASPreliminary

= 8 TeV, e+jets s

-1, 20.3 fb

[GeV]

lepton pT

0 100 200 300 400 500 600

Data / MC 0.5

1 1.5

(a) e+jets

0 100 200 300 400 500 600

Events / GeV

10 20 30 40

50 Data

Single lepton t

t Dilepton t t Single top W+jets Multijet Z+jets Diboson ATLASPreliminary

+jets = 8 TeV, µ s

-1, 20.3 fb

[GeV]

lepton pT

0 100 200 300 400 500 600

Data / MC 0.5

1 1.5

(b) µ+jets

0 100 200 300 400 500 600

Jets / GeV

20 40 60 80 100 120 140 160 180

200 _Data

Single lepton t

= 8 TeV, e+jets s

-1, 20.3 fb

[GeV]

small-R jets pT

0 100 200 300 400 500 600

Data / MC 0.5

1 1.5

(c) e+jets

0 100 200 300 400 500 600

Jets / GeV

20 40 60 80 100 120 140 160

180 Data

Single lepton t

+jets = 8 TeV, µ s

-1, 20.3 fb

[GeV]

small-R jets pT

0 100 200 300 400 500 600

Data / MC 0.5

1 1.5

(d) µ+jets

Events / GeV

10 20 30 40

50 Data

Single lepton t

= 8 TeV, e+jets s

-1, 20.3 fb

[GeV]

miss

ET

0 50 100 150 200 250 300 350 400 450 500

Data / MC 0.5

1 1.5

(e) e+jets

Events / GeV

5 10 15 20 25 30 35 40

45 Data

Single lepton t

+jets = 8 TeV, µ s

-1, 20.3 fb

[GeV]

miss

ET

0 50 100 150 200 250 300 350 400 450 500

Data / MC 0.5

1 1.5

(f) µ+jets

Figure 1: Distributions of p

_T

of the lepton candidates (top), p

_T

of selected small-R jets (middle) and

missing E

_T

(bottom). The distributions for the e

+jets and µ+jets channels are shown on the left and

right figures, respectively. The t¯ t prediction is obtained using the nominal P

+

P



sample. The

ratio of data and MC prediction is shown in the insets below the plots. The hashed area includes all

the object-related uncertainties (on the jet, lepton and E

_T^miss

) and the uncertainties from the background

estimation and MC statistics. The vertical lines indicate the data statistical uncertainty.

(12)

Events / 1.0

500 1000 1500 2000 2500 3000 ^Data

Single lepton t

t Dilepton t t Single top W+jets

Multijet Z+jets Diboson

ATLASPreliminary = 8 TeV s

-1, 20.3 fb

e+jets

top-jet candidate η -2 -1.5 -1 -0.5 0 0.5 1 1.5 2

Data / MC 0.5

1 1.5

(a) e+jets

Events / 1.0

500 1000 1500 2000 2500 3000

Data Single lepton t

t Dilepton t t Single top W+jets

Multijet Z+jets Diboson

ATLASPreliminary = 8 TeV s

-1, 20.3 fb

+jets µ

top-jet candidate η -2 -1.5 -1 -0.5 0 0.5 1 1.5 2

Data / MC 0.5

1 1.5

(b) µ+jets

100 150 200 250 300 350 400

Events / GeV

20 40 60 80

100 _Data

Single lepton t

= 8 TeV, e+jets s

-1, 20.3 fb

top-jet candidate mass [GeV]

100 150 200 250 300 350 400

Data / MC 0.5

1 1.5

(c) e+jets

100 150 200 250 300 350 400

Events / GeV

10 20 30 40 50 60 70 80

90 Data

Single lepton t

+jets = 8 TeV, µ s

-1, 20.3 fb

top-jet candidate mass [GeV]

100 150 200 250 300 350 400

Data / MC 0.5

1 1.5

(d) µ+jets

300 400 500 600 700 800 900 1000 1100 1200

Events / GeV

10-2

10-1

1 10 102

103

104 _Data

Single lepton t

= 8 TeV, e+jets s

-1, 20.3 fb

[GeV]

top-jet candidate pT

300 400 500 600 700 800 900 1000 1100 1200

Data / MC 0.5

1 1.5

(e) e+jets

300 400 500 600 700 800 900 1000 1100 1200

Events / GeV

10-2

10-1

1 10 102

103

104 _Data

Single lepton t

+jets = 8 TeV, µ s

-1, 20.3 fb

[GeV]

top-jet candidate pT

300 400 500 600 700 800 900 1000 1100 1200

Data / MC 0.5

1 1.5

(f) µ+jets

Figure 2: Distributions of

η

(top), mass (middle) and p

_T

(bottom) of the leading selected anti-k

t

R

=1.0

jets. The distributions for the e

+jets andµ+jets channels are shown on the left and right figures, respec-

tively. The t¯ t prediction is obtained using the nominal P

+

P



sample. The ratio of data and

MC prediction is shown in the insets below the plots. The hashed area includes all the object-related

uncertainties (on the jet, lepton and E

_T^miss

) and the uncertainties from the background estimation and MC

statistics. The vertical lines indicate the data statistical uncertainty.

(13)

9.2 Particle-level, parton-level and fiducial region definitions

Particle-level corrections to the data are derived using leptons and jets from simulated t¯ t events that are constructed using particles, with a mean lifetime greater

⁵

than 0.3

×

10

⁻¹⁰

s, that are directly resulting from pp interactions or from subsequent decays of particles with a shorter mean lifetime.

To define the lepton of the particle-level selection, all leptons (e,

µ,νe

,ν

µ

,

ντ

) from hadron decays are discarded. The (e,

µ,) leptons fromτ-lepton decays are accepted only if the parentτ

lepton is not a hadron decay product itself. The four-momenta of photons within a cone of

∆

R

=

0.1 around the electron and muon direction are added to those of the leptons.

Both small-R and large-R jets are reconstructed using all particles except for leptons as selected above (i.e. muons and neutrinos from hadron decays are included in jets) and for any photons associated with the selected electrons or muons. The trimming procedure applied to detector-level jets described in Sec. 5 is also applied to particle-level jets, considering all of their particle-level constituents.

An R

=0.4 jet with

p

_T >

25 GeV and

|η| <

2.5 is considered to be “b-tagged” (i.e. initiated by a b-quark) if there exists at least one B-hadron with p

_T >

5 GeV which is clustered in the jet

⁶

.

The missing transverse momentum magnitude E

^miss_T

is calculated from the four-momentum sum of neutrinos not resulting from hadron decays.

Aiming at minimizing the theoretical input to the measurement, the fiducial region is chosen to follow the detector-level event selections closely, including the kinematic requirements on the objects and the requirements on the event topology. In contrast to the detector-level selection, no overlap removal procedure is applied for leptons and jets, as described in Sec. 5 and no isolation requirement is imposed on leptons. Using the particle-level objects defined above, the fiducial region is defined as:

•

exactly one lepton (electron or muon) with p

_T >

25 GeV,

|η|<2.5.

•

E

^miss_T >

20 GeV and E

^miss_T +

m

^W_T >

60 GeV.

•

at least one R

=0.4 anti-kt

jet with p

_T >

25 GeV and

|η|<

2.5 and within

∆

R

<1.5 from the lepton.

If there is more than one such jet, the one with the largest p

_T

is considered to be the leptonic b-jet candidate.

•

at least one R

=

1.0 trimmed anti-k

t

jet with p

_T>

300 GeV, mass

>

100 GeV,

√

d

₁₂ >

40 GeV and

|η|<

2, well separated from both the lepton (∆

φ >

2.3) and the leptonic b-jet candidate (∆ R

>

1.5).

The jet mass is reconstructed from the four-vector sum of the particles constituting the jet. If more than one large-R jet satisfy these criteria, the one with largest p

_T

is chosen. The jet passing this selection is referred to as the particle-level top-jet candidate.

•

at least one b-tagged small-R jet such that

∆

R(jet

_R₌_1.0,

jet

_R₌_0.4

)

<

1 and

/

or the leptonic b-jet can- didate is b-tagged.

The particle-level event selection is summarized in Tab. 1. Fiducial particle-level corrections are deter- mined by using only simulated t¯ t events for which exactly one of the W bosons, resulting from the decay of the t¯ t pair, decays to an electron or a muon either directly or through a

τ

lepton decay. All other t¯ t events, classified as background in Sec. 6, are not used for this purpose. The cross-section is then mea- sured as a function of the particle-level top-jet candidate as defined above and its transverse momentum is called p

_T,ptcl

.

For the parton-level, the top quark that decays to a hadronically decaying W boson is considered just before the decay and after QCD radiation, selecting events for which the momentum of such a top

5With this definition charged pions are considered stable and used to build jets.

6TheB-hadrons are not stable and will not contribute to the total four-vector of the jet, only their decay products will.

However they are clustered with their energy set to a negligible value to check that they match the jet geometrically [56].

(14)

quark, p

_T,parton

, is larger than 300 GeV. Parton-level corrections are determined by using only simulated t¯ t events for which exactly one of the W boson decays to an electron or a muon or a

τ

lepton (including hadronic

τ

decays). The correction to the full phase partonic phase space defined above is obtained by accounting for the branching ratio of t¯ t to the

`+jets channel.

9.3 Unfolding to particle- and parton-level

The procedure to unfold the distribution of p

_T,reco

, the p

_T

of the detector-level leading trimmed large-R jet, to obtain the di

ff

erential cross-section as a function of p

_T,ptcl

is comprised of several steps, outlined in:

dσ

tt¯

d p

ⁱ_T,ptcl =

N

_ptclⁱ

∆

X

ⁱL =

1

∆

X

ⁱL

f

_ptcl!recoⁱ X

j

(M

⁻¹

)

^ptcl,i_reco,j

f

_reco!ptcl^j

(N

_reco^j −

N

_reco,bgnd^j

) (3) where N

_reco^j

is the number of observed events in bin j of p

_T,reco

after all selection requirements detailed in Sec. 5 are applied, N

_ptclⁱ

is the total number of events in bin i of p

_T,ptcl

that meet the selection requirement in the fiducial region described in Sec. 9.2,

∆

X

ⁱ

is the size of bin i of p

_T,ptcl

(and p

_T,reco

) and

L

is the total integrated luminosity of the data sample. The corrections that are applied to p

_T,reco

correspond to the following steps.

First, the post-selection non-t¯ t background contamination, N

_reco,bgnd^j

, which is described in Sec. 6, is subtracted from the observed number of events in each p

_T,reco

bin. The contribution from non-`

+

jets t¯ t events defined as background in Sec. 6 is subtracted by a multiplicative correction, quantifying the expected fraction of these events in p

_T,reco

bin, to reduce possible biases deriving from predictions of their absolute value.

The corrections described in the remaining steps are all extracted from the nominal P+P

t¯ t sample. In the second step, the acceptance factors f

_reco!ptcl^j

correct the p

_T,reco

spectrum, on a bin-by-bin basis, for the t¯ t events that fulfill the detector-level selection requirements, but fail to satisfy those for the fiducial region at particle-level. For each p

_T,reco

bin j, f

_reco!ptcl^j

is defined as the ratio of the number of events that meet both detector- and particle-level selection requirements to the number of events that satisfy the detector-level selection in that bin. The distribution of f

_reco!ptcl^j

is shown in Fig. 3(a) for various MC generators.

The third step corrects for detector resolution e

ff

ects. A migration matrix is constructed to correlate the p

_T,reco

binned distribution to the p

_T,ptcl

distribution, the particle-level representation. It is calculated by using t¯ t events that meet the reconstruction selection requirements and are within the particle-level fiducial region. The migration matrix M

_reco,j^ptcl,i

represents the probability for an event with p

_T,ptcl

in bin i to have a p

_T,reco

in bin j. This matrix is shown in Fig. 4(a) for the combined

`+

jets sample. It shows that approximately 50% to 85% of events have values of p

_T,ptcl

and of p

_T,reco

that fall in the same bin.

The inversion of the migration matrix to correct p

_T,reco

to the particle-level is carried out by an

unfolding scheme based on Tikhonov regularization that is implemented through the singular value de-

composition (SVD) of the matrix [72]. This scheme is chosen to reduce sizeable statistical fluctuations

that are introduced by instabilities in the inversion procedure. The unfolding regularization parameter,

which characterizes the size of the expansion of the solution to the inversion problem, has been optimized

according to the procedure described in Ref. [72]. In parallel the bin size for the p

_T,ptcl

(and p

_T,reco

) dis-

tribution has been optimized such that systematic uncertainties are larger than statistical uncertainties in

each bin, and that the width of each bin corresponds to at least one and a half time the root mean square

(RMS) of the p

_T,reco