Search for the Standard Model Higgs boson in the H → WW

(1)

ATLAS-CONF-2012-060 06June2012

ATLAS NOTE

ATLAS-CONF-2012-060

June 6, 2012

Search for the Standard Model Higgs boson in the H → WW

⁽^∗⁾

→ ℓνℓν decay mode using Multivariate Techniques with 4.7 fb

⁻¹

of ATLAS data

at √

s = 7 TeV

The ATLAS Collaboration

Abstract

A search for the Standard Model Higgs boson in the

H→WW⁽^∗⁾→ℓνℓν

(ℓ

= e, µ)

decay mode using multivariate techniques is presented. The search is performed using proton-proton collision data corresponding to an integrated luminosity of 4.7 fb

⁻¹

at a centre-of-mass energy of 7 TeV collected during 2011 with the ATLAS detector at the Large Hadron Collider. No significant excess of events over the expected background is observed. An upper bound is placed on the Higgs boson production cross section as a function of its mass. A Standard Model Higgs boson with a mass in the range be- tween 130 GeV and 281 GeV is excluded at the 95% confidence level, while the expected exclusion range is 127 GeV

≤m_H≤

255 GeV.

Reproduction of this article or parts of it is allowed as specified in the CC-BY-3.0 license.

(2)

1 Introduction

The Higgs boson is the only elementary particle in the Standard Model (SM) of particle physics that has not yet been observed. It is intimately related to the Higgs mechanism [1–3] which in the SM gives mass to all other massive elementary particles. The search for this particle is a centrepiece of the Large Hadron Collider (LHC) physics programme.

Indirect limits on the Higgs boson mass of m

_H <

158 GeV at 95% confidence level (CL) have been set using global fits to precision electroweak results [4]. Direct searches at LEP and the Tevatron have excluded at 95% CL a SM Higgs boson with a mass below 114.4 GeV [5] and in the regions 100 GeV

<

m

_H <

106 GeV and 147 GeV

<

m

_H<

179 GeV [6], respectively.

The results of searches in various channels using proton-proton (pp) collision data corresponding to an integrated luminosity of approximately 5 fb

⁻¹

have been reported recently by both the ATLAS and the CMS Collaborations [7,8], excluding the mass ranges between 110.0 and 117.5 GeV, between 118.5 and 122.5 GeV, and between 127.5 and 600 GeV.

ATLAS has reported a number of results in the H

→

WW

⁽^∗⁾→ℓνℓν

channel (with

ℓ =

e, µ). The most recent result [9] is based on all of the data recorded in 2011, corresponding to an integrated luminosity of 4.7 fb

⁻¹

, and excludes a SM Higgs boson in the range of 133 GeV

≤

m

_H ≤

261 GeV.

These results are obtained by using a transverse mass variable, m

T

, as a discriminant.

In this Note, also based on the 4.7 fb

⁻¹

dataset, the results of Ref. [9] are extended by using two multivariate analyses: a boosted decision tree (BDT) technique [10] and a matrix element (ME) method. The multivariate analyses build upon the m

T

fit approach, using the same simulated samples, object selection criteria and statistical treatment, and utilize the correlations among discriminant vari- ables. Of the two methods, BDT yields the better expected sensitivity and is used to obtain the results of this Note. The ME method is sensitive to different aspects of the event kinematics and is used to support the BDT results. A similar search has been performed by the CMS Collaboration [11] using the BDT technique. ATLAS plans to use both the BDT and the ME techniques with 2012 data to search for the Higgs boson in the H

→

WW

channel.

The data and simulated samples are summarised in Section 2, while Section 3 details the object and event selection criteria. Section 4 describes the two multivariate techniques. Section 5 discusses the use of data control samples to estimate the dominant backgrounds, and Section 6 summarizes the sources of systematic uncertainty. Section 7 presents the results of the analyses and compares them with the results of the m

T

fit analysis.

2 Data and Simulated Samples

The data used for these analyses were collected in 2011 using the ATLAS detector, a multi-purpose particle physics experiment with a forward-backward symmetric cylindrical geometry and near 4

π

coverage in solid angle [12]. It consists of an inner tracking detector surrounded by a thin super- conducting solenoid, electromagnetic and hadronic calorimeters, and an external muon spectrometer incorporating large superconducting air-core toroid magnets. The combination of these systems pro- vides charged particle measurements together with highly efficient and precise lepton measurements over the pseudorapidity

¹

range

|η|<

2.5. Jets are reconstructed over the full coverage of the calorime-

1ATLAS uses a right-handed coordinate system with its origin at the nominal interaction point (IP) in the centre of the detector and thez-axis along the beam line. Thex-axis points from the IP to the centre of the LHC ring, and they-axis points upwards. Cylindrical coordinates (r, φ) are used in the transverse plane,φbeing the azimuthal angle around the beam pipe. The pseudorapidity is defined in terms of the polar angleθasη=−ln tan(θ/2).

(3)

ters,

|η|<

4.9, and this calorimeter coverage provides a precise measurement of the missing transverse momentum.

The data were collected using inclusive single-muon and single-electron triggers. The single- muon trigger required the transverse momentum of the muon with respect to the beam line, p

T

, to exceed 18 GeV; for the single-electron trigger the threshold varied from 20 to 22 GeV. The trigger ob- ject quality requirements were tightened throughout the data-taking period to cope with the increasing instantaneous luminosity.

In these analyses, the signal contributions that are considered include the dominant gluon fusion production process (gg

→

H, denoted as ggF), the vector-boson fusion production process (qq

^′ →

qq

^′

H, denoted as VBF) and the Higgs-strahlung process (qq

^′ →

W H, ZH, denoted as W H/ZH).

For the decay of the Higgs boson, only the H

→

WW

mode is considered, with final states featuring two charged leptons (ℓ

=

e, µ, including small contributions from leptonic

τ

decays). The branching fraction for this decay, as a function of m

_H

, is taken from the HDECAY [13] program.

The signal cross section is computed to next-to-next-to-leading order (NNLO) [14–19] in QCD for the ggF process. Next-to-leading order (NLO) electroweak (EW) corrections are also applied [20,21], as well as QCD soft-gluon resummations up to next-to-next-to-leading log (NNLL) [22]. These calcu- lations are detailed in Refs. [23–25], and assume factorisation between QCD and EW corrections. Full NLO QCD and EW corrections [26–28] and approximate NNLO QCD corrections [29] are used to calculate the cross sections for VBF signal production. The cross sections of the associated W H/ZH production processes are calculated up to NNLO QCD corrections [30, 31], with NLO EW radiative corrections [32] applied.

The ggF and VBF processes are modelled using the POWHEG [33,34] Monte Carlo (MC) genera- tor, interfaced to PYTHIA [35] for showering and hadronisation; the ggF Higgs boson p

T

spectrum is reweighted to agree with the prediction from HqT [36]. The associated W H/ZH production processes are modelled using PYTHIA.

ALPGEN [37], interfaced to HERWIG [38] with the MLM matching scheme [39] is used to model the production of W and Z/γ

^∗

bosons decaying to charged leptons in association with jets (the latter is also referred to below as the Drell-Yan process). MC@NLO [40] is used to model t¯ t and WW production, using HERWIG for the parton hadronisation; an additional contribution to the continuum WW background from gluon-initiated diagrams is modelled using gg2WW [41] interfaced to HER- WIG. Wherever parton showering is performed by HERWIG, JIMMY [42] is used for the simulation of the underlying event. SHERPA [43] is used for the generation of ZZ final states while MC@NLO is chosen for WZ production. Wγ production is modelled with ALPGEN while MadGraph [44, 45] is employed for Wγ

^∗

[46]. AcerMC [47], using PYTHIA for showering and hadronisation, is used for the generation of single top events in all three production channels (s-channel, t-channel, and Wt).

The CT10 parton distribution function (PDF) set [48] is used for the MC@NLO samples, CTEQ6L1

[49] for the ALPGEN, SHERPA, and MadGraph samples, and MRSTMCal [50] for the PYTHIA and

AcerMC samples. Acceptances and efficiencies are based on a full simulation [51] of the ATLAS

detector using GEANT4 [52]. This includes a realistic treatment of the event pile-up conditions (the

data are affected by the detector response to multiple pp collisions occurring in the same or in differ-

ent bunch crossings) present in the 2011 data; from the first 2.3 fb

⁻¹

to the last 2.4 fb

⁻¹

of data taken,

the average number of pp interactions per bunch crossing increased from 6.3 to 11.6.

(4)

3 Event Selection

The event selection criteria are the similar to those used in Ref. [9], with a few differences that are specified below.

Events are required to have a primary vertex that is consistent with the beam spot position, with at least three associated tracks with p

T >

400 MeV. Overall quality criteria are applied in order to suppress non-collision backgrounds such as cosmic-ray muons, beam-related backgrounds, or noise in the calorimeters.

H

→

WW

(ℓ

=

e, µ) candidates are pre-selected by requiring exactly two oppositely charged leptons with p

T

thresholds of 25 GeV and 15 GeV for the leading and sub-leading lepton, respectively. For muons, the range

|η| <

2.40 is used; for electrons, the range

|η| <

2.47 is used, with the region 1.37

<|η|<

1.52 (corresponding to the boundary between calorimeter cryostats) excluded.

The selected electron candidates are reconstructed using a combination of tracking and calorimetric information [53], while the muon candidates are identified by matching tracks reconstructed in the inner detector to those reconstructed in the muon spectrometer [54]. At least one of the selected lep- tons is required to match a triggering object. Leptons from heavy-flavour decays and jets satisfying the lepton identification criteria are suppressed by requiring the leptons to be isolated: the scalar sum of the p

T

of charged particles and of the calorimeter energy deposits within

∆R= p

∆φ²+ ∆η² =

0.3 of the lepton direction are each required to be less than approximately 0.15 times the lepton p

T

, with slight differences between track- and calorimeter-based criteria and between electrons and muons.

The Drell-Yan production of Z/γ

^∗

or

Υ

resonances leads to two same-flavour, opposite-sign high- p

T

leptons. In the ee and

µµ

channels this background is suppressed by requiring the dilepton invariant mass to be greater than 12 GeV and to di

ff

er from the Z-boson mass m

_Z

by at least 15 GeV. In the eµ channel, the dilepton invariant mass is required to be greater than 10 GeV.

Multijet and Drell-Yan backgrounds are suppressed by requiring large E

^miss_T

. The E

^miss_T

is the magnitude of

p^miss_T

, the opposite of the vector sum of the reconstructed objects’ transverse momenta, including identified muons, electrons, photons, jets, and clusters in the calorimeter not associated with these objects. The quantity E

_T,rel^miss

used in this analysis is defined as: E

^miss_T,rel=

E

^miss_T

sin

∆φ_min

, with

∆φ_min ≡

min(∆φ,

^π₂

). Here,

∆φ

is the absolute azimuthal angular difference between the E

^miss_T

vector and the nearest lepton or jet with p

T>

25 GeV. For the ee and

µµ

channels, the multijet and Drell-Yan events are suppressed by requiring E

^miss_T,rel >

45 GeV. For the eµ channel, Drell-Yan events originate predominantly from the

ττ

production channel, where the small leptonic

τ

decay branching fractions lead to a much smaller background. In this channel. the requirement is relaxed to E

^miss_T,rel >

25 GeV.

After the isolation and E

^miss_T,rel

cuts, the multijet background is found to be negligible. As can be observed in Figure 1, the distribution of the E

_T,rel^miss

variable is described well by the sum of the predicted backgrounds. For this Figure, all the backgrounds are estimated using the MC simulation except the W+jets background which is estimated using a data-driven approach as described in Section 5.4.

Figure 2 shows the multiplicity distribution of jets reconstructed using the anti-k

_t

algorithm [55], with radius parameter R

=

0.4, for all events satisfying the pre-selection criteria described above.

Only jets with p

T >

25 GeV and

|η| <

4.5 are selected. This threshold is increased to 30 GeV in

the region 2.75

< |η| <

3.25, which corresponds to a boundary between two calorimeter systems

and is more sensitive to reconstruction issues arising from event pile-up. The background rate and

composition depend significantly on jet multiplicity, as does the signal topology: in events with no

jets, the signal originates almost entirely from the ggF process and the background is dominated by

WW and Drell-Yan events. In contrast, when produced in association with two or more jets, the signal

(5)

Events / 4 GeV

10-1

1 10 102

103

104

105

106

107

108

109

Data SM (sys ⊕ stat)

WW WZ/ZZ/Wγ

t

t Single Top

Z+jets W+jets H [125 GeV]

ATLASPreliminary

L dt = 4.7 fb-1

∫

= 7 TeV, s

ν νe

→e WW(*)

→ H

[GeV]

T,rel

Emiss

0 20 40 60 80 100 120 140 160 180 200

Data / MC

0.7 0.8 0.91.11 1.2 1.3

Events / 4 GeV

10-1

1 10 102

103

104

105

106

107

108

109 Data SM (sys ⊕ stat)

WW WZ/ZZ/Wγ

t

t Single Top

L dt = 4.7 fb-1

∫

= 7 TeV, s

ν µ ν µ (*)→

→WW H

[GeV]

T,rel

Emiss

0 20 40 60 80 100 120 140 160 180 200

Data / MC

0.7 0.8 0.91.11 1.2 1.3

Events / 4 GeV

10-1

1 10 102

103

104

105

106

WW WZ/ZZ/Wγ

t

t Single Top

L dt = 4.7 fb-1

∫

= 7 TeV, s

ν µ ν

→e WW(*)

→ H

[GeV]

T,rel

Emiss

0 20 40 60 80 100 120 140 160 180 200

Data / MC

0.7 0.8 0.91.11 1.2 1.3

Figure 1: The E

^miss_T,rel

distributions for the ee (top left),

µµ

(top right), and eµ (bottom) channels with

the lepton p

T

and minimum m

_ℓℓ

requirements applied. The expected signal for a SM Higgs boson

is shown for m

_H =

125 GeV. The final bin includes the overflow. The lower pane in each plot

shows the bin-by-bin data/MC ratio; the shaded error band includes both MC statistical and systematic

uncertainties.

(6)

contains a much larger contribution from the VBF process

²

and the background is dominated by t t ¯ production. To enhance the sensitivity, further selection criteria that depend on the jet multiplicity are therefore applied to the pre-selected sample. The data are subdivided into 0-jet, 1-jet and 2-jet channels according to the jet counting defined above, with the 2-jet channel also including events with higher jet multiplicities. The different selections for these channels are described in more detail below.

Njets

0 2 4 6 8 10

Events

1000 2000 3000 4000 5000

WW WZ/ZZ/Wγ

t

t Single Top

Z+jets W+jets H [125 GeV] x 10

ATLAS Preliminary

L dt = 4.7 fb-1

∫

= 7 TeV, s

ν νl

→l WW(*)

→ H

Figure 2: Multiplicity of jets within the acceptance described in the text, after the cut on E

^miss_T,rel

. The shaded region indicates the total uncertainty on the background prediction. The expected signal for a SM Higgs boson with m

H =

125 GeV is superimposed (multiplied by a factor of 10 for better visibility).

In the 0-jet channel, the magnitude p

^ℓℓ_T

of the transverse momentum of the dilepton system,

p^ℓℓ_T = p^ℓ1_T +p^ℓ2_T

, is required to be greater than 30 GeV for the eµ channel and greater than 45 GeV for the ee and

µµ

channels. This improves the rejection of the Drell-Yan background.

In the 1-jet channel, backgrounds from top quark decays are suppressed by rejecting any event containing a b-tagged jet, as determined using a b-tagging algorithm which uses a combination of impact parameter significance and secondary vertexing information and exploits the topology of weak decays of b- and c-hadrons [56]. The algorithm is tuned to achieve an 80% b-jet identification ef- ficiency for t¯ t events while yielding a light-jet tagging rate of approximately 6% [57, 58]. The total transverse momentum, defined as the magnitude of the vector sum

p^tot_T =p^ℓ1_T +p^ℓ2_T +p^j_T+p^miss_T

, is re- quired to be smaller than 30 GeV in order to suppress Drell-Yan, t¯ t and single top background events with jets with p

T

below threshold. To reduce Z

→ ττ

background, the

ττ

invariant mass, m

_ττ

, is computed under the assumption that the reconstructed leptons are

τ

decay products, that the neutrinos produced in the

τ

decays are collinear with the leptons [59], and that they are the only source of E

^miss_T

. Events with

|

m

_ττ −

m

_Z| <

25 GeV are rejected if the energy fractions carried by the putative visible decay products are positive (the collinear approximation does not always yield physical solutions).

The 2-jet selection follows the 1-jet selection described above (with the p

^tot_T

definition modified to include all selected jets). In addition, the following jet-related cuts are applied: the two highest-p

T

jets

2For example,∼87% of selected signal events in the 2-jet channel originate via the VBF process.

(7)

in the event, the “tag” jets, are required to lie in opposite pseudorapidity hemispheres (η

j1×η_j2 <

0), with no additional jet within

|η| <

3.2; the two tag jets must be separated in pseudorapidity by a distance

|∆η_jj|

of at least 3.8 units; finally, the invariant mass of the two tag jets, m

jj

, must be at least 500 GeV.

Due to spin correlations in the WW

⁽^∗⁾

system arising from the spin-0 nature of the Higgs boson, the charged leptons tend to emerge from the interaction point in the same direction. This feature results in the dilepton invariant mass, m

ℓℓ

, being small. Consequently, in all jet channels, m

ℓℓ

is required to be less than 80 GeV (the

|

m

ℓℓ−

m

Z|>

15 GeV cut is always applied for the same-flavour channels). In the 2-jet channel, an additional cut is applied on the azimuthal angular difference between the leptons,

∆φ_ℓℓ

, which is required to be less than 1.8 radians

³

. For m

H ≥

200 GeV, the leptons tend to have higher p

T

and larger angular separation. Therefore, the m

ℓℓ

and

∆φ_ℓℓ

criteria are omitted in this Higgs mass region. In the following, the selections for m

_H <

200 GeV and m

_H >=

200 GeV are referred to as low m

_H

and high m

_H

selections, respectively.

The predicted numbers of events from signal (m

H =

125 GeV) and background sources at each stage of the low m

_H

selection are presented in Table 1.

4 Multivariate Techniques

As mentioned in Section 1, two multivariate analyses (MVAs) are used in this search, namely, a BDT and a ME technique. The MVAs take advantage of correlations among the discriminant variables, thereby enhancing the search sensitivity with respect to a cut-based analysis. In this case, the MVAs are employed in the 0-jet and 1-jet channels only. In the 2-jet channel, a cut-based analysis [9] is used.

The two multivariate methods are described below.

4.1 The Boosted Decision Tree (BDT) technique

A decision tree is a collection of cuts used to classify events as signal or background. The input events are successively split using a set of discriminant variables. At each split, the algorithm finds the variable and the optimal cut on this variable that give the best separation between signal and background. The tree is complete after a specified number of splits, at which point it contains event subsamples or leaves, with each leaf having predominantly signal or background events. A signal event is correctly classified if it is in a signal-like leaf, and similarly for background events. A second tree is then grown to correctly classify the signal or background events that were misclassified by the first tree. Such events are given an increased weight relative to correctly classified events, so that the second tree focuses on previously misclassifed events. This process is repeated until a specified number of trees have been grown. A weighted average is taken from all trees to form a BDT output discriminant, which ranges from -1 to

+1.

In this analysis, four kinematic variables are used to distinguish between expected Higgs signal and backgrounds in the 0-jet and 1-jet channels: m

_ℓℓ

, p

^ℓℓ_T

,

∆φ_ℓℓ

and a transverse mass variable m

T

[60], defined as:

m

T = q

(E

_T^ℓℓ+

E

^miss_T

)

²− |p^ℓℓ_T +p^miss_T |²,

3ThemT fit analysis reported in Ref. [9] requiresmℓℓto be less than 50 GeV and uses the∆φℓℓcriterion in all three jet categories. The multivariate analyses use the loosermℓℓselection, and avoids the∆φℓℓcriterion in the 0-jet and 1-jet categories, to minimize event elimination at this stage.

(8)

where E

^ℓℓ_T = q

|p^ℓℓ_T|²+

m

²_ℓℓ

. These variables have all been used in the transverse mass fit analysis [9]

and are well understood.

Two independent BDTs are trained in each of the 0-jet and 1-jet channels: one with the ee and

µµ

final states together and one with the eµ final state. BDTs are trained separately for each Higgs mass hypothesis. The SM background processes trained against are WW, WZ, ZZ, t¯ t, single top, and Z/γ

^∗+jets⁴

. Both the signal and the backgrounds input to training are normalized to SM expectations.

One thousand trees are grown for each training, each tree having two splits.

Figures 3-6 show distributions of m

T

,

∆φ_ℓℓ

m

_ℓℓ

and p

^ℓℓ_T

in the 0-jet and 1-jet channels after all event selection criteria except the m

ℓℓ

cut. The backgrounds are estimated with MC simulation except for W+jets which is estimated using a data-driven approach, and the WW, top, and Drell-Yan back- grounds which are normalised to data in control regions. BDT output distributions in the 0-jet and 1-jet channels for events passing all low-m

H

selection criteria are shown in Figure 7.

4.2 The Matrix Element (ME) technique

In the Matrix Element technique, each event is tested against signal and background hypotheses based on fully differential cross section calculations for each process using MadGraph. The differential cross section is integrated over the allowable phase space of the undetected neutrinos. The resulting probability density function is evaluated for candidate events to obtain parton level event probability densities P

_s

and P

_b

for the signal and background hypotheses, respectively. The background processes used are continuum WW production in both the 0-jet and 1-jet channels and, in addition, t t ¯ in the 1-jet channel. In the 0-jet channel, the momenta of the two leptons are used to compute the probability densities, while the leading jet momentum is additionally used in the 1-jet channel. The probability densities are combined into an event probability discriminant EPD

=

P

s/(Ps +

P

b

), which ranges from 0 to

+1.

The salient feature of the ME method is that the discriminant is calculated from first principles.

In practice, the calculation of the probability densities is rendered tractable with some simplifying assumptions. P

s

and P

b

can be expressed as a product of the squared matrix element

|

M

|²

and the phase-space factor dΦ convolved with the parton distribution functions f (q

1

) and f (q

2

), as shown in Eq. 1. The evolution of the parton-level configuration

y

into a reconstructed event x in the detector is modeled by a transfer function T( x, y).

P(x)

=

1

σ

Z

dΦ (y)dq

1

dq

2

f (q

1

) f (q

2

)

|

M(y)

|²

T (x, y)dy (1) Transfer functions improve the sensitivity of the method when the detector resolution for energy or angle measurements is not negligible. Since lepton momenta and jet angles are well measured by ATLAS, their transfer functions are taken to be

δ-functions. The transfer function for jet energy

with respect to partonic energy is described by a double Gaussian function that parameterizes the jet response in fully simulated MC events.

EPD distributions in the 0-jet and 1-jet channels for events passing all event selection criteria are shown in Figure 8.

4Low-mass Drell-Yan events, having a lepton pair with 10 GeV<mℓℓ<40 GeV, are not used in BDT training.

(9)

[GeV]

mT

50 100 150 200 250 300 350 400

Events/10 GeV

0 50 100 150 200 250 300

Data SM (sys ⊕ stat) WW WZ/ZZ/Wγ

t

t Single Top

Z+jets W+jets H [125 GeV]

Preliminary ATLAS

L dt = 4.7 fb-1

∫

= 7 TeV, s

+ 0 jet ν νl

→ l WW(*)

→ H

[GeV]

mT

50 100 150 200 250 300 350 400

Events/10 GeV

0 20 40 60 80

100 ^Data ^stat)

SM (sys ⊕ WW WZ/ZZ/Wγ

t

t Single Top

Z+jets W+jets H [125 GeV]

Preliminary ATLAS

L dt = 4.7 fb-1

∫

= 7 TeV, s

+ 1 jet ν νl

→ l WW(*)

→ H

Figure 3: m

T

distribution in the 0-jet (top) and 1-jet (bottom) channels for events satisfying all selec-

tion criteria except the cut on m

_ℓℓ

. The lepton flavours are combined. The expected signal for a SM

Higgs boson with m

H =

125 GeV is superimposed. The final bin includes the overflow. The hashed

area indicates the total uncertainty on the background prediction.

(10)

(ll)[rad]

φ

0 0.5 1 1.5 2 2.5 ∆3 3.5

Events/0.25

0 50 100 150 200 250 300 350

t

t Single Top

Z+jets W+jets H [125 GeV]

Preliminary ATLAS

L dt = 4.7 fb-1

∫

= 7 TeV, s

+ 0 jet ν νl

→ l WW(*)

→ H

(ll)[rad]

φ

0 0.5 1 1.5 2 2.5 ∆3 3.5

Events/0.25

0 20 40 60 80 100 120 140

t

t Single Top

Z+jets W+jets H [125 GeV]

Preliminary ATLAS

L dt = 4.7 fb-1

∫

= 7 TeV, s

+ 1 jet ν νl

→ l WW(*)

→ H

Figure 4:

∆φ_ℓℓ

distribution in the 0-jet (top) and 1-jet (bottom) channels, for events satisfying all

selection criteria except the cut on m

_ℓℓ

. The lepton flavours are combined. The expected signal for a

SM Higgs boson with m

H =

125 GeV is superimposed. The hashed area indicates the total uncertainty

on the background prediction.

(11)

[GeV]

mll

0 50 100 150 200 250 300

Events/20 GeV

0 100 200 300 400

500 ^Data ^{SM (sys}^⊕^stat)

WW WZ/ZZ/Wγ t

t Single Top

Z+jets W+jets H [125 GeV]

Preliminary ATLAS

L dt = 4.7 fb-1

∫

= 7 TeV, s

+ 0 jet ν νl

→ l WW(*)

→ H

[GeV]

mll

0 50 100 150 200 250 300

Events/20 GeV

0 50 100 150 200

250 ^Data ^{SM (sys}^⊕^stat)

WW WZ/ZZ/Wγ t

t Single Top

Z+jets W+jets H [125 GeV]

Preliminary ATLAS

L dt = 4.7 fb-1

∫

= 7 TeV, s

+ 1 jet ν νl

→ l WW(*)

→ H

Figure 5: m

ℓℓ

distribution in the 0-jet (top) and 1-jet (bottom) channels, for events satisfying all

selection criteria except the cut on m

_ℓℓ

. The lepton flavours are combined. The expected signal for

a SM Higgs boson with m

H =

125 GeV is superimposed. The final bin includes the overflow. The

hashed area indicates the total uncertainty on the background prediction.

(12)

(ll) [GeV]

pT

0 20 40 60 80 100 120 140 160 180 200

Events/10 GeV

0 100 200 300 400 500

600 ^Data WW ^{SM (sys} WZ/ZZ/W^⊕γ^stat)

t

t Single Top

Z+jets W+jets H [125 GeV]

Preliminary ATLAS

L dt = 4.7 fb-1

∫

= 7 TeV, s

+ 0 jet ν νl

→ l WW(*)

→ H

(ll) [GeV]

pT

0 20 40 60 80 100 120 140 160 180 200

Events/10 GeV

0 20 40 60 80 100 120 140 160 180

t

t Single Top

Z+jets W+jets H [125 GeV]

Preliminary ATLAS

L dt = 4.7 fb-1

∫

= 7 TeV, s

+ 1 jet ν νl

→ l WW(*)

→ H

Figure 6: p

^ℓℓ_T

distribution in the 0-jet (top) and 1-jet (bottom) channels, for events satisfying all selec-

tion criteria except the cut on m

_ℓℓ

. The lepton flavours are combined. The expected signal for a SM

Higgs boson with m

H =

125 GeV is superimposed. The final bin includes the overflow. The hashed

area indicates the total uncertainty on the background prediction.

(13)

BDT Output -1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

Events/0.2

0 100 200 300 400 500

t

t Single Top

Z+jets W+jets H [125]x10

Preliminary ATLAS

L dt = 4.7 fb-1

∫

= 7 TeV, s

+ 0 jet ν νl

→ l WW(*)

→ H

BDT Output -1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

Events/0.4

0 50 100 150 200 250

t

t Single Top

Z+jets W+jets H [125]x10

Preliminary ATLAS

L dt = 4.7 fb-1

∫

= 7 TeV, s

+ 1 jet ν νl

→ l WW(*)

→ H

Figure 7: BDT output weight distributions for Higgs signal and background events satisfying all selection criteria in the 0-jet (top) and 1-jet (bottom) channels corresponding to m

H=

125 GeV. The lepton flavours are combined. The signal has been multiplied by a factor of 10 for better visibility.

The number of bins in each distribution corresponds to that used in the statistical treatment.

(14)

EPD 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Events / 0.05

10 20 30 40

50 Data SM (sys ⊕ stat)

WW Wγ

WZ/ZZ tt

single top Z+jets W+jets H[125 GeV] x 10

dt = 4.7 fb-1

L

∫

= 7 TeV, s

+ 1 jet ν νl

→ l WW(*)

→ H

Figure 8: EPD distributions for Higgs signal and background events satisfying all selection criteria in

the 0-jet (top) and 1-jet (bottom) channels corresponding to m

_H =

125 GeV. The lepton flavours are

combined. The signal has been multiplied by a factor of 10 for better visibility.

(15)

5 Background Normalisation and Control Samples

The dominant backgrounds are normalised using control samples obtained from the data with selec- tions similar to those used in the signal region but with some criteria reversed or modified to create signal-depleted, background-enriched regions. This helps to reduce the sensitivity of the background predictions to the systematic uncertainties that are detailed in Section 6. In the following, such control samples are defined for the WW, Z/γ

^∗+jets, top, and

W+jets backgrounds.

5.1 WW control sample

The WW background MC prediction is normalised using a control region defined with the same se- lection as for the signal region except that the upper bound on m

ℓℓ

is replaced with a lower bound m

ℓℓ >

80 GeV (m

ℓℓ >

m

Z+

15 GeV) for the eµ (ee and

µµ) final states. This control region is used

only for the low m

_H

selection in the 0-jet and 1-jet analyses. In the high (m

_H ≥

200) m

_H

selection and in the 2-jet analysis, a high-statistics signal-depleted region cannot be isolated in this fashion; in these cases, the MC prediction is used.

Figure 9 shows the ME EPD distributions in this control region in the 0-jet and 1-jet channels

⁵

. The number of events in the WW control region in data agrees well with the predictions, as can be seen in Table 1. The total uncertainty on the predicted WW background in the signal region is 10%

for the 0-jet and 28% for the 1-jet analyses.

To further check the accuracy of WW event modeling in the MC, the BDT method is trained in the 0-jet channel with WW events as signal and t¯ t, single top, WZ, ZZ and Z/γ

^∗+jets events as

backgrounds. The m

_ℓℓ

and p

^ℓℓ_T

requirements are not applied in this check. As shown in Figure 10, in all three leptonic channels the method classifies WW events as being signal-like, well separated from the dominant Z/γ

^∗+jets background. Good data-MC agrement is seen in the BDT output distributions,

suggesting that WW events are well modelled by the simulation.

5.2 Z/γ

^∗

+ jets control sample

Z/γ

^∗+

jets contamination in the signal region arises from events containing two real leptons and mis- measured E

^miss_T

. Since there is a discrepancy between data and MC in the E

_T^miss

tail, the predicted Z/γ

^∗+jets background in the

ee and

µµ

channels is corrected with a E

^miss_T

mismodelling factor. This factor is determined using control regions in both data and MC; three regions are defined in the two- dimensional plane of E

^miss_T,rel

and m

ℓℓ

:

•

A: E

^miss_T,rel>

45 GeV, m

ℓℓ<

m

Z−

15 GeV (signal region)

•

B: E

^miss_T,rel>

45 GeV,

|

m

ℓℓ−

m

Z|<

15 GeV

•

C: E

^miss_T,rel<

45 GeV,

|

m

_ℓℓ−

m

_Z|<

15 GeV The mismodelling factor F

met

is then defined as:

F

met=

B

data

B

data+

C

data

B

MC+

C

MC

B

MC

(2)

5The BDT output distributions in theWWcontrol region are not instructive, since the events are overwhelmingly in the most background-like bin.

(16)

EPD 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Events / 0.1

50 100 150 200 250 300

Data SM (sys ⊕ stat)

WW Wγ

WZ/ZZ tt

dt = 4.7 fb-1

L

∫

= 7 TeV, s

+ 0 jet ν νl

→ l WW(*)

→ H

EPD 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Events / 0.083

10 20 30 40 50 60 70

WW Wγ

WZ/ZZ tt

dt = 4.7 fb-1

L

∫

= 7 TeV, s

+ 1 jet ν νl

→ l WW(*)

→ H

Figure 9: Distributions of the ME EPD in the WW control region in the 0-jet (top) and 1-jet (bottom)

analyses. The lepton flavours are combined. The signal shown is for m

H =

125 GeV, multiplied by

a factor of 10 for better visibility. The hashed area indicates the total uncertainty on the background

prediction.

(17)

BDT Output -1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

Events/0.2

0 50 100 150 200 250 300

350 ^Data ^{SM (stat)}

WW WZ/ZZ/Wγ

t

t Single Top

Z+jets W+jets H [125 GeV]

Preliminary ATLAS

L dt = 4.7 fb-1

∫

= 7 TeV, s

+ 0 jet ν νe

→ e WW(*)

→ H

BDT Output -1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

Events/0.2

0 100 200 300 400 500 600 700

Data SM (stat)

WW WZ/ZZ/Wγ

t

t Single Top

Z+jets W+jets H [125 GeV]

L dt = 4.7 fb-1

∫

= 7 TeV, s

+ 0 jet ν µ ν µ

(*)→

→ WW H

BDT Output -1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

Events/0.2

0 200 400 600 800 1000 1200 1400 1600 1800 2000

Data SM (stat)

WW WZ/ZZ/Wγ

t

t Single Top

Z+jets W+jets H [125 GeV]

L dt = 4.7 fb-1

∫

= 7 TeV, s

+ 0 jet ν µ ν

→ e WW(*)

→ H

Figure 10: BDT output distributions in the 0-jet ee (top),

µµ

(middle) and eµ (bottom) channels, the

method having been trained with WW events as signal, and t¯ t, single top, WZ, ZZ and Z/γ

^∗+

jets

events as backgrounds. The Higgs signal corresponding to m

H =

125 GeV is shown for comparison

only and was not used in training. The shaded error band shows only the statistical uncertainty on the

predictions.

(18)

after subtracting contamination from other backgrounds in regions B and C in data. The corrected estimate of the Z/γ

^∗+jets background in the signal region is then

A

^corr_MC =

A

MC×

F

met

. The values of F

met

thus obtained are:

•

ee

+

0-jet: 0.99

±

0.15

•

ee

+

1-jet: 0.88

±

0.17

• µµ+

0-jet: 1.13

±

0.08

• µµ+

1-jet: 0.89

±

0.10 where only statistical uncertainties are shown.

The Z/γ

^∗+jets background in the

eµ channel is small, and is estimated using the MC prediction without any scaling applied.

The total uncertainty on the Z/γ

^∗+jets background amounts to 19% and 35% in the 0-jet and 1-jet

categories respectively, with all three lepton channels combined. The uncertainties include the statis- tical uncertainty on F

met

as well as theoretical and experimental systematic uncertainties described in Section 6.

5.3 Top control sample

In the 0-jet signal region, the estimated number of top quark background events is extrapolated from the number of events satisfying the pre-selection criteria described in Section 3. This sample is domi- nated by top quark backgrounds, as shown in Figure 2. The scale factor used to propagate the t¯ t yield from this sample to the signal region is estimated as the square of the efficiency for one top quark decay to survive the jet veto, which is determined using another control sample defined by the pres- ence of an additional b-jet. To account for the presence of single top events, a correction is computed using simulation [61]. The overall e

ffi

ciency for the requirements on p

^ℓℓ_T

and m

_ℓℓ

is also taken from simulation. The total uncertainty on the top quark background estimate in events with no jets is 26%.

In the 1-jet and 2-jet analyses, the top quark background MC prediction is normalised to the data using a control sample defined by reversing the b-jet veto. In the 1-jet analysis, the requirement on m

ℓℓ

is removed, while in the 2-jet analysis the requirements on m

ℓℓ

and

∆φ_ℓℓ

are both removed.

The resulting samples are dominated by top quark backgrounds (both t t ¯ and single top), with little contribution from other sources. The number of events in the 1-jet and 2-jet control regions are given in Table 1. The total uncertainties on the estimated top quark background in the 1-jet and 2-jet analyses amount to 27% and 44%, respectively.

Figure 11 shows the BDT output distribution and Figure 12 the ME EPD distribution in the 1-jet top control region. Good data-MC agreement is seen in both distributions.

5.4 W + jets control sample

The W+jets background contribution is estimated using a data control sample of events where one

of the two leptons satisfies the identification and isolation criteria described in Section 3, and the

other lepton (denoted “anti-identified”) fails these criteria while satisfying a loosened selection. The

dominant contribution to this background comes from fake electrons. The W+jets contamination

in the signal region is then obtained by scaling the number of events in the data control sample by a

normalisation “misidentification factor”. This factor is estimated as a function of anti-identified lepton

p

T

using an inclusive dijet data sample, after subtracting the residual contributions from real leptons

(19)

BDT Output -1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

Events/0.4

1 10 102

103

104

t

t Single Top

Z+jets W+jets H [125]x10

Preliminary ATLAS

L dt = 4.7 fb-1

∫

= 7 TeV, s

+ 1 jet ν νl

→ l WW(*)

→ H

Figure 11: BDT output distribution in the 1-jet top control region, defined by requiring a b-tagged jet.

The lepton flavours are combined. The signal shown is for m

_H =

125 GeV, multiplied by a factor of 10 for better visibility. The hashed area indicates the total uncertainty on the background prediction.

arising from leptonic W and Z decays. The W candidates are identified by requiring the transverse mass m

^W_T =

q

2p

^ℓ_T

E

_T^miss·

(1

−

cos

∆φ) to satisfy

m

^W_T >

30 GeV. In this expression, p

^ℓ_T

is the lepton transverse momentum and

∆φ

is the difference in azimuth between the lepton and missing transverse momentum directions. The Z candidates are identified requiring two opposite-sign leptons of the same flavour and

|

m

ℓℓ−

m

Z|<

15 GeV. The small remaining lepton contamination, which includes Wγ and Wγ

^∗

events, is subtracted using MC simulation.

The uncertainty on the misidentification factor is the main uncertainty on the W+jets background contribution. This uncertainty is dominated by differences in jet properties between dijet and W+jets samples evaluated with simulated events, with smaller contributions originating from trigger effects and the subtraction of the contribution from real leptons from leptonic W and Z decays. The total uncertainty on this background is estimated to be approximately 50%.

6 Systematic Uncertainties

Theoretical uncertainties on the signal production cross sections are determined following Refs. [62, 63]. QCD renormalisation and factorisation scales are varied up and down independently by a factor of two. Independent uncertainties on the ggF signal production are assumed for the inclusive cross section, the cross section for production with at least one jet and with at least two jets. The resulting uncertainties on the cross sections in exclusive jet multiplicity analyses are taken into account, as well as anti-correlations caused by transitions among jet categories. The relative 0-jet (1-jet) cross section uncertainties depend on m

_H

, rising from

±

21% (

±

31%) at m

_H =

125 GeV and m

_H =

240 GeV to

±

42% (

±

31%) at m

H=

600 GeV [63–65]. The scale uncertainty on both the VBF signal cross section

and the acceptance associated with the jet veto requirement, to which the 2-jet analysis is particularly

sensitive, is 4%. In the 2-jet analysis, around 25% of the signal events are produced via ggF, where

the uncertainty on the ggF signal cross section is around 25%. For m

H

larger than 300 GeV, an

(20)

EPD 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Events / 0.083

20 40 60 80 100 120 140 160

WW Wγ

WZ/ZZ tt

dt = 4.7 fb-1

L

∫

= 7 TeV, s

+ 1 jet ν νl

→ l WW(*)

→ H

Figure 12: EPD distribution in the 1-jet top control region, defined by requiring a b-tagged jet. The lepton flavours are combined. The signal shown is for m

_H =

125 GeV, multiplied by a factor of 10 for better visibility. The hashed area indicates the total uncertainty on the background prediction.

additional uncertainty due to the Higgs lineshape description in the POWHEG MC generator is added in quadrature for both the ggF and the VBF channel and amounts to 150%

×

(m

H/1 TeV)³

[63,66–68].

PDF uncertainties are estimated, following Refs. [48, 69–71], using the envelopes of error sets as well as di

ff

erent PDF sets applied separately to quark-quark, quark-gluon, and gluon-gluon initiated processes. The relative PDF uncertainty on the dominant ggF signal process is about 8%; the associ- ated VBF uncertainty varies from

±

2% at m

_H =

125 GeV to

±

4 % at m

_H =

600 GeV. Uncertainties on the modelling of signal and background processes are estimated by using alternative generators, such as MC@NLO for the ggF process, ALPGEN for WW production, POWHEG for t t ¯ production, and PYTHIA for the Z/γ

^∗+

2 jets process. The uncertainties associated with the underlying event and parton showering are taken into account in the acceptance uncertainty, but they are negligible compared to the scale uncertainties on the cross sections in excluve jet bins.

The main experimental uncertainties are related to the jet energy scale, which is determined from a combination of test beam, simulation, and in situ measurements. The uncertainty on the jet energy scale varies from 14% to 2% as a function of jet p

T

and

η

for jets with p

T >

25 GeV and

|η|<

4.5 [72];

for central jets it is at most 4%. An additional contribution from event pile-up is estimated to vary

between 5% and 0.5%, depending on jet p

_T

and

η

. The uncertainty on the jet energy resolution is

estimated from in situ measurements. The resolution varies from 25% to 5%, and its uncertainty from

5% to 2%, as a function of jet p

T

and

η. The reconstruction, identification and trigger efficiencies for

electrons and muons, as well as their momentum scales and resolutions, are estimated using Z

→ℓℓ

,

J/ψ

→ ℓℓ, and

W

→ ℓν

decays (ℓ

=

e,

µ). With the exception of the uncertainty on the electron

efficiency, which varies between 2% and 5% as a function of p

T

and

η, the resulting uncertainties are

all smaller than 1%. Jet energy scale and lepton momentum scale uncertainties are propagated to the

E

_T^miss

computation. Additional contributions arise from jets with p

T <

20 GeV as well as from soft

energy deposits in the calorimeter not associated with reconstructed physics objects [73]; their effect

on the total background event yield ranges from 1% to 8%. Finally, E

^miss_T