All posts by Dan

Cut Set Using Track/Pi0 Likelihood

April 27, 2009 Dan

I implemented a cut set (#6) similar to cut set 1 except using a cut on the likelihood of each cluster. The conditions are

met>20 GeV
ht>100 GeV
at least 1 track/pi0 cluster with likelihood > 0.4

The numbers are:

N(signal)	N(W+jets)	N(ttbarr)	N(t-channel single top)	N(background)
42.6	124427	1567	47	126167

Compared to cut set 1, i.e. pT(cluster)>10 GeV w/ no likelihood cut, we have reduced the signal by 8.6% and the background by 40.6%.

–Dan

Track/pi0 Cluster Likelihood Ratios

April 20, 2009 Dan

Following Carla’s work deriving a likelihood for jet based identification, I have implemented a similar scheme for track/pi0 clusters. I make a series of parameterized likelihood for y(i)=B(i)/S(i), where i is a particular variable, S is the signal Higgs sample, and B is the combined W+jets sample. Then I multiply all the y’s together as uncorrelated likelihood ratios and create the final likelihood as L=1/(1+y). The plots of each y are given below. Continue reading Track/pi0 Cluster Likelihood Ratios →

Correlation Matrices for Track/pi0 Cluster Variables

April 10, 2009 Dan

Below are 3 correlation matrices for track/pi0 cluster variables.Â The matrices are formed from

The signal sample.
The W+jets sample.
The difference between the signal and W+jets correlation matrices.

Notice that the signal sample matrix is more anti-correlated across all variables than the W+jets sample.Â All variables exhibit |rho|>10% for all combinations, i.e. there are no highly uncorrelated pairs of variables.

–Dan

Cut Set & N(live) Table Using Soft Electrons

April 8, 2009 Dan

I have implemented a cut set based on this one except with the additional requirements that the event contains a soft electron and that the charge correlation for clusters containing an electron be negative. The requirements are as follows:

met>20 GeV
ht>100 GeV
at least 1 soft electron
at least 1 track/pi0 cluster with pT>10 GeV and N(live)<7.
if the soft electron is inside a cluster, qCorr<0.

The expected signal and background for these cuts are

N(signal)	N(W+jets)	N(ttbar)	N(t-channel single top)	Significance
9	2406	421	47	0.17

I have also made tables of N(live), N(tracks), N(pi0s) for the following cases:

Event contains a soft electron but it is not inside the cluster. Table is here.
Event contains a soft electron and it is inside the cluster; no cut on the charge correlation. Table is here.
Event contains a soft electron and it is inside the cluster and qCorr<0. Table is here.

The 5 most significant N(live), N(tracks), N(pi0s) bins for the 3 cases are listed below.

Soft Electron Outside Cluster

N(live)	N(tracks)	N(pi0s)	N(signal)	N(bg)	Significance
2	2	0	0.65	230	0.043
3	2	2	0.30	66	0.037
2	3	0	0.37	105	0.036
1	1	3	0.03	0.66	0.035
2	2	1	0.38	121	0.034

Soft Electron Inside Cluster; no cut on qCorr

N(live)	N(tracks)	N(pi0s)	N(signal)	N(bg)	Significance
2	2	0	1.12	66	0.14
2	2	1	1.12	126	0.10
1	1	1	0.77	66	0.09
3	2	2	0.40	47	0.059
3	2	3	0.24	22	0.051

Soft Electron Inside Cluster; qCorr<0

N(live)	N(tracks)	N(pi0s)	N(signal)	N(bg)	Significance
2	2	0	1.02	47	0.15
2	2	1	0.92	71	0.11
3	2	3	0.23	12	0.065
2	2	2	0.34	29	0.063
3	2	2	0.36	33	0.062

Notice that in all of the cases, the most significant bins are with N(live)<4. Also notice that the cut on charge correlation cuts 29% and 44% of the background in the (2,2,0) and (2,2,1) bins, respectively, while barely taking any signal away at all. This is intuitive in the 2 track bin, since one of the tracks must be the soft electron and therefore the other one is a 1-prong tau decay. Since the taus are coming from a neutral a0, the 2 track bin is the most pure for this cut. --Dan

N(live) Table

April 6, 2009 Dan

Here is a table that breaks down the number of signal and background events w.r.t. N(live), N(tracks), and N(pi0s). N(live) is as defined before. N(tracks) and N(pi0s) are the tracks and pi0s in a given cluster with the given N(live). The most signal-like bins are:

N(live)	N(tracks)	N(pi0s)	N(signal)	N(bg)	Significance
2	4	9	0.03	0	0.054
2	2	1	2.63	7077	0.031
2	2	0	6.32	44781	0.03
2	2	2	1.55	2959	0.029
3	2	2	2.28	6436	0.028
3	2	3	1.26	2416	0.026
2	1	2	2.43	10771	0.023

Notice that the most “signal-like” entries are in N(live)=2 and N(live)=3.

I want to do 2 more things with this table:

Change the counting of N(tracks) to count the soft tracks in the cluster cone that do not pass threshold. I did a similar thing earlier when calculating N(live).
Require the presence of a soft electron inside the cluster and recalculate the table. I expect clusters containing a soft electron to be more signal-like. We might be able to get further separation for those SE-tagged clusters by looking at N(live), N(track), N(pi0).

–Dan

Cut Set Using N(live)

March 30, 2009 Dan

I implemented a cut set similar to Cut Set 1, shown here .

The cuts are as follows:

tight central lepton pT>20 GeV
Raw Met>20 GeV
Raw Ht>100 GeV
at least 1 track/pi0 cluster with

N(live)<7
pT>10 GeV

The results of this cut set compared to the cut set 1 (the same except for the N(live) cut) are

Source	Number in Cut Set 1	Number in Cut Set 1 + N(live) Cut
Signal	46.6	44.8
W+jets	208713	169236
ttbar	3018	2502
Single Top (t-channel only)	518	373
Total Background	212250	172112

We have reduced the signal by 3.8% and the background by 19%.Â Clearly, this cut is a keeper.

Effect of track pT thresholds on N(live) distribution

March 24, 2009 Dan

In a previous post, I defined the variable N(live) for track/pi0 clusters.Â I was initially using tracks that were owned by the cluster, i.e. tracks that passed the seed and shoulder pT thresholds.Â I wanted to see what effect these thresholds have on the N(live) distribution.

The plot below compares the N(live) distribution using owned tracks (in red) and all tracks within the dR threshold of the cluster, in this case dR=0.5, w.r.t. the seed track.Â This plot was made on the signal MC sample and required that a generator level tau must be present inside the cluster.

As expected, when counting all tracks the mean of the distribution increases.Â The increase is relatively small for these tau-tagged signal MC clusters; the mean goes from 2.52 to 2.89.Â We didn’t expect this to be a huge effect for clusters coming from taus, as they should have relatively few tracks.Â The effect is more dramatic when comparing to the W+jet background, as seen in the plot below.

We achieve much better separation in the N(live)=1,2,3 bins when using all tracks in the event to calculate it.
–Dan

Cluster Tower Distribution Study

March 23, 2009 Dan

Define a new variable, N(live), which we expect to discriminate between jets and di-taus. The definition is as follows:

Extrapolate tracks in the track/pi0 cluster to the calorimeter
Count the number of towers that have a track from the cluster extrapolating to the OR a pi0 identified in them AND have total energy > 100 MeV. This number is N(live).

Below is the plot of N(live):

The signal sample has more probability to have 1-3 towers live compared to W+jets. ttbar tends to have more “live” towers in the cluster than the other samples.

Next we look in each bin where the Higgs signal is dominant, N(live)=1,2,3

daDefine a new variable, N(live), which we expect to discriminate between jets and di-taus. The definition is as follows:

Extrapolate tracks in the track/pi0 cluster to the calorimeter
Count the number of towers that have a track from the cluster extrapolating to the OR a pi0 identified in them AND have total energy > 100 MeV. This number is N(live).

Below is the plot of N(live):

The signal sample has more probability to have 1-3 towers live compared to W+jets. ttbar tends to have more “live” towers in the cluster than the other samples.

Next we look in each bin where the Higgs signal is dominant, N(live)=1,2,3