cmsb2g / B2GAnaFW

Analysis framework for Beyond Two Generations (B2G) Physics Analysis Group (PAG) of the Compact Muon Solenoid (CMS) Experiment

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

New Tag for 03Feb2017 samples

alefisico opened this issue · comments

Hi all
not only because of this: https://hypernews.cern.ch/HyperNews/CMS/get/b2g-selections/332.html
but I think is time for the next tag. Is there something extra that we want to include in this tag?
If I remember correctly, we didn't launch a big campaign for the last tag v8.0.x_v2.5.

Dear Alejandro,
Can we add my PR (#70)?
#70

This fixes a problem with the stochastic part of jet smearing.

Hi all
@jkarancs is that the only fix that you want to include? Unfortunately I dont have writing permissions, but maybe @dmajumder can help with that? (Actually maybe it will be useful for someone else to have the permissions? in case Devdatta is busy)
@skyriacoCMS and @jkarancs, let me check if I can solve this compilation problem incorporating both changes, and I'll go back to you.
Also, is there a strict timeline for this new tag? i.e., is it someone that needs this right now? (just to know)

@alefisico
Yes, that's the only change. It's just one line, so someone with write permission should do it. I made a PR because I don't have such permission, but it seems noone accepted it yet.

I just commented on the PR, sorry for some reason I missed this notification in my mailbox.

As for the tag for 03Feb, I am not sure if it is urgent. Do one of you want to use it for your analyses right away? I am slightly concerned about the space, we may need to move things around or delete old stuff.

I think not from our side, maybe we can ask in HN?

I am waiting for some clarification from Janos before merging #70
I think this PR can go into CMSSW_8_0_X_V3 along with Savvas's new additions.

Another thing
@alefisico For AK8CHS jets, I think the matched Puppi subjet indexes are not being stored properly, r being stored at all. Can you check this part?

@dmajumder
I don't see your comment, but let me clarify the change in PR #70.
The JER twiki has a recipe for stochastic smearing, which is the 3rd formula here:
https://twiki.cern.ch/twiki/bin/viewauth/CMS/JetResolution#Smearing_procedures

sigma_JER is written as the "relative pt resolution", so the ratio of the absolute jet resolution (which is what we have currently) divided by the pt of the jet. I made this correction in the PR, it's just one line:
2258c2d#diff-f49af754dc51e5242da678a17c234588L253

So to summarize, this third formula is going to be implemented like this after the fix:
https://github.com/jkarancs/B2GAnaFW/blob/2258c2da84265056613ee755d4cc8cba93799f66/src/JetUserData.cc#L253-L254

Summary: #70 is closed and we do not need the change.
Once I hear from @alefisico about the storage of the AK8PUPPI subjet indices matched to the AK8CHS jets we can move forward

@dmajumder Hello, If you create a new branch on the official framework I can make a pull request with my changes - which are a lot - and then on top of that add the indices issue.

@skyriacoCMS I have created a new branch https://github.com/cmsb2g/B2GAnaFW/tree/CMSSW_8_0_X_V3
You can make the PR there

btw, I will check the subjet indeces once #72 is merged.

#72 Merged. With these developments, any production needs to re-produce both the data and the MC. Also, did anybody change the GT for 03Feb re-miniaod?

Also, did anybody change the GT for 03Feb re-miniaod?

I dont think so, but I can do it once I commit the changes in the indexes.

let's have a look once we have the subjet indices checked.

@dmajumder the subjet indices are not stored because the subjets are not stored. Remember that we are talking about the Puppi jets link to the CHS jets directly in miniAOD. But do we want those subjets? I think we have an entire collection of Puppi jets with subjets and all, so maybe it is not needed.
On the other side I updte the JECs and all the commands needed. I can push my branch with those changes if you want.

@alefisico You're right. However, one reason is to apply the PUPPI SD mass correction for W/Z tagging. This requires the uncorrected PUPPI SD mass. So I'd suggest storing this as a userFloat.

And yes, please push when you're done.

@dmajumder to store the four momentum of the subjets as userFloat? (just to be sure)

@alefisico Only the PUPPI SD uncorrected mass should suffice. You usually calculate it using the sum of the subjet uncorrected four-momenta.

Unfortunately we we have no way of knowing how these recipes will evolve in the future, so this is just an interim solution.

sorry for the delay but I had some stupid merging conflicts that I had to solve.
I just create #73, after that I think we are ready.

PR#73 merged. Do some tests before a new tag?

I have merged #74
time to do tests. BTW, given that we are adding several novel stuff, I think we need a light validation framework to test some variables before making production tags. Any ideas?

Dear all,

In the mean time let me elaborate.
I already privately calculated the uncorrected puppi softdrop mass (needed for W tagging) the same way as it is implemented currently, with some exceptions:

  • I was minimizing the matching distance and only accept the closest jet
  • I was using "slimmedJetsAK8" collection (which is available in MINIAOD) and their "SoftDropPuppi" subjets, exactly like in the recipe [1].
    We currently use: "packedPatJetsAK8PFPuppiSoftDrop" which is not in MINIAOD and its subjet collection: "SoftDrop".

What do you think? this is the relevant code.

[1] - https://github.com/cms-jet/PuppiSoftdropMassCorr/#get-uncorrected-puppi-soft-drop-mass-from-miniaod
[2] - https://github.com/cmsb2g/B2GAnaFW/blob/CMSSW_8_0_X_V3/src/BoostedJetToolboxUserData.cc#L86-L99

Dear all,

Additional to the things I mentioned previously, the subjet 4 momentum also seemed wrong. We need to use the correctedP4(0) according to the POG recipe.
I decided to make a PR (#76) and also add the MVA electron values/categories for general purpose and HZZ IDs, I think others might find them useful.

Using the L1L2L3 corrected subjets when computing softdrop mass is also useful (for top tagging) so I added back that variable too. Now the two new variables will be suitable for both the latest recommendations for W and top tagging. For the former one also need to apply the GEN/RECO corrections, but this can change so it's better to be applied on-the-fly in the analyses.

@jkarancs we already store the SD mass so what extra variables are needed?

We are not storing any corrections since as you said, they change. We just want to store the minimal set of variables needed to compute these corrections

Hi all (sorry for the late reply, I was very sick the last week)
@jkarancs about

I was using "slimmedJetsAK8" collection (which is available in MINIAOD) and their "SoftDropPuppi" subjets, exactly like in the recipe [1].
We currently use: "packedPatJetsAK8PFPuppiSoftDrop" which is not in MINIAOD and its subjet collection: "SoftDrop".

The reason of using the packedPatJetsAK8PFPuppiSoftDrop instead of the slimmedJetsAK8 is because we are NOT using the slimmedJetsAK8 jets at all. We are still reclustering everything througth the jetToolbox for AK8 jets and mimic the puppi jets in the CHS collection as in miniAOD. Maybe that is the reason your calculations are different, although I will keep your way to make sure the AK8 jet is the closest to the jet with subjets.

@dmajumder
Indeed we have the Puppi softdrop mass already, but we don't exactly use the same corrections as the POG. They checked different corrections, and they currently recommend on their twiki [1] to use "groomed top jets (SoftDrop) and AK4 L1L2L3 JEC corrections applied to the subjets", the same as on second bullet on slide 4 here [2]. Our Puppi SoftDrop mass is somewhat different, so to get the exact same results I suggest to use this new corrSDMassPuppi variable.

[1] - https://twiki.cern.ch/twiki/bin/view/CMS/JetTopTagging#13_TeV_working_points_CMSSW_8_0
[2] - https://indico.cern.ch/event/616638/contributions/2491478/attachments/1420668/2176938/TopTaggingSFMeasurement.pdf

@alefisico
I wish you to get well soon.

This new calculation of "uncorrSDMassPuppi" using the MINIAOD collection is only there in order to exactly get the same uncorrected mass that the POG uses for W tagging, otherwise we introduce some kind of difference which is in most cases small, but in some others it is not negligible. Of course we are free to define and use our own reclustered collections, but in my opinion, for mass tagging (both W and top), the matching should be done using the MINIAOD collection in order to be able to reproduce the POG results and use their corrections/scale factors.

@jkarancs

Hello All,

So regarding the GP MVA IDs ok, fair enough, the idea was to just have a single variable - the score of the mva and then the user decides what to cut on. The WP are on the EG twikis but mostly we have seen (IN THE b2g group -Dylan Rankin) that this new training WPs was not doing as good as the old one and one had to shift the WP to get similar or better s/b and stability in higher pt electrons. So I thought to just add the mva value and each one decides what suites them. But what you have is ok in anycase.

Good that you added the HZZ MVA. I did not know that people wanted this so I did not add it.

For the PhotonJets - Sorry for this. Its a typo - single line fix. I will fix it and make a PR once you guys settle on the rest of the issues.

Cheers

@skyriacoCMS
The HZZ MVA ID is indeed useful for people who want to a) have a very high efficiency selection, or b) veto electrons or c) want to extend below 10GeV pt, so I think this is good ID to keep also.

@alefisico @skyriacoCMS @jkarancs are we ready to create a new tag?
I've got queries from some people, so this is now a bit high priority.

I dont have any other thing to include from my side.

@alefisico thanks. once the other two confirm i think we're good to go.

@dmajumder @jkarancs @alefisico
Hello guys,

From my side I have nothing more to add either

@dmajumder
Also ready from my side. Time to make new ntuples!

I've created a new release, please check the notes and the changelog w.r.t. the previous production release:
https://github.com/cmsb2g/B2GAnaFW/releases/tag/v8.0.x_v3.1

I'll announce it tomorrow

@decosa @oiorio

Sorry but could you summarize the email thread here? It's easier to keep track.

I am still not sure if MET recipe needs to be updated. It is mentioned here
https://twiki.cern.ch/twiki/bin/view/CMS/MissingETRecipes
but not here apparently.
https://twiki.cern.ch/twiki/bin/view/CMSPublic/ReMiniAOD03Feb2017Notes#MET_Recipes

BTW, I am not sure if we all need the muon and EG cleaned MET. We should still keep the Type 1 corrected MET since that should satisfy most needs.

Please comment and we'll follow up with a new tag.

ok, I will publicise the tag then, and in the next round we'll see if other METs are needed.
We really have to get the object contacts to sign off from now on.

v8.0.x_v3.2 made. I am going to announce it

Sorry to add this as a late comment (and coming from outside the group), but does the latest tag include the EGM regressions?