-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CMS: updates for the MC provenance query 2016 #182
Comments
Start with example datasets: /ADDmonoPhoton_MD-1_d-3_TuneCP5_13TeV-pythia8/RunIISummer20UL16NanoAODv9-106X_mcRun2_asymptotic_v17-v2/NANOAODSIM Expected changes in the scripts: Called from
lhe_generators.py is called separately (see e.g. 2015 readme):
|
|
For the record, DIGIPremix step has a 22 Mb config file containing the list of files in the pile-up Premix datasets.
with
This is a 22M file and if taken for 40k MC datasets, it will result in 880 G disk space, so we can do it differently... |
To do:
|
Updates to LHE generator search
Check which inputs are passed to the job in
Reminder:
Note:
|
The current script gets the provenance information as follows
As the processing scheme has changed from UL processing (no input datasets before AODSIM as they were transient) this won't work anymore.
The query flow should be changed to go directly to the chain:
For an example dataset /ADDmonoPhoton_MD-1_d-3_TuneCP5_13TeV-pythia8/RunIISummer20UL16NanoAODv9-106X_mcRun2_asymptotic_v17-v2/NANOAODSIM:
On the web GUI:
Query by the output file name:
https://cms-pdmv.cern.ch/mcm/requests?produce=%2FADDmonoPhoton_MD-1_d-3_TuneCP5_13TeV-pythia8%2FRunIISummer20UL16NanoAODv9-106X_mcRun2_asymptotic_v17-v2%2FNANOAODSIM&page=0&shown=140737488355327
https://cms-pdmv.cern.ch/mcm/chained_requests?contains=EXO-RunIISummer20UL16NanoAODv9-00205&page=0
then for each request of the query and get the dicts in the respective pages.
On the command line
Using
pred_id
from dasget the chain from the dictionary:
curl -s -k https://cms-pdmv.cern.ch/mcm/public/restapi/requests/get/EXO-RunIISummer20UL16NanoAODv9-00205
get the id of the chained request
auth-get-sso-cookie -u https://cms-pdmv.cern.ch/mcm -o cookies.txt
(see also docs)then, for each step in the chain, get the full dict or what is needed, e.g.
An example with the LHE step:
the gridpack location is in the fragment:
The text was updated successfully, but these errors were encountered: