Introduction

The C ++-based RO OT W orkflow for N -tuples (CROWN) is a fast new way to convert NanoAOD samples into flat TTrees to be used in further analysis. The main focus of the framework is to provide a fast and clean way of selecting events and calculating quantities and weights. The framework has minimal dependencies and only uses ROOT and it’s Dataframe as a backend.

Design Idea

The framework consists of two main parts, a python configuration and a set of C++ functions. The python configuration is used to automatically generate a C++ script, which is then compiled to an executable using cmake and all available compiler optimizations. This has the main advantage that the compiled executable is very fast and efficient in calculating the output TTree. In the following sketch, the overall workflow of CROWN is illustrated.

Getting started

Warning

The Framework depends on the scale factors provided by CMS. These are directly included in the repository via a git submodule. Since the scale factors are added from the CERN gitlab, access to the CERN gitlab repository (https://gitlab.cern.ch/cms-nanoAOD/jsonpog-integration), is needed. Since the repository is added via SSH, your SSH key must be added to the CERN gitlab instance ( A tutorial on how to do this can be found here: https://docs.gitlab.com/ee/user/ssh.html#add-an-ssh-key-to-your-gitlab-account). For the instructions to work, you also have to add the SSH key to your GitHub.com account. The instructions to do this can be found here: https://help.github.com/articles/adding-a-new-ssh-key-to-your-github-account/

After making sure, that the access rights are given, setting up the framework is straightforward.

First, clone the Repository

git clone --recurse-submodules git@github.com:KIT-CMS/CROWN.git

and source the current LCG stack (at the moment we use a nightly build)

source init.sh

after this, the framework should be installed, but without any analysis, other than the example analysis. If you want to set up a specific analysis, you can do so by adding the name of the analysis to your init.sh command. Currently, supported analyses are:

Available Analyses
Analysis name	Repository
`tau`	https://github.com/KIT-CMS/TauAnalysis-CROWN
`earlyrun3`	https://github.com/khaosmos93/CROWN-config-earlyRun3

So to set the tau Analysis, you can do so by running

source init.sh tau

Running the framework

To create a new executable, first create a build directory

mkdir build && cd build

and then run cmake to set up the Makefiles. A python configuration is needed to specify the code, that should be generated. Configurations are located in the analysis_configuations directory. Within this folder, a subfolder for each type of analysis is created. Within the analysis folder, multiple Configurations belonging to the same analysis can be located. For example in the tau analysis, a main configuration config.py as well as several smaller Configurations exist.

Note

You have to provide both 1. the analysis that you want to run e.g. -DANALYSIS=template_analysis 2. the configuration that should be used -DCONFIG=min_config.

For the cmake command, a minimal set of options has to be provided, in this case, we use the template analysis with the minimal example

cmake .. -DANALYSIS=template_analysis -DCONFIG=min_config -DSAMPLES=data -DERAS=2018 -DSCOPES=mm

The options that are currently available are:

-DANALYSIS=template_analysis: The analysis to be used. This is the name of the folder in the analysis_configurations directory.

-DCONFIG=min_config: The configuration to be used. This is the name of the python configuration file. The file has to be located in the directory of the analysis and the path is provided in the Python import syntax e.g. subfolder.myspecialconfig

-DSAMPLES=emb: The samples to be used. This is a single sample or a comma-separated list of sample names.

-DERAS=2018: The era to be used. This is a single era or a comma-separated list of era names.

-DSCOPES=et: The scopes to be run. This is a single scope or a comma-separated list of scopes. The global scope is always run.

-DTHREADS=20: The number of threads to be used. Defaults to single threading.

-DSHIFTS=all: The shifts to be used. Defaults to all shifts. If set to all, all shifts are used, if set to none, no shifts are used, so only nominal is produced. If set to a comma-separated list of shifts, only those shifts are used. If set to only a substring matching multiple shifts, all shifts matching that string will be produced e.g. -DSHIFTS=tauES will produce all shifts containing tauES in the name.

-DDEBUG=true: If set to true, the code generation will run with debug information and the executable will be compiled with debug flags

-DOPTIMIZED=true: If set to true, the compiler will run with -O3, resulting in slower build times but faster runtimes. Should be used for developments, but not in production.

Compile the executable using

make install -j 20

The recommended build system is using regular UNIX build files, however, as an additional option, the ninja build system (https://ninja-build.org/) can be used for CROWN. To use ninja, set export CMAKE_GENERATOR="Ninja" in the init.sh as env variable, and then use the ninja install -j 20 command to compile the executable. Since CROWN profits from the parallelization of the build process, the number of threads can and should be set using the -j option.

After the compilation, the CROWN executable can be found in the build/bin folder. The executable can be used via a single output file followed by an arbitrary number of input files.

./executable_name outputfile.root inputfile_1.root inputfile_2.root

Creating Documentation

The Web documentation at readthedocs is updated automatically. However, if you want to create the documentation locally you have to first create a new build directory like build_docs

mkdir build_docs && cd build_docs

then run cmake to set the documentation building process

cmake ../docs

and build the documentation using

make

The resulting documentation can then be found in

build_docs/docs/index.html