Repository
https://github.com/BFuks/mad5-utopian-exercises
What Will I Learn?
MadAnalysis5 is a suite of software used to generate c/c++ code to analyze particle collider data. It was developed for Linux systems, but with some tweaks can be run using the Windows Subsystem for Linux (WSL, a.k.a. Bash on Ubuntu on Windows) available in Windows 10.
- You will learn how to install MadAnalysis5 and its core dependencies under the WSL
- You will learn how to generate and run the equivalent of 'hello world' to test the installation.
Requirements
- An current installation (build 16215 or later) of Windows 10
- Approximately 1 to 2 gigabytes of drive space, mainly for the WSL installation
- An activated WSL with Ubuntu installed
- Installing WSL itself is beyond the scope of this tutorial, as there are already detailed installation instructions from Microsoft and from third parties
- This tutorial was specifically tested using
Ubuntu 16.04.4 LTS
, the user may check their version by typinglsb_release -a
in the console.
Difficulty
- Intermediate. The user does not need to be an expert using the command line, but must not be intimidated by it.
Tutorial Contents
There are three components to successfully run MadAnalysis5 using a fresh WSL:
- Installing relevant development tools
- Installing the software and its dependencies
- Testing the installation using the equivalent of 'Hello, world!' to analyze some Large Hadron Collider (LHC) data.
All of these steps will be performed within the Bash shell, which is available from the start menu or can be found by pressing the Windows key and searching for 'ubuntu'.
Preparing the WSL environment for development
A base install of Ubuntu under WSL will not include the necessary development tools for c/c++ (e.g. gcc
and make
). The simplest way to install these is to use the build-essential
metapackage. The following three commands will update your package manager and install build-essential
:
sudo apt-get update
sudo apt-get upgrade
sudo apt-get install build-essential
Installing MadAnalysis5 and its dependencies
Although MadAnalysis5 has the ability to install some of its dependencies, that route does not always work under WSL. It is advisable to follow the order of installation listed here.
Python 2.7
As installed under the WSL, Ubuntu does not have Python, which MadAnalysis5 depends on. You must first install Python itself and then set up the environment so that MadAnalysis5 can find it.
sudo apt install python2.7
sudo ln -s /usr/bin/python2.7 /usr/bin/python
Note: This method assumes that a custom WSL instance is being set up for MadAnalysis5. Although creating a symbolic link to python2.7
is the simplest solution, it can cause issues in environments where other projects requiring Python3 are being run. Setting up a Python virtualenv
is probably a workable solution in that case, but is beyond the scope of this tutorial.
Installing the ROOT
dependency
In this section, ROOT
does not refer to the Unix term, but rather the CERN particle physics data analysis package.
Although other versions of ROOT are likely to work, this tutorial was developed using release 6.10.08. The binary release for the appropriate Ubuntu version (here, 16.xx) can be installed using the following commands:
cd ~
curl https://root.cern.ch/download/root_v6.10.08.Linux-ubuntu16-x86_64-gcc5.4.tar.gz | tar xvz
cd root/bin/
source thisroot.sh
cd ~
Note: To avoid having to source thisroot.sh
for every new WSL instance, the user is advised to update their .bashrc
to do so.
Installing madanalysis5
MadAnalysis5 can be installed similarly to ROOT
:
wget https://launchpad.net/madanalysis5/trunk/v1.6/+download/ma5_v1.6.tgz
tar -xvzf ma5_v1.6.tgz
After unarchiving, run ./bin/ma5
.
The screen displayed should be similar to above and list mostly disabled dependencies, but indicate that ROOT
has been found. There should be a prompt asking the number of cores to be configured for. Choose the default number of cores and let the program run. It should list the component number and total number of components it is preparing, as below:
Installing delphes
This is one of the major installation steps which differs from installing MadAnalysis5 under a dedicated Linux environment. Instead of using the MadAnalysis5 prompt to 'install delphes', the user should clone the delphes git repository into the appropraite location and build from source:
From the madanalysis5 directory
cd tools
git clone https://github.com/delphes/delphes.git
cd delphes
make
Compilation should take approximately 30 to 60 minutes on a contemporary computer (e.g. an i7-3xxx with 8 gb RAM was sufficient for this tutorial).
MadAnalysis5 should be run again (~/madanalysis/bin/ma5
) to check that delphes
was found and to reconfigure the components.
Installing PAD
PAD can be installed within the ma5
prompt as per the the original instructions :
install pad
Generating the equivalent of 'Hello, world!' to test the installation.
MadAnalysis5 serves as a code generator to create programs for specific analyses of data. The code resulting from this section of tutorial serves as both a test of the installation and as a completed Task 1a for the associated Utopian project. The user is advised to work through the example themselves, but a full working version is available on github as well as at the primary github repository for the associated task.
Code generation
As in the associated task, the first step is to tell MadAnalysis5 where to create the skeleton code:
./bin/ma5 -E test_folder test_analysis
For the generated code to compile under the WSL an additional step is needed. This is because the environment variables referenced in the generated Build/setup.sh
script, when expanded, contain spaces. This is a result of Windows naming conventions and the conditionals used in the scrip, as generated, cannot handle them.
To fix the script, edit Build/setup.sh
in in vim
or your text editor of choice, and surround all environment variables in conditional statements with double quotes .
For example:
if [[ $MA5_BASE && $PATH && $LD_LIBRARY_PATH ]]; then
should become
if [[ "$MA5_BASE" && "$PATH" && "$LD_LIBRARY_PATH" ]]; then
(User wishing to use a GUI editor such as gedit
will need to setup their system to use X ).
Downloading example data
It is generally best to treat the WSL file system and the host windows file system as two separate entities. The best way to download the example data file is then to use wget, such as we did with the ROOT
binaries. This is also a good time to create the input file read by the generated program.
cd test_folder\test_analysis\Input
wget http://madanalysis.irmp.ucl.ac.be/raw-attachment/wiki/MA5PublicSandBox/tth_aa.root
realpath tth_aa.root > tth_aa.list
Running the software
The generated code, after the modifications above, will run, but will not produce the output required by Task 1a. The user is encouraged to figure out how to modify test_folder/Build/SampleAnalyzer/User/Analyzer/test_analysis.cpp
to produce the desired output. However, one approach is published at the github repository associated with this tutorial, should they become stuck.
For completeness, the rest of the build process is the same as under native Ubuntu and should not take more than a few minutes.
cd test_folder/Build
source setup.sh
make
./MadAnalysis5job ../Input/tth_aa.list
The expected output will differ slightly, based on how the user chose to display the output, but should appear similar to below:
Proof of Work Done
The modified generated code used in this tutorial will be available after pulling at the primary github repository for the associated task https://github.com/BFuks/mad5-utopian-exercises and is currently available at the author's forked repository at https://github.com/effofex/mad5-utopian-exercises/blob/master/ex1a_effofex.cpp