GitHub repositories for CMS Big Data projects
-
Victor
-
Dominick
- Notebooks in scala to process data and write out parquet and in python to generate plots from information in parquet files https://github.com/olivito/scala-spark-root/tree/master/notebooks
- Applications in scala https://github.com/olivito/spark-root-applications
- Forked from https://github.com/vkhristenko/spark-root-applications
- Dimuon reduction code: https://github.com/olivito/spark-root-applications/blob/master/src/main/scala/org/dianahep/sparkrootapplications/examples/DimuonReductionAODMultiDataset.scala
- Running on multiple samples in one spark job: https://github.com/olivito/spark-root-applications#running-on-multiple-samples-in-one-spark-job
-
SiewYan
- Minimalistic Root file Operation Example: Reading NanoAod and save a histograms in png format: https://gist.github.com/SiewYan/20ed7942497631749c469866395dcf06