R is the most popular data analysis language, but there is little concrete experimental data analyzing the time breakdown when running R programs. In this project, we addresses this limitation by systematically cataloging where time is spent when running R programs.
The results from this project can be used by
- Database researchers to better integrate R with database engines
- Architects to design micro-architectural features to improve the performance of R programs
- Programming languages researchers to consider techniques to improve the performance of R programs.
Datasets and Software
The software and datasets required for reproducing the experiments in the paper can be found below.Dependencies and R source code changes
The dependecies and the R source code changes for running the experiments can be downloaded here.Please begin with the README file for instructions.