Measuring performance:

Profiling & benchmarking

Author

Marie-Hélène Burle

Before we talk about ways to improve performance, let’s see how to measure it.

When should you care?

“There is no doubt that the grail of efficiency leads to abuse. Programmers waste enormous amounts of time thinking about, or worrying about, the speed of noncritical parts of their programs, and these attempts at efficiency actually have a strong negative impact when debugging and maintenance are considered. We should forget about small efficiencies, say about 97% of the time: premature optimization is the root of all evil. Yet we should not pass up our opportunities in that critical 3%.”

— Donald Knuth

Optimizing code takes time, can lead to mistakes, and may make code harder to read. Consequently, not all code is worth optimizing and before jumping into optimizations, you need a strategy.

You should consider optimizations when:

you have debugged your code (optimization comes last, don’t optimize a code that doesn’t run),
you will run a section of code (e.g. a function) many times (your optimization efforts will really pay off),
a section of code is particularly slow.

How do you know which sections of your code are slow? Don’t rely on intuition. You need to profile your code to identify bottlenecks.

Profiling

“It is often a mistake to make a priori judgments about what parts of a program are really critical, since the universal experience of programmers who have been using measurement tools has been that their intuitive guesses fail.”

— Donald Knuth

Base R profiler

R comes with a profiler: Rprof.

The data gets collected with:

## Start profiler
Rprof()

<Your code to profile>

## Stop profiler
Rprof(NULL)

This creates a Rprof.out file in your working directory (you can give it another name by passing a name into the initial call to Rprof (e.g. Rprof("test.out")).

The raw data is dense and is better read by running summaryRprof() (or summaryRprof("test.out") if you have created the file test.out rather than the default).

Alternatively, you can run R CMD Rprof (or R CMD Rprof test.out if you named your file) from the command line.

You can find an example here.

Packages

A number of packages run Rprof under the hood and create flame graphs or provide other utilities to visualize the profiling data:

profr,
proftools,
profvis built by posit (formerly RStudio Inc) is the newest tool. See here for an example.

Benchmarking

Once you have identified expressions that are particularly slow, you can use benchmarking tools to compare variations of the code.

In the most basic fashion, you can use system.time(), but this is limited and imprecise.

The microbenchmark package is a much better option. It gives the minimum time, lower quartile, mean, median, upper quartile, and maximum time of R expressions.

The newer bench package is very similar, but it has less overhead, is more accurate, and—for sequential code—gives information on memory usage and garbage collections. This is the package that we will use for this course.

The main function from this package is mark(). You can pass as argument(s) one or multiple expressions that you want to benchmark. By default, it ensures that all expressions output the same result. If you want to remove this test, add the argument check = FALSE.

While mark() gives memory usage and garbage collection information for sequential code, this functionality is not yet implemented for parallel code. When benchmarking parallel expressions, we will have to use the argument memory = FALSE.

You will see many examples of this throughout this course.

When benchmarking code, it’s generally best to use the median rather than the minimum or mean, especially if your data might contain outliers, as the median is less affected by extreme values and provides a better representation of the “typical” performance.