Pilot Benchmark Framework

The Pilot Benchmark Framework provides a tool (bench) and a library (libpilot) to automate computer performance measurement. It answers questions like:

How long should I run this benchmark to get a precise result?
Is this new scheduler really 3% faster, or is that measurement noise?
Which is faster for my database: 20 worker threads or 25?

Design goals:

Be as intelligent as possible — users should not need a statistics background.
Results must be statistically valid: accurate, precise, and repeatable.
Reach a valid result in the shortest possible time.

Pilot is written in C++ for fast in-place analysis and is released under a dual BSD 3-clause and GPLv2+ license.

This repository is a modernized fork of the original ASCAR Pilot by Yan Li et al., with CMake 4.x / Boost 1.74+ compatibility, an optional headless build mode, and a --confidence-level flag in run_program. See What is Pilot? for the full change list.

Documentation

All documentation is in the doc/ directory as reStructuredText, readable directly on GitHub or buildable into HTML with Sphinx.

Getting Started

Document	Description
What is Pilot?	Overview, motivation, and changes in this fork
Tutorial: Benchmarking C++ Functions	Use `libpilot` to measure function duration with full statistical rigor
Tutorial: Command-Line Benchmarking	Use `bench` to drive any CLI program, with a full `dd` example

Installation and Building

Document	Description
Build from Source	CMake options, build modes, Python binding
Install on Linux	Linux build instructions
Install on macOS	macOS build instructions

Concepts and Statistical Background

Document	Description
Statistics 101	Accuracy, precision, repeatability, and confidence intervals explained
Terms and Definitions	PI, session, round, work amount, unit reading, WPS
Autocorrelation Detection and Mitigation	Why autocorrelation matters and how subsession analysis fixes it
Deciding Optimal Session Length	How Pilot decides when to stop collecting data
Warm-up and Cool-Down Phase Detection	EDM change-point detection and the WPS linear regression model
Comparing Results	Welch's t-test and the algorithm for ranking benchmark results

Command-Line Reference

Document	Description
Command-Line Reference	All flags and options for `bench run_program`, `bench analyze`, and `bench detect_changepoint_edm`; preset table; confidence level vs. CI width explained

Build from Source

Pilot supports two build modes:

WITH_TUI=OFF (default): headless CLI, no curses/CDK dependency.
WITH_TUI=ON: adds the optional curses/CDK text UI.

Headless build (recommended for CI, containers, scripted use):

cmake -S . -B build -DCMAKE_BUILD_TYPE=Release -DWITH_TUI=OFF
cmake --build build -j

The resulting binary is build/cli/bench.

With text UI:

cmake -S . -B build -DCMAKE_BUILD_TYPE=Release -DWITH_TUI=ON
cmake --build build -j

Requires curses development headers and libraries.

Quick Start

# Benchmark a C++ function — link libpilot and call simple_runner()
g++ -std=c++14 -O2 -lpilot -o my_bench my_bench.cc
./my_bench

# Run a command-line benchmark (dd example, Option 1 — recommended)
bench run_program -d 0 --wps -w 1,5000 \
    -- ./run_dd.sh /tmp/io_test %WORK_AMOUNT%

# Analyze an existing CSV of measurements
bench analyze --preset normal data.csv

See the tutorials for full worked examples.

Development

We partially follow Google's C++ Style Guide, with the exception that we use four spaces for indentation rather than two.

Acknowledgments

This is a research project from the Storage Systems Research Center at UC Santa Cruz. Supported in part by the National Science Foundation under awards IIP-1266400, CCF-1219163, CNS-1018928, CNS-1528179, by the Department of Energy under award DE-FC02-10ER26017/DESC0005417, by a Symantec Graduate Fellowship, by a grant from Intel Corporation, and by industrial members of the Center for Research in Storage Systems. Any opinions, findings, and conclusions expressed in this material are those of the author(s) and do not necessarily reflect the views of the sponsors.

Citation

If you use Pilot in your research, please cite the original paper:

@inproceedings{li:mascots16,
  author    = {Yan Li and Yash Gupta and Ethan L. Miller and Darrell D. E. Long},
  title     = {Pilot: A Framework that Understands How to Do Performance
               Benchmarks the Right Way},
  booktitle = {Proceedings of the IEEE 24th International Symposium on
               Modeling, Analysis, and Simulation of Computer and
               Telecommunication Systems (MASCOTS'16)},
  year      = {2016},
  publisher = {IEEE},
}

Name		Name	Last commit message	Last commit date
Latest commit History 341 Commits
assets		assets
build-env		build-env
cli		cli
cmake		cmake
doc		doc
examples		examples
include		include
lib		lib
packaging		packaging
.gitignore		.gitignore
AUTHORS		AUTHORS
CMakeLists.txt		CMakeLists.txt
CTestCustom.cmake		CTestCustom.cmake
LICENSE		LICENSE
README.md		README.md
lgpl-2.1.txt		lgpl-2.1.txt
pilot-tool.doxyfile		pilot-tool.doxyfile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pilot Benchmark Framework

Documentation

Getting Started

Installation and Building

Concepts and Statistical Background

Command-Line Reference

Build from Source

Quick Start

Development

Acknowledgments

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Pilot Benchmark Framework

Documentation

Getting Started

Installation and Building

Concepts and Statistical Background

Command-Line Reference

Build from Source

Quick Start

Development

Acknowledgments

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages