For full usage documentation, check out the Pbench User Guide

Stay up to date with updates made in the Pbench space with release notes

Distributed system analysis made easy with Pbench

3rd June 2020

Using Pbench to debug Performance Problems

30th Mar 2020

Distributed System Analysis using Pbench Tool

28th Sept 2018

Tool scripts are mostly boilerplate: they need to take a standard set of commands (–install, –start, –stop, –postprocess) and a standard set of options (–iteration, –group, –dir, –interval, –options). Consequently, the easiest thing to do is to take an existing script and modify it slightly to call the tool of your choice. I describe here the case of turbostat.

There are some tools that timestamp each output stanza; there are others that do not. In the former case, make sure to use whatever option the tool requires to include such timestamps (e.g. vmstat -t on RHEL6 or RHEL7 - but strangely not on Fedora 20 - will produce such timestamps).

There are some tools that are included in the default installation - others need to be installed separately. Turbostat is not always installed by default, so the tool script installs the package (which is named differently on RHEL6 and RHEL7) if necessary. In some cases (e.g. the sysstat tools), we provide an RPM in the pbench repo and the tool script makes sure to install that if necessary.

The only other knowledge required is where the tool executable resides (usually /usr/bin/ or /usr/local/bin/ - /usr/bin/turbostat in this case) and the default options to pass to the tool (which can be modified by passing –options to the tool script).

So here are the non-boilerplate portions of the turbostat tool script. The first interesting part is to set tool_bin to point to the binary:

# Defaults
tool=$script_name
tool_bin=/usr/bin/$tool

This only works if the script is named the same as the tool, which is encouraged. If the installed location of your tool is not /usr/bin, then adjust accordingly.

Since turbostat does not provide a timestamp option, we define a datalog script to add timestamps (no need for that for vmstat e.g.) and use that as the tool command:

case "$script_name" in turbostat)
tool_cmd="$script_path/datalog/$tool-datalog $interval $tool_output_file"
;;
esac

The datalog script uses the pbench-log-timestamp pbench utility to timestamp the output. It will then be up to the postprocessing script to tease out the data appropriately.

The last interesting part dispatches on the command - the install is turbostat-specific, but the rest is boilerplate: --start just executes the tool_cmd as defined above and stashes away the pid, so that --stop can kill the command later; --postprocess calls the separate post-processing script (see below):

release=$(awk '{x=$7; split(x, a, "."); print a[1];}' /etc/redhat-release)
case $release in
  6)
     pkg=cpupowerutils
     ;;
  7)
     pkg=kernel-tools
     ;;
  *)
     # better be installed already
      ;;
esac
case "$mode" in
  install)
    if [ ! -e $tool_bin ]; then
      yum install $pkg
      echo $script_name is installed
    else
      echo $script_name is installed
    fi
  start)
    mkdir -p $tool_output_dir
    echo "$tool_cmd" >$tool_cmd_file
    debug_log "$script_name: running $tool_cmd"
    $tool_cmd >>"$tool_output_file" & echo $! >$tool_pid_file
    wait
    ;;
  stop)
     pid=`cat "$tool_pid_file"`
     debug_log "stopping $script_name"
     kill $pid && /bin/rm "$tool_pid_file"
     ;;
  postprocess)
     debug_log "postprocessing $script_name"
     $script_path/postprocess/$script_name-postprocess $tool_output_dir
     ;;
esac

Finally, there is the post-processing tool: the simplest thing to do is nothing. That's currently the case for the turbostat post-processing tool, but ideally it should produce a JSON file with the data points and an HTML file that uses the nv3 library to plot the data graphically in a browser. See the postprocess directory for examples, e.g. iostat postprocessing tool.

What does --name do?

What does --config do?

What does --dir do?

What does --remote do?

What does --label do?

How to add a collection tool

How to add a benchmark

How do I collect data for a short time while my benchmark is running?

I have a script to run my benchmark - how do I use it with Pbench?

How do I install pbench-agent?