2. HPCBench Standard Benchmark¶

The “standard” benchmark is a general purpose benchmark. Its configuration is passed thru the attributes field in the YAML campaign file.

It is made of 3 top attributes, each of one being a dictionary:

metrics: specifies how to extract information from command output
executables: describe the commands to execute
shells (optional): provide more flexibility to build the commands to execute

2.1. A trivial example¶

Before getting into all the details, here’s a basic standard benchmark configuration to get started with:

benchmarks:
  '*':
    simple:
      type: standard
      attributes:
        executables:
            - command: ["echo", "42"]
        metrics:
            the_answer_to_everything:
                match: "(.*)"
                type: Cardinal

2.2. Metrics configuration reference¶

This section specifies the metrics the benchmark has to extract from command outputs, as dictionary “name” -> “configuration”

A metric configuration is a dictionary made of the following keys:

2.2.1. match¶

The regular expression used to extract the value from the program execution output. The expression must specify one and only one group used to extract the proper value.

The string to match has trailing whitespace removed.

2.2.2. type¶

The metric type, as specified in the hpcbench.API.Metric

2.2.3. multiply_by (optional)¶

If any, the extracted value will be multiplied by the specified value. It is useful to convert a unit, for instance from flop to Gflop.

2.2.4. category (optional)¶

The benchmark category the metric applies to. Default is standard.

2.2.5. from (optional)¶

Specifies the output file to look for. Default is stdout.

2.2.6. when (optional)¶

Provides a way to override fields above according to the metas of the executed command. Conditions may be declared as a list, the first condition evaluated to true providing a given property is used.

A condition is a dictionary composed of the following attributes:

conditions: a dictionary of “meta_name” -> “value” where value is either a value of a list of values.
match, multiply_by, from (optional): provide value that supersedes default one if conditions above are met.

For instance:

benchmarks:
  '*':
    my-test:
      type: standard
      attributes:
        metrics:
          simulation_time:
            match: "\\s+total compute time:\\s(\\d*\\.?\\d+) \\[ms\\]"
            type: Second
            multiply_by: 0.001
            when:
            -
              conditions:
                compiler: [gcc, icc]
                branch: feat/bump-performance
              match: "\\s+total pool clone time:\\s(\\d*\\.?\\d+) \\[ms\\]"
              multiply_by: 1.0

This example describes how the simulation_time metric has to be extracted and computed. In the general case:

the regular expression used to extract the metric is the “… total compute time” expression
The type of the metrics is Metric.Second
The extracted value will by multiplied by 0.001

But when the compiler metas is either “gcc” or “icc” and when the branch meta is “feat/bump_performance”:

the regular expression is different
the multiplication factor is 1

2.3. Executables configuration reference¶

This section specifies the commands the benchmark has to execute. It is made of a list of dictionaries. Each dictionary describes a set of commands to run. They are composed of the following keys:

2.3.1. command¶

Describes the command to launch. It must be a list of elements. Elements support the Python Format Specification Mini-Language where the possible attributes are the metas describe below.

2.3.2. metas¶

A list of dictionary or the dictionary itself if this is the only one. Each dictionary describes a set of metas values.

executables:
-
  command: [echo, {foo}, {bar}]
  metas:
  -
    foo: 1
    bar: [2, 3]
  - foo: [4, 5]
    bar: [6, 7]

Using a list of values allows you to describe a combination of commands. In the example above, it means launching 6 commands:

echo 1 2
echo 1 3
echo 4 6
echo 4 7
echo 5 6
echo 5 7

It is possible to specify several metas at once:

executables:
-
  command: [echo, {foo}, {bar}]
  metas:
  -
    foo: 1
    bar: [2, 3]
  - "foo, bar": [[4, 6], [5, 7]

This sample is equivalent to the previous.

Some functions can also be called to specify the list of values a meta can take, among:

range, same as Python range builtin
linspace, geomspace, linspace, arange, same as NumPy corresponding functions.
correlate, to specify multi metas at once.

In this case, the meta description is a dictionary providing the following attributes:

function: name of the function to call
args: optional list of arguments given to the function
kwargs: optional dictionary of keywords arguments given to the function

For instance:

executables:
-
  command: [echo, {foo}, {bar}]
  metas:
  -
    foo: 1
    bar:
      function: range
      args: [2, 4]

Will launch the 2 commands:

echo 1 2
echo 1 3

The correlate signature is as follow: * a mandatory list of series given in the args section * 2 optional arguments: explore and with_overflow

A serie is made of a list of arguments givento a NumPY function returning the values the meta has to take, for instance:

[geomspace, 32, 1, num=6]

allowed functions are: geomspace, logspace, linspace, arange an additional _cast=<type> allows you to cast the result of the NumPy function, for instance: [geomspace, 32, 1, num=6, _cast=int]

For example:

executables:
-
  command: [mycommand, -p, {processes}, -t, {threads}]
  metas:
  -
    "[processes, threads]":
      function: correlate
      args:
      - [geomspace, 8, 1, num=4, _cast=int]
      - [geomspace, 1, 8, num=4, _cast=int]

Will launch the following 4 commands:

mycommand -p 8 -t 1
mycommand -p 4 -t 2
mycommand -p 2 -t 4
mycommand -p 1 -t 8

The explore optional argument allows you to test additional combinations by modifying every combinations by given matrices

For example:

executables:
-
  command: [mycommand, -p, {processes}, -t, {threads}]
  metas:
  -
    "[processes, threads]":
      function: correlate
      args:
      - [geomspace, 4, 1, num=3, _cast=int]
      - [geomspace, 1, 4, num=3, _cast=int]
      kwargs:
        explore:
        - [0, 1]

Will launch the following 8 commands:

mycommand -p 4 -t 1
mycommand -p 2 -t 2
mycommand -p 1 -t 4
mycommand -p 4 -t 2
mycommand -p 2 -t 3

If the optional boolean with_overflow keyword argument was set to True, then an additional (1, 1) command would have been triggered, corresponding to the initiual (1, 4) combination plus (1, 1) matrix. Instead of having (1, 5), the first value of the threads serie would had been used, resulting in the (1, 1) value. Because such combination is usually pointless, the with_overflow default value is False.

2.3.3. category (optional)¶

A category to ease classification. Default value is “standard”.

2.4. Shells configuration reference¶

This sections describes a list of commands that may prefix the commands specified in the executables section. It is composed of a list of dictionary. Each dictionary is made of the following keys:

2.4.1. commands¶

A list of shell commands, for instance

shells:
- commands:
  - . /usr/share/lmod/lmod/init/bash
  - . $SPACK_ROOT/share/spack/setup-env.sh
  - spack install myapp@{branch} %{compiler}
  - spack load myapp@{branch} %{compiler}

Specified commands also support the Python Format Specification Mini-Language to use the metas of execution context. Those metas can be either those define in the shells or executables section.

2.4.2. metas¶

Provides either a list of a dictionary providing additional metas values or the dictionary itself if this is the sole dictionary. Like in the executables section, it describes a combination of metas.

shells:
- commands:
  - . /usr/share/lmod/lmod/init/bash
  - . $SPACK_ROOT/share/spack/setup-env.sh
  - spack install myapp@{branch} %{compiler}
  - spack load myapp@{branch} %{compiler}
  metas:
    compiler: [gcc, icc]