`lab.experiment` — Create experiments

`Experiment`

class lab.experiment.Experiment(path=None, environment=None)[source]

Base class for Lab experiments.

See Concepts for a description of how Lab experiments are structured.

The experiment will be built at path. It defaults to <scriptdir>/data/<scriptname>/. E.g., for the script experiments/myexp.py, the default path will be experiments/data/myexp/.

environment must be an Environment instance. You can use LocalEnvironment to run your experiment on a single computer (default). If you have access to the computer grid in Basel you can use the predefined grid environment BaselSlurmEnvironment. Alternatively, you can derive your own class from Environment.

add_command(name, command, time_limit=None, wall_time_limit=None, memory_limit=None, soft_stdout_limit=1024, hard_stdout_limit=10240, soft_stderr_limit=64, hard_stderr_limit=10240, **kwargs)

Call an executable.

If invoked on a run, this method adds the command to the specific run. If invoked on the experiment, the command is appended to the list of commands of all runs.

name is a string describing the command. It must start with a letter and consist exclusively of letters, numbers, underscores and hyphens.

command has to be a list of strings where the first item is the executable.

After time_limit seconds one of SIGXCPU or SIGTERM is sent to the command. The command can catch this signal and exit gracefully. After five additional seconds, the command is aborted with SIGKILL. The time spent by a command is the sum of time spent across all threads of the command is the sum of time spent across all threads of the command and its descendants.

The wall_time_limit parameter specifies the wall-clock time limit in seconds. If not set and time_limit is provided, it defaults to max(30, time_limit * 1.5) seconds. If both time_limit and wall_time_limit are None, no wall-clock time limit is enforced.

The command is aborted with SIGKILL when any of its threads uses more than memory_limit MiB.

You can limit the log size (in KiB) with a soft and hard limit for both stdout and stderr. When the soft limit is hit, an unexplained error is registered for this run, but the command is allowed to continue running. When the hard limit is hit, the command is killed with SIGTERM. This signal can be caught and handled by the process.

By default, there are limits for the log and error output, but time and memory are not restricted.

All kwargs (except stdin) are passed to subprocess.Popen. Instead of file handles you can also pass filenames for the stdout and stderr keyword arguments. Specifying the stdin kwarg is not supported.

>>> exp = Experiment()
>>> run = exp.add_run()
>>> # Add commands to a *specific* run.
>>> run.add_command("solver", ["mysolver", "input-file"], time_limit=60)
>>> # Add a command to *all* runs.
>>> exp.add_command("cleanup", ["rm", "my-temp-file"])

Make sure to call all Python programs from the currently active Python interpreter, i.e., sys.executable. Otherwise, the system Python version might be used instead of the Python version from the virtual environment.

>>> run.add_command("myplanner", [sys.executable, "planner.py", "input-file"])

add_fetcher(src=None, dest=None, merge=None, name=None, filter=None, **kwargs)[source]

Add a step that fetches results from an experiment or evaluation directory into a new or existing evaluation directory.

You can use this method to combine results from multiple experiments.

src can be an experiment or evaluation directory or a properties file. It defaults to exp.path.

dest must be a new or existing evaluation directory. It defaults to exp.eval_dir. If dest already contains data and merge is set to None, the user will be prompted whether to override the existing data or to merge the old and new data. Setting merge to True or to False has the effect that the old data is merged or replaced (and the user will not be prompted).

If no name is given, call this step “fetch-basename(src)”.

You can fetch only a subset of runs (e.g., runs for specific domains or algorithms) by passing filters with the filter argument.

Example setup:

>>> exp = Experiment("/tmp/exp")

Fetch all results and write a single combined properties file to the default evaluation directory (this step is added by default):

>>> exp.add_fetcher(name="fetch")

Merge the results from “other-exp” into this experiment’s results:

>>> exp.add_fetcher(src="/path/to/other-exp-eval")

Fetch only the runs for certain algorithms:

>>> exp.add_fetcher(filter_algorithm=["algo_1", "algo_5"])

add_new_file(name, dest, content, permissions=420)

Write content to /path/to/exp-or-run/dest and make the new file available to the commands as name.

name is an alias for the resource in commands. It must start with a letter and consist exclusively of letters, numbers and underscores.

>>> exp = Experiment()
>>> run = exp.add_run()
>>> run.add_new_file("learn", "learn.txt", "a = 5; b = 2; c = 5")
>>> run.add_command("print-trainingset", ["cat", "{learn}"])

add_parser(parser)[source]

Add a lab.parser.Parser to each run of the experiment.

Each parser is executed in each run directory and manipulates the run’s “properties” file. For information about how to write parsers see Parser.

add_report(report, name='', eval_dir='', outfile='')[source]

Add report to the list of experiment steps.

This method is a shortcut for add_step(name, report, eval_dir, outfile) and uses sensible defaults for omitted arguments.

If no name is given, use outfile or the report’s class name.

By default, use the experiment’s standard eval_dir.

If outfile is omitted, compose a filename from name and the report’s format. If outfile is a relative path, put it under eval_dir.

>>> from downward.reports.absolute import AbsoluteReport
>>> exp = Experiment("/tmp/exp")
>>> exp.add_report(AbsoluteReport(attributes=["coverage"]))

add_resource(name, source, dest='', symlink=False)

Include the file or directory source in the experiment or run.

name is an alias for the resource in commands. It must start with a letter and consist exclusively of letters, numbers and underscores. If you don’t need an alias for the resource, set name=’’.

source is copied to /path/to/exp-or-run/dest. If dest is omitted, the last part of the path to source will be taken as the destination filename. If you only want an alias for your resource, but don’t want to copy or link it, set dest to None.

Example:

>>> exp = Experiment()
>>> exp.add_resource("planner", "path/to/my-planner")

includes my-planner in the experiment directory. You can use {planner} to reference my-planner in a run’s commands:

>>> run = exp.add_run()
>>> run.add_resource("domain", "path-to/gripper/domain.pddl")
>>> run.add_resource("task", "path-to/gripper/prob01.pddl")
>>> run.add_command("plan", ["{planner}", "{domain}", "{task}"])

add_run(run=None)[source]

Schedule run to be part of the experiment.

If run is None, create a new run, add it to the experiment and return it.

add_step(name, function, *args, **kwargs)[source]

Add a step to the list of experiment steps.

Use this method to add experiment steps like writing the experiment file to disk, removing directories and publishing results. To add fetch and report steps, use the convenience methods add_fetcher() and add_report().

name is a descriptive name for the step. When selecting steps on the command line, you may either use step names or their indices.

function must be a callable Python object, e.g., a function or a class implementing __call__.

args and kwargs will be passed to function when the step is executed.

>>> import shutil
>>> import subprocess
>>> from lab.experiment import Experiment
>>> exp = Experiment("/tmp/myexp")
>>> exp.add_step("build", exp.build)
>>> exp.add_step("start", exp.start_runs)
>>> exp.add_step("rm-eval-dir", shutil.rmtree, exp.eval_dir)
>>> exp.add_step("greet", subprocess.call, ["echo", "Hello"])

build(write_to_disk=True)[source]

Finalize the internal data structures, then write all files needed for the experiment to disk.

If write_to_disk is False, only compute the internal data structures. This is only needed on grids for FastDownwardExperiments.build() which turns the added algorithms and benchmarks into Runs.

property eval_dir

Return the name of the default evaluation directory.

This is the directory where the fetched and parsed results will land by default.

property name: Return the directory name of the experiment’s path.

parse()[source]

Run all parsers that have been added to the experiment with add_parser().

After parsing, you’ll want to run a “fetch” step to collect the parsed data from the experiment into the evaluation directory.

run_steps()[source]: Parse the commandline and run selected steps.

set_property(name, value)

Add a key-value property.

These can be used later, for example, in reports.

>>> exp = Experiment()
>>> exp.set_property("suite", ["gripper", "grid"])
>>> run = exp.add_run()
>>> run.set_property("domain", "gripper")
>>> run.set_property("problem", "prob01.pddl")

Each run must have the property id which must be a unique list of strings. They determine where the results for this run will land in the combined properties file.

>>> run.set_property("id", ["algo1", "task1"])
>>> run.set_property("id", ["algo2", "domain1", "problem1"])

start_runs()[source]

Execute all runs that were added to the experiment.

Depending on the selected environment this method will start the runs locally or on a computer grid.

Custom command line arguments

lab.experiment.ARGPARSER

ArgumentParser instance that can be used to add custom command line arguments. You can import it, add your arguments and call its parse_known_args() method to retrieve the argument values. To avoid confusion with step names you shouldn’t use positional arguments.

Note

Custom command line arguments are only passed to locally executed steps.

from lab.experiment import ARGPARSER

ARGPARSER.add_argument(
    "--tex", action="store_true", help="produce LaTeX output"
)

args = ARGPARSER.parse_known_args()
if args.tex:
    print("writing LaTeX output")
else:
    print("writing HTML output")

`Run`

class lab.experiment.Run(experiment)[source]

An experiment consists of multiple runs. There should be one run for each (algorithm, benchmark) pair.

A run consists of one or more commands.

experiment must be an Experiment instance.