SmartDashboard#

SmartDashboard is an add-on to SmartSim that provides a dashboard to help users understand and monitor their SmartSim experiments in a visual way. Configuration, status, and logs are available for all launched entities within an experiment for easy inspection.

A Telemetry Monitor is a background process that is launched along with the experiment that produces the data displayed by SmartDashboard. The Telemetry Monitor can be disabled by adding export SMARTSIM_TELEMETRY_ENABLE=0 as an environment variable. When disabled, SmartDashboard will not display any data. To re-enable, set the SMARTSIM_TELEMETRY_ENABLE environment variable to 1 with export SMARTSIM_TELEMETRY_ENABLE=1.

Experiment metadata is also stored in the .smartsim directory, a hidden folder for internal api use and used by the dashboard. Deletion of the experiment folder will remove all experiment metadata.

Installation#

It’s important to note that SmartDashboard only works while using SmartSim, so SmartSim will need to be installed as well. SmartSim installation docs can be found here.

User Install:

Run pip install smartdashboard to install SmartDashboard without cloning the repository.

Developer Install:

Clone the SmartDashboard repository at CrayLabs/SmartDashboard.git. Once cloned, cd into the repository and run:

pip install -e .

Running SmartDashboard#

After launching a SmartSim experiment, the dashboard can be launched using SmartSim’s CLI.

smart dashboard --port <port number> --directory <experiment directory path>

The port can optionally be specified, otherwise the dashboard port will default to 8501. The directory must be specified and should be a relative or absolute path to the created experiment directory.

--port, -p

port number

--directory, -d

experiment directory

Example workflow:

# directory before running experiment
├── hello_world.py
# hello_world.py
from smartsim import Experiment

exp = Experiment("hello_world_exp", launcher="auto")
run = exp.create_run_settings(exe="echo", exe_args="Hello World!")
run.set_tasks(60)
run.set_tasks_per_node(20)

model = exp.create_model("hello_world", run)
exp.start(model, block=True, summary=True)
# in interactive terminal
python hello_world.py
# directory after running experiment
├── hello_world.py
└── hello_world_exp

By default, hello_world_exp is created in the directory of the driver script.

# in a different interactive terminal
smart dashboard --port 8888 --directory hello_world_exp

The dashboard will automatically open in a browser at port 8888 when smart dashboard is invoked locally.

Note

If the dashboard is executed remotely, establishing port-forwarding to the remote machine will be necessary. This may be accomplished with ssh as follows:

# using ssh to establish port forwarding
ssh -L [local-addr]:<local-port>:<remote-addr>:<remote-port> <user-id>@<remote-addr>

# example forwarding the remote port 8888 to localhost:8000
ssh -L localhost:8000:super1.my.domain.net:8888 smartdash@super1.my.domain.net

After establishing the port-forwarding, a local browser can be pointed at the appropriate URL, such as http://localhost:8000 for the example above.

The dashboard is also persistent, meaning that a user can launch and use the dashboard even after the experiment has completed.

Using SmartDashboard#

Once displayed in the browser, SmartDashboard currently has two tabs on the left hand side.

Experiment Overview: This tab is where configuration information, statuses, and logs are located for each launched entity of the experiment. The Experiment section displays configuration information for the overall experiment and its logs. In the Applications section, also known as SmartSim Models, select a launched application to see its status, what it was configured with, and its logs. The Orchestrators section also provides configuration and status information, as well as logs per shard for a selected orchestrator. Finally, in the Ensembles section, select an ensemble to see its status and configuration. Then select any of its members to see its status, configuration, and logs.

Help: This tab links to SmartSim documentation and provides a SmartSim contact for support.