.. _uses: Use Cases +++++++++ Examples By Category ==================== .. raw:: html

Multiprocessing

Distributed Python and interactive Jupyter

Data

Easy-to-use and HPC-optimized data exchange between applications

Telemetry

Observability for applications and workflows

Workflow

AI/HPC workflows across systems and sites

AI

Data loading and resilient training and inference

.. toctree:: :maxdepth: 2 :hidden: cbook/multiprocessing.rst cbook/data.rst cbook/telemetry.rst cbook/workflows.rst cbook/ai.rst cbook/core.rst | Tutorials ========= .. raw:: html

Data processing

Process a large dataset in parallel

Orchestrating processes

Orchestrate executing a serial executable with many different arguments

Orchestrating MPI applications

Orchestrate a parameter sweep for an MPI application

Distributed PyTorch

Train with PyTorch across many GPUs

Workflows

Develop a workflow that puts it all together

Running across nodes

Run on a collection of servers or a supercomputer

Jupyter

Use a Jupyter notebook with Dragon

Controlling GPU Affinity

Running functions and processes that use specific GPUs

Telemetry with Grafana

Visualize and analyze system and custom metrics

Debugging

Best practies for debugging

.. toctree:: :maxdepth: 2 :hidden: uses/data_processing.rst uses/orchestrate_procs.rst uses/orchestrate_mpi.rst uses/distributed_training.rst uses/workflow.rst uses/multinode.rst uses/jupyter.rst uses/gpus.rst uses/grafana.rst uses/debugging.rst