Actively Running Jobs:

CCR now offers a detailed view into what is happening on the node(s) where your job is running.  You can view the graphs from OnDemand but you do NOT need to submit the jobs from within OnDemand.  Even jobs submitted from the command line using 'sbatch' are available in your list of Active Jobs.

Click on the arrow next to one of your current jobs and at the bottom you will see two graphs for each node your job is running on.  One graph shows CPU metrics and the other memory (RAM) usage.

When you click on the 'Detailed Metrics' link you will be redirected to the Grafana dashboard.  Here you will see very detailed metrics regarding the node(s) your job is running on including CPU, RAM, network, Infiniband/OmniPath, Disk usage and I/O, and GPU info:

We provide how-to guides on job monitoring as well as other options for monitoring running jobs here

Completed Job Information:

For metrics about job performance and node usage after your job completes, please use the UBMoD portal.  Job data is usually available 24 hours after a job completes.  More details about UBMod

Slurm account information is also available and useful depending on what information you're looking for regarding your jobs.  We provide some suggested commands here.