...
Panel | ||||||
---|---|---|---|---|---|---|
| ||||||
TL;DR: This is a guide for using the University at Albany's DGX On-Prem cluster's free tier. Key points:
The main message is: Always implement checkpointing in your jobs, or you risk losing hours/days of work when preemption occurs. |
...
The free tier provides substantial computing resources with specific limitations such as:
Maximum number of4 concurrent jobs per user
Access to up to 4 a specific number of GPUs and 256 CPUs
Maximum job duration of 7 daysper user
Automatic job requeuing upon preemption
For current free tier limits, please refer to the Service Level Agreement (SLA).
Understanding Job Preemption
...