This document details how to use the ECEN Olympus cluster to remotely access software used in academic Linux labs and for research.
What is the Olympus Cluster
The Olympus cluster consists of the login node (olympus.ece.tamu.edu), eight non-GPU compute nodes and five GPU compute nodes. The cluster has software that ensures users receive the resources needed for their labs and research by distributing users' jobs across the compute nodes based on the user’s requirements. There is limited software installed on the Olympus head node.
Five Nodes - Poweredge 730XD - dual Xeon(R) CPU E5-2650 v3 - 20 cores (40 with HT) per node, 256GB RAM
100 core total
Three Nodes - Poweredge R6525- Dual AMD EPYC 7443 - 48 cores (96 with HT) per node, 256GB RAM
144 core total
Three Nodes - Poweredge C4140 - Dual Xeon(R) Gold 6130 - 32 cores (64 with HT) per node, 196GB RAM, 4 Tesla V100’s per node
96 core and 12 Nvidia V100 total
Two Nodes - PowerEdge R750xa - Dual Xeon(R) Gold 6326 - 32 cores (64 with HT) per node, 256GB RAM, 4 Ampere A100’s per node
64 core and 8 Nvidia A100 total
Cluster Configuration and Usage Limitations
To assure resources are available to all students, the following limitations are enforced.
Nodes are grouped into partitions. The following partitions are configured.
CPU: Eight nodes - Five nodes have academic priority (academic jobs will run on these nodes first)
CPU-RESEARCH: Three nodes - research jobs will run on these nodes - requires PI approval for access
GPU: Five nodes for projects and research - requires PI/Faculty approval for access
Resource allocation is set using Quality of Service groups (qos) in slurm.
QOS name | Hardware Limits | Default Time Limits | Hard Time Limit | Partition |
ugrad (academic) | 4 cpu cores | 12 hours | 12 hours | CPU |
grad (academic) | 6 cpu cores | 12 hours | 12 hours | CPU |
research | 144 cpu cores | 48 hours | 7 days | CPU-Research |
ecen-ugrad-gpu | 8 cpu, 1gpu | 36 hours | 36 hours | GPU |
olympus-research-gpu | 32 cpu, 4gpu | 4 days | 4 days | GPU |
olympus-research-gpu2 | 160 cpu/20 gpu | 7 days | 21 days | GPU |