This document details how to use the ECEN Olympus cluster and how to use it to remotely access Linux software used in academic Linux labs and for research.
...
What is the Olympus Cluster
The Olympus cluster consists of the login node (olympus.ece.tamu.edu), eight non-GPU compute nodes and five GPU compute nodes. The cluster has software that ensures users receive the resources needed for their labs and research by distributing users' jobs across the compute nodes based on their course the user’s requirements. There is limited software installed on the Olympus head node.
Nodes 1Five Nodes - 5 Poweredge 730XD 730XD - dual Xeon (R) CPU E5-2650 v3 - 20 cores (40 with HT) per node, 256GB with 256GB RAM
100 core totaltotal
Three Nodes 6 -8 Poweredge R6525- Dual AMD EPYC 7443 - 48 cores (96 with HT) per node, with 256GB RAM
144 core total
Three Nodes 9 - 11 Poweredge C4140 C4140 - Dual Xeon (R) Gold 6130 - 32 cores (64 with HT) per node, with 196GB RAM, 4 Tesla V100’s per node
96 core and 12 Nvidia V100 total
Two Nodes 12 - 13 PowerEdge R750xa - Dual Xeon (R) Gold 6326 - 32 cores (64 with HT) per node, with 256GB RAM, 4 Tesla Ampere A100’s per node
64 core and 8 Nvidia A100 total
Cluster Configuration and Usage Limitations
To assure resources are available to all students, the following limitations are enforced. Nodes are grouped into partitions. The following partitions are configured.
CPU Nodes: Eight nodes 1 -8. Nodes 1-5 Five nodes have academic priority (academic jobs will run on these nodes first)
CPU-RESEARCH: Nodes 6Three nodes - 8 research jobs will run on these nodes - requires PI approval for access
GPU: Five nodes 9-13 for coursework projects and research - requires PI/Faculty approval for access
Resource allocation is set using Quality of Server Service groups (qos) in slurm.
QOS name | Hardware Limits | Default Time Limits | Hard Time Limit | Partition |
olympus-academic |
6 cpu cores | 12 hours | 12 hours |
academic |
olympus-cpu |
12 hours
12 hours
CPU
Research
-research | 144 cpu cores | 48 hours |
7 days |
cpu- |
research |
olympus-ugrad-gpu | 8 cpu, 1gpu | 36 hours | 36 |
GPU
hours | gpu-research | |||
olympus-research-gpu-sh | 16 cpu 2gpu | 12 hours | 12 hours | gpu-research |
olympus-research-gpu | 32 cpu, 4gpu | 4 days |
4days
GPU
4 days | gpu-research |
olympus-research-gpu2 | 160 cpu |
20 gpu | 7 days |
21 days
14 days | gpu-research |
QOS Uses –
olympus-academic – access to acadmic partition for courses with Linux requirements.
olympus-cpu-research – access to cpu-research partition
olympus-ugrad-gpu – undergraduate access to gpu-research partition
olympus-research-gpu – access to the gpu-research partition
olympus-research-gpu-sh – interactive job access to gpu-research partition
olympus-research2 -unlimited access to gpu-research partition, special case use