Olympus Cluster Information

 

This document details how to use the ECEN Olympus cluster to remotely access software used in academic Linux labs and for research.

What is the Olympus Cluster

The Olympus cluster consists of the login node (olympus.ece.tamu.edu), eight non-GPU compute nodes and five GPU compute nodes.    The cluster has software that ensures users receive the resources needed for their labs and research by distributing users' jobs across the compute nodes based on the user’s requirements. There is limited software installed on the Olympus head node.

 

Five Nodes - Poweredge 730XD - dual Xeon(R) CPU E5-2650 v3 - 20 cores (40 with HT) per node, 256GB RAM

100 core total

 

Three Nodes -  Poweredge R6525- Dual AMD EPYC 7443 - 48 cores (96 with HT) per node, 256GB RAM

144 core total

 

Three Nodes - Poweredge C4140 - Dual Xeon(R) Gold 6130 - 32 cores (64 with HT) per node, 196GB RAM, 4 Tesla V100’s per node

96 core and 12 Nvidia V100 total

 

Two Nodes - PowerEdge R750xa - Dual Xeon(R) Gold 6326 - 32 cores (64 with HT) per node, 256GB RAM, 4 Ampere A100’s per node

64 core and 8 Nvidia A100 total

 

Cluster Configuration and Usage Limitations

To assure resources are available to all students, the following limitations are enforced.

Nodes are grouped into partitions.  The following partitions are configured.

CPU: Eight nodes -  Five nodes have academic priority (academic jobs will run on these nodes first)

CPU-RESEARCH:  Three nodes - research jobs will run on these nodes - requires PI approval for access

GPU:  Five nodes for projects and research - requires PI/Faculty approval for access

 

Resource allocation is set using Quality of Service groups (qos) in slurm. 

QOS name

Hardware Limits

Default Time Limits

Hard Time Limit

Partition

ugrad (academic)

4 cpu cores

12 hours

12 hours

CPU

grad (academic)

6 cpu cores

12 hours

12 hours

CPU

research

144 cpu cores

48 hours

7 days

CPU-Research

ecen-ugrad-gpu

8 cpu, 1gpu

36 hours

36 hours

GPU

olympus-research-gpu

32 cpu, 4gpu

4 days

4 days

GPU

olympus-research-gpu2

160 cpu/20 gpu

7 days

21 days

GPU