Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This document details how to use the ECEN Olympus cluster and how to use it to remotely access Linux software used in academic Linux labs and for research.

...

What is the Olympus Cluster

The Olympus cluster consists of the login node (olympus.ece.tamu.edu), eight non-GPU compute nodes and five GPU compute nodes.    The cluster has software that ensures users receive the resources needed for their labs and research by distributing users' jobs across the compute nodes based on their course the user’s requirements. There is limited software installed on the Olympus head node.

Five Nodes - Poweredge 730XD 730XD - dual Xeon (R) CPU E5-2650 v3 - 20 cores (40 with HT) per node, 256GB with 256GB RAM

100 core totaltotal

Three Nodes -  Poweredge R6525- Dual AMD EPYC 7443 - 48 cores (96 with HT) per node, with 256GB RAM

144 core total

Three Nodes - Poweredge C4140 C4140 - Dual Xeon (R) Gold 6130 - 32 cores (64 with HT) per node, with 196GB RAM, 4 Tesla V100’s per node

96 core and 12 Nvidia V100 total

Two Nodes - PowerEdge R750xa - Dual Xeon (R) Gold 6326 - 32 cores (64 with HT) per node, with 256GB RAM, 4 Ampere A100’s per node

...

To assure resources are available to all students, the following limitations are enforced. Nodes are grouped into partitions.  The following partitions are configured.

CPU Nodes: Eight nodes 1 -8.  Nodes 1-5   Five nodes have academic priority (academic jobs will run on these nodes first)

CPU-RESEARCH:  Nodes 6Three nodes - 8 research jobs will run on these nodes - requires PI approval for access

GPU:  Five nodes 9-13 for coursework projects and research - requires PI/Faculty approval for access

Resource allocation is set using Quality of Server Service groups (qos) in slurm. 

QOS name

Hardware Limits

Default Time Limits

Hard Time Limit

Partition

Ugrad (

olympus-academic

)

4

6 cpu cores

12 hours

12 hours

CPU

Grad (

academic

)

6

olympus-cpu

cores

12 hours

12 hours

CPU

Research

12

-research

144 cpu cores

48 hours

48 hours

7 days

CPU

cpu-

Research

research

Ecen

olympus-ugrad-gpu

8 cpu, 1gpu

36 hours

36 hours

GPU

OlympusOlympus

gpu-research

olympus-research-gpu-sh

16 cpu 2gpu

12 hours

12 hours

gpu-research

olympus-research-gpu

32 cpu, 4gpu

4 days

4days

GPU

4 days

gpu-research

olympus-research-gpu2

160 cpu

/GPU

20 gpu

7 days

21 days

14 days

gpu-research

QOS Uses –

olympus-academic – access to acadmic partition for courses with Linux requirements.

olympus-cpu-research – access to cpu-research partition

olympus-ugrad-gpu – undergraduate access to gpu-research partition

olympus-research-gpu – access to the gpu-research partition

olympus-research-gpu-sh – interactive job access to gpu-research partition

olympus-research2 -unlimited access to gpu-research partition, special case use

Academic users - instructions for using Olympus

Research - Olympus CPU User Information

Research - Olympus GPU User Information