Current System Load - CPU

The plot below shows the status of nodes on the current ARCHER2 Full System service. A description of each of the status types is provided below the plot.

image

Note: the long running reservation visible in the plot corresponds to the short QoS which is used to support small, short jobs with fast turnaround time.

Current System Load - GPU

image

Service Alerts

The ARCHER2 documentation also covers some Known Issues which users may encounter when using the system.

Status Type Start End Scope User Impact Reason
Ongoing At-Risk 2024-05-20 10:00 2024-05-25 00:00 ARCHER2 Nodes A rolling-reboot to update the compute nodes on ARCHER2 which includes the newer CPE (Cray Programming Environment) 23.09. This will not impact running work but once jobs finish, compute nodes will be rebooted and then be returned to service with the new updated software. Serial work is unaffected.

Maintenance Sessions

This section lists recent and upcoming maintenance sessions. A full list of past maintenance sessions is available.

Status Type Start End Scope User Impact Reason
Completed Full 2024-05-08 09:00 2024-05-08 21:00 Full ARCHER2 System Users will not be able to connect to the login nodes, jobs will not run and users will be unable to access data during this maintenance Replacement of operating system certificates
Completed Slurm 2024-05-01 09:00 2024-05-01 10:35 Slurm maintenance Running jobs will continue to run, but Slurm commands will be unavailable for a few minutes when the controller restarts. Required maintenance
Completed Partial TBC ARCHER2 Running jobs will continue but users will not be able to submit new jobs. Users will be notified when job submission is available again. Integrating the GPU nodes into ARCHER2

Previous Service Alerts

This section lists the five most recent resolved service alerts from the past 30 days. A full list of historical resolved service alerts is available.

Status Type Start End Scope User Impact Reason
Resolved Service Alert 2024-05-22 11:30 2024-05-22 13:30 Access to License server The License server is inaccessible - our team are working to restore
Resolved Service Alert 2024-05-08 14:00 2024-05-08 14:08 Connectivity to ARCHER2 may have a short outage but no impact is expected We do not expect any user impact but if there is an issue it will be a short connectivity outage Changing power supply for the JANET CIENA unit
Resolved Service Alert 2024-05-09 10:00 2024-05-09 12:00 ARCHER2 rundeck ticketing server May be a delay in processing new user requests via SAFE Physical moving of the server hosting the rundeck ticketing system
Resolved Issue 2024-04-26 08:25 2024-04-26 10:00 Serial nodes Serial node dvn01 is currently unavailable. Serial jobs are queued and running but performance may be slower than usual until the issue is resolved.
Resolved Service Alert 2024-04-25 09:30 2024-04-25 10:40 Serial Nodes, DVN01 and DVN02 Users will not be able to use the serial nodes. This means members of n02 will not be able to run jobs as their workflow depends on the serial nodes. We appreciate this is both critical and urgent for this project and HPE are investigating. The heavy load on the metadata server may have impacted the slurm controller and caused the slurm deamon to fail on these nodes. Investigation is ongoing.

System Status mailings

If you would like to receive email notifications about system issues and outages, please subscribe to the System Status Notifications mailing list via SAFE

FAQ

Usage statistics

This section contains data on ARCHER2 usage for Apr 2024. Access to historical usage data is available at the end of the section.

Usage by job size and length

Heatmap of usage job size versus job length

Queue length data

The colour indicates scheduling coefficient which is computed as [run time] divided by [run time + queue time]. A scheduling coefficient of 1 indicates that there was zero time queuing, a scheduling coefficient of 0.5 means that the job spent as long queuing as it did running.

Heatmap of scheduling coefficient job size versus job length

Software usage data

Plot and table of % use and job step size statistics for different software on ARCHER2 for Apr 2024. This data is also available as a CSV file.

Plot of usage by different software

This table shows job step size statistics in cores weighted by usage, total number of job steps and percent usage broken down by different software for Apr 2024.

Software Min Q1 Median Q3 Max Jobs Nodeh PercentUse Users Projects
Overall 1 512.0 1280.0 4608.0 375040 6430590 3714900.3 100.0 934 134
VASP 1 512.0 768.0 1280.0 28672 332288 711860.3 19.2 137 15
Unknown 1 608.0 2048.0 8192.0 375040 5289560 611212.9 16.5 443 94
Met Office UM 1 1024.0 1152.0 12544.0 12544 53700 467363.2 12.6 51 6
GROMACS 1 1024.0 2560.0 2560.0 32768 41986 253036.8 6.8 51 13
LAMMPS 1 512.0 1280.0 2560.0 131072 10325 195019.7 5.2 60 22
CP2K 1 128.0 256.0 512.0 4096 35220 174904.3 4.7 75 9
CASTEP 1 320.0 768.0 1280.0 55296 178861 129017.6 3.5 51 8
Nektar++ 10 5120.0 6144.0 7680.0 12800 699 121538.5 3.3 10 3
Python 1 1024.0 2304.0 2304.0 16384 198553 85797.9 2.3 63 25
OpenFOAM 1 512.0 1280.0 2048.0 12800 3749 84718.6 2.3 48 15
ChemShell 2 256.0 1024.0 7296.0 38400 1907 77780.9 2.1 15 4
SENGA 1 4096.0 6400.0 8192.0 24575 233 75192.6 2.0 5 3
BOUT++ 768 1344.0 1344.0 1344.0 1344 282 59294.9 1.6 1 1
Xcompact3d 2 4096.0 16384.0 16384.0 16384 706 52770.1 1.4 21 7
Nek5000 256 6400.0 25600.0 25600.0 25600 85 51792.7 1.4 5 4
Hydro3D 250 2100.0 2500.0 5280.0 43750 211 49439.2 1.3 4 2
ONETEP 1 128.0 256.0 512.0 2048 3772 44138.6 1.2 8 2
NEMO 1 5504.0 5504.0 5504.0 65536 12612 39761.9 1.1 23 4
Quantum Espresso 1 256.0 896.0 896.0 2432 120494 38870.1 1.0 15 4
FHI aims 1 256.0 512.0 1536.0 4096 78035 35867.2 1.0 17 3
GENE 1 2304.0 4096.0 10240.0 10240 475 35739.7 1.0 4 2
OSIRIS 4096 12288.0 12288.0 36864.0 36864 334 34360.0 0.9 3 3
MITgcm 1 112.0 363.0 624.0 1920 27119 33042.1 0.9 17 3
Code_Saturne 128 4096.0 8192.0 65536.0 131072 156 30138.0 0.8 7 4
CRYSTAL 1 128.0 131072.0 131072.0 131072 5152 27423.6 0.7 8 4
Smilei 1 256.0 256.0 512.0 4096 580 22338.7 0.6 6 1
EPOCH 400 5120.0 5120.0 5120.0 5120 205 21782.8 0.6 3 2
3DNS 8800 8800.0 8800.0 17680.0 17680 19 20130.8 0.5 1 1
WRF 384 384.0 384.0 384.0 384 325 19560.6 0.5 1 1
HYDRA 1 3200.0 12800.0 12800.0 12800 351 14661.7 0.4 8 4
TPLS 16 2048.0 4096.0 4096.0 4096 120 13888.5 0.4 3 2
NWChem 1 256.0 256.0 384.0 13056 25297 11533.4 0.3 11 4
iIMB 256 1862.0 3200.0 6400.0 6400 101 9739.6 0.3 2 2
VAMPIRE 6 512.0 1024.0 2048.0 2048 601 8736.6 0.2 9 3
a.out 1 1000.0 2048.0 4096.0 8192 923 7453.7 0.2 34 10
SU2 128 512.0 1280.0 1280.0 19200 532 6413.9 0.2 6 2
CASINO 1 2048.0 2048.0 3584.0 4096 71 6292.4 0.2 2 2
SBLI 2 4096.0 4096.0 4096.0 16384 104 6248.0 0.2 2 1
CESM 64 768.0 1280.0 4096.0 4096 2399 5875.1 0.2 8 1
NAMD 8 64.0 512.0 512.0 512 1128 5394.0 0.1 5 3
ptau3d 1 160.0 280.0 280.0 1024 106 3313.2 0.1 3 3
DL_POLY 256 2560.0 2560.0 2560.0 2560 8 2037.6 0.1 2 2
RMT 24 640.0 640.0 640.0 2432 198 1975.3 0.1 3 1
EDAMAME 1331 1331.0 1331.0 1331.0 3375 11 1458.2 0.0 2 1
FEniCS 1 131072.0 131072.0 131072.0 131072 21 1301.7 0.0 1 1
PRECISE 4 2048.0 2048.0 2048.0 6144 77 1066.3 0.0 1 1
OpenSBLI 1 131072.0 131072.0 131072.0 131072 9 950.9 0.0 2 2
SIESTA 12 2048.0 2048.0 2560.0 2560 43 503.7 0.0 3 2
HemeLB 1 200.0 200.0 200.0 1024 102 477.9 0.0 5 3
PDNS3D 1024 1024.0 1024.0 1024.0 1024 8 365.9 0.0 1 1
GPAW 1 128.0 128.0 128.0 256 84 342.8 0.0 1 1
Elk 32 32.0 32.0 32.0 32 176 292.0 0.0 1 1
GS2 1 2048.0 2048.0 4096.0 4096 198 286.9 0.0 3 3
FVCOM 640 640.0 640.0 640.0 640 2 205.0 0.0 1 1
ludwig 16 512.0 512.0 1024.0 1024 21 136.9 0.0 1 1
Arm Forge 1 512.0 1280.0 1280.0 2048 217 27.9 0.0 15 9
AxiSEM3D 96 128.0 192.0 192.0 288 30 27.0 0.0 1 1
Amber 128 512.0 512.0 512.0 512 3 0.1 0.0 1 1
CloverLeaf 8 8.0 8.0 8.0 8 3 0.0 0.0 1 1
DL_MESO 1 1.0 1.0 1.0 1 3 0.0 0.0 1 1

Historical usage data

Period Usage Heatmap Queue Heatmap Software usage plot Software usage data
Apr 2024 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Mar 2024 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Feb 2024 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Jan 2024 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Dec 2023 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Nov 2023 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Oct 2023 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Sep 2023 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Aug 2023 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Jul 2023 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Jun 2023 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
May 2023 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Apr 2023 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Mar 2023 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Feb 2023 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Jan 2023 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Dec 2022 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Nov 2022 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Oct 2022 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Sep 2022 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Aug 2022 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Jul 2022 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Jun 2022 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
May 2022 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Apr 2022 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Mar 2022 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Feb 2022 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Jan 2022 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)
Dec 2021 Usage heatmap (PNG) Queue heatmap (PNG) Software usage plot (PNG) Software usage and size data (CSV)