- Current System Load - Full System
- Service Alerts
- Maintenance Sessions
- System Status Mailings
- FAQ
- Usage statistics
Current System Load - Full System
The plot below shows the status of nodes on the current ARCHER2 Full System service. A description of each of the status types is provided below the plot.
- alloc: Nodes running user jobs
- idle: Nodes available for user jobs
- resv: Nodes in reservation and not available for standard user jobs
- plnd: Nodes are planned to be used for a future jobs. If pending jobs can fit in the space before the future job is due to start they can run on these nodes (often referred to as backfilling).
- down, drain, maint, drng, comp, boot: Nodes unavailable for user jobs
- mix: Nodes in multiple states
Note: the long running reservation visible in the plot corresponds to the short QoS which is used to support small, short jobs with fast turnaround time.
Service Alerts
The ARCHER2 documentation also covers some Known Issues which users may encounter when using the system.
Status | Type | Start | End | Scope | User Impact | Reason |
---|---|---|---|---|---|---|
Ongoing | Service Alert | 2024-03-27 09:00 | 2024-03-28 09:00 | All parallel jobs launched using srun | All parallel jobs launched using `srun` will have their IO profile captured by the Darshan IO profiling tool. In rare cases this may cause jobs to fail or impact performance. Users can disable Darshan by adding the line `module remove darshan` before they use `srun` in their job submission scripts. | Capturing data on the IO use on ARCHER2 to improve the service. |
Previous Service Alerts
This section lists resolved service alerts from the past 30 days. A full list of historical resolved service alerts is available.
Status | Type | Start | End | Scope | User Impact | Reason |
---|---|---|---|---|---|---|
Resolved | Service Alert | 2024-03-12 09:30 | 2024-03-13 15:30 | ARCHER2 GPU nodes |
The ARCHER2 GPU nodes are reserved Tuesday 12-03-2024 from 09:30 to 17:00 Wednesday 13-03-2024 from 09:30 to 15:30 |
The GPU nodes are being used for a training course. Normal access will be restored at 15:30 on Wednesday when the course ends. |
Resolved | Service Alert | 2024-03-08 09:00 | 2024-03-12 18:30 | Compute nodes | We are currently using a rolling-reboot to update the compute nodes on ARCHER2. This will not impact running work but once jobs finish, compute nodes will be rebooted and then be returned to service with the new updated software. Serial work is unaffected. | Updates to ARCHER2 compute nodes |
Resolved | Service Alert | 2024-02-23 12:30 | 2024-02-23 16:40 | Some ARCHER2 compute nodes after a power outage | New jobs had been stopped and some nodes down. New jobs now running and almost all nodes back in service. | Some compute nodes temporarily lost power |
Maintenance Sessions
This section lists recent and upcoming maintenance sessions. A full list of past maintenance sessions is available.
Status | Type | Start | End | Scope | User Impact | Reason |
---|---|---|---|---|---|---|
Completed | Partial | 2024-03-07 09:00 | 2024-03-07 12:30 | RDFaaS /epsrc and /general file systems | Users will not be able to access data on /epsrc and /general during this maintenance | Replacement of Power Supply Unit (PSU) on the RDFaas (E1000) |
Completed | Partial | TBC | ARCHER2 | Running jobs will continue but users will not be able to submit new jobs. Users will be notified when job submission is available again. | Integrating the GPU nodes into ARCHER2 |
System Status mailings
If you would like to receive email notifications about system issues and outages, please subscribe to the System Status Notifications mailing list via SAFE
FAQ
Usage statistics
This section contains data on ARCHER2 usage for Feb 2024. Access to historical usage data is available at the end of the section.
Usage by job size and length
Queue length data
The colour indicates scheduling coefficient which is computed as [run time] divided by [run time + queue time]. A scheduling coefficient of 1 indicates that there was zero time queuing, a scheduling coefficient of 0.5 means that the job spent as long queuing as it did running.
Software usage data
Plot and table of % use and job step size statistics for different software on ARCHER2 for Feb 2024. This data is also available as a CSV file.
This table shows job step size statistics in cores weighted by usage, total number of job steps and percent usage broken down by different software for Feb 2024.
Software | Min | Q1 | Median | Q3 | Max | Jobs | Nodeh | PercentUse | Users | Projects |
---|---|---|---|---|---|---|---|---|---|---|
Overall | 1 | 512.0 | 1024.0 | 3200.0 | 524288 | 3645902 | 3405862.2 | 100.0 | 889 | 132 |
VASP | 1 | 512.0 | 1024.0 | 1280.0 | 8192 | 457284 | 841757.0 | 24.7 | 148 | 17 |
Unknown | 1 | 512.0 | 2048.0 | 8192.0 | 262144 | 1187931 | 522632.1 | 15.3 | 444 | 91 |
Met Office UM | 1 | 1024.0 | 1152.0 | 2304.0 | 12544 | 69371 | 328121.3 | 9.6 | 51 | 3 |
LAMMPS | 1 | 512.0 | 1024.0 | 2560.0 | 131072 | 12385 | 237879.5 | 7.0 | 53 | 22 |
CP2K | 1 | 416.0 | 512.0 | 1024.0 | 8192 | 47145 | 199056.1 | 5.8 | 59 | 12 |
GROMACS | 1 | 512.0 | 1024.0 | 2560.0 | 6144 | 9691 | 187119.6 | 5.5 | 44 | 8 |
ChemShell | 1 | 1536.0 | 6656.0 | 10880.0 | 11136 | 1221 | 104103.1 | 3.1 | 20 | 4 |
OpenFOAM | 1 | 1024.0 | 1536.0 | 4096.0 | 12800 | 2471 | 103781.1 | 3.0 | 39 | 15 |
CASTEP | 1 | 192.0 | 512.0 | 4096.0 | 12800 | 242628 | 79384.1 | 2.3 | 54 | 9 |
MITgcm | 1 | 200.0 | 615.0 | 624.0 | 1920 | 25169 | 78114.5 | 2.3 | 17 | 2 |
Code_Saturne | 128 | 4096.0 | 4096.0 | 131072.0 | 262144 | 236 | 72443.2 | 2.1 | 7 | 4 |
FHI aims | 1 | 512.0 | 1536.0 | 1536.0 | 4096 | 27947 | 65787.8 | 1.9 | 20 | 3 |
Xcompact3d | 4 | 9216.0 | 32768.0 | 32768.0 | 524288 | 441 | 56577.6 | 1.7 | 12 | 7 |
Quantum Espresso | 1 | 256.0 | 896.0 | 1024.0 | 2048 | 104209 | 55549.7 | 1.6 | 18 | 6 |
Python | 1 | 216.0 | 1024.0 | 2816.0 | 9216 | 1373683 | 52485.8 | 1.5 | 67 | 29 |
3DNS | 4 | 17680.0 | 26928.0 | 30512.0 | 50217 | 57 | 47363.7 | 1.4 | 3 | 1 |
NWChem | 1 | 384.0 | 640.0 | 1024.0 | 3840 | 45168 | 40929.3 | 1.2 | 14 | 5 |
Nektar++ | 16 | 3840.0 | 5120.0 | 6400.0 | 12800 | 868 | 38849.5 | 1.1 | 8 | 2 |
NEMO | 1 | 1024.0 | 1568.0 | 6528.0 | 7232 | 23324 | 36368.6 | 1.1 | 51 | 4 |
SENGA | 1 | 13560.0 | 13560.0 | 13560.0 | 37500 | 122 | 34241.9 | 1.0 | 6 | 3 |
ONETEP | 1 | 128.0 | 512.0 | 1024.0 | 1024 | 2572 | 26459.6 | 0.8 | 9 | 2 |
BOUT++ | 768 | 768.0 | 1344.0 | 1344.0 | 1344 | 141 | 26268.4 | 0.8 | 1 | 1 |
SPARTA | 1024 | 65536.0 | 65536.0 | 65536.0 | 65536 | 9 | 25916.2 | 0.8 | 1 | 1 |
EDAMAME | 64 | 1331.0 | 1331.0 | 1331.0 | 8000 | 342 | 22545.6 | 0.7 | 2 | 1 |
GENE | 36 | 4096.0 | 4096.0 | 4096.0 | 10240 | 141 | 15487.3 | 0.5 | 3 | 2 |
Hydro3D | 200 | 2640.0 | 2640.0 | 2640.0 | 16000 | 215 | 11089.9 | 0.3 | 3 | 3 |
CESM | 64 | 1024.0 | 2048.0 | 4096.0 | 4096 | 2273 | 9556.2 | 0.3 | 7 | 1 |
SU2 | 1 | 1024.0 | 3840.0 | 7680.0 | 7680 | 1190 | 9421.2 | 0.3 | 6 | 2 |
OSIRIS | 12288 | 12288.0 | 12288.0 | 12288.0 | 12288 | 17 | 9140.6 | 0.3 | 2 | 2 |
iIMB | 768 | 4608.0 | 4608.0 | 10752.0 | 10752 | 44 | 8952.9 | 0.3 | 2 | 2 |
CRYSTAL | 1 | 128.0 | 256.0 | 32768.0 | 32768 | 816 | 8358.0 | 0.2 | 10 | 5 |
a.out | 1 | 256.0 | 256.0 | 512.0 | 2560 | 320 | 7427.1 | 0.2 | 14 | 9 |
OpenSBLI | 128 | 8192.0 | 8192.0 | 8192.0 | 131072 | 55 | 6631.4 | 0.2 | 2 | 2 |
WRF | 64 | 320.0 | 320.0 | 384.0 | 384 | 219 | 6198.0 | 0.2 | 5 | 3 |
RMT | 512 | 2432.0 | 2432.0 | 2432.0 | 2432 | 184 | 5672.6 | 0.2 | 3 | 1 |
NAMD | 16 | 512.0 | 512.0 | 512.0 | 512 | 474 | 4222.0 | 0.1 | 5 | 4 |
HYDRA | 1 | 6400.0 | 10240.0 | 12800.0 | 12800 | 469 | 3333.4 | 0.1 | 6 | 4 |
CASINO | 1 | 1024.0 | 1280.0 | 1920.0 | 2560 | 113 | 3173.3 | 0.1 | 1 | 1 |
EPOCH | 1 | 16384.0 | 16384.0 | 16384.0 | 16384 | 195 | 2215.3 | 0.1 | 4 | 1 |
TPLS | 64 | 1024.0 | 2048.0 | 2048.0 | 2048 | 37 | 1933.2 | 0.1 | 2 | 2 |
Nek5000 | 256 | 512.0 | 512.0 | 768.0 | 768 | 38 | 1811.4 | 0.1 | 1 | 1 |
SIESTA | 1 | 2048.0 | 2048.0 | 2560.0 | 6656 | 1111 | 1488.2 | 0.0 | 5 | 3 |
GS2 | 32 | 408.0 | 1536.0 | 1664.0 | 1792 | 1272 | 1403.1 | 0.0 | 2 | 1 |
DL_MESO | 8 | 64.0 | 64.0 | 64.0 | 128 | 370 | 1295.4 | 0.0 | 2 | 1 |
SBLI | 1 | 8192.0 | 8192.0 | 8192.0 | 8192 | 283 | 914.6 | 0.0 | 3 | 2 |
PDNS3D | 512 | 1024.0 | 1024.0 | 1024.0 | 1024 | 99 | 790.9 | 0.0 | 2 | 1 |
FVCOM | 1 | 640.0 | 640.0 | 640.0 | 896 | 44 | 600.2 | 0.0 | 2 | 1 |
ptau3d | 1 | 512.0 | 512.0 | 512.0 | 1024 | 72 | 343.8 | 0.0 | 3 | 3 |
DL_POLY | 8 | 64.0 | 4096.0 | 4096.0 | 4096 | 644 | 185.1 | 0.0 | 3 | 3 |
HemeLB | 1 | 768.0 | 768.0 | 768.0 | 2048 | 30 | 135.1 | 0.0 | 3 | 1 |
Arm Forge | 2 | 1024.0 | 1024.0 | 1024.0 | 2048 | 218 | 106.2 | 0.0 | 8 | 7 |
Smilei | 8 | 32.0 | 32.0 | 32.0 | 2048 | 275 | 55.4 | 0.0 | 3 | 1 |
Elk | 4 | 32.0 | 32.0 | 256.0 | 512 | 71 | 48.9 | 0.0 | 2 | 2 |
Fluidity | 8 | 8192.0 | 8192.0 | 8192.0 | 8192 | 7 | 39.6 | 0.0 | 1 | 1 |
AxiSEM3D | 4 | 48.0 | 48.0 | 48.0 | 480 | 140 | 32.9 | 0.0 | 1 | 1 |
GPAW | 32 | 32.0 | 64.0 | 64.0 | 64 | 4 | 24.0 | 0.0 | 1 | 1 |
Amber | 16 | 896.0 | 1280.0 | 1408.0 | 2048 | 45 | 22.0 | 0.0 | 1 | 1 |
ECOGEN | 256 | 256.0 | 256.0 | 256.0 | 256 | 2 | 17.7 | 0.0 | 1 | 1 |