Overview
CMS Tier-2 center at Purdue consists of dedicated and opportunistic resources on six computing clusters: Hammer, Brown, Bell, Gilbreth, Geddes, and Negishi.
Hammer, Brown, Bell, and Negishi are large community clusters maintained by ITaP on which CMS has dedicated and opportunistic access to job slots. Gilbreth cluster is exclusively dedicated to GPU-enabled workflows. Geddes cluster is currently used for the development of an Analysis Facility at Purdue CMS Tier-2.
Instructions to access the Tier-2 clusters and recommendations about their usage use are given here.
The technical details of the hardware installed at the Tier-2 clusters are listed below.
Resources
Network connections
- Wide area networking
- WAN link to Indiana GigaPOP: 100Gb/s dedicated with 100Gb/s backup path
- Data Center Core and LAN
- Storage cluster to Data Center Core: 400 Gb/sec
- Community clusters are 100G Infiniband
Computing/Storage nodes
Name | # nodes | job slots | Model | Compute | GPU | Storage |
cms-i |
7 (*) |
- |
KingStar |
- |
- |
144TB (*) |
cms-j |
4 (*) |
- |
Advanced HPC |
- |
- |
216TB (*) |
cms-k |
10 |
- |
KingStar |
- |
- |
216TB |
cms-l |
3 |
- |
Aspen |
- |
- |
288TB |
cms-m |
2 |
- |
Aspen |
- |
- |
360TB |
cms-n |
1 |
- |
KingStar |
- |
- |
192TB |
cms-jb00 |
1 |
- |
SuperMicro JBOD |
- |
- |
840TB |
cms-jb01 |
1 |
|
WD JBOD |
- |
- |
1428TB |
cms-jb02 |
1 |
|
WD JBOD |
- |
- |
1632TB (102x16TB) |
cms-jb03 |
1 | WD JBOD + Dell R6515 |
- | - |
1632TB (102x16TB) |
|
cms-jb04 |
1 | WD JBOD + Dell R640 |
- | - | 1428TB (102x14TB) |
|
cms-jb05 |
1 | WD JBOD + Dell R640 |
- | - | 1428TB (102x14TB) |
|
cms-jb06 |
1 | WD JBOD + SM H12SSW-NT |
1428TB (102x14TB) |
|||
dtn00,01 |
2 | ASUS | ||||
dtn02-07 |
6 | SM H12SSW-NT | Data Transfer Nodes (dtn06 is currently used as XCache server) |
|||
ps-eos |
1 | ASUS K14PA-U24 | PerfSONAR server (close to storage) | |||
hammer-d |
18 |
864 |
DELL |
48-core (HT-on) Xeon Gold 6126 @ 2.60GHz, 192GB RAM |
- |
- |
hammer-e |
15 |
720 |
Supermicro |
48-core (HT-on) Xeon Gold 6126 @ 2.60GHz, 96GB RAM |
- |
- |
hammer-f,g |
22 |
5632 |
DELL |
256-core (HT-on) AMD EPYC 7702 @ 2GHz, 512GB RAM |
1x nVidia T4 |
- |
bell |
9.5 (**)(#) |
1216 |
DELL |
128-core (HT-off) AMD EPYC 7662 @ 2GHz, 256GB RAM |
- |
- |
gilbreth |
2 (**) |
- |
DELL |
40-core Xeon Gold 5218R CPU @ 2.10GHz, 192GB RAM |
2x nVidia V100 |
- |
geddes CPU |
3 (**) |
- |
DELL |
128-core (HT-off) AMD EPYC 7662 @ 2.0GHz, 512GB RAM |
- |
8TB SSD |
geddes GPU | 2 (**) | - | DELL PowerEdge R7525 |
128-core (HT-off) AMD EPYC 7662 @ 2.0GHz, 512 GB RAM | 2x nVidia A100 | 8TB SSD |
Additional Geddes Storage | 4TB SSD | |||||
negishi-c |
16(**) |
4096 |
DELL |
256-core (HT-on) AMD EPYC 7763 @ 2.2GHz, 512GB RAM |
- |
- |
negishi-a |
8.5(**) |
1088 (8.5x128) |
DELL |
128-core (HT-on) AMD EPYC 7763 @ 2.2GHz, 512GB RAM |
- |
- |
(**) Nodes purchased through the Community Clusters Program. Hardware not owned by CMS.
- User interface
- cms-fe00/01: (cms.rcac.purdue.edu) 2x AMD EPYC 7702 64-Core Processors, 1TB RAM, T4 GPU
services: log in, grid UI, Condor + SLURM batch, afs client
Serve both the CMS and Hammer clusters.
- cms-fe00/01: (cms.rcac.purdue.edu) 2x AMD EPYC 7702 64-Core Processors, 1TB RAM, T4 GPU
- Gatekeepers/Schedulers
- Purdue-Hammer: (hammer-osg.rcac.purdue.edu) 4-cores, 8 GB RAM VM, services: HTCondor-CE
- Purdue-Bell: (bell-osg.rcac.purdue.edu) 4-cores, 8 GB RAM VM, services: HTCondor-CE
- Purdue-Negishi (osg.negishi.rcac.purdue.edu) 4-cores, 8 GB RAM VM, services: HTCondor-CE
- Analysis Facility
- 3 nodes (af-a0[0-2].cms.rcac.purdue.edu): 2x AMD EPYC 7662 64-core CPUs, 512GB RAM
- Miscellaneous
- XRooTD: (xrootd.rcac.purdue.edu) 12-cores, AMD Opteron(tm) Processor 4280, 16 GB RAM, service: xrootd
- Squid: Running 6 instances of squid servers in total. Each squid servers runs three instances of squid.
- squid1.rcac.purdue.edu: 8-cores, Intel(R) Xeon(R) CPU E31240 @ 2.1 GHz, 8 GB RAM, service: squid
- squid2.rcac.purdue.edu: 8-cores, Intel(R) Xeon(R) CPU E31240 @ 2.1 GHz, 8 GB RAM, service: squid
- Perfsonar:
- Perfsonar latency node: (perfsonar-cms1.itns.purdue.edu) 8-cores, AMD Opteron(tm) Processor 2380 @ 2.4 GHz, 16 GB RAM, service: latency
- Perfsonar bandwidth node: (perfsonar-cms2.itns.purdue.edu) 8-cores, AMD Opteron(tm) Processor 2380 @ 2.4GHz, 16 GB RAM, service: bandwidth
- GridFTP Redirector: (cms-gridftp.rcac.purdue.edu) 8-cores, Quad-Core AMD Opteron(tm) Processor 2380 @ 0.8 GHz, 16 GB RAM, service: srm
- GridFTP: 45 instances
- XRooTD: 45 instances
Storage:
- EOS storage: c,j,k,l nodes, plus all JBODs (c-nodes to be retired)
- HDFS storage: e,f,g,h,i nodes (In a read-only state. Will be retired soon)