Sr. HPC System Administrator
Newcastle Associates, Inc.
Chicago, IL
JOB DETAILS
SKILLS
Ansible, Automation, Customer Support/Service, Distributed Computing, File Systems, IBM Product Family, Identify Issues, Large-Scale Systems, MPI, Network Administration/Management, Network Support, Network Switching, Network Systems, OpenMP, Operating Systems, Performance Analysis, Performance Management, Performance Tuning/Optimization, Puppet (Configuration Management), Science Software, Scientific Research, Software Administration, System Operations, Systems Administration/Management, Technical Support
LOCATION
Chicago, IL
POSTED
11 days ago
As a Sr. HPC Systems Administrator, you will be a key member of a team that provides high-end research computing resources to researchers at a world-class university and scientific research institution.
The team is dedicated to enabling research by providing access to centrally managed High Performance Computing (HPC), storage, and visualization resources. These resources include hardware, software, high-level scientific and technical user support, and the education and training required to help researchers make full use of modern HPC technology and local and national super-computing resources.
You'll oversee day-to-day operations of the systems including systems administration, monitoring and storage performance up to and including network components. Also you'll manage the system’s network switch, parallel file system and HPC software stack and tools.
To be successful you should have the following qualifications:
- 5+ years of professional experience supporting HPC compilers and libraries.
- Installing, configuring, and maintaining job management tools (such as SLURM, Moab, TORQUE, PBS, etc.).
- Configuring, installing and troubleshooting MPI and OpenMP.
- Hands-on experience of at least one distributed file system (Spectrum Scale-GPFS, Lustre, BeeGFS, Gluster, IMRIX, PVFS, etc.).
- Operating system deployment tools (e.g. XCAT, ROCKS).
- Configuring, administering, and supporting network storage subsystems (e.g. IBM, NetAppl DataDirect Network, LSI, etc.).
- Direct experience working with Infiniband (must at least be able to demonstrate a working knowledge of Infiniband concepts, OFED layers, sub-net managers).
- Configuring, installing, tuning and maintaining scientific application software on large-scale systems.
- Experience with systems automation tools such as Ansible or Puppet.
- Configuring, installing, maintaining and/or using performance monitoring and optimization tools.
- Bachelors Degree
You are welcome to send your resume for quick consideration.
Eligibility to accept permanent employment without visa sponsorship is required.
Position is based onsite in the Chicago area
About the Company
N
Newcastle Associates, Inc.
o make the right fit, we consider all of the factors involved in a successful placement, such as immediate productivity, long-term growth potential, professional career development, compatibility with personal needs, as well as organizational fit.
We maintain an extensive network of contacts and business relationships that we leverage to identify high caliber candidates and unique career opportunities.
Our approach is thorough, respectful, honest and business oriented. We recognize the significance and complexity of hiring decisions, and provide the information, support and guidance to help everyone in the process make the right choice.
COMPANY SIZE
500 to 999 employeesINDUSTRY
Business Services - Other