Systems Engineer - HPC & GPU Infrastructure

Leidos Holdings Inc

Bethesda, MD

JOB DETAILS
SALARY
$87,100–$157,450 Per Year
SKILLS
Ansible, Apache, Automation, Benchmarking, Cloud Computing, Communication Skills, Computer Architecture, Computer Graphics, Computer Hardware, Computer Networks, Computer Science, DHCP (Dynamic Host Configuration Protocol), DNS (Domain Name System), Debugging Skills, Desktop PC, Distributed Computing, Electrical Engineering, Environmental Engineering, Firewalls, GPU (Graphics Processing Unit), Government Contracts, Graphics Algorithms, HSRP (Hot Standby Router Protocol), Hardware Architecture, Integrated Circuits (ICs), Intelligence Community, Intrusion Detection Systems, Intrusion Prevention Systems, Legal, Linux Operating System, Machine Tool, Network Cable, Network Operations Center, Network Security, Network System Hardware, Operating Systems, Operations Management, Parallel Computing, Performance Analysis, Performance Management, Performance Tuning/Optimization, Power Management, Problem Solving Skills, Puppet (Configuration Management), SNMP (Simple Network Management Protocol), Sensitive Compartmented Information (SCI), Strategic Planning, Stress Testing, System Integration (SI), Systems Engineering, TCP/IP (Transmission Control Protocol/Internet Protocol), Team Player, Test Design, Test Suite, Top Secret Clearance, United States Citizen, VLAN (Virtual Local Area Network), Validation Testing, Virtualization
LOCATION
Bethesda, MD
POSTED
18 days ago

Leidos is looking for a mid-career Systems Engineer (Mid-Career) - HPC & GPU Infrastructure with a deep understanding of operating systems, hardware, Kubernetes, and NVIDIA GPU products. As a Systems Engineer (Mid-Career) - HPC & GPU Infrastructure, you will play a pivotal role in designing, developing, and optimizing GPU clusters for the IC community customers.

This is a 100% on-site position. All work must be performed at the customer site in Bethesda at the Intelligence Community Campus.

Responsibilities:

  • HPC and GPU environment engineering: Contribute to the installation and maintenance of GPU and HPC hardware on-prem and in the cloud, providing insights into hardware performance to ensure efficient interaction with software components.

  • Performance Optimization: Analyze HPC/GPU cluster performance, identify bottlenecks, and develop strategies to enhance performance across various applications in Linux, addressing both hardware and software considerations. Regularly monitor and improve performance.

  • HPC/GPU tooling: Install and configure HPC/GPU job scheduling and workload management platforms such as Slurm , PBS , Apache Airflow , Kubernetes

  • Power Efficiency: Work on power management techniques to optimize GPU power consumption, ensuring efficient operation on both mobile and desktop Linux platforms. Continuously assess and enhance power efficiency strategies.

  • Testing and Validation: Design and execute tests to validate GPU performance and functionality on Linux, including stress testing, benchmarking, and debugging to ensure robust operation. Maintain and expand the testing suite.

  • Documentation: Maintain comprehensive technical documentation, including architectural specifications, code documentation, and Linux-specific best practices for GPU development. Keep documentation up to date with changes and improvements.

  • Industry Insight: Stay updated on the latest trends, innovations, and competitive landscapes within the GPU industry, contributing to research efforts and proposing Linux-specific approaches to GPU design and optimization. Share regular updates and insights with the team.

You Bring

  • Bachelor's or higher degree in Computer Science, Electrical Engineering, or a related field. Additional years of experience may be considered in lieu of a degree.

  • 4+ years of relevant systems engineering experience

  • Expertise in operating system integration for Linux.

  • Strong understanding of computer hardware architecture, particularly as it relates to Linux systems.

  • Knowledge of parallel computing, graphics algorithms, and real-time rendering in Linux environments.

  • Excellent problem-solving skills and the ability to collaborate within a team.

  • Strong communication skills for conveying technical information in a Linux context.

  • Proficiency with scripting languages such as Python or BASH.

  • Proficiency with automation tools such Ansible, Puppet, Salt, Terraform, etc.

  • Candidate must, at a minimum, meet DoD 8570.11- IAT Level II certification requirements (currently Security+ CE, CCNA-Security, GICSP, GSEC, or SSCP along with an appropriate computing environment (CE) certification). An IAT Level III certification would also be acceptable (CASP+, CCNP Security, CISA, CISSP, GCED, GCIH, CCSP).

Clearance

  • Active TS/SCI clearance with Polygraph required OR active TS/SCI and willingness to obtain and maintain a Poly.

  • US Citizenship is required due to the nature of the government contracts we support.

Preferred Qualifications

  • Knowledge of GPU virtualization, cloud computing, and emerging Linux-based technologies in the field.

  • Experience with container technologies (Docker, Kubernetes)

  • Experience with Prometheus/Grafana for monitoring

  • Knowledge of distributed resource scheduling systems

  • Understanding data center networking hardware and cabling concepts.

  • Understanding of networking technologies such as DHCP, DNS, TCP/IP, VLANs, HSRP, and SNMP.

  • Knowledge of data center networking security principles Firewall ACLs, IPS/IDS, and Policy Based Routing.

#NMEC DTP-leidos

If you're looking for comfort, keep scrolling. At Leidos, we outthink, outbuild, and outpace the status quo - because the mission demands it. We're not hiring followers. We're recruiting the ones who disrupt, provoke, and refuse to fail. Step 10 is ancient history. We're already at step 30 - and moving faster than anyone else dares.

Original Posting:

June 8, 2026

For U.S. Positions: While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.

Pay Range:

Pay Range $87,100.00 - $157,450.00

The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.

About the Company

L

Leidos Holdings Inc

SAIC is a premier Fortune 500® technology integrator driving our nation's digital transformation. Our robust portfolio of offerings across the defense, space, civilian, and intelligence markets includes secure high-end solutions in engineering, IT modernization, and mission solutions. Using our expertise and understanding of existing and emerging technologies, we integrate the best components from our own portfolio and our partner ecosystem to deliver innovative, effective, and efficient solutions that are critical to achieving our customers' missions. We are a team of 26,000 strong driven by mission, united purpose, and inspired by opportunity. Headquartered in Reston, Virginia, SAIC has annual revenues of approximately $7.1 billion. For more information, visit saic.com.
COMPANY SIZE
10,000 employees or more
INDUSTRY
Computer/IT Services
FOUNDED
2013
WEBSITE
https://jobs.saic.com/