Role: HPC System Administrator Location: Onsite in Salt Lake City, Utah, or Annapolis, MD Type: Full-time role
SUMMARY: HPC System Administrator, TS SCI FSP Clearance, Location, Annapolis Junction, MD Job Description: Our direct client is currently seeking highly qualified, experienced, and motivated candidates for a High-Performance Computing (HPC) Systems Administrator (SA) to join its HPC team for our Federal Government customer. As the Federal HPC Systems Administrator, you will use your knowledge and expertise to support the customer’s software needs and to maintain the operational stability and performance of the HPC Cluster Solution and its integration into the existing customer computing infrastructure. Your work will be conducted on premise at the Federal Government facility in Maryland. TS/SCI with Full Scope Poly required. In this role you will:
Work in partnership with the other HPE system administrators and Government system administrators to resolve issues and maximize system up-time
Provide system administration support and hands-on training to Government personnel
Provide expertise to establish the stability and performance of the Solution, software and applications programming support, and integration into the existing
computing infrastructure
Take the actions necessary to ensure that the warranty remains in effect, maintaining the system documentation under configuration management, and providing for the continual operation, stability, and performance of the Solution
Support, install, upgrade, and configure the software and assist the Government’s use of software and software tools for the Solution
Mitigate any identified security vulnerabilities within a 30 day period
Track software tool errors and provide reporting to HPE quality control department
Provide expertise in system resource allocation and OS tuning and configuration
Write scripts for the use of and configuration of standard OS utilities
Assist users with system operation and application design, optimization, and debugging
Assist users in finding performance issues in their applications and help users understand and measure the performance limits of the Solution
Education and Experience Required
A current, active TS/SCI with Full Scope Poly clearance is required.
US Citizenship is required
10+ years experience as a system administrator in a Linux cluster environment
Bachelor’s degree in a technical discipline from an accredited college or university
or 15 years of relevant experience
Experience with High Availability systems
Experience writing and troubleshooting scripts
Desired
Experience with configuration management tools
Experience setting up and troubleshooting remote network OS installations
Experience in multiple scripting languages, such as bash / Perl / Expect / Python
Experience with CentOS or Red Hat Enterprise Linux
Experience with Pacemaker / Corosync High Availability systems
Experience deploying and/or supporting Lustre or other parallel file systems
InfiniBand experience
Experience with Ansible configuration management tool