Linux Engineer - Contract Role, Gaithersburg, MD Linux engineer needed to help build and strengthen a rapidly expand Linux environment. The infrastructure supports cutting edge medical technology. The high-level view: 1) Add appropriate admin redundancy for local HPC and Linux systems. 2) Proactively manage local HPC and storage, including developing and maintaining a long-term roadmap/budget for necessary expansion and replacement of obsolete hardware, taking into account our ongoing migration to cloud. 3) Establish and execute a process for control and validation of system updates (e.g. OS, package/library updates). The detailed view:
Install, configure, maintain and refresh local HPC and various Linux systems in production and staging environments
Proactively manage local HPC and storage including developing and maintaining a long-term roadmap/budget for necessary expansion and replacement of obsolete hardware, taking into account our ongoing migration to cloud
Establish and execute a process for control and validation of system updates (e.g. OS, package/library updates)
Support large-scale, rapidly growing Linux server environment on-prem and cloud (Azure)
Reduce single points of failure in the server environment
Collaborate to build tools and scripts for automating various system administration tasks
Performance tuning and backup for both pre-prod and prod environments
Optimize hardware use for various applications, including in-house IO intensive applications and distributed databases
Tier 3 troubleshooting of hardware and system issues, including root cause analysis, incident reporting, communication and escalation
Evaluate cloud technology and hardware platforms - integrating them into our production environment
Maintain system security by remediating identified faults and vulnerable areas within the system or application
Provide documentation of supported systems, processes and access control
Work with internal customers and peers (Systems engineering, Networking, Cybersecurity etc.) through tickets / email / phone
Contribute to BRLI's Business Continuity Planning/Disaster Recovery efforts