The High Performance Computing Lead shall be responsible for:
Directing/Managing/Supervising the design, development, implementation, integration, and maintenance of research technology infrastructure and systems capabilities to support the organization's business objectives.
Leading the design, implementation, and management of security systems and redundant data backups. Ensuring the development of cost-effective systems and operations to meet current and future research requirements.
Overseeing the analysis of research problems and leading the evaluation, development, and recommendation of specific technology products and platforms to provide cost-effective solutions that meet business and technology requirements.
Ensuring design of best-fit infrastructure, network, database, and/or security architectures.
Guiding configuration, debugging, and support for infrastructure and ensuring all quality and change control standards are met.
Coordinating strategically aligned Research Computing support to the BCH research community, with a great deal of independence, by applying advanced informatics tools, methods, and technologies in support of computational biology
Evaluating new technologies and methods with feedback and guidance from internal and external colleagues.
Conducting analyses to identify and design solutions with limited information and based on trends; incorporating feedback from internal and external forums (conferences, presentations) and broadening use; architecting evolution of research computing tools, methods, and technologies, data migration, documentation, and processes.
Ensuring high level of customer service is being performed by Research Computing as per agreed upon metrics; advocating for Boston Children's Hospital research community and ensuring delivery of services; overseeing and optimizing customer-facing processes.
Developing and reviewing training curriculum; training staff and researchers; effectively tailoring presentations; developing, implementing, and maintaining knowledge management systems.
Presenting at steering committees, scientific meetings, and working groups; effectively conveying progress and asserting point of view; constructively discussing issues and providing facts.
In order to qualify you must have:
4-year STEM Bachelor's degree or 4-years of STEM experience
A minimum of one year of position-specific experience might potentially include experience acquired through MSc or Ph.D. studies
Deep expert knowledge in distributed computing (CUDA, OpenMP, MPI, Spark), high-performance and cloud computing (HPC, SLURM, SGE, AWS), data science software implementation (Docker, Podman, Singularity, Conda, Venv, Make, CMake), container orchestration (Kubernetes, Openshift), version control tools (SVN, git), common AWS services (EC2, S3, Lambda) and infrastructure automation (Ansible, Terraform, Puppet, Chef)
Deep expert knowledge of advanced computational workflows including distributed artificial intelligence and machine-learning models (Tensorflow, Keras, Horovod, PySpark).
Deep expert knowledge of technologies used in support of biomedical research, such as programming languages (R, Python, C, Java), scripting (bash, shell, Perl), databases (Postgres, MySQL), and biomedical software. Experience with using, customizing, and analyzing service request management software, knowledge management, and project tracking systems
Extensive knowledge of novel and innovative technologies used in support of biomedical research. Experience working with biomedical and/or patient-level data, and ability to follow stringent data privacy requirements.
Ability to resolve a wide range of complex and innovative Research Computing assignments in support of biomedical research; ability to routinely lead complex and innovative projects and coordinate multi-disciplinary project teams.