Senior Data Engineer

Senior Data Engineer

22 Jan 2024
California, Lajolla, 92037 Lajolla USA

Senior Data Engineer

Vacancy expired!

In partnership with ExxonMobil, Synthetic Genomics, Inc. (SGI) is growing algae biofuels to one day power planes, propel ships and fuel trucks - ultimately offering the potential to cut emissions in half. SGI's research spans from developing genetically engineered algae strains to cultivating acres of energy-rich algae at our state-of-the-art farm in California’s Imperial Valley. At the center of this research is SGI's Research Informatics Platform, which is responsible for the automated collection and analysis of IoT sensor data and sophisticated laboratory measurements. SGI's Research Informatics Platform provides a common operating picture that fosters cross team collaborations and provides an increased understanding of the factors driving performance variation across the scales from lab to farm.
To improve automation and reduce time to actionable insights within the Research Informatics Platform, SGI is looking for a Senior Data Engineer to join its development team. We are looking for creative problem solvers with both a passion for innovation and a focus on delivering technical solutions. As a Senior Data Engineer, your work will improve the quality, reliability, accuracy and consistency of our research data. You will also work with the team to design, build and deploy data science and analytic solutions at scale.

Responsibilities
  • Build project-specific data pipelines (ETL processes) and validation tools using Python, SQL and AWS cloud technologies
  • Partner with members of the Research Informatics team to define requirements for the Research Informatics Platform
  • Implement data models, database schemas, data structures and processing logic to support automated insights
  • Use best practices for code development, optimization and unit testing
  • Collaborate with data scientists to define SLAs for data availability, quality, usability and correctness.
  • Develop and maintain automated data availability, quality monitoring, and alerting for the Research Informatics Platform
  • Manage concurrent requests from multiple research teams and strategically, prioritize when necessary

Qualifications
  • BS in Computer Science, Information Architecture, Mathematics or similar field with a MS preferred
  • 7+ years of data engineering experience
  • 3+ years of experience building and operating scalable data pipelines
  • 3+ years of hands-on experience developing solutions with a MPP data warehouse (e.g., Redshift, Teradata, Vertica MPP)

Key Skills
  • Advanced programming skills, including object-oriented programming; Proficient with Python and JavaScript; willingness to learn other languages as needed.
  • Significant knowledge of database architecture, data modeling, SQL query solution design and coding, query optimization, and performance tuning
  • Experience building data pipelines sourced from both Web APIs and Web-Service APIs
  • Effectively communicate and collaborate with business and scientific leads from other organizations
  • Ability to work with ambiguous requirements and be comfortable exploring new technology and making your own tools when standard approaches don’t meet requirements
  • Passionate about delivering high quality, data solutions to further scientific research within an algal biofuels program

Job Details

  • ID
    JC8460035
  • State
  • City
  • Job type
    Permanent
  • Salary
    Depends on Experience
  • Hiring Company
    Synthetic Genomics
  • Date
    2021-01-19
  • Deadline
    2021-03-20
  • Category

Jocancy Online Job Portal by jobSearchi.