Vacancy expired!
This role can be 100% remote or full time in Denver, CO
The Role:
Site Reliability Engineers (SREs) at FullContact are focussed on the uptime, reliability and observability of all of the critical software systems used throughout the company. The SRE team works embedded with the various FullContact engineering teams, attending their planning, standups and contributing to day to day work. As an SRE you will help advance our services towards more automated and self healing systems that are easy to maintain. This requires working together with all of engineering to help define and implement metrics and alerting that increase the robustness, scalability, performance and cost efficiency of the underlying systems. In addition to metrics and stability SREs are also relied on as a source of knowledge and expertise for our underlying infrastructure, security, and CI/CD stack.
As an SRE on the FullContact team, one of the first projects will be to help drive our implementation of Spark on Kubernetes (EMR + EKS) and working with the teams to migrate jobs with an eye towards performance and cost.
Our engineering environment uses a vast variety of technologies and frameworks; a successful SRE doesn’t need to be an expert in all or most of them but does need to be open and eager to learn, contribute, and build great things that help move the team forward. Technologies used at FullContact include: