Cloud Operations Engineer

Cloud Operations Engineer

23 Jul 2024
California, Sanjose, 95101 Sanjose USA

Cloud Operations Engineer

Vacancy expired!

This Jobot Job is hosted by: Dee Nguyen
Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume.

A bit about us:

Our client has been helping service provider customers deliver the very best TV experiences to as many of their subscribers as possible. We constantly leverage the latest, technological innovations to help improve performance - and future-proof CapEx-heavy CPE investments.

Why join us?

Our people are a smart, nimble and talented bunch who work hard and have fun. We offer frequent all-hands meeting with the CEO, company sponsored events, celebrations and rewards as well as the opportunity to work in a small but dynamic environment that is changing the world of virtualization.

Our vision is any experience, any network, any device. We are uniquely positioned to make it happen, and we are always looking for the best talent that is key to our innovations.

Job Details

About the role:
As a Cloud Operations Engineer you will use leading edge technologies to build, deploy, operate, and maintain configuration management and orchestration routines to deliver and scale applications and services in virtualized environments and in the cloud as part of a small, geographically distributed Cloud Operations team. We are committed to delivering best-in-class system uptime and operations observability through automation and instrumentation.

This is a key technical role within the Cloud Platform and Operations organization that interfaces closely with the Cloud Infrastructure, Engineering, Product and Customer Engagement teams. The ideal candidate is a self-starter with a strong focus on collaboration, automation, continuous improvement, and an innovative mindset that will lead to operational efficiencies, and increased compliance. The candidate must have a track-record of managing live productions environments with strict availability targets and extensive experience building automation to support continuous software releases.

What you'll do:
Deliver configuration management and orchestration routines to deploy and scale applications and services in virtualized and cloud environments; operate and maintain these routines in production
Support product development teams in the delivery of continuous integration, continuous deployment, providing templates and patterns to follow to ensure code produced by product development teams can be deployed and scaled on standardized technologies and platforms
Perform root cause analysis for production issues where the root cause is in infrastructure, environment, configuration, or deployment routines; understand when to escalate to product development teams; remediate root causes and implement preventative actions
Participate in on-call rotation and afterhours maintenance when necessary, respond to major incidents, and participate in bridge calls when called upon in support of initiatives and incident response
Actively collaborate with the product, engineering and QA teams to build automated testing and monitoring of deployments
Revamp and continuously optimize application release cycles as production environments and product suite scales
Participate in Change Management activities which include reviews, approvals, rollback plans, and live operations transition
Applying automation where possible to reduce manual and repetitive tasks
Architect and implement monitoring, reporting and centralized dashboarding solutions with visibility to internal and external customers
What you'll need:
3 + years of experience as a DevOps/SRE Engineer operating an Public Cloud platform
2+ years experience working with configuration management and orchestration technologies such as Cloud Formation, Ansible or comparable
Knowledge of application performance monitoring
Knowledge of cloud infrastructure principles (load balancing, high availability, server-based and serverless architecture, database configurations)
Extensive knowledge of troubleshooting in a Linux environment
Experience managing cloud-native applications in Docker containers with Kubernetes orchestration
In-depth knowledge of Bamboo, Jenkins, Artifactory or similar CI/CD tools
Proficiency in Python, bash or other programming language
Ability to quickly learn new and existing technologies
Experience troubleshooting using monitoring and logging tools such as Splunk, DataDog, NewRelic, etc in complex cloud-based environments
Ability to work in fast paced and dynamic environment
Strong written and verbal communication skills
AWS Certified - Associate Certification

Interested in hearing more? Easy Apply now by clicking the "Apply Now" button.

Job Details

Jocancy Online Job Portal by jobSearchi.