Senior Site Reliability Engineer for Contract project in REMOTE
We are looking for-
Senior Site Reliability Engineer for Contract project in REMOTE (could be anywhere in US). Below is the detailed requirement. Title: Senior Site Reliability Engineer Location: REMOTE - North Carolina, US Duration: 12+ months Job Description:
Actively support and own the planning, design, implementation and integration of recently released software solutions in customer environments.
Drive solution delivery improvements through automation, testing and standard methodologies' implementation to optimize deployments, accelerate customer value-realization and improve overall online service reliability.
Practice SRE delivery principles, including building CI/CD pipelines and measure KPIs via Service Level Indicators (SLIs), Service Level Objectives (SLOs) and Service Level Agreement (SLAs).
Interact with professional services delivery, product management, and BU teams to integrate automation in our offers to improve quality and customer experience.
Collaborate in groundbreaking initiatives to define and improve new delivery methodologies in accordance with DevOps and SRE models
Roles & Responsibilities
Apply technical knowledge and customer insights to build a modernization roadmap add value and improve reliability on a software solution. Architect solutions to meet business and IT needs, ensuring technical viability of new projects and successful deployments, while orchestrating key resources and infusing key Infrastructure technologies (e.g. Windows and Linux IaaS, Security, Networking, etc.), and Application Development and DevOps technologies (e.g. App Service, containers, serverless, cloud native, etc.) as appropriate
Enterprise-scale technical experience with public and private cloud and hybrid infrastructures, architecture designs, migrations, and technology management required. Experience and understanding of large-scale application portfolios in enterprise-wide environments (including migration of on-premises workloads to the cloud) required.
Domain expertise across 2 or more of the following areas: release engineering, incident management, monitoring, self-service automation, change management, performance and chaos engineering.
Understanding of networking and software defined implementations (SDN).
Produced or supplied to public body of work around automation (GitHub repo, blog, open-source project).
Ability to define good SLI, SLO and error budget to maintain reliability of any system.
8-10+ years’ experience after a B.S or M.S in Computer Science, Electrical Engineering, or a related field.
Strong technical expertise in one of Cisco Security, Cloud or Data Center technologies is required.
Domain expertise in software development, programmability, automation and DevOps related tools with a minimum of 2 years of experience in 3 or more of the following areas:
Automation and Analytics tools: Ansible, Splunk
Linux and Virtualization: VMware, Docker, OpenStack, Kubernetes, KVM, Vagrant, LXC
Data base: MongoDB, PostgreSQL, Kafka, RabbitMQ, Cassandra, MySQL
APIs and Encoding: XML, JSON, YANG, YAML, REST, RESTCONF, NETCONF
CI/CD, DevOps, Agile, Jenkins
Orchestration : Camunda
Supervising solutions: AppDynamics, ThousandEyes, ELK, Prometheus, Grafana, Influx DB
Cloud : AWS, Google Cloud Platform, Azure
Data Center Technical Expertise – UCS, HX, ACI, Storage, and/or Nexus