Site Reliability Engineer

Site Reliability Engineer

23 Nov 2024
California, Los angeles, 90001 Los angeles USA

Site Reliability Engineer

Vacancy expired!

Hello Applicant,
I am

Avinash from

Mindgadget Inc., Please review the below role and let me know your interest.

Note: Citizens and GCs are applicable for these roles.

Location - Remote till 1/31/2022. After that candidate may need to join one of the locations in the Bay Area, LA, Seattle, NY

SRE for Infrastructure• Managing infrastructure services, responsible for including but not limited to deployment,
operation and troubleshooting;
• Maintain services to meet service-level-agreements (SLAs) or service-level-objective
(SLOs) by measuring and monitoring availability, performance, and overall system health;
• Provide user support, incident responses, and post-mortems;
• Participate in on-call rotation.

Minimum qualifications:
• Bachelor's degree or above, majoring in Computer Science or related fields
• 2+ years of experience in one or more of the following types of systems at their newest
versions:
• Kubernetes and Docker
• Redis and/or MongoDB
• Kafka and/or RocketMQ
• Flink
• MySQL
• ElasticSearch
• HDFS
• Mesos and/or Yarn
• Spark and/or Hive
• Familiar with Unix/Linux operating systems
• Experience in debugging and automating routine tasks;
• Strong skills in problem solving and communication
• Excellent team player
• Experience in supporting/managing systems at scale (10s thousands to 100s thousands
instances) is a big plus;

SRE for Cloud• Manage cloud infrastructure, provide resource allocation, system upgrades, user access
control etc.
• Perform deep dives on complex system issues ranging from software bugs, hardware
failures to network issues.
• Build tools and automation to improve operational efficiency.
• On-call responsibility

Minimum qualifications:
• Master's degree (or Bachelor's degree with 3+) years of experience in Computer
Engineering, Electrical Engineering, Computer Science or related major
• 3+ years experience working with Unix Linux systems from kernel to shell and beyond
• 3+ years of scripting experience in Shell and Python.
Preferred qualifications:
• Networking configuration and systems administration experience on a Public Cloud
platform (AWS, Google Cloud Platform, Azure, OCI, etc.)
• Hardware and system troubleshooting skills
• Experience in a large scale production environment
• Experience in L4/L7 load balancers
• Experience in Kerberos, LDAP, or other account management and access control
systems.

SRE for Services• Create, manage and integrate software to automate and secure public cloud
environments;
• Testing and examining code written by others and analyzing results;
• Develop and own the solutions that can support large capacity and scale reliability in a
24/7 environment;
• Monitor the system and respond to incidents to maintain system SLO/SLA, review and
follow up production incidents;
• Share on-call responsibility and Troubleshoot problems across a wide array of services
and functional areas.• Bachelor's degree or above, majoring in Computer Science or related fields, with at least
2 years of related work experience;
• Experience working with Unix Linux systems from kernel to shell and beyond;
• Familiar with system operation skills in Linux and network;
• Experience programming in at least one of the following languages: Python, Perl, Go, or
C/C.
• Experience in CI/CD, Kubernetes, Database experience, or setting up big data pipelines.
If you are interested and available, please share resumes to

avinashATmindgadgetDOTcom or reach me at

4o8-419-9494 directly.

Job Details

Jocancy Online Job Portal by jobSearchi.