Director of Cloud Operations. Our behavioral analytics / machine-learning platform is the leader in providing Fraud Detection solutions.
The position is based in Mountain View and reports to Vice President of Cloud Operations and Security.
Responsibilities:
Lead all 24 X 7 operations for all Guardian Analytics cloud offerings
Architect designs and strategies to help meet our technology and automation goals, including Cloud and SDN architectures
Design, scale, and maintain the Guardian Analytics SaaS infrastructure
Knowledge of modern development languages such as Python and other scripting languages to develop tools, scripts, and frameworks that drive efficiency in automation, monitoring, and management of our large-scale environments
Utilize instrumentation & metrics, and applications to automate and improve operational processes and availability, scaling, and security of the production and development environments
Develop and participate in infrastructure operations, release management, incident management, problem management, configuration management and change management processes for all cloud services
Design, enhance, and maintain development, management and monitoring systems
Ensure successful backup and/or replication of Customer Data in a secure manner
Collaborate with Engineering and Product teams to design and implement solutions to support Operations vision and strategy
Deploy and maintain product releases and customer configurations
Participate in SOC 2 and PCI audits, and ensure all controls are followed as per platform
Engage with industry and vendor partners to drive our requirements and product needs
Required Skills and Experience:
Previous experience in managing Operations teams, as well as 5+ years in technical leadership roles
Bachelor's degree in Computer Science or similar major or equivalent experience
10+ years operational experience managing critical tools and infrastructure with a strong focus on providing cloud-based services and technology
3+ years of experience with Cloud Technologies and Architecture (AWS, OpenStack, etc.), Cloud Orchestration/Automation Tools (TerraForm, Salt, Puppet, etc.), and virtualization/container technologies (KVM, Kubernetes, Docker, etc.)
Experience with tools for system, process, and environment monitoring (e.g. Nagios/Icinga, Graphana, Cacti, etc.), logging analysis (Logstash, Splunk, Elastic Search, etc.), and configuration management (e.g. Salt, Ansible, Puppet)
Understanding of source code control systems; experience with Subversion (SVN) and Git a plus, including DevOps experience with CI/CD using Jenkins, Artifactory, or similar technologies.
Experience with hyper-converged infrastructure
Proficient in MySQL support (replication, grants, operational procedures)
Familiarity with Java/JVM performance tuning
Knowledgeable in security fundamentals, including encryption, OpenSSL, SSL Certificates, Linux, system hardening, etc.
Experience with large-scale, clustered, and distributed storage and filesystems
Experience with network layer devices and functionality (e.g. Switching, Routing, Load Balancing, Proxying, NAT)
Expert level triage and troubleshooting skills
Additional Preferred Qualifications:
Strong interpersonal and communication skills; ability to collaborate across teams and skill levels
Experience developing, tracking and leveraging performance metrics for continual improvement
Expert level Linux system administration skills
Proficiency in modern development languages and frameworks such as Python, Jinja2 and other scripting languages (BASH, Perl, etc.)
Strong attention to detail and excellent documentation skills
Self-motivated individual who requires minimal supervision
Resourceful, persistent, flexible, and adaptable team player