about 4 hours ago
Base Salary
$161k - $242k/yr
Responsibilities
- Drive initiatives to implement and enforce best practices for data streaming and processing.
- Deploy and manage services on Kubernetes-based platforms like Amazon EKS and Google GKE.
- Provision and manage cloud infrastructure using Terraform.
- Maintain and optimize CI/CD pipelines using Jenkins, ArgoCD, and GitHub Actions.
- Work with cloud-native data services such as AWS Kinesis and Google Dataflow.
- Develop and maintain automation scripts using Python.
- Monitor system performance and troubleshoot issues.
- Implement SRE practices to improve service reliability.
- Analyze and optimize cloud costs.
- Ensure compliance with security policies in cloud environments.
- Collaborate with cross-functional teams to improve development workflows.
Requirements
- 7+ years of experience in a DevOps, Site Reliability Engineering, or Cloud Infrastructure role.
- Strong experience with AWS and GCP data services.
- Proficiency in deploying and managing workloads on Kubernetes.
- Hands-on experience with Infrastructure-as-Code using Terraform.
- Expertise in CI/CD pipeline management using Jenkins and ArgoCD.
- Programming skills in Python for automation.
- Experience with observability and monitoring tools.
- Strong understanding of SRE principles.
- Experience with cost optimization strategies for cloud infrastructure.
- Self-motivated with the ability to influence changes across teams.
- Ability to work collaboratively in an agile environment.
Benefits
- Comprehensive benefits including holistic mind, body, and lifestyle programs.
- Opportunities for career acceleration and personal growth.
Tech Stack
Apache AirflowApache FlinkApache KafkaApache SparkDatadogGitHub ActionsGoogle BigQueryGrafanaIstioJenkinsKubernetesPrometheusPythonRabbitMQSnowflakeTerraform