about 7 hours ago
Hyderābād, IndiaMid Level / Senior
H1B Sponsor
Responsibilities
- Ensure the reliability of software systems by designing and maintaining scalable infrastructure.
- Develop automation tools and scripts to streamline operational tasks.
- Monitor system performance and respond to incidents to minimize downtime.
- Analyze system usage patterns for capacity planning.
- Identify and address performance bottlenecks in software systems.
- Implement Infrastructure as Code practices using tools like Terraform.
- Maintain monitoring and logging solutions for system insights.
- Participate in an on-call rotation for 24/7 system availability.
- Collaborate with security teams to implement security best practices.
- Develop and maintain disaster recovery plans.
- Continuously analyze system performance for improvement opportunities.
Requirements
- 2-4 years of experience in site reliability engineering.
- B.Tech/M.Tech in computer science, information technology, or a related field.
- Experience in a product organization is a plus.
- Certifications from cloud service providers like AWS or Google Cloud are a plus.
- Proficiency in programming languages such as Python, Go, or Shell.
- Strong automation skills using tools like Ansible or Terraform.
- Experience with containerization technologies like Docker and Kubernetes.
- Proficiency in cloud platforms such as AWS, Azure, or Google Cloud.
- Familiarity with monitoring tools like Prometheus or Grafana.
- Understanding of networking concepts and security best practices.
- Experience with CI/CD pipelines and version control systems like Git.