
Senior Site Reliability Engineer
Blink Healthabout 2 hours ago
Delhi, IndiaSenior / Staff+
H1B Sponsor
Responsibilities
- Establish and evolve SRE best practices across the organization.
- Define and drive observability strategy for system health and performance.
- Design and implement software-driven solutions within the infrastructure domain.
- Act as a technical leader, influencing decision-making across core infrastructure.
- Take ownership of large, ambiguous initiatives from concept to delivery.
- Combine knowledge of software development, infrastructure, and security to improve platform resilience.
- Proactively identify risks and recommend platform upgrades.
- Partner with engineering teams to improve developer workflows and operational maturity.
- Provide technical mentorship and high-quality design reviews.
- Lead by example in documentation and knowledge sharing.
- Participate in and help mature incident response and post-incident learning.
Requirements
- Bachelor’s or Master’s degree in Computer Science or equivalent experience.
- 10+ years of experience in site reliability engineering or related roles.
- Expert-level troubleshooting across the entire stack.
- Strong command-line proficiency and expertise in Linux systems.
- Advanced understanding of networking concepts.
- Experience with multiple programming languages such as Python, Go, and Bash.
- Strong track record of automating operational work.
- Deep experience with cloud platforms, preferably AWS.
- Strong expertise in Kubernetes and container orchestration.
- Experience designing and maintaining company-wide IaC codebases.