GrepJob
Blink Health

Senior Site Reliability Engineer

Blink Health
Apply
about 2 hours ago
Delhi, IndiaSenior / Staff+
H1B Sponsor

Responsibilities

  • Establish and evolve SRE best practices across the organization.
  • Define and drive observability strategy for system health and performance.
  • Design and implement software-driven solutions within the infrastructure domain.
  • Act as a technical leader, influencing decision-making across core infrastructure.
  • Take ownership of large, ambiguous initiatives from concept to delivery.
  • Combine knowledge of software development, infrastructure, and security to improve platform resilience.
  • Proactively identify risks and recommend platform upgrades.
  • Partner with engineering teams to improve developer workflows and operational maturity.
  • Provide technical mentorship and high-quality design reviews.
  • Lead by example in documentation and knowledge sharing.
  • Participate in and help mature incident response and post-incident learning.

Requirements

  • Bachelor’s or Master’s degree in Computer Science or equivalent experience.
  • 10+ years of experience in site reliability engineering or related roles.
  • Expert-level troubleshooting across the entire stack.
  • Strong command-line proficiency and expertise in Linux systems.
  • Advanced understanding of networking concepts.
  • Experience with multiple programming languages such as Python, Go, and Bash.
  • Strong track record of automating operational work.
  • Deep experience with cloud platforms, preferably AWS.
  • Strong expertise in Kubernetes and container orchestration.
  • Experience designing and maintaining company-wide IaC codebases.