Service Reliability Engineer
ThoughtWorksabout 12 hours ago
Singapore, SingaporeMid Level / Senior
H1B Sponsor
Responsibilities
- Provide operational support for large-scale distributed environments.
- Quickly diagnose and resolve issues to minimize downtime.
- Troubleshoot and investigate across databases, web services, and applications.
- Handle production incidents and manage incident communication with clients.
- Monitor and ensure technical/business expectations of deliverables are met.
- Share ideas with team members and stakeholders to facilitate discussions.
- Maintain positive relationships with internal peers to deliver strategic objectives.
- Suggest innovative solutions to current constraints and business policies.
Requirements
- Hands-on experience in programming and scripting languages such as Python, Go, or Bash.
- Good understanding of at least one Public Cloud (AWS, Azure, GCP).
- Exposure to observability tools like Grafana, Datadog, or ELK Stack.
- Familiarity with DevOps and GitOps practices.
- Knowledge of container-based architecture and orchestration tools like Kubernetes.
- Understanding of technical architecture and modern design patterns.
- Familiarity with creating infrastructure resources that follow Cloud’s Well Architected Framework principles.
Benefits
- Autonomy in career development supported by interactive tools and development programs.
- A collaborative culture that values helping each other grow.