SRE, Site Reliability Engineering

4 months ago

Dublin, IrelandMid Level / Senior

H1B Sponsor

Responsibilities

Build, operate, and improve production systems focusing on reliability and performance.
Automate operational tasks to reduce manual toil.
Contribute to system design and implementation using SRE best practices.
Define and measure SLIs and SLOs for supported services.
Enhance observability through metrics, dashboards, and logging.
Participate in on-call rotations and respond to production incidents.
Assist with incident investigations and contribute to post-incident reviews.
Analyze system behavior and capacity usage.
Identify and address reliability issues with teammates.
Collaborate with engineers to ship reliable systems.
Write and maintain operational runbooks and documentation.