about 3 hours ago
Dublin, Ireland
Mid Level / Senior
H1B Sponsor
Responsibilities
- Coordinate with a global team to ensure uptime guarantees for Atlas customers.
- Implement and refine processes and tools for the Cloud Operations Engineering team.
- Design and deploy systems to reduce Mean Time to Resolve for customer incidents.
- Monitor and proactively resolve customer-facing incidents on the Atlas platform.
- Automate routine monitoring and troubleshooting tasks.
- Diagnose live incidents and differentiate between platform and usage issues.
- Conduct root cause analysis after incidents and recommend process improvements.
- Document troubleshooting workflows and standard operating procedures.
- Collaborate with product management and engineering teams to improve management applications.
- Inform leadership of major outages and coordinate on-call rotations.
Requirements
- At least 2 years of experience as a DevOps, SRE, or Cloud Operations engineer.
- Expertise in Linux system administration and troubleshooting.
- Experience in monitoring and analyzing system performance data.
- Knowledge of database operations and concepts.
- Familiarity with networking technologies like DNS and TCP/IP.
- Experience with cloud platforms such as AWS, GCP, or Azure.
- Ability to write scripts to solve systems problems.
- A degree in Computer Science or equivalent experience.
- Proficiency in at least one programming language such as Java, Go, or JavaScript.
- A keen interest in learning new technologies.
Benefits
- Competitive salary, equity, pension, and health insurance.
- Regular performance, compensation, and development reviews.
- 20 weeks of maternity and paternity leave.
Tech Stack
AWSAzureGoGoogle Cloud PlatformJavaJavaScriptKubernetesLinuxMongoDBSplunk
Categories
BackendDevOps