3 days ago
Responsibilities
- Manage and maintain large-scale distributed systems using an infrastructure-as-code approach.
- Develop and enhance tools to automate the deployment and management of large-scale services.
- Diagnose and resolve issues by editing code and adjusting infrastructure configurations.
- Develop automation solutions and manage services efficiently using version-controlled infrastructure-as-code.
- Support mission-critical services and participate in on-call rotations as needed.
Requirements
- 5+ years of relevant experience in site reliability or systems engineering.
- Proficiency with Python or Ansible for automation tasks.
- Demonstrated experience building and maintaining automation solutions.
- Strong background in systems administration, specifically with Linux or other major operating systems.
- Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
Benefits
- Various health plans.
- Time off plans for vacation and sick time.
- Parental leave options.
- Retirement options.
- Education reimbursement.
- In-office perks, and more.
