Site Reliability Engineer
NICE
2 months ago
Pune, India
Mid Level / Senior
Responsibilities
- Monitor availability and maintain system health in the production environment.
- Build software and systems to manage platform infrastructure and applications.
- Improve reliability, quality, and time-to-market of software solutions.
- Measure and optimize system performance to meet customer needs.
- Provide operational support for large distributed software applications.
- Analyze metrics for performance tuning and fault finding.
- Collaborate with development teams to enhance services through testing.
- Participate in system design consulting and capacity planning.
- Create sustainable systems through automation.
Requirements
- 2+ years of programming/scripting experience in languages like Go, Python, or .Net.
- Bachelor’s degree in computer science, engineering, or related field.
- 2-3 years of experience in systems engineering, automation, and reliability.
- Proficiency in at least one programming language and experience with scripting languages.
- Deep understanding of cloud computing platforms like AWS.
- Experience with infrastructure as code tools such as CloudFormation or Terraform.
- Knowledge of CI/CD concepts and tools like Jenkins or GitLab CI/CD.
- Strong knowledge of containerization technologies and microservices architecture.
- Experience with monitoring and observability tools.
- Excellent problem-solving skills for troubleshooting complex issues.
- Experience in incident management and driving incident response efforts.
- Readiness to work on Graveyard Shift.
Benefits
- Flexible hybrid work model with 2 days in the office and 3 days remote.
- Opportunities for internal career growth across multiple roles and locations.
- Collaborative and creative work environment.
Tech Stack
Amazon DynamoDBAWSAzureBashC#CircleCIDockerGitLab CI/CDGoGrafanaJenkinsKubernetes.NETNode.jsPowerShellPrometheusPythonTerraform
Categories
BackendDevOps