2 days ago
Responsibilities
- Deploy, maintain, and support a highly available cloud infrastructure on AWS and Azure.
- Build and manage cloud infrastructure using Infrastructure as Code tools like Terraform.
- Collaborate with Engineering and DevOps teams for smooth application deployments.
- Monitor production environments and resolve infrastructure and application issues.
- Implement monitoring, logging, and observability solutions for platform reliability.
- Participate in incident response and root cause analysis to minimize service disruptions.
- Contribute to automation initiatives to improve efficiency and reduce manual effort.
- Optimize resource utilization and support cloud cost management.
- Support security and compliance requirements through best practices.
- Participate in disaster recovery testing and platform maintenance activities.
- Provide timely support for production environments during on-call rotations.
- Document operational procedures and troubleshooting guides.
- Stay current with cloud technologies and contribute ideas for improvement.
- Mentor junior engineers and participate in knowledge-sharing sessions.
Requirements
- 3–6 years of experience in Linux environments.
- 3+ years of hands-on experience with cloud providers like AWS or Azure.
- Strong knowledge of network infrastructures.
- Experience with Terraform or other Infrastructure as Code tools.
- Familiarity with CI/CD pipelines and deployment automation.
- Experience with scripting languages such as Python or Shell.
- Strong troubleshooting and problem-solving skills.
- Knowledge of deploying containerized services with Docker or Kubernetes is a plus.
- SQL/Database knowledge is beneficial.
- Knowledge of cloud security best practices is advantageous.
- Cloud Certification, such as AWS Cloud Solution Architect – Associate, is preferred.
- Ability to learn new technologies quickly and multitask under pressure.
- Excellent verbal and written communication skills.