about 2 months ago
Rockville, MD, USAMid Level / Senior
H1B Sponsor
Base Salary
$75k - $80k/yr
Responsibilities
- Design and operate scalable, resilient, and secure infrastructure platforms across cloud and hybrid environments.
- Champion DevOps and SRE practices including Infrastructure as Code, CI/CD, observability, and reliability engineering.
- Build developer-friendly platforms that simplify deployments and improve velocity.
- Enable and optimize infrastructure for AI/ML workloads, including data pipelines and storage systems.
- Develop and maintain automated CI/CD pipelines for applications, data, and ML workflows.
- Implement observability frameworks to ensure system health and performance.
- Define and manage SLOs, SLIs, and error budgets to drive reliability improvements.
- Lead incident response, root cause analysis, and postmortems with a focus on continuous improvement.
- Automate provisioning, configuration, patching, and environment lifecycle management.
- Build and manage containerized and orchestrated platforms (Docker, Kubernetes).
- Support cloud migration, modernization, and platform standardization initiatives.
- Ensure systems meet security, compliance, backup, and disaster recovery requirements.
- Evangelize DevOps practices to the development community.
- Mentor engineers and promote best practices in DevOps, SRE, and platform engineering.
- Stay abreast of new technologies in AIOps, MLOps, cloud computing, and security best practices.
Requirements
- Must have 6+ years of hands-on Linux experience, including Ubuntu/CentOS/Red Hat.
- Must have 4+ years of experience automating Infrastructure-as-Code (IaC) deployments on cloud platforms.
- Must have 4+ years in DevOps/SRE roles supporting production systems.
- Must have 4+ years with CI/CD and automation tools such as Terraform, Ansible, and Jenkins.
- Strong scripting skills in Python, Bash, or PowerShell.
- Experience with monitoring and observability tools like Prometheus and Grafana.
- Proficient in using coding assistants for developing scripts and tools for DevOps use cases.
- Proficient in debugging or troubleshooting SQL/NoSQL databases and web servers.
- Must be willing to learn new technologies and adapt to project needs.
- Cloud certifications are preferred.
- Certifications in Docker, Kubernetes, or Networking are optional.
Benefits
- 100% Medical, Dental & Vision Coverage for Employees.
- Paid Time Off and Paid Holidays.
- 401K match up to 5%.
- Educational Benefits for Career Growth.
- Employee Referral Bonus.
- Flexible Spending Accounts for healthcare, parking, dependent care, and transportation.
