6 days ago
Beijing, ChinaMid Level / Senior
Responsibilities
- Design and maintain CI/CD pipelines for multiple services across the platform.
- Improve deployment automation, release strategies, and rollback mechanisms.
- Build and enhance monitoring, alerting, and observability systems across production services.
- Ensure system health visibility through metrics, logs, traces, and dashboards.
- Work with engineers to reduce deployment risk and improve release confidence.
- Implement safe deployment strategies such as canary, blue-green, or phased rollouts.
- Improve incident detection speed and reduce mean time to recovery (MTTR).
- Support infrastructure reliability for business-critical insurance workflows.
- Standardize deployment and monitoring practices across engineering teams.
- Continuously improve CI/CD performance, stability, and maintainability.
Requirements
- Experience in DevOps, SRE, platform engineering, or infrastructure roles.
- Strong understanding of CI/CD pipelines, deployment automation, and release engineering.
- Experience with monitoring, logging, and observability systems in production environments.
- Ability to troubleshoot deployment and production issues in a structured and calm manner.
- Strong understanding of system reliability, uptime, and operational risk.
- Experience supporting production systems with high availability requirements.
- Hands-on ownership mindset during incidents and deployment failures.
- Practical judgment on release safety, performance, and system stability.
- Strong collaboration with engineering teams in fast-paced environments.
- Low ego and disciplined approach to production operations.
Benefits
- Build Reliable Delivery Systems – Own CI/CD and monitoring for AI automation platforms.
- High-Impact Engineering – Solve real-world release engineering and observability challenges.
- Global Engineering Team – Work with experienced engineers across multiple countries.
- Fully Remote – Work remotely from China while collaborating with our Malaysia-based teams.
- International Exposure – Build systems used across Southeast Asia markets.
- Learning & Development Budget – Support continuous technical growth and DevOps expertise.
- High Ownership Environment – Strong autonomy over deployment and monitoring architecture.
- Modern Engineering Culture – Focus on reliability, speed, and engineering excellence.
- Competitive Compensation – Attractive salary package based on experience and impact.
Tech Stack
AnsibleAWSAzureDatadogDockerGitHub ActionsGitLab CI/CDGoogle Cloud PlatformGrafanaJenkinsKubernetesPrometheusTerraform
