GrepJob
OnHires

Lead Platform Engineer

OnHires
Apply
about 2 months ago
Remote, WorldwideMid Level / Staff+

Responsibilities

  • Architect and implement scalable infrastructure for the trading platform.
  • Build and maintain internal tools to streamline developer workflows.
  • Champion Infrastructure as Code practices using Terraform or CloudFormation.
  • Manage and optimize platform-critical services like NATS, RabbitMQ, and AWS RDS.
  • Automate and optimize deployment processes for continuous integration and delivery.
  • Manage and scale containerized workloads using Kubernetes and Docker.
  • Define and maintain Service Level Objectives and Indicators.
  • Implement observability tools for real-time system monitoring.
  • Lead incident response efforts and conduct root cause analysis.
  • Architect and manage cloud-based systems for high-traffic applications.
  • Implement disaster recovery and business continuity strategies.
  • Collaborate with software engineers to design tailored infrastructure solutions.
  • Mentor junior engineers and document best practices.
  • Contribute to evolving backend microservices towards Go and Rust.
  • Evaluate and integrate critical third-party software and infrastructure.

Requirements

  • 5-8+ years of hands-on experience with AWS cloud services.
  • Proficiency with Docker and Kubernetes.
  • Strong experience with Infrastructure as Code tools like Terraform.
  • Proficiency in at least one programming language such as Python or Go.
  • Expertise in building and maintaining CI/CD workflows.
  • Experience with observability platforms like Prometheus or Datadog.
  • Proven ability to handle incident response and postmortem reviews.
  • Strong problem-solving skills and ability to deliver complex infrastructure solutions.
  • Experience collaborating with product engineers to improve workflows.
  • Ownership mindset with leadership and mentoring capabilities.

Benefits

  • Competitive salary with future equity options.
  • Opportunities to work with cutting-edge technologies.
  • Flexible working hours and a remote-friendly environment.
  • Professional growth through certifications, conferences, and internal training.

Tech Stack

AWSC#DatadogDockerGitHub ActionsGitLab CI/CDGoGrafanaJavaJenkinsKubernetesPrometheusPythonRabbitMQRedisRustTerraformTypeScript

Categories