about 4 hours ago
Responsibilities
- Debug and triage issues across a live fleet of Linux-based sensor nodes.
- SSH into field hardware to diagnose and recover systems with limited access.
- Own site bring-ups end to end to restore operations.
- Build and maintain fleet management systems for device health and updates.
- Identify and eliminate repeat issues through tooling and pre-deployment checks.
- Design and implement observability across edge devices and cloud infrastructure.
Requirements
- Strong Linux systems administration skills, especially in production environments.
- Experience with edge hardware and cloud infrastructure.
- Solid understanding of networking fundamentals like DNS and VPNs.
- Proficiency in scripting or programming with Python, Go, or Bash.
- Familiarity with containerization technologies like Docker and Kubernetes.
- Experience with embedded systems and low-level code in Rust or C is a plus.
