about 4 hours ago
Responsibilities
- Drive the technical vision and architecture for Postman's observability platform.
- Design and build scalable solutions for metrics, logging, tracing, alerting, and operational analytics.
- Investigate complex production issues, identify root causes, and drive long-term corrective actions.
- Partner with engineering teams to improve service reliability, availability, performance, and operational maturity.
- Establish observability standards, best practices, and instrumentation frameworks across the company.
- Build tooling and automation that enable faster incident detection, diagnosis, and resolution.
- Leverage telemetry data to uncover performance bottlenecks, capacity risks, and reliability gaps.
- Drive cross-functional initiatives focused on platform health, operational excellence, and engineering productivity.
- Mentor senior engineers and help raise the technical bar across the organization.
Requirements
- 10+ years of software engineering experience with significant exposure to distributed systems and cloud-native architectures.
- Strong expertise in observability domains including monitoring, logging, distributed tracing, telemetry pipelines, and incident management.
- Experience operating large-scale production systems with a strong focus on reliability, scalability, and performance.
- Deep understanding of system debugging, root-cause analysis, performance optimization, and production operations.
- Strong programming experience in one or more languages such as Go, Java, Python, Node.js, or similar.
- Experience with observability technologies such as OpenTelemetry, Prometheus, Grafana, Elasticsearch, Datadog, New Relic, Splunk, Honeycomb, or equivalent platforms.
- Ability to influence technical direction across teams without direct authority.
- Strong communication skills and a data-driven approach to problem-solving.
Benefits
- Comprehensive medical coverage.
- Flexible PTO and wellness reimbursement.
- Monthly lunch stipend.
- Wellness programs to support physical and mental health.
- Frequent team-building events.
- Donation-matching program for causes employees care about.
- In-person collaboration model with a focus on teamwork.