GrepJob
Graphcore

Observability, Telemetry Engineer

Graphcore
Apply
5 days ago
Gdańsk, PolandMid Level / Senior
H1B Sponsor

Responsibilities

  • Contribute to all phases of product development, from definition to early customer support.
  • Design and implement fault-remediation solutions at scale.
  • Implement multi-component integrations for seamless management and monitoring.
  • Create reference designs including documentation and source code.
  • Deploy solutions for engineering teams to aid in debugging and performance analysis.
  • Ensure comprehensive testing by collaborating with development and QA teams.
  • Mentor and guide junior engineers to foster continuous learning.

Requirements

  • BSc or MSc degree in Computer Engineering, Computer Science, or related field.
  • Experience in architecting and implementing scalable cluster management systems.
  • Good understanding of computer systems architecture including CPU, GPU, and DPU.
  • Programming and debugging skills for server platforms.
  • Expertise in management architectures and associated tools.
  • Detailed knowledge of Redfish APIs.
  • Experience with large-scale telemetry datasets and actionable dashboards.
  • Strong skills in C/C++/Go and Python.
  • Excellent written and verbal communication skills.

Benefits

  • Competitive salary and annual leave policy.
  • Medical and dental health plans.
  • Gym card and employee pension matched up to 4%.
  • Yearly review of benefits to ensure value and reward.
  • Commitment to an inclusive work environment.

Tech Stack

CC++DatadogGoPythonSplunk

Categories

AI & MLBackendData EngineeringTesting