GrepJob
Graphcore

Observability, Staff Telemetry Engineer

Graphcore
Apply
3 days ago
Gdańsk, PolandStaff+
H1B Sponsor

Responsibilities

  • Contribute to all phases of product development, from definition to early customer support.
  • Design and implement fault-remediation solutions at scale.
  • Implement multi-component integrations for seamless management and monitoring.
  • Create reference designs including documentation and source code.
  • Deploy solutions internally for engineering teams to aid in various analyses.
  • Work with development and QA teams to enhance testing and ensure comprehensive test plans.
  • Mentor and guide junior engineers.

Requirements

  • BSc or MSc degree in Computer Engineering, Computer Science, or equivalent experience.
  • Demonstrated success in architecting and implementing scalable cluster management systems.
  • Good understanding of computer systems architecture including CPU, GPU, and DPU.
  • Experience with programming and debugging for server platforms.
  • Expertise in management architectures and associated tools.
  • Detailed knowledge of Redfish APIs.
  • Experience with large-scale telemetry datasets and actionable dashboards.
  • Strong skills in C/C++/Go and Python.
  • Excellent written and verbal communication skills.

Benefits

  • Competitive salary and annual leave policy.
  • Medical and dental health plans.
  • Gym card and employee pension matched up to 4%.
  • Yearly review of benefits to ensure value and rewards.
  • Commitment to an inclusive work environment.

Tech Stack

CC++DatadogGoPythonSplunk

Categories

AI & MLData EngineeringTesting