Senior Research Engineer - Inference ML
Cerebras Systems
3 months ago
Sunnyvale, CA, USA or Toronto, Canada
Senior / Staff+
H1B Sponsor
Responsibilities
- Design, implement, and optimize transformer architectures for NLP and computer vision on Cerebras hardware.
- Research and prototype novel inference algorithms and model architectures that leverage Cerebras capabilities.
- Train models to convergence, perform hyperparameter sweeps, and analyze results.
- Bring up new models on the Cerebras system, validate correctness, and troubleshoot integration issues.
- Profile and optimize model code to maximize throughput and minimize latency.
- Develop diagnostic tools to identify performance bottlenecks and guide optimization strategies.
- Collaborate across teams to drive projects from inception through delivery.
Requirements
- Bachelor’s degree in a related technical field and 7+ years of ML software development experience, or equivalent experience.
- Master’s degree in Computer Science or related field and 4+ years of software development experience, or equivalent experience.
- PhD in Computer Science or related field with 2+ years of relevant experience, or equivalent practical experience.
- 4+ years of experience testing, maintaining, or launching software products, including 2+ years in software design and architecture.
- 3+ years of experience in software development focused on machine learning, such as deep learning or computer vision.
- Strong programming skills in C++ and/or Python.
- Experience with Generative AI and Machine Learning systems.
Benefits
- Opportunity to work on a breakthrough AI platform beyond GPU constraints.
- Ability to publish and open source cutting-edge AI research.
- Engagement with one of the fastest AI supercomputers in the world.
- Job stability with startup vitality.
- A simple, non-corporate work culture that respects individual beliefs.
Tech Stack
C++PythonPyTorch
Categories
AI & MLData Science