about 2 months ago
Cupertino, CA, USAMid Level / Senior
H1B Sponsor
Responsibilities
- Propose and conduct novel research to achieve results on Sohu that are unviable on GPUs.
- Translate core mathematical operations from popular Transformer-based models into performant instruction sequences for Sohu.
- Develop deep architectural knowledge to inform software performance on Sohu hardware.
- Co-design and finetune emerging model architectures for efficiency on Sohu.
- Guide and contribute to the Sohu software stack and performance characterization tools using Python and Rust.
Requirements
- An ML Research background with interests in hardware co-design.
- Experience with Python, Pytorch, and/or JAX.
- Familiarity with transformer model architectures and inference serving stacks.
- Experience working cross-functionally in diverse software and hardware organizations.
Benefits
- Full medical, dental, and vision packages, with 100% of premium covered.
- Housing subsidy of $2,000/month for those living within walking distance of the office.
- Daily lunch and dinner in the office.
- Relocation support for those moving to Cupertino.
