AI Intern – VLA Deployment

about 2 months ago

Santa Clara, CA, USAIntern / Entry Level

Responsibilities

Support model quantization and deployment efforts for large-scale multimodal models.
Assist with applying model optimization techniques under guidance from senior engineers.
Work with research and platform teams to improve model deployability.
Contribute to deployment tools, test pipelines, and runtime modules in C++ and Python.
Help analyze model performance, memory usage, latency, and numerical accuracy.
Participate in debugging and performance tuning across the model and system stack.
Support validation and testing workflows for stable deployment.

BS, MS, or PhD in Computer Science, Electrical Engineering, Robotics, or a related field.
Strong programming skills in C++ and/or Python.
Familiarity with deep learning frameworks such as PyTorch.
Basic understanding of model inference and optimization workflows.
Exposure to model compression or quantization concepts.
Interest in computer architecture and performance optimization.
Strong problem-solving skills and ability to learn quickly.
Good communication skills and ability to collaborate with teams.