about 3 hours ago
Santa Clara, CA, USASenior / Mid Level
Base Salary
$175k - $296k/yr
Responsibilities
- Design and construct core data closed loop pipelines.
- Develop toolchains for data cleaning, annotation quality inspection, and data mining.
- Provide data support for production and R&D processes.
- Optimize the performance of the entire data pipeline.
- Build a data management platform covering the entire process from data collection to model training.
- Collaborate with technical teams to understand business requirements.
Requirements
- Bachelor's degree or higher in Computer Science, Software Engineering, Artificial Intelligence, or related fields.
- 3-5+ years of experience in large-scale data processing or data platform development.
- Proficiency in at least one programming language among Python, Go, or Java.
- Hands-on project experience in designing and developing large-scale data pipelines.
- Production-level experience with distributed message queues.
- Experience with distributed data lake systems and performance tuning.
- Strong cross-team communication and collaboration skills.
Benefits
- A fun, supportive and engaging environment.
- Infrastructures and computational resources to support your work.
- Opportunity to work on cutting edge technologies with top talents.
- Opportunity to make a significant impact on the transportation revolution.
- Competitive compensation package.
- Snacks, lunches, dinners, and fun activities.
Tech Stack
Categories
AI & MLData Engineering