Staff Machine Learning Engineer
Tempus
10 months ago
Chicago, IL, USA +3 more
Staff+
H1B Sponsor
Base Salary
$170k - $230k/yr
Responsibilities
- Architect and build sophisticated data processing workflows for multimodal training data.
- Develop strategies for efficient data ingestion from various sources.
- Utilize and optimize frameworks for large-scale ML data loading and streaming.
- Collaborate with infrastructure teams to optimize cloud-native services.
- Engineer connectors and data loaders for diverse knowledge sources.
- Optimize data storage for large-scale training and knowledge access.
- Orchestrate and monitor complex data workflows using tools like Airflow.
- Establish monitoring and alerting systems for data pipeline health.
- Analyze and optimize data I/O performance bottlenecks.
- Manage costs associated with storing and processing datasets in the cloud.
Requirements
- Master's degree in Computer Science, AI, Software Engineering, or related field.
- 8+ years of industry experience in large-scale data pipelines and infrastructure.
- Experience with massive datasets and distributed data processing tools.
- Hands-on experience with ML data handling tools and libraries.
- Understanding of data challenges for training large models.
- Proficiency in Python and modern distributed data processing frameworks.
- Proven leadership and collaboration skills with cross-functional teams.
- Excellent communication skills for explaining complex concepts.
- Ability to thrive in a fast-paced, dynamic research environment.
Tech Stack
Apache AirflowApache SparkGoogle Cloud PlatformMLflowPython
Categories
AI & MLData EngineeringData Science