GrepJob
EchoTwin AI

Vision Language Model Engineer

EchoTwin AI
Apply
8 months ago
San Francisco, CA, USAMid Level / Senior

Responsibilities

  • Design and implement state-of-the-art vision-language models using deep learning frameworks.
  • Develop and fine-tune models for image captioning, visual question answering, and text-to-image generation.
  • Collaborate with data scientists and software engineers to integrate models into production systems.
  • Optimize model performance for accuracy, latency, and scalability in real-world applications.
  • Conduct experiments to evaluate model performance and iterate on architectures and training pipelines.
  • Stay up-to-date with the latest research in vision-language models and incorporate advancements into projects.
  • Contribute to data preprocessing, augmentation, and annotation pipelines for multimodal datasets.
  • Document model development processes and present findings to technical and non-technical stakeholders.

Requirements

  • Bachelor’s, Master’s or Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or a related field.
  • 3+ years of experience in machine learning, focusing on vision-language models or multimodal AI.
  • Hands-on experience with deep learning frameworks such as PyTorch or TensorFlow.
  • Proven track record of building and deploying computer vision and/or NLP models.
  • Proficiency in Python and relevant ML libraries (e.g., Hugging Face, OpenCV, Transformers).
  • Experience with large-scale model training and optimization techniques.
  • Strong understanding of neural network architectures (e.g., CNNs, Transformers, CLIP).
  • Experience with multimodal datasets and preprocessing techniques for images and text.
  • Familiarity with cloud platforms (e.g., AWS, GCP, Azure) and model deployment workflows.
  • Strong problem-solving skills and ability to work in a fast-paced, collaborative environment.
  • Excellent communication skills to explain complex technical concepts to diverse audiences.

Benefits

  • Options for medical, dental, and vision coverage for employees and dependents.
  • Flexible Spending Account (FSA) and Dependent Care Flexible Spending Account (DCFSA).
  • 401(k) with 3% company matching.
  • Unlimited PTO.
  • Profit sharing.

Tech Stack

AWSAzureGoogle Cloud PlatformOpenCVPythonPyTorchTensorFlow

Categories

AI & MLData Science