Member of Technical Staff - ML Engineer / Scientist (JP Localization)

9 months ago

Tokyo, JapanMid Level / Senior

H1B Sponsor

Responsibilities

Identify, collect, and curate diverse high-quality Japanese text, audio, and multimodal datasets.
Design methods to synthetically generate or augment Japanese training data when needed.
Ensure datasets meet enterprise-grade quality, coverage, and compliance requirements.
Train and fine-tune language and vision models for Japanese enterprise use cases.
Adapt existing LFMs for Japanese language, culture, and enterprise-specific workflows.
Implement evaluation frameworks to benchmark model quality on Japanese datasets.
Design evaluation datasets and metrics for Japanese enterprise applications.
Conduct thorough error analysis and iteratively improve model performance.
Ensure robustness, fairness, and reliability in Japanese-language outputs.

Native Japanese speaker with a deep understanding of the Japanese model evaluation landscape.
Familiarity with Japanese pre-training data sources.
Experience using modeling and inference tools such as Huggingface inference, vLLM, and cloud APIs.

Hands-on experience with state-of-the-art technology at a leading AI company.
Opportunity to directly shape foundation model performance in Japanese.
Collaborative, fast-paced environment where your work drives the next generation of LFMs.