9 months ago
Tokyo, JapanMid Level / Senior
H1B Sponsor
Responsibilities
- Identify, collect, and curate diverse high-quality Japanese text, audio, and multimodal datasets.
- Design methods to synthetically generate or augment Japanese training data when needed.
- Ensure datasets meet enterprise-grade quality, coverage, and compliance requirements.
- Train and fine-tune language and vision models for Japanese enterprise use cases.
- Adapt existing LFMs for Japanese language, culture, and enterprise-specific workflows.
- Implement evaluation frameworks to benchmark model quality on Japanese datasets.
- Design evaluation datasets and metrics for Japanese enterprise applications.
- Conduct thorough error analysis and iteratively improve model performance.
- Ensure robustness, fairness, and reliability in Japanese-language outputs.
Requirements
- Native Japanese speaker with a deep understanding of the Japanese model evaluation landscape.
- Familiarity with Japanese pre-training data sources.
- Experience using modeling and inference tools such as Huggingface inference, vLLM, and cloud APIs.
Benefits
- Hands-on experience with state-of-the-art technology at a leading AI company.
- Opportunity to directly shape foundation model performance in Japanese.
- Collaborative, fast-paced environment where your work drives the next generation of LFMs.
Categories
AI & MLData Science
