Senior Software Engineer, Machine Learning (AIXON)
Appier
9 months ago
Taipei, Taiwan
Senior
Responsibilities
- Architect and operate resilient ML job execution frameworks covering training, inference, and post-processing workflows.
- Develop and maintain API services and developer tooling to orchestrate ML workflows on Kubernetes using Argo Workflows, Helm, Terraform.
- Build scalable, efficient batch pipelines with Apache Spark to support large-scale ML training and evaluation.
- Design and maintain robust data infrastructures using Trino, Databricks, and other modern database technologies.
- Develop tooling that streamlines ML experimentation and accelerates production workflows.
- Collaborate with ML scientists to transform research prototypes into reliable, scalable AI products.
- Lead cloud infrastructure design and operations on GCP, leveraging managed services.
- Define and implement CI/CD pipelines to enable seamless, automated deployments.
- Optimize system resource utilization and performance using distributed computing principles.
Requirements
- Bachelor’s degree in Computer Science, Engineering, or a related technical field; Master’s degree preferred.
- 5+ years of hands-on experience in ML platform engineering, MLOps, or data infrastructure.
- Expert proficiency in Python, Java, or Go, with solid foundations in data structures and algorithm design.
- In-depth experience with cloud environments (AWS or GCP) and cloud-native service management.
- Proven mastery of Docker containers and Kubernetes cluster management.
- Strong understanding of the ML lifecycle from training to evaluation and feedback loops.
- Familiarity with Git workflows and Linux-based development environments.
- Passionate about continual learning and innovation in AI-powered developer tools.
Tech Stack
Apache AirflowApache SparkDatabricksDockerGitHub ActionsGoGoogle BigQueryGoogle Cloud PlatformGrafanaHelmJavaJenkinsKubernetesMLflowPrometheusPythonTerraform
Categories
AI & MLData ScienceDevOps