Senior Full-Stack AI ML Engineer
Salary
Skills
Location
Languages
Job Description
Senior Full-Stack AI ML Engineer
Direct Mission & Ownership Note
This role carries full End-to-End ownership: you are responsible for the entire project lifecycle - from identifying the business problem and engineering the data to deploying and maintaining the model in production.
- Work Model: Remote (work from anywhere in EU).
- Strategic Travel: Requires a one-week trip to the USA once per quarter to collaborate on-site and understand physical operations (all travel costs fully covered).
- Collaboration Model: Strictly B2B contract.
- Seniority: This level of autonomy requires a minimum of 7+ years of hands-on experience. We need a veteran who doesn't need a roadmap provided for them - they build it.
About the Role
We are seeking a Senior Full-Stack AI ML Engineer to design, build, deploy, and operate production-grade machine learning systems for manufacturing and operational use cases. This role is explicitly focused on classical and applied machine learning, not LLMcentric or prompt-engineering-only work. The successful candidate will demonstrate deep hands-on experience across the full ML lifecycle: from data engineering and statistical modeling to deployment, monitoring, and continuous improvement in production.
You will work in ambiguous, real-world problem spaces, translating operational and business challenges into measurable ML solutions. The role requires strong engineering fundamentals, applied ML expertise, and the ability to own systems end-to-end in a cloud environment. The ideal candidate combines deep technical skills with a consulting mindset and can occasionally act as a solution owner / product lead.
Key Responsibilities
End-to-End Machine Learning Delivery
- Lead the full ML lifecycle: data acquisition → feature engineering → model development → validation → deployment → monitoring and iteration.
- Design and implement predictive, prescriptive, and optimization models for manufacturing operations, predictive maintenance, quality, and supply-chain use cases.
- Apply classical ML and statistical techniques, including: time-series forecasting; anomaly and outlier detection; regression, classification, clustering; probabilistic and statistical modeling and; optimization and decision models.
Data & Feature Engineering
- Build and maintain robust ETL/ELT pipelines for batch and near-real-time ML workloads.
- Perform exploratory data analysis (EDA) to understand data quality, bias, distributions, and feature behavior.
- Design feature pipelines and feature stores, ensuring consistency between training and inference.
- Integrate data from heterogeneous sources (APIs, databases, event streams) with strong validation, schema enforcement, and observability.
Production AI ML Systems
- Deploy models as scalable services (APIs, batch jobs, streaming inference) in cloud environments.
- Implement model monitoring for performance, drift, data quality, and operational health.
- Collaborate with platform and software teams to ensure solutions are secure, reliable, and cost-efficient.
- Contribute to and enforce MLOps best practices, including CI/CD for ML, reproducibility, versioning, and rollback strategies.
Technical Leadership & Collaboration
- Translate ambiguous business problems into clear ML problem formulations, success metrics, and delivery plans.
- Lead technical decision-making around model choice, evaluation strategy, and trade-offs.
- Communicate model behavior, limitations, and results clearly to non-ML stakeholders using visualizations, dashboards, and written documentation.
- Mentor junior engineers and contribute to ML engineering standards and best practices.
Required Qualifications (Must-Have)
- 10+ years of combined experience across machine learning engineering, data engineering, and production system delivery.
- 5+ years of hands-on experience delivering end-to-end AI ML systems in production, beyond experimentation or notebooks.
- Demonstrated depth in classical machine learning and applied statistics (not limited to GenAI or LLMs).
- Strong understanding of model evaluation, validation strategies, bias/variance trade-offs, and failure modes.
- Strong proficiency in Python for production ML (clean code, testing, packaging, performance).
- Strong SQL skills and experience modeling data for analytics and ML.
- 3+ years of cloud experience (GCP strongly preferred; AWS acceptable).
- Hands-on experience with: BigQuery, Cloud Storage, Dataflow / Apache Beam, Airflow / Cloud Composer, Containerized deployments (Docker; Kubernetes or managed equivalents).
- Experience operationalizing ML with tools such as MLflow, Airflow/Dagster, CI/CD pipelines, and monitoring frameworks.
- Proven ability to monitor, debug, and improve models post-deployment, including drift detection and retraining strategies.
- Experience optimizing pipelines and models for performance, reliability, and cost in production environments.
- Proven ability to translate business problems into ML solutions, defining success metrics and delivery milestones.
- Strong communication skills: able to explain ML outcomes and trade-offs to technical and non-technical audiences.
- Experience collaborating with cross-functional teams (engineering, operations, product). Nice-to-Have
- Experience in manufacturing, industrial, or supply-chain domains.
- Prior ownership of a technical roadmap or AI product lifecycle, including planning, prioritization, and delivery.
- GCP or AI/ML professional certifications (e.g., Google Professional Machine Learning Engineer, Professional Data Engineer).
- Background in consulting or client-facing delivery roles.
- Limited GenAI/LLM experience is acceptable when complemented by strong classical ML depth (e.g., LLMs used as components, not the core skillset).
Location
Sumpfstrasse 26, Zug, Switzerland
