Senior MLOps Engineer (AI LLM Platform)
Xcede · Berlin, Germany
Apply & track with Apply EdgeXcede are proud to partner with a global energy and sustainability business operating across 20 countries with over 6000 employees and €1+ billion in revenue. Their products and services are used across 14 million homes and commercial properties worldwide. The business is now actively scaling its AI engineering function and building out the platform that will run its production AI systems at scale.The RoleYou will join the AI platform engineering team responsible for the infrastructure that powers the company's AI agents and LLM applications. Your focus is deployment, reliability, observability, and cost, making sure models (both managed and self-hosted) run in production safely, predictably, and efficiently. You will set the operational standards the wider AI engineering function builds on.What You Will Be DoingBuilding and operating the deployment infrastructure for LLMs, supporting both managed and self-hosted modelsManaging cloud environments through infrastructure-as-code with TerraformContainerisation and orchestration with KubernetesOwning observability, monitoring, logging, and model and data drift detection across production systemsAdvising on model selection, performance, and operational cost optimisationEstablishing production standards: versioning, rollbacks, compliance, and securityWorking closely with AI engineers to move prototypes and models into stable, scalable productionWhat We Are Looking ForSenior LLMOps or MLOps experience operating ML and LLM systems in productionStrong infrastructure-as-code skills with TerraformKubernetes and container orchestration experienceCloud experience, ideally Azure (AWS or GCP also relevant)Solid Python and software engineering best practices (CI/CD, testing, code quality)A production mindset around monitoring, cost, governance, and reliabilityJava exposure is a plus, not a requirementMinimum C1 German with strong English essentialDetails€85,000 - €110,00030 days holiday per yearHybrid working with up to 50% remoteFlexible working hoursStrong learning and development culture with ongoing training opportunitiesAgile working environment within a globally established businessInterested? Reach out directly to Norman Cistovas at Xcede or apply below.