Hybrid
AI Ops Engineer - up to £85,000 + Benefits - Hybrid - Derby
Derby, Derbyshire
£75000.00 - £85000.00 per annum
Permanent
AI Ops Engineer Salary: Up to £85,000 Location: Hybrid - 3 days per week onsite in Derby Working Hours: Full time - Monday to FridayA globally renowned organisation is seeking an AI Ops Engineer to join a growing AI and data function, taking ownership of the operational backbone that enables AI systems to run reliably, securely and at scale in production environments. This AI Ops Engineer role sits at the intersection of machine learning, cloud infrastructure and DevOps, supporting the full lifecycle of AI solutions from deployment through to ongoing optimisation.The AI Ops Engineer position is well suited to an engineer who enjoys building resilient platforms, improving operational maturity and working closely with data scientists and software engineers to deliver production-grade AI capability.Responsibilities for the AI Ops Engineer:Design, build and operate deployment pipelines for AI models, prompts and supporting artefactsOwn lifecycle management including versioning, promotion, rollback and retirement of AI solutionsImplement monitoring and observability covering performance, usage, drift and data qualityEnsure AI systems meet security, compliance and governance requirementsOptimise inference performance, scalability and cost efficiencyManage infrastructure supporting training and inference including cloud platforms, containers and GPU resourcesEnable reproducibility through experiment tracking and artefact managementSupport incident response, root-cause analysis and resolution of AI-related failuresCollaborate with data scientists and software engineers to design scalable, reliable machine learning infrastructureDevelop and maintain CI/CD pipelines for machine learning workloadsMaintain standards for version control, testing and technical documentationWork with cross-functional teams to integrate AI solutions into existing platforms and workflowsStay current with advancements in MLOps, DevOps and AI operations, driving continuous improvementEssential Skills for the AI Ops Engineer:Strong experience operating machine learning or AI systems in production environmentsHands-on experience with CI/CD pipelines for data or ML workloadsExperience managing cloud-based infrastructure for AI workloadsSolid understanding of monitoring, observability and operational resilienceStrong collaboration skills with the ability to work across engineering and data teamsExperience supporting secure, compliant and well-governed systemsDesirable Skills for the AI Ops Engineer:Experience integrating Python-based services with modern front-end frameworksFamiliarity with MLOps practices for deploying, monitoring and managing AI systemsExposure to large-scale enterprise data environments or knowledge management systemsUnderstanding of Agile delivery practices and collaborative toolingKnowledge of data security, compliance and responsible AI principlesDomain exposure within engineering or manufacturing environmentsIf you are an AI Ops Engineer looking to take ownership of AI operations within a complex, production-focused environment, please apply in the immediate instance. AI, Artificial Intelligence