Apply Edge Start your job search

Staff AI Platform Engineer

Discovered MENA · Abu Dhabi, Abu Dhabi Emirate, United Arab Emirates

Apply & track with Apply Edge
Staff AI Platform Engineer - Abu DhabiDiscover the Opportunity:We’re partnering with a major government entity in Abu Dhabi that is building large-scale AI infrastructure and next-generation AI platforms to support digital transformation across the public sector.This role sits within a central AI engineering function responsible for enabling production-grade AI systems at scale, supporting multiple engineering teams through modern platform, infrastructure, and observability capabilities.This is a staff-level engineering opportunity for someone who enjoys building the foundations that power high-performance AI systems, from GPU inference infrastructure and vector databases through to deployment platforms and developer tooling.Discover the Responsibilities:Design, build, and operate scalable AI platform infrastructure, including model serving, vector databases, embedding pipelines, and compute environmentsDevelop and maintain GPU-based inference infrastructure to support low-latency, high-throughput AI workloads in productionBuild and operate data infrastructure including ingestion pipelines, object storage, vector stores, and ETL processesDesign and maintain deployment platforms using containerisation, CI/CD pipelines, and infrastructure-as-code practicesImplement observability across AI systems, including telemetry, logging, tracing, alerting, and AI-specific performance monitoringBuild reusable internal tooling, deployment patterns, and platform abstractions that improve engineering productivityDefine and enforce platform standards across reliability, scalability, security, and operational excellencePartner closely with engineering teams to translate infrastructure requirements into scalable platform capabilitiesEvaluate and implement new platform technologies to ensure long-term scalability and operational efficiencyDiscover the Requirements:10+ years of experience in platform engineering, infrastructure engineering, or backend systems engineeringStrong experience with cloud platforms such as Azure, AWS, or GCP, including compute, networking, storage, and cost optimisationDeep expertise in Docker, Kubernetes, and containerised infrastructure supporting AI or high-scale workloadsStrong experience building and operating CI/CD pipelines and infrastructure-as-code environmentsExperience designing and operating production-grade data pipelines and distributed systemsStrong programming skills in Python, Java, Go, or similar backend technologiesStrong understanding of PostgreSQL and production-scale database operationsHands-on experience with observability tooling including tracing, logging, metrics, and alertingExperience with GPU inference infrastructure, vector databases, RAG pipelines, or AI platform environments is highly desirableStrong communication skills with the ability to explain technical decisions, trade-offs, and operational considerations clearly