Senior DevOps Engineer

Acceler8 Talent · San Francisco, CA

Senior DevOps EngineerAI Infrastructure for Superintelligence | Stealth Startup | San FranciscoWe're hiring first DevOps Engineer to join a well-funded AI infrastructure startup building the next generation of secure, scalable infrastructure for frontier AI systems.Backed by leaders from OpenAI, Anthropic, Google DeepMind, NVIDIA, CoreWeave, and Weights & Biases, the company is tackling one of the biggest challenges in AI today: building the infrastructure layer that enables safe, reliable, and efficient acceleration toward superintelligence.Founded by engineers with deep expertise across machine learning, distributed systems, security, and advanced computing, the team is building core infrastructure that leading AI companies will depend on in the coming years.This is an opportunity to join as one of the earliest engineers and take ownership of critical production systems, deployment infrastructure, reliability engineering, and developer platforms that power frontier AI workloads.You'll work on:CI/CD systems and developer productivity platformsKubernetes, container orchestration, and runtime infrastructureInfrastructure-as-Code and automated provisioning systemsCloud, on-prem, hybrid, and air-gapped deploymentsObservability, monitoring, and reliability engineering at scaleSecurity-focused infrastructure for advanced AI workloadsHigh-performance networking and distributed systems operationsInfrastructure automation across large-scale compute environmentsWhat We're Looking For:Strong experience building and operating production infrastructureDeep knowledge of Kubernetes, Docker, and containerized environmentsExperience with Infrastructure-as-Code tools such as Terraform, Pulumi, or CloudFormationStrong Linux systems expertise and automation skillsExperience designing CI/CD pipelines and developer workflowsAbility to operate independently and take ownership in a fast-moving environmentNice to Have:Experience with large-scale AI, ML, or HPC infrastructureBackground in distributed systems or systems programmingExperience with Prometheus, Grafana, Datadog, OpenTelemetry, or similar observability stacksKnowledge of networking, RDMA, RoCE, VPNs, and high-performance infrastructureExperience managing bare-metal environments and hardware-adjacent systemsProficiency in Go, Rust, C++, or PythonOpen-source contributions, technical writing, or public engineering projectsThis Role Is:100% onsite — Financial District, San FranciscoEarly-stage, high-ownership opportunityDirect collaboration with founders and world-class investorsMassive impact on the future of AI infrastructureCompensationCompetitive base salary + significant early-stage equity.