← All jobs

Senior Systems Engineer, OS Automation

CoreWeave · Livingston, NJ / New York City, NY/ Sunnyvale, CA/ Bellevue, WA

On-site Senior
RAGObservabilityGPULatencyThroughputKubernetesDockerKubeflowCI/CDPythonGoOpenAI APIAnthropic APIEmbeddingsAWSAzureGCP

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at  www.coreweave.com . About the Role: SysEng HAVOCK ( H ardware - A cceleration - V irtualization - O perating Systems - C ontainerization - K ernel) CoreWeave is looking for a Senior Systems Engineer who is ready to evolve beyond traditional DevOps. You will start by stabilizing and scaling our Linux OS and Kernel build pipelines. Once the foundation is set, you will lead the transition to AI-native infrastructure , building "smart" workflows that don't just report errors, but understand and fix them. You are a Systems Engineer at heart, but you are ready to apply LLMs, RAG, and predictive modeling to solve infrastructure challenges at scale. Our Team’s Stack: Languages: Python, Go, bash/sh Observability: Prometheus, Victoria Metrics, Grafana OS & Kernel: Linux Kernel (custom build), Ubuntu Hardware: Intel/AMD/ARM CPUs, Nvidia GPUs, DPUs, Infiniband and Ethernet NICs Containerization: Docker, Kubernetes (k8s), KubeVirt, containerd, kubelet Responsibilities: Pipeline Architecture: Design, maintain, and automate reproducible OS image build pipelines for our massive fleet of GPU-accelerated servers. Kernel Distribution: Collaborate with kernel engineers to package, validate, and distribute custom Linux builds across Intel, AMD, and ARM architectures. Dependency Management: Build tooling to manage dependencies, versioning, and release workflows, ensuring hermetic builds. Telemetry & Metrics: Standardize the collection of build metrics to create a baseline

Apply on company site →