← All jobs

Lead/Manager Together Cloud Infrastructure

Together AI · Amsterdam

On-site Manager
GPUKubernetesCI/CDAWSAzureGCPDockerPythonGo

About the Role Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastructure. As a Lead/Manager, you will play a key role in building the Together cloud platform engineering team in the Netherlands. We are a highly available, global, blazing-fast cloud infrastructure that virtualizes cutting-edge ML hardware (GB200s/GB300s, BlueField DPUs) and enables state-of-the-art ML practitioners with self-serve AI cloud services, such as on-demand + managed Kubernetes and Slurm clusters. This platform serves both our internal SaaS products (inference, fine-tuning) and our external cloud customers, spanning dozens of data centers across the world. Some of what you’ll work on: Work on a distributed GPU scheduling system for the on-demand clusters product, Instant Clusters. Build out a global management plane for managing our data center compute, networking, and storage. Design and build new customer-facing cloud platform services, delivering killer enterprise AI cloud features. Hybrid working 2 days a week at our offices in Amsterdam Responsibilities Lead/Manage a team of 8 together cloud Infrastructure Engineer in Amsterdam, Identify, design, and develop foundational backend services that power Together’s commerce platform Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure Partner with product teams to understand functional requirements and deliver solutions that meet business needs Write clear, well-tested, and maintainable software and IaC for both new and existing systems Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance Participate in an on-call rotation to address critical incidents when necessary Requirements  Ideally 1-2 years of leading the Infrastructure team. Demonstrable backgro

Apply on company site →