Inference & model-serving jobs

Serving roles — GPU, Triton, TensorRT, quantization, latency and throughput.

599 open roles

← All jobs

Software Engineer, Cloud Agents

OpenAI · San Francisco
$293k–385k/yr
New Hybrid OpenAI APIOrchestrationObservabilityPython

Senior Software Engineer, Vector Index Research

Zilliz · Redwood City
New Hybrid Senior MilvusGPUQuantizationEmbeddings

Staff Software Engineer, Inference Platform

Cerebras · Headquarters/Sunnyvale Office
New On-site Staff OrchestrationObservabilityGPULatency

Software Engineer, Foundations Search

OpenAI · San Francisco
$380k–555k/yr
New On-site ObservabilityLatencyThroughputEmbeddings

Software Engineer, Agent Infrastructure

OpenAI · San Francisco
$230k–385k/yr
New On-site Tool useOrchestrationKubernetesOpenAI API

Member of Technical Staff (Machine Learning Engineer)

Reka AI · Remote
New Remote Staff LatencyPyTorchTensorFlowAWS

Member of Technical Staff - Imagine Model

xAI · Palo Alto, CA; Seattle, WA
New On-site Staff DistillationPyTorchRayPython

Site Reliability Engineer, Inference Infrastructure

Cohere · Toronto
New Remote ObservabilityGPULatencyThroughput

Research Engineer, Infrastructure

Cognition · San Francisco
New On-site OrchestrationGPUThroughputPyTorch

Senior Backend Software Engineer, AI Observability & Evals Platform (LangSmith)

LangChain · San Francisco, CA
New On-site Senior LangChainLangSmithObservabilityThroughput

Engineering Manager, Inference

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site Manager GPUAnthropic APIDeep learningPython

Manager, Partner AI Deployment Engineering - AWS

OpenAI · San Francisco
$251k–335k/yr
New Hybrid Manager OrchestrationObservabilityAWSPython

Principal ML Platform Engineer

Synthesia · Europe
New Remote Principal OrchestrationObservabilityGPUKubernetes

Head of Forward Deployed Engineering

Snorkel AI · New York City, NY (Hybrid); Redwood City, CA (Hybrid); San Francisco, CA (Hybrid)
New Hybrid Manager OrchestrationPythonMulti-agentTool use

Member of Technical Staff, Search

Cohere · United States
New Remote Staff PyTorchTensorFlowPythonC++

Staff Software Engineer, Full-Stack - Enterprise Gen AI

Scale AI · New York, NY; San Francisco, CA
New On-site Staff AWSAzureGCPTypeScript

Software Engineer, Machine Learning

Glean · Bangalore, India
New On-site PythonGoC++JavaScript

Software Engineer, Agent (Brazilian Portuguese speaking)

Sierra · San Francisco, CA
$180k–390k/yr
New On-site RAGTypeScriptGoOpenAI API

Member of Technical Staff - RL Infrastructure

xAI · Palo Alto, CA
New On-site Staff ObservabilityCI/CDGoEval harnesses

Staff Software Engineer, AI Reliability Engineering

Anthropic · Dublin, IE
New On-site Staff ObservabilityLatencyAnthropic APIThroughput

Senior Software Engineer

LanceDB · HQ
$180k–250k/yr
New Remote Senior RAGPyTorchRayRust

Senior Software Engineer, SmithDB

LangChain · San Francisco, CA
New On-site Senior LangChainLangSmithObservabilityLatency

Senior Machine Learning Engineer

Cresta · Canada (Remote)
New Remote Senior Hugging FaceRAGEmbeddingsMulti-agent

Software Engineer- BIS (Baseten Inference Stack)

Baseten · San Francisco
$180k–360k/yr
New Hybrid vLLMOrchestrationObservabilityGPU

Software Engineer, GenAI

Abridge · SF Office
$255k–300k/yr
New Hybrid LangChainLlamaIndexTool useOrchestration

Agentic Risk Analyst

OpenAI · San Francisco
$288k–425k/yr
New Hybrid Tool useMulti-agentLatencyPython

Member of Technical Staff, MLE (Korea)

Cohere · Korea
New Remote Staff TensorFlowPythonDeep learningGPU

Research Engineer, Code RL (Reinforcement Learning)

Anthropic · San Francisco, CA | New York City, NY
New On-site Eval harnessesGPURLHFPyTorch

Member of Engineering (Reinforcement Learning Infrastructure)

Poolside · Remote (EMEA/East Coast)
New Remote vLLMObservabilityThroughputPyTorch

Staff Software Engineer, Agents

Harvey · New York
$231k–340k/yr
New Hybrid Staff ObservabilityLatencyPythonRAG

Senior Software Engineer, Backend

Harvey · Bengaluru
New Hybrid Senior LatencyAWSAzureGCP

Software Engineer, AI Infrastructure

Glean · Mountain View, CA
New On-site RAGOrchestrationPythonGo

Member of Technical Staff (AI Inference Engineer)

Perplexity · London
New On-site Staff OrchestrationObservabilityGPUTriton

Senior Software Engineer, Inference

Anthropic · London, UK
New On-site Senior OrchestrationObservabilityKubernetesAWS

Senior AI Product Engineer, Fullstack

Arize AI · Remote (United States)
New Remote Senior ArizeObservabilityPythonTypeScript

Research Engineer, Machine Learning

Mistral AI · Palo Alto
New Hybrid PyTorchTensorFlowDeep learningNLP

Member of Technical Staff

xAI · Palo Alto, CA
New On-site Staff Deep learningComputer visionKubernetesAWS

Senior Machine Learning Engineer

Cresta · United States (Remote)
New Remote Senior Hugging FaceRAGEmbeddingsMulti-agent

Manager of Applied AI Architecture, Partnerships

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site Manager AWSAzureGCPAnthropic API

Software Engineer, AI Infrastructure

Fireworks AI · San Mateo
New On-site vLLMPyTorchKubernetesSageMaker

Finance Systems & Automations Manager

Scale AI · San Francisco, CA
New On-site Manager Tool useOrchestrationLLM-as-judgeOpenAI API

AI Deployment Engineer, Cyber

OpenAI · San Francisco
$234k–260k/yr
New Hybrid Tool useThroughputCI/CDPython

Inference Technical Lead, On-Device Transformers

OpenAI · San Francisco
$445k/yr
New Hybrid Lead LatencyThroughputOpenAI APIGPU

Applied AI Engineer, Global Public Sector

Scale AI · Doha, Qatar; London, UK
New On-site Deep learningAWSAzureGCP

Applied AI Engineer

Cognition · San Francisco
$180k–225k/yr
New On-site MCPPythonTypeScriptJavaScript

AI Deployment Engineer, Startups

OpenAI · San Francisco
$198k–280k/yr
New On-site OpenAI APIThroughputPythonJavaScript

Software Engineer, Agent (Dutch speaking)

Sierra · London
£150k–315k/yr
New On-site RAGTypeScriptGoOpenAI API

Software Engineer, Product

Braintrust · San Francisco
New On-site ObservabilityTypeScriptJavaScriptCI/CD

Forward Deployed Security Engineer

OpenAI · Washington, DC
$293k–385k/yr
New Hybrid KubernetesAWSAzurePython

Conversational Modelling Research Engineer

Tavus · Remote
New Remote LatencyPyTorchDeep learningNLP

Software Engineer, Agents

Mercor · San Francisco or NYC
$130k–500k/yr
New On-site LangChainOrchestrationLangSmithObservability

Software Engineer - Infrastructure

Baseten · San Francisco
$165k–330k/yr
New Hybrid OrchestrationKubernetesPythonGo

ML Research Engineer, ML Systems

Scale AI · San Francisco, CA; Seattle, WA; New York, NY
New On-site Tool useRLHFPyTorchGPU

Member of Engineering (Pre-training / Data Engineering)

Poolside · Remote (EMEA/East Coast)
New Remote vLLMOrchestrationObservabilityGPU

Applied AI, Technical Lead, Forward Deployed AI Engineer - EMEA

Mistral AI · Paris
New Hybrid Lead LangChainHugging FaceRAGPyTorch

Senior Data Intelligence Engineer

Deepgram · USA | Remote
$165k–230k/yr
New Remote Senior RAGLatencyPythonEmbeddings

Member of Technical Staff (AI Infrastructure Engineer)

Perplexity · San Francisco
$220k–405k/yr
New On-site Staff OrchestrationObservabilityGPUPyTorch

Senior Software Engineer - Together Cloud Platform

Together AI · San Francisco
New On-site Senior GPUKubernetesCI/CDAWS

Staff Software Engineer, Inference

CoreWeave · Sunnyvale, CA / Bellevue, WA
New On-site Staff vLLMOrchestrationGPUTriton

Research Engineer / Research Scientist- Personal AGI (Post Training)

OpenAI · San Francisco
$295k–555k/yr
New On-site OpenAI APIPythonDeep learningRLHF

Lead Member of Technical Staff, Inference Infrastructure

Cohere · San Francisco
New Remote Staff GPULatencyThroughputNLP

Backend Engineer - API

xAI · Palo Alto, CA
New On-site vLLMOrchestrationObservabilityTensorRT

IC Agentic Engineering Manager - Stargate

OpenAI · San Francisco
$293k–385k/yr
New On-site Manager OrchestrationObservabilityOpenAI APITool use

AI Scientist - Audio

Mistral AI · Paris
New Hybrid PyTorchKubernetesRayPython

Engineering Manager, Fraud & Compliance

Mercor · San Francisco
$250k–400k/yr
New On-site Manager Eval harnessesLLM-as-judgeObservabilityLatency

Software Engineer, Agent

Sierra · Toronto
$180k–390k/yr
New On-site RAGTypeScriptGoOpenAI API

Software Engineer, ML Systems & Training Architecture

OpenAI · San Francisco
$295k–380k/yr
New On-site OpenAI APIGPUPythonDocker

Research Engineer, AI Safety & Alignment

Character.AI · Redwood City, CA
$225k–400k/yr
New Hybrid OrchestrationRLHFKubernetesDocker

Software Engineer, Agents

Harvey · New York
$161k–242k/yr
New Hybrid ObservabilityLatencyPythonRAG

Software Engineer, Model Inference

OpenAI · San Francisco
$295k–555k/yr
New On-site GPULatencyThroughputPyTorch

Open-Source Software, Machine Learning Engineer

Mistral AI · Paris
New Hybrid vLLMHugging FaceDistillationPyTorch

Senior AI Infrastructure Engineer, Model Serving Platform

Scale AI · San Francisco, CA; New York, NY
New On-site Senior vLLMOrchestrationObservabilityTensorRT

Senior Platform Engineer, Voice AI

Together AI · San Francisco
New On-site Senior OrchestrationObservabilityLatencyKubernetes

Software Engineer, Applied AI

Mercor · San Francisco
$130k–500k/yr
New On-site AWSAzureGCPPython

Research Scientist, Agent Robustness

Scale AI · San Francisco, CA; New York, NY
New On-site RLHFDPOEval harnessesGPU

Staff+ Software Engineer, Backend

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site Staff Tool useOrchestrationOpenAI APIMulti-agent

Full Stack Software Engineer, Codex

OpenAI · San Francisco
$255k–405k/yr
New Hybrid OrchestrationObservabilityTypeScriptRust

Senior/Staff Applied AI, Machine Learning Engineer

Mistral AI · Seoul
New On-site Staff RAGLLM-as-judgeTool useMulti-agent

Senior/Staff Software Engineer, Search & Retrieval Infrastructure

Pinecone · Tel Aviv
New Hybrid Staff RAGPineconeOrchestrationObservability

Staff Software Engineer - Developer Experience (DevEx)

Harvey · Bengaluru
New Hybrid Staff ObservabilityKubernetesDockerCI/CD

Full Stack Software Engineer, ChatGPT ImageGen

OpenAI · San Francisco
$185k–385k/yr
New Hybrid OrchestrationObservabilityLatencyTypeScript

Staff Software Engineer, AI Reliability Engineering

Anthropic · London, UK
New On-site Staff ObservabilityLatencyGPUThroughput

AI Applications Ops Lead, GPS

Scale AI · Doha, Qatar; London, UK
New On-site Lead OrchestrationObservabilityKubernetesRAG

Staff+ Software Engineer, Public Sector

Anthropic · Remote-Friendly, United States; San Francisco, CA | New York City, NY | Washington, DC
New Remote Staff GoAnthropic APILLM-as-judgeObservability

Senior AI Product Engineer, Backend

Arize AI · Remote (United States)
New Remote Senior OrchestrationArizeObservabilityKubernetes

Agent Development Manager

Decagon · Toronto
$130k–180k/yr
New On-site Manager OpenAI APIPythonTypeScriptJavaScript

Senior Software Engineer, Agents

Harvey · New York
$193k–290k/yr
New Hybrid Senior ObservabilityLatencyPythonRAG

Member of Technical Staff, AI Training Infrastructure

Fireworks AI · San Mateo
New On-site Staff OrchestrationPyTorchKubernetesDocker

Research Engineer/Scientist - Human Alignment, Consumer Devices

OpenAI · San Francisco
$380k–445k/yr
New Hybrid RLHFOpenAI APIEmbeddingsReranking

Member of Technical Staff (ML)

Reka AI · US, UK, Remote
New Remote Staff PyTorchDeep learningComputer visionNLP

Applied Machine Learning Engineer

Fireworks AI · San Mateo
New On-site RLHFPythonDeep learningGPU

X Developer Platform – Forward Deployed Engineer, X API

xAI · New York, NY; Palo Alto, CA
New On-site PythonTypeScriptJavaScriptRust

Applied AI Architect, Partnerships

Anthropic · Paris, France
New On-site AWSGCPAnthropic APICI/CD

Software Engineer, Data Infrastructure - Research

OpenAI · San Francisco
$250k–380k/yr
New On-site GPUOpenAI APIPythonPyTorch

Researcher, Post Training

Lovable · Stockholm
New On-site OrchestrationGPULatencyPyTorch

Performance Engineer, GPU

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site OrchestrationGPUTritonQuantization

Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

Scale AI · San Francisco, CA; New York, NY
New On-site Multi-agentGPURLHFPyTorch

Senior Software Engineer - Developer Experience

Snorkel AI · Redwood City, CA (Hybrid); San Francisco, CA (Hybrid)
New Hybrid Senior KubernetesDockerCI/CDAWS

Applied Scientist / Research Engineer (Internship)

Mistral AI · Paris
New On-site Intern RAGPyTorchPythonGPU

Staff Software Engineer, Inference Cloud

Cerebras · Headquarters/Sunnyvale Office
New On-site Staff OrchestrationObservabilityGPULatency

Engineering Manager - Forward Deployed Engineering (LLM)

Baseten · San Francisco
$260k–380k/yr
New Hybrid Manager vLLMHugging FaceObservabilityGPU

Staff Software Engineer, AI Reliability

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site Staff ObservabilityLatencyThroughputGPU

Software Engineer, Inference - Multi Modal

OpenAI · San Francisco
$295k–555k/yr
New On-site vLLMGPUTensorRTLatency

Member of Technical Staff (Software Engineer, Applied AI)

Perplexity · San Francisco
$220k–405k/yr
New On-site Staff PythonLLM-as-judgeEval harnessesRAG

Senior Staff Software Engineer, AI Model Lifecycle

Crusoe · San Francisco, CA - US
New On-site Staff vLLMOrchestrationGPULoRA

Senior Software Engineer, Applied AI

CoreWeave · New York, NY / Sunnyvale, CA / Bellevue, WA
New On-site Senior ObservabilityKubernetesDockerCI/CD

AI Deployment Strategist, AI4Engineering - EMEA

Mistral AI · Paris
New On-site PythonJavaScriptGoRAG

Research Engineer, Production Model Post-Training

Anthropic · Zürich, CH
New On-site RLHFDeep learningPythonAnthropic API

Software Engineer (Backend), Enterprise

Scale AI · Budapest, Hungary
New On-site OrchestrationObservabilityLatencyThroughput

Applied Machine Learning Research Scientist

Cerebras · US and Canada Offices
New On-site GPURLHFPyTorchDeep learning

Senior/Staff Software Engineer, Search & Retrieval Infrastructure

Pinecone · US Remote
$190k–270k/yr
New Remote Staff RAGPineconeOrchestrationObservability

Staff Research Engineer - Video Post Training

Synthesia · Europe
New Remote Staff QuantizationDPODistillationDeep learning

Applied AI, Senior/Staff Forward Deployed Machine Learning Engineer - Munich

Mistral AI · Munich
New On-site Staff RAGMulti-agentDeep learningLLM-as-judge

Research Engineer, Codex

OpenAI · San Francisco
$295k–445k/yr
New On-site Tool useMulti-agentObservabilityLatency

Applied Scientist / Research Engineer, AI4Engineering - EMEA

Mistral AI · Paris
New Hybrid RAGPyTorchDeep learningPython

Generative AI Inference Engineer

Stability AI · United States
New On-site OrchestrationTritonTensorRTPyTorch

Member of Technical Staff (Software Engineer)

Cerebras · Headquarters/Sunnyvale Office
New On-site Staff OrchestrationGPULatencyKubernetes

Staff Software Engineer, Inference Infrastructure

Cohere · San Francisco
New Hybrid Staff GPULatencyThroughputNLP

Senior Production Engineer

Crusoe · San Francisco, CA - US
New On-site Senior OrchestrationObservabilityLatencyKubernetes

AI Deployment Engineer

OpenAI · Sydney, Australia
New Hybrid OpenAI APIThroughputPythonJavaScript

Member of Technical Staff - Pre-Training

xAI · Palo Alto, CA
New On-site Staff GPUDeep learningNLPPyTorch

Staff Software Engineer, Voice Agent

Decagon · San Francisco
$200k–400k/yr
New Hybrid Staff ObservabilityLatencyDeep learningNLP

Deployed Engineer (Las Vegas)

LangChain · Las Vegas, NV
New Remote LangChainOrchestrationLangSmithObservability

Software Engineer, GPU Infrastructure (HPC)

Cohere · Canada
New Hybrid ObservabilityGPULatencyThroughput

Member of Technical Staff (AI Researcher)

Perplexity · San Francisco
$220k–485k/yr
New On-site Staff DPOPyTorchDeep learningPython

Member of Engineering (Pre-training / Synthetic Data)

Poolside · Remote (US)
New Remote GPUDeep learningPythonNLP

Machine Learning Scientist (All Levels)

Abridge · SF Office
$205k–300k/yr
New Hybrid GPUPyTorchTensorFlowDeep learning

Senior Machine Learning Engineer - Voice Experience

Cresta · United States (Remote)
New Remote Senior Hugging FaceRAGEmbeddingsObservability

Manager of Technical Staff, Sovereign AI

Cohere · Toronto
New On-site Staff PyTorchTensorFlowPythonDeep learning

Forward Deployed Engineer, Deepgram for Restaurants

Deepgram · New York City, NY
$197k–246k/yr
New On-site LatencyNLPGoPython

Training: ML Framework Engineer

OpenAI · San Francisco
$205k–445k/yr
New On-site OrchestrationObservabilityThroughputPython

Staff+ Software Engineer, Inference Runtime

Anthropic · Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
New Remote Staff GPUTritonLatencyThroughput

Product Engineer, GTM Growth Engineering

OpenAI · San Francisco
$230k–385k/yr
New On-site ThroughputGoOpenAI APIRAG

Research Engineer, Frontier Speculative Decoding

Together AI · San Francisco, New York City
New On-site GPUPyTorchKubernetesPython

Solutions Architect (Inference)

Together AI · London
New On-site GPUKubernetesDockerPython

Senior Software Engineer, Agent Orchestration

Decagon · San Francisco
$200k–400k/yr
New On-site Senior OrchestrationObservabilityLatencyThroughput

Research Engineer, Applied AI Engineering

OpenAI · San Francisco
$250k–555k/yr
New On-site DistillationPyTorchTensorFlowDeep learning

Research Engineer/Research Scientist, Audio

Anthropic · San Francisco, CA
New On-site LatencyThroughputPyTorchKubernetes

Machine Learning Systems Engineer, Research Tools

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site PythonAnthropic APIRAGGPU

Senior Software Engineer, Observability Insights

CoreWeave · New York, NY / Sunnyvale, CA
New On-site Senior LangChainMCPObservabilityKubernetes

Software Engineer, ML Research

Anysphere (Cursor) · San Francisco
New On-site GPUPythonDeep learningRLHF

Principal Software Engineer, AI Observability & Evals Platform

LangChain · Boston, MA
New On-site Principal LangChainLangSmithObservabilityThroughput

Software Engineer, Voice

Sierra · San Francisco, CA
$230k–390k/yr
New On-site LatencyTypeScriptGoPython

Software Engineer, RL Training Infra

OpenAI · San Francisco
$295k–445k/yr
New Hybrid Multi-agentOrchestrationLatencyThroughput

Research Intern, Inference (Fall 2026)

Together AI · San Francisco
New On-site Intern LatencyPyTorchDeep learningPython

Research, Mid-Training

Cognition · San Francisco
New On-site PyTorchDeep learningPythonGPU

Staff Software Engineer, AI Model Lifecycle

Crusoe · San Francisco, CA - US
New On-site Staff vLLMOrchestrationGPULoRA

Software Engineer, Workload Enablement

OpenAI · San Francisco
$293k–385k/yr
New Hybrid GPUPyTorchKubernetesPython

Platform Engineer, Model Shaping

Together AI · San Francisco
New On-site OrchestrationObservabilityGPUKubernetes

AI Deployment Engineer, Startups

OpenAI · New York City
$198k–280k/yr
New Hybrid OpenAI APIThroughputPythonJavaScript

Member of Technical Staff (Data Intelligence)

Reka AI · US, UK, Singapore, Remote
New Remote Staff OrchestrationThroughputPyTorchDeep learning

Software Engineer, Inference AI/ML

CoreWeave · Sunnyvale, CA / Bellevue, WA
New On-site vLLMGPUTritonTensorRT

Research Software Engineer - Paris/London

Mistral AI · Paris
New Hybrid OrchestrationObservabilityGPUKubernetes

Member of Technical Staff - Inference

xAI · Palo Alto, CA
New On-site Staff vLLMOrchestrationGPUTriton

Member of Technical Staff, Integration/RL Team (Research Engineer)

Cohere · Paris
New Remote Staff PyTorchKubernetesRayPython

Software Engineer, Agent (German speaking)

Sierra · London
£150k–315k/yr
New On-site RAGTypeScriptGoOpenAI API

Performance Engineer, Inference Systems

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site ObservabilityGPUQuantizationLatency

Senior Member of Technical Staff, Safety for Agents

Cohere · London
New Hybrid Staff PyTorchTensorFlowPythonDeep learning

ML Systems Engineer, Robotics

Scale AI · San Francisco, CA
New On-site OrchestrationObservabilityGPULatency

Research Engineer, Knowledge Team

Anthropic · Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
New Remote RAGPythonAnthropic APIDeep learning

Software Engineer, Backend (Warsaw)

Mistral AI · Warsaw
New Hybrid MCPObservabilityKubernetesDocker

Multimodal LLM Researcher (MLLM)

Pika · Palo Alto HQ
$185k–400k/yr
New On-site OrchestrationDistillationPyTorchTensorFlow

AI Field Engineer - AI Natives

Fireworks AI · San Mateo
New On-site vLLMGPUTensorRTQuantization

Staff Software Engineer, AI Data Platform

Labelbox · San Francisco Bay Area
New On-site Staff ThroughputKubernetesGCPPython

Software Engineer, Fleet Infrastructure

OpenAI · San Francisco
$230k–490k/yr
New On-site GPUKubernetesCI/CDAzure

Full Stack Engineer

Comet ML · Israel Hybrid, Europe Remote
New Hybrid RAGObservabilityThroughputCI/CD

Post-Training Research Engineer

Baseten · San Francisco
$200k–275k/yr
New Hybrid GPUDistillationPyTorchTensorFlow

Staff Research Engineer

Decagon · New York City
$200k–400k/yr
New On-site Staff OrchestrationPythonMulti-agentRAG

Member of Technical Staff (Forward Deployed Engineer, Applied AI)

Perplexity · New York City
$205k–335k/yr
New Hybrid Staff RAGOrchestrationLatencyPython

Software Engineer, Agent (Korean Speaking)

Sierra · Singapore
$295k–495k/yr
New On-site RAGTypeScriptGoOpenAI API

Senior Software Engineer, Core Infrastructure

Decagon · San Francisco
$200k–400k/yr
New On-site Senior ObservabilityGPULatencyKubernetes

Staff Software Engineer, Agent Orchestration

Decagon · New York City
$200k–400k/yr
New Hybrid Staff OrchestrationObservabilityLatencyMulti-agent

Applied AI Inference Engineer

Baseten · San Francisco
$165k–330k/yr
New Hybrid LatencyDockerPythonThroughput

Performance & Systems Engineer, Codex

OpenAI · San Francisco
$295k–445k/yr
New On-site OrchestrationLatencyThroughputGPU

Software Engineer, Model Routing & Inference

Anysphere (Cursor) · New York
New On-site GPULatencyThroughputKubernetes

Senior Research Engineer

Decagon · New York City
$200k–400k/yr
New On-site Senior OrchestrationPythonMulti-agentRAG

Customer Engineer

Modal · New York
$150k–220k/yr
New On-site GPULatencyPythonDocker

Applied AI Engineer, ML Infrastructure Engineer / Devops - EMEA

Mistral AI · Paris
New On-site GPUPyTorchTensorFlowKubernetes

Software Engineer, Agent (Cantonese Speaking)

Sierra · Singapore
$295k–495k/yr
New On-site RAGTypeScriptGoOpenAI API

Engineering Manager, FDE Infrastructure (UK)

Cohere · United Kingdom
New Remote Manager GPUKubernetesCI/CDAWS

Senior/Staff Applied Scientist/Research Engineer

Mistral AI · Seoul
New Hybrid Staff RAGPyTorchPythonGPU

Member of Engineering (Pre-training / Data Acquisition)

Poolside · Remote (EMEA/East Coast)
New Remote OrchestrationObservabilityThroughputKubernetes

Applied AI, Forward Deployed Machine Learning Engineer - Palo Alto

Mistral AI · Palo Alto
New On-site LangChainRAGPyTorchDeep learning

Member of Engineering (Evaluations / Engineering)

Poolside · Remote (EMEA/East Coast)
New Remote AWSAzureGCPPython

Software Engineer, Agent Evaluation and Quality

Anysphere (Cursor) · San Francisco
New On-site Eval harnessesLLM-as-judgeObservabilityPython

Senior Software Engineer, GenAI

Scale AI · San Francisco, CA; New York, NY
New On-site Senior LatencyRLHFPythonTypeScript

Senior Software Engineer, Inference

CoreWeave · Sunnyvale, CA / Bellevue, WA
New On-site Senior vLLMOrchestrationObservabilityGPU

Software Engineer, Backend

Braintrust · San Francisco
New On-site ObservabilityDockerAWSPython

Applied Scientist / Research Engineer - Singapore

Mistral AI · Singapore
New On-site RAGPyTorchPythonGPU

AI Deployment Engineer, Startups

OpenAI · Sydney, Australia
New On-site OpenAI APITool useMulti-agentOrchestration

Principal Research Engineer

Synthesia · Europe
New Remote Principal RLHFDPODeep learningGPU

Applied AI Engineer, Site Reliability Engineer - EMEA

Mistral AI · Paris
New Hybrid ObservabilityKubernetesAWSAzure

Deployed Engineer, Federal

Cognition · Washington DC
New Hybrid PythonTypeScriptAWSAzure

Machine Learning Fellow - Human Frontier Collective (Canada)

Scale AI · Canada
New On-site LangChainGPUPyTorchTensorFlow

Staff Backend Engineer, Dubbing

Synthesia · Europe
New Remote Staff OrchestrationObservabilityGoEval harnesses

Software engineer, agents (UK)

Writer · London, UK
New Hybrid OrchestrationObservabilityThroughputKubernetes

Simulation Applications Engineer

OpenAI · San Francisco
$230k–385k/yr
New On-site OrchestrationObservabilityGPUThroughput

Machine Learning Scientist

Suno · Boston
$160k–280k/yr
New On-site PyTorchDeep learningGPUTensorRT

Backend Software Engineer - Active Learning Team

Deepgram · USA | Remote
$150k–220k/yr
New Remote OrchestrationLatencyPythonRust

Full-Stack Engineer, AI Data Platform

Labelbox · San Francisco Bay Area
New On-site ThroughputRLHFKubernetesAWS

Senior Software Engineer Together Cloud Infrastructure

Together AI · Amsterdam
New On-site Senior ObservabilityGPUKubernetesCI/CD

Research Engineer, Universes

Anthropic · Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
New Remote GoAnthropic APIOrchestrationEval harnesses

Lead Full Stack Machine Learning Engineer

Cerebras · India Office
New On-site Lead GPUPyTorchTensorFlowDeep learning

Staff Software Engineer, Storage

Crusoe · San Francisco, CA - US
New On-site Staff LatencyThroughputPyTorchTensorFlow

Software Engineer - Training Product

Baseten · San Francisco
$165k–330k/yr
New Hybrid vLLMGPULoRAPyTorch

Director of Customer Engineering, Agent Builder

Decagon · San Francisco
$250k–320k/yr
New On-site Manager GoLLM-as-judgeMulti-agentOrchestration

AI Deployment Engineer, Messenger Integrations

OpenAI · San Francisco
$197k–292k/yr
New Hybrid OpenAI APILatencyGoPython

Staff Software Engineer, Public Sector

Scale AI · San Francisco, CA; St. Louis, MO; New York, NY; Washington, DC
New On-site Staff RAGOrchestrationKubernetesDocker

Software engineer, agents

Writer · New York City, NY
$112k–218k/yr
New Hybrid OrchestrationObservabilityThroughputKubernetes

Software Engineer, AI Data & Evaluation

Mercor · San Francisco
$130k–500k/yr
New On-site ThroughputEmbeddingsRAGObservability

Member of Technical Staff, Post-Training

Cohere · London
New Hybrid Staff PyTorchKubernetesRayPython

Software Engineer, Core Services

Anysphere (Cursor) · San Francisco
New On-site ObservabilityThroughputLatency

Research Platform Engineer

Mistral AI · Paris
New Remote OrchestrationObservabilityGPUCI/CD

Member of Technical Staff, Senior/Staff MLE

Cohere · San Francisco
New Remote Staff PythonRAGLLM-as-judgeEval harnesses

Engineering Manager, Saudi Arabia

Scale AI · Riyadh, Saudi Arabia
New On-site Manager GoLLM-as-judgeTool useMulti-agent

Research Engineer, Machine Learning Systems

Deepgram · USA | Remote
$150k–250k/yr
New Remote OrchestrationLatencyThroughputKubernetes

Software Engineer, Security

Sierra · San Francisco, CA
$200k–330k/yr
New On-site Tool useCI/CDPythonTypeScript

Machine Learning Fellow - Human Frontier Collective (UK)

Scale AI · United Kingdom
New On-site LangChainGPUPyTorchTensorFlow

Senior Software Engineer, Agents

Decagon · New York City
$200k–400k/yr
New On-site Senior PythonTypeScriptDeep learningComputer vision

AI Deployment Strategist - Netherlands

Mistral AI · Amsterdam
New On-site PythonJavaScriptGoAWS

Applied AI Architect, Applied AI (Digital Natives Business)

Anthropic · Munich, Germany
New On-site PythonAnthropic APIOpenAI APIRAG

Agent Post-Training Research

OpenAI · San Francisco
$295k–445k/yr
New On-site Tool useMulti-agentObservabilityLatency

Senior Forward Deployed Engineer (AI Agent) - Germany

Cresta · Berlin, Germany (Hybird)
New On-site Senior RAGVertex AICI/CDAWS

Senior Frontier Agents Engineer

Scale AI · San Francisco, CA; New York, NY
New On-site Senior OpenAI APILangChainLlamaIndexRAG

Software Engineer, Agent

Sierra · Singapore
$295k–495k/yr
New On-site RAGTypeScriptGoOpenAI API

AI Researcher, Core ML (Turbo)

Together AI · San Francisco
New On-site vLLMGPUTensorRTQuantization

Researcher, Training

OpenAI · San Francisco
$360k–440k/yr
New On-site Deep learningOpenAI APINLPGPU

Internship - Search Machine Learning Engineer

Perplexity · London
New On-site Intern RAGPyTorchTensorFlowNLP

Member of Technical Staff (Software Engineer, Acceleration)

Perplexity · San Francisco
$250k–405k/yr
New On-site Staff PythonTypeScriptGoRust

Machine Learning Engineer, Global Public Sector

Scale AI · Doha, Qatar; London, UK
New On-site RAGMulti-agentGPULatency

Anthropic Fellows Program

Anthropic · London, UK; Ontario, CAN; Remote-Friendly, United States; San Francisco, CA
New Remote Deep learningPythonGoAnthropic API

Engineering Manager, FDE Infrastructure (NORAM)

Cohere · Canada
New Remote Manager GPUKubernetesCI/CDAWS

Senior ML Engineer, Dubbing

Synthesia · Europe
New Remote Senior NLPGoEval harnessesLLM-as-judge

Member of Technical Staff (AI Infrastructure Engineer)

Perplexity · London
New Hybrid Staff OrchestrationObservabilityGPUPyTorch

Software Engineer, Site Reliability (SRE)

Sierra · San Francisco, CA
$230k–390k/yr
New On-site OrchestrationObservabilityCI/CDAWS

AI Engineer, Model Quality and Performance

Cerebras · Headquarters/Sunnyvale Office
New On-site GPUDockerEval harnessesLLM-as-judge

Applied Scientist / Research Engineer

Mistral AI · Seoul
New On-site RAGPyTorchPythonDeep learning

Software Engineer, Safeguards Foundations (Internal Tooling)

Anthropic · London, UK
New On-site ThroughputCI/CDAWSGCP

Software Engineer, Model Performance Systems

Baseten · San Francisco
$160k–200k/yr
New Hybrid OrchestrationObservabilityGPUQuantization

Forward Deployed Engineering Intern (AI Agent)

Cresta · Toronto, Canada (Hybrid)
New Hybrid Intern Vertex AIPythonTool useOpenAI API

Staff Fullstack Engineer, Avatars

Synthesia · Europe
New Remote Staff GoPythonTypeScriptJavaScript

AI Solutions Engineer

Baseten · San Francisco
$165k–330k/yr
New Hybrid LatencyDockerPythonObservability

Software Engineer, Cybersecurity Products

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA; Washington, DC
New On-site GoAnthropic APIOpenAI APITool use

Machine Learning Infrastructure Engineer

Character.AI · Redwood City, CA
$150k–350k/yr
New Hybrid GPUPyTorchTensorFlowKubernetes

Principal Engineer, AI Inference Reliability

Cerebras · US and Canada Offices
New On-site Principal ObservabilityGPULatencyPython

Machine Learning Engineer - Inference

Together AI · San Francisco
New On-site vLLMTritonTensorRTPyTorch

Senior AI Infrastructure Engineer - Training Platform

Scale AI · San Francisco, CA; Seattle, WA; New York, NY
New On-site Senior OrchestrationObservabilityGPUThroughput

Senior Systems Engineer, OS Automation

CoreWeave · Livingston, NJ / New York City, NY/ Sunnyvale, CA/ Bellevue, WA
New On-site Senior RAGObservabilityGPULatency

Research Staff, LLMs

Deepgram · USA | Remote
$150k–250k/yr
New Remote Staff LatencyRLHFPyTorchDeep learning

Forward Deployed Engineer, Agentic Platform (Singapore)

Cohere · Singapore
New Remote RAGOrchestrationLatencyPython

Staff Frontier Agents Engineer

Scale AI · San Francisco, CA; New York, NY
New On-site Staff OpenAI APILangChainLlamaIndexRAG

Head of Decision Intelligence

Deepgram · USA | Remote
New Remote Manager LatencyPythonLLM-as-judgeTool use

Member of Technical Staff (GPU Performance Engineer)

Reka AI · US, UK, Singapore, Remote
New Remote Staff GPUPyTorchDeep learningKubernetes

Research Engineer - Environments, Data and Post-Training

Mercor · San Francisco
$130k–500k/yr
New On-site Tool useDeep learningNLPPython

Senior Software Engineer - AI / ML

Snorkel AI · Redwood City, CA (Hybrid); San Francisco, CA (Hybrid)
New Hybrid Senior Multi-agentOrchestrationAWSAzure

Technical Lead Manager, Physical AI

Scale AI · San Francisco, CA
New On-site Manager GPUPyTorchDeep learningPython

Software Engineer, Inference Platform

Cerebras · Headquarters/Sunnyvale Office
New On-site OrchestrationObservabilityGPULatency

Software Engineer, Accelerators

OpenAI · San Francisco
$295k–380k/yr
New On-site PyTorchGPUTensorRTQuantization

GenAI Strategic Projects Lead, Public Sector

Scale AI · Washington, DC
New On-site Lead RLHFRAGEmbeddingsReranking

Full Stack Software Engineer, ChatGPT Finances

OpenAI · San Francisco
$293k–325k/yr
New Hybrid ObservabilityPythonTypeScriptGo

Applied AI Engineer, Codex Core Agent

OpenAI · San Francisco
$230k–385k/yr
New On-site LatencyPythonOpenAI APIEval harnesses

Senior Software Engineer, Agent Orchestration

Decagon · New York City
$200k–400k/yr
New On-site Senior OrchestrationObservabilityLatencyMulti-agent

Engineering Manager (API Platform)

Perplexity · San Francisco
$300k–405k/yr
New On-site Manager OrchestrationLatencyPythonGo

Software Engineer, Codex Core Agents

OpenAI · San Francisco
$230k–385k/yr
New On-site OrchestrationLatencyCI/CDTool use

Software Engineer, API Multicloud

OpenAI · San Francisco
$293k–385k/yr
New Hybrid ObservabilityAWSPythonTypeScript

Senior Research Engineer - Interactive Avatars

Synthesia · Europe
New Remote Senior PyTorchComputer visionPythonGo

Software Engineer, Enterprise

Scale AI · London, UK
New On-site OrchestrationObservabilityLatencyThroughput

Forward Deployed Engineer - EMEA

Anysphere (Cursor) · London
New Remote LatencyPythonTypeScriptJavaScript

AI Scientist - Warsaw

Mistral AI · Warsaw
New Hybrid PyTorchKubernetesRayPython

Backend Software Engineer - Engine Team (Voice Agent)

Deepgram · USA | Remote
$150k–220k/yr
New Remote RAGOrchestrationObservabilityLatency

Senior Forward Deployed Engineer (AI Agent) - UK

Cresta · United Kingdom (Remote)
New Remote Senior RAGVertex AICI/CDAWS

Software Engineering Manager, AI Observability & Evals Platform (New York, NY)

LangChain · New York, NY
New On-site Manager LangChainLangSmithObservabilityThroughput

Tech Lead Manager, Agentic Runtime

Glean · Mountain View, CA
New On-site Manager OrchestrationObservabilityLatencyKubernetes

Senior Member of Technical Staff, MLE (Middle East)

Cohere · Riyadh
New Remote Staff TensorFlowPythonGPUDeep learning

Staff + Senior Software Engineer, Inference Deployment

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site Staff OrchestrationObservabilityGPUThroughput

Software Engineer, Agent Builder

Sierra · San Francisco, CA
$230k–390k/yr
New On-site TypeScriptGoOpenAI APIAnthropic API

Research Engineer, RL Scaling Science

Anthropic · London, UK
New On-site PythonAnthropic APIDeep learningGPU

Member of Technical Staff (Product Engineer)

Reka AI · US, UK, Remote
New Remote Staff EmbeddingsObservabilityLatencyPython

Staff Software Engineer, Backend

Harvey · Bengaluru
New Hybrid Staff LatencyAWSAzureGCP

Machine Learning Engineering Intern

Cresta · Toronto Canada
New On-site Intern RAGPyTorchTensorFlowDeep learning

Software Engineer, Ray Data

Anyscale · Bengaluru, Karnataka
New On-site RayPythonC++GPU

Research Engineer, Interpretability

Anthropic · San Francisco, CA
New On-site GPUThroughputPyTorchPython

RE/RS, Data Understanding - Foundations

OpenAI · San Francisco
$350k–555k/yr
New On-site Deep learningOpenAI APIPythonGPU

Research Engineer, Science of Scaling

Anthropic · London, UK
New On-site Deep learningKubernetesPythonComputer vision

Member of Technical Staff - Sovereign AI

Cohere · Canada
New Remote Staff GPUPythonDeep learning

GTM Systems Analyst

Scale AI · San Francisco, CA
New On-site MCPOpenAI APIAnthropic APIRAG

Senior Backend Engineer, Inference Platform

Together AI · San Francisco
New On-site Senior vLLMOrchestrationGPUTriton

Research Internship (Fall, 2026)

Cohere · Canada
New Remote Intern PyTorchTensorFlowDeep learningNLP

Senior Software Engineer, Backend

Harvey · San Francisco
$193k–290k/yr
New Hybrid Senior GPUDeep learningRAG

Forward Deployed Engineer - ML

Modal · New York
$180k–250k/yr
New On-site vLLMGPULatencyRLHF

Security Engineer, Agent Security

OpenAI · San Francisco
$234k–385k/yr
New On-site AWSAzureGCPPython

Applied AI, Technical Lead, Forward Deployed AI Engineer - Munich

Mistral AI · Munich
New On-site Lead LangChainHugging FaceRAGPyTorch

Research Engineer, Cybersecurity RL (Reinforcement Learning)

Anthropic · San Francisco, CA | New York City, NY
New On-site Anthropic APIDeep learningRLHFPython

Member of Technical Staff (AI Software Engineer, Multimodal)

Perplexity · San Francisco
$220k–405k/yr
New On-site Staff GoRustC++Computer vision

Forward Deployed Engineer

Tavus · San Francisco
New On-site PythonTypeScriptJavaScriptGo

Forward Deployed Engineer - ML

Modal · Stockholm
New On-site vLLMGPULatencyRLHF

Senior Software Engineer, Database Systems

Zilliz · Redwood City
New Hybrid Senior MilvusGPUKubernetesDocker

Research Engineer, RL Engineering

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site RLHFPythonAnthropic APIPyTorch

Software Engineer, Voice Agents / AI - Deepgram for Restaurants

Deepgram · San Francisco, CA
$154k–307k/yr
New Remote LatencyKubernetesDockerAWS

Software engineer, generative AI

Writer · San Francisco, CA
$112k–304k/yr
New Hybrid PineconeWeaviatepgvectorLatency

Member of Technical Staff - Imagine Product

xAI · Palo Alto, CA
New On-site Staff ObservabilityLatencyThroughputKubernetes

Software Engineer, Agent (Italian speaking)

Sierra · London
£150k–315k/yr
New On-site RAGTypeScriptGoOpenAI API

Software Engineer, Platform

Scale AI · London, UK
New On-site EmbeddingsKubernetesAWSAzure

Research Engineer, Data Infrastructure

Mistral AI · Paris
New Hybrid OrchestrationKubernetesPythonCI/CD

Staff Infrastructure Software Engineer, Enterprise AI

Scale AI · New York, NY; San Francisco, CA
New On-site Staff Multi-agentOrchestrationObservabilityLatency

Anthropic Fellows Program, ML Systems & Performance

Anthropic · London, UK; Ontario, CAN; Remote-Friendly, United States; San Francisco, CA
New Remote PythonGoAnthropic APIGPU

Research Engineer, Machine Learning (RL Velocity)

Anthropic · London, UK
New On-site PyTorchAnthropic APIDeep learningGPU

Staff + Senior Software Engineer, Inference

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site Staff OrchestrationObservabilityKubernetesAWS

Researcher, Training - London

OpenAI · London, UK
£170k–445k/yr
New On-site Deep learningOpenAI APINLPGPU

Applied AI, Forward Deployed Machine Learning Engineer, Critical and Sovereign Institutions, EMEA

Mistral AI · Paris
New On-site LangChainRAGPyTorchDeep learning

Member of Technical Staff

xAI · Palo Alto, CA
New On-site Staff OrchestrationQuantizationPyTorchTensorFlow

Applied Scientist / Domain Expert, AI4Engineering - EMEA

Mistral AI · Paris
New Hybrid Deep learningNLPComputer visionPython

Research Engineer, Machine Learning - Paris/London/Zurich/Warsaw

Mistral AI · Paris
New Hybrid PyTorchTensorFlowDeep learningNLP

Member of Technical Staff, Software Engineer

Fireworks AI · San Mateo
New On-site Staff OrchestrationPythonTool useMulti-agent

Manager, Forward Deployed Engineering

Snorkel AI · New York City, NY (Hybrid); Redwood City, CA (Hybrid); San Francisco, CA (Hybrid)
New Hybrid Manager OrchestrationThroughputPythonObservability

Software Engineer, ML Infrastructure

Anysphere (Cursor) · San Francisco
New On-site GPUThroughputKubernetesRay

Senior Software Engineer, Inference

Anthropic · Dublin, IE
New On-site Senior OrchestrationObservabilityKubernetesAWS

Software Engineer, Agent Data Platform

Sierra · San Francisco, CA
$230k–390k/yr
New On-site OrchestrationLatencyPythonTypeScript

Technical Program Manager, Inference Performance

Anthropic · San Francisco, CA | Seattle, WA
New On-site Manager Anthropic APILatencyThroughputGPU

Principal Engineer, Inference Cloud

Cerebras · Headquarters/Sunnyvale Office
New On-site Principal OrchestrationObservabilityGPULatency

Software Engineer, Bugbot

Anysphere (Cursor) · San Francisco
New On-site OrchestrationObservabilityLatencyCI/CD

Research Engineer / Research Scientist, Tokens

Anthropic · New York City, NY; New York City, NY | Seattle, WA; San Francisco, CA
New On-site ThroughputPyTorchKubernetesAnthropic API

AI Field Engineer - Enterprise

Fireworks AI · San Mateo
New On-site vLLMGPUTensorRTQuantization

Software Engineer - Deepgram for Restaurants

Deepgram · San Francisco, CA
$154k–307k/yr
New On-site LatencyKubernetesDockerAWS

Manager, Applied AI Engineering, Life Sciences (Beneficial Deployments)

Anthropic · San Francisco, CA | New York City, NY
New On-site Manager MCPGoAnthropic APIOrchestration

Staff + Sr. Software Engineer, Cloud Inference

Anthropic · San Francisco, CA
New On-site Staff OrchestrationObservabilityKubernetesCI/CD

AI Researcher (Multimodal Audio/Video Generation)

Tavus · San Francisco
New On-site GPUPyTorchPythonDeep learning

Member of Engineering (Reinforcement Learning)

Poolside · Remote (EMEA/East Coast)
New Remote PyTorchDeep learningPythonRLHF

Member of Technical Staff, Forward Deployed

Vapi · San Francisco
$200k–280k/yr
New Hybrid Staff RAGPineconepgvectorKubernetes

Field Engineering Intern - Summer 2026

Lambda · San Francisco Office (Second St)
New Hybrid Intern vLLMOrchestrationGPUTensorRT

Staff Software Engineer, Backend

Cresta · Canada (Remote)
New Remote Staff RAGOrchestrationObservabilityLatency

Member of Technical Staff - ML Training Systems

Modal · New York
$150k–350k/yr
New On-site Staff GPULatencyPythonPyTorch

Software Engineer, Agentic Runtime

Glean · Mountain View, CA
New On-site OrchestrationObservabilityLatencyKubernetes

AI Scientist - Robotics

Mistral AI · Paris
New Hybrid PyTorchPythonDeep learningComputer vision

Forward Deployed Engineer, Agentic Platform (UK/Europe)

Cohere · Europe
New Hybrid RAGOrchestrationLatencyPython

Senior Member of Technical Staff, Multimodal AI

Cohere · San Francisco
New Remote Staff GPUPyTorchTensorFlowDeep learning

Agent Engineer, TLM

Sierra · New York, NY
$280k–410k/yr
New On-site RAGTypeScriptGoPython

Support Operations Engineer

Harvey · Remote
$104k–156k/yr
New Remote RAGPythonTool useEmbeddings

Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI

Scale AI · San Francisco, CA; New York, NY
New On-site Deep learningNLPLLM-as-judgeEval harnesses

Applied Research Intern

Weaviate · Europe
New On-site Intern vLLMRAGWeaviateTensorRT

Customer Support Engineer (Inference), India

Together AI · India
New On-site GPUKubernetesPythonTypeScript

ML Ops Infrastructure Engineer

Deepgram · USA | Remote
$160k–220k/yr
New Remote ObservabilityGPUTritonTensorRT

Staff Inference ML Runtime Engineer

Cerebras · US and Canada Offices
New On-site Staff vLLMObservabilityGPUTensorRT

Principal ML Investigator

Cerebras · Headquarters/Sunnyvale Office
New On-site Principal GPUTritonThroughputTensorRT

Research Engineer, Machine Learning (Reinforcement Learning)

Anthropic · San Francisco, CA | New York City, NY
New On-site Tool useGPUPyTorchTensorFlow

Member of Technical Staff, Backend

Vapi · San Francisco
$200k–280k/yr
New Hybrid Staff ObservabilityThroughputTypeScriptPython

Product Engineer - Dedicated Inference

Baseten · San Francisco
$165k–330k/yr
New Hybrid ObservabilityKubernetesPythonJavaScript

Research Engineer, Discovery

Anthropic · San Francisco, CA
New On-site OrchestrationGPUThroughputPyTorch

Senior Software Engineer, Backend - Platform Team

Cresta · Canada (Remote)
New Remote Senior ObservabilityLatencyKubernetesDocker

Solutions Architect (NYC)

LangChain · New York, NY
New On-site LangChainRAGMulti-agentLangSmith

Member of Technical Staff, Core Backend

Vapi · San Francisco
$180k–265k/yr
New Hybrid Staff ObservabilityLatencyTypeScriptCI/CD

Senior Software Engineer (SDK)

Langfuse · Europe
€90k–160k/yr
New Hybrid Senior LangChainLlamaIndexObservabilityDocker

Forward Deployed Engineer - Systems

Modal · New York
$180k–240k/yr
New On-site OrchestrationGPULatencyKubernetes

AI Systems Engineer, Codex Agents

OpenAI · San Francisco
$230k–385k/yr
New On-site OrchestrationObservabilityGPULatency

AI Scientist - Zurich

Mistral AI · Zurich
New Hybrid PyTorchKubernetesRayPython

AI Engineer, Product

Mistral AI · Paris
New On-site OrchestrationObservabilityLatencyPython

Software Engineer, Intelligence

Sierra · San Francisco, CA
$230k–390k/yr
New On-site PythonDeep learningNLPRAG

AI Application Engineer, APJ

Arize AI · Remote (Singapore)
New Remote LangChainArizeObservabilityPython

Software Engineer, Product

Anysphere (Cursor) · San Francisco
New On-site Tool useMulti-agentOrchestrationRAG

Senior Full-Stack Software Engineer, (Forward Deployed), GPS

Scale AI · London, UK
New On-site Senior KubernetesDockerAWSAzure

Member of Technical Staff (Rust Engineer, Search)

Perplexity · Belgrade
New On-site Staff OrchestrationLatencyThroughputAWS

Lead/Manager Together Cloud Infrastructure

Together AI · Amsterdam
New On-site Manager GPUKubernetesCI/CDAWS

Member of Technical Staff (AI Software Engineer, Agents)

Perplexity · San Francisco
$220k–405k/yr
New On-site Staff PythonTypeScriptGoRust

Staff Software Engineer, AI Platform

Harvey · New York
$231k–340k/yr
New Hybrid Staff RAGEmbeddingsRerankingEval harnesses

Full-Stack Software Engineer, Reinforcement Learning

Anthropic · San Francisco, CA | New York City, NY
New On-site ObservabilityThroughputDockerCI/CD

Software Engineer, Productivity - Inference Runtime

OpenAI · San Francisco
$230k–385k/yr
New On-site ObservabilityGPULatencyCI/CD

Tokens-as-a-Service (Taas) Software Engineer

OpenAI · San Francisco
$293k–385k/yr
New Hybrid ObservabilityGPUThroughputOpenAI API

Research Scientist, AI Controls and Monitoring

Scale AI · San Francisco, CA; New York, NY
New On-site ObservabilityRLHFDPOLatency

Senior Backend Engineer (Data Infrastructure)

Langfuse · Europe
€90k–160k/yr
New Hybrid Senior ObservabilityLatencyThroughputDocker

Senior Forward-Deployed Engineer, Federal

Deepgram · Washington D.C.
$160k–200k/yr
New On-site Senior LatencyNLPKubernetesAWS

Principal Solutions Engineer, Enterprise

Scale AI · New York, NY; San Francisco, CA
New On-site Principal PythonDeep learningNLPLLM-as-judge

Full Stack Software Engineer, Codex Cloud Apps

OpenAI · San Francisco
$230k–325k/yr
New On-site PythonTypeScriptJavaScriptOpenAI API

Senior Software Engineer, Core Infrastructure

Decagon · New York City
$200k–400k/yr
New On-site Senior ObservabilityGPULatencyKubernetes

Software Engineer, Agent (Spanish speaking)

Sierra · London
£150k–315k/yr
New On-site RAGTypeScriptGoOpenAI API

Sr. Member of Technical Staff

Cerebras · Headquarters/Sunnyvale Office
New On-site Staff OrchestrationGPULatencyKubernetes

Member of Technical Staff - Product (Backend)

Modal · New York
$150k–300k/yr
New On-site Staff ObservabilityGPULatencyPython

Engineering - Internal AI Transformation

ElevenLabs · United States
New Remote RAGOrchestrationKubernetesDocker

Software Engineer, Agent - Healthcare

Sierra · San Francisco, CA
$180k–390k/yr
New On-site RAGTypeScriptGoOpenAI API

AI Deployment Strategist - Paris

Mistral AI · Paris
New On-site PythonJavaScriptGoGPU

Engineering Manager, Active Learning

Deepgram · USA | Remote
$180k–220k/yr
New Remote Manager LatencyPythonDeep learningNLP

Member of Technical Staff, Document Understanding

LlamaIndex · San Francisco
$180k–250k/yr
New Hybrid Staff LlamaIndexvLLMRAGTensorRT

Engineering Manager, Multimodal (API)

OpenAI · San Francisco
$293k–385k/yr
New On-site Manager Computer visionOpenAI APIGPUDeep learning

Engineering Site Lead

Perplexity · London
New On-site Lead OrchestrationGPUKubernetesCI/CD

Member of Technical Staff - Multimodal Understanding

xAI · Palo Alto, CA
New On-site Staff Tool useOrchestrationGPUThroughput

Forward Deployed Software Engineer - London

OpenAI · London, UK
New Hybrid OpenAI APIPythonTypeScriptJavaScript

Applied AI Inference - Forward Deployed Engineer

Baseten · San Francisco
$165k–330k/yr
New Hybrid LatencyDockerPythonThroughput

ML Research Engineer (Inference)

Cerebras · India Office
New On-site vLLMHugging FaceGPUQuantization

Member of Technical Staff - Voice Model

xAI · Palo Alto, CA
New On-site Staff LatencyPyTorchKubernetesRay

Research Engineer, Knowledge Foundations

Anthropic · San Francisco, CA
New On-site ObservabilityPythonAnthropic APIOpenAI API

Member of Technical Staff, Modeling

Cohere · London
New Remote Staff TensorFlowPythonGPUDeep learning

Engineering Manager, Defense

Scale AI · Doha, Qatar
New On-site Manager LLM-as-judgeEval harnessesTool useOrchestration

Applied AI, Senior/Staff Forward Deployed Machine Learning Engineer - Singapore

Mistral AI · Singapore
New On-site Staff RAGTool useGPUPyTorch

Software Engineer, Inference - Performance Optimization

OpenAI · San Francisco
$295k–555k/yr
New On-site LatencyThroughputGPUPython

Senior Research Engineer

Decagon · San Francisco
$200k–400k/yr
New On-site Senior OrchestrationPythonMulti-agentRAG

Infrastructure Solution Architect - EMEA

Mistral AI · Paris
New On-site GPUKubernetesDockerCI/CD

Applied AI Architect Lead, EMEA Commercial

Anthropic · Dublin, IE
New On-site Lead GoAnthropic APIPythonTypeScript

Senior Software Engineer, AI Platform

Harvey · New York
$220k–300k/yr
New On-site Senior Multi-agentOrchestrationEval harnessesLLM-as-judge

Forward Deployed Engineer - Dublin

OpenAI · Dublin, Ireland
New Hybrid PythonJavaScriptOpenAI APILLM-as-judge

Engineering Manager, Global Public Sector

Scale AI · London, UK
New On-site Manager RAGEmbeddingsEval harnessesLLM-as-judge

Staff Software Engineer- AI Workload Orchestration

CoreWeave · Sunnyvale, CA / Bellevue, WA
New On-site Staff OrchestrationObservabilityGPUKubernetes

Member of Engineering (Scalability)

Poolside · Remote (EMEA/East Coast)
New Remote ObservabilityGPUPyTorchDeep learning

Staff Machine Learning Engineer

Cresta · United States (Remote)
New Remote Staff Hugging FaceRAGEmbeddingsTool use

Product Design Intern for AI and Human Agents Platform

Cresta · Toronto, Canada
New On-site Intern Vertex AITool useOpenAI APIPython

AI Deployment Strategist - Munich, Germany

Mistral AI · Munich
New On-site PythonJavaScriptGoGPU

Senior Member of Technical Staff, Synthetic Data

Cohere · Toronto
New Remote Staff vLLMGPUTensorRTThroughput

Software Engineer, ML Data Systems

Anysphere (Cursor) · San Francisco
New On-site OrchestrationRayPythonCI/CD

Researcher, Agent Post-Training, API & Power-Users

OpenAI · San Francisco
$295k–445k/yr
New Hybrid Tool useMulti-agentObservabilityLatency

Research Engineer, Machine Learning (Reinforcement Learning)

Anthropic · London, UK
New On-site Tool useGPUPyTorchTensorFlow

Member of Technical Staff, Performance Optimization

Fireworks AI · San Mateo
New On-site Staff OrchestrationGPUTritonQuantization

Staff Engineer, Command Center Insights & Actions

Crusoe · San Francisco, CA - US
New On-site Staff ObservabilityGPUGoRust

Senior Solutions Engineer, Federal

Deepgram · Washington D.C.
$160k–200k/yr
New On-site Senior LatencyNLPKubernetesAWS

Software Engineer - Voice AI (Inference Runtime)

Baseten · San Francisco
$165k–330k/yr
New Hybrid vLLMOrchestrationGPUTensorRT

Analytics & Automation Lead, User Safety & Risk Operations

OpenAI · San Francisco
$302k–335k/yr
New Hybrid Lead OpenAI APIPythonTypeScriptJavaScript

Software Engineer

Cognition · San Francisco
New On-site Tool useOrchestrationLatencyPython

Software Engineering Manager, AI Observability & Evals Platform (San Francisco, CA)

LangChain · San Francisco, CA
New On-site Manager LangChainLangSmithObservabilityThroughput

Research Engineer, Pretraining

Anthropic · London, UK
New On-site ThroughputPyTorchDeep learningKubernetes

Machine Learning Fellow - Human Frontier Collective (US)

Scale AI · United States
New On-site LangChainGPUPyTorchTensorFlow

Technical Deployment Lead, Forward Deployed Engineering (FDE) - NYC

OpenAI · New York City
$198k–294k/yr
New Hybrid Lead OpenAI APIPythonTypeScriptJavaScript

Audio Inference Engineer, Model Efficiency

Cohere · New York
New Remote vLLMGPULatencyThroughput

Software Engineer Intern

Cresta · Toronto Canada
New On-site Intern ObservabilityLatencyThroughputVertex AI

Senior Backend Engineer, LangSmith Deployments

LangChain · Boston, MA
New On-site Senior LangChainMulti-agentOrchestrationLangSmith

Solutions Architect, National Security

Anthropic · Washington, DC
New On-site PythonAnthropic APITool useObservability

Engineering Manager, MLE

OpenAI · San Francisco
$293k–385k/yr
New On-site Manager DistillationPyTorchTensorFlowDeep learning

Senior Research Engineer - Video Foundation Models (Pre - Training)

Synthesia · Europe
New Remote Senior GPULatencyPyTorchDeep learning

Staff Research Engineer, Model Efficiency

Cohere · New York
New Remote Staff GPULatencyThroughputQuantization

AI Inference Internship

Perplexity · London
New Hybrid Intern EmbeddingsGPUTritonQuantization

Software Engineer, Agent (French speaking)

Sierra · London
£150k–315k/yr
New On-site RAGTypeScriptGoOpenAI API

Engineering Manager, Agent Orchestration

Decagon · San Francisco
$280k–430k/yr
New On-site Manager Tool useOrchestrationObservabilityLatency

ML Platform Engineer

Synthesia · Europe
New Remote OrchestrationObservabilityGPUKubernetes

TL, Research Inference

OpenAI · San Francisco
$380k–555k/yr
New On-site ObservabilityGPULatencyThroughput

Member of Technical Staff, Model Efficiency

Cohere · New York
New Remote Staff vLLMGPULatencyThroughput

Applied AI/ML Scientist

Cerebras · UAE
New On-site GPURLHFDPOPyTorch

Research Engineer, Performance RL (Reinforcement Learning)

Anthropic · San Francisco, CA
New On-site TritonPyTorchAnthropic APIGPU

Research Engineer/Research Scientist - Personal AGI, North Stars

OpenAI · San Francisco
$295k–555k/yr
New Hybrid OpenAI APIEval harnessesLLM-as-judgeObservability

Senior Performance Engineer, Inference

Cerebras · Headquarters/Sunnyvale Office
New On-site Senior vLLMTool useGPUTriton

Senior Forward Deployed Engineer (AI Agent)

Cresta · Canada (Remote)
New Remote Senior RAGVertex AICI/CDAWS

Software Engineer, Applied ML (Discovery, Recommendation & Search)

Character.AI · Redwood City, CA
$200k–300k/yr
New Hybrid GPUPyTorchTensorFlowCI/CD

Senior Software Engineer, Backend (AI Agent)

Cresta · United States (Remote)
New Remote Senior KubernetesDockerVertex AIAWS

VP of Product, Research and Training Infrastructure

CoreWeave · Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA
New On-site Manager OrchestrationGPURLHFKubernetes

Senior Machine Learning Engineer, Voice AI

Together AI · San Francisco
New On-site Senior vLLMGPUTensorRTLatency

Forward Deployed Engineer

Lovable · London
New On-site AWSGCPTypeScriptRust

Software engineer, connectors & MCP

Writer · San Francisco, CA
$155k–304k/yr
New Hybrid OrchestrationMCPObservabilityLatency

Offensive Security Engineer, Agent Products

OpenAI · San Francisco
$278k–490k/yr
New On-site Tool useKubernetesCI/CDAzure

Agent Engineer - NY

Vapi · New York
$160k–180k/yr
New Hybrid OrchestrationGoOpenAI APIPython

Senior Software Engineer, AI Model Lifecycle

Crusoe · San Francisco, CA - US
New On-site Senior vLLMOrchestrationGPULoRA

Senior Search Applications Performance Engineer

Cohere · Toronto
New Remote Senior vLLMWeaviateGPUTriton

Staff Machine Learning Engineer, Voice AI

Together AI · San Francisco
New On-site Staff vLLMGPUTensorRTLatency

Frontier Agents Engineer

Scale AI · San Francisco, CA; New York, NY
New On-site OpenAI APILangChainLlamaIndexRAG

Senior Research Engineer, Voice + Speech

Decagon · New York City
$200k–400k/yr
New On-site Senior OrchestrationPythonDeep learningNLP

Applied AI, Use-case, Software Engineer (Harness)

Mistral AI · Paris
New Hybrid OrchestrationKubernetesTool useMulti-agent

Software Engineer, Inference

Pika · Palo Alto HQ
$185k–250k/yr
New On-site GPUQuantizationThroughputDeep learning

Member of Technical Staff (Software Engineer, API Platform)

Perplexity · San Francisco
$220k–405k/yr
New On-site Staff OrchestrationLatencyThroughputKubernetes

Senior Software Engineer, Backend (AI Agent Integrations)

Cresta · Canada (Remote)
New Remote Senior OrchestrationObservabilityLatencyKubernetes

Agent Strategist - NYC

Vapi · New York
$130k–160k/yr
New Hybrid OpenAI APIAnthropic APIPythonTypeScript

Applied AI Engineer - AI Solutions

Snorkel AI · New York City, NY (Hybrid); Redwood City, CA (Hybrid); San Francisco, CA (Hybrid); United States (Remote)
New Hybrid LlamaIndexHugging FaceRAGWeaviate

Software Engineer, Gen AI Platform

Abridge · SF Office
$221k–300k/yr
New Hybrid LangChainLlamaIndexTool useOrchestration

Member of Technical Staff - Sandbox Service

xAI · London, UK
New On-site Staff PythonGoRustC++

Software engineer, generative AI (UK)

Writer · London, UK
New Hybrid PineconeWeaviatepgvectorLatency

Senior Software Engineer II, Applied Training

CoreWeave · Sunnyvale, CA / Bellevue, WA
New On-site Senior OrchestrationPyTorchKubernetesRay

Software Engineer, Frontier AI Infrastructure

Scale AI · San Francisco, CA; St. Louis, MO; New York, NY; Washington, DC
New On-site OrchestrationLatencyKubernetesDocker

AI Deployment Strategist, Cybersecurity - EMEA

Mistral AI · Paris
New On-site PythonGoGPUKubernetes

Software Engineer, New Grad

Mistral AI · Paris
New On-site Junior ObservabilityKubernetesDockerCI/CD

Senior Software Engineer, Agent Infrastructure

Cohere · Toronto
New Remote Senior Multi-agentOrchestrationKubernetesDocker

Senior Software Engineer I, Inference

CoreWeave · Sunnyvale, CA / Bellevue, WA
New On-site Senior vLLMOrchestrationObservabilityGPU

Software Engineer - Enterprise Platform

Baseten · San Francisco
$165k–330k/yr
New Hybrid OrchestrationObservabilityKubernetesDocker

Senior Software Engineer, Security Agents

Cohere · Toronto
New Remote Senior OrchestrationPythonMulti-agentTool use

Performance Engineer

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site GPULatencyThroughputQuantization

Applied AI Architect, Partnerships

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site AWSGCPAnthropic APIPython

STEM Fellow - Human Frontier Collective (US)

Scale AI · United States
New On-site PythonDeep learningNLPComputer vision

Engineering Manager, Runtime Fabric

Baseten · San Francisco
$165k–330k/yr
New Hybrid Manager GPUGoC++Kubernetes

Technical Program Manager, Cloud Inference

Anthropic · San Francisco, CA | New York City, NY
New On-site Manager Vertex AICI/CDAzureGo

Forward Deployed Engineer - UAE

OpenAI · Abu Dhabi, UAE
New Hybrid PythonJavaScriptGoOpenAI API

Member of Technical Staff - Recommendation Systems

xAI · Palo Alto, CA
New On-site Staff PyTorchDeep learningComputer visionGPU

Senior Forward Deployed Engineer (AI Agent)

Cresta · Australia (Remote)
New Remote Senior RAGVertex AICI/CDAWS

Applied AI Architect

Anthropic · Tokyo, Japan
New On-site PythonAnthropic APIOpenAI APITypeScript

Senior Research Engineer

Decagon · London
£200k–300k/yr
New On-site Senior OrchestrationPythonOpenAI APILangChain

AI Scientist - Palo Alto

Mistral AI · Palo Alto
New Hybrid PyTorchNLPKubernetesRay

Solution Architect (AI/LLM Inference)

Baseten · San Francisco
$165k–330k/yr
New Hybrid vLLMEmbeddingsGPULatency

Engineering Manager, Distillation & Detection Platform

OpenAI · San Francisco
$293k–385k/yr
New On-site Manager ObservabilityDistillationLLM-as-judgeLatency

Forward Deployed Engineer - Systems

Modal · Stockholm
New On-site OrchestrationGPULatencyKubernetes

Software Engineer, Human Data Interface

Anthropic · San Francisco, CA | New York City, NY
New On-site Anthropic APIGPUPythonTypeScript

Cloud Platform Engineer

Baseten · San Francisco Office
$165k–330k/yr
New Hybrid ObservabilityKubernetesCI/CDGPU

Staff Research Engineer

Decagon · San Francisco
$200k–400k/yr
New Hybrid Staff OrchestrationPythonMulti-agentEval harnesses

Frontier Agent Engineering Manager, Enterprise

Scale AI · San Francisco, CA; New York, NY
New On-site Manager OpenAI APILangChainLlamaIndexRAG

Member of Technical Staff, Training Performance Engineer

Cohere · London
New Hybrid Staff TritonThroughputPyTorchPython

Member of Technical Staff, Product

Vapi · San Francisco
$180k–280k/yr
New Hybrid Staff ObservabilityThroughputTypeScriptJavaScript

Forward Deployed Engineer, Agentic Platform

Cohere · United States
New Remote RAGOrchestrationLatencyPython

Defense / Edge Tech Lead

Deepgram · USA | Remote
$185k–245k/yr
New Remote Lead GPUTensorRTQuantizationLatency

Senior Staff Frontier Agents Engineer

Scale AI · San Francisco, CA; New York, NY
New On-site Staff OpenAI APILangChainLlamaIndexRAG

Advanced Technology: AI/ML Research Scientist

Cerebras · Headquarters/Sunnyvale Office
New On-site ObservabilityGPULatencyThroughput

Software Engineer, Agent

Sierra · Tokyo
$22000k–47000k/yr
New On-site RAGTypeScriptGoOpenAI API

ML Infrastructure Engineer, Safeguards

Anthropic · San Francisco, CA
New On-site OrchestrationObservabilityLatencyThroughput

Engineering Manager, Core Services

Anysphere (Cursor) · San Francisco
New On-site Manager ObservabilityThroughputLatency

Engineering Manager - Model Performance

Baseten · San Francisco
$260k–380k/yr
New Hybrid Manager EmbeddingsOrchestrationGPUTensorRT

Senior AI Support Engineer (US)

Dust · San Francisco
$150k–200k/yr
New On-site Senior MCPTool useOrchestrationMulti-agent

Forward Deployed Engineer, Sovereign AI

Cohere · Ottawa
New Hybrid RAGPythonOpenAI APIAWS

Backend Engineer - API

xAI · London, UK
New On-site vLLMOrchestrationObservabilityTensorRT

Software Engineer - Model Products

Baseten · San Francisco
$180k–360k/yr
New Hybrid vLLMObservabilityGPUTensorRT

Research Engineer - Speech & Realtime Models

OpenAI · San Francisco
$295k–445k/yr
New Hybrid DistillationPyTorchTensorFlowDeep learning

Forward Deployed Research Scientist

Labelbox · San Francisco Bay Area
New On-site Eval harnessesRLHFDPONLP

Software Engineer, Safeguards Evals

Anthropic · San Francisco, CA | New York City, NY
New On-site Tool usePythonAnthropic APIRAG

Research Engineer, Pretraining Scaling - London

Anthropic · London, UK
New On-site ObservabilityPyTorchAnthropic APIDeep learning

Member of Technical Staff - X Search

xAI · Palo Alto, CA
New On-site Staff PythonGoRustGPU

Research Engineer, Core ML

Together AI · San Francisco
New On-site vLLMGPUTensorRTQuantization

Software Engineer - Model Performance

Baseten · San Francisco
$180k–360k/yr
New Hybrid vLLMEmbeddingsGPUTensorRT

Research Engineer/Research Scientist, Pre-training

Anthropic · Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
New Remote ThroughputPyTorchDeep learningKubernetes

AI Deployment Engineer, Public Sector - Federal Civilian

OpenAI · Washington, DC
$137k–250k/yr
New Hybrid OpenAI APIThroughputPythonJavaScript

Member of Technical Staff, QA

Vapi · San Francisco
$180k–280k/yr
New Hybrid Staff ThroughputTypeScriptJavaScriptPython

Software Engineer, Agents

Harvey · San Francisco
$161k–242k/yr
New Hybrid ObservabilityLatencyPythonRAG

Applied AI Architect, Government Technology

Anthropic · Washington, DC
New On-site PythonAnthropic APIOpenAI APILLM-as-judge

Machine Learning Intern/Co-op (Fall, 2026)

Cohere · Canada
New Remote Intern TensorFlowNLPPythonDeep learning

Staff Python / PyTorch Developer — Frontend Inference Compiler – Dubai

Cerebras · UAE
New On-site Staff vLLMObservabilityGPUTensorRT

Staff Software Engineer, Agent Orchestration

Decagon · San Francisco
$200k–400k/yr
New Hybrid Staff OrchestrationObservabilityLatencyTool use

Engineering Manager (TLM, Agents)

Perplexity · San Francisco
$300k–405k/yr
New On-site Manager PythonTypeScriptGoRust

Research Engineering Lead

Lovable · Stockholm
New On-site Lead ObservabilityDistillationCI/CDAWS

Software Engineer - Internal Tools

xAI · Palo Alto, CA
New On-site OrchestrationEval harnessesObservabilityLatency

Member of Technical Staff - Ads

xAI · Palo Alto, CA
New On-site Staff ThroughputLatencyDeep learningNLP

Applied AI Engineer, Senior/Staff Devops/SRE

Mistral AI · Singapore
New On-site Staff GPUPyTorchTensorFlowKubernetes

Staff Software Engineer, Full Stack - NYC

Harvey · New York
$260k–310k/yr
New Hybrid Staff GPURAGPythonComputer vision

Research Intern RL & Post-Training Systems, Turbo (Fall 2026)

Together AI · San Francisco
New On-site Intern LatencyThroughputRLHFDPO

Senior Staff Software Engineer, Managed Orchestration

Crusoe · San Francisco, CA - US
New On-site Staff OrchestrationKubernetesCI/CDGCP

Developer Productivity

LangChain · San Francisco, CA
New On-site LangChainLangSmithObservabilityLatency

Forward Deployed Engineer, Agentic Platform (Korea)

Cohere · Korea
New Remote RAGTool useOrchestrationPython

Member of Technical Staff - ML Performance

Modal · New York
$200k–350k/yr
New On-site Staff vLLMGPUTensorRTLatency

Software Engineer - AI Enablement

Baseten · San Francisco
$165k–330k/yr
New Hybrid GoLLM-as-judgeEval harnessesOrchestration

Tech Lead Manager- MLRE, ML Systems

Scale AI · San Francisco, CA; New York, NY
New On-site Manager Tool useRLHFPyTorchDeep learning

Forward Deployed Engineer, Infrastructure Specialist (Public Sector)

Cohere · Ottawa
New Hybrid KubernetesCI/CDAWSAzure

Software Engineer, Ray Serve

Anyscale · Bengaluru, Karnataka
New On-site OrchestrationObservabilityLatencyPyTorch

Member of Technical Staff, Evals & Post-Training Product

Fireworks AI · San Mateo
New On-site Staff GPULLM-as-judgeEval harnessesLatency

Senior Software Engineer, Backend/Infra

Pika · Palo Alto HQ
$185k–300k/yr
New On-site Senior LangChainMulti-agentOrchestrationLatency

Full Stack Software Engineer, Agent Enablement

OpenAI · San Francisco
$255k–405k/yr
New Hybrid OpenAI APITool useMulti-agentOrchestration

Senior Platform Engineer, Ingestion

LangChain · Remote - Europe
New Remote Senior LangChainLangSmithObservabilityThroughput

Staff Software Engineer, Applied Training

CoreWeave · Sunnyvale, CA / Bellevue, WA
New On-site Staff OrchestrationPyTorchKubernetesRay

Senior+ Software Engineer - Research Platform, Consumer Devices

OpenAI · San Francisco
$293k–325k/yr
New Hybrid Senior ObservabilityGPURAGLatency

Forward Deployed Engineer, Agentic Platform (West Coast)

Cohere · San Francisco
New Remote RAGOrchestrationLatencyPython

ChatGPT Performance Engineer

OpenAI · San Francisco
$325k–405k/yr
New On-site OpenAI APIObservabilityGPULatency

Software Engineer, Agent

Sierra · Paris
€125k–265k/yr
New On-site RAGTypeScriptGoOpenAI API

Machine Learning Engineer

Together AI · San Francisco
New On-site vLLMPythonGoRust

Member of Technical Staff - Mid-training

xAI · Palo Alto, CA
New On-site Staff DockerRayDeep learningComputer vision

Customer Support Engineer (Inference)

Together AI · San Francisco, CA
New On-site GPUKubernetesPythonTypeScript

Research Engineer, Post-Training (All Industry Levels)

Character.AI · Redwood City or New York City
$225k–400k/yr
New Hybrid OrchestrationKubernetesDockerDeep learning

Engineering Manager, Model Inference

Abridge · SF Office
$220k–270k/yr
New Hybrid Manager vLLMOrchestrationObservabilityGPU

Senior Director, AI Model Lifecycle

Crusoe · San Francisco, CA - US
New On-site Manager vLLMOrchestrationGPULoRA

Senior Software Engineer, Full Stack - NYC

Harvey · New York
$200k–260k/yr
New Hybrid Senior GPUPythonTypeScriptJavaScript

Infrastructure Software Engineer, Enterprise GenAI

Scale AI · San Francisco, CA; New York, NY
New On-site KubernetesAWSAzureGCP

AI Deployment Strategist - Sweden

Mistral AI · Stockholm
New On-site PythonJavaScriptGoGPU

LLM Inference Frameworks and Optimization Engineer

Together AI · San Francisco, Singapore, Amsterdam
New On-site vLLMOrchestrationGPUTriton

Full Stack LLM Engineer

Cerebras · Toronto Office
New On-site GPUPyTorchTensorFlowDeep learning

Software Engineer, Product

Sierra · San Francisco, CA
$230k–390k/yr
New On-site LatencyTypeScriptGoOpenAI API

AI Deployment Engineer

OpenAI · New York City
$197k–278k/yr
New Hybrid PythonJavaScriptOpenAI APITypeScript

Tech Lead Manager, Agentic Runtime

Glean · San Francisco, CA
New On-site Manager OrchestrationObservabilityLatencyKubernetes

Engineering Manager, GPU (ML Accelerator)

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site Manager GPUDeep learningPyTorchNLP

Senior ML Systems Engineer, Frameworks & Tooling

Cohere · London
New Remote Senior vLLMOrchestrationTensorRTThroughput

Software Engineer, Developer Productivity

OpenAI · San Francisco
$210k–490k/yr
New On-site EmbeddingsGPUKubernetesPython

Strategist, Agent Development

Sierra · London
£125k–245k/yr
New On-site LLM-as-judgeRAGTool useMulti-agent

Senior/Staff Applied Scientist/Research Engineer, EMEA

Mistral AI · Paris
New Hybrid Staff RAGPyTorchPythonDeep learning

Member of Technical Staff, Pre-Training Data

Cohere · Toronto
New Remote Staff ThroughputPythonDeep learningNLP

Staff Software Engineer, Inference

Anthropic · London, UK
New On-site Staff OrchestrationObservabilityKubernetesAWS

Staff Software Engineer, Inference

Anthropic · Dublin, IE
New On-site Staff OrchestrationObservabilityKubernetesAWS

Software Engineer, Robotics

Scale AI · Mexico City, MX
New On-site OrchestrationComputer visionKubernetesDocker

Member of Technical Staff, MLE (UK/EU)

Cohere · London
New Hybrid Staff TensorFlowPythonDeep learningGPU

Principal Research Engineer, Post-Training

Character.AI · Redwood City, CA
$275k–400k/yr
New On-site Principal OrchestrationObservabilityGPUKubernetes

Distributed LLM Inference Engineer

Anyscale · San Francisco
New Hybrid vLLMTritonTensorRTLatency

Machine Learning Infrastructure Engineer- Model Inference

Abridge · SF Office
$221k–260k/yr
New Hybrid vLLMOrchestrationGPUTriton

Solutions Engineer (Dublin)

Crusoe · Dublin - IE
New On-site OrchestrationGPUKubernetesDocker

Senior Research Engineer - Voice

Synthesia · Europe
New Remote Senior QuantizationLatencyDPODistillation

Member of Technical Staff - Imagine Safety

xAI · Palo Alto, CA
New On-site Staff ObservabilityLatencyThroughputKubernetes

Research, Post-Training

Cognition · San Francisco
New On-site RLHFGoDeep learningPython

Staff Software Engineer, Backend

Harvey · San Francisco
$231k–340k/yr
New Hybrid Staff GPURAGLatencyThroughput

Regional Director, Forward Deployed Engineering

Anysphere (Cursor) · San Francisco
New On-site Manager PythonTypeScriptJavaScriptGo

Software Engineer, APIs & Context Platform

Glean · Mountain View, CA
New On-site OrchestrationMCPLatencyPython

Member of Technical Staff

Fireworks AI · New York
New On-site Staff LatencyKubernetesDockerRay

Member of Technical Staff - Reasoning

xAI · London, UK
New On-site Staff EmbeddingsRerankingTool useMulti-agent

Member of Technical Staff, Training Infra Engineer

Cohere · Paris
New Remote Staff PyTorchKubernetesRayPython

Forward Deployed Engineer

Anysphere (Cursor) · San Francisco
New On-site LatencyPythonTypeScriptJavaScript

Inference Engineer, Robotics

OpenAI · San Francisco
$380k/yr
New Hybrid GPUThroughputLatency

Researcher, Alignment Training

OpenAI · San Francisco
$250k–445k/yr
New On-site Eval harnessesLLM-as-judgeObservabilityGPU

Senior Software Engineer, Backend (AI Agent Runtime)

Cresta · Canada (Remote)
New Remote Senior OrchestrationObservabilityLatencyKubernetes

QA Lead (ML Integration and Quality)

Cerebras · India Office
New On-site Lead GPUKubernetesPythonGo

Forward Deployed Engineer, Agentic Platform

Cohere · Middle East
New Hybrid RAGOrchestrationLatencyPython

Senior Staff Software Engineer, Backend

Harvey · Bengaluru
New Hybrid Staff LatencyAWSAzureGCP

Manager of Solutions Architecture, Applied AI (Enterprise Tech)

Anthropic · San Francisco, CA | New York City, NY
New On-site Manager GoAnthropic APIOpenAI APIPython

Software Engineer, RL Data

Anthropic · London, UK; San Francisco, CA | New York City, NY
New On-site MCPKubernetesDockerPython

Staff Software Engineer, AI Platform

Harvey · San Francisco
$231k–340k/yr
New Hybrid Staff RAGEmbeddingsRerankingOrchestration

AI Scientist - Paris/London - Onsite or Hybrid or Remote

Mistral AI · Paris
New Hybrid PyTorchKubernetesRayPython

Trust Engineer

Harvey · San Francisco
$220k–330k/yr
New Hybrid AWSAzureGCPPython

Senior Software Engineer, Agents

Harvey · San Francisco
$193k–290k/yr
New Hybrid Senior ObservabilityLatencyPythonRAG

Software Engineer, Backend (New-York)

Mistral AI · New York
New Hybrid ObservabilityKubernetesDockerCI/CD

Support Engineer

Sierra · San Francisco, CA
$250k–275k/yr
New On-site GoOpenAI APIAnthropic APIPython

Software Engineer, Inference – AMD GPU Enablement

OpenAI · San Francisco
$295k–555k/yr
New On-site vLLMOrchestrationGPUTriton

Senior Software Engineer - Together Cloud Infrastructure

Together AI · San Francisco
New On-site Senior ObservabilityGPUKubernetesCI/CD

Staff Software Engineer, Infrastructure

Decagon · San Francisco
$200k–400k/yr
New Hybrid Staff ObservabilityGPULatencyKubernetes

Software Engineer, Research - Human Data

OpenAI · San Francisco
$230k–385k/yr
New On-site OpenAI APIRAGLLM-as-judgeEval harnesses

Applied AI, Evaluation Engineer

Mistral AI · Paris
New On-site PyTorchPythonEval harnessesLLM-as-judge

Member of Technical Staff, Agent Code

Cohere · London
New Hybrid Staff PyTorchTensorFlowPythonDeep learning

Staff Software Engineer, Agents

Harvey · San Francisco
$231k–340k/yr
New Hybrid Staff ObservabilityLatencyPythonRAG

Research Engineer, Pretraining Scaling

Anthropic · San Francisco, CA
New On-site ObservabilityPyTorchAnthropic APIDeep learning

Staff Research Engineer, Discovery Team

Anthropic · San Francisco, CA
New On-site Staff KubernetesDockerAnthropic APIGPU

Member of Technical Staff (AI Inference Engineer)

Perplexity · San Francisco
$220k–485k/yr
New On-site Staff OrchestrationObservabilityGPUTriton