Fine-tuning engineer jobs

Model-adaptation roles — LoRA/PEFT, RLHF/DPO, distillation.

123 open roles

← All jobs

Member of Technical Staff - Imagine Model

xAI · Palo Alto, CA; Seattle, WA
New On-site Staff DistillationPyTorchRayPython

Staff Software Engineer, Full-Stack - Enterprise Gen AI

Scale AI · New York, NY; San Francisco, CA
New On-site Staff AWSAzureGCPTypeScript

Research Engineer, Code RL (Reinforcement Learning)

Anthropic · San Francisco, CA | New York City, NY
New On-site Eval harnessesGPURLHFPyTorch

Member of Engineering (Reinforcement Learning Infrastructure)

Poolside · Remote (EMEA/East Coast)
New Remote vLLMObservabilityThroughputPyTorch

Anthropic Fellows Program, Reinforcement Learning

Anthropic · London, UK; Ontario, CAN; Remote-Friendly, United States; San Francisco, CA
New Remote PythonGoAnthropic APIRLHF

Frontier Agents Intern (Fall 2026)

Together AI · San Francisco
New On-site Intern PyTorchDeep learningNLPPython

ML Research Engineer, ML Systems

Scale AI · San Francisco, CA; Seattle, WA; New York, NY
New On-site Tool useRLHFPyTorchGPU

AI Deployment Engineer - Startups

OpenAI · London, UK
New On-site OpenAI APITool useMulti-agentOrchestration

Research Engineer / Research Scientist- Personal AGI (Post Training)

OpenAI · San Francisco
$295k–555k/yr
New On-site OpenAI APIPythonDeep learningRLHF

Applied Research Intern

Labelbox · San Francisco Bay Area
New On-site Intern RLHFDPOPyTorchTensorFlow

Research Engineer, AI Safety & Alignment

Character.AI · Redwood City, CA
$225k–400k/yr
New Hybrid OrchestrationRLHFKubernetesDocker

Open-Source Software, Machine Learning Engineer

Mistral AI · Paris
New Hybrid vLLMHugging FaceDistillationPyTorch

Research Scientist, Agent Robustness

Scale AI · San Francisco, CA; New York, NY
New On-site RLHFDPOEval harnessesGPU

Senior/Staff Applied AI, Machine Learning Engineer

Mistral AI · Seoul
New On-site Staff RAGLLM-as-judgeTool useMulti-agent

Research Engineer, Production Model Post-Training

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site RLHFDeep learningPythonAnthropic API

Research Engineer/Scientist - Human Alignment, Consumer Devices

OpenAI · San Francisco
$380k–445k/yr
New Hybrid RLHFOpenAI APIEmbeddingsReranking

Applied Machine Learning Engineer

Fireworks AI · San Mateo
New On-site RLHFPythonDeep learningGPU

Researcher, Post Training

Lovable · Stockholm
New On-site OrchestrationGPULatencyPyTorch

Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

Scale AI · San Francisco, CA; New York, NY
New On-site Multi-agentGPURLHFPyTorch

Senior Staff Software Engineer, AI Model Lifecycle

Crusoe · San Francisco, CA - US
New On-site Staff vLLMOrchestrationGPULoRA

Research Engineer, Production Model Post-Training

Anthropic · Zürich, CH
New On-site RLHFDeep learningPythonAnthropic API

Applied Machine Learning Research Scientist

Cerebras · US and Canada Offices
New On-site GPURLHFPyTorchDeep learning

Staff Research Engineer - Video Post Training

Synthesia · Europe
New Remote Staff QuantizationDPODistillationDeep learning

Research Engineer, Codex

OpenAI · San Francisco
$295k–445k/yr
New On-site Tool useMulti-agentObservabilityLatency

Researcher, Synthetic RL

OpenAI · San Francisco
$295k–445k/yr
New On-site Deep learningRLHF

[Expression of Interest] Research Scientist / Engineer, Honesty

Anthropic · New York City, NY; San Francisco, CA
New On-site RAGRLHFPythonAnthropic API

Member of Technical Staff (AI Researcher)

Perplexity · San Francisco
$220k–485k/yr
New On-site Staff DPOPyTorchDeep learningPython

Staff+ Software Engineer, Inference Runtime

Anthropic · Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
New Remote Staff GPUTritonLatencyThroughput

Research Engineer, Applied AI Engineering

OpenAI · San Francisco
$250k–555k/yr
New On-site DistillationPyTorchTensorFlowDeep learning

Senior Software Engineer, Observability Insights

CoreWeave · New York, NY / Sunnyvale, CA
New On-site Senior LangChainMCPObservabilityKubernetes

Software Engineer, ML Research

Anysphere (Cursor) · San Francisco
New On-site GPUPythonDeep learningRLHF

Staff Software Engineer, AI Model Lifecycle

Crusoe · San Francisco, CA - US
New On-site Staff vLLMOrchestrationGPULoRA

Member of Technical Staff - Inference

xAI · Palo Alto, CA
New On-site Staff vLLMOrchestrationGPUTriton

Member of Technical Staff, Integration/RL Team (Research Engineer)

Cohere · Paris
New Remote Staff PyTorchKubernetesRayPython

Machine Learning Engineer, Integrity

OpenAI · San Francisco
$266k–555k/yr
New On-site DistillationPyTorchTensorFlowDeep learning

Multimodal LLM Researcher (MLLM)

Pika · Palo Alto HQ
$185k–400k/yr
New On-site OrchestrationDistillationPyTorchTensorFlow

AI Field Engineer - AI Natives

Fireworks AI · San Mateo
New On-site vLLMGPUTensorRTQuantization

Post-Training Research Engineer

Baseten · San Francisco
$200k–275k/yr
New Hybrid GPUDistillationPyTorchTensorFlow

Senior Software Engineer, GenAI

Scale AI · San Francisco, CA; New York, NY
New On-site Senior LatencyRLHFPythonTypeScript

Staff AI research scientist

Writer · San Francisco, CA
$234k–296k/yr
New Hybrid Staff Tool useLLM-as-judgeRLHFDPO

Senior/Staff Machine Learning Research Engineer, General Agents, Enterprise GenAI

Scale AI · San Francisco, CA; New York, NY
New On-site Staff Tool useMulti-agentOrchestrationObservability

Principal Research Engineer

Synthesia · Europe
New Remote Principal RLHFDPODeep learningGPU

Software engineer, agents (UK)

Writer · London, UK
New Hybrid OrchestrationObservabilityThroughputKubernetes

Full-Stack Engineer, AI Data Platform

Labelbox · San Francisco Bay Area
New On-site ThroughputRLHFKubernetesAWS

Software Engineer - Training Product

Baseten · San Francisco
$165k–330k/yr
New Hybrid vLLMGPULoRAPyTorch

Applied AI Architect, Applied AI (Digital Natives Business)

Anthropic · Munich, Germany
New On-site PythonAnthropic APIOpenAI APIRAG

Agent Post-Training Research

OpenAI · San Francisco
$295k–445k/yr
New On-site Tool useMulti-agentObservabilityLatency

AI Researcher, Core ML (Turbo)

Together AI · San Francisco
New On-site vLLMGPUTensorRTQuantization

Software Engineer, Model Performance Systems

Baseten · San Francisco
$160k–200k/yr
New Hybrid OrchestrationObservabilityGPUQuantization

Machine Learning Research Engineer, Agents - Enterprise GenAI

Scale AI · San Francisco, CA; New York, NY
New On-site Multi-agentRLHFDeep learningNLP

Research Staff, LLMs

Deepgram · USA | Remote
$150k–250k/yr
New Remote Staff LatencyRLHFPyTorchDeep learning

Forward Deployed Engineer, Agentic Platform (Singapore)

Cohere · Singapore
New Remote RAGOrchestrationLatencyPython

Research Engineer - Environments, Data and Post-Training

Mercor · San Francisco
$130k–500k/yr
New On-site Tool useDeep learningNLPPython

Senior Software Engineer - AI / ML

Snorkel AI · Redwood City, CA (Hybrid); San Francisco, CA (Hybrid)
New Hybrid Senior Multi-agentOrchestrationAWSAzure

GenAI Strategic Projects Lead, Public Sector

Scale AI · Washington, DC
New On-site Lead RLHFRAGEmbeddingsReranking

Research Engineer, Interpretability

Anthropic · San Francisco, CA
New On-site GPUThroughputPyTorchPython

GTM Systems Analyst

Scale AI · San Francisco, CA
New On-site MCPOpenAI APIAnthropic APIRAG

Research Engineer / Research Scientist -Personal AGI, Proactivity

OpenAI · San Francisco
$295k–555k/yr
New Hybrid OpenAI APIEval harnessesLLM-as-judgeObservability

Forward Deployed Engineer - ML

Modal · New York
$180k–250k/yr
New On-site vLLMGPULatencyRLHF

Research Engineer, Cybersecurity RL (Reinforcement Learning)

Anthropic · San Francisco, CA | New York City, NY
New On-site Anthropic APIDeep learningRLHFPython

Forward Deployed Engineer - ML

Modal · Stockholm
New On-site vLLMGPULatencyRLHF

Research Engineer, RL Engineering

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site RLHFPythonAnthropic APIPyTorch

Researcher, Training - London

OpenAI · London, UK
£170k–445k/yr
New On-site Deep learningOpenAI APINLPGPU

Manager, Forward Deployed Engineering

Snorkel AI · New York City, NY (Hybrid); Redwood City, CA (Hybrid); San Francisco, CA (Hybrid)
New Hybrid Manager OrchestrationThroughputPythonObservability

Research Engineer, Frontier Evals & Environments

OpenAI · San Francisco
$205k–380k/yr
New On-site Tool useMulti-agentRLHFGo

Forward Deployed Engineer

Labelbox · San Francisco Bay Area
New On-site RLHFPythonComputer visionNLP

Member of Technical Staff, MLE [Singapore]

Cohere · Singapore
New Remote Staff PythonDeep learningNLPRAG

AI Field Engineer - Enterprise

Fireworks AI · San Mateo
New On-site vLLMGPUTensorRTQuantization

Member of Technical Staff, MLE

Cohere · San Francisco
New Remote Staff PythonDeep learningRAGEmbeddings

Member of Engineering (Reinforcement Learning)

Poolside · Remote (EMEA/East Coast)
New Remote PyTorchDeep learningPythonRLHF

Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI

Scale AI · San Francisco, CA; New York, NY
New On-site Deep learningNLPLLM-as-judgeEval harnesses

Forward Deployed Engineering Manager

Labelbox · San Francisco Bay Area
New On-site Manager LLM-as-judgeRLHF

Research Engineer, Machine Learning (Reinforcement Learning)

Anthropic · San Francisco, CA | New York City, NY
New On-site Tool useGPUPyTorchTensorFlow

Member of Technical Staff, Backend

Vapi · San Francisco
$200k–280k/yr
New Hybrid Staff ObservabilityThroughputTypeScriptPython

Member of Technical Staff - Post-Training and RL

xAI · Palo Alto, CA
New On-site Staff RLHFDPO

Research Scientist, AI Controls and Monitoring

Scale AI · San Francisco, CA; New York, NY
New On-site ObservabilityRLHFDPOLatency

Member of Technical Staff, Document Understanding

LlamaIndex · San Francisco
$180k–250k/yr
New Hybrid Staff LlamaIndexvLLMRAGTensorRT

Member of Technical Staff - Voice Model

xAI · Palo Alto, CA
New On-site Staff LatencyPyTorchKubernetesRay

Research Engineer, Knowledge Foundations

Anthropic · San Francisco, CA
New On-site ObservabilityPythonAnthropic APIOpenAI API

Applied AI, Senior/Staff Forward Deployed Machine Learning Engineer - Singapore

Mistral AI · Singapore
New On-site Staff RAGTool useGPUPyTorch

Researcher, Agent Post-Training, API & Power-Users

OpenAI · San Francisco
$295k–445k/yr
New Hybrid Tool useMulti-agentObservabilityLatency

Forward Deployed Engineer, GenAI

Scale AI · San Francisco, CA; New York, NY
New On-site RLHFEval harnessesObservability

Engineering Manager, MLE

OpenAI · San Francisco
$293k–385k/yr
New On-site Manager DistillationPyTorchTensorFlowDeep learning

Applied AI/ML Scientist

Cerebras · UAE
New On-site GPURLHFDPOPyTorch

Research Engineer, Performance RL (Reinforcement Learning)

Anthropic · San Francisco, CA
New On-site TritonPyTorchAnthropic APIGPU

VP of Product, Research and Training Infrastructure

CoreWeave · Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA
New On-site Manager OrchestrationGPURLHFKubernetes

Senior Software Engineer, AI Model Lifecycle

Crusoe · San Francisco, CA - US
New On-site Senior vLLMOrchestrationGPULoRA

Agent Strategist - NYC

Vapi · New York
$130k–160k/yr
New Hybrid OpenAI APIAnthropic APIPythonTypeScript

Applied AI Engineer - AI Solutions

Snorkel AI · New York City, NY (Hybrid); Redwood City, CA (Hybrid); San Francisco, CA (Hybrid); United States (Remote)
New Hybrid LlamaIndexHugging FaceRAGWeaviate

Software Engineer, Gen AI Platform

Abridge · SF Office
$221k–300k/yr
New Hybrid LangChainLlamaIndexTool useOrchestration

Applied AI Architect, Partnerships

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
New On-site AWSGCPAnthropic APIPython

Applied Research Scientist, Agents

Labelbox · San Francisco Bay Area
New On-site Tool useRLHFPyTorchTensorFlow

AI Deployment Engineer - Startups

OpenAI · Paris, France
New Hybrid OpenAI APITool useMulti-agentOrchestration

Engineering Manager, Distillation & Detection Platform

OpenAI · San Francisco
$293k–385k/yr
New On-site Manager ObservabilityDistillationLLM-as-judgeLatency

Staff Research Engineer

Decagon · San Francisco
$200k–400k/yr
New Hybrid Staff OrchestrationPythonMulti-agentEval harnesses

Research Engineer, Multimodal

Character.AI · Redwood City, CA
$225k–400k/yr
New Hybrid OrchestrationLoRARLHFPyTorch

Defense / Edge Tech Lead

Deepgram · USA | Remote
$185k–245k/yr
New Remote Lead GPUTensorRTQuantizationLatency

Software Engineer, Agent

Sierra · Tokyo
$22000k–47000k/yr
New On-site RAGTypeScriptGoOpenAI API

Research Engineer - Speech & Realtime Models

OpenAI · San Francisco
$295k–445k/yr
New Hybrid DistillationPyTorchTensorFlowDeep learning

Forward Deployed Research Scientist

Labelbox · San Francisco Bay Area
New On-site Eval harnessesRLHFDPONLP

Researcher, Agent Post-Training, Personality

OpenAI · San Francisco
$295k–445k/yr
New On-site Tool useMulti-agentRLHFEval harnesses

Research Engineer, Core ML

Together AI · San Francisco
New On-site vLLMGPUTensorRTQuantization

Software Engineer - Model Performance

Baseten · San Francisco
$180k–360k/yr
New Hybrid vLLMEmbeddingsGPUTensorRT

Director, Research - Human Data Systems

Snorkel AI · San Francisco, CA (Hybrid)
New Hybrid Manager RLHFLLM-as-judgeMulti-agentOrchestration

Research Engineering Lead

Lovable · Stockholm
New On-site Lead ObservabilityDistillationCI/CDAWS

Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI

Scale AI · San Francisco, CA; New York, NY
New On-site Staff Multi-agentRLHFLLM-as-judgeDeep learning

Research Intern RL & Post-Training Systems, Turbo (Fall 2026)

Together AI · San Francisco
New On-site Intern LatencyThroughputRLHFDPO

ML/Research Engineer, Safeguards

Anthropic · San Francisco, CA | New York City, NY
New On-site PythonAnthropic APIRLHFDeep learning

Tech Lead Manager- MLRE, ML Systems

Scale AI · San Francisco, CA; New York, NY
New On-site Manager Tool useRLHFPyTorchDeep learning

Senior Software Engineer, Backend/Infra

Pika · Palo Alto HQ
$185k–300k/yr
New On-site Senior LangChainMulti-agentOrchestrationLatency

Staff Software Engineer, Applied Training

CoreWeave · Sunnyvale, CA / Bellevue, WA
New On-site Staff OrchestrationPyTorchKubernetesRay

Member of Technical Staff - Mid-training

xAI · Palo Alto, CA
New On-site Staff DockerRayDeep learningComputer vision

Senior Director, AI Model Lifecycle

Crusoe · San Francisco, CA - US
New On-site Manager vLLMOrchestrationGPULoRA

Senior Software Engineer, Full Stack - NYC

Harvey · New York
$200k–260k/yr
New Hybrid Senior GPUPythonTypeScriptJavaScript

Research Lead, Training Insights

Anthropic · Remote-Friendly (Travel Required) | San Francisco, CA; San Francisco, CA | New York City, NY
New Remote Lead Tool useAnthropic APIEval harnessesLLM-as-judge

Research Engineer

Cohere · Toronto
New Remote RLHFPyTorchDeep learningNLP

Principal Research Engineer, Post-Training

Character.AI · Redwood City, CA
$275k–400k/yr
New On-site Principal OrchestrationObservabilityGPUKubernetes

Senior Research Engineer - Voice

Synthesia · Europe
New Remote Senior QuantizationLatencyDPODistillation

Research, Post-Training

Cognition · San Francisco
New On-site RLHFGoDeep learningPython

Regional Director, Forward Deployed Engineering

Anysphere (Cursor) · San Francisco
New On-site Manager PythonTypeScriptJavaScriptGo

Member of Technical Staff - Reasoning

xAI · London, UK
New On-site Staff EmbeddingsRerankingTool useMulti-agent

Software Engineer, RL Data

Anthropic · London, UK; San Francisco, CA | New York City, NY
New On-site MCPKubernetesDockerPython

Applied Research Engineer, Agents

Labelbox · San Francisco Bay Area
New On-site Tool useRLHFPyTorchTensorFlow