Cerebras

19 open AI-engineering roles · late-stage · cerebras.net

OrchestrationObservabilityGPULatencyThroughputKubernetesCI/CDGoC++PythonRLHFPyTorchDeep learningDocker

Staff Software Engineer, Inference Platform

Cerebras · Headquarters/Sunnyvale Office

New On-site Staff OrchestrationObservabilityGPULatency

Staff Software Engineer, Inference Cloud

Cerebras · Headquarters/Sunnyvale Office

New On-site Staff OrchestrationObservabilityGPULatency

Applied Machine Learning Research Scientist

Cerebras · US and Canada Offices

New On-site GPURLHFPyTorchDeep learning

Member of Technical Staff (Software Engineer)

Cerebras · Headquarters/Sunnyvale Office

New On-site Staff OrchestrationGPULatencyKubernetes

Lead Full Stack Machine Learning Engineer

Cerebras · India Office

New On-site Lead GPUPyTorchTensorFlowDeep learning

AI Engineer, Model Quality and Performance

Cerebras · Headquarters/Sunnyvale Office

New On-site GPUDockerEval harnessesLLM-as-judge

Principal Engineer, AI Inference Reliability

Cerebras · US and Canada Offices

New On-site Principal ObservabilityGPULatencyPython

Software Engineer, Inference Platform

Cerebras · Headquarters/Sunnyvale Office

New On-site OrchestrationObservabilityGPULatency

Principal Engineer, Inference Cloud

Cerebras · Headquarters/Sunnyvale Office

New On-site Principal OrchestrationObservabilityGPULatency

Staff Inference ML Runtime Engineer

Cerebras · US and Canada Offices

New On-site Staff vLLMObservabilityGPUTensorRT

Principal ML Investigator

Cerebras · Headquarters/Sunnyvale Office

New On-site Principal GPUTritonThroughputTensorRT

Sr. Member of Technical Staff

Cerebras · Headquarters/Sunnyvale Office

New On-site Staff OrchestrationGPULatencyKubernetes

ML Research Engineer (Inference)

Cerebras · India Office

New On-site vLLMHugging FaceGPUQuantization

Applied AI/ML Scientist

Cerebras · UAE

New On-site GPURLHFDPOPyTorch

Senior Performance Engineer, Inference

Cerebras · Headquarters/Sunnyvale Office

New On-site Senior vLLMTool useGPUTriton

Advanced Technology: AI/ML Research Scientist

Cerebras · Headquarters/Sunnyvale Office

New On-site ObservabilityGPULatencyThroughput

Staff Python / PyTorch Developer — Frontend Inference Compiler – Dubai

Cerebras · UAE

New On-site Staff vLLMObservabilityGPUTensorRT

Full Stack LLM Engineer

Cerebras · Toronto Office

New On-site GPUPyTorchTensorFlowDeep learning

QA Lead (ML Integration and Quality)

Cerebras · India Office

New On-site Lead GPUKubernetesPythonGo