Staff Backend Engineer, Dubbing
Synthesia · Europe
Synthesia is the world’s leading AI video platform for business, used by over 90% of the Fortune 100. Founded in 2017, the company is headquartered in London, with offices and teams across Europe and the US. As AI continues to shape the way we live and work, Synthesia develops products to enhance visual communication and enterprise skill development, helping people work better and stay at the center of successful organizations. Following our recent Series E funding round, where we raised $200 million, our valuation stands at $4 billion. Our total funding exceeds $530 million from premier investors including Accel, NVentures (Nvidia's VC arm), Kleiner Perkins, GV, and Evantic Capital, alongside the founders and operators of Stripe, Datadog, Miro, and Webflow. ABOUT THE ROLE You will work on the engineering systems powering Synthesia's dubbing product, the multi-step pipeline that transforms existing videos into new-language versions while preserving lip sync, voice quality, timing, and overall video integrity. Your role centers on the core challenge: building a production system that orchestrates complex, long-running jobs (often taking tens of minutes to hours) with reliability, observability, and quality at every stage. You'll ensure that localized videos are indistinguishable from originals, working across transcription, speaker identification, translation, voice synthesis, and video rendering. You will be responsible for designing and evolving systems that handle: - End-to-end pipeline orchestration for long-running, multi-stage jobs - Quality layers across transcription accuracy, speaker diarization, lip-sync rendering, translation, voice cloning, and TTS - Integration of ML-driven components (providers and open-source models) into production workflows - Video and audio complexity (normalization, chunking, encoding, vocal separation, retiming) - Evaluation frameworks that prove measurable improvements in output quality You will own projects