How DeepL Built a Translation Powerhouse with AI

Explore the journey of building an AI-powered translation company that successfully competes with tech giants in this 43-minute podcast episode featuring DeepL's CEO and founder Jarek Kutylowski. Discover how DeepL launched neural machine translation in 2017, built custom GPU data centers, and developed strategies for small teams to challenge established players like Google Translate. Learn about the technical challenges of AI translation, including why high-quality translations still require human context and how models are tailored for enterprise applications. Examine DeepL's approach to training on curated multilingual datasets, their methods for measuring translation quality, and their stance on avoiding customer-specific fine-tuning. Gain insights into the evolution of speech translation technology, the growing demand for translation services, and the return on investment of incremental quality improvements. Understand how DeepL handles hallucinations in translation models, supports smaller languages and language pairs, and manages large-scale inference infrastructure. Delve into the future role of human translators, the broader impact of AI on global communication, and how translation companies adapt to the evolving large language model landscape while meeting enterprise needs.

Syllabus

[00:00:00] Introducing Jarek and DeepL’s mission
[00:01:46] Competing with Google Translate & LLMs
[00:04:14] Pretraining vs. proprietary model strategy
[00:06:47] Building GPU data centers in 2017
[00:08:09] The value of curated bilingual and monolingual data
[00:09:30] How DeepL measures translation quality
[00:12:27] Personalization and enterprise-specific tuning
[00:14:04] Why translation demand is growing
[00:16:16] ROI of incremental quality gains
[00:18:20] The role of human translators in the future
[00:22:48] Hallucinations in translation models
[00:24:05] DeepL’s work on speech translation
[00:28:22] The broader impact of global communication
[00:30:32] Handling smaller languages and language pairs
[00:32:25] Multi-language model consolidation
[00:35:28] Engineering infrastructure for large-scale inference
[00:39:23] Adapting to evolving LLM landscape & enterprise needs