Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore the challenges and solutions for deploying Transformer networks at scale in this insightful talk from the Toronto Machine Learning Series. Delve into the world of Natural Language Processing (NLP) as Pieter Luitjens, CTO of Private AI, shares his expertise on optimizing Transformer networks for resource-constrained environments. Learn how to improve latency and throughput for processing large amounts of data, particularly in de-identification tasks. Discover strategies for addressing the unique challenges posed by the size of Transformer networks, including achieving acceptable latency and unit economics. Gain valuable insights on how to effectively process terabytes of data in a cost-effective manner, drawing from real-world experiences in deploying deep learning algorithms in multi-billion dollar industrial projects.