Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn advanced techniques for deploying generative AI models on edge devices in this technical talk from Meta's PyTorch Edge team lead Chen Lai. Explore ExecuTorch's innovative approach to addressing edge deployment challenges, including memory optimization and hardware compatibility across diverse platforms. Dive into technical collaborations with Apple, Arm, Qualcomm, and MediaTek that enable deployment of sophisticated language models like LLAMA on mobile devices. Master the process of converting PyTorch models into optimized executable programs using the XTorch ecosystem, including key components like Torchexport and Torchio for compute graph capture and quantization. Understand how Torchchat enables large language model inference across various devices while maintaining compatibility with Hugging Face models. Gain insights into Meta's commitment to advancing edge computing through community-driven innovation and cross-industry collaboration.
Syllabus
GenAI Deployment EXPERT Shares Top Techniques
Taught by
tinyML