Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn to fine-tune the compact SmolVLM vision language model using Python and QLoRA techniques in this 17-minute tutorial video. Access free code examples and Jupyter notebooks that demonstrate how to optimize this small but powerful vision-language model on consumer-grade GPUs or Google Colab. Explore detailed explanations of the model's inner workings, implementation steps for visual question answering tasks, and integration with the Hugging Face ecosystem. Gain hands-on experience working with SmolVLM's instruction-tuned version while understanding the technical aspects of efficient model fine-tuning for computer vision applications.
Syllabus
CODE to Fine-Tune NEW SmolVLM on Consumer GPU w QLoRA
Taught by
Discover AI