Learn Backend Development Part-Time, Online
Power BI Fundamentals - Create visualizations and dashboards from scratch
Overview
Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Learn to harness the power of Qwen 2.5 VL, an advanced open source vision-language model, in this 33-minute video from Roboflow. Discover practical applications for integrating vision-language models into real-world scenarios, with a focus on building a nutrition label scanning application. Explore state-of-the-art capabilities in image and scene comprehension tasks while gaining hands-on experience with VLM technology. Master techniques for extracting and processing visual information from nutrition labels and understand how to apply these same principles across various vision-based applications.
Syllabus
How to use Qwen 2.5 VL | Read Nutrition Labels (and More!) with VLMs
Taught by
Roboflow