Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Host Your Own Llama 3 Chatbot in 10 Minutes with Runpod and vLLM - Lecture 3

Data Centric via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how to host a Llama 3 8B model chatbot in just 20 minutes using vLLM's inference server, Runpod GPUs, and Chainlit for the front end. Discover the process of hosting the Llama 3 model on Runpod and creating an efficient chatbot without relying on heavy frameworks. Follow along as the video guides you through building a Runpod template, deploying it, obtaining the endpoint, preparing the Python script, and finally launching the chatbot. Gain practical insights into AI engineering and model hosting, with additional resources provided for further learning and development.

Syllabus

Intro:
Build Runpod Template:
Deploy Runpod Template:
Getting the Endpoint:
Prepping the Python Script:
Launching the Chatbot:

Taught by

Data Centric

Reviews

Start your review of Host Your Own Llama 3 Chatbot in 10 Minutes with Runpod and vLLM - Lecture 3

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.