Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Host Your Own Llama 3 Chatbot in 10 Minutes with Runpod and vLLM - Lecture 3

Data Centric via YouTube

Start learning Write review

Details

Start learning

Provider

YouTube
Pricing

Free Video
Languages

English
Effort

20 minutes
Sessions

Self-Paced
Level

Intermediate

Found in

Learn how to host a Llama 3 8B model chatbot in just 20 minutes using vLLM's inference server, Runpod GPUs, and Chainlit for the front end. Discover the process of hosting the Llama 3 model on Runpod and creating an efficient chatbot without relying on heavy frameworks. Follow along as the video guides you through building a Runpod template, deploying it, obtaining the endpoint, preparing the Python script, and finally launching the chatbot. Gain practical insights into AI engineering and model hosting, with additional resources provided for further learning and development.