From Logs To Insights: Real-time Conversational Troubleshooting for Kubernetes With GenAI
CNCF [Cloud Native Computing Foundation] via YouTube
Python, Prompt Engineering, Data Science — Build the Skills Employers Want Now
Learn the Skills Netflix, Meta, and Capital One Actually Hire For
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
This conference talk explores how to transform Kubernetes logs into actionable insights using generative AI for more efficient troubleshooting. Learn how to build an AI-driven observability solution that addresses the challenge of sifting through massive volumes of logs in distributed microservices environments. The presenters, Tiago Reichert and Lucas Duarte from AWS, demonstrate a step-by-step approach to implementing a conversational troubleshooting system using Large Language Models (LLMs). Discover how to configure Fluent Bit collectors to gather systemd logs, Kubernetes events, and application logs, stream them to scalable object storage, and construct a vector database that enables natural language interaction with log data. Gain practical knowledge to implement GenAI observability in your own Kubernetes clusters, reducing downtime and simplifying the troubleshooting process across multiple clusters.
Syllabus
From Logs To Insights: Real-time Conversational Troubleshooting for... Tiago Reichert & Lucas Duarte
Taught by
CNCF [Cloud Native Computing Foundation]