Lead AI Strategy with UCSB's Agentic AI Program — Microsoft Certified
AI Engineer - Learn how to integrate AI into software applications
Overview
Google, IBM & Meta Certificates – 40% Off
One plan covers every Professional Certificate on Coursera.
Unlock All Certificates
Dive into an informative 49-minute InfoQ talk where David Cheney from GitHub reveals the architectural secrets and innovative engineering solutions behind achieving sub-200 millisecond response times for GitHub Copilot's code completion service. Explore how the team overcame the unique challenges of building a cloud-hosted, interactive autocomplete system that competes with local IDE performance. Follow their journey from initial alpha to a globally distributed system, learning about critical technical decisions including the implementation of HTTP/2, custom load balancing with GLB, and intelligent request handling strategies. The presentation provides valuable insights for engineers working on low-latency applications and distributed systems at scale, with a complete transcript available on the InfoQ website.
Syllabus
GitHub Copilot's Latency Secrets: How They Built Sub-200ms Autocomplete
Taught by
InfoQ