QFactory - Accelerating Quantized Large Language Model Serving with Qtile Graphs

QFactory - Accelerating Quantized Large Language Model Serving with Qtile Graphs

USENIX via YouTube Direct link

USENIX ATC '25 - QFactory: Accelerating Quantized Large Language Model Serving with Qtile Graphs

1 of 1

1 of 1

USENIX ATC '25 - QFactory: Accelerating Quantized Large Language Model Serving with Qtile Graphs

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

QFactory - Accelerating Quantized Large Language Model Serving with Qtile Graphs

Automatically move to the next video in the Classroom when playback concludes

  1. 1 USENIX ATC '25 - QFactory: Accelerating Quantized Large Language Model Serving with Qtile Graphs

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.