Superdatascience

692: Lossless LLM Weight Compression: Run Huge Models on a Single GPU

Author: Vários
Narrator: Vários
Publisher: Podcast
Duration: 0:07:39
More information

Add to list

Listen

Synopsis

Join Jon as he navigates listeners through the innovative SpQR approach—a cutting-edge, lossless LLM weight compression technique that harnesses the power of quantization. Tune in as Jon delves into the four steps behind this groundbreaking method in this week's episode.Additional materials: www.superdatascience.com/692Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Superdatascience

692: Lossless LLM Weight Compression: Run Huge Models on a Single GPU

Synopsis

Need help

Install our app:

Superdatascience

692: Lossless LLM Weight Compression: Run Huge Models on a Single GPU

Informações:

Synopsis

Need help

Install our app: