Building a Two-Node AMD Strix Halo Cluster for LLMs with llama.cpp RPC - MiniMax-M2 and GLM 4.6

Building a Two-Node AMD Strix Halo Cluster for LLMs with llama.cpp RPC - MiniMax-M2 and GLM 4.6

Donato Capitella via YouTube Direct link

00:00 – Intro

1 of 7

1 of 7

00:00 – Intro

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Building a Two-Node AMD Strix Halo Cluster for LLMs with llama.cpp RPC - MiniMax-M2 and GLM 4.6

Automatically move to the next video in the Classroom when playback concludes

  1. 1 00:00 – Intro
  2. 2 01:48 – Network Setup
  3. 3 04:04 – Llama.cpp RPC Setup
  4. 4 06:14 – Running MiniMax-M2 Q6_K_XL
  5. 5 16:56 – Running GLM 4.6 Q4_K_XL
  6. 6 22:37 – Llama-Bench Results
  7. 7 24:28 – Cluster with 4 Strix Halos?

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.