Metal qmatmul mat-mat product #39

EricLBuehler · 2024-11-14T14:46:12Z

Before, we were only using an mv qmatmul which scaled lineraly with bs * seqlen. This uses a mm kernel which should speed things up!

* Test passes * All tests pass * Now all the tests really pass * Try out always using mm * Mirror llama.cpp metric * Mirror llama.cpp metric * Update test

EricLBuehler added 8 commits November 14, 2024 09:12

Test passes

c97d75d

All tests pass

da51eef

Now all the tests really pass

6139344

Try out always using mm

79d4fed

Mirror llama.cpp metric

4a0908a

Mirror llama.cpp metric

940b14e

Update test

6efab89

Merge branch 'main' into metal_qmatmul_mm

9e70272

EricLBuehler merged commit 6be03dd into main Nov 14, 2024
8 of 11 checks passed

EricLBuehler deleted the metal_qmatmul_mm branch November 14, 2024 16:40

EricLBuehler added a commit that referenced this pull request Nov 14, 2024

Metal qmatmul mat-mat product (#39)

885bd31

* Test passes * All tests pass * Now all the tests really pass * Try out always using mm * Mirror llama.cpp metric * Mirror llama.cpp metric * Update test

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metal qmatmul mat-mat product #39

Metal qmatmul mat-mat product #39

EricLBuehler commented Nov 14, 2024

Metal qmatmul mat-mat product #39

Metal qmatmul mat-mat product #39

Conversation

EricLBuehler commented Nov 14, 2024