hckrnws

Llama.cpp can do 40 tok/s on M2 Max, 0% CPU usage, using all 38 GPU cores

by samwillis

samwillis
11m
rcarmo
11m
senttoschool
11m
smoldesu
11m
wmf
11m
KeplerBoy
11m
sitkack
11m
sliken
11m
fomine3
11m
brandall10
11m
timschmidt
11m
senttoschool
11m
timschmidt
11m
senttoschool
11m
timschmidt
11m
senttoschool
11m
timschmidt
11m
gtirloni
11m
NicoJuicy
11m
MuffinFlavored
11m
viraptor
11m
kccqzy
11m
MuffinFlavored
11m
rini17
11m

Comment was deleted :(

emilsedgh
11m
8n4vidtmkvmk
11m
bobbylarrybobby
11m
rcme
11m
viraptor
11m
rcme
11m
viraptor
11m
senttoschool
11m
itake
11m
selalipop
11m
solarkraft
11m
r00fus
11m
slowmovintarget
11m
itg
11m
satysin
11m
smoldesu
11m
senttoschool
11m
smoldesu
11m
faeriechangling
11m
electric_mayhem
11m
jazzyjackson
11m
kristianp
11m
sitkack
11m
samwillis
11m
geek_at
11m
samwillis
11m
rvz
11m
rektide
11m
behnamoh
11m
senttoschool
11m

Comment was deleted :(

akomtu
11m
kurtoid
11m
airgapstopgap
11m

Crafted by Rajat

Source Code