hckrnws

Llama.cpp can do 40 tok/s on M2 Max, 0% CPU usage, using all 38 GPU cores

by samwillis

samwillis
2y
rcarmo
2y
senttoschool
2y
smoldesu
2y
wmf
2y
KeplerBoy
2y
sitkack
2y
sliken
2y
fomine3
2y
brandall10
2y
timschmidt
2y
senttoschool
2y
timschmidt
2y
senttoschool
2y
timschmidt
2y
senttoschool
2y
timschmidt
2y
gtirloni
2y
NicoJuicy
2y
MuffinFlavored
2y
viraptor
2y
kccqzy
2y
MuffinFlavored
2y
rini17
2y

Comment was deleted :(

emilsedgh
2y
8n4vidtmkvmk
2y
bobbylarrybobby
2y
rcme
2y
viraptor
2y
rcme
2y
viraptor
2y
senttoschool
2y
itake
2y
selalipop
2y
solarkraft
2y
r00fus
2y
slowmovintarget
2y
itg
2y
satysin
2y
smoldesu
2y
senttoschool
2y
smoldesu
2y
faeriechangling
2y
electric_mayhem
2y
jazzyjackson
2y
kristianp
2y
sitkack
2y
samwillis
2y
geek_at
2y
samwillis
2y
rvz
2y
rektide
2y
behnamoh
2y
senttoschool
2y

Comment was deleted :(

akomtu
2y
kurtoid
2y
airgapstopgap
2y

Crafted by Rajat

Source Code