hckrnws

Lossless LLM compression for efficient GPU inference via dynamic-length float

(arxiv.org)
411
22
14d

by CharlesW

jhj
14d
iandanforth
14d
VladVladikoff
14d
vessenes
14d
zorgmonkey
14d
eln1
12d
bjornsing
14d
ironbound
13d
refibrillator
14d
Dylan16807
13d
brookst
14d
liuliu
14d
hinkley
14d
boulos
14d
badmonster
14d
latchkey
14d
airstrike
14d
latchkey
14d
sundarurfriend
14d
latchkey
14d
Ringz
14d
latchkey
14d
saagarjha
14d
latchkey
14d
zarathustreal
13d
latchkey
13d
zarathustreal
8d
miohtama
14d
daveguy
14d
mhitza
14d
Der_Einzige
14d
LoganDark
14d
gunalx
14d
Der_Einzige
14d
danielmarkbruce
14d
jhj
14d
danielmarkbruce
13d
Dylan16807
10d
danielmarkbruce
10d
Dylan16807
10d
striking
14d
kadushka
14d
latchkey
14d
NBJack
14d
latchkey
14d
NBJack
12d
DrillShopper
14d
latchkey
14d
danielmarkbruce
14d
spoaceman7777
14d
danielmarkbruce
14d
loufe
14d
jonplackett
14d
loufe
14d
Animats
14d
eoerl
14d
aseligman
14d
yjftsjthsd-h
14d
brigade
13d
philjohn
14d
hnuser123456
14d
wills_forward
14d
moffkalast
14d
janalsncm
14d
danielmarkbruce
14d
BoorishBears
14d
imtringued
13d
danielmarkbruce
14d
BoorishBears
14d
danielmarkbruce
14d
BoorishBears
14d
danielmarkbruce
14d
moffkalast
14d
Der_Einzige
14d
moffkalast
13d
kridsdale3
14d
danielmarkbruce
14d
kadushka
14d
omneity
14d
throwaway314155
14d
thund
14d
thund
14d
jhj
14d
gitroom
14d
mountainriver
14d
jsemrau
14d
xmasotto
14d
buildbot
13d
firefoxd
14d
luotuoshangdui
14d
aazo11
14d
iamnotagenius
14d
sroussey
14d
spindump8930
14d
jasonjmcghee
14d
gojomo
14d
svachalek
14d
iamnotagenius
14d
marksimi
14d
newuser111
14d
fxegdfvbfds
14d
hchja
14d
spindump8930
14d
throwaway314155
14d
anticensor
14d
vessenes
14d
Havoc
14d
artemisart
14d
Vendan
14d
brokencode
14d
vintermann
14d
8ytecoder
14d
ziddoap
14d
ein0p
14d
timschmidt
14d
ein0p
14d
timschmidt
14d
ein0p
14d
bigyabai
10d
ow5
14d
ein0p
14d

Crafted by Rajat

Source Code