N
Hacker Next
new
past
show
ask
show
jobs
submit
login
▲
A complete Llama2 inference engine that fits in 1356 bytes of x86 assembly
(
github.com
)
27 points by
monax
34 days ago
|
0 comments
add comment
Rendered at 10:16:55 GMT+0000 (Coordinated Universal Time) with Vercel.