Skip Navigation

User banner
Posts
0
Comments
3
Joined
8 mo. ago

Lisp enjoyer, occasional code golfer.
I self host what i can, because i can.
As for the rest, i invite you to read my timeline.

  • @Treczoks @flemtone Thing is, the final LLM inference is usually done at reduced precision. 8-16 bits usually, but even 4bits or lower with different layers of varying precision.

  • @TheBlackLounge @kalkulat LLM inference is definitely theoretically possible on analog chips. They just may not scale :v