Show HN: TurboQuant-WASM – Google's vector quantization in the browser
SIMD vector compression — 3 bits/dim with fast dot product
Our Take
This tackles embedding bloat in a way that actually matters — the kind of thing that makes a 1.5GB index basically unusable on mobile. TeamChong squeezed float32 vectors 6x using WASM and Google's quantization trick, and the real flex is skipping the training entirely. Most quantization approaches need hours of preprocessing, but this just wants a dim and a seed, then you're ready to encode. Direct dot product on compressed data without decoding is the move here, and relaxed SIMD support in modern browsers finally makes it viable. 211 stars in a few weeks isn't bad signal for something this niche.
WASM-based vector quantization library that compresses float32 embeddings 6x (1.5GB → 240MB) and enables direct search on compressed data without decompression
Key Facts
The people behind Show HN: TurboQuant-WASM – Google's vector quantization in the browser
Links
Want products like this in your inbox every morning?
Five products. Every morning. Written by someone who actually cares whether they're good or not. Free forever, unsubscribe whenever.