32x Reduced Memory Usage With Binary Quantization
In-depth technical breakdown of how binary quantization works and how to use it in Weaviate.
April 2, 2024 · 18 min read
In-depth technical breakdown of how binary quantization works and how to use it in Weaviate.
Boosting Weaviate using SIMD-AVX512, Loop Unrolling and Compiler Optimizations
The details behind how you can compress vectors using PQ with little loss of recall!
Fine-tuning LlaMA 7B to use the Weaviate GraphQL APIs
Using the Weaviate Tile Encoder to compress vectors with Product Quantization.
Implementing HNSW + Product Quantization (PQ) vector compression in Weaviate.
Vector search on disks: How does Vamana compare to HNSW?
Self-Supervised Retrieval can surpass BM25 and Supervised techniques. This technique also pairs very well alongside BM25 in Hybrid Retrieval. Learn more about it.