Vector index performance tuning: latency, recall, and memory tradeoffs covering HNSW / IVF / quantization selection.
Install
npx skills add skillerdotdev/skills@vector-index-tuning -y -g Cross-tool: Claude Code ~/.claude/skills/ · Qoder ~/.qoder/skills/ · TRAE ~/.trae/skills/
SourceTrigger this skill when building RAG systems, optimizing vector search performance, or choosing vector index solutions.
| Method | Recall | Latency | Memory | Use Case |
|---|---|---|---|---|
| HNSW | High | Low | High | <1M vectors, low-latency priority |
| IVF-PQ | Medium | Medium | Low | Large scale, memory-constrained |
| Flat | Highest | High | Highest | <100K, precision priority |
M (connections): 16-64, higher recall but more memoryefConstruction (build search width): 128-256, affects index build timeefSearch (query search width): top-k to 2*M, tunable at runtime