with one click
vector-index-tuning
Optimize vector index performance for latency, recall, and memory. Use when tuning HNSW parameters, selecting quantization strategies, or scaling vector search infrastructure.
Optimize vector index performance for latency, recall, and memory. Use when tuning HNSW parameters, selecting quantization strategies, or scaling vector search infrastructure.
| name | vector-index-tuning |
| description | Optimize vector index performance for latency, recall, and memory. Use when tuning HNSW parameters, selecting quantization strategies, or scaling vector search infrastructure. |
Guide to optimizing vector indexes for production performance.
Data Size Recommended Index
āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā
< 10K vectors ā Flat (exact search)
10K - 1M ā HNSW
1M - 100M ā HNSW + Quantization
> 100M ā IVF + PQ or DiskANN
| Parameter | Default | Effect |
|---|---|---|
| M | 16 | Connections per node, ā = better recall, more memory |
| efConstruction | 100 | Build quality, ā = better index, slower build |
| efSearch | 50 | Search quality, ā = better recall, slower search |
Full Precision (FP32): 4 bytes Ć dimensions
Half Precision (FP16): 2 bytes Ć dimensions
INT8 Scalar: 1 byte Ć dimensions
Product Quantization: ~32-64 bytes total
Binary: dimensions/8 bytes
Full template library and detailed worked examples live in references/details.md. Read that file when you need the concrete templates.
Schedule and publish social media posts across 13 platforms (X, LinkedIn, Instagram, Facebook Pages, TikTok, Discord, Telegram, YouTube, Reddit, WordPress, Pinterest) via the SocialClaw API. Use when the user wants to publish, schedule, or manage social media content programmatically. Requires SOCIALCLAW_API_KEY.
Conduct WCAG 2.2 accessibility audits with automated testing, manual verification, and remediation guidance. Use when auditing websites for accessibility, fixing WCAG violations, or implementing accessible design patterns.
Create production-ready FastAPI projects with async patterns, dependency injection, and comprehensive error handling. Use when building new FastAPI applications or setting up backend API projects.
Master REST and GraphQL API design principles to build intuitive, scalable, and maintainable APIs that delight developers. Use when designing new APIs, reviewing API specifications, or establishing API design standards.
Implement proven backend architecture patterns including Clean Architecture, Hexagonal Architecture, and Domain-Driven Design. Use this skill when designing clean architecture for a new microservice, when refactoring a monolith to use bounded contexts, when implementing hexagonal or onion architecture patterns, or when debugging dependency cycles between application layers.
Implement Command Query Responsibility Segregation for scalable architectures. Use when separating read and write models, optimizing query performance, or building event-sourced systems.