text-embedding-3-small Notes
OpenAI's most cost-effective embedding model.
- Dimensions: 1536 (default), reducible via
dimensionsparam - Pricing: $0.02 / 1M tokens input
- Context: 8191 tokens
- Use case: semantic search, clustering, recommendation
Trade-offs vs -3-large
- Smaller = cheaper + faster
- Quality drop is real but small for general English
- For multilingual: -large pulls ahead
Why mdfy uses -small
50,000 docs × ~1k tokens/doc × $0.02/M = $1 to fully embed a hub. That's the right operating point for free-tier users.
Reduction trick
Truncate to 512 dims for storage savings — re-normalize before similarity search. mdfy doesn't do this yet; pgvector(1536) is fine under 100k docs/user.