text-embedding-3-small Notes

OpenAI's most cost-effective embedding model.

Dimensions: 1536 (default), reducible via dimensions param
Pricing: $0.02 / 1M tokens input
Context: 8191 tokens
Use case: semantic search, clustering, recommendation

Trade-offs vs -3-large

Smaller = cheaper + faster
Quality drop is real but small for general English
For multilingual: -large pulls ahead

Why mdfy uses -small

50,000 docs × ~1k tokens/doc × $0.02/M = $1 to fully embed a hub. That's the right operating point for free-tier users.

Reduction trick

Truncate to 512 dims for storage savings — re-normalize before similarity search. mdfy doesn't do this yet; pgvector(1536) is fine under 100k docs/user.