ai-skills-api/requirements.txt at 62c875c9a64c2d4a5401720cf82d29dacdc53d63 - helm/ai-skills-api - Helm's Forge: Home of Durandal

helm/ai-skills-api

Lukas Parsons 82fd963577 Add token-saving patterns: semantic cache, RAG, compression

- semantic_cache.py: Semantic similarity matching for cache hits
- rag.py: RAG-based context selection with local embeddings
- compression.py: Conversation history summarization
- New endpoints: /cache/semantic-lookup, /cache/semantic-store, /context/rag, /compress
- Uses sentence-transformers (all-MiniLM-L6-v2) - no external API calls
- No vector DB needed - cosine similarity on small datasets is fast enough
- Expected savings: 50-70% token reduction

2026-03-22 21:32:08 -04:00

9 lines

176 B

Text

Raw Blame History

 fastapi==0.109.0
 uvicorn[standard]==0.27.0
 sqlalchemy==2.0.25
 pydantic==2.5.3
 python-dotenv==1.0.0
 aiosqlite==0.19.0
 sentence-transformers==2.3.1
 numpy==1.26.3
 tiktoken==0.5.2