Commit graph

7 commits

Author SHA1 Message Date
e4dd4da188 Update MCP server (remove cache tool), fix readme endpoints, add template reference 2026-03-22 22:35:02 -04:00
3dce79e818 Add agent template for Forgejo 2026-03-22 22:33:39 -04:00
b8edf40010 Major refactor: remove semantic cache, add config, auth, improve RAG performance, fix tags JSON 2026-03-22 22:32:44 -04:00
62c875c9a6 Change API port from 8080 to 8675 across all configs and docs 2026-03-22 21:54:51 -04:00
82fd963577 Add token-saving patterns: semantic cache, RAG, compression
- semantic_cache.py: Semantic similarity matching for cache hits
- rag.py: RAG-based context selection with local embeddings
- compression.py: Conversation history summarization
- New endpoints: /cache/semantic-lookup, /cache/semantic-store, /context/rag, /compress
- Uses sentence-transformers (all-MiniLM-L6-v2) - no external API calls
- No vector DB needed - cosine similarity on small datasets is fast enough
- Expected savings: 50-70% token reduction
2026-03-22 21:32:08 -04:00
7f7699ff94 Initial commit: Skills API with MCP servers
- FastAPI backend with SQLite (ai.db)
- Tables: skills, snippets, conventions, cache, memory
- MCP servers: homelab, gameservers, skills
- Docker Compose setup
- Seed data with 8 skills, 2 conventions, 2 snippets
- Token savings patterns via context bundles and caching
2026-03-22 21:18:23 -04:00
114b3b1628 Initial commit 2026-03-22 21:13:03 -04:00