jeremylongshore/Hybrid-ai-stack-intent-solutions
Production AI orchestration routing requests between local models (Ollama) and cloud APIs (Claude/OpenAI). Reduces AI costs 60-80% by running lightweight models locally for simple tasks.
GitHub repository with 5 stars and 1 forks.
Language: Python