What it does: Provides a rich CLI interface backed by a FastAPI server. The assistant can read files, write files, list directories, run terminal commands, and remember past sessions via persistent conversation memory. Runs fully offline via Ollama, or on free cloud APIs (Groq, Gemini, OpenRouter) when you need more power.
Provider chain: Requests route through Groq → Gemini → OpenRouter → Ollama in order of preference. If one provider is rate-limited or unavailable, the next one picks up automatically — so the assistant is always responsive.
How it was built: Python is the core language. FastAPI handles the backend API layer. Ollama runs local LLM inference (DeepSeek Coder or CodeLlama). The CLI uses
rich for styled terminal output. Memory is stored as
plain files so sessions persist across restarts.