USB-portable offline AI assistant for developers without reliable internet. Works on restricted networks, fully FOSS, zero cloud dependency.
A Zero-Setup, USB-Portable Agentic Coding Assistant for Offline Environments
License: MIT | 100% FOSS | No Internet Required | Linux ¡ macOS ¡ Windows
Over 2 billion people face unreliable internet monthly. But the deeper problem for developers isn't just connectivityâit's dependency.
Every modern AI coding tool assumes a working internet connection; a pre-configured machine, Ollama, or a runtime pre-installed; at least 16GB RAM; and a cloud API key. Students in rural India, engineers on air-gapped corporate networks, and developers at hackathon venues with broken WiFiânone of them fit that assumption.
Existing local tools like Goose, Cline, and Continue. Dev solved the privacy problem. They do not solve the accessibility and portability problem. They still require a full local dev environment just to run.
Hey Dongle solves a different problem: what if your entire AI coding assistant lived on a USB stick and just workedâon any machineâthe moment you plugged it in?
Hey Dongle is a self-contained, agentic AI coding assistant that runs entirely from a USB driveâno installation, no internet, no pre-configured environment required.
Plug it into any machine running Linux, macOS, or Windows with Python available, and within seconds you have a full agentic coding assistant that can read your codebase, write files, run code, and iterate autonomously.
It is not a chatbot. It is not a passive code explainer. It is an agent that takes instructions and acts on them.
Hey Dongle vs Goose / Cline / Aider:
Requires installationâHey Dongle: None. Others: Yes.
Requires Ollama or a runtimeâHey Dongle: Bundled inside. Others: Must be pre-installed.
Works on CPU-only machinesâHey Dongle: Yes. Others: Partial.
Works on low-RAM (8GB) machinesâHey Dongle: Optimized for it. Others: Struggles.
Zero internet at any pointâHey Dongle: Fully air-gapped. Others: Setup requires internet.
Target user â Hey Dongle: Developers anywhere. Others: Developers with good hardware.
Hey Dongle reads your project directory on startup and builds a lightweight index of your files. It doesn't just answer questions â it knows what you're working on.
The agent reads files, writes files, creates new ones, and applies targeted editsâwith your confirmation before any destructive action. This is a real agentic loop, not suggestions you copy-paste manually.
It runs your code, reads the output or error, and iteratesâup to N stepsâuntil the task is complete or it asks for your input.
"Fix this bug" â runs code â sees error â edits file â runs again â done.
Describe what you want in plain English. Hey Dongle breaks it into steps, executes them, and reports back. No memorizing commands.
Session history is stored locally on the USB. Come back the next day, plug in, and it remembers exactly where you left off.
Model: Qwen2.5-Coder 3B (Q4_K_M GGUF) Best coding performance at small size. Runs on 6GB RAM, CPU-only; no GPU needed.
Inference Runtime: llama-cpp-python (bundled) CPU-first, zero-install, cross-platform. Ships inside the USBânothing to download.
Agent Loop: Custom Python â no LangChain A minimal ~200 line tool-calling loop. Lean, fast, and fully auditable. No 40-dependency framework bloat.
Interface: Textual Python TUI terminal UI. No browser, no server, works on any machine.
Storage: SQLite (Python standard library) Session memory, file index, conversation history. Zero extra dependencies.
Packaging: PyInstaller + bundled model in a single folder on USB. One command to launch.
Minimum:
6 GB RAM free
Any x86_64 CPU (2015 or newer)
4 GB USB stick
No GPU required
No internet required
Recommended:
8 GB RAM or more
4+ CPU cores
8 GB USB stick
đž Developers in rural or low-connectivity areas
đ˘ Engineers on air-gapped corporate or government networks
đ Students at institutions with restricted or limited bandwidth
đ Competitive programmers at venues with unreliable WiFi
đ Anyone who needs a fully offline, privacy-first coding assistant
Zero-setup USB deployment
Qwen2.5-Coder 3B bundled, fully offline
Agentic loop: file read / write / execute / iterate
TUI interface via Textual
Persistent session memory via SQLite
RAG over local codebase for larger projects
User-swappable model support (bring your own GGUF)
Multi-file refactor planning
MCP (Model Context Protocol) tool integration
Support for even lower-spec hardware via further quantization
Community model registry for offline download packs
MIT License â fully open-source, no proprietary dependencies, no cloud, no tracking.
Built for FOSS Hack 2026 đŽđł