FOSS Hack 2025 > All Projects > Agentic Object Detection

Agentic Object Detection

Reasoning-driven object detection: human-like precision via text prompts without the overhead of custom training

To build a new plugin for HUB that will implement an agentic object detection workflow.

We will use design patterns to reason at length about unique attributes like color, shape, and texture for smarter, more precise recognition in any scenario.

This will be implemented by a combination of reasoning LLMs (like deepseek r1) with large vision models like (like SmolVLM-Instruct) with traditional compute vision techniques.

Issues, PRs and Discussions

Pull Request

add agent_detection_plugin

Asankhaya Sharma

codelion