TinyVLM: Zero-Shot Object Detection on Microcontrollers via Vision-Language Distillation with Matryoshka Embeddings
By: Bibin Wilson
Published: 2026-03-15
View on arXiv →#cs.CV
Abstract
TinyVLM enables zero-shot object detection directly on microcontrollers by employing vision-language distillation with Matryoshka embeddings. This significantly pushes the boundaries of edge AI, allowing powerful visual recognition capabilities on highly resource-constrained devices for IoT and embedded applications.