Efficient Generative AI on Edge Devices: A Distillation-Based Approach
By: Sarah Jones, Michael Brown, Emily White, James Taylor, Olivia Davis
Published: 2025-12-11
View on arXiv →#cs.AI
Abstract
Deploying powerful generative AI models on resource-constrained edge devices remains a significant challenge. This paper introduces a novel distillation-based framework that effectively compresses large generative models without sacrificing performance, enabling real-time image generation and natural language processing on smartphones and IoT devices. Experimental results show significant reductions in model size and latency.