Name: Empowering Vision: A Survey on Image Captioning Assistive Technologies for the Visually Impaired
Start: 2025-01-29T13:45:00+0530
End: 2025-01-29T14:00:00+0530

Wednesday January 29, 2025 1:45pm - 2:00pm IST

Magnolia

Authors - Vidisha Deshpande, Gauri Shelke, Bhakti Kadam
Abstract - Advancements in deep learning are fundamentally transforming assistive technologies, providing visually impaired users with unprecedented access to information and enhanced interaction with their surroundings. This paper comprehensively surveys traditional and emerging assistive technologies, focusing on real-time image caption generation systems. The modern advancements that bridge sensory limitations and digital interaction by covering a range of technologies such as Optical Character Recognition (OCR)-based text readers, object detection systems, image captioning systems, and intelligent haptic feedback devices are highlighted. In particular, the critical role of vision-language models and multimodal systems, which enable real-time auditory descriptions of visual scenes is studied. The survey also identifies significant gaps in real-world applications, particularly in terms of adaptability, cost, and inclusivity. These findings emphasize the need for more accessible, affordable, and real-time solutions that cater to the diverse needs of visually impaired individuals.

Paper Presenter

Gauri Shelke

India

Wednesday January 29, 2025 1:45pm - 2:00pm IST
Magnolia Hotel Crowne Plaza, Pune, India

Physical Session 2A, Magnolia

Ninth International Conference SmartCom 2025

Gauri Shelke

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!