Loading…
Wednesday January 29, 2025 1:30pm - 1:45pm IST
Authors - Varalatchoumy M, Syed Hayath, Dinesh D, Dhanush C P, Manu R, V Sadhana
Abstract - This paper presents an advanced Generative AI-powered system for video-to-text summarization, leveraging state-of-the-art Computer Vision (CV) technologies and Natural Language Processing (NLP) techniques. The developed system addresses the growing need to extract key information efficiently from lengthy videos across diverse domains such as education, entertainment, sports, and instructional content. By integrating visual and textual data, it pinpoints essential moments and generates concise summaries that capture the core message of the video, reducing the time users spend understanding extensive media. At the heart of this system lies a robust, open-source large language model (LLM), finetuned to produce human-like summaries from video transcripts. The system processes visual cues using advanced CV techniques—such as keyframe extraction and scene segmentation—and textual cues via Automatic Speech Recognition (ASR), which converts audio into text. This dual approach facilitates a deep understanding of spoken and visual content, ensuring that summaries are precise, relevant, and contextually accurate. The system has been evaluated on a diverse dataset, comprising videos of various genres, qualities, and lengths, demonstrating its capability to generalize effectively across a wide spectrum of content. Applications of this video summarization tool include content management, video indexing, educational platforms, and beyond, offering significant time-saving benefits to users and organizations. By incorporating real-time feedback, the system continuously refines its summarization techniques, enhancing accuracy and ensuring that users quickly access the most relevant information, thereby promoting greater accessibility and usability of video content.
Paper Presenter
avatar for Dinesh D
Wednesday January 29, 2025 1:30pm - 1:45pm IST
Maple Hotel Crowne Plaza, Pune, India

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link