Loading…
Friday January 31, 2025 3:00pm - 5:00pm IST

Authors - Kavita Patil, Rohit Patil, Vedanti Koyande, Amaya Singh Thakur, Kshitij Kadam, Kavita Moholkar
Abstract - This paper evaluates a chatbot system designed for personalized business interactions using advanced Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG). The system combines proprietary business data with external databases to improve contextual relevance. A comparative analysis of leading LLMs—Gemini Pro, GPT-4, Claude 2, GPT-3.5, and LLaMA 2—was conducted across benchmarks like MMLU, GSM8K, BigBench Hard, HumanEval, and DROP. Gemini Pro outperformed the others, with scores of 88.9% on MMLU, 86.3% on GSM8K, 78.1% on BigBench Hard, 73.5% on HumanEval, and 79.2% on DROP, showcasing its strength in complex reasoning and long-context retrieval. Fine-tuned with business-specific data, Gemini Pro sets a new standard for high-accuracy, scalable chatbot solutions, ideal for enterprise applications.
Paper Presenter
Friday January 31, 2025 3:00pm - 5:00pm IST
Virtual Room B Pune, India

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link