📞 +91-7667918914 | ✉️ ijarcce@gmail.com
International Journal of Advanced Research in Computer and Communication Engineering
International Journal of Advanced Research in Computer and Communication Engineering A monthly Peer-reviewed & Refereed journal
ISSN Online 2278-1021ISSN Print 2319-5940Since 2012
IJARCCE adheres to the suggestive parameters outlined by the University Grants Commission (UGC) for peer-reviewed journals, upholding high standards of research quality, ethical publishing, and academic excellence.
← Back to VOLUME 15, ISSUE 4, APRIL 2026

Edu-Vision: An AI-Powered Multimodal Educational Assistant for Intelligent Content Understanding and Personalized Tutoring

Sura Reddy, George A, Abhishek M, Dr.Paavai Anand

👁 11 views📥 3 downloads
Share: 𝕏 f in
Abstract: Edu-Vision is an advanced AI-powered educational assistant designed to understand, interpret, and teach educational content from multiple formats including PDF files, Word documents, PowerPoint presentations, images, diagrams, and scanned notes. The system integrates large language models, optical character recognition, vision-language models, and real-time internet retrieval to deliver explanations, summaries, quizzes, flashcards, study plans, and personalized tutoring support. The implementation combines file-specific extraction modules, tutoring pipelines powered by language models, and diagram analysis using a vision encoder-decoder model. The platform also includes context- aware document indexing, page-by-page explanation, accessibility-oriented features such as braille-ready output and audio-friendly quizzes, and adaptive tutoring based on subject and learner needs. By following Universal Design for Learning principles, Edu-Vision improves inclusivity and supports flexible, student-centered learning. The project demonstrates how multimodal AI can transform educational assistance into an interactive, accessible, and personalized learning experience.

Keywords: Multimodal Learning, Educational Assistant, OCR, Large Language Models, Vision-Language Model, Personalized Tutoring, Universal Design for Learning.

How to Cite:

[1] Sura Reddy, George A, Abhishek M, Dr.Paavai Anand, “Edu-Vision: An AI-Powered Multimodal Educational Assistant for Intelligent Content Understanding and Personalized Tutoring,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI: 10.17148/IJARCCE.2026.154225

Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License.