Abstract: Sign Speak is an AI-driven system designed to enable real-time, bidirectional communication between hearing individuals and those with speech or hearing impairments. It translates speech into animated sign language and recognizes hand gestures to generate spoken output in multiple languages, including English, Kannada, Tamil, and Hindi. The system integrates Speech Recognition, Natural Language Processing, and Computer Vision using OpenCV and MediaPipe. It leverages Google Translate for multilingual support and gTTS for voice synthesis. Built on a Flask backend with a responsive HTML, CSS, and JavaScript frontend, Sign Speak performs reliably under varied conditions.Designed for scalability, the system allows easy integration of updates like dynamic gesture recognition and regional sign language support. Testing has shown high accuracy and seamless module coordination. Future enhancements include mobile and wearable versions, continuous gesture recognition, emotion detection, and AR/VR integration—advancing its mission of inclusive, accessible communication.
Keywords: Sign Language, Speech Translation, Computer Vision,, Accessibility, Multilingual Translation Speech Recognition, and Natural Language Processing.
|
DOI:
10.17148/IJARCCE.2025.14506