Skip to content

Neurality10/BAYES-WATCH

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

VidyaNetra — Empowering the Visually Impaired Through Technology

Hack36 | Theme: Community & Upliftment


Built at Hack36

🌐 Hack36 Website


Introduction

VidyaNetra is an inclusive educational platform designed for the blind and visually impaired in India. It enables users to independently access learning materials, take exams without scribes, and even learn braille using spatial and binaural audio. With multiple accessibility modules powered by AI, VidyaNetra aims to revolutionize inclusive education and uplift marginalized communities.


🎥 Demo Video

Watch Demo on Google Drive


📊 Presentation

View Presentation


📚 Table of Contents


🛠️ Technology Stack

  • Hugging Face Transformers
  • Streamlit
  • Google Text-to-Speech (gTTS)
  • Ollama
  • Qwen1.5-7B LLM
  • LangChain
  • Cloudinary (media upload & management)
  • Docling (PDF parsing & structuring)
  • Pillow (image processing)

🌟 Key Features

1. PDF/Text to Audio & Braille Converter

  • Converts digital text (PDFs, etc.) into synthesized audio
  • Generates digital Braille formats (e.g., .brf)
  • Makes existing written materials accessible for both auditory and tactile learners

2. Novel Braille Learning via Spatial Audio

  • Teaches Braille character patterns using spatial audio cues
  • Maps each Braille dot position to a specific 3D sound location
  • Supports intuitive, non-visual learning methods

3. AI-Powered Accessible Exam Portal

  • Secure login using facial and voice recognition
  • Reads questions aloud and accepts spoken responses
  • Enables independent, scribe-free exams for the visually impaired

4. Video Content Interaction (Transcript + Audio Q&A using RAG)

  • Users can upload educational videos
  • Transcribes content using Whisper
  • Interactive Q&A using Retrieval-Augmented Generation (RAG) via voice or text input
  • Responses are returned in both audio and text format

🌍 Real-World Impact

  • Empowers over 9 million visually impaired individuals in India
  • Enables independent access to education and examinations
  • Scalable integration with government and NGO programs
  • Promotes digital equity and accessibility in mainstream education

Future Enhancements

Voice-Driven Navigation

Implement a voice-powered AI agent for natural language-based platform control, eliminating the need to remember commands or menus.

Advanced Binaural Audio for Diagrams

Convert diagrams and images into textual representations, extract key insights, and deliver them through immersive binaural audio for a deeper auditory learning experience.

Interactive Data Analytics via Sound

Transform charts (bar graphs, line graphs, etc.) into audio experiences using pitch, volume, and spatial cues to represent data—navigated by voice or simple gestures.


Contributors

Team Name: Bayes Watch

  • Harshvardhan Patil
  • Darsh Shah
  • Vanshika Gautam
  • Parth Panchiwala

🏫 Made At

Built at Hack36

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages