VidyaNetra — Empowering the Visually Impaired Through Technology

Hack36 | Theme: Community & Upliftment

Introduction

VidyaNetra is an inclusive educational platform designed for the blind and visually impaired in India. It enables users to independently access learning materials, take exams without scribes, and even learn braille using spatial and binaural audio. With multiple accessibility modules powered by AI, VidyaNetra aims to revolutionize inclusive education and uplift marginalized communities.

🎥 Demo Video

Watch Demo on Google Drive

📊 Presentation

View Presentation

🛠️ Technology Stack

Hugging Face Transformers
Streamlit
Google Text-to-Speech (gTTS)
Ollama
Qwen1.5-7B LLM
LangChain
Cloudinary (media upload & management)
Docling (PDF parsing & structuring)
Pillow (image processing)

🌟 Key Features

1. PDF/Text to Audio & Braille Converter

Converts digital text (PDFs, etc.) into synthesized audio
Generates digital Braille formats (e.g., .brf)
Makes existing written materials accessible for both auditory and tactile learners

2. Novel Braille Learning via Spatial Audio

Teaches Braille character patterns using spatial audio cues
Maps each Braille dot position to a specific 3D sound location
Supports intuitive, non-visual learning methods

3. AI-Powered Accessible Exam Portal

Secure login using facial and voice recognition
Reads questions aloud and accepts spoken responses
Enables independent, scribe-free exams for the visually impaired

4. Video Content Interaction (Transcript + Audio Q&A using RAG)

Users can upload educational videos
Transcribes content using Whisper
Interactive Q&A using Retrieval-Augmented Generation (RAG) via voice or text input
Responses are returned in both audio and text format

🌍 Real-World Impact

Empowers over 9 million visually impaired individuals in India
Enables independent access to education and examinations
Scalable integration with government and NGO programs
Promotes digital equity and accessibility in mainstream education

Future Enhancements

Voice-Driven Navigation

Implement a voice-powered AI agent for natural language-based platform control, eliminating the need to remember commands or menus.

Advanced Binaural Audio for Diagrams

Convert diagrams and images into textual representations, extract key insights, and deliver them through immersive binaural audio for a deeper auditory learning experience.

Interactive Data Analytics via Sound

Transform charts (bar graphs, line graphs, etc.) into audio experiences using pitch, volume, and spatial cues to represent data—navigated by voice or simple gestures.

Contributors

Team Name: Bayes Watch

Harshvardhan Patil
Darsh Shah
Vanshika Gautam
Parth Panchiwala

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
app1_data		app1_data
pipeline_output		pipeline_output
.gitignore		.gitignore
README.md		README.md
app.py		app.py
auth_braille_helpers.py		auth_braille_helpers.py
command_processor.py		command_processor.py
config.py		config.py
data_handler.py		data_handler.py
dependencies.py		dependencies.py
doc_processor_v2_helpers.py		doc_processor_v2_helpers.py
docling_handler.py		docling_handler.py
image_descriptor.py		image_descriptor.py
main.py		main.py
markdown_generator.py		markdown_generator.py
markdown_reformatter.py		markdown_reformatter.py
output_saver.py		output_saver.py
pipeline_config.py		pipeline_config.py
pipeline_orchestator.py		pipeline_orchestator.py
pipeline_utils.py		pipeline_utils.py
requirements.txt		requirements.txt
speech_to_text.py		speech_to_text.py
state_manager.py		state_manager.py
text_to_speech.py		text_to_speech.py
transcript.py		transcript.py
tts.py		tts.py
tts_generator.py		tts_generator.py
ui_elements.py		ui_elements.py
vision_descriptor.py		vision_descriptor.py
voice_exam_helpers.py		voice_exam_helpers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VidyaNetra — Empowering the Visually Impaired Through Technology

Hack36 | Theme: Community & Upliftment

Introduction

🎥 Demo Video

📊 Presentation

📚 Table of Contents

🛠️ Technology Stack

🌟 Key Features

1. PDF/Text to Audio & Braille Converter

2. Novel Braille Learning via Spatial Audio

3. AI-Powered Accessible Exam Portal

4. Video Content Interaction (Transcript + Audio Q&A using RAG)

🌍 Real-World Impact

Future Enhancements

Voice-Driven Navigation

Advanced Binaural Audio for Diagrams

Interactive Data Analytics via Sound

Contributors

🏫 Made At

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VidyaNetra — Empowering the Visually Impaired Through Technology

Hack36 | Theme: Community & Upliftment

Introduction

🎥 Demo Video

📊 Presentation

📚 Table of Contents

🛠️ Technology Stack

🌟 Key Features

1. PDF/Text to Audio & Braille Converter

2. Novel Braille Learning via Spatial Audio

3. AI-Powered Accessible Exam Portal

4. Video Content Interaction (Transcript + Audio Q&A using RAG)

🌍 Real-World Impact

Future Enhancements

Voice-Driven Navigation

Advanced Binaural Audio for Diagrams

Interactive Data Analytics via Sound

Contributors

🏫 Made At

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages