A real-time object detection system with voice narration capabilities. The aim is to create a Social Media Platform for the visually impaired where they can upload their photos and get the description of the image in voice. Also they can share their photos with their friends and family and checkout the photos from others and know the updates of their surroundings.
- Real-Time Object Detection: Utilizes Coco-SSD (TensorFlow.js) for live object detection.
- Voice narration of detected objects
- Web-based interface for easy access
- Unique Social Media Platform for the visually impaired
- Unique profiles and verified users
Follow these instructions to set up the project on your local machine.
Ensure you have the following installed:
- Node.js (for web dependencies)
- MongoDB
-
Clone the repository:
git clone https://github.com/yourusername/fusion-vision.git cd fusion-vision
-
Install Dependencies
npm install
-
Set Up the Database
-
Ensure MongoDB is running locally or configure a remote MongoDB instance.
-
Create
.env
file with the following content:MONGODB_URI=mongodb://localhost:27017/camera-app PORT=5000 DB_HOST=localhost DB_USER=root DB_PASSWORD=your_password_here DB_NAME=fusion_vision
- Start the Server:
node server.js
-
Launch the web interface:
Open
index.html
in a modern web browser.
- TensorFlow (>=2.0.0)
- Deep learning framework for object detection.
- Used for running the COCO-SSD model.
-
HTML5
- Camera API
- Canvas for drawing
- Speech synthesis
-
JavaScript
- TensorFlow.js
- WebRTC for camera access
- Speech synthesis API
- Modern web browser with:
- WebRTC support
- JavaScript enabled
- Web Speech API support
To ensure the application functions correctly, you need to grant the following permissions in your web browser:
- Camera Access: Required for capturing images and performing real-time object detection.
- Microphone Access: Needed if the application includes voice input features.
- Speech Synthesis: Ensure your browser allows speech synthesis for voice narration of detected objects.
Here are some screenshots of the application:
Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests.