Guide Mobile | Devpost

Inspiration

In today's fast paced world there is hardly any time for people on the road to observe the surroundings and help the needy. However, the technology has successfully advanced well that the blind need not depend on any other person to perform their day to day activities. Guide Mobile is similar to that of a guide dog but it will help you to experience the scenery before you as if someone speaks with you.

What it does

Guide Mobile which is a web app takes an image as input, and it provides a speech as an output where the sentence is generated with an image captioning algorithm.

How We built it

The image captioning model was built with tensorflow library. The model was trained on the flick8k image captioning dataset. The overall accuracy of the model was around 80%. There was extensive data preprocessing for the images as well as the corresponding text data and there was use of both Convolutional and Sequential networks. We are deploying the model on web app with Django MVT.

Challenges we ran into

The dataset provided with very limited images hence the output is not accurate for all the real world scenarios. Backend development was something new which under the limited time was challenging to build.

Accomplishments that we're proud of

We were able to deploy the web app and test it successfully on images. The outputs for most of the cases yielded fruitful results.

What we learned

Learned to maintain ML models and deploy them. Also learned to version the progress and develop robust backend for the website.

What's next for Guide Mobile

Adding more features for the blind such as adding regional languages, networking etc. Also work on real time scenery description so they don't need to click a picture to understand.

Built With

django
python
tensorflow

Submitted to

TOHacks 2022

Created by

I worked on the ML model development.

Hemanth Harikrishnan
Machine learning enthusiast with knowledge and project experience, working on vivid problem statements, and delivering quality results
I worked on the back-end. I have used Django MVT for the back-end integrating the Machine Learning, OpenCV Model to it

Tejas Ambhore
I worked on the frontend - setting up the web app

Hussain Omer

Updates

Hemanth Harikrishnan started this project — May 29, 2022 08:19 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.