Spotify Vision | Devpost

Inspiration

This project was the combination of our love for music and building something related to computer vision, so we took the best of both worlds and decided to make a project using Spotify.

What it does

Hand gestures on camera to control your Spotify

Challenges we ran into

Visualizer was hard to do, merging all of our files was hard, and finding data sets took a long time to ensure our model was accurate. We deliberated on which model to use, comparing and contrasting each one. We attempted to train a model, but it took excessively long, so we decided to use an existing model and modify the inputs. The challenge we ran into again with this existing model is that it had limitations with the number of gestures, so we had to train it more by gathering more data so it can recognize all the gestures we wanted the app to.

Accomplishments that we're proud of

Getting the visualizer done, designing the UI, getting the camera to work and called Spotify functions. We seamlessly integrated AI with the Spotify AI to get the desired result and make the best experience for our future users.

What we learned

Spotify API, Python TkInter library, OpenCV, AI stuff, PyTorch, Windows SDK, Numpy, PyAudio We also learned to interact with hardware by accessing the sound card and webcam of the computer