Inspiration

As someone whose second language is English, I often face challenges in communicating with people from different cultural and language backgrounds. I wanted a quick and seamless way to translate speech or text without the hassle of unlocking my phone, navigating to a website, or typing. A simple, on-the-go translator seemed like the perfect solution for situations like quick conversations with friends or colleagues.

What it does

The Omi Translator is a voice-activated translator that simplifies multilingual communication. You can set the target language and API key (currently using my personal key for easy setup). Then, say the word "translate," followed by the content you want to translate, and wait for a notification or chat response with the translation.

How we built it

I leveraged the Omi Dev Kit for its voice recognition capabilities and connected it to a language translation API. The setup included integrating a trigger phrase and handling the corresponding api call.

After saying the word "translate", content will start to get recorded. I used 2 criterias to determine when to end the recording and start the tranlation

  1. End of sentence of at least 3 words: If we see a punctuation with at least 3 words before that, then we start translating
  2. After a long pause, if theres no more text for a long time, then we start translating.

These 2 criterias have been successful so far, and has been working smoothly for me

Challenges we ran into

One challenge was triggering and stopping the translation process efficiently. Ensuring the translation process was fast, accurate, and didn’t consume too many resources required careful planning.

Another challenge was optimizing the integration with the Omi Dev Kit, especially since some features like speaker support are not yet available.

Accomplishments that we're proud of

I successfully created a functional, voice-activated translation tool that eliminates the need for additional steps like unlocking a phone or navigating through apps. This tool significantly improves the user experience for quick and efficient translations.

What we learned

I gained a deeper understanding of integrating APIs with the Omi Dev Kit and explored creative ways to handle real-time input and output. This project also highlighted the importance of designing for user convenience in fast-paced communication scenarios.

What's next for Omi Translator In the future, if the Omi Dev Kit supports voice broadcasting, I aim to enable real-time translations, mimicking expensive services that typically cost over $90. Additionally, resolving this GitHub issue could allow us to trigger translations with a single click on the device. This enhancement would make communication even more seamless and accessible, improving lives globally.

Built With

Share this project:

Updates