Inspiration
Karaoke is fun—but it becomes magic when it blends with immersive technology. We wanted to make karaoke more than just reading lyrics on a screen. We imagined singing inside an AR environment where lyrics float in front of you, allowing people to sing, entertain, and practice songs whenever, wherever. Our goal was simple: bring karaoke into the world of spatial computing and build the foundation for an XR-powered social music experience.
What it does
HoloSing is a karaoke experience built with Unity for the XREAL Beam to be used with the XREAL glasses that lets users:
Search for songs directly from YouTube
Extract clean audio automatically
Display synchronized lyrics pulled from captions (SRT, VTT, JSON)
It’s a full end-to-end karaoke pipeline—from YouTube search → backend processing → Unity playback → real-time lyric sync—all running in one seamless flow.
How we built it
Unity
Handles the karaoke experience—search UI, playback controls, subtitle parsing, and lyric display
Uses TextMesh Pro for crisp typography
Integrates AR Foundation + XREAL XR support for future AR overlays
Backend (Node.js / Express)
Uses ytdl-core to extract audio
Fetches subtitles via youtube-captions-scraper
Provides a REST API that Unity calls to retrieve audio + lyrics
Challenges we ran into
Syncing lyrics across formats — SRT, VTT, and YouTube captions each behave differently
Latency between backend and Unity — Extracting audio + captions takes time; we optimized pipelines by caching the files for demonstration.
API limitations — We first tried to find karaoke specific APIs to be used, but they all required permission to be given beforehand.
AR integration — Coordinating Unity's AR stack with the XREAL SDK took a lot of iteration
What we learned
- Best practices for clean backend → Unity pipelines
- AR Foundation + XREAL plugin setup
- Managing Unity’s Android build ecosystem
What’s next for Vibers
Karaoke is inherently a social event, and our next steps are to build out multiplayer integration and audio inputs. We want to allow users to connect with each other remotely, sing together with each other, and even have a karaoke scoring system for a little bit of competitive fun. Imagine singing with floating holographic lyrics anywhere—your room becomes the stage.
Built With
- 3d
- android
- ar
- c#
- cache
- javascript
- klingai
- nanobanana
- node.js
- unity
- xreal
- xrealsdk
- xrit
- youtubeapi

Log in or sign up for Devpost to join the conversation.