π Inspiration
Millions of professionals, students, and public speakers struggle to improve their presentation skills due to lack of real-time, objective feedback.
- π€ Speech Quality: Am I speaking clearly? Is my pacing correct?
- π₯ Body Language: Am I making eye contact? What does my posture convey?
- π§ Overall Impact: How can I sound more confident and persuasive?
Traditional feedback is subjective, delayed, and limited, leading to inefficient practice and slow improvement.
π‘ What It Does
Vocal Check is an AI-powered platform that analyzes both speech and body language from videos to deliver instant, multi-dimensional feedback.
π€ Speech Analysis
- Detects filler words (um, uh, like, etc.)
- Measures pace and pause consistency
- Evaluates tone modulation and vocal variety
- Checks pronunciation clarity
π₯ Body Language Analysis
- Tracks posture and alignment
- Measures eye contact consistency
- Analyzes gestures and movement
- Detects facial expressions
π§ AI-Powered Insights
- Personalized suggestions (Beginner β Advanced)
- Context-aware recommendations
- Goal-based improvement tips
- Text-to-Speech (TTS) summaries
π Detailed Reports
- JSON + human-readable reports
- Performance scoring system
- Confidence metrics and tracking
- Downloadable transcripts
π₯ Key Features
- β Multi-Modal Analysis (Speech + Video + AI)
- β‘ Instant Feedback from one upload
- π― Skill-Level Based Training
- π§ͺ Practice Mode with scoring
- π PPT + Video Integration
- π€ User Profiles & Progress Tracking
- π TTS Feedback Summaries
- π API-Ready Architecture
π οΈ How We Built It
Frontend
- HTML, CSS, JavaScript (Responsive UI)
Backend
- Python Flask (Modular architecture)
AI/ML Stack
- Google Gemini API β Intelligent feedback
- OpenAI Whisper β Speech-to-text
- Computer Vision β Body language analysis
Video Processing Pipeline
def process_video(video):
audio = extract_audio(video)
transcript = whisper.transcribe(audio)
speech_metrics = analyze_speech(transcript)
body_metrics = analyze_video_frames(video)
insights = gemini.generate_feedback(speech_metrics, body_metrics)
return insights
---
## π₯ NEXT STEP (IMPORTANT)
Replace these with your actual assets:
- `system-architecture.png`
- `pipeline.png`
---
## π― Want me to go one level higher?
I can generate for you:
- π¨ **Actual system architecture diagram (ready image)**
- π **Flowchart (clean + hackathon style)**
- π **UI mockups (dashboard, report screen)**
Just say: **"generate images"** and Iβll create them π
Built With
- advanced-cv-video-processing:-moviepy-with-ffmpeg-storage:-json-based-with-file-management-scalability:-rate-limiting
- async
- frontend:-javascript
- html/css-with-responsive-design-backend:-python-flask-with-modular-architecture-ai/ml:-google-gemini-api
- openai-whisper
- processing
Log in or sign up for Devpost to join the conversation.