Context-Aware Object Reader

Welcome to the dual-app accessibility platform for Apple Vision Pro and iOS companion devices. This repo contains the implementation scaffold for the "Context-Aware Object Reader" defined in the HackHarvard 2025 requirements.

Repo Layout

test

VisionProApp/ – visionOS spatial experience (SwiftUI, RealityKit, PHASE) with modular services and view models
MobileApp/ – iOS companion for 24/7 streaming (SwiftUI, AVFoundation, WebRTC skeleton)
SharedModels/ – Swift package hosting shared network and AI models plus service protocols
web-helper/ – Next.js web application for remote helpers to view Vision Pro stream and respond to assistance requests
Docs/ – living architecture docs and design notes
requirements/ – canonical product requirements and discovery artifacts

Getting Started

Vision Pro & Mobile Apps

Install Xcode 16+ with visionOS 26 SDK.
Open the workspace you create combining VisionProApp, MobileApp, and SharedModels.
Provide environment variables for Gemini API credentials before wiring real network calls.

Web Helper Application

Install Node.js 18+
Navigate to web-helper/ directory
Run npm install to install dependencies
Follow web-helper/QUICKSTART.md for detailed setup

See WEB-HELPER-SUMMARY.md for complete web helper documentation.

Development Status

The codebase currently provides:

Dependency-injected services respecting SOLID + DRY principles.
SwiftUI views and view models for both apps, ready to hook into real devices.
Stubs for WebRTC, Gemini, PHASE, and helper network integrations flagged via AppError.notImplemented.
Shared models and protocol definitions to keep both apps in sync.
NEW: Complete Next.js web application for remote helper video streaming and assistance response.

Next Steps

Integrate Web Helper: Connect Vision Pro WebRTC streaming with the web-helper app (see web-helper/INTEGRATION.md)
Implement actual WebRTC signaling and transmission using the SharedModels contracts.
Deploy Socket.IO signaling server and configure TURN server for production.
Replace placeholder Gemini service calls with Firebase AI Logic / Google Gemini SDK.
Wire ARKit spatial anchors, gaze tracking, and PHASE audio playback on VisionPro.
Connect AVFoundation capture output to WebRTC publisher on the mobile app.
Backfill unit/UI tests as modules gain real behavior.

The Zen of Python might be guiding the vibe, but the code stays Swift-clean and puppy-approved. 🐶

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
Docs		Docs
HarvardHack25		HarvardHack25
requirements		requirements
web-helper		web-helper
.DS_Store		.DS_Store
README.md		README.md
WEB-HELPER-SUMMARY.md		WEB-HELPER-SUMMARY.md
harvard hacks live server vision pro.pdf		harvard hacks live server vision pro.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Context-Aware Object Reader

Repo Layout

Getting Started

Vision Pro & Mobile Apps

Web Helper Application

Development Status

Next Steps

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

michaelpeters-dev/Hack-Harvard

Folders and files

Latest commit

History

Repository files navigation

Context-Aware Object Reader

Repo Layout

Getting Started

Vision Pro & Mobile Apps

Web Helper Application

Development Status

Next Steps

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages