Skip to content

kdr/video-screen-copilot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Video Screen Copilot

A tool to record your screen, index it with CloudGlue AI, and ask questions about what you've seen on your screen using the Model Context Protocol (MCP).

Demo videos: ▶️ YouTube Playlist

Prerequisites

  • CloudGlue API key
  • Node.js (for the MCP server)
  • Python 3.x (for the screen recording server)

Project Components

This project consists of two main components:

1. Screen Recording Server (cg-session-upload)

A Flask-based server that records your screen and sends the recordings to CloudGlue for indexing.

Setup

  1. Navigate to the cg-session-upload directory:

    cd cg-session-upload
  2. Install the required Python dependencies:

    pip install -r requirements.txt
  3. Create a .env file with your CloudGlue API key and collection ID:

    CLOUDGLUE_API_KEY=your_api_key_here
    TARGET_COLLECTION_ID=your_collection_id_here
    

Usage

Start the server with your preferred options:

# Using environment variables from .env file
python continuous_server.py --percentage 70

# Explicitly providing API key and collection ID
python continuous_server.py --percentage 70 --api-key your_api_key_here --collection-id your_collection_id_here

# Additional options
python continuous_server.py --port 5002 --percentage 70

API endpoints:

  • /start - Start recording
  • /stop - Stop recording
  • /set_percentage/<int:percentage> - Set screen recording percentage
  • /recent_recordings - Get list of recent recordings

2. MCP Server (visual-screen-copilot-mcp-server)

A Node.js server implementing the Model Context Protocol to allow LLMs to access your screen recording data.

Setup

  1. Navigate to the visual-screen-copilot-mcp-server directory:

    cd visual-screen-copilot-mcp-server
  2. Install dependencies:

    npm install
  3. Build the project:

    npm run build

Integration with MCP

To use this project with an MCP-compatible AI assistant, add the following configuration to your MCP setup:

{
    "visual-screen-copilot-mcp-server": {
      "command": "node",
      "args": [
        "/path/to/visual-screen-copilot-mcp-server/build/index.js",
        "--api-key",
        "your_cloudglue_api_key_here",
        "--target-collection-id",
        "your_screen_recording_collection_id_here"
      ]
    }
}

Workflow

  1. Start the screen recording server with python continuous_server.py
  2. Navigate to /start to begin recording your screen
  3. Use your computer normally
  4. Navigate to /stop when you're done
  5. The recording will be processed and indexed by CloudGlue
  6. Use the MCP server to allow AI assistants to access and answer questions about your screen recordings

License

YOLO

Screen shots

image image

Deep Research Over Screen Recording - Claude Conversation

image image image image image

Hackathon

Image

World's Biggest MCP Hackathon

  • Saturday, May 17, 2025

About

Code for entry in Word's Biggest MCP Hackathon

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published