Plugins Ecosystem#

Welcome to the FiftyOne Plugins ecosystem! 🚀

Here you’ll discover cutting-edge research, state-of-the-art models, and powerful add-ons that unlock new FiftyOne workflows.

FiftyOne plugins allow you to extend and customize the functionality of the core tool to suit your specific needs. From advanced computer vision models to integrations with other popular AI tools, this curated collection of plugins will transform FiftyOne into your bespoke visual AI development workbench.

Showcase your own plugin

Huggingface Hub ⭐ 2

by voxel51
Push FiftyOne datasets to the Hugging Face Hub, and load datasets from the Hub into FiftyOne!

Voxel51

Transformers ⭐ 2

by voxel51
Run inference on your datasets using Hugging Face Transformers models!

Voxel51

Pdf-loader ⭐ 4

by brimoor
Load your PDF documents into FiftyOne as per-page images

Community

Synthetic Gui Samples Plugins ⭐ 7

by harpreetsahota
A FiftyOne plugin for generating synthetic samples for datasets in COCO4GUI format

Community

Image Captioning ⭐ 10

by jacobmarks
Caption all your images with state of the art vision-language models!

Community

Sam3 Images ⭐ 3

by harpreetsahota
Integration of Meta's SAM3 (Segment Anything Model 3) into FiftyOne, with full support of text prompts, keypoint prompts, bounding box prompts, auto segmentation, and image embeddings.

Community,Model

Semantic Video Search ⭐ 23

by danielgural
search through your video datasets using FiftyOne Brain and Twelve Labs!

Community

Qwen3vl Video ⭐ 12

by harpreetsahota
A FiftyOne zoo model integration for Qwen3-VL that enables comprehensive video understanding with multiple label types in a single forward pass and for computing video embeddings.

Community,Model

Annotation ⭐ 134

by voxel51
Utilities for integrating FiftyOne with annotation tools

Voxel51

Brain ⭐ 134

by voxel51
Utilities for working with the FiftyOne Brain

Voxel51

Dashboard ⭐ 134

by voxel51
Create your own custom dashboards from within the App

Voxel51

Io ⭐ 134

by voxel51
A collection of import/export utilities

Voxel51

Indexes ⭐ 134

by voxel51
Utilities working with FiftyOne database indexes

Voxel51

Plugins ⭐ 134

by voxel51
Utilities for managing and building FiftyOne plugins

Voxel51

Delegated ⭐ 134

by voxel51
Utilities for managing your delegated operations

Voxel51

Runs ⭐ 134

by voxel51
Utilities for managing your custom runs

Voxel51

Utils ⭐ 134

by voxel51
Call your favorite SDK utilities from the App

Voxel51

Zoo ⭐ 134

by voxel51
Download datasets and run inference with models from the FiftyOne Zoo, all without leaving the App

Voxel51

Molmo2 ⭐ 0

by harpreetsahota
Molmo2 is a family of open vision-language models developed by the Allen Institute for AI (Ai2) that support image, video, and multi-image understanding and grounding.

Community,Model

Apple Sharp ⭐ 4

by harpreetsahota
SHARP is Apple's state-of-the-art model for predicting 3D Gaussian Splats from a single RGB image. This integration brings SHARP to FiftyOne, enabling batch inference on image datasets with 3D visualization.

Community,Model

Gemini-vision-plugin ⭐ 3

by AdonaiVera
This plugin integrates Google Gemini's multimodal Vision models (e.g., gemini-2.5-flash) into your FiftyOne workflows. Prompt with text and one or more images; receive a text response grounded in visual inputs

Community,Model

Mineru 2 5 ⭐ 6

by harpreetsahota
MinerU2.5 is a 1.2B-parameter vision-language model for efficient high-resolution document parsing. This model can support grounding OCR as well as free text OCR.

Community,Model

Nvlabs Cradiov3 ⭐ 25

by harpreetsahota
Implementing NVLabs C-RADIOv3 Embeddings Model as Remotely Sourced Zoo Model for FiftyOne

Community,Model

Caption Viewer ⭐ 2

by harpreetsahota
A plugin that intelligently displays and formats VLM (Vision Language Model) outputs and text fields. Perfect for viewing OCR results, receipt analysis, document processing, and any text-heavy computer vision workflows.

Community

Fiftyone-vlm-efficient ⭐ 4

by AdonaiVera
Improve VLM training data quality with state-of-the-art dataset pruning and quality techniques

Community

Nemotron Nano Vl ⭐ 3

by harpreetsahota
Implementing Llama-3.1-Nemotron-Nano-VL-8B-V1 as a Remote Zoo Model for FiftyOne

Community,Model

Model-comparison ⭐ 14

by allenleetc
Compare two object detection models!

Community

Fiftyone Wandb Plugin ⭐ 2

by harpreetsahota
This plugin connects FiftyOne datasets with Weights & Biases to enable reproducible, data-centric ML workflows.

Community

Moondream3 ⭐ 11

by harpreetsahota
Moondream 3 (Preview) is an vision language model with a mixture-of-experts architecture (9B total parameters, 2B active). This model makes no compromises, delivering state-of-the-art visual reasoning while still retaining our efficient and deployment-friendly ethos.

Community,Model

Siglip2 ⭐ 2

by harpreetsahota
A FiftyOne Remotely Sourced Zoo Model integration for Google's SigLIP2 model enabling natural language search across images in your FiftyOne Dataset

Community,Model

Vggt ⭐ 20

by harpreetsahota
Implemeting Meta AI's VGGT as a FiftyOne Remote Zoo Model

Community,Model

Zero Shot Prediction ⭐ 36

by jacobmarks
Run zero-shot (open vocabulary) prediction on your data!

Community

Fast Vlm ⭐ 9

by harpreetsahota
Integrating FastVLM as a Remote Source Zoo Model for FiftyOne

Community,Model

Anonymize ⭐ 6

by swheaton
Anonymize/blur images based on a FiftyOne Detections field.

Community

Medsiglip ⭐ 2

by harpreetsahota
Implementing MedSigLIP as a Remote Zoo Model for FiftyOne

Community,Model

Text Evaluation Metrics ⭐ 1

by harpreetsahota
This plugin provides five text evaluation metrics for comparing predictions against ground truth\: ANLS, Exact Match, Normalized Similarity, Character Error Rate, and Word Error Rate.

Community

Nanonets Ocr2 ⭐ 1

by harpreetsahota
Nanonets-OCR2 transforms documents into structured markdown with intelligent content recognition and semantic tagging, making it ideal for downstream processing by Large Language Models (LLMs).

Community,Model

Olmocr-2 ⭐ 1

by harpreetsahota
olmOCR-2 is a state-of-the-art OCR model built on Qwen2.5-VL architecture that extracts text from document images with high accuracy.

Community,Model

Deepseek Ocr ⭐ 3

by harpreetsahota
DeepSeek-OCR is a vision-language model designed for optical character recognition with a focus on "contextual optical compression."

Community,Model

Semantic Document Search ⭐ 9

by jacobmarks
Perform semantic search on text in your documents!

Community

Voxelgpt ⭐ 250

by voxel51
An AI assistant that can query visual datasets, search the FiftyOne docs, and answer general computer vision questions

Voxel51

Jina Embeddings V4 ⭐ 1

by harpreetsahota
Jina Embeddings v4 is a state-of-the-art Vision Language Model that generates embeddings for both images and text in a shared vector space.

Community,Model

Image Issues ⭐ 34

by jacobmarks
Find common image quality issues in your datasets

Community

Vlmrun-voxel51-plugin ⭐ 9

by vlm-run
Extract structured data from visual and audio sources including documents, images, and videos

Community

Kosmos2 5 ⭐ 3

by harpreetsahota
Kosmos-2.5 excels at two core tasks\: generating spatially-aware text blocks (OCR) and producing structured markdown output from images.

Community,Model

Medgemma ⭐ 10

by harpreetsahota
Implementing MedGemma as a Remote Zoo Model for FiftyOne

Community,Model

Nomic-embed-multimodal ⭐ 1

by harpreetsahota
Nomic Embed Multimodal is a family of vision-language models built on Qwen2.5-VL that generates high-dimensional embeddings for both images and text in a shared vector space.

Community,Model

Bimodernvbert ⭐ 1

by harpreetsahota
BiModernVBert is a vision-language model built on the ModernVBert architecture that generates embeddings for both images and text in a shared 768-dimensional vector space.

Community,Model

Colmodernvbert ⭐ 1

by harpreetsahota
ColModernVBert is a multi-vector vision-language model built on the ModernVBert architecture that generates ColBERT-style embeddings for both images and text.

Community,Model

Colqwen2 5 V0 2 ⭐ 1

by harpreetsahota
ColQwen2.5 is a Vision Language Model based on Qwen2.5-VL-3B-Instruct that generates ColBERT-style multi-vector representations for efficient document retrieval. This version takes dynamic image resolutions (up to 768 image patches) and doesn't resize them, preserving aspect ratios for better accuracy.

Community,Model

Bddoia-fiftyone ⭐ 2

by AdonaiVera
Load and explore the BDDOIA Safe/Unsafe Action dataset via the FiftyOne Zoo

Community,Dataset

Active Learning ⭐ 17

by jacobmarks
Accelerate your data labeling with Active Learning!

Community

Ui Tars ⭐ 7

by harpreetsahota
Implementing UI-TARS-1.5 as a Remote Zoo Model for FiftyOne

Community,Model

Fiftyone-agents ⭐ 1

by AdonaiVera
A comprehensive FiftyOne plugin for testing and evaluating multiple Vision-Language Models (VLMs) with dynamic prompts and built-in evaluation capabilities

Community

Gui Actor ⭐ 2

by harpreetsahota
Implementing Microsoft's GUI Actor as a Remote Zoo Model for FiftyOne

Community,Model

Isaac0 1 ⭐ 4

by harpreetsahota
Isaac-0.1 is the first in Perceptron AI's family of models built to be the intelligence layer for the physical world. This integration supports various computer vision tasks including object detection, classification, OCR, visual question answering, and more.

Community,Model

Colpali V1 3 ⭐ 1

by harpreetsahota
ColPali is a Vision Language Model based on PaliGemma-3B that generates ColBERT-style multi-vector representations for efficient document retrieval.

Community,Model

Paligemma2 ⭐ 5

by harpreetsahota
Implementing PaliGemma-2-Mix as a Remote Zoo Model for FiftyOne

Community,Model

Minicpm-v ⭐ 4

by harpreetsahota
Integrating MiniCPM-V 4.5 as a Remote Source Zoo Model in FiftyOne

Community,Model

Multimodal Rag ⭐ 21

by jacobmarks
Create and test multimodal RAG pipelines with LlamaIndex, Milvus, and FiftyOne!

Community

Audio Retrieval ⭐ 11

by jacobmarks
Find the images in your dataset most similar to an audio file!

Community

Nemo Retriever Parse Plugin ⭐ 4

by harpreetsahota
Implementing NVIDIA NeMo Retriever Parse as a FiftyOne Plugin

Community

Clustering ⭐ 11

by jacobmarks
Cluster your images using embeddings with FiftyOne and scikit-learn!

Community

Clustering Algorithms ⭐ 4

by danielgural
Find the clusters in your data using some of the best algorithms available!

Community

Vitpose ⭐ 3

by harpreetsahota
Run ViTPose Models from Hugging Face on your FiftyOne Dataset

Community

Moondream2 ⭐ 3

by harpreetsahota
Moondream2 implementation as a remotely sourced zoo model for FiftyOne

Community,Model

Florence2 ⭐ 4

by harpreetsahota
Implementing Florence2 as a Remote Zoo Model for FiftyOne

Community,Model

Multi Annotator Toolkit ⭐ 5

by madave94
Tackle noisy annotation! Find and analyze annotation issues in datasets with multiple annotators per image.

Community

Kimi Vl A3b ⭐ 6

by harpreetsahota
FiftyOne Remotely Sourced Zoo Model integration for Moonshot AI's Kimi-VL-A3B models enabling object detection, keypoint localization, and image classification with strong GUI and document understanding capabilities.

Community,Model

Showui ⭐ 2

by harpreetsahota
Integrating ShowUI into FiftyOne as a Remote Source Zoo Model

Community,Model

Mimo Vl ⭐ 3

by harpreetsahota
Implementing MiMo-VL as a Remote Zoo Model for FiftyOne

Community,Model

Os Atlas ⭐ 5

by harpreetsahota
Integrating OS-Atlas Base into FiftyOne as a Remote Source Zoo Model

Community,Model

Vqa-plugin ⭐ 19

by jacobmarks
Ask (and answer) open-ended visual questions about your images!

Community

Fiftyone Lerobot Importer ⭐ 5

by harpreetsahota
Import your LeRobot format dataset into FiftyOne format

Community

Coco4gui Fiftyone ⭐ 3

by harpreetsahota
Implementing the COCO4GUI dataset type in FiftyOne with importers and exports

Community

Segments-voxel51-plugin ⭐ 5

by segmentsai
Integrate FiftyOne with the Segments.ai annotation tool!

Community

Youtube Panel Plugin ⭐ 6

by jacobmarks
Play YouTube videos in the FiftyOne App!

Community

Mlflow ⭐ 5

by voxel51
Track model training experiments on your FiftyOne datasets with MLflow!

Voxel51

Edit Label Attributes ⭐ 3

by ehofesmann
Edit attributes of your labels directly in the FiftyOne App!

Community

Fiftyone-tile ⭐ 1

by mmoollllee
Tile your high resolution images to squares for training small object detection models

Community

Hiera Video Embeddings ⭐ 3

by harpreetsahota
Compute embeddings for video using Facebook Hiera Models

Community

Qwen2 5 Vl ⭐ 1

by harpreetsahota
Implementing Qwen2.5-VL as a Remote Zoo Model for FiftyOne

Community,Model

Pytesseract Ocr ⭐ 11

by jacobmarks
Run optical character recognition with PyTesseract!

Community

Reverse Image Search ⭐ 13

by jacobmarks
Find the images in your dataset most similar to an image from filesystem or the internet!

Community

Audio Loader ⭐ 5

by danielgural
Import your audio datasets as spectograms into FiftyOne!

Community

Visual Document Retrieval ⭐ 3

by harpreetsahota
A FiftyOne Remotely Sourced Zoo Model integration for LlamaIndex's VDR model enabling natural language search across document images, screenshots, and charts in your datasets.

Community,Model

Albumentations Augmentation ⭐ 13

by jacobmarks
Test out any Albumentations data augmentation transform with FiftyOne!

Community

Image Deduplication ⭐ 18

by jacobmarks
Find exact and approximate duplicates in your dataset!

Community

Emoji Search ⭐ 7

by jacobmarks
Semantically search emojis and copy to clipboard!

Community

Janus Vqa ⭐ 6

by harpreetsahota
Run the Janus Pro Models from Deepseek on your Fiftyone Dataset

Community

Depth Pro Plugin ⭐ 2

by harpreetsahota
Perfom zero-shot metric monocular depth estimation using the Apple Depth Pro model

Community

Optimal Confidence Threshold ⭐ 5

by danielgural
Find the optimal confidence threshold for your detection models automatically!

Community

Outlier Detection ⭐ 7

by danielgural
Find those troublesome outliers in your dataset automatically!

Community

Text To Image ⭐ 33

by jacobmarks
Add synthetic data from prompts with text-to-image models and FiftyOne!

Community

Plotly-map-panel ⭐ 0

by allenleetc
Plotly-based Map Panel with adjustable marker cosmetics!

Community

Concept Space Traversal ⭐ 5

by jacobmarks
Navigate concept space with CLIP, vector search, and FiftyOne!

Community

Concept Interpolation ⭐ 6

by jacobmarks
Find images that best interpolate between two text-based extremes!

Community

Gpt4 Vision ⭐ 9

by jacobmarks
Chat with your images using GPT-4 Vision!

Community

Fiftyone-timestamps ⭐ 1

by mmoollllee
Compute datetime-related fields (sunrise, dawn, evening, weekday, ...) from your samples' filenames or creation dates

Community

Keyword Search ⭐ 3

by jacobmarks
Perform keyword search on a specified field!

Community

Img To Video ⭐ 1

by danielgural
Bring images to life with image to video!

Community

Double Band Filter ⭐ 2

by jacobmarks
on two numeric ranges simultaneously!

Community

Filter Values ⭐ 1

by ehofesmann
Filter a field of your FiftyOne dataset by one or more values.

Community

Line2d ⭐ 4

by wayofsamu
Visualize x,y-Points as a line chart.

Community

Twilio Automation ⭐ 2

by jacobmarks
Automate data ingestion with Twilio!

Community

Note

Community plugins are external projects maintained by their respective authors. They are not part of FiftyOne core and may change independently. Please review each plugin’s documentation and license before use.