Plugins Ecosystem#

Welcome to the FiftyOne Plugins ecosystem! 🚀

Here you’ll discover cutting-edge research, state-of-the-art models, and powerful add-ons that unlock new FiftyOne workflows.

FiftyOne plugins allow you to extend and customize the functionality of the core tool to suit your specific needs. From advanced computer vision models to integrations with other popular AI tools, this curated collection of plugins will transform FiftyOne into your bespoke visual AI development workbench.


Image
Huggingface Hub ⭐ 2

by voxel51
Push FiftyOne datasets to the Hugging Face Hub, and load datasets from the Hub into FiftyOne!

Voxel51

Image
Transformers ⭐ 2

by voxel51
Run inference on your datasets using Hugging Face Transformers models!

Voxel51

Image
Pdf-loader ⭐ 4

by brimoor
Load your PDF documents into FiftyOne as per-page images

Community

Image
Synthetic Gui Samples Plugins ⭐ 7

by harpreetsahota
A FiftyOne plugin for generating synthetic samples for datasets in COCO4GUI format

Community

Image
Image Captioning ⭐ 10

by jacobmarks
Caption all your images with state of the art vision-language models!

Community

Image
Sam3 Images ⭐ 3

by harpreetsahota
Integration of Meta's SAM3 (Segment Anything Model 3) into FiftyOne, with full support of text prompts, keypoint prompts, bounding box prompts, auto segmentation, and image embeddings.

Community,Model

Image
Semantic Video Search ⭐ 23

by danielgural
search through your video datasets using FiftyOne Brain and Twelve Labs!

Community

Image
Qwen3vl Video ⭐ 12

by harpreetsahota
A FiftyOne zoo model integration for Qwen3-VL that enables comprehensive video understanding with multiple label types in a single forward pass and for computing video embeddings.

Community,Model

Image
Annotation ⭐ 134

by voxel51
Utilities for integrating FiftyOne with annotation tools

Voxel51

Image
Brain ⭐ 134

by voxel51
Utilities for working with the FiftyOne Brain

Voxel51

Image
Dashboard ⭐ 134

by voxel51
Create your own custom dashboards from within the App

Voxel51

Image
Io ⭐ 134

by voxel51
A collection of import/export utilities

Voxel51

Image
Indexes ⭐ 134

by voxel51
Utilities working with FiftyOne database indexes

Voxel51

Image
Plugins ⭐ 134

by voxel51
Utilities for managing and building FiftyOne plugins

Voxel51

Image
Delegated ⭐ 134

by voxel51
Utilities for managing your delegated operations

Voxel51

Image
Runs ⭐ 134

by voxel51
Utilities for managing your custom runs

Voxel51

Image
Utils ⭐ 134

by voxel51
Call your favorite SDK utilities from the App

Voxel51

Image
Zoo ⭐ 134

by voxel51
Download datasets and run inference with models from the FiftyOne Zoo, all without leaving the App

Voxel51

Image
Molmo2 ⭐ 0

by harpreetsahota
Molmo2 is a family of open vision-language models developed by the Allen Institute for AI (Ai2) that support image, video, and multi-image understanding and grounding.

Community,Model

Image
Apple Sharp ⭐ 4

by harpreetsahota
SHARP is Apple's state-of-the-art model for predicting 3D Gaussian Splats from a single RGB image. This integration brings SHARP to FiftyOne, enabling batch inference on image datasets with 3D visualization.

Community,Model

Image
Gemini-vision-plugin ⭐ 3

by AdonaiVera
This plugin integrates Google Gemini's multimodal Vision models (e.g., gemini-2.5-flash) into your FiftyOne workflows. Prompt with text and one or more images; receive a text response grounded in visual inputs

Community,Model

Image
Mineru 2 5 ⭐ 6

by harpreetsahota
MinerU2.5 is a 1.2B-parameter vision-language model for efficient high-resolution document parsing. This model can support grounding OCR as well as free text OCR.

Community,Model

Image
Nvlabs Cradiov3 ⭐ 25

by harpreetsahota
Implementing NVLabs C-RADIOv3 Embeddings Model as Remotely Sourced Zoo Model for FiftyOne

Community,Model

Image
Caption Viewer ⭐ 2

by harpreetsahota
A plugin that intelligently displays and formats VLM (Vision Language Model) outputs and text fields. Perfect for viewing OCR results, receipt analysis, document processing, and any text-heavy computer vision workflows.

Community

Image
Fiftyone-vlm-efficient ⭐ 4

by AdonaiVera
Improve VLM training data quality with state-of-the-art dataset pruning and quality techniques

Community

Image
Nemotron Nano Vl ⭐ 3

by harpreetsahota
Implementing Llama-3.1-Nemotron-Nano-VL-8B-V1 as a Remote Zoo Model for FiftyOne

Community,Model

Image
Model-comparison ⭐ 14

by allenleetc
Compare two object detection models!

Community

Image
Fiftyone Wandb Plugin ⭐ 2

by harpreetsahota
This plugin connects FiftyOne datasets with Weights & Biases to enable reproducible, data-centric ML workflows.

Community

Image
Moondream3 ⭐ 11

by harpreetsahota
Moondream 3 (Preview) is an vision language model with a mixture-of-experts architecture (9B total parameters, 2B active). This model makes no compromises, delivering state-of-the-art visual reasoning while still retaining our efficient and deployment-friendly ethos.

Community,Model

Image
Siglip2 ⭐ 2

by harpreetsahota
A FiftyOne Remotely Sourced Zoo Model integration for Google's SigLIP2 model enabling natural language search across images in your FiftyOne Dataset

Community,Model

Image
Vggt ⭐ 20

by harpreetsahota
Implemeting Meta AI's VGGT as a FiftyOne Remote Zoo Model

Community,Model

Image
Zero Shot Prediction ⭐ 36

by jacobmarks
Run zero-shot (open vocabulary) prediction on your data!

Community

Image
Fast Vlm ⭐ 9

by harpreetsahota
Integrating FastVLM as a Remote Source Zoo Model for FiftyOne

Community,Model

Image
Anonymize ⭐ 6

by swheaton
Anonymize/blur images based on a FiftyOne Detections field.

Community

Image
Medsiglip ⭐ 2

by harpreetsahota
Implementing MedSigLIP as a Remote Zoo Model for FiftyOne

Community,Model

Image
Text Evaluation Metrics ⭐ 1

by harpreetsahota
This plugin provides five text evaluation metrics for comparing predictions against ground truth\: ANLS, Exact Match, Normalized Similarity, Character Error Rate, and Word Error Rate.

Community

Image
Nanonets Ocr2 ⭐ 1

by harpreetsahota
Nanonets-OCR2 transforms documents into structured markdown with intelligent content recognition and semantic tagging, making it ideal for downstream processing by Large Language Models (LLMs).

Community,Model

Image
Olmocr-2 ⭐ 1

by harpreetsahota
olmOCR-2 is a state-of-the-art OCR model built on Qwen2.5-VL architecture that extracts text from document images with high accuracy.

Community,Model

Image
Deepseek Ocr ⭐ 3

by harpreetsahota
DeepSeek-OCR is a vision-language model designed for optical character recognition with a focus on "contextual optical compression."

Community,Model

Image
Semantic Document Search ⭐ 9

by jacobmarks
Perform semantic search on text in your documents!

Community

Image
Voxelgpt ⭐ 250

by voxel51
An AI assistant that can query visual datasets, search the FiftyOne docs, and answer general computer vision questions

Voxel51

Image
Jina Embeddings V4 ⭐ 1

by harpreetsahota
Jina Embeddings v4 is a state-of-the-art Vision Language Model that generates embeddings for both images and text in a shared vector space.

Community,Model

Image
Image Issues ⭐ 34

by jacobmarks
Find common image quality issues in your datasets

Community

Image
Vlmrun-voxel51-plugin ⭐ 9

by vlm-run
Extract structured data from visual and audio sources including documents, images, and videos

Community

Image
Kosmos2 5 ⭐ 3

by harpreetsahota
Kosmos-2.5 excels at two core tasks\: generating spatially-aware text blocks (OCR) and producing structured markdown output from images.

Community,Model

Image
Medgemma ⭐ 10

by harpreetsahota
Implementing MedGemma as a Remote Zoo Model for FiftyOne

Community,Model

Image
Nomic-embed-multimodal ⭐ 1

by harpreetsahota
Nomic Embed Multimodal is a family of vision-language models built on Qwen2.5-VL that generates high-dimensional embeddings for both images and text in a shared vector space.

Community,Model

Image
Bimodernvbert ⭐ 1

by harpreetsahota
BiModernVBert is a vision-language model built on the ModernVBert architecture that generates embeddings for both images and text in a shared 768-dimensional vector space.

Community,Model

Image
Colmodernvbert ⭐ 1

by harpreetsahota
ColModernVBert is a multi-vector vision-language model built on the ModernVBert architecture that generates ColBERT-style embeddings for both images and text.

Community,Model

Image
Colqwen2 5 V0 2 ⭐ 1

by harpreetsahota
ColQwen2.5 is a Vision Language Model based on Qwen2.5-VL-3B-Instruct that generates ColBERT-style multi-vector representations for efficient document retrieval. This version takes dynamic image resolutions (up to 768 image patches) and doesn't resize them, preserving aspect ratios for better accuracy.

Community,Model

Image
Bddoia-fiftyone ⭐ 2

by AdonaiVera
Load and explore the BDDOIA Safe/Unsafe Action dataset via the FiftyOne Zoo

Community,Dataset

Image
Active Learning ⭐ 17

by jacobmarks
Accelerate your data labeling with Active Learning!

Community

Image
Ui Tars ⭐ 7

by harpreetsahota
Implementing UI-TARS-1.5 as a Remote Zoo Model for FiftyOne

Community,Model

Image
Fiftyone-agents ⭐ 1

by AdonaiVera
A comprehensive FiftyOne plugin for testing and evaluating multiple Vision-Language Models (VLMs) with dynamic prompts and built-in evaluation capabilities

Community

Image
Gui Actor ⭐ 2

by harpreetsahota
Implementing Microsoft's GUI Actor as a Remote Zoo Model for FiftyOne

Community,Model

Image
Isaac0 1 ⭐ 4

by harpreetsahota
Isaac-0.1 is the first in Perceptron AI's family of models built to be the intelligence layer for the physical world. This integration supports various computer vision tasks including object detection, classification, OCR, visual question answering, and more.

Community,Model

Image
Colpali V1 3 ⭐ 1

by harpreetsahota
ColPali is a Vision Language Model based on PaliGemma-3B that generates ColBERT-style multi-vector representations for efficient document retrieval.

Community,Model

Image
Paligemma2 ⭐ 5

by harpreetsahota
Implementing PaliGemma-2-Mix as a Remote Zoo Model for FiftyOne

Community,Model

Image
Minicpm-v ⭐ 4

by harpreetsahota
Integrating MiniCPM-V 4.5 as a Remote Source Zoo Model in FiftyOne

Community,Model

Image
Multimodal Rag ⭐ 21

by jacobmarks
Create and test multimodal RAG pipelines with LlamaIndex, Milvus, and FiftyOne!

Community

Image
Audio Retrieval ⭐ 11

by jacobmarks
Find the images in your dataset most similar to an audio file!

Community

Image
Nemo Retriever Parse Plugin ⭐ 4

by harpreetsahota
Implementing NVIDIA NeMo Retriever Parse as a FiftyOne Plugin

Community

Image
Clustering ⭐ 11

by jacobmarks
Cluster your images using embeddings with FiftyOne and scikit-learn!

Community

Image
Clustering Algorithms ⭐ 4

by danielgural
Find the clusters in your data using some of the best algorithms available!

Community

Image
Vitpose ⭐ 3

by harpreetsahota
Run ViTPose Models from Hugging Face on your FiftyOne Dataset

Community

Image
Moondream2 ⭐ 3

by harpreetsahota
Moondream2 implementation as a remotely sourced zoo model for FiftyOne

Community,Model

Image
Florence2 ⭐ 4

by harpreetsahota
Implementing Florence2 as a Remote Zoo Model for FiftyOne

Community,Model

Image
Multi Annotator Toolkit ⭐ 5

by madave94
Tackle noisy annotation! Find and analyze annotation issues in datasets with multiple annotators per image.

Community

Image
Kimi Vl A3b ⭐ 6

by harpreetsahota
FiftyOne Remotely Sourced Zoo Model integration for Moonshot AI's Kimi-VL-A3B models enabling object detection, keypoint localization, and image classification with strong GUI and document understanding capabilities.

Community,Model

Image
Showui ⭐ 2

by harpreetsahota
Integrating ShowUI into FiftyOne as a Remote Source Zoo Model

Community,Model

Image
Mimo Vl ⭐ 3

by harpreetsahota
Implementing MiMo-VL as a Remote Zoo Model for FiftyOne

Community,Model

Image
Os Atlas ⭐ 5

by harpreetsahota
Integrating OS-Atlas Base into FiftyOne as a Remote Source Zoo Model

Community,Model

Image
Vqa-plugin ⭐ 19

by jacobmarks
Ask (and answer) open-ended visual questions about your images!

Community

Image
Fiftyone Lerobot Importer ⭐ 5

by harpreetsahota
Import your LeRobot format dataset into FiftyOne format

Community

Image
Coco4gui Fiftyone ⭐ 3

by harpreetsahota
Implementing the COCO4GUI dataset type in FiftyOne with importers and exports

Community

Image
Segments-voxel51-plugin ⭐ 5

by segmentsai
Integrate FiftyOne with the Segments.ai annotation tool!

Community

Image
Youtube Panel Plugin ⭐ 6

by jacobmarks
Play YouTube videos in the FiftyOne App!

Community

Image
Mlflow ⭐ 5

by voxel51
Track model training experiments on your FiftyOne datasets with MLflow!

Voxel51

Image
Edit Label Attributes ⭐ 3

by ehofesmann
Edit attributes of your labels directly in the FiftyOne App!

Community

Image
Fiftyone-tile ⭐ 1

by mmoollllee
Tile your high resolution images to squares for training small object detection models

Community

Image
Hiera Video Embeddings ⭐ 3

by harpreetsahota
Compute embeddings for video using Facebook Hiera Models

Community

Image
Qwen2 5 Vl ⭐ 1

by harpreetsahota
Implementing Qwen2.5-VL as a Remote Zoo Model for FiftyOne

Community,Model

Image
Pytesseract Ocr ⭐ 11

by jacobmarks
Run optical character recognition with PyTesseract!

Community

Image
Reverse Image Search ⭐ 13

by jacobmarks
Find the images in your dataset most similar to an image from filesystem or the internet!

Community

Image
Audio Loader ⭐ 5

by danielgural
Import your audio datasets as spectograms into FiftyOne!

Community

Image
Visual Document Retrieval ⭐ 3

by harpreetsahota
A FiftyOne Remotely Sourced Zoo Model integration for LlamaIndex's VDR model enabling natural language search across document images, screenshots, and charts in your datasets.

Community,Model

Image
Albumentations Augmentation ⭐ 13

by jacobmarks
Test out any Albumentations data augmentation transform with FiftyOne!

Community

Image
Image Deduplication ⭐ 18

by jacobmarks
Find exact and approximate duplicates in your dataset!

Community

Image
Emoji Search ⭐ 7

by jacobmarks
Semantically search emojis and copy to clipboard!

Community

Image
Janus Vqa ⭐ 6

by harpreetsahota
Run the Janus Pro Models from Deepseek on your Fiftyone Dataset

Community

Image
Depth Pro Plugin ⭐ 2

by harpreetsahota
Perfom zero-shot metric monocular depth estimation using the Apple Depth Pro model

Community

Image
Optimal Confidence Threshold ⭐ 5

by danielgural
Find the optimal confidence threshold for your detection models automatically!

Community

Image
Outlier Detection ⭐ 7

by danielgural
Find those troublesome outliers in your dataset automatically!

Community

Image
Text To Image ⭐ 33

by jacobmarks
Add synthetic data from prompts with text-to-image models and FiftyOne!

Community

Image
Plotly-map-panel ⭐ 0

by allenleetc
Plotly-based Map Panel with adjustable marker cosmetics!

Community

Image
Concept Space Traversal ⭐ 5

by jacobmarks
Navigate concept space with CLIP, vector search, and FiftyOne!

Community

Image
Concept Interpolation ⭐ 6

by jacobmarks
Find images that best interpolate between two text-based extremes!

Community

Image
Gpt4 Vision ⭐ 9

by jacobmarks
Chat with your images using GPT-4 Vision!

Community

Image
Fiftyone-timestamps ⭐ 1

by mmoollllee
Compute datetime-related fields (sunrise, dawn, evening, weekday, ...) from your samples' filenames or creation dates

Community

Image
Keyword Search ⭐ 3

by jacobmarks
Perform keyword search on a specified field!

Community

Image
Img To Video ⭐ 1

by danielgural
Bring images to life with image to video!

Community

Image
Double Band Filter ⭐ 2

by jacobmarks
on two numeric ranges simultaneously!

Community

Image
Filter Values ⭐ 1

by ehofesmann
Filter a field of your FiftyOne dataset by one or more values.

Community

Image
Line2d ⭐ 4

by wayofsamu
Visualize x,y-Points as a line chart.

Community

Image
Twilio Automation ⭐ 2

by jacobmarks
Automate data ingestion with Twilio!

Community

Note

Community plugins are external projects maintained by their respective authors. They are not part of FiftyOne core and may change independently. Please review each plugin’s documentation and license before use.