OpenGVLab – Medium

OpenGVLab

Published in

Open-source vision-language model now comparable to GPT-4V

InternVL 1.5 VS GPT4V. Here are some real cases.

May 2, 2024

Open-source vision-language model now comparable to GPT-4V

May 2, 2024

Published in

Does Your Multi-model LLM Truly See The Diagrams In Visual Math Problems?

If you want to use an AI model to solve math problems, you need to make sure which one understands diagrams.

Apr 17, 2024

Does Your Multi-model LLM Truly See The Diagrams In Visual Math Problems?

Apr 17, 2024

Published in

InternVid: Video-Text Dataset to Empowering Video Creation and Understanding

A large-scale video-text dataset contains over 7 million videos.

Apr 3, 2024

InternVid: Video-Text Dataset to Empowering Video Creation and Understanding

Apr 3, 2024

Published in

VideoMamba: State Space Model for Efficient Video Understanding

Better, faster, cheaper method for Video understanding with AI

Mar 27, 2024

VideoMamba: State Space Model for Efficient Video Understanding

Mar 27, 2024

Published in

The All-Seeing Project: Towards Panoptic Visual Recognization and General Relation Comprehension…

Mar 26, 2024

The All-Seeing Project: Towards Panoptic Visual Recognization and General Relation Comprehension…

Mar 26, 2024

OmniQuant: Calibrated Quantization for LLMs, Has been Integrated with commercial APP

An open-source, efficient LLM Model quantization method.

Mar 12, 2024

OmniQuant: Calibrated Quantization for LLMs, Has been Integrated with commercial APP

Mar 12, 2024

OpenGVLab

OpenGVLab

Friend of Medium

General Vision Team of Shanghai AI Lab; https://github.com/OpenGVLab; https://twitter.com/opengvlab

Following

Help

Status

About

Careers

Press

Blog

Privacy

Rules

Terms

Text to speech