I'm a researcher at Shanghai AI Lab working on data-centric research.
Before that I obtained bachelor and master degree at Sun Yat-Sen University (SYSU).
We found that 50% of LAION-2B samples contain the Parrot Captions (concurent text in captions and print pixels)
and the Parrot Captions do huge impact on CLIP-style Vision-Language Alignment.