2023-10-05
-
Artificial Intelligence,
Information Processing | Computing,
Research
Haotian Liu, Chunyuan Li, Yuheng Li and Yong Jae Lee show that the fully-connected vision-language cross-modal connector in LLaVA is surprisingly powerful and data-efficient.