Skip to yearly menu bar Skip to main content


Poster

X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning

Artemis Panagopoulou ⋅ Le Xue ⋅ Ning Yu ⋅ LI JUNNAN ⋅ DONGXU LI ⋅ Shafiq Joty ⋅ Ran Xu ⋅ Silvio Savarese ⋅ Caiming Xiong ⋅ Juan Carlos Niebles
2024 Poster

Abstract

Chat is not available.