Skip to yearly menu bar Skip to main content


Poster

X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning

Artemis Panagopoulou · Le Xue · Ning Yu · LI JUNNAN · DONGXU LI · Shafiq Joty · Ran Xu · Silvio Savarese · Caiming Xiong · Juan Carlos Niebles
2024 Poster

Abstract

Chat is not available.