Abstract: Exploring complementary information between RGB and thermal/depth modalities is crucial for bi-modal salient object detection (BSOD). However, the distinct characteristics of different ...
Abstract: With the rapid development of imaging sensor technology in the field of remote sensing, multi-modal remote sensing data fusion has emerged as a crucial research direction for land cover ...
Video-MME applies to both image MLLMs, i.e., generalizing to multiple images, and video MLLMs. 🌟 Video-MME is only used for academic research. Commercial use in any form is prohibited. The copyright ...