Abstract: Exploring complementary information between RGB and thermal/depth modalities is crucial for bi-modal salient object detection (BSOD). However, the distinct characteristics of different ...
Abstract: With the rapid development of imaging sensor technology in the field of remote sensing, multi-modal remote sensing data fusion has emerged as a crucial research direction for land cover ...
Video-MME applies to both image MLLMs, i.e., generalizing to multiple images, and video MLLMs. 🌟 Video-MME is only used for academic research. Commercial use in any form is prohibited. The copyright ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results