分享

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs

热度