分享

VIMI: Grounding Video Generation through Multi-modal Instruction

热度