分享

Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video Environments

热度