分享

Interpreting the linear structure of vision-language model embedding spaces

热度