Reading Between the Pixels: Linking Text-Image Embedding Alignment to Typographic Attack Success on Vision-Language Models<br>像素间的解读:将文本-图像嵌入对齐与视觉语言模型上的排版攻击成功率关联研究<br>[摘要](abstracts/2604.12371.html)
Abstract (EN)
Abstract not available.
摘要 (ZH)
未生成(可稍后重试翻译)
← Back