Reading Between the Pixels: Linking Text-Image Embedding Alignment to Typographic Attack Success on Vision-Language Models<br>像素间的解读:将文本-图像嵌入对齐与视觉语言模型上的排版攻击成功率关联研究<br>[摘要](abstracts/2604.12371.html)

Abstract (EN)

Abstract not available.

摘要 (ZH)

未生成(可稍后重试翻译)

← Back