Reading Between the Pixels: Linking Text-Image Embedding Alignment to Typographic Attack Success on Vision-Language Models<br>像素间的解读：将文本-图像嵌入对齐与视觉语言模型上的排版攻击成功率关联研究<br>[摘要](abstracts/2604.12371.html)