Positional versus Symbolic Attention Heads: Learning Dynamics, RoPE Geometry, and Length Generalization<br>位置注意力头与符号注意力头:学习动态、RoPE几何与长度泛化<br>[摘要](abstracts/2605.31558.html)
Abstract (EN)
Abstract not available.
摘要 (ZH)
未生成(可稍后重试翻译)
← Back