Intraoral scanning is widely used for digital optical impressions in prosthodontic, implant, and orthodontic treatment, but full-arch and long-span scanning remain labor-intensive tasks with limited automation. In the confined oral cavity, operators must continuously adjust scanner motion while accumulating narrow field-of-view observations, making reconstruction quality sensitive to missing tooth surfaces and operator workload. We propose RobOralScan, which, to the best of our knowledge, is the first reinforcement learning (RL)-based pipeline for robotic automatic intraoral scanning. RobOralScan introduces a geometric memory-based observation space that accumulates partial scan observations into a tri-state geometric representation, allowing the policy to reason over scan history and insufficiently observed regions. It further introduces tooth-wise coverage learning, combining coverage-aware reward signals and a progressive training scheme to improve global reconstruction coverage while reducing uneven coverage across individual teeth. The learned policy selects relative scanner motions from accumulated geometric memory and robot proprioception for closed-loop scan control within the oral workspace. RobOralScan achieves a Chamfer Distance of 0.00838, an average coverage of 92.58%, a lower-tail per-tooth coverage of 88.45%, and a normalized AUC of 0.6674, completing the scan criterion in 8 of 10 evaluation episodes. Furthermore, zero-shot sim-to-real experiments demonstrate its practical feasibility on a physical robot-scanner setup.
口内扫描广泛应用于修复、种植和正畸治疗中的数字光学印模,但全牙弓和长跨度扫描仍是劳动密集型任务,自动化程度有限。在受限的口腔环境中,操作员必须持续调整扫描仪运动,同时积累窄视场观测,使得重建质量易受缺失牙面及操作员工作负荷影响。我们提出RobOralScan——据我们所知,这是首个基于强化学习(RL)的机器人自动口内扫描流程。RobOralScan引入了一种基于几何记忆的观测空间,将部分扫描观测累积为三态几何表示,使策略能够推理扫描历史及观测不足的区域。它还进一步引入了基于牙齿的覆盖学习,结合覆盖感知奖励信号与渐进式训练方案,在提升全局重建覆盖的同时减少各牙齿间覆盖不均。学习到的策略从累积的几何记忆与机器人本体感知中选取相对扫描仪运动,以实现口腔工作空间内的闭环扫描控制。RobOralScan实现了0.00838的倒角距离、92.58%的平均覆盖、88.45%的牙齿覆盖率下限、0.6674的归一化AUC,并在10次评估片段中有8次完成扫描标准。此外,零样本的仿真到实物实验证明了其在实际机器人-扫描仪装置上的可行性。