基于LLM引导符号回归的变构型AUV自适应控制

Adaptive control of a morphing AUV based on large language model-guided symbolic regression

  • 摘要:
    目的 针对爬游混合型自主水下航行器(AUV)在构型转换过程中面临的剧烈非线性动态变化及安全稳定控制难题,提出一种基于大型语言模型(LLM)驱动的双环符号自适应控制(DSAC)框架。
    方法 该框架由慢速自适应环(~0.1 Hz)和快速控制环(100 Hz)组成。在慢速环中,创新性地将LLM作为“物理推理器”,通过解析肢体构型语义,自动生成包含搜索空间、必要项与禁止项的结构先验,将盲目符号回归(SR)约4.32万种组合的搜索空间压缩至约256种(缩减169倍),并加速收敛2.1倍,引导符号回归(SR)引擎从残差数据中快速发现具备显式解析形式的阻力定律。在快速环中,设计基于李雅普诺夫的安全滤波器,在更新控制律前对生成模型的耗散性进行实时验证,确保系统在动力学突变下的理论稳定性。
    结果 在渐变、阶跃、正弦三种变形场景的高保真仿真中,DSAC框架均能准确辨识变形过程中的流体动力学结构,轨迹跟踪均方根误差(RMSE)较传统PID及鲁棒MRAC降至0.054 m/s,降低近25%。在快速构型转换阶段,传统方法常出现振荡行为,而该框架的性能提升尤为显著。安全滤波器在变形阶段共处理47个候选模型,成功拦截3个物理不可行的“幻觉”模型(含负阻尼项),在包含30%对抗性扰动的压力测试中实现100%安全合规率,验证了“LLM生成假设—物理约束验证”范式的有效性。湖泊试验数据的校准仿真进一步表明,DSAC的预测与真实非线性特征一致。这些结果凸显了该框架对模型误设和环境扰动两者的鲁棒性。
    结论 所提出的DSAC框架可有效解决变构型AUV动力学建模中未知模型结构、搜索效率低和物理合理性差三大挑战,通过LLM语义引导与李雅普诺夫安全验证的结合,实现了数据驱动学习能力与严格稳定性保障的统一。这为将LLM驱动的符号发现方法推广到其他具有复杂构型依赖动力学的机器人领域开辟了新的可能。

     

    Abstract:
    Objective Hybrid crawling-swimming autonomous underwater vehicles (AUVs) possess the unique capability to operate both in the water column and along the seabed. However, their operation is severely challenged by abrupt and highly nonlinear variations in system dynamics during configuration transitions (morphing), which pose significant challenges for ensuring safe and stable control. This paper proposes a dual-loop symbolic adaptive control (DSAC) framework driven by large language model (LLM)-guided symbolic regression.
    Method The proposed DSAC architecture comprises a slow adaptation loop (~0.1 Hz) and a fast control loop (100 Hz). In the slow adaptation loop, an LLM is innovatively employed as a "physics reasoner". By interpreting the semantic implications of limb configuration changes, the LLM automatically generates structural priors—namely, the search space, required terms, and forbidden terms—thereby reducing the blind symbolic regression (SR) search space from approximately 43 200 combinations to about 256 (a 169-fold reduction) and improving convergence speed by a factor of 2.1. Guided by these priors, the SR engine efficiently identifies drag laws in explicit analytical form from residual data. In the fast control loop (100 Hz), a Lyapunov-based safety filter is designed to verify the dissipativity of the identified model in real time before updating the control law. This mechanism ensures the theoretical stability of the system during dynamic transitions.
    Results High-fidelity simulations under three morphing scenarios (gradual, step, and sinusoidal) demonstrate that the proposed DSAC framework can accurately identify hydrodynamic structures during the deformation process. Compared with conventional PID control and robust MRAC, the DSAC framework reduces the tracking error root-mean-square error (RMSE) by approximately 25%, with the RMSE decreasing to 0.054 m/s. This performance gain is particularly pronounced during abrupt configuration transitions, where traditional controllers often struggle to maintain stability. During the morphing phases, the safety filter evaluated 47 candidate models and successfully rejected 3 physically implausible "hallucinated" models, including one containing negative damping terms. In adversarial stress tests, where 30% of the candidate models were intentionally corrupted with non physical terms, the framework maintained a 100% safety compliance rate, thereby validating the effectiveness of the "LLM-driven hypothesis generation—physical constraint verification" paradigm. Calibration simulations based on lake trial data further demonstrate that the DSAC predictions are consistent with real-world nonlinear characteristics. These findings confirm the robustness of the proposed framework against model misspecification and environmental disturbances.
    Conclusion The proposed DSAC framework effectively addresses three fundamental challenges in morphing AUV dynamics modeling: unknown model structures, inefficient search processes, and poor physical plausibility. By integrating LLM-based semantic guidance with Lyapunov-based stability verification, it achieves a unified balance between adaptability and stability. This methodology can be extended to other robotic systems with configuration-dependent dynamics.

     

/

返回文章
返回