乏信息下基于大模型的无人艇路径规划智能体框架设计与研究

LLM-based intelligent agent framework for USV path planning in information-scarce environments

  • 摘要: 摘 要:【目的】针对现有无人艇路径规划算法在认知盲区、感知缺失和语义模糊等“乏信息”环境下高度依赖规则库和高精度传感输入、难以应对突发未知威胁的局限性,提出一种基于大语言模型的智能体框架,探索其在“乏信息”下进行无人艇路径规划的可行性与优势。【方法】所构建的智能体框架以预训练大模型为智能中枢,集成多种工具函数(包括各类传感器调用接口)和人机交互接口,引入长短期记忆与在线和离线演进能力;设计结构化提示模板,将传感器观测及置信度、任务目标、先验常识、决策优先级和条件判断等信息组织为提示,引导大模型进行环境感知、动态推理与路径生成。【结果】实验结果表明,在静态障碍物环境下,智能体在路径长度、转弯次数、平均转弯角度等关键指标上均优于或不明显逊于A*、RRT、PSO和Hybrid A*等代表性传统算法,综合得分为1,高于其他传统算法(分别为0.47、0.47、0.77、0.76);在动态环境下,大模型能够有效调用工具函数,适时请求人机交互,并基于实时感知信息执行路径规划。在面临认知盲区、感知缺失和语义模糊等复杂情境时,模型还能结合置信度评估、历史记忆及常识性知识,自适应调整规划策略,展现出卓越的规划推理能力与泛化适应能力,体现出高度类人化的智能水平。【结论】可见,基于大模型的无人艇路径规划智能体无论是在简单避障还是在“乏信息”等复杂场景下均具有显著优势,有望推动无人艇向更高自主性发展。

     

    Abstract: Abstract:Objectives Existing unmanned surface vehicle (USV) path planning algorithms rely heavily on rule-based libraries and high-precision sensor inputs, making them ill-suited to “information-scarce” environments characterized by cognitive blind spots, perception loss, and semantic ambiguity, and thus unable to handle sudden unknown threats. This work proposes an intelligent agent framework based on large language models (LLMs) to investigate its feasibility and advantages for USV path planning under such information-scarce conditions. Methods The proposed agent framework centers on a pre-trained LLM as the planning core, integrating various tool functions (including sensor interfaces) and a human-machine interaction module, and incorporating short-term and long-term memories with both online and offline evolution capabilities. A structured prompt template is designed to organize sensor observations and their confidence levels, mission objectives, prior knowledge, decision priorities, and conditional logic into prompts that guide the LLM in environment perception, dynamic reasoning, and path generation. Results Experimental results demonstrate that, in environments with static obstacles, the proposed intelligent agent outperforms or is at least comparable to representative traditional algorithms such as A*, RRT, PSO, and Hybrid A* in key metrics including path length, number of turns, and average turning angle. The agent achieved a composite performance score of 1.00, significantly higher than those of the traditional algorithms (0.47, 0.47, 0.77, and 0.76, respectively). In dynamic environments, the LLM can effectively invoke tool functions, initiate human-in-the-loop interactions when appropriate, and perform real-time path planning based on sensor inputs. When faced with challenging conditions such as cognitive blind spots, perceptual gaps, and semantic ambiguity, the agent can adapt its planning strategy by leveraging confidence assessments, historical memory, and commonsense knowledge, demonstrating robust reasoning, generalization capabilities, and human-like intelligence. Conclusions These findings indicate that the LLM-based path planning agent exhibits significant advantages in both simple obstacle avoidance and complex, information-scarce scenarios. This approach offers strong potential to advance USVs toward higher levels of autonomy.

     

/

返回文章
返回