• 00 07-03 (4) Agentic Business Process Management: Practitioner Perspectives on Agent Governance in Business Processes Agentic Business Process Management: Praxisperspektiven zur Agenten-Governance in Unternehmensprozessen 代理业务流程管理:从业者对业务流程代理治理的看法 2504.03693v2
  • 01 07-03 KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs KERAP: Ein wissensbasierter Ansatz für genaue Null-Shot-Diagnose-Vorhersage mit Multi-Agent LLMs KERRAP: 利用多种试剂LLMs进行准确零光诊断预测的知识强化理由说明方法 2507.02773v1
  • 02 07-03 A unifying approach to self-organizing systems interacting via conservation laws Ein vereinheitlichter Ansatz für selbstorganisierende Systeme, die über Erhaltungsgesetze interagieren 对通过养护法相互作用的自我组织系统采取统一办法 2507.02575v1
  • 03 07-03 Incorporating LLMs for Large-Scale Urban Complex Mobility Simulation Einschließlich LLMs für großräumige Urban Complex Mobility Simulation 大型城市综合流动模拟项目LLMs 2505.21880v2
  • 04 07-03 Benchmarking Generalizable Bimanual Manipulation: RoboTwin Dual-Arm Collaboration Challenge at CVPR 2025 MEIS Workshop Benchmarking Generalizable Bimanual Manipulation: RoboTwin Dual-Arm Collaboration Challenge bei CVPR 2025 MEIS Workshop 基准的可通用二手操纵:2025年欧洲气象和气象科学研究所讲习班上的机器人双臂双臂合作挑战 2506.23351v2
  • 05 07-03 Horus: A Protocol for Trustless Delegation Under Uncertainty Horus: Ein Protokoll für eine treulose Delegation unter Unsicherheit 荷鲁斯:不确定性下无信托代表团议定书 2507.00631v3
  • 06 07-02 (3) Synergizing Logical Reasoning, Knowledge Management and Collaboration in Multi-Agent LLM System Synergisieren von logischer Vernunft, Wissensmanagement und Zusammenarbeit im Multi-Agent LLM-System 多机构LLM系统统一逻辑理由、知识管理和协作 2507.02170v1
  • 07 07-02 Enhancing LLM-based Quantum Code Generation with Multi-Agent Optimization and Quantum Error Correction Verbesserung der LLM-basierten Quantencode-Generierung durch Multi-Agent-Optimierung und Quantenfehlerkorrektur 强化基于LLM的量制码生成,并采用多种物力优化和量度错误校正 2504.14557v2
  • 08 07-02 Distance-based Relative Orbital Transition for Satellite Swarm Array Deployment Under J2 Perturbation Distanzbasierter relativer Orbitalübergang für Satelliten-Swarm-Array-Einsatz unter J2 Perturbation 在J2扰动下部署卫星冲积阵列的相对轨道过渡 2507.01769v1
  • 09 07-02 Agent Ideate: A Framework for Product Idea Generation from Patents Using Agentic AI Agent-Idee: Ein Rahmen für Produkt-Ideen-Erzeugung aus Patenten mit Agent-KI Agent Ideate: 使用Agentic AI 专利产品创意一代框架 2507.01717v1
  • 10 07-02 Exploring Advanced LLM Multi-Agent Systems Based on Blackboard Architecture Erforschen von fortgeschrittenen LLM-Multi-Agent-Systemen auf der Basis von Tafelarchitektur 探索基于黑板架构的高级LLM多机构系统 2507.01701v1
  • 11 07-02 Co-Optimizing Reconfigurable Environments and Policies for Decentralized Multi-Agent Navigation Co-Optimierung neu konfigurierbarer Umgebungen und Politiken für dezentralisierte Multi-Agent-Navigation 共同优化可重新配置的环境和权力下放多机构导航政策 2403.14583v2
  • 12 07-02 Agent-as-Tool: A Study on the Hierarchical Decision Making with Reinforcement Learning Agent-as-Tool: Eine Studie über die hierarchische Entscheidungsfindung mit Verstärkungslernen Agent-as-Tool:关于以强化学习方式作出等级决策的研究 2507.01489v1
  • 13 07-02 BioMARS: A Multi-Agent Robotic System for Autonomous Biological Experiments BioMARS: Ein Multi-Agenten-Robotersystem für autonome biologische Experimente BioMARS:一个用于自主生物实验的多功能机器人系统 2507.01485v1
  • 14 07-02 RALLY: Role-Adaptive LLM-Driven Yoked Navigation for Agentic UAV Swarms RALLY: Rollenadaptive LLM-getriebene Yoked-Navigation für Agentische UAV-Swarme 用于UAV冲锋枪的 2507.01378v1
  • 15 07-02 Cooperative Target Capture in 3D Engagements over Switched Dynamic Graphs Kooperative Zielerfassung in 3D-Verpflichtungen über gewechselte dynamische Graphen 通过切换动态图表进行三维参与中的合作目标抓取 2507.01350v1
  • 16 07-02 Aitomia: Your Intelligent Assistant for AI-Driven Atomistic and Quantum Chemical Simulations Aitomia: Ihr intelligenter Assistent für KI-getriebene Atomistische und Quantum Chemical Simulationen Aitomia:您对AI-Driven原子学和量子化学模拟的智能助理 2505.08195v2
  • 17 07-02 Optimal Dispersion Under Asynchrony Optimale Dispersion unter Asynchronie Asynconsrony 下的优化分散 2507.01298v1
  • 18 07-02 Adaptive Traffic Signal Control based on Multi-Agent Reinforcement Learning. Case Study on a simulated real-world corridor Adaptive Verkehrssignalsteuerung auf Basis des Multi-Agenten-Verstärkungslernens. Fallstudie zu einem simulierten Real-World-Korridor 基于多机构强化学习的适应性交通信号控制,模拟现实世界走廊案例研究 2503.02189v4
  • 19 07-01 (2) Dynamic Strategy Adaptation in Multi-Agent Environments with Large Language Models Dynamische Strategieanpassung in Multi-Agent-Umgebungen mit großen Sprachmodellen 具有大语言模式的多机构环境中的动态战略适应 2507.02002v1
  • 20 07-01 Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Active Scout: Multi-Target-Tracking mit neuralen Strahlungsfeldern in dichten städtischen Umgebungen 活跃的童子军:在城市环境中使用神经辐射场进行多目标跟踪 2406.07431v3
  • 21 07-01 Large Language Model Powered Intelligent Urban Agents: Concepts, Capabilities, and Applications Large Language Model Powered Intelligent Urban Agents: Konzepte, Fähigkeiten und Anwendungen 大语言示范型大语言智能智能城市代表机构:概念、能力和应用 2507.00914v1
  • 22 07-01 TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation Translaw: Benchmarking von großen Sprachmodellen in der Multi-Agenten-Simulation der Kollaborativen Übersetzung TransLaw:在多方代理模拟协作翻译时确定大语言模式基准 2507.00875v1
  • 23 07-01 Position: Emergent Machina Sapiens Urge Rethinking Multi-Agent Paradigms Position: Emergent Machina Sapiens Urge Rethinking Multi-Agent Paradigmen 职位: 新兴马奇纳·萨皮恩斯敦促重新思考多机构模式 2502.04388v3
  • 24 07-01 Robust Correlated Equilibrium: Definition and Computation Robustes korreliertes Gleichgewicht: Definition und Berechnung 强力Cor相关平衡:定义和计算 2311.17592v2
  • 25 07-01 Hierarchical Decentralized Stochastic Control for Cyber-Physical Systems Hierarchische dezentrale stochastische Steuerung für Cyber-Physische Systeme 网络物理系统等级分层存储控制 2506.22971v2
  • 26 07-01 Towards a Playground to Democratize Experimentation and Benchmarking of AI Agents for Network Troubleshooting Auf dem Weg zu einem Spielplatz zur Demokratisierung von Experimenten und Benchmarking von KI-Agenten zur Netzwerkfehlerbehebung 走向使AI 网络排除问题代理机构民主化试验和基准设定的竞技场 2507.01997v1
  • 27 07-01 Twill: Scheduling Compound AI Systems on Heterogeneous Mobile Edge Platforms Twill: Scheduling Compound AI-Systeme auf heterogenen mobilen Edge-Plattformen Twill: 异源移动边缘平台上排成不同式移动边缘平台的AI系统 2507.00491v1
  • 28 07-01 Novel Pigeon-inspired 3D Obstacle Detection and Avoidance Maneuver for Multi-UAV Systems Neuartige Pigeon-inspirierte 3D-Hördererkennung und Vermeidungsmanöver für Multi-UAV-Systeme 多无人驾驶航空器系统3D障碍探测和避免多功能、无人驾驶航空器系统新小鸽诱导的3D障碍操纵器 2507.00443v1
  • 29 06-30 (1) What Makes Local Updates Effective: The Role of Data Heterogeneity and Smoothness Was lokale Updates effektiv macht: Die Rolle von Daten Heterogenität und Glätte 是什么使本地更新有效:数据多样化和平稳的作用 2507.00195v1
  • 30 06-30 Advancing Learnable Multi-Agent Pathfinding Solvers with Active Fine-Tuning Erlernbare Multi-Agent-Pathfinding-Lösemittel mit aktiver Feinsteuerung 具有积极微调功能的推进可学习多机构探索式解答器 2506.23793v1
  • 31 06-30 PokéAI: A Goal-Generating, Battle-Optimizing Multi-agent System for Pokemon Red PokéAI: Ein Ziel-Generierung, Schlacht-Optimierung Multi-Agenten-System für Pokemon Red PokéAI:波克蒙红公司目标启动、战斗优化多剂多试剂系统 2506.23689v1
  • 32 06-30 Curated Collaborative AI Edge with Network Data Analytics for B5G/6G Radio Access Networks Kuratierter Kollaborativer AI Edge mit Network Data Analytics für B5G/6G Radio Access Networks B5G/6G无线电接入网络与网络数据分析 2507.01994v1
  • 33 06-30 MGPRL: Distributed Multi-Gaussian Processes for Wi-Fi-based Multi-Robot Relative Localization in Large Indoor Environments MGPRL: Verteilte Multi-Gaussian-Prozesse für WLAN-basierte Multi-Roboter-relative Lokalisierung in großen Innenräumen MGPRL:大型室内环境中无线-基于无线-基于多机器人的多机器人相对本地化的分布式多盖日进程 2506.23514v1
  • 34 06-30 State and Memory is All You Need for Robust and Reliable AI Agents Zustand und Gedächtnis sind alles, was Sie für robuste und zuverlässige KI-Agenten brauchen 国家记忆是强力和可靠的AI代理所需要的一切 2507.00081v1
  • 35 06-29 (7) Automated Vehicles Should be Connected with Natural Language Automatisierte Fahrzeuge sollten mit natürlicher Sprache verbunden werden 自动车辆应与自然语言连接 2507.01059v1
  • 36 06-29 Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge Agentisches medizinisches Wissen Grafiken verbessern medizinische Frageantworten: Die Lücke zwischen LLMs und sich entwickelndem medizinischem Wissen überbrücken 药用知识图加强医疗问题的回答:缩小LLMM与不断发展的医学知识之间的差距 2502.13010v3
  • 37 06-29 Ad-Hoc Human-AI Coordination Challenge Ad-hoc-Koordinierungsherausforderung Mensch-AI A. 协调挑战 2506.21490v2
  • 38 06-29 Interaction Identification of a Heterogeneous NDS with Quadratic-Bilinear Subsystems Interaktionsidentifizierung eines Heterogenen NDS mit Quadratisch-Bilinearen Subsystemen 与赤道-双线亚系统对异基因 NDS的交互识别 2412.02547v2
  • 39 06-29 Research on Comprehensive Classroom Evaluation System Based on Multiple AI Models Forschung zum umfassenden Klassenraum-Bewertungssystem auf der Grundlage mehrerer KI-Modelle 基于多种AI模式的综合课堂评价系统研究 2506.23079v1
  • 40 06-28 (6) Evaluating Agents using Social Choice Theory Bewertung von Agenten anhand der Theorie der sozialen Wahl 使用社会选择理论评估代理人 2312.03121v4
  • 41 06-28 A Large Language Model-Enabled Control Architecture for Dynamic Resource Capability Exploration in Multi-Agent Manufacturing Systems Eine großsprachige modellfähige Steuerungsarchitektur für dynamische Ressourcenkapazitäts-Exploration in Multi-Agent-Produktionssystemen 多机构制造系统动态资源能力探索大语言模型化控制结构 2505.22814v2
  • 42 06-28 Resilient-Native and Intelligent Next-Generation Wireless Systems: Key Enablers, Foundations, and Applications Resilient-Native und intelligente Mobilfunksysteme der nächsten Generation: Key Enabler, Grundlagen und Anwendungen 具有弹性的、有弹性的和智能的下一级无线无线系统:关键启用器、基础和应用 2506.22991v1
  • 43 06-28 Detection of coordinated fleet vehicles in route choice urban games. Part I. Inverse fleet assignment theory Ermittlung koordinierter Flottenfahrzeuge bei der Routenwahl urbane Spiele. Teil I Inverse Flottenzuteilungstheorie 在选择路线选择城市游戏中发现协调一致的车队车辆。 2506.22966v1
  • 44 06-28 Agent-to-Agent Theory of Mind: Testing Interlocutor Awareness among Large Language Models Agent-to-Agent Theorie des Geistes: Testen Gesprächspartner Bewusstsein unter großen Sprachmodellen 精神感官理论:测试大语言模型间对话者的认识 2506.22957v1
  • 45 06-28 Neural Cellular Automata: From Cells to Pixels Neurale Zelluläre Automaten: Von Zellen zu Pixeln 神经细胞自定义数据: 从单元格到像素 2506.22899v1
  • 46 06-28 Cooperation as Black Box: Conceptual Fluctuation and Diagnostic Tools for Misalignment in MAS Kooperation als Black Box: Konzeptionelle Fluktuation und Diagnosetools für Fehlausrichtung in MAS 合作作为黑箱:MAS中不协调的概念波动和诊断工具 2506.22876v1
  • 47 06-28 Momentum-based Accelerated Algorithm for Distributed Optimization under Sector-Bound Nonlinearity Momentumbasierte beschleunigte Algorithmen zur verteilten Optimierung unter sektorübergreifender Nichtlinearität 部门-基于动力的在部门-健全非线性下分配的优化分配加速计算 2506.22855v1
  • 48 06-28 Consensus seeking in diffusive multidimensional networks with a repeated interaction pattern and time-delays Konsenssuche in diffusen multidimensionalen Netzwerken mit wiederholtem Interaktionsmuster und Zeitverzögerungen 寻求共识,在反复互动模式和拖延时间的多维网络中寻求共识 2402.15677v2
  • 49 06-27 (5) eCAV: An Edge-Assisted Evaluation Platform for Connected Autonomous Vehicles eCAV: Eine Edge Assisted Evaluation Platform für vernetzte autonome Fahrzeuge eCAV: 连接自治车辆的边缘辅助评价平台 2506.16535v2
  • 50 06-27 Toward Data Systems That Are Business Semantic Centric and AI Agents Assisted Auf dem Weg zu Datensystemen, die geschäftsführende semantische Centric- und KI-Agenten sind 建立具有商业语义中心和AI 辅助代理的数据系统 2506.05520v2
  • 51 06-27 Soft Condorcet Optimization for Ranking of General Agents Soft Condorcet Optimierung für das Ranking von General Agents 对一般代理人员排名的优化 2411.00119v4
  • 52 06-27 Exploring Modularity of Agentic Systems for Drug Discovery Erforschung der Modularität von Wirkstoffsystemen für die Drogenentdeckung 探索药物发现剂系统模式 2506.22189v1
  • 53 06-27 Programming Distributed Collective Processes in the eXchange Calculus Programmierung verteilter kollektiver Prozesse im eXchange Calculus eXchange Calculus 中的程序编程分配集体进程 2401.11212v4
  • 54 06-27 SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model SceneDiffuser++: City-Scale Verkehrssimulation über ein Generatives Weltmodell 景点Diffuser++:通过创世模式的城市规模交通量模拟 2506.21976v1
  • 55 06-27 Mitigating Metropolitan Carbon Emissions with Dynamic Eco-driving at Scale Mit dem dynamischen Öko-Fahren im Maßstab die Emissionen von Metropolitankohlenstoff mindern 减缓城市碳排放,在规模上进行动态生态驾驶 2408.05609v2
  • 56 06-27 Design of A* based heuristic algorithm for efficient interdiction in multi-Layer networks Entwurf eines auf A* basierenden heuristischen Algorithmus für effizientes Interdiction in Multi-Layer-Netzwerken 设计基于A* 的超值算法,以有效阻截多路网络 2506.10017v3
  • 57 06-27 ARAG: Agentic Retrieval Augmented Generation for Personalized Recommendation ARAG: Agentische Retrieval Augmented Generation für Personalisierte Empfehlung AARAG: 个人化推荐的 “ 危险回收增加的一代人 “ 2506.21931v1
  • 58 06-27 Cooperative Bearing-Only Target Pursuit via Multiagent Reinforcement Learning: Design and Experiment Cooperative Bearing-Only Target Pursuit über Multiagent-Verstärkung Lernen: Design und Experiment 通过多试剂强化学习,仅以合作定点追踪:设计和实验 2503.08740v2
  • 59 06-26 (4) Sequence Modeling for N-Agent Ad Hoc Teamwork Sequenzmodellierung für N-Agent Ad Hoc Teamwork N-代理特设团队工作的序列建模 2506.05527v2
  • 60 06-26 xChemAgents: Agentic AI for Explainable Quantum Chemistry xChemAgenten: Agentische KI für erklärbare Quantenchemie xchemAgents: 可解释量子化学的AAA剂 2505.20574v2
  • 61 06-26 Will LLMs be Professional at Fund Investment? DeepFund: A Live Arena Perspective Werden LLMs Professional bei Fund Investment sein? DeepFund: Eine Live Arena Perspektive LLM女士在基金投资方面是否具有专业性? 2503.18313v2
  • 62 06-26 Integrated Multimodal Sensing and Communication: Challenges, Technologies, and Architectures Integrierte multimodale Sensing und Kommunikation: Herausforderungen, Technologien und Architekturen 综合多式联运和通信:挑战、技术和结构 2506.22507v1

Article 0

Title@2025-07-03 (4): Agentic Business Process Management: Practitioner Perspectives on Agent Governance in Business Processes

Title: Agentic Business Process Management: Practitioner Perspectives on Agent Governance in Business Processes Agentic Business Process Management: Praxisperspektiven zur Agenten-Governance in Unternehmensprozessen 代理业务流程管理:从业者对业务流程代理治理的看法 2504.03693v2

Authors (5): Hoang Vu, Nataliia Klievtsova, Henrik Leopold, Stefanie Rinderle-Ma, Timotheus Kampik

With the rise of generative AI, industry interest in software agents is growing. Given the stochastic nature of generative AI-based agents, their effective and safe deployment in organizations requires robust governance, which can be facilitated by agentic business process management. However, given the nascence of this new-generation agent notion, it is not clear what BPM practitioners consider to be an agent, and what benefits, risks and governance challenges they associate with agent deployments. To investigate how organizations can effectively govern AI agents, we conducted a qualitative study involving semi-structured interviews with 22 BPM practitioners from diverse industries. They anticipate that agents will enhance efficiency, improve data quality, ensure better compliance, and boost scalability through automation, while also cautioning against risks such as bias, over-reliance, cybersecurity threats, job displacement, and ambiguous decision-making. To address these challenges, the study presents six key recommendations for the responsible adoption of AI agents: define clear business goals, set legal and ethical guardrails, establish human-agent collaboration, customize agent behavior, manage risks, and ensure safe integration with fallback options. Additionally, the paper outlines actions to align traditional BPM with agentic AI, including balancing human and agent roles, redefining human involvement, adapting process structures, and introducing performance metrics. These insights provide a practical foundation for integrating AI agents into business processes while preserving oversight, flexibility, and trust.

随着基因化的AI的兴起,工业界对软件代理的兴趣正在增加。鉴于基因化的AI型代理的随机性质,在组织中有效和安全地部署它们需要强有力的治理,而这种治理可以通过代理业务流程管理加以促进。然而,鉴于这种新一代代理概念的诞生,目前尚不清楚BPM从业人员认为什么是代理,以及他们与代理部署有关哪些好处、风险和治理挑战。为了调查各组织如何能够有效地管理AI代理,我们开展了一项定性研究,涉及与不同行业22名BPM从业人员的半结构性访谈。他们预计,代理将提高效率,提高数据质量,确保更好的遵守,并通过自动化提高可扩展性,同时告诫人们避免偏见、过度依赖、网络安全威胁、工作流离失所和模棱两可的决策等风险。为了应对这些挑战,BPM从业人员研究提出了六项重要建议,以便负责地采用AI代理:确定明确的商业目标、设置法律和道德护栏、建立人类代理协作、定制代理行为、管理风险和确保安全地与倒行选项相结合。此外,文件概述了为使传统的BPM结构结构与代理人参与和重新确定机构的工作基础。


Article 1

Title@2025-07-03 (4): KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs

Title: KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs KERAP: Ein wissensbasierter Ansatz für genaue Null-Shot-Diagnose-Vorhersage mit Multi-Agent LLMs KERRAP: 利用多种试剂LLMs进行准确零光诊断预测的知识强化理由说明方法 2507.02773v1

Authors (8): Yuzhang Xie, Hejie Cui, Ziyang Zhang, Jiaying Lu, Kai Shu, Fadi Nahab, Xiao Hu, Carl Yang

Medical diagnosis prediction plays a critical role in disease detection and personalized healthcare. While machine learning (ML) models have been widely adopted for this task, their reliance on supervised training limits their ability to generalize to unseen cases, particularly given the high cost of acquiring large, labeled datasets. Large language models (LLMs) have shown promise in leveraging language abilities and biomedical knowledge for diagnosis prediction. However, they often suffer from hallucinations, lack structured medical reasoning, and produce useless outputs. To address these challenges, we propose KERAP, a knowledge graph (KG)-enhanced reasoning approach that improves LLM-based diagnosis prediction through a multi-agent architecture. Our framework consists of a linkage agent for attribute mapping, a retrieval agent for structured knowledge extraction, and a prediction agent that iteratively refines diagnosis predictions. Experimental results demonstrate that KERAP enhances diagnostic reliability efficiently, offering a scalable and interpretable solution for zero-shot medical diagnosis prediction.

医学诊断预测在疾病检测和个性化保健方面发挥着关键作用。虽然对机器学习模式(ML)在这项工作中被广泛采用,但它们对监督培训的依赖限制了其推广到隐形病例的能力,特别是鉴于获取有标签的大型数据集的成本很高。大型语言模型(LLMs)在利用语言能力和生物医学知识进行诊断预测方面显示了希望。然而,这些模型往往存在幻觉,缺乏结构化的医疗推理,并产生无用的产物。为了应对这些挑战,我们建议采用知识图表(KG)强化的推理方法,通过多剂结构改进基于LLM的诊断预测。我们的框架包括属性绘图的连接剂、结构知识提取的检索剂以及反复完善诊断预测的预测剂。实验结果表明,KERAP提高了诊断可靠性,为零速诊断诊断预测提供了可扩展和可解释的解决方案。


Article 2

Title@2025-07-03 (4): A unifying approach to self-organizing systems interacting via conservation laws

Title: A unifying approach to self-organizing systems interacting via conservation laws Ein vereinheitlichter Ansatz für selbstorganisierende Systeme, die über Erhaltungsgesetze interagieren 对通过养护法相互作用的自我组织系统采取统一办法 2507.02575v1

Authors (8): Frank Barrows, Guanming Zhang, Satyam Anand, Zizi Chen, Jonathan Lin, Amman Desai, Stefano Martiniani, Francesco Caravelli

We present a unified framework for embedding and analyzing dynamical systems using generalized projection operators rooted in local conservation laws. By representing physical, biological, and engineered systems as graphs with incidence and cycle matrices, we derive dual projection operators that decompose network fluxes and potentials. This formalism aligns with principles of non-equilibrium thermodynamics and captures a broad class of systems governed by flux-forcing relationships and local constraints. We extend this approach to collective dynamics through the PRojective Embedding of Dynamical Systems (PrEDS), which lifts low-dimensional dynamics into a high-dimensional space, enabling both replication and recovery of the original dynamics. When systems fall within the PrEDS class, their collective behavior can be effectively approximated through projection onto a mean-field space. We demonstrate the versatility of PrEDS across diverse domains, including resistive and memristive circuits, adaptive flow networks (e.g., slime molds), elastic string networks, and particle swarms. Notably, we establish a direct correspondence between PrEDS and swarm dynamics, revealing new insights into optimization and self-organization. Our results offer a general theoretical foundation for analyzing complex networked systems and for designing systems that self-organize through local interactions.

我们提出了一个统一框架,用于利用植根于当地保护法的通用预测操作员嵌入和分析动态系统。我们通过将物理、生物和工程设计系统作为事件和周期矩阵的图表来代表物理、生物和工程系统,产生分解网络通量和潜力的双重预测操作员。这种形式主义符合非平衡热动力学的原则,并捕捉了由通量-促进关系和地方制约的多种系统。我们通过动态系统(PrEDS)的旋转嵌入式嵌入系统(e.g.,粘液型模子)、弹性弦网络和粒子蒸发器等,将这一方法推广到集体动态中,将低维动态提升到一个高维空间,使原始动态得以复制和恢复。当系统属于PREDS级时,它们的集体行为可以通过投射到一个中等空间空间来有效地近似。我们展示了PREDS的多功能性,包括受通量-促进关系和地方制约的电路、适应性流动网络(e.g.lipee molds)、弹性弦网络网络和粒子蒸发波波波波波波体。 特别是,我们为自我分析的系统提供了自我分析的系统。


Article 3

Title@2025-07-03 (4): Incorporating LLMs for Large-Scale Urban Complex Mobility Simulation

Title: Incorporating LLMs for Large-Scale Urban Complex Mobility Simulation Einschließlich LLMs für großräumige Urban Complex Mobility Simulation 大型城市综合流动模拟项目LLMs 2505.21880v2

Authors (8): Yu-Lun Song, Chung-En Tsern, Che-Cheng Wu, Yu-Ming Chang, Syuan-Bo Huang, Wei-Chu Chen, Michael Chia-Liang Lin, Yu-Ta Lin

This study presents an innovative approach to urban mobility simulation by integrating a Large Language Model (LLM) with Agent-Based Modeling (ABM). Unlike traditional rule-based ABM, the proposed framework leverages LLM to enhance agent diversity and realism by generating synthetic population profiles, allocating routine and occasional locations, and simulating personalized routes. Using real-world data, the simulation models individual behaviors and large-scale mobility patterns in Taipei City. Key insights, such as route heat maps and mode-specific indicators, provide urban planners with actionable information for policy-making. Future work focuses on establishing robust validation frameworks to ensure accuracy and reliability in urban planning applications.

与传统的基于规则的反弹道导弹框架不同,拟议框架利用LLM,通过制作合成人口概况、分配常规和偶发地点以及模拟个人化路线,加强代理人多样性和现实主义。 利用台北市的现实世界数据、模拟模型个人行为和大规模流动模式,重要见解,如路线热图和模式特定指标,为城市规划者提供了可供决策使用的信息。未来工作的重点是建立强有力的验证框架,以确保城市规划应用的准确性和可靠性。


Article 4

Title@2025-07-03 (4): Benchmarking Generalizable Bimanual Manipulation: RoboTwin Dual-Arm Collaboration Challenge at CVPR 2025 MEIS Workshop

Title: Benchmarking Generalizable Bimanual Manipulation: RoboTwin Dual-Arm Collaboration Challenge at CVPR 2025 MEIS Workshop Benchmarking Generalizable Bimanual Manipulation: RoboTwin Dual-Arm Collaboration Challenge bei CVPR 2025 MEIS Workshop 基准的可通用二手操纵:2025年欧洲气象和气象科学研究所讲习班上的机器人双臂双臂合作挑战 2506.23351v2

Authors (99): Tianxing Chen, Kaixuan Wang, Zhaohui Yang, Yuhao Zhang, Zanxin Chen, Baijun Chen, Wanxi Dong, Ziyuan Liu, Dong Chen, Tianshuo Yang, Haibao Yu, Xiaokang Yang, Yusen Qin, Zhiqiang Xie, Yao Mu, Ping Luo, Tian Nian, Weiliang Deng, Yiheng Ge, Yibin Liu, Zixuan Li, Dehui Wang, Zhixuan Liang, Haohui Xie, Rijie Zeng, Yunfei Ge, Peiqing Cong, Guannan He, Zhaoming Han, Ruocheng Yin, Jingxiang Guo, Lunkai Lin, Tianling Xu, Hongzhe Bi, Xuewu Lin, Tianwei Lin, Shujie Luo, Keyu Li, Ziyan Zhao, Ke Fan, Heyang Xu, Bo Peng, Wenlong Gao, Dongjiang Li, Feng Jin, Hui Shen, Jinming Li, Chaowei Cui, Yu Chen, Yaxin Peng, Lingdong Zeng, Wenlong Dong, Tengfei Li, Weijie Ke, Jun Chen, Erdemt Bao, Tian Lan, Tenglong Liu, Jin Yang, Huiping Zhuang, Baozhi Jia, Shuai Zhang, Zhengfeng Zou, Fangheng Guan, Tianyi Jia, Ke Zhou, Hongjiu Zhang, Yating Han, Cheng Fang, Yixian Zou, Chongyang Xu, Qinglun Zhang, Shen Cheng, Xiaohe Wang, Ping Tan, Haoqiang Fan, Shuaicheng Liu, Jiaheng Chen, Chuxuan Huang, Chengliang Lin, Kaijun Luo, Boyu Yue, Yi Liu, Jinyu Chen, Zichang Tan, Liming Deng, Shuo Xu, Zijian Cai, Shilong Yin, Hao Wang, Hongshan Liu, Tianyang Li, Long Shi, Ran Xu, Huilin Xu, Zhengquan Zhang, Congsheng Xu, Jinchang Yang, Feng Xu

Embodied Artificial Intelligence (Embodied AI) is an emerging frontier in robotics, driven by the need for autonomous systems that can perceive, reason, and act in complex physical environments. While single-arm systems have shown strong task performance, collaborative dual-arm systems are essential for handling more intricate tasks involving rigid, deformable, and tactile-sensitive objects. To advance this goal, we launched the RoboTwin Dual-Arm Collaboration Challenge at the 2nd MEIS Workshop, CVPR 2025. Built on the RoboTwin Simulation platform (1.0 and 2.0) and the AgileX COBOT-Magic Robot platform, the competition consisted of three stages: Simulation Round 1, Simulation Round 2, and a final Real-World Round. Participants totally tackled 17 dual-arm manipulation tasks, covering rigid, deformable, and tactile-based scenarios. The challenge attracted 64 global teams and over 400 participants, producing top-performing solutions like SEM and AnchorDP3 and generating valuable insights into generalizable bimanual policy learning. This report outlines the competition setup, task design, evaluation methodology, key findings and future direction, aiming to support future research on robust and generalizable bimanual manipulation policies. The Challenge Webpage is available at https://robotwin-benchmark.github.io/cvpr-2025-challenge/.

人工智能(Embodied AI)是机器人的新兴前沿,其驱动力是需要能够感知、理性和在复杂的物理环境中行动的自主系统。虽然单臂系统已经表现出很强的任务性能,但协作双臂系统对于处理涉及僵硬、变形和触摸敏感物体的更复杂的任务至关重要。为推进这一目标,我们在第二次MEIS研讨会上发起了机器人双臂双臂协作挑战(CVPR 2025 2025)。在RoboTwin模拟平台(1.0和2.0)和AgileX COBOT-Magic机器人平台(AgilX COBOT-Magic机器人平台)上建立起来,竞争由三个阶段组成:模拟回合1、模拟回合2和最后现实世界回合。参与者完全处理了17项双臂操纵任务,包括僵硬、变形和触角假设。这项挑战吸引了64个全球团队和400多名参与者,产生了像SEM和AnchorDP3这样的最优秀的解决方案,并对通用双体政策学习产生了宝贵的见解。这份报告概述了竞争设置、强有力设计任务设计、关键结果以及未来战略评估方法,这是未来研究的基础研究,目的是要达到总目标。


Article 5

Title@2025-07-03 (4): Horus: A Protocol for Trustless Delegation Under Uncertainty

Title: Horus: A Protocol for Trustless Delegation Under Uncertainty Horus: Ein Protokoll für eine treulose Delegation unter Unsicherheit 荷鲁斯:不确定性下无信托代表团议定书 2507.00631v3

Authors (2): David Shi, Kevin Joo

Correctness is an emergent property of systems where exposing error is cheaper than committing it. In dynamic, low-trust environments, autonomous AI agents benefit from delegating work to sub-agents, yet correctness cannot be assured through upfront specification or centralized oversight. We propose a protocol that enforces correctness through collateralized claims in a recursive verification game. Tasks are published as intents, and solvers compete to fulfill them. Selected solvers carry out tasks under risk, with correctness checked post hoc by verifiers. Any challenger can challenge a result by staking against it to trigger the verification process. Incorrect agents are slashed and correct opposition is rewarded, with an escalation path that penalizes erroneous verifiers themselves. When incentives are aligned across solvers, challengers, and verifiers, falsification conditions make correctness the Nash equilibrium.

正确性是暴露错误比实施错误更便宜的系统的一种新兴特性。 在动态的低信任环境中,自主的AI代理商从将工作委托给分代理人中受益,但无法通过先期规格或集中监督来保证正确性。 我们提议了一项协议,在循环性核查游戏中通过抵押债权强制执行正确性。 任务作为意图公布,解决者竞相完成。 选定的解决者执行有风险的任务,由核查者检查是否正确性。 任何挑战者都可以通过对它进行打击以触发核查进程来挑战结果。 错误的代理商被砍断,正确的反对者被奖励,而升级路径则惩罚错误的验证者本身。 当激励措施在解决者、挑战者和核查者之间一致时,伪造的条件可以使纳什平衡得到正确性。


Article 6

Title@2025-07-02 (3): Synergizing Logical Reasoning, Knowledge Management and Collaboration in Multi-Agent LLM System

Title: Synergizing Logical Reasoning, Knowledge Management and Collaboration in Multi-Agent LLM System Synergisieren von logischer Vernunft, Wissensmanagement und Zusammenarbeit im Multi-Agent LLM-System 多机构LLM系统统一逻辑理由、知识管理和协作 2507.02170v1

Authors (2): Adam Kostka, Jarosław A. Chudziak

This paper explores the integration of advanced Multi-Agent Systems (MAS) techniques to develop a team of agents with enhanced logical reasoning, long-term knowledge retention, and Theory of Mind (ToM) capabilities. By uniting these core components with optimized communication protocols, we create a novel framework called SynergyMAS, which fosters collaborative teamwork and superior problem-solving skills. The system’s effectiveness is demonstrated through a product development team case study, where our approach significantly enhances performance and adaptability. These findings highlight SynergyMAS’s potential to tackle complex, real-world challenges.

本文件探讨了如何整合先进的多机构系统(MAS)技术,以发展一支具有强化逻辑推理、长期知识保留和思维理论(TOM)能力的代理团队。通过将这些核心组成部分与优化通信协议结合起来,我们创建了名为“协同MAS”的新颖框架,促进协作协作和高超解决问题技能。通过产品开发团队案例研究,我们的方法大大增强了绩效和适应能力,证明了该系统的有效性。这些结论凸显了协同MAS应对复杂、现实世界挑战的潜力。


Article 7

Title@2025-07-02 (3): Enhancing LLM-based Quantum Code Generation with Multi-Agent Optimization and Quantum Error Correction

Title: Enhancing LLM-based Quantum Code Generation with Multi-Agent Optimization and Quantum Error Correction Verbesserung der LLM-basierten Quantencode-Generierung durch Multi-Agent-Optimierung und Quantenfehlerkorrektur 强化基于LLM的量制码生成,并采用多种物力优化和量度错误校正 2504.14557v2

Authors (4): Charlie Campbell, Hao Mark Chen, Wayne Luk, Hongxiang Fan

Multi-agent frameworks with Large Language Models (LLMs) have become promising tools for generating general-purpose programming languages using test-driven development, allowing developers to create more accurate and robust code. However, their potential has not been fully unleashed for domain-specific programming languages, where specific domain exhibits unique optimization opportunities for customized improvement. In this paper, we take the first step in exploring multi-agent code generation for quantum programs. By identifying the unique optimizations in quantum designs such as quantum error correction, we introduce a novel multi-agent framework tailored to generating accurate, fault-tolerant quantum code. Each agent in the framework focuses on distinct optimizations, iteratively refining the code using a semantic analyzer with multi-pass inference, alongside an error correction code decoder. We also examine the effectiveness of inference-time techniques, like Chain-of-Thought (CoT) and Retrieval-Augmented Generation (RAG) in the context of quantum programming, uncovering observations that are different from general-purpose code generation. To evaluate our approach, we develop a test suite to measure the impact each optimization has on the accuracy of the generated code. Our findings indicate that techniques such as structured CoT significantly improve the generation of quantum algorithms by up to 50%. In contrast, we have also found that certain techniques such as RAG show limited improvement, yielding an accuracy increase of only 4%. Moreover, we showcase examples of AI-assisted quantum error prediction and correction, demonstrating the effectiveness of our multi-agent framework in reducing the quantum errors of generated quantum programs.

包含大语言模型(LLMS)的多试剂框架已成为使用测试驱动开发生成通用编程语言的有希望的工具,使开发者能够创建更准确和更稳健的代码。 但是,它们的潜力尚未完全释放给特定域的编程语言, 具体域展示了独特的优化机会以进行定制改进。 在本文件中, 我们首先探索量子程序( 如量子错误校正) 的多试剂代码生成。 通过确定量子设计中独特的优化, 我们引入了一个新的多试剂框架, 以生成准确的、 容错的量代码代码。 框架中的每个代理剂都侧重于不同的优化, 利用带有多密码误差的语义分析器迭代之精度完善代码。 我们还检查了计算时间技术的有效性, 如量子系统链(COT) 和Retrievval- Auged Ping(RAG) 的生成, 揭示了与通用代码生成不同的观测结果。 为了评估我们的方法, 我们开发了一个测试套以测量每个精度的精确度分析器来测量每个精度的代码的精确度, 我们的精确度, 也展示了生成的精确度框架。


Article 8

Title@2025-07-02 (3): Distance-based Relative Orbital Transition for Satellite Swarm Array Deployment Under J2 Perturbation

Title: Distance-based Relative Orbital Transition for Satellite Swarm Array Deployment Under J2 Perturbation Distanzbasierter relativer Orbitalübergang für Satelliten-Swarm-Array-Einsatz unter J2 Perturbation 在J2扰动下部署卫星冲积阵列的相对轨道过渡 2507.01769v1

Authors (2): Yuta Takahashi, Shin-ichiro Sakai

This paper presents an autonomous guidance and control strategy for a satellite swarm that enables scalable distributed space structures for innovative science and business opportunities. The averaged $J_2$ orbital parameters that describe the drift and periodic orbital motion were derived along with their target values to achieve a distributed space structure in a decentralized manner. This enabled the design of a distance-based orbital stabilizer to ensure autonomous deployment into a monolithic formation of a coplanar equidistant configuration on a user-defined orbital plane. Continuous formation control was assumed to be achieved through fuel-free actuation, such as satellite magnetic field interaction and differential aerodynamic forces, thereby maintaining long-term formation stability without thruster usage. A major challenge for such actuation systems is the potential loss of control capability due to increasing inter-satellite distances resulting from unstable orbital dynamics, particularly for autonomous satellite swarms. To mitigate this risk, our decentralized deployment controller minimized drift distance during unexpected communication outages. As a case study, we consider the deployment of palm-sized satellites into a coplanar equidistant formation in a $J_2$-perturbed orbit. Moreover, centralized grouping strategies are presented.

本文为卫星群提供了一个自主的指导和控制策略,使分布空间结构能够扩缩,用于创新科学和商业机会。描述漂移和定期轨道运动的平均轨道参数是2美元,其目标值是为了以分散的方式实现分布空间结构。这样设计了远程轨道稳定器,以确保在用户定义的轨道平面上将自发部署成一个共同平面等距配置的单一结构。连续的形成控制假设通过无燃料的动力作用来实现,例如卫星磁场相互作用和差分空气动力力,从而在不使用推进器的情况下保持长期形成稳定。这种动力系统所面临的一项重大挑战是,由于不稳定的轨道动态,特别是自主卫星群落,卫星之间的距离可能增加,从而导致控制能力丧失。为减轻这一风险,我们分散部署控制器在意外通信离场时将漂移距离降到最小。作为案例研究,我们认为,在美元为2美元的中央轨道战略中,将棕榈大小的卫星部署成平面平面平面平面平面平面。


Article 9

Title@2025-07-02 (3): Agent Ideate: A Framework for Product Idea Generation from Patents Using Agentic AI

Title: Agent Ideate: A Framework for Product Idea Generation from Patents Using Agentic AI Agent-Idee: Ein Rahmen für Produkt-Ideen-Erzeugung aus Patenten mit Agent-KI Agent Ideate: 使用Agentic AI 专利产品创意一代框架 2507.01717v1

Authors (4): Gopichand Kanumolu, Ashok Urlana, Charaka Vinayak Kumar, Bala Mallikarjunarao Garlapati

Patents contain rich technical knowledge that can inspire innovative product ideas, yet accessing and interpreting this information remains a challenge. This work explores the use of Large Language Models (LLMs) and autonomous agents to mine and generate product concepts from a given patent. In this work, we design Agent Ideate, a framework for automatically generating product-based business ideas from patents. We experimented with open-source LLMs and agent-based architectures across three domains: Computer Science, Natural Language Processing, and Material Chemistry. Evaluation results show that the agentic approach consistently outperformed standalone LLMs in terms of idea quality, relevance, and novelty. These findings suggest that combining LLMs with agentic workflows can significantly enhance the innovation pipeline by unlocking the untapped potential of business idea generation from patent data.

专利包含丰富的技术知识,可以激发创新产品想法,但获取和解释这种信息仍是一项挑战。这项工作探索了使用大语言模型和自主代理商来开采和从特定专利中产生产品概念。在这项工作中,我们设计了“设计”代理商,这是一个自动产生专利产品商业理念的框架。我们试验了在计算机科学、自然语言处理和材料化学三个领域(计算机科学、自然语言处理和材料化学)的开放源LLM和代理商建筑。评价结果显示,在思想质量、相关性和新颖性方面,该代理商做法一贯优于独立的LMS。这些研究结果表明,将LMS与代理工作流程相结合,通过从专利数据中释放尚未开发的产生商业理念的潜力,可以极大地增强创新管道。


Article 10

Title@2025-07-02 (3): Exploring Advanced LLM Multi-Agent Systems Based on Blackboard Architecture

Title: Exploring Advanced LLM Multi-Agent Systems Based on Blackboard Architecture Erforschen von fortgeschrittenen LLM-Multi-Agent-Systemen auf der Basis von Tafelarchitektur 探索基于黑板架构的高级LLM多机构系统 2507.01701v1

Authors (2): Bochen Han, Songmao Zhang

In this paper, we propose to incorporate the blackboard architecture into LLM multi-agent systems (MASs) so that (1) agents with various roles can share all the information and others’ messages during the whole problem-solving process, (2) agents that will take actions are selected based on the current content of the blackboard, and (3) the selection and execution round is repeated until a consensus is reached on the blackboard. We develop the first implementation of this proposal and conduct experiments on commonsense knowledge, reasoning and mathematical datasets. The results show that our system can be competitive with the SOTA static and dynamic MASs by achieving the best average performance, and at the same time manage to spend less tokens. Our proposal has the potential to enable complex and dynamic problem-solving where well-defined structures or workflows are unavailable.

在本文中,我们提议将黑板结构纳入LLM多试剂系统,以便(1) 在整个解决问题的过程中,具有各种作用的代理人能够分享所有信息和其他信息;(2) 将根据黑板当前内容选择采取行动的代理人;(3) 选择和执行回合重复,直到就黑板达成共识;我们首次实施这一提议,并进行关于常识知识、推理和数学数据集的实验;结果显示,通过实现最佳平均性能,我们的系统能够与SOTA静态和动态的MAS具有竞争力,同时设法减少象征性开支;我们的建议有可能在没有明确界定的结构或工作流程的地方促成复杂和动态的解决问题。


Article 11

Title@2025-07-02 (3): Co-Optimizing Reconfigurable Environments and Policies for Decentralized Multi-Agent Navigation

Title: Co-Optimizing Reconfigurable Environments and Policies for Decentralized Multi-Agent Navigation Co-Optimierung neu konfigurierbarer Umgebungen und Politiken für dezentralisierte Multi-Agent-Navigation 共同优化可重新配置的环境和权力下放多机构导航政策 2403.14583v2

Authors (3): Zhan Gao, Guang Yang, Amanda Prorok

This work views the multi-agent system and its surrounding environment as a co-evolving system, where the behavior of one affects the other. The goal is to take both agent actions and environment configurations as decision variables, and optimize these two components in a coordinated manner to improve some measure of interest. Towards this end, we consider the problem of decentralized multi-agent navigation in a cluttered environment, where we assume that the layout of the environment is reconfigurable. By introducing two sub-objectives – multi-agent navigation and environment optimization – we propose an agent-environment co-optimization problem and develop a coordinated algorithm that alternates between these sub-objectives to search for an optimal synthesis of agent actions and environment configurations; ultimately, improving the navigation performance. Due to the challenge of explicitly modeling the relation between the agents, the environment and their performance therein, we leverage policy gradient to formulate a model-free learning mechanism within the coordinated framework. A formal convergence analysis shows that our coordinated algorithm tracks the local minimum solution of an associated time-varying non-convex optimization problem. Experiments corroborate theoretical findings and show the benefits of co-optimization. Interestingly, the results also indicate that optimized environments can offer structural guidance to de-conflict agents in motion.

这项工作将多试剂系统及其周围环境视为一个共同演变的系统,其中一方的行为会影响另一方的行为。目标是将代理行为和环境配置同时作为决定变量,并以协调的方式优化这两个组成部分,以提高某种程度的兴趣。为此,我们考虑在一种杂乱的环境中分散多试剂导航的问题,我们假设环境的布局是可以重新配置的。通过引入两个次级目标 – – 多试剂导航和环境优化 – – 我们提出一个代理-环境共同优化问题,并开发一种协调的算法,以替代这些次级目标,寻找最佳的代理行为和环境配置综合;最终,改进导航性能。由于明确模拟代理人、环境及其在其中的性能之间的关系的挑战,我们利用政策梯度来制定在协调框架内不使用模型的学习机制。正式的趋同分析表明,我们协调的算法追踪了相关时间分配非convex优化问题的当地最低解决办法。实验证实了理论性结论,并展示了冲突动态推动者提供最佳结构化指导的好处。


Article 12

Title@2025-07-02 (3): Agent-as-Tool: A Study on the Hierarchical Decision Making with Reinforcement Learning

Title: Agent-as-Tool: A Study on the Hierarchical Decision Making with Reinforcement Learning Agent-as-Tool: Eine Studie über die hierarchische Entscheidungsfindung mit Verstärkungslernen Agent-as-Tool:关于以强化学习方式作出等级决策的研究 2507.01489v1

Authors (1): Yanfei Zhang

Large Language Models (LLMs) have emerged as one of the most significant technological advancements in artificial intelligence in recent years. Their ability to understand, generate, and reason with natural language has transformed how we interact with AI systems. With the development of LLM-based agents and reinforcement-learning-based reasoning models, the study of applying reinforcement learning in agent frameworks has become a new research focus. However, all previous studies face the challenge of deciding the tool calling process and the reasoning process simultaneously, and the chain of reasoning was solely relied on the unprocessed raw result with redundant information and symbols unrelated to the task from the tool, which impose a heavy burden on the model’s capability to reason. Therefore, in our research, we proposed a hierarchical framework Agent-as-tool that detach the tool calling process and the reasoning process, which enables the model to focus on the verbally reasoning process while the tool calling process is handled by another agent. Our work had achieved comparable results with only a slight reinforcement fine-tuning on 180 samples, and had achieved exceptionally well performance in Bamboogle with 63.2% of exact match and 75.2% in cover exact match, exceeding Search-R1 by 4.8% in exact match and 3.2% in cover exact match.

大型语言模型(LLMS)是近年来人工智能中最重要的技术进步之一。它们理解、生成和理解自然语言的能力改变了我们与AI系统互动的方式。随着LLM代理商的开发以及基于强化学习的推理模型的开发,在代理框架中应用强化学习的研究已成为一个新的研究焦点。然而,所有以往的研究都面临着同时决定工具调用过程和推理过程的挑战,而推理链完全依赖未经处理的原始结果,与工具的任务无关的冗余信息和符号,给模型的能力带来沉重负担。因此,在我们的研究中,我们建议了一个等级框架,用工具调用过程和推理过程来分解,使模型能够侧重于口头推理过程,而工具调过程则由另一个代理处理。我们的工作取得了类似的结果,对180个样本只稍加微微的微调,在Bamboogle取得了特别良好的性能,精确匹配率为63.2%,准确匹配率75.2%,准确匹配率为7.8%,精确匹配率超过搜索-R1,准确匹配率为3.8%。


Article 13

Title@2025-07-02 (3): BioMARS: A Multi-Agent Robotic System for Autonomous Biological Experiments

Title: BioMARS: A Multi-Agent Robotic System for Autonomous Biological Experiments BioMARS: Ein Multi-Agenten-Robotersystem für autonome biologische Experimente BioMARS:一个用于自主生物实验的多功能机器人系统 2507.01485v1

Authors (10): Yibo Qiu, Zan Huang, Zhiyu Wang, Handi Liu, Yiling Qiao, Yifeng Hu, Shu’ang Sun, Hangke Peng, Ronald X Xu, Mingzhai Sun

Large language models (LLMs) and vision-language models (VLMs) have the potential to transform biological research by enabling autonomous experimentation. Yet, their application remains constrained by rigid protocol design, limited adaptability to dynamic lab conditions, inadequate error handling, and high operational complexity. Here we introduce BioMARS (Biological Multi-Agent Robotic System), an intelligent platform that integrates LLMs, VLMs, and modular robotics to autonomously design, plan, and execute biological experiments. BioMARS uses a hierarchical architecture: the Biologist Agent synthesizes protocols via retrieval-augmented generation; the Technician Agent translates them into executable robotic pseudo-code; and the Inspector Agent ensures procedural integrity through multimodal perception and anomaly detection. The system autonomously conducts cell passaging and culture tasks, matching or exceeding manual performance in viability, consistency, and morphological integrity. It also supports context-aware optimization, outperforming conventional strategies in differentiating retinal pigment epithelial cells. A web interface enables real-time human-AI collaboration, while a modular backend allows scalable integration with laboratory hardware. These results highlight the feasibility of generalizable, AI-driven laboratory automation and the transformative role of language-based reasoning in biological research.

大型语言模型(LLMS)和视觉语言模型(VLMS)具有通过自主实验改造生物研究的潜力。然而,它们的应用仍然受到僵硬的协议设计、对动态实验室条件的适应性有限、错误处理不足和高度操作复杂性的限制。这里我们引入了生物MARS(生物多动机器人系统),这是一个将LLMS、VLMS和模块机器人整合到自主设计、规划和实施生物实验的智能平台。生物MARS使用一个等级结构:生物学剂通过回溯生成合成协议;技术员将协议转化为可执行的机器人伪代码;技术员代理将协议转化为可执行的机器人伪代码;以及检查员代理确保程序的完整性,通过多式联运的认知和异常检测。这个系统自主地执行细胞传承和文化任务,在可行性、一致性和形态完整性方面匹配或超过手工性性性能。它也支持环境觉醒优化、优于区分硬性皮质皮细胞细胞细胞细胞的常规战略。一个网络界面能够实时进行人类-AI合作,而模块后端则允许与实验室硬件进行可伸缩的整合。这些结果突出了生物革命性研究、AI驱动的实验室自动化工具的作用。


Article 14

Title@2025-07-02 (3): RALLY: Role-Adaptive LLM-Driven Yoked Navigation for Agentic UAV Swarms

Title: RALLY: Role-Adaptive LLM-Driven Yoked Navigation for Agentic UAV Swarms RALLY: Rollenadaptive LLM-getriebene Yoked-Navigation für Agentische UAV-Swarme 用于UAV冲锋枪的 2507.01378v1

Authors (7): Ziyao Wang, Rongpeng Li, Sizhao Li, Yuming Xiang, Haiping Wang, Zhifeng Zhao, Honggang Zhang

Intelligent control of Unmanned Aerial Vehicles (UAVs) swarms has emerged as a critical research focus, and it typically requires the swarm to navigate effectively while avoiding obstacles and achieving continuous coverage over multiple mission targets. Although traditional Multi-Agent Reinforcement Learning (MARL) approaches offer dynamic adaptability, they are hindered by the semantic gap in numerical communication and the rigidity of homogeneous role structures, resulting in poor generalization and limited task scalability. Recent advances in Large Language Model (LLM)-based control frameworks demonstrate strong semantic reasoning capabilities by leveraging extensive prior knowledge. However, due to the lack of online learning and over-reliance on static priors, these works often struggle with effective exploration, leading to reduced individual potential and overall system performance. To address these limitations, we propose a Role-Adaptive LLM-Driven Yoked navigation algorithm RALLY. Specifically, we first develop an LLM-driven semantic decision framework that uses structured natural language for efficient semantic communication and collaborative reasoning. Afterward, we introduce a dynamic role-heterogeneity mechanism for adaptive role switching and personalized decision-making. Furthermore, we propose a Role-value Mixing Network (RMIX)-based assignment strategy that integrates LLM offline priors with MARL online policies to enable semi-offline training of role selection strategies. Experiments in the Multi-Agent Particle Environment (MPE) environment and a Software-In-The-Loop (SITL) platform demonstrate that RALLY outperforms conventional approaches in terms of task coverage, convergence speed, and generalization, highlighting its strong potential for collaborative navigation in agentic multi-UAV systems.

虽然传统的多机构强化学习(MARL)方法提供了动态适应性,但由于数字通信中的语义差距和同一作用结构的僵硬性,导致笼统化和任务缩放性,大语言模型(LLM)控制框架的最近进展显示了强大的语义推理能力,利用了广泛的先前知识。然而,由于缺乏在线学习和过度依赖静态前科,这些工作往往与有效的探索相挣扎,导致个人潜力和总体系统性能下降。为了解决这些局限性,我们提议了一个“功能-Adapitive LLM-Driven Yoked导航算法” ,从而造成数字通信的模糊性,导致任务变异性。具体地说,我们首先开发了一个由LLMM(基于自然语言的结构化)决定性决定框架,用于高效的语义化交流和协作性推理。之后,我们引入了一个强有力的角色偏移机制,用于调整前置的 Plix(LIL) 任务定位工具,用于前期环境定位的配置、前置任务定位、前置的ML(MA) 任务定位-L(M) 定位-L) 工具的定位定位定位定位定位定位定位战略。我们提出了一个动态定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位-定位


Article 15

Title@2025-07-02 (3): Cooperative Target Capture in 3D Engagements over Switched Dynamic Graphs

Title: Cooperative Target Capture in 3D Engagements over Switched Dynamic Graphs Kooperative Zielerfassung in 3D-Verpflichtungen über gewechselte dynamische Graphen 通过切换动态图表进行三维参与中的合作目标抓取 2507.01350v1

Authors (2): Abhinav Sinha, Shashi Ranjan Kumar

This paper presents a leaderless cooperative guidance strategy for simultaneous time-constrained interception of a stationary target when the interceptors exchange information over switched dynamic graphs. We specifically focus on scenarios when the interceptors lack radial acceleration capabilities, relying solely on their lateral acceleration components. This consideration aligns with their inherent kinematic turn constraints. The proposed strategy explicitly addresses the complexities of coupled 3D engagements, thereby mitigating performance degradation that typically arises when the pitch and yaw channels are decoupled into two separate, mutually orthogonal planar engagements. Moreover, our formulation incorporates modeling uncertainties associated with the time-to-go estimation into the derivation of cooperative guidance commands to ensure robustness against inaccuracies in dynamic engagement scenarios. To optimize control efficiency, we analytically derive the lateral acceleration components in the orthogonal pitch and yaw channels by solving an instantaneous optimization problem, subject to an affine constraint. We show that the proposed cooperative guidance commands guarantee consensus in time-to-go values within a predefined time, which can be prescribed as a design parameter, regardless of the interceptors’ initial configurations. We provide simulations to attest to the efficacy of the proposed method.

本文为在截击器交换换动图形信息时同时对固定目标进行有时间限制的拦截提供了一个没有领头的合作指导战略。 我们特别侧重于截击器缺乏辐射加速能力的情景, 仅依靠其横向加速组件。 这种考虑符合其固有的运动转变限制。 拟议的战略明确解决了3D结合的复杂问题, 从而缓解了通常在投球和亚乌渠道分解成两种不同的、 相互或交错的平面操作时产生的性能退化。 此外, 我们的提法将与时间到地面估计有关的不确定性模型纳入合作指导指令的衍生, 以确保动态参与情景中不准确的稳健性。 为了优化控制效率, 我们通过解决瞬间优化问题, 分析在垂直优化时空问题 。 我们表明, 拟议的合作指导指令保证在预定的时间内在时间到地面的数值上达成共识, 无论拦截器的初始配置如何, 我们提供模拟来证明拟议方法的有效性。


Article 16

Title@2025-07-02 (3): Aitomia: Your Intelligent Assistant for AI-Driven Atomistic and Quantum Chemical Simulations

Title: Aitomia: Your Intelligent Assistant for AI-Driven Atomistic and Quantum Chemical Simulations Aitomia: Ihr intelligenter Assistent für KI-getriebene Atomistische und Quantum Chemical Simulationen Aitomia:您对AI-Driven原子学和量子化学模拟的智能助理 2505.08195v2

Authors (6): Jinming Hu, Hassan Nawaz, Yuting Rui, Lijie Chi, Arif Ullah, Pavlo O. Dral

We have developed Aitomia - a platform powered by AI to assist in performing AI-driven atomistic and quantum chemical (QC) simulations. This evolving intelligent assistant platform is equipped with chatbots and AI agents to help experts and guide non-experts in setting up and running the atomistic simulations, monitoring their computation status, analyzing the simulation results, and summarizing them for the user in text and graphical forms. We achieve these goals by exploiting open-source large language models (LLMs, original and fine-tuned), rule-based agents, and a retrieval-augmented generation (RAG) system. Aitomia leverages the versatility of our MLatom ecosystem, supporting AI-enhanced computational chemistry tasks ranging from ground- to excited-state calculations such as geometry optimizations, thermochemistry, and spectra calculations. Aitomia is the first intelligent assistant publicly accessible online on a cloud computing platform for atomistic simulations of broad scope (Aitomistic Hub at https://aitomistic.xyz), while it may also be deployed locally as described at http://mlatom.com/aitomia. Aitomia is expected to lower the barrier to performing atomistic simulations, democratizing simulations, and accelerating research and development in the relevant fields.

我们开发了Aitimia,这是一个由AI提供动力的平台,用于协助进行AI驱动的原子学和量子化学模拟(QC),这个不断发展的智能助理平台配备了聊天机和AI代理器,帮助专家和指导非专家建立和运行原子学模拟,监测其计算状况,分析模拟结果,并以文字和图形形式为用户总结这些结果。我们通过利用开放源的大语言模型(LLLMS、原始和微调)、基于规则的代理器和检索型代(RAG)系统实现这些目标。Aitomia利用了我们MLatom生态系统的多功能,支持AI加强的计算化学任务,从地面到动力状态计算,如几何测量优化、热化和光谱计算。Aitomia是第一个在广域的原子学模拟(Aitomicicicic Hubs at https://aitomisticistic.xyz)上公开访问的云计算平台的智能助手,同时它也可能部署在以下领域进行加速的模拟。


Article 17

Title@2025-07-02 (3): Optimal Dispersion Under Asynchrony

Title: Optimal Dispersion Under Asynchrony Optimale Dispersion unter Asynchronie Asynconsrony 下的优化分散 2507.01298v1

Authors (5): Debasish Pattanayak, Ajay D. Kshemkalyani, Manish Kumar, Anisur Rahaman Molla, Gokarna Sharma

We study the dispersion problem in anonymous port-labeled graphs: $k \leq n$ mobile agents, each with a unique ID and initially located arbitrarily on the nodes of an $n$-node graph with maximum degree $\Delta$, must autonomously relocate so that no node hosts more than one agent. Dispersion serves as a fundamental task in distributed computing of mobile agents, and its complexity stems from key challenges in local coordination under anonymity and limited memory. The goal is to minimize both the time to achieve dispersion and the memory required per agent. It is known that any algorithm requires $\Omega(k)$ time in the worst case, and $\Omega(\log k)$ bits of memory per agent. A recent result [SPAA’25] gives an optimal $O(k)$-time algorithm in the synchronous setting and an $O(k \log k)$-time algorithm in the asynchronous setting, both using $O(\log(k+\Delta))$ bits. In this paper, we close the complexity gap in the asynchronous setting by presenting the first dispersion algorithm that runs in optimal $O(k)$ time using $O(\log(k+\Delta))$ bits of memory per agent. Our solution is based on a novel technique we develop in this paper that constructs a port-one tree in anonymous graphs, which may be of independent interest.

我们研究匿名端口标签图表中的分散问题: $k\leq n$移动代理器, 每一个都有独特的身份, 最初被任意放置在最大度为$\Delta$的美元节点上, 必须自动迁移, 以便无节点能容纳不止一个代理器。 分散是移动代理器分布计算中的一项基本任务, 其复杂性来自匿名和有限记忆下当地协调的关键挑战。 目标是在匿名和有限记忆下尽可能减少实现分散的时间和每个代理器所需的记忆。 已知任何算法都需要$\ Omega (k) 时间, 在最差的情况下需要$\ Omega (log k) 和$\ log k) 每个代理器的记忆比特 。 最近的结果 [ SPA’ 25] 给出了同步环境中最佳的美元( k) 和 $( k) logk k) 的时间算法。 目标是尽可能减少时间运算, 使用 $( k@ Delta) 美元 位。 在本文中, 我们将一个基于 美元 硬质 流流流流流中 的硬体 的硬体 的硬体 解 解 解 。


Article 18

Title@2025-07-02 (3): Adaptive Traffic Signal Control based on Multi-Agent Reinforcement Learning. Case Study on a simulated real-world corridor

Title: Adaptive Traffic Signal Control based on Multi-Agent Reinforcement Learning. Case Study on a simulated real-world corridor Adaptive Verkehrssignalsteuerung auf Basis des Multi-Agenten-Verstärkungslernens. Fallstudie zu einem simulierten Real-World-Korridor 基于多机构强化学习的适应性交通信号控制,模拟现实世界走廊案例研究 2503.02189v4

Authors (3): Dickness Kakitahi Kwesiga, Angshuman Guin, Michael Hunter

Previous studies that have formulated multi-agent reinforcement learning (RL) algorithms for adaptive traffic signal control have primarily used value-based RL methods. However, recent literature has shown that policy-based methods may perform better in partially observable environments. Additionally, RL methods remain largely untested for real-world normally signal timing plans because of the simplifying assumptions common in the literature. The current study attempts to address these gaps and formulates a multi-agent proximal policy optimization (MA-PPO) algorithm to implement adaptive and coordinated traffic control along an arterial corridor. The formulated MA-PPO has a centralized-critic architecture under a centralized training and decentralized execution framework. Agents are designed to allow selection and implementation of up to eight signal phases, as commonly implemented in field controllers. The formulated algorithm is tested on a simulated real-world seven intersection corridor. The speed of convergence for each agent was found to depend on the size of the action space, which depends on the number and sequence of signal phases. The performance of the formulated MA-PPO adaptive control algorithm is compared with the field implemented actuated-coordinated signal control (ASC), modeled using PTV-Vissim-MaxTime software in the loop simulation (SILs). The trained MA-PPO performed significantly better than the ASC for all movements. Compared to ASC the MA-PPO showed 2% and 24% improvements in travel time in the primary and secondary coordination directions, respectively. For cross streets movements MA-PPO also showed significant crossing time reductions. Volume sensitivity experiments revealed that the formulated MA-PPO demonstrated good stability, robustness, and adaptability to changes in traffic demand.

以往为适应性交通信号控制开发多试剂强化学习(RL)算法的研究,主要使用基于价值的RL方法,但最近的文献表明,基于政策的方法在部分可观测环境中可能效果更好。此外,由于文献中常见的简化假设,对于现实世界通常的信号计时计划,RL方法基本上没有进行测试。目前的研究试图弥补这些差距,并制定一个多试剂准政策优化算法(MA-PPPO),在动脉走廊沿线实施适应性和协调性交通控制。拟订的MA-PO适应性控制算法,在集中培训和分散执行框架下,有一个集中式的critic-cal结构。设计这些方法是为了选择和执行八个信号阶段,如通常在外地控制器中实施的那样。所设计的RLA-L方法在模拟现实世界七个交叉走廊上进行测试。发现,每个代理商的趋同速度取决于行动空间的大小,这取决于信号阶段的数目和顺序。对于MA-PO的适应性控制算法的性算法,与外地执行的操作性协调性信号控制(ASC-MAPO-S-Servialalalalalalalalationallievalalal d)的进度也显示了MAST-MAP-IL-IL 。MAPA-ILA-ILMA-S的所有模拟的改进。


Article 19

Title@2025-07-01 (2): Dynamic Strategy Adaptation in Multi-Agent Environments with Large Language Models

Title: Dynamic Strategy Adaptation in Multi-Agent Environments with Large Language Models Dynamische Strategieanpassung in Multi-Agent-Umgebungen mit großen Sprachmodellen 具有大语言模式的多机构环境中的动态战略适应 2507.02002v1

Authors (4): Shaurya Mallampati, Rashed Shelim, Walid Saad, Naren Ramakrishnan

Large language models (LLMs) demonstrate strong reasoning abilities across mathematical, strategic, and linguistic tasks, yet little is known about how well they reason in dynamic, real-time, multi-agent scenarios, such as collaborative environments in which agents continuously adapt to each other’s behavior, as in cooperative gameplay settings. In this paper, we bridge this gap by combining LLM-driven agents with strategic reasoning and real-time adaptation in cooperative, multi-agent environments grounded in game-theoretic principles such as belief consistency and Nash equilibrium. The proposed framework applies broadly to dynamic scenarios in which agents coordinate, communicate, and make decisions in response to continuously changing conditions. We provide real-time strategy refinement and adaptive feedback mechanisms that enable agents to dynamically adjust policies based on immediate contextual interactions, in contrast to previous efforts that evaluate LLM capabilities in static or turn-based settings. Empirical results show that our method achieves up to a 26\% improvement in return over PPO baselines in high-noise environments, while maintaining real-time latency under 1.05 milliseconds. Our approach improves collaboration efficiency, task completion rates, and flexibility, illustrating that game-theoretic guidance integrated with real-time feedback enhances LLM performance, ultimately fostering more resilient and flexible strategic multi-agent systems.

大型语言模型(LLMS)在数学、战略和语言任务中表现出很强的推理能力,然而,在动态的、实时的、多试剂的情景中,例如合作游戏环境中,我们很少知道这些模型在动态的、实时的、多试剂的情景中的理由何在,例如代理不断适应彼此行为的协作环境,如合作游戏环境。在本文中,我们通过将LLM驱动的代理物与基于合作、基于信仰一致性和纳什平衡等游戏理论的合作、多试剂环境的战略推理和实时适应性适应性适应性环境相结合,缩小了这一差距。拟议框架广泛适用于代理物协调、沟通和根据不断变化的条件作出决定的动态情景。我们提供了实时战略改进和适应性反馈机制,使代理物能够根据即时环境互动动态调整政策,这与以往在静态或基于转机环境评价LLMM能力的努力形成对比。经验性结果表明,我们的方法在高噪音环境中比PPO基准回报率达到26的改进,同时将实时拉特度保持在1.05毫秒以下。我们的方法改进了协作效率、任务完成率和灵活性,从而最终提升了游戏和弹性的多动性战略反馈。


Article 20

Title@2025-07-01 (2): Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments

Title: Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments Active Scout: Multi-Target-Tracking mit neuralen Strahlungsfeldern in dichten städtischen Umgebungen 活跃的童子军:在城市环境中使用神经辐射场进行多目标跟踪 2406.07431v3

Authors (2): Christopher D. Hsu, Pratik Chaudhari

We study pursuit-evasion games in highly occluded urban environments, e.g. tall buildings in a city, where a scout (quadrotor) tracks multiple dynamic targets on the ground. We show that we can build a neural radiance field (NeRF) representation of the city – online – using RGB and depth images from different vantage points. This representation is used to calculate the information gain to both explore unknown parts of the city and track the targets – thereby giving a completely first-principles approach to actively tracking dynamic targets. We demonstrate, using a custom-built simulator using Open Street Maps data of Philadelphia and New York City, that we can explore and locate 20 stationary targets within 300 steps. This is slower than a greedy baseline, which does not use active perception. But for dynamic targets that actively hide behind occlusions, we show that our approach maintains, at worst, a tracking error of 200m; the greedy baseline can have a tracking error as large as 600m. We observe a number of interesting properties in the scout’s policies, e.g., it switches its attention to track a different target periodically, as the quality of the NeRF representation improves over time, the scout also becomes better in terms of target tracking. Code is available at https://github.com/grasp-lyrl/ActiveScout.

我们在高度隐蔽的城市环境中,例如城市高楼建筑,一个侦察员(quadrotor)跟踪当地的多种动态目标。我们显示,我们可以在网上,使用RGB和不同偏向点的深度图像,在高度隐蔽的城市环境中,研究追逐和躲避游戏。这个代表用于计算信息,既探索城市未知部分,又跟踪目标,从而对积极跟踪动态目标采取完全第一原则的方法。我们用一个定制的模拟器,利用费城和纽约市的开放街道地图数据,显示我们可以探索20个固定目标,并将其定位在300个步骤之内。这比贪婪的基线要慢,没有使用积极的看法。但对于积极隐藏在隐蔽点后面的动态目标,我们显示我们的方法最差时保持了200米的跟踪错误;贪婪的基线可能有一个大至600米的跟踪错误。我们观察侦察政策中的一些有趣的属性,例如,它能将注意力转换到跟踪不同目标的跟踪系统/定位系统的质量,在NRFS上也定期改进。


Article 21

Title@2025-07-01 (2): Large Language Model Powered Intelligent Urban Agents: Concepts, Capabilities, and Applications

Title: Large Language Model Powered Intelligent Urban Agents: Concepts, Capabilities, and Applications Large Language Model Powered Intelligent Urban Agents: Konzepte, Fähigkeiten und Anwendungen 大语言示范型大语言智能智能城市代表机构:概念、能力和应用 2507.00914v1

Authors (7): Jindong Han, Yansong Ning, Zirui Yuan, Hang Ni, Fan Liu, Tengfei Lyu, Hao Liu

The long-standing vision of intelligent cities is to create efficient, livable, and sustainable urban environments using big data and artificial intelligence technologies. Recently, the advent of Large Language Models (LLMs) has opened new ways toward realizing this vision. With powerful semantic understanding and reasoning capabilities, LLMs can be deployed as intelligent agents capable of autonomously solving complex problems across domains. In this article, we focus on Urban LLM Agents, which are LLM-powered agents that are semi-embodied within the hybrid cyber-physical-social space of cities and used for system-level urban decision-making. First, we introduce the concept of urban LLM agents, discussing their unique capabilities and features. Second, we survey the current research landscape from the perspective of agent workflows, encompassing urban sensing, memory management, reasoning, execution, and learning. Third, we categorize the application domains of urban LLM agents into five groups: urban planning, transportation, environment, public safety, and urban society, presenting representative works in each group. Finally, we discuss trustworthiness and evaluation issues that are critical for real-world deployment, and identify several open problems for future research. This survey aims to establish a foundation for the emerging field of urban LLM agents and to provide a roadmap for advancing the intersection of LLMs and urban intelligence. A curated list of relevant papers and open-source resources is maintained and continuously updated at https://github.com/usail-hkust/Awesome-Urban-LLM-Agents.

智能城市的长期愿景是利用海量数据和人工智能技术创造高效、可居住和可持续的城市环境。最近,大语言模型(LLMs)的出现开辟了实现这一愿景的新途径。凭借强大的语义理解和推理能力,LLMs可以被部署为能自主解决跨领域复杂问题的智能剂。在本篇文章中,我们侧重于城市LLM代理,这是在城市混合网络-物理-社会空间内半吸收的LLM授权代理物,用于系统一级的城市决策。首先,我们引入了城市LLM代理商的概念,讨论其独特的能力和特点。第二,我们从代理商工作流程的角度,包括城市感测、记忆管理、推理学、执行和学习,来调查当前的研究场景色。第三,我们将城市LLMM代理商的应用领域分为五组:城市规划、交通、环境、公共安全和城市社会,在每组中介绍具有代表性的作品。最后,我们讨论了对现实世界部署至关重要的信任度和评估问题,并确定了一些开放的LLMMM代理商问题。我们从代理商的工作流程的角度考察当前和不断更新的LMA-LM公司的文件。这项调查的目的是为正在建立一个基础。


Article 22

Title@2025-07-01 (2): TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation

Title: TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation Translaw: Benchmarking von großen Sprachmodellen in der Multi-Agenten-Simulation der Kollaborativen Übersetzung TransLaw:在多方代理模拟协作翻译时确定大语言模式基准 2507.00875v1

Authors (4): Xi Xuan, King-kui Sin, Yufei Zhou, Chunyu Kit

Multi-agent systems empowered by large language models (LLMs) have demonstrated remarkable capabilities in a wide range of downstream applications, including machine translation. However, the potential of LLMs in translating Hong Kong legal judgments remains uncertain due to challenges such as intricate legal terminology, culturally embedded nuances, and strict linguistic structures. In this work, we introduce TransLaw, a novel multi-agent framework implemented for real-world Hong Kong case law translation. It employs three specialized agents, namely, Translator, Annotator, and Proofreader, to collaboratively produce translations for high accuracy in legal meaning, appropriateness in style, and adequate coherence and cohesion in structure. This framework supports customizable LLM configurations and achieves tremendous cost reduction compared to professional human translation services. We evaluated its performance using 13 open-source and commercial LLMs as agents and obtained interesting findings, including that it surpasses GPT-4o in legal semantic accuracy, structural coherence, and stylistic fidelity, yet trails human experts in contextualizing complex terminology and stylistic naturalness. Our platform website is available at CityUHK, and our bilingual judgment corpus used for the evaluation is available at Hugging Face.

由大型语言模型(LLMS)授权的多试剂系统在包括机器翻译在内的一系列下游应用中表现出了非凡的能力,然而,由于复杂的法律术语、文化内含的细微差别和严格的语言结构等挑战,LLMS在翻译香港法律判决方面的潜力仍然不确定,在这项工作中,我们采用了TransLaw,这是为香港真实世界判例法翻译工作实施的新颖的多试剂框架,它雇用了3个专业代理人,即笔译员、说明员和校对员,合作翻译法律意义高度准确、风格适当、结构具有充分一致性和一致性。这个框架支持可定制的LM配置,并实现了与专业人类翻译服务相比的巨大成本削减。我们用13个开放源和商业LLMs作为代理对它的业绩进行了评估,并获得了有趣的发现,包括它在法律语义准确性、结构一致性和文体贴性方面超过了GPT-4,但在复杂术语背景和自然性方面有历史线索的人类专家。我们的平台网站在城市UHK提供,我们用于评估的双语判决书可在Hugging Face上查阅。


Article 23

Title@2025-07-01 (2): Position: Emergent Machina Sapiens Urge Rethinking Multi-Agent Paradigms

Title: Position: Emergent Machina Sapiens Urge Rethinking Multi-Agent Paradigms Position: Emergent Machina Sapiens Urge Rethinking Multi-Agent Paradigmen 职位: 新兴马奇纳·萨皮恩斯敦促重新思考多机构模式 2502.04388v3

Authors (5): Hepeng Li, Yuhong Liu, Jun Yan, Jie Gao, Xiaoou Yang

Artificial Intelligence (AI) agents capable of autonomous learning and independent decision-making hold great promise for addressing complex challenges across various critical infrastructure domains, including transportation, energy systems, and manufacturing. However, the surge in the design and deployment of AI systems, driven by various stakeholders with distinct and unaligned objectives, introduces a crucial challenge: How can uncoordinated AI systems coexist and evolve harmoniously in shared environments without creating chaos or compromising safety? To address this, we advocate for a fundamental rethinking of existing multi-agent frameworks, such as multi-agent systems and game theory, which are largely limited to predefined rules and static objective structures. We posit that AI agents should be empowered to adjust their objectives dynamically, make compromises, form coalitions, and safely compete or cooperate through evolving relationships and social feedback. Through two case studies in critical infrastructure applications, we call for a shift toward the emergent, self-organizing, and context-aware nature of these multi-agentic AI systems.

有能力自主学习和独立决策的人工智能(AI)代理机构在应对运输、能源系统和制造业等各种关键基础设施领域的复杂挑战方面大有希望,然而,由不同和不结盟目标的不同利益攸关方驱动的人工智能系统设计和部署激增,带来了一个至关重要的挑战:在共同环境中,如何不协调的人工智能系统在不造成混乱或损害安全的情况下和谐地共存和演变?为了解决这个问题,我们主张从根本上重新思考现有的多试剂框架,如多剂系统和游戏理论,这些多剂系统和游戏理论基本上限于预先确定的规则和静态目标结构。我们主张应授权AI代理机构通过不断发展的关系和社会反馈,积极调整其目标,作出妥协,形成联盟,并进行安全竞争或合作。我们通过在关键基础设施应用方面的两个案例研究,呼吁向这些多剂性人工智能系统的新兴、自我组织和背景性质转变。


Article 24

Title@2025-07-01 (2): Robust Correlated Equilibrium: Definition and Computation

Title: Robust Correlated Equilibrium: Definition and Computation Robustes korreliertes Gleichgewicht: Definition und Berechnung 强力Cor相关平衡:定义和计算 2311.17592v2

Authors (4): Rahul Misra, Rafał Wisniewski, Carsten Skovmose Kallesøe, Manuela L. Bujorianu

We study N-player finite games with costs perturbed due to time-varying disturbances in the underlying system and to that end, we propose the concept of Robust Correlated Equilibrium that generalizes the definition of Correlated Equilibrium. Conditions under which the Robust Correlated Equilibrium exists are specified, and a decentralized algorithm for learning strategies that are optimal in the sense of Robust Correlated Equilibrium is proposed. The primary contribution of the paper is the convergence analysis of the algorithm and to that end, we propose a modification of the celebrated Blackwell’s Approachability theorem to games with costs that are not just time-average, as in the original Blackwell’s Approachability Theorem, but also include the time-average of previous algorithm iterates. The designed algorithm is applied to a practical water distribution network with pumps being the controllers and their costs being perturbed by uncertain consumption due to the consumers. Simulation results show that each controller achieves no regret, and empirical distributions converge to the Robust Correlated Equilibrium.

我们研究N玩家有限游戏,其成本因基础系统的时间变化干扰而变化,为此,我们提议采用“硬盘相关平衡”概念,将“硬盘相关平衡”的定义概括为“平衡”的定义。在这种条件下,规定了“硬盘相关平衡”存在的条件,并提出了一种在Robust Cor相关平衡意义上最优化的学习战略的分散算法。文件的主要贡献是对算法进行趋同分析,并为此目的,我们提议修改著名的Blackwell可接近游戏的标语,其成本不光是时间平均的,正如最初的Blackwell的可接近性标语一样,而且还包括先前的算法替代词的时间平均。设计算法适用于实用的水分配网络,泵是控制器,其成本因消费者消费的不确定性而受到困扰。模拟结果显示,每个控制器都毫无遗憾,实验性分布会与Robust Cor Cor相关。


Article 25

Title@2025-07-01 (2): Hierarchical Decentralized Stochastic Control for Cyber-Physical Systems

Title: Hierarchical Decentralized Stochastic Control for Cyber-Physical Systems Hierarchische dezentrale stochastische Steuerung für Cyber-Physische Systeme 网络物理系统等级分层存储控制 2506.22971v2

Authors (3): Kesav Kaza, Ramachandran Anantharaman, Rahul Meshram

This paper presents a two-timescale hierarchical decentralized architecture for control of Cyber-Physical Systems. The architecture consists of $N$ independent sub-processes, a global controller, and $N$ local controllers, each formulated as a Markov Decision Process (MDP). The global controller, operating at a slower timescale optimizes the infinite-horizon discounted cumulative reward under budget constraints. For the local controllers, operating at a faster timescale, we propose two different optimization frameworks, namely the COpt and FOpt. In the COpt framework, the local controller also optimizes an infinite-horizon MDP, while in the FOpt framework, the local controller optimizes a finite-horizon MDP. The FOpt framework mimics a federal structure, where the local controllers have more autonomy in their decision making. First, the existence of stationary deterministic optimal policies for both these frameworks is established. Then, various relationships between the two frameworks are studied, including a bound on the difference between the two optimal value functions. Additionally, sufficiency conditions are provided such that the two frameworks lead to the same optimal values.

本文介绍了一个用于控制网络物理系统的两级级分权架构。该架构由独立子流程、全球控制器和当地控制器组成,每个流程都作为Markov决策程序(MDP)制定。全球控制器在较慢的时间尺度上优化了预算限制下无限和偏差的累积奖励。对于在较快的时间尺度上运作的本地控制器,我们建议两个不同的优化框架,即COpt和Fopt。在COpt框架内,地方控制器还优化了无限和偏差 MDP,而在Fopt框架内,地方控制器优化了限定和偏差 MDP。Fopt框架模拟了联邦结构,当地控制器在决策中拥有更大的自主权。首先,对这两个框架的固定和最佳政策的存在进行了研究,包括对两种最佳价值功能之间的差异进行约束。此外,提供了充足条件,使两个框架达到相同的最佳价值。


Article 26

Title@2025-07-01 (2): Towards a Playground to Democratize Experimentation and Benchmarking of AI Agents for Network Troubleshooting

Title: Towards a Playground to Democratize Experimentation and Benchmarking of AI Agents for Network Troubleshooting Auf dem Weg zu einem Spielplatz zur Demokratisierung von Experimenten und Benchmarking von KI-Agenten zur Netzwerkfehlerbehebung 走向使AI 网络排除问题代理机构民主化试验和基准设定的竞技场 2507.01997v1

Authors (6): Zhihao Wang, Alessandro Cornacchia, Franco Galante, Carlo Centofanti, Alessio Sacco, Dingde Jiang

Recent research has demonstrated the effectiveness of Artificial Intelligence (AI), and more specifically, Large Language Models (LLMs), in supporting network configuration synthesis and automating network diagnosis tasks, among others. In this preliminary work, we restrict our focus to the application of AI agents to network troubleshooting and elaborate on the need for a standardized, reproducible, and open benchmarking platform, where to build and evaluate AI agents with low operational effort.

最近的研究表明,人工智能(AI),更具体地说,大语言模型(LLMs),在支持网络配置合成和网络诊断任务自动化等方面是有效的。 在这一初步工作中,我们的重点仅限于应用AI代理机构解决网络故障,并阐明需要一个标准化的、可复制的和开放的基准平台,以便低投入地建立和评估AI代理机构。


Article 27

Title@2025-07-01 (2): Twill: Scheduling Compound AI Systems on Heterogeneous Mobile Edge Platforms

Title: Twill: Scheduling Compound AI Systems on Heterogeneous Mobile Edge Platforms Twill: Scheduling Compound AI-Systeme auf heterogenen mobilen Edge-Plattformen Twill: 异源移动边缘平台上排成不同式移动边缘平台的AI系统 2507.00491v1

Authors (5): Zain Taufique, Aman Vyas, Antonio Miele, Pasi Liljeberg, Anil Kanduri

Compound AI (cAI) systems chain multiple AI models to solve complex problems. cAI systems are typically composed of deep neural networks (DNNs), transformers, and large language models (LLMs), exhibiting a high degree of computational diversity and dynamic workload variation. Deploying cAI services on mobile edge platforms poses a significant challenge in scheduling concurrent DNN-transformer inference tasks, which arrive dynamically in an unknown sequence. Existing mobile edge AI inference strategies manage multi-DNN or transformer-only workloads, relying on design-time profiling, and cannot handle concurrent inference of DNNs and transformers required by cAI systems. In this work, we address the challenge of scheduling cAI systems on heterogeneous mobile edge platforms. We present Twill, a run-time framework to handle concurrent inference requests of cAI workloads through task affinity-aware cluster mapping and migration, priority-aware task freezing/unfreezing, and DVFS, while minimizing inference latency within power budgets. We implement and deploy our Twill framework on the Nvidia Jetson Orin NX platform. We evaluate Twill against state-of-the-art edge AI inference techniques over contemporary DNNs and LLMs, reducing inference latency by 54% on average, while honoring power budgets.

AI(cAI)系统连锁多种AI模型,以解决复杂问题。 cAI系统通常由深神经网络(DNN)、变压器和大型语言模型(LLMS)组成,具有高度的计算多样性和动态工作量变化。在移动边缘平台上部署 CIA 服务对同时安排DNN- Transferation 推论任务是一项重大挑战,这些任务以未知的顺序动态抵达。 现有的移动边缘AI 推断战略管理着多DNN(或仅供变压器)的工作量,依靠设计-时间特征分析,无法同时处理DNNNN和CAI系统所需的变压器的推论。在这项工作中,我们应对了将cAI系统安排在多变异移动边缘平台上的挑战。我们介绍Twill(一个运行时间框架),以便同时处理CANI工作量的推论,通过一个具有亲近感的集群测绘和迁移、优先觉察到冻结/不冻结任务和DVFS,同时尽量减少权力预算中的推力。我们执行和在NVIDO(NX)平级平台上执行和部署我们关于NIS(NPR)平均削减54)的NLPERPL)预算。


Article 28

Title@2025-07-01 (2): Novel Pigeon-inspired 3D Obstacle Detection and Avoidance Maneuver for Multi-UAV Systems

Title: Novel Pigeon-inspired 3D Obstacle Detection and Avoidance Maneuver for Multi-UAV Systems Neuartige Pigeon-inspirierte 3D-Hördererkennung und Vermeidungsmanöver für Multi-UAV-Systeme 多无人驾驶航空器系统3D障碍探测和避免多功能、无人驾驶航空器系统新小鸽诱导的3D障碍操纵器 2507.00443v1

Authors (3): Reza Ahmadvand, Sarah Safura Sharif, Yaser Mike Banad

Recent advances in multi-agent systems manipulation have demonstrated a rising demand for the implementation of multi-UAV systems in urban areas, which are always subjected to the presence of static and dynamic obstacles. Inspired by the collective behavior of tilapia fish and pigeons, the focus of the presented research is on the introduction of a nature-inspired collision-free formation control for a multi-UAV system, considering the obstacle avoidance maneuvers. The developed framework in this study utilizes a semi-distributed control approach, in which, based on a probabilistic Lloyd’s algorithm, a centralized guidance algorithm works for optimal positioning of the UAVs, while a distributed control approach has been used for the intervehicle collision and obstacle avoidance. Further, the presented framework has been extended to the 3D space with a novel definition of 3D maneuvers. Finally, the presented framework has been applied to multi-UAV systems in 2D and 3D scenarios, and the obtained results demonstrated the validity of the presented method in dynamic environments with stationary and moving obstacles.

多试剂系统操纵方面最近的进展表明,对在城市地区实施多无人驾驶航空器系统的需求不断增加,城市地区始终存在静态和动态障碍,受罗非鱼和鸽子集体行为的影响,所述研究的重点是对多无人驾驶航空器系统采用自然驱动的无碰撞形成控制,同时考虑到避免障碍的动作;本研究的发达框架采用了半分散控制办法,根据劳埃德的概率算法,对无人驾驶航空器的最佳定位采用集中指导算法,同时对车辆间碰撞和避免障碍也采用了分散控制法;此外,提出的框架已扩大到3D空间,对3D操作作了新的定义;最后,在2D和3D情景中,对多无人驾驶航空器系统适用了介绍的框架,获得的结果表明在有固定和移动障碍的动态环境中采用的方法的有效性。


Article 29

Title@2025-06-30 (1): What Makes Local Updates Effective: The Role of Data Heterogeneity and Smoothness

Title: What Makes Local Updates Effective: The Role of Data Heterogeneity and Smoothness Was lokale Updates effektiv macht: Die Rolle von Daten Heterogenität und Glätte 是什么使本地更新有效:数据多样化和平稳的作用 2507.00195v1

Authors (1): Kumar Kshitij Patel

This thesis contributes to the theoretical understanding of local update algorithms, especially Local SGD, in distributed and federated optimization under realistic models of data heterogeneity. A central focus is on the bounded second-order heterogeneity assumption, which is shown to be both necessary and sufficient for local updates to outperform centralized or mini-batch methods in convex and non-convex settings. The thesis establishes tight upper and lower bounds in several regimes for various local update algorithms and characterizes the min-max complexity of multiple problem classes. At its core is a fine-grained consensus-error-based analysis framework that yields sharper finite-time convergence bounds under third-order smoothness and relaxed heterogeneity assumptions. The thesis also extends to online federated learning, providing fundamental regret bounds under both first-order and bandit feedback. Together, these results clarify when and why local updates offer provable advantages, and the thesis serves as a self-contained guide for analyzing Local SGD in heterogeneous environments.

该论文有助于在现实的数据异质模型下对本地更新算法,特别是本地 SGD 进行分布式和联合优化的理论理解,在分布式和联合式的数据异质模型下,有助于对本地更新算法,特别是本地 SGD 进行理论理解。中心重点是受约束的二级异质假设,这证明既必要,又足以使本地更新在 convex 和非 convex 设置中超越集中式或微型批量法。该论文在若干制度中为各种本地更新算法规定了严格的上下限,并说明了多种问题类别的微量复杂程度。其核心是一个精细的基于共识的基于共识的分析框架,在三级平稳和宽松的异质性假设下产生更敏锐的定时趋同界限。该论文还延伸至在线的联邦学习,提供了一级和带宽度反馈的基本遗憾界限。这些结果共同澄清了本地更新在何时和为什么提供可证实的优势,该论文是用于在混杂环境中分析本地 SGD 的自成指南。


Article 30

Title@2025-06-30 (1): Advancing Learnable Multi-Agent Pathfinding Solvers with Active Fine-Tuning

Title: Advancing Learnable Multi-Agent Pathfinding Solvers with Active Fine-Tuning Erlernbare Multi-Agent-Pathfinding-Lösemittel mit aktiver Feinsteuerung 具有积极微调功能的推进可学习多机构探索式解答器 2506.23793v1

Authors (4): Anton Andreychuk, Konstantin Yakovlev, Aleksandr Panov, Alexey Skrynnik

Multi-agent pathfinding (MAPF) is a common abstraction of multi-robot trajectory planning problems, where multiple homogeneous robots simultaneously move in the shared environment. While solving MAPF optimally has been proven to be NP-hard, scalable, and efficient, solvers are vital for real-world applications like logistics, search-and-rescue, etc. To this end, decentralized suboptimal MAPF solvers that leverage machine learning have come on stage. Building on the success of the recently introduced MAPF-GPT, a pure imitation learning solver, we introduce MAPF-GPT-DDG. This novel approach effectively fine-tunes the pre-trained MAPF model using centralized expert data. Leveraging a novel delta-data generation mechanism, MAPF-GPT-DDG accelerates training while significantly improving performance at test time. Our experiments demonstrate that MAPF-GPT-DDG surpasses all existing learning-based MAPF solvers, including the original MAPF-GPT, regarding solution quality across many testing scenarios. Remarkably, it can work with MAPF instances involving up to 1 million agents in a single environment, setting a new milestone for scalability in MAPF domains.

多试剂路由调查(MAPF)是多机器人轨迹规划问题的共同抽象,多同质机器人同时在共享环境中移动。尽管最优化地解决MAPF是NP-硬、可缩放和高效的,但解决方案对于物流、搜索和救援等现实世界应用至关重要。为此,分散的利用机器学习的亚最佳MAPF解答器已上台。在最近引入的MAPF-GPT(纯仿造学习解析器)的成功基础上,我们引入了MAPF-GPT-DDG。这一新颖办法有效地微调了使用集中专家数据经过预先训练的MAPFF模式。利用新的三角数据生成机制,MAPF-GPT-DDG加速培训,同时在测试时间大大改进了绩效。我们的实验表明,MAPF-GPT-DGGS-DG超越了所有现有的基于学习的MAPFFS解答器,包括最初的MAPF-GPT,涉及许多测试情景的解决方案质量。值得注意的是,它可以与MAPFFS(MAPF-GPF)公司一起在新领域建立一个具有里程碑的、在100万个新环境上建立一个里程碑。


Article 31

Title@2025-06-30 (1): PokéAI: A Goal-Generating, Battle-Optimizing Multi-agent System for Pokemon Red

Title: PokéAI: A Goal-Generating, Battle-Optimizing Multi-agent System for Pokemon Red PokéAI: Ein Ziel-Generierung, Schlacht-Optimierung Multi-Agenten-System für Pokemon Red PokéAI:波克蒙红公司目标启动、战斗优化多剂多试剂系统 2506.23689v1

Authors (4): Zihao Liu, Xinhang Sui, Yueran Song, Siwen Wang

We introduce Pok'eAI, the first text-based, multi-agent large language model (LLM) framework designed to autonomously play and progress through Pok'emon Red. Our system consists of three specialized agents-Planning, Execution, and Critique-each with its own memory bank, role, and skill set. The Planning Agent functions as the central brain, generating tasks to progress through the game. These tasks are then delegated to the Execution Agent, which carries them out within the game environment. Upon task completion, the Critique Agent evaluates the outcome to determine whether the objective was successfully achieved. Once verification is complete, control returns to the Planning Agent, forming a closed-loop decision-making system. As a preliminary step, we developed a battle module within the Execution Agent. Our results show that the battle AI achieves an average win rate of 80.8% across 50 wild encounters, only 6% lower than the performance of an experienced human player. Furthermore, we find that a model’s battle performance correlates strongly with its LLM Arena score on language-related tasks, indicating a meaningful link between linguistic ability and strategic reasoning. Finally, our analysis of gameplay logs reveals that each LLM exhibits a unique playstyle, suggesting that individual models develop distinct strategic behaviors.

我们引入了Pok'eAI, 这是第一个基于文本的多试剂大型语言模型(LLM)框架, 旨在通过 Pok'emon Red 自动游戏和进步。 我们的系统由三个专门的代理器组成: 规划、 执行和 Critique- each 及其自己的记忆库、 角色和技能组。 规划代理机作为中央大脑发挥功能, 通过游戏产生进步的任务。 这些任务随后被委托给执行代理器, 后者在游戏环境中执行。 任务完成后, Critique Agenter 评估结果, 以确定目标是否成功实现。 一旦核查完成, 控制返回到规划代理, 形成一个闭环决策系统。 作为第一步, 我们开发了一个执行代理的战斗模块。 我们的结果表明, 作战代理器在50次野战中平均赢得80.8%的胜利率, 仅比有经验的人类玩家的表现低6 % 。 此外, 我们发现, 模型的战斗性表现与其LM Arena 与语言相关任务的成绩密切相关, 显示语言能力和战略思维模式之间的有意义的联系, 展示了我们独特的游戏行为模式。


Article 32

Title@2025-06-30 (1): Curated Collaborative AI Edge with Network Data Analytics for B5G/6G Radio Access Networks

Title: Curated Collaborative AI Edge with Network Data Analytics for B5G/6G Radio Access Networks Kuratierter Kollaborativer AI Edge mit Network Data Analytics für B5G/6G Radio Access Networks B5G/6G无线电接入网络与网络数据分析 2507.01994v1

Authors (7): Sardar Jaffar Ali, Syed M. Raza, Duc-Tai Le, Rajesh Challa, Min Young Chung, Ness Shroff, Hyunseung Choo

Despite advancements, Radio Access Networks (RAN) still account for over 50\% of the total power consumption in 5G networks. Existing RAN split options do not fully harness data potential, presenting an opportunity to reduce operational expenditures. This paper addresses this opportunity through a twofold approach. First, highly accurate network traffic and user predictions are achieved using the proposed Curated Collaborative Learning (CCL) framework, which selectively collaborates with relevant correlated data for traffic forecasting. CCL optimally determines whom, when, and what to collaborate with, significantly outperforming state-of-the-art approaches, including global, federated, personalized federated, and cyclic institutional incremental learnings by 43.9%, 39.1%, 40.8%, and 31.35%, respectively. Second, the Distributed Unit Pooling Scheme (DUPS) is proposed, leveraging deep reinforcement learning and prediction inferences from CCL to reduce the number of active DU servers efficiently. DUPS dynamically redirects traffic from underutilized DU servers to optimize resource use, improving energy efficiency by up to 89% over conventional strategies, translating into substantial monetary benefits for operators. By integrating CCL-driven predictions with DUPS, this paper demonstrates a transformative approach for minimizing energy consumption and operational costs in 5G RANs, significantly enhancing efficiency and cost-effectiveness.

尽管取得了进展,但无线电接入网络(RAN)仍然占5G网络总电耗量的50 %以上。现有的RAN分割选项没有充分利用数据潜力,为减少业务支出提供了机会。本文件通过双管齐下的方法探讨了这一机会。首先,利用拟议的Curate合作学习框架实现了高度准确的网络流量和用户预测,该框架有选择地与相关的交通预报相关数据合作。CCL最妥善地确定谁、何时和什么与谁合作,大大超过最先进的方法,包括全球、联邦化、个性化联合和周期化机构递增学习,分别占43.9%、39.1%、40.8%和31.35%。第二,建议采用分配单位集成计划(DUPS),利用CCL的深度强化学习和预测,以高效地减少活跃的DU服务器的数量。DUPS动态地将交通从未充分利用的DU服务器转向优化资源使用,提高能源效率,超过常规战略,将能源效率提高到89%,从而大大降低RAPS的成本效益,同时大幅提高DA-AS运营商成本。


Article 33

Title@2025-06-30 (1): MGPRL: Distributed Multi-Gaussian Processes for Wi-Fi-based Multi-Robot Relative Localization in Large Indoor Environments

Title: MGPRL: Distributed Multi-Gaussian Processes for Wi-Fi-based Multi-Robot Relative Localization in Large Indoor Environments MGPRL: Verteilte Multi-Gaussian-Prozesse für WLAN-basierte Multi-Roboter-relative Lokalisierung in großen Innenräumen MGPRL:大型室内环境中无线-基于无线-基于多机器人的多机器人相对本地化的分布式多盖日进程 2506.23514v1

Authors (2): Sai Krishna Ghanta, Ramviyas Parasuraman

Relative localization is a crucial capability for multi-robot systems operating in GPS-denied environments. Existing approaches for multi-robot relative localization often depend on costly or short-range sensors like cameras and LiDARs. Consequently, these approaches face challenges such as high computational overhead (e.g., map merging) and difficulties in disjoint environments. To address this limitation, this paper introduces MGPRL, a novel distributed framework for multi-robot relative localization using convex-hull of multiple Wi-Fi access points (AP). To accomplish this, we employ co-regionalized multi-output Gaussian Processes for efficient Radio Signal Strength Indicator (RSSI) field prediction and perform uncertainty-aware multi-AP localization, which is further coupled with weighted convex hull-based alignment for robust relative pose estimation. Each robot predicts the RSSI field of the environment by an online scan of APs in its environment, which are utilized for position estimation of multiple APs. To perform relative localization, each robot aligns the convex hull of its predicted AP locations with that of the neighbor robots. This approach is well-suited for devices with limited computational resources and operates solely on widely available Wi-Fi RSSI measurements without necessitating any dedicated pre-calibration or offline fingerprinting. We rigorously evaluate the performance of the proposed MGPRL in ROS simulations and demonstrate it with real-world experiments, comparing it against multiple state-of-the-art approaches. The results showcase that MGPRL outperforms existing methods in terms of localization accuracy and computational efficiency. Finally, we open source MGPRL as a ROS package https://github.com/herolab-uga/MGPRL.

多机器人相对本地化的现有方法往往取决于高成本或短程传感器,如相机和LDARs。因此,这些方法面临高计算间接费用(如地图合并)和脱节环境中的困难等挑战。为了应对这一限制,本文件介绍了MGPRL,这是使用多维-Fi访问点(AP)的Convex-hull来运行的多机器人相对本地化新分发框架。为了实现这一点,我们采用了多机器人相对本地化的多点化方法。为了实现这一点,我们采用了高成本或短程传感器相对本地化,例如相机和LDARs。因此,这些方法面临高计算间接费用(如地图合并)和脱节环境中的困难。为了应对这一限制,每个机器人都通过对其环境中的APs进行在线扫描来预测 RSSI 网站环境领域,并用于多个 Wi-Fi-Fi-Fieral访问点的定位, 进行相对本地化,每个机器人将其所预测的多点的多点的多点化高点计算方法比对等的多点 。SIS(RSI)实地测测算系统(RS-L) 和运行前的精确化工具运行运行运行。这个方法很好地展示。


Article 34

Title@2025-06-30 (1): State and Memory is All You Need for Robust and Reliable AI Agents

Title: State and Memory is All You Need for Robust and Reliable AI Agents Zustand und Gedächtnis sind alles, was Sie für robuste und zuverlässige KI-Agenten brauchen 国家记忆是强力和可靠的AI代理所需要的一切 2507.00081v1

Authors (15): Matthew Muhoberac, Atharva Parikh, Nirvi Vakharia, Saniya Virani, Aco Radujevic, Savannah Wood, Meghav Verma, Dimitri Metaxotos, Jeyaraman Soundararajan, Thierry Masquelin, Alexander G. Godfrey, Sean Gardner, Dobrila Rudnicki, Sam Michael, Gaurav Chopra

Large language models (LLMs) have enabled powerful advances in natural language understanding and generation. Yet their application to complex, real-world scientific workflows remain limited by challenges in memory, planning, and tool integration. Here, we introduce SciBORG (Scientific Bespoke Artificial Intelligence Agents Optimized for Research Goals), a modular agentic framework that allows LLM-based agents to autonomously plan, reason, and achieve robust and reliable domain-specific task execution. Agents are constructed dynamically from source code documentation and augmented with finite-state automata (FSA) memory, enabling persistent state tracking and context-aware decision-making. This approach eliminates the need for manual prompt engineering and allows for robust, scalable deployment across diverse applications via maintaining context across extended workflows and to recover from tool or execution failures. We validate SciBORG through integration with both physical and virtual hardware, such as microwave synthesizers for executing user-specified reactions, with context-aware decision making and demonstrate its use in autonomous multi-step bioassay retrieval from the PubChem database utilizing multi-step planning, reasoning, agent-to-agent communication and coordination for execution of exploratory tasks. Systematic benchmarking shows that SciBORG agents achieve reliable execution, adaptive planning, and interpretable state transitions. Our results show that memory and state awareness are critical enablers of agentic planning and reliability, offering a generalizable foundation for deploying AI agents in complex environments.

大型语言模型(LLMS)在自然语言理解和生成方面带来了巨大的进步;然而,在复杂的现实世界科学工作流程中的应用仍然受到记忆、规划和工具整合挑战的限制;在这里,我们引入了SciBORG(科学专用人工智能代理,为研究目标优化优化了人工智能工具),这是一个模块化的代理框架,使基于LLM的代理商能够自主规划、理性并实现可靠和可靠的具体领域任务执行;代理商是从源代码文档中动态地构建的,并辅之以有限的州自动自动自动图像(FSA)记忆,从而能够持续进行国家跟踪和有环境意识的决策;这一方法消除了对人工快速工程的需求,并允许通过在扩展的工作流程中保持背景,在各种应用程序中进行强有力的、可扩展的部署;通过工具或执行失败来恢复;我们验证SciBORG的模块,通过与实施用户指定反应的微波合成器等整合器,同时作出符合环境意识的一般决策,并展示其在从Pubchem数据库中自主进行多步生物测定的检索,从而能够持续进行国家跟踪、推理学、代理商-代理商-代理机构进行标准化的过渡规划,并展示了我们的国家基准化、系统化、系统化基础基础分析、系统化分析、系统化的系统化分析、测试、测试、系统化的系统化的系统化的系统化的系统化的系统化、测试、测试、测试、测试、测试、测试、测试、测试、测试、测试、测试、测试、测试、测试、测试、测试的代理商执行结果,以显示国家分析、测试、测试、测试、测试、测试、测试、测试、测试、测试结果。


Article 35

Title@2025-06-29 (7): Automated Vehicles Should be Connected with Natural Language

Title: Automated Vehicles Should be Connected with Natural Language Automatisierte Fahrzeuge sollten mit natürlicher Sprache verbunden werden 自动车辆应与自然语言连接 2507.01059v1

Authors (6): Xiangbo Gao, Keshu Wu, Hao Zhang, Kexin Tian, Yang Zhou, Zhengzhong Tu

Multi-agent collaborative driving promises improvements in traffic safety and efficiency through collective perception and decision making. However, existing communication media – including raw sensor data, neural network features, and perception results – suffer limitations in bandwidth efficiency, information completeness, and agent interoperability. Moreover, traditional approaches have largely ignored decision-level fusion, neglecting critical dimensions of collaborative driving. In this paper we argue that addressing these challenges requires a transition from purely perception-oriented data exchanges to explicit intent and reasoning communication using natural language. Natural language balances semantic density and communication bandwidth, adapts flexibly to real-time conditions, and bridges heterogeneous agent platforms. By enabling the direct communication of intentions, rationales, and decisions, it transforms collaborative driving from reactive perception-data sharing into proactive coordination, advancing safety, efficiency, and transparency in intelligent transportation systems.

多试剂协作驾驶有望通过集体认识和决策改善交通安全和效率。然而,现有的通信媒体 – – 包括原始传感器数据、神经网络特征和认知结果 – – 在带宽效率、信息完整性和代理互操作性方面受到限制。此外,传统做法在很大程度上忽视了决策层面的融合,忽视了协作驱动的关键层面。在本文中,我们认为,应对这些挑战需要从纯粹面向认知的数据交换过渡到使用自然语言的明确意图和逻辑交流。自然语言平衡语义密度和通信带宽,灵活适应实时条件,以及桥梁的混合代理平台。通过促成意图、原理和决定的直接沟通,将协作驱动从被动的认知数据共享转变为智能运输系统的主动协调,提高安全性、效率和透明度。


Article 36

Title@2025-06-29 (7): Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge

Title: Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge Agentisches medizinisches Wissen Grafiken verbessern medizinische Frageantworten: Die Lücke zwischen LLMs und sich entwickelndem medizinischem Wissen überbrücken 药用知识图加强医疗问题的回答:缩小LLMM与不断发展的医学知识之间的差距 2502.13010v3

Authors (5): Mohammad Reza Rezaei, Reza Saadati Fard, Jayson L. Parker, Rahul G. Krishnan, Milad Lankarany

Large Language Models (LLMs) have significantly advanced medical question-answering by leveraging extensive clinical data and medical literature. However, the rapid evolution of medical knowledge and the labor-intensive process of manually updating domain-specific resources pose challenges to the reliability of these systems. To address this, we introduce Agentic Medical Graph-RAG (AMG-RAG), a comprehensive framework that automates the construction and continuous updating of medical knowledge graphs, integrates reasoning, and retrieves current external evidence, such as PubMed and WikiSearch. By dynamically linking new findings and complex medical concepts, AMG-RAG not only improves accuracy but also enhances interpretability in medical queries. Evaluations on the MEDQA and MEDMCQA benchmarks demonstrate the effectiveness of AMG-RAG, achieving an F1 score of 74.1 percent on MEDQA and an accuracy of 66.34 percent on MEDMCQA, outperforming both comparable models and those 10 to 100 times larger. Notably, these improvements are achieved without increasing computational overhead, highlighting the critical role of automated knowledge graph generation and external evidence retrieval in delivering up-to-date, trustworthy medical insights.

大型语言模型(LLMS)通过利用广泛的临床数据和医学文献,大大推进了医学问题解答;然而,医疗知识的迅速发展以及人工更新特定领域资源的劳动密集型过程,给这些系统的可靠性带来了挑战;为此,我们引入了Agric Medical Graph-RAG(AMG-RAG)(AMG-RAG)(AMG-RAG)(AMG-RAG)(AMG-RAG)(AMG-RAG)(AMG-RAG)(AMG-RA)(AMG-RA)(AMG-RA)(AMG-RAG)(AMG-RA)(AMG-RAG)(AMG-RAG-RAG-RAG)(A)(AG)(AMG-MG-RA)(LLLMM)(LLLMMM)(LMMM)(LMMM)(LLLLM)(LMTM)(LMTM)(LMIT)(LID)(LI)(LMT)(LMT)(LI)(LLLID)(LIG(LIG(LM)(LMT)(LMT)(LI)(LM)(LM)(LM)(LM)(L)(L)(LM)(LM)(LLIG(LM)(LLLLLLLID)(LM)(L)(L)(L)(L)(L)(L)(L)(L)(L)(L)(L)(L)(L)(L)(L)(L)(LM)(L)(L)(LM)(LM)(LM)(L)(L)(LM)(LM)(LM)(L)(L)(LID)(L)(L)(LM)(LM)(LM)(L)(LM)(L)(L)(LM)(LM)(LM)(LI


Article 37

Title@2025-06-29 (7): Ad-Hoc Human-AI Coordination Challenge

Title: Ad-Hoc Human-AI Coordination Challenge Ad-hoc-Koordinierungsherausforderung Mensch-AI A. 协调挑战 2506.21490v2

Authors (10): Tin Dizdarević, Ravi Hammond, Tobias Gessler, Anisoara Calinescu, Jonathan Cook, Matteo Gallici, Andrei Lupu, Darius Muglich, Johannes Forkel, Jakob Nicolaus Foerster

Achieving seamless coordination between AI agents and humans is crucial for real-world applications, yet it remains a significant open challenge. Hanabi is a cooperative card game featuring imperfect information, constrained communication, theory of mind requirements, and coordinated action – making it an ideal testbed for human-AI coordination. However, its use for human-AI interaction has been limited by the challenges of human evaluation. In this work, we introduce the Ad-Hoc Human-AI Coordination Challenge (AH2AC2) to overcome the constraints of costly and difficult-to-reproduce human evaluations. We develop \textit{human proxy agents} on a large-scale human dataset that serve as robust, cheap, and reproducible human-like evaluation partners in AH2AC2. To encourage the development of data-efficient methods, we open-source a dataset of 3,079 games, deliberately limiting the amount of available human gameplay data. We present baseline results for both two- and three- player Hanabi scenarios. To ensure fair evaluation, we host the proxy agents through a controlled evaluation system rather than releasing them publicly. The code is available at \href{https://github.com/FLAIROx/ah2ac2}{https://github.com/FLAIROx/ah2ac2}.

AI代理商和人类之间实现无缝协调对于现实应用至关重要,但它仍然是一个重要的公开挑战。Hanabi是一个合作的纸牌游戏,其特点是信息不完善、沟通受限、思想要求理论和协调行动 – – 使它成为人类-AI协调的理想测试点。然而,人类-AI互动的利用受到人类评估挑战的限制。在这项工作中,我们介绍了A-Hoc Human-AI协调挑战(AH2AC2),以克服昂贵和难以再处理的人类评估的制约因素。我们开发了用于大规模人类数据集的Textit{human代理商,该数据集是强健、廉价和可复制的人类类评价伙伴。为了鼓励开发数据效率高的方法,我们开源了3 079个游戏数据集,有意限制现有人类游戏数据的数量。我们为两种和三种玩家Hanabi情景提供了基准结果。为了确保公平评价,我们通过一个受控的评价系统来托管代理商,而不是公开放。代码可在以下网站获得:\refusb2/FLAIAA2/AAAA/GLAAA/A/AAA/A/ARC2。


Article 38

Title@2025-06-29 (7): Interaction Identification of a Heterogeneous NDS with Quadratic-Bilinear Subsystems

Title: Interaction Identification of a Heterogeneous NDS with Quadratic-Bilinear Subsystems Interaktionsidentifizierung eines Heterogenen NDS mit Quadratisch-Bilinearen Subsystemen 与赤道-双线亚系统对异基因 NDS的交互识别 2412.02547v2

Authors (2): Tong Zhou, Yubing Li

This paper attacks time-domain identification for interaction parameters of a heterogeneous networked dynamic system (NDS), with each of its subsystems being described by a continuous-time descriptor quadratic-bilinear time-invariant (QBTI) model. The obtained results can also be applied to parameter estimations for a lumped QBTI system. No restrictions are put on the sampling rate. Explicit formulas are derived respectively for the transient and steady-state responses of the NDS, provided that the probing signal is generated by a linear time invariant (LTI) system. Some relations have been derived between the NDS steady-state response and its frequency domain input-output mappings. These relations reveal that the value of some NDS associated generalized TFMs can in principle be estimated at almost any interested point of the imaginary axis from time-domain input-output experimental data, as well as its derivatives and a right tangential interpolation along an arbitrary direction. Based on these relations, an estimation algorithm is suggested respectively for the parameters of the NDS and the values of these generalized TFMs. A numerical example is included to illustrate characteristics of the suggested estimation algorithms.

这一纸张攻击时间- 确定一个多式网络动态系统(NDS)的互动参数,其每个子系统都用一个连续时间描述描述描述标注的二次曲线- 分贝- 时间变化模型( QBTI) 。 所获得的结果还可用于对一个拼凑的 QBTI 系统的参数估计。 对取样率没有限制。 NDS 的瞬态和稳定状态反应将分别得出明确的公式,条件是探测信号是由一个线性变异( LTI) 系统生成的。 NDS 稳定状态反应与其频率域域内输入- 输出图绘制之间的某些关系已经产生。 这些关系表明,一些与NDS 相关的通用 TFM 原则上可以在时间- 部输入- 输出实验数据的想象轴几乎任何感兴趣的点上估计其衍生物和任意方向的右切线间推算值。 根据这些关系,建议对 NDS 参数和这些通用 TFMM 值的数值分别进行估算算算。


Article 39

Title@2025-06-29 (7): Research on Comprehensive Classroom Evaluation System Based on Multiple AI Models

Title: Research on Comprehensive Classroom Evaluation System Based on Multiple AI Models Forschung zum umfassenden Klassenraum-Bewertungssystem auf der Grundlage mehrerer KI-Modelle 基于多种AI模式的综合课堂评价系统研究 2506.23079v1

Authors (4): Cong Xie, Li Yang, Daben Wang, Jing Xiao

The promotion of the national education digitalization strategy has facilitated the development of teaching quality evaluation towards all-round, process-oriented, precise, and intelligent directions, inspiring explorations into new methods and technologies for educational quality assurance. Classroom teaching evaluation methods dominated by teaching supervision and student teaching evaluation suffer from issues such as low efficiency, strong subjectivity, and limited evaluation dimensions. How to further advance intelligent and objective evaluation remains a topic to be explored. This paper, based on image recognition technology, speech recognition technology, and AI large language models, develops a comprehensive evaluation system that automatically generates evaluation reports and optimization suggestions from two dimensions: teacher teaching ability and classroom teaching effectiveness. This study establishes a closed-loop classroom evaluation model that comprehensively evaluates student and teaching conditions based on multi-dimensional data throughout the classroom teaching process, and further analyzes the data to guide teaching improvement. It meets the requirements of all-round and process-oriented classroom evaluation in the era of digital education, effectively solves the main problems of manual evaluation methods, and provides data collection and analysis methods as well as technologies for relevant research on educational teaching evaluation.

国家教育数字化战略的推广促进了面向全方位、过程导向、准确和智能方向的教学质量评估的发展,激励了对教育质量保证新方法和技术的探索;以教学监督和学生教学评估为主的课堂教学评估方法存在低效率、强主观性和有限的评估层面等问题;如何进一步推动智能和客观评估仍然是一个有待探讨的专题;该文件以图像识别技术、语音识别技术和全方位语言模型为基础,开发了一个全面评估系统,从两个方面自动生成评估报告和优化建议:教师教学能力和课堂教学效力;该研究建立了一个封闭式课堂评估模式,根据整个课堂教学过程的多维数据全面评估学生和教学条件,并进一步分析数据以指导教学的改进;它满足了数字教育时代全方位和面向进程的课堂评估的要求,有效解决了手工评估方法的主要问题,提供了数据收集和分析方法以及教育教学评估相关研究的技术。


Article 40

Title@2025-06-28 (6): Evaluating Agents using Social Choice Theory

Title: Evaluating Agents using Social Choice Theory Bewertung von Agenten anhand der Theorie der sozialen Wahl 使用社会选择理论评估代理人 2312.03121v4

Authors (9): Marc Lanctot, Kate Larson, Yoram Bachrach, Luke Marris, Zun Li, Avishkar Bhoopchand, Thomas Anthony, Brian Tanner, Anna Koop

We argue that many general evaluation problems can be viewed through the lens of voting theory. Each task is interpreted as a separate voter, which requires only ordinal rankings or pairwise comparisons of agents to produce an overall evaluation. By viewing the aggregator as a social welfare function, we are able to leverage centuries of research in social choice theory to derive principled evaluation frameworks with axiomatic foundations. These evaluations are interpretable and flexible, while avoiding many of the problems currently facing cross-task evaluation. We apply this Voting-as-Evaluation (VasE) framework across multiple settings, including reinforcement learning, large language models, and humans. In practice, we observe that VasE can be more robust than popular evaluation frameworks (Elo and Nash averaging), discovers properties in the evaluation data not evident from scores alone, and can predict outcomes better than Elo in a complex seven-player game. We identify one particular approach, maximal lotteries, that satisfies important consistency properties relevant to evaluation, is computationally efficient (polynomial in the size of the evaluation data), and identifies game-theoretic cycles.

我们认为,许多一般性评价问题可以通过投票理论的透镜来看待。每个任务都被解释为一个单独的选民,只需要对代理人进行分级或对称比较,才能产生总体评价。通过将聚合体视为社会福利功能,我们能够利用社会选择理论的数百年研究,得出有原则的评价框架,并具有分义基础。这些评价是可解释的和灵活的,同时避免目前跨任务评价所面临的许多问题。我们在多个场合,包括强化学习、大语言模型和人类,应用这个“投票时估价”框架。在实践中,我们观察到“VASE”比大众评价框架(Elo和Nash平均)更强大,能够发现评价数据中的特性,不能单从分数中看出来,在复杂的七人游戏中可以比Elo更好地预测结果。我们确定了一种特定的方法,即最大彩票,能够满足与评价相关的重要一致性特性,是计算有效的(评价数据规模的极多数值),并查明游戏-理论周期。


Article 41

Title@2025-06-28 (6): A Large Language Model-Enabled Control Architecture for Dynamic Resource Capability Exploration in Multi-Agent Manufacturing Systems

Title: A Large Language Model-Enabled Control Architecture for Dynamic Resource Capability Exploration in Multi-Agent Manufacturing Systems Eine großsprachige modellfähige Steuerungsarchitektur für dynamische Ressourcenkapazitäts-Exploration in Multi-Agent-Produktionssystemen 多机构制造系统动态资源能力探索大语言模型化控制结构 2505.22814v2

Authors (2): Jonghan Lim, Ilya Kovalenko

Manufacturing environments are becoming more complex and unpredictable due to factors such as demand variations and shorter product lifespans. This complexity requires real-time decision-making and adaptation to disruptions. Traditional control approaches highlight the need for advanced control strategies capable of overcoming unforeseen challenges, as they demonstrate limitations in responsiveness within dynamic industrial settings. Multi-agent systems address these challenges through decentralization of decision-making, enabling systems to respond dynamically to operational changes. However, current multi-agent systems encounter challenges related to real-time adaptation, context-aware decision-making, and the dynamic exploration of resource capabilities. Large language models provide the possibility to overcome these limitations through context-aware decision-making capabilities. This paper introduces a large language model-enabled control architecture for multi-agent manufacturing systems to dynamically explore resource capabilities in response to real-time disruptions. A simulation-based case study demonstrates that the proposed architecture improves system resilience and flexibility. The case study findings show improved throughput and efficient resource utilization compared to existing approaches.

由于需求变化和产品寿命缩短等因素,制造环境变得更加复杂和不可预测。这种复杂性要求实时决策和适应干扰。传统控制方法强调,需要制定能够克服意外挑战的先进控制战略,因为这些战略表明在动态工业环境中反应能力有限。多试剂系统通过下放决策权来应对这些挑战,使系统能够对业务变化作出动态反应。然而,目前的多试剂系统遇到与实时适应、环境意识决策和动态开发资源能力有关的挑战。大型语言模型提供了通过具备环境意识的决策能力克服这些限制的可能性。本文为多剂制造系统引入了大型语言模型化控制结构,以动态地探索资源能力以应对实时干扰。模拟案例研究表明,拟议的结构提高了系统的复原力和灵活性。案例研究结果表明,与现有方法相比,吞吐量和高效利用资源的情况有所改善。


Article 42

Title@2025-06-28 (6): Resilient-Native and Intelligent Next-Generation Wireless Systems: Key Enablers, Foundations, and Applications

Title: Resilient-Native and Intelligent Next-Generation Wireless Systems: Key Enablers, Foundations, and Applications Resilient-Native und intelligente Mobilfunksysteme der nächsten Generation: Key Enabler, Grundlagen und Anwendungen 具有弹性的、有弹性的和智能的下一级无线无线系统:关键启用器、基础和应用 2506.22991v1

Authors (6): Mehdi Bennis, Sumudu Samarakoon, Tamara Alshammari, Chathuranga Weeraddana, Zhoujun Tian, Chaouki Ben Issaid

Just like power, water, and transportation systems, wireless networks are a crucial societal infrastructure. As natural and human-induced disruptions continue to grow, wireless networks must be resilient. This requires them to withstand and recover from unexpected adverse conditions, shocks, unmodeled disturbances and cascading failures. Unlike robustness and reliability, resilience is based on the understanding that disruptions will inevitably happen. Resilience, as elasticity, focuses on the ability to bounce back to favorable states, while resilience as plasticity involves agents and networks that can flexibly expand their states and hypotheses through real-time adaptation and reconfiguration. This situational awareness and active preparedness, adapting world models and counterfactually reasoning about potential system failures and the best responses, is a core aspect of resilience. This article will first disambiguate resilience from reliability and robustness, before delving into key mathematical foundations of resilience grounded in abstraction, compositionality and emergence. Subsequently, we focus our attention on a plethora of techniques and methodologies pertaining to the unique characteristics of resilience, as well as their applications through a comprehensive set of use cases. Ultimately, the goal of this paper is to establish a unified foundation for understanding, modeling, and engineering resilience in wireless communication systems, while laying a roadmap for the next-generation of resilient-native and intelligent wireless systems.

与电力、水和运输系统一样,无线网络是一个至关重要的社会基础设施。随着自然和人为干扰的继续增长,无线网络必须具有抗御力。这要求它们抵御和从意外的不利条件、冲击、未建模的扰动和连锁故障中恢复过来。与强健和可靠性不同,复原力的基础在于破坏将不可避免地发生。作为弹性,复原力侧重于向有利国家回弹的能力,而作为可塑性,复原力涉及能够通过实时适应和重组灵活扩展其状态和假设的代理和网络。这种态势感知和积极准备、调整世界模型和反事实推论潜在系统故障和最佳反应,是复原力的一个核心方面。本文章首先将脱离可靠性和稳健性,然后根据抽象性、构成性和出现情况,进入复原力的关键数学基础。随后,我们集中关注与复原力独特特点有关的大量技术和方法,并通过综合使用案例来应用这些技术和方法。最终,本文的目标是为理解、建模、建模和建模、建模和建模性系统建立统一的基础。


Article 43

Title@2025-06-28 (6): Detection of coordinated fleet vehicles in route choice urban games. Part I. Inverse fleet assignment theory

Title: Detection of coordinated fleet vehicles in route choice urban games. Part I. Inverse fleet assignment theory Ermittlung koordinierter Flottenfahrzeuge bei der Routenwahl urbane Spiele. Teil I Inverse Flottenzuteilungstheorie 在选择路线选择城市游戏中发现协调一致的车队车辆。 2506.22966v1

Authors (2): Grzegorz Jamróz, Rafał Kucharski

Detection of collectively routing fleets of vehicles in future urban systems may become important for the management of traffic, as such routing may destabilize urban networks leading to deterioration of driving conditions. Accordingly, in this paper we discuss the question whether it is possible to determine the flow of fleet vehicles on all routes given the fleet size and behaviour as well as the combined total flow of fleet and non-fleet vehicles on every route. We prove that the answer to this Inverse Fleet Assignment Problem is ‘yes’ for myopic fleet strategies which are more ‘selfish’ than ‘altruistic’, and ‘no’ otherwise, under mild assumptions on route/link performance functions. To reach these conclusions we introduce the forward fleet assignment operator and study its properties, proving that it is invertible for ‘bad’ objectives of fleet controllers. We also discuss the challenges of implementing myopic fleet routing in the real world and compare it to Stackelberg and Nash routing. Finally, we show that optimal Stackelberg fleet routing could involve highly variable mixed strategies in some scenarios, which would likely cause chaos in the traffic network.

因此,我们在本文件中讨论这样一个问题:鉴于车队的规模和行为,是否可以确定所有航线的车队流动情况,以及每条航线上车队和非车队车辆总流动情况,是否可以确定车队车辆的流动情况。我们证明,这一反车队任务问题的答案是“是”远洋车队战略的“是”而不是“不rutic”和“否”战略,这些战略在路线/连接性功能的微小假设下更“自私”而不是“不透明”。为了达成这些结论,我们介绍远洋车队分配操作员并研究其特性,证明这对机队控制员的“坏”目标来说是不可忽略的。我们还讨论在现实世界中执行我的机动车队路线和不随风车车辆总流动的挑战,并将它与斯塔克勒伯格和纳什路由作比较。最后,我们表明,最佳斯塔克尔堡车队的路线安排可能在某些情形中涉及高度不同的混合战略,这可能造成交通网络混乱。


Article 44

Title@2025-06-28 (6): Agent-to-Agent Theory of Mind: Testing Interlocutor Awareness among Large Language Models

Title: Agent-to-Agent Theory of Mind: Testing Interlocutor Awareness among Large Language Models Agent-to-Agent Theorie des Geistes: Testen Gesprächspartner Bewusstsein unter großen Sprachmodellen 精神感官理论:测试大语言模型间对话者的认识 2506.22957v1

Authors (4): Younwoo Choi, Changling Li, Yongjin Yang, Zhijing Jin

As large language models (LLMs) are increasingly integrated into multi-agent and human-AI systems, understanding their awareness of both self-context and conversational partners is essential for ensuring reliable performance and robust safety. While prior work has extensively studied situational awareness which refers to an LLM’s ability to recognize its operating phase and constraints, it has largely overlooked the complementary capacity to identify and adapt to the identity and characteristics of a dialogue partner. In this paper, we formalize this latter capability as interlocutor awareness and present the first systematic evaluation of its emergence in contemporary LLMs. We examine interlocutor inference across three dimensions-reasoning patterns, linguistic style, and alignment preferences-and show that LLMs reliably identify same-family peers and certain prominent model families, such as GPT and Claude. To demonstrate its practical significance, we develop three case studies in which interlocutor awareness both enhances multi-LLM collaboration through prompt adaptation and introduces new alignment and safety vulnerabilities, including reward-hacking behaviors and increased jailbreak susceptibility. Our findings highlight the dual promise and peril of identity-sensitive behavior in LLMs, underscoring the need for further understanding of interlocutor awareness and new safeguards in multi-agent deployments. Our code is open-sourced at https://github.com/younwoochoi/InterlocutorAwarenessLLM.

由于大型语言模型(LLMs)日益被纳入多代理人和人类-投资者系统,因此了解他们对自己和对话伙伴的认识对于确保可靠业绩和稳健安全至关重要。虽然先前的工作已广泛研究情况认识,其中提到LLM有能力认识其运作阶段和制约因素,但在很大程度上忽视了确定和适应对话伙伴特性和特点的补充能力。在本文件中,我们将后一种能力正式确定为对话者认识,并介绍对当代LMs出现的第一次系统评价。我们研究了对话者在三个层面的判断模式、语言风格和调整偏好方面的判断,并表明LLMs可靠地识别了同一家庭同龄人和某些突出的模范家庭,如GPT和Claude。为了表明其实际意义,我们编写了三个案例研究,其中对话者认识通过迅速适应加强多LLM的协作,并引入新的调整和安全脆弱性,包括奖励性行为和增加破狱风险。我们的调查结果强调了LMSMs的双重承诺和风险,强调需要进一步理解多代理人部署中的中间意识和新保障措施。我们的准则是开放的。


Article 45

Title@2025-06-28 (6): Neural Cellular Automata: From Cells to Pixels

Title: Neural Cellular Automata: From Cells to Pixels Neurale Zelluläre Automaten: Von Zellen zu Pixeln 神经细胞自定义数据: 从单元格到像素 2506.22899v1

Authors (6): Ehsan Pajouheshgar, Yitao Xu, Ali Abbasi, Alexander Mordvintsev, Wenzel Jakob, Sabine Süsstrunk

Neural Cellular Automata (NCAs) are bio-inspired systems in which identical cells self-organize to form complex and coherent patterns by repeatedly applying simple local rules. NCAs display striking emergent behaviors including self-regeneration, generalization and robustness to unseen situations, and spontaneous motion. Despite their success in texture synthesis and morphogenesis, NCAs remain largely confined to low-resolution grids. This limitation stems from (1) training time and memory requirements that grow quadratically with grid size, (2) the strictly local propagation of information which impedes long-range cell communication, and (3) the heavy compute demands of real-time inference at high resolution. In this work, we overcome this limitation by pairing NCA with a tiny, shared implicit decoder, inspired by recent advances in implicit neural representations. Following NCA evolution on a coarse grid, a lightweight decoder renders output images at arbitrary resolution. We also propose novel loss functions for both morphogenesis and texture synthesis tasks, specifically tailored for high-resolution output with minimal memory and computation overhead. Combining our proposed architecture and loss functions brings substantial improvement in quality, efficiency, and performance. NCAs equipped with our implicit decoder can generate full-HD outputs in real time while preserving their self-organizing, emergent properties. Moreover, because each MLP processes cell states independently, inference remains highly parallelizable and efficient. We demonstrate the applicability of our approach across multiple NCA variants (on 2D, 3D grids, and 3D meshes) and multiple tasks, including texture generation and morphogenesis (growing patterns from a seed), showing that with our proposed framework, NCAs seamlessly scale to high-resolution outputs with minimal computational overhead.

神经细胞自闭器(NCAs)是具有生物启发性的系统,在这种系统中,相同的细胞自我组织起来,通过反复应用简单的当地规则,形成复杂和一致的模式。 NCA表现出惊人的突发行为,包括自我再生、一般化和对不可见环境的稳健性和自发运动。尽管在质合成和摩擦形成方面取得了成功, NCA(NCA)在很大程度上仍然局限于低分辨率网格,这种限制源于:(1) 培训时间和记忆要求随着网格大小的二次增长,(2) 严格的当地信息传播,阻碍远程细胞通信,(3) 高分辨率实时推断的强烈需求。在此工作中,我们克服了这种限制,将NCA与微小的、共同的隐含式变异器配对在一起,受最近隐含的内心结构表现的启发。 在低分辨率电网格上,轻度解解码解码解码解码使图像得到任意解析。 我们还提议了多项新的损失功能,具体针对高分辨率的高分辨率的内存和计算输出,以及高分辨率的内径推断。 将我们的拟议结构与高分辨率的内置的内置的内置的内置的内径和内置的内置的内存和内存和内置的内置的内置的内置的内置的内置的内置的内存和内置的内置的内置的内置的内置的内置的内置的内置的内置的内存和内置的内置的内存和内置的内置的内存和内置的内置的内置的内存和内置性能。


Article 46

Title@2025-06-28 (6): Cooperation as Black Box: Conceptual Fluctuation and Diagnostic Tools for Misalignment in MAS

Title: Cooperation as Black Box: Conceptual Fluctuation and Diagnostic Tools for Misalignment in MAS Kooperation als Black Box: Konzeptionelle Fluktuation und Diagnosetools für Fehlausrichtung in MAS 合作作为黑箱:MAS中不协调的概念波动和诊断工具 2506.22876v1

Authors (2): Shayak Nandi, Fernanda M. Eliott

Misalignment in multi-agent systems (MAS) is often treated as a technical failure; yet many such failures originate upstream, during the conceptual design phase, where semantic ambiguity and normative projection take place. This paper identifies a foundational source of interpretive misalignment in MAS: the systemic conflation of cooperation and coordination, and the moral overreading that follows. Using the Rabbit-Duck illusion, we illustrate how perspective-dependent readings of agent behavior can create epistemic instability. To address this, we introduce the Misalignment Mosaic, a diagnostic framework for diagnosing meaning-level misalignment in MAS. It comprises four components: 1. Terminological Inconsistency, 2. Concept-to-Code Decay, 3. Morality as Cooperation, and 4. Interpretive Ambiguity. The Mosaic enables researchers to examine how misalignment arises not only through policy or reward structures but also through language, framing, and design assumptions. While this paper focuses on the specific ambiguity between coordination and cooperation, the Mosaic generalizes to other overloaded concepts in MAS, such as alignment, autonomy, and trust. Rather than define cooperation once and for all, we offer a framework to diagnose meaning itself as a source of misalignment.

多试剂系统(MAS)的不匹配往往被视为技术故障;然而,许多此类不匹配通常被视为技术故障;然而,在概念设计阶段,在概念设计阶段,即语义模糊和规范性预测发生时,许多此类不匹配起源于上游;本文件确定了MAS中解释性不匹配的基本根源:合作与协调的系统性整合,以及随后的道德过度解读;我们利用Rabbit-Duck的幻觉,说明对代理行为进行基于视角的解读,不仅通过政策或奖赏结构,而且通过语言、框架和设计假设,可以造成认知性不稳定;为了解决这个问题,我们引入了Mosaliment Mosaic,这是一个诊断性框架,用以诊断MAS中意义上意义层次不匹配的诊断性判断性诊断性框架;它由四个部分组成:1. 术语不一致,2. 概念-Code Decay,3. 道德作为合作的系统组合,以及4. 道德上的过度解读性模糊性。MACS使研究人员能够研究,不仅通过政策或奖赏结构,而且通过语言、框架和设计假设,而且通过语言、框架和设计来分析,如何区分不协调性,而且一旦成为我们自主性定义了其他过度性概念的源。


Article 47

Title@2025-06-28 (6): Momentum-based Accelerated Algorithm for Distributed Optimization under Sector-Bound Nonlinearity

Title: Momentum-based Accelerated Algorithm for Distributed Optimization under Sector-Bound Nonlinearity Momentumbasierte beschleunigte Algorithmen zur verteilten Optimierung unter sektorübergreifender Nichtlinearität 部门-基于动力的在部门-健全非线性下分配的优化分配加速计算 2506.22855v1

Authors (2): Mohammadreza Doostmohammadian, Hamid R. Rabiee

Distributed optimization advances centralized machine learning methods by enabling parallel and decentralized learning processes over a network of computing nodes. This work provides an accelerated consensus-based distributed algorithm for locally non-convex optimization using the gradient-tracking technique. The proposed algorithm (i) improves the convergence rate by adding momentum towards the optimal state using the heavy-ball method, while (ii) addressing general sector-bound nonlinearities over the information-sharing network. The link nonlinearity includes any sign-preserving odd sector-bound mapping, for example, log-scale data quantization or clipping in practical applications. For admissible momentum and gradient-tracking parameters, using perturbation theory and eigen-spectrum analysis, we prove convergence even in the presence of sector-bound nonlinearity and for locally non-convex cost functions. Further, in contrast to most existing weight-stochastic algorithms, we adopt weight-balanced (WB) network design. This WB design and perturbation-based analysis allow to handle dynamic directed network of agents to address possible time-varying setups due to link failures or packet drops.

通过在计算节点网络上建立平行和分散的学习过程,分散优化优化的中央机器学习方法,从而在计算节点网络上实现平行和分散的学习过程。这项工作提供了一种加速的基于共识的分布算法,用于使用梯度跟踪技术实现当地非曲线优化。提议的算法(一)通过利用重球法增加向最佳状态发展的势头,提高趋同率,同时(二)解决在信息共享网络上一般部门非线性的问题。链接非线性包括任何标值保留奇异的部门分布绘图,例如,对日志尺度数据进行定量或对实际应用进行剪切。对于可接受的势头和梯度跟踪参数,我们利用扰动理论和eigen光谱分析,证明即使在存在部门性非直线性和当地非碳度成本功能的情况下,我们也会达到趋同率。此外,与大多数现有的加权组合算法相比,我们采用了权重平衡(WB)网络设计。这种WB设计和基于扰动的分析可以处理动态引导的代理网络,以解决因链接失败或包装下降而可能造成的时间波动的设置。


Article 48

Title@2025-06-28 (6): Consensus seeking in diffusive multidimensional networks with a repeated interaction pattern and time-delays

Title: Consensus seeking in diffusive multidimensional networks with a repeated interaction pattern and time-delays Konsenssuche in diffusen multidimensionalen Netzwerken mit wiederholtem Interaktionsmuster und Zeitverzögerungen 寻求共识,在反复互动模式和拖延时间的多维网络中寻求共识 2402.15677v2

Authors (5): Hoang Huy Vu, Quyen Ngoc Nguyen, Tuynh Van Pham, Chuong Van Nguyen, Minh Hoang Trinh

This paper studies a consensus problem in multidimensional networks having the same agent-to-agent interaction pattern under both intra- and cross-layer time delays. Several conditions for the agents to asymptotically reach a consensus are derived, which involve the overall network’s structure, the local interacting pattern, and the assumptions specified on the time delays. The validity of these conditions is proved by direct eigenvalue evaluation and supported by numerical simulations.

本文研究的是多个网络的共识问题,这些网络在系统内和跨层时间延误的情况下都存在着相同的代理和代理相互作用模式,由此得出了代理不时达成共识的若干条件,涉及整个网络的结构、当地互动模式和关于时间延误的假设,这些条件的有效性通过直接的基因价值评估得到证明,并得到数字模拟的支持。


Article 49

Title@2025-06-27 (5): eCAV: An Edge-Assisted Evaluation Platform for Connected Autonomous Vehicles

Title: eCAV: An Edge-Assisted Evaluation Platform for Connected Autonomous Vehicles eCAV: Eine Edge Assisted Evaluation Platform für vernetzte autonome Fahrzeuge eCAV: 连接自治车辆的边缘辅助评价平台 2506.16535v2

Authors (7): Tyler Landle, Jordan Rapp, Dean Blank, Chandramouli Amarnath, Abhijit Chatterjee, Alexandros Daglis, Umakishore Ramachandran

As autonomous vehicles edge closer to widespread adoption, enhancing road safety through collision avoidance and minimization of collateral damage becomes imperative. Vehicle-to-everything (V2X) technologies, which include vehicle-to-vehicle (V2V), vehicle-to-infrastructure (V2I), and vehicle-to-cloud (V2C), are being proposed as mechanisms to achieve this safety improvement. Simulation-based testing is crucial for early-stage evaluation of Connected Autonomous Vehicle (CAV) control systems, offering a safer and more cost-effective alternative to real-world tests. However, simulating large 3D environments with many complex single- and multi-vehicle sensors and controllers is computationally intensive. There is currently no evaluation framework that can effectively evaluate realistic scenarios involving large numbers of autonomous vehicles. We propose eCAV – an efficient, modular, and scalable evaluation platform to facilitate both functional validation of algorithmic approaches to increasing road safety, as well as performance prediction of algorithms of various V2X technologies, including a futuristic Vehicle-to-Edge control plane and correspondingly designed control algorithms. eCAV can model up to 256 vehicles running individual control algorithms without perception enabled, which is $8\times$ more vehicles than what is possible with state-of-the-art alternatives.

随着自治车辆接近广泛采用,通过避免碰撞和尽量减少附带损害而加强道路安全就势在必行。车辆对一切技术,包括车辆对车辆、车辆对车辆、车辆对基础设施(V2V)、车辆对车辆对库(V2C),以及车辆对库(V2C),正在作为实现这一安全改善的机制提出建议。模拟测试对于及早评价连接的自动车辆控制系统(CAV)至关重要,为实际世界测试提供了更安全、更具有成本效益的替代方法。然而,以许多复杂的单一和多车辆传感器和控制器模拟大型三维环境是计算密集型的。目前没有能够有效评价涉及大量自主车辆的现实情景的评价框架。我们提议eCAV – – 一个高效、模块化和可扩展的评价平台,以便利对提高道路安全的算法方法进行功能验证,以及对各种V2X技术的算法进行性预测,包括一种更安全、更具有成本效益的车辆对地控制平面和相应设计的控制算法。eCAVAV-AV-AV-A-A-C-可以使个人对可能采用的代谢方法进行比其他车辆更接近于256-A型的车辆的代控制。


Article 50

Title@2025-06-27 (5): Toward Data Systems That Are Business Semantic Centric and AI Agents Assisted

Title: Toward Data Systems That Are Business Semantic Centric and AI Agents Assisted Auf dem Weg zu Datensystemen, die geschäftsführende semantische Centric- und KI-Agenten sind 建立具有商业语义中心和AI 辅助代理的数据系统 2506.05520v2

Authors (1): Cecil Pang

Contemporary businesses operate in dynamic environments requiring rapid adaptation to achieve goals and maintain competitiveness. Existing data platforms often fall short by emphasizing tools over alignment with business needs, resulting in inefficiencies and delays. To address this gap, I propose the Business Semantics Centric, AI Agents Assisted Data System (BSDS), a holistic system that integrates architecture, workflows, and team organization to ensure data systems are tailored to business priorities rather than dictated by technical constraints. BSDS redefines data systems as dynamic enablers of business success, transforming them from passive tools into active drivers of organizational growth. BSDS has a modular architecture that comprises curated data linked to business entities, a knowledge base for context-aware AI agents, and efficient data pipelines. AI agents play a pivotal role in assisting with data access and system management, reducing human effort, and improving scalability. Complementing this architecture, BSDS incorporates workflows optimized for both exploratory data analysis and production requirements, balancing speed of delivery with quality assurance. A key innovation of BSDS is its incorporation of the human factor. By aligning data team expertise with business semantics, BSDS bridges the gap between technical capabilities and business needs. Validated through real-world implementation, BSDS accelerates time-to-market for data-driven initiatives, enhances cross-functional collaboration, and provides a scalable blueprint for businesses of all sizes. Future research can build on BSDS to explore optimization strategies using complex systems and adaptive network theories, as well as developing autonomous data systems leveraging AI agents.

现有数据平台往往不尽人意,因为强调与商业需求相匹配的工具,从而导致效率低下和延误。为弥补这一差距,我提议企业语义中心、AI代理辅助数据系统(BSDS),这是一个综合结构、工作流程和团队组织的整体系统,它综合了结构、工作流程和团队组织,以确保数据系统符合业务优先事项,而不是技术制约因素的制约。 工商安全数据系统将数据系统重新定义为动态的促进企业成功因素,将其从被动工具转变为积极的组织增长驱动因素。 工商安全数据系统有一个模块架构,由与商业实体相联系的整理数据、具备环境意识的AI代理商的知识库以及高效的数据管道组成。 AI代理商在协助数据存取和系统管理、减少人类努力和提高可扩展性方面发挥着关键作用。 对这一架构进行补充,工商安全数据系统将优化工作流程用于探索性数据分析和生产要求,平衡交付速度和质量保证。 BSDS的一项关键创新是纳入人的因素。 通过将数据团队专门知识与企业语义学、BSDS连接,从而弥合技术访问和系统之间的鸿沟,从而加快了技术流流化战略,从而加快了全球数据流流化战略的执行。


Article 51

Title@2025-06-27 (5): Soft Condorcet Optimization for Ranking of General Agents

Title: Soft Condorcet Optimization for Ranking of General Agents Soft Condorcet Optimierung für das Ranking von General Agents 对一般代理人员排名的优化 2411.00119v4

Authors (10): Marc Lanctot, Kate Larson, Michael Kaisers, Quentin Berthet, Ian Gemp, Manfred Diaz, Roberto-Rafael Maura-Rivero, Yoram Bachrach, Anna Koop, Doina Precup

Driving progress of AI models and agents requires comparing their performance on standardized benchmarks; for general agents, individual performances must be aggregated across a potentially wide variety of different tasks. In this paper, we describe a novel ranking scheme inspired by social choice frameworks, called Soft Condorcet Optimization (SCO), to compute the optimal ranking of agents: the one that makes the fewest mistakes in predicting the agent comparisons in the evaluation data. This optimal ranking is the maximum likelihood estimate when evaluation data (which we view as votes) are interpreted as noisy samples from a ground truth ranking, a solution to Condorcet’s original voting system criteria. SCO ratings are maximal for Condorcet winners when they exist, which we show is not necessarily true for the classical rating system Elo. We propose three optimization algorithms to compute SCO ratings and evaluate their empirical performance. When serving as an approximation to the Kemeny-Young voting method, SCO rankings are on average 0 to 0.043 away from the optimal ranking in normalized Kendall-tau distance across 865 preference profiles from the PrefLib open ranking archive. In a simulated noisy tournament setting, SCO achieves accurate approximations to the ground truth ranking and the best among several baselines when 59\% or more of the preference data is missing. Finally, SCO ranking provides the best approximation to the optimal ranking, measured on held-out test sets, in a problem containing 52,958 human players across 31,049 games of the classic seven-player game of Diplomacy.

驱动AI模型和代理商的进展需要比较其在标准化基准上的业绩; 对于普通代理商来说,个人业绩必须集中在可能多种多样的不同任务中。 在本文中,我们描述一个由社会选择框架(名为Soft Condorcet Optimization (SCO))启发的新型排名方案,以计算最佳代理商的排名:在预测评价数据中的代理商比较方面,最差错最小的排序。这种最佳排名是当评价数据(我们认为是选票)被解释为来自地面真理排名的杂音样本时,个人业绩必须被综合起来,这是Condorcet最初投票系统标准的一种解决方案。在存在时,SCO的评级是康多赢家的最高级游戏,我们并不表示对典型的评级系统Elo(SCO)具有一定的真实性。我们提出三种优化算法来计算代理商的最佳排名:在预测评估数据中的代理商比较中,在与Kemey-Young投票方法相比, 上,SCO的排名平均为0至0.043,在标准级排名中,在从PreLib公开排名的865位排名中,优者为最佳的优级排名中达到8,在SCO级排名中,最差级排名中,在排序中则提供最佳的排名中,最差的排名为39的排名为最差的排名为最低级排名。


Article 52

Title@2025-06-27 (5): Exploring Modularity of Agentic Systems for Drug Discovery

Title: Exploring Modularity of Agentic Systems for Drug Discovery Erforschung der Modularität von Wirkstoffsystemen für die Drogenentdeckung 探索药物发现剂系统模式 2506.22189v1

Authors (4): Laura van Weesep, Samuel Genheden, Ola Engkvist, Jens Sjölund

Large-language models (LLMs) and agentic systems present exciting opportunities to accelerate drug discovery and design. In this study, we critically examine the modularity of LLM-based agentic systems for drug discovery, i.e., whether parts of the agentic system such as the LLM are interchangeable, a topic that has received limited attention in drug discovery applications. We compare the performance of different large language models (LLMs) and the effectiveness of tool-calling agents versus code-generating agents in this domain. Our case study, comparing performance in orchestrating tools for chemistry and drug discovery using an LLM-as-a-judge score, shows that Claude-3.5-Sonnet, Claude-3.7-Sonnet and GPT-4o outperform alternative language models such as Llama-3.1-8B, Llama-3.1-70B, GPT-3.5-Turbo, and Nova-Micro. Although we confirm that code-generating agents outperform the tool-calling ones on average, we show that this is highly question and model dependent. Furthermore, the impact of replacing system prompts is dependent on the specific question asked and the model used, underscoring that – even in this particular domain – one cannot just replace language models without considering prompt re-engineering. Our study highlights the necessity of further research into the modularity of agentic systems to enable the development of stable and scalable solutions for real-world problems.

大型语言模型(LLMS)和药剂系统为加速药物发现和设计提供了令人兴奋的机会。在本研究中,我们批判地检查了LLM药物发现制剂系统的模块化,即LLM等药剂系统的某些部分是否可互换,这是一个在药物发现应用方面受到关注有限的主题。我们比较了不同大语言模型(LLMS)的性能以及工具调用剂相对于该领域代码生成剂的效能。我们进行的案例研究比较,比较了使用LLM-as-a-judge分的化学和药物发现协调工具的性能。我们发现,LLM-as-a-a-judge分比较了LM-Sonnet、Claude-3.7-Sonnet和GPT-4o-PT-sperformat等药物发现系统,显示Claude-3.5-Sonnet、Claude-3.1-8B、Llama-3.1-370B、GPT-3.5-T-Turbo和Nova-Mic-Mic-Mic)替代语言模型模型模型模型模型的成型模型,但我们确认代码生成的代号系统比工具系统平均的特性系统要超越了工具的系统,我们所使用的一种周期性研究重点研究,因此无法再研究。我们所使用的一种周期性研究。我们确认的模型研究的模型研究的模型研究需要,无法再强调这一模型的精确性研究的精确性研究的精确性研究需要。我们所使用的一种特定的模型。


Article 53

Title@2025-06-27 (5): Programming Distributed Collective Processes in the eXchange Calculus

Title: Programming Distributed Collective Processes in the eXchange Calculus Programmierung verteilter kollektiver Prozesse im eXchange Calculus eXchange Calculus 中的程序编程分配集体进程 2401.11212v4

Authors (5): Giorgio Audrito, Roberto Casadei, Ferruccio Damiani, Gianluca Torta, Mirko Viroli

Recent trends like the Internet of Things (IoT) suggest a vision of dense and multi-scale deployments of computing devices in nearly all kinds of environments. A prominent engineering challenge revolves around programming the collective adaptive behaviour of such computational ecosystems. This requires abstractions able to capture concepts like ensembles (dynamic groups of cooperating devices) and collective tasks (joint activities carried out by ensembles). In this work, we consider collections of devices interacting with neighbours and that execute in nearly-synchronised sense-compute-interact rounds, where the computation is given by a single program mapping sensing values and incoming messages to output and outcoming messages. To support programming whole computational collectives, we propose the abstraction of a distributed collective process, which can be used to define at once the ensemble formation logic and its collective task. We formalise the abstraction in the eXchange Calculus (XC), a core functional language based on neighbouring values (maps from neighbours to values) where state and interaction is handled through a single primitive, exchange, and provide a corresponding implementation in the FCPP language. Then, we exercise distributed collective processes using two case studies: multi-hop message propagation and distributed monitoring of spatial properties. Finally, we discuss the features of the abstraction and its suitability for different kinds of distributed computing applications.

在这项工作中,我们考虑与邻居发生互动的装置的集成,这些装置以近同步的感知和计算互动周期执行,计算方法是由一个单一程序绘制感测值和发送信息到输出和流出信息。为了支持整个计算集体的编程,我们提议一个分布式集体过程的抽象化,这个过程可以用来立即界定共性形成逻辑及其集体任务。我们把电子Xchange Calculus(XC)中的抽象化,这是一个基于相邻价值的核心功能语言(从邻居到价值观的图解),通过单一原始、交换处理国家和互动,并在FCPP语言中提供相应的执行。最后,我们利用两种案例研究,进行分布式集成的集体进程,并传播各种空间信息。最后,我们用两种案例研究的形式,进行集体分布式的数学特性。我们用两种案例研究来传播其空间信息。最后,我们用两种案例研究来传播空间信息。


Article 54

Title@2025-06-27 (5): SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model

Title: SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model SceneDiffuser++: City-Scale Verkehrssimulation über ein Generatives Weltmodell 景点Diffuser++:通过创世模式的城市规模交通量模拟 2506.21976v1

Authors (9): Shuhan Tan, John Lambert, Hong Jeon, Sakshum Kulshrestha, Yijing Bai, Jing Luo, Dragomir Anguelov, Mingxing Tan, Chiyu Max Jiang

The goal of traffic simulation is to augment a potentially limited amount of manually-driven miles that is available for testing and validation, with a much larger amount of simulated synthetic miles. The culmination of this vision would be a generative simulated city, where given a map of the city and an autonomous vehicle (AV) software stack, the simulator can seamlessly simulate the trip from point A to point B by populating the city around the AV and controlling all aspects of the scene, from animating the dynamic agents (e.g., vehicles, pedestrians) to controlling the traffic light states. We refer to this vision as CitySim, which requires an agglomeration of simulation technologies: scene generation to populate the initial scene, agent behavior modeling to animate the scene, occlusion reasoning, dynamic scene generation to seamlessly spawn and remove agents, and environment simulation for factors such as traffic lights. While some key technologies have been separately studied in various works, others such as dynamic scene generation and environment simulation have received less attention in the research community. We propose SceneDiffuser++, the first end-to-end generative world model trained on a single loss function capable of point A-to-B simulation on a city scale integrating all the requirements above. We demonstrate the city-scale traffic simulation capability of SceneDiffuser++ and study its superior realism under long simulation conditions. We evaluate the simulation quality on an augmented version of the Waymo Open Motion Dataset (WOMD) with larger map regions to support trip-level simulation.

交通模拟的目标是增加数量可能有限、人工驱动的里程,供测试和验证使用,数量要大得多的模拟合成里程。这一愿景的顶点将是一个基因化模拟城市,给城市地图和自主车辆(AV)软件堆,模拟器可以在A点到B点之间无缝模拟旅行,在AV周围向城市铺设,控制现场的各个方面,从动动动动剂(例如车辆、行人)到控制交通灯状态。我们称之为城市模拟,这需要模拟技术的放大:现场生成以填充初始场景,代理器行为模拟以安抚场景,封闭推理,动态场景生成以无缝产出和清除物剂,以及环境模拟交通灯等因素。虽然在各种工作上分别研究了一些关键技术,但在研究界中,动态现场生成和环境模拟等其他技术没有得到更多的注意。我们建议SteenDive用户级的模拟里程,这是模拟模拟模拟模拟模拟技术的首端点,需要模拟区域模拟:现场生成场景,为动动动动动模模型, 模拟城市模拟模拟一个能升级的市级模模模模模模模模模模型,在单一模模模级上测试世界上,一个精度上进行一个精度模拟,一个比级的模模级的模级的模级模拟功能模拟。我们比比级模拟,在模拟了比比级的市级模拟,对比级的模拟,要进行一个比级的模拟,一个比级的模拟,一个比级的模拟,一个比级,要更深级模拟,要级级,要级级级级级级级级级级级级级级级级级级级级级级级级级级比级级级级级级的模拟,要进行一个比级级级级,要进行一个比级比级级级级级,要级,要级,要级,要级,要级,要进行一个比级,要进行一个比级,要级,要级,要级,要级,要级,要进行一个比级地平级地平级。


Article 55

Title@2025-06-27 (5): Mitigating Metropolitan Carbon Emissions with Dynamic Eco-driving at Scale

Title: Mitigating Metropolitan Carbon Emissions with Dynamic Eco-driving at Scale Mit dem dynamischen Öko-Fahren im Maßstab die Emissionen von Metropolitankohlenstoff mindern 减缓城市碳排放,在规模上进行动态生态驾驶 2408.05609v2

Authors (9): Vindula Jayawardana, Baptiste Freydt, Ao Qu, Cameron Hickert, Edgar Sanchez, Catherine Tang, Mark Taylor, Blaine Leonard, Cathy Wu

The sheer scale and diversity of transportation make it a formidable sector to decarbonize. Here, we consider an emerging opportunity to reduce carbon emissions: the growing adoption of semi-autonomous vehicles, which can be programmed to mitigate stop-and-go traffic through intelligent speed commands and, thus, reduce emissions. But would such dynamic eco-driving move the needle on climate change? A comprehensive impact analysis has been out of reach due to the vast array of traffic scenarios and the complexity of vehicle emissions. We address this challenge with large-scale scenario modeling efforts and by using multi-task deep reinforcement learning with a carefully designed network decomposition strategy. We perform an in-depth prospective impact assessment of dynamic eco-driving at 6,011 signalized intersections across three major US metropolitan cities, simulating a million traffic scenarios. Overall, we find that vehicle trajectories optimized for emissions can cut city-wide intersection carbon emissions by 11-22%, without harming throughput or safety, and with reasonable assumptions, equivalent to the national emissions of Israel and Nigeria, respectively. We find that 10% eco-driving adoption yields 25%-50% of the total reduction, and nearly 70% of the benefits come from 20% of intersections, suggesting near-term implementation pathways. However, the composition of this high-impact subset of intersections varies considerably across different adoption levels, with minimal overlap, calling for careful strategic planning for eco-driving deployments. Moreover, the impact of eco-driving, when considered jointly with projections of vehicle electrification and hybrid vehicle adoption remains significant. More broadly, this work paves the way for large-scale analysis of traffic externalities, such as time, safety, and air quality, and the potential impact of solution strategies.

运输的规模和多样性使得它成为一个可怕的去碳化部门。 在这里, 我们考虑一个减少碳排放的新机遇: 越来越多地采用半自主汽车, 可以通过智能速度指令来减少中途和低路交通, 从而减少排放。 但是, 如此动态的生态驱动能将针头移动到气候变化上吗? 全面的影响力分析已经无法达到, 原因是交通情况繁多, 车辆排放的复杂性。 我们通过大规模设想情景模型, 利用精心设计的网络分解战略, 广泛利用多任务深度强化学习, 来应对这一挑战。 我们对动态生态驱动的半自主汽车进行深度的预期影响评估, 在6,011个美国三大大城市的信号交汇点上, 从而减少中途交通流量。 总的来说, 最优化的汽车轨迹可以将全市范围内的交叉碳排放减少11-22 % , 不伤害人员或安全, 并且合理的假设, 相当于以色列和尼日利亚的国家排放量。 我们发现, 10 % 生态驱动深度的交通将带来25- 50 % 的深度的深度的预期, 快速的深度的 , 快速的深度的深度的深度的 , 快速的深度的深度的 , 快速的循环的循环的循环的计算, 将产生20- 50 , 的深度的深度的深度的深度的深度的深度的深度的深度的 , , , 速度的深度的深度的深度的深度的深度的深度的深度的计算, , 。


Article 56

Title@2025-06-27 (5): Design of A* based heuristic algorithm for efficient interdiction in multi-Layer networks

Title: Design of A* based heuristic algorithm for efficient interdiction in multi-Layer networks Entwurf eines auf A* basierenden heuristischen Algorithmus für effizientes Interdiction in Multi-Layer-Netzwerken 设计基于A* 的超值算法,以有效阻截多路网络 2506.10017v3

Authors (1): Sukanya Samanta

Intercepting a criminal using limited police resources presents a significant challenge in dynamic crime environments, where the criminal’s location continuously changes over time. The complexity is further heightened by the vastness of the transportation network. To tackle this problem, we propose a layered graph representation, in which each time step is associated with a duplicate of the transportation network. For any given set of attacker strategies, a near-optimal defender strategy is computed using the A-Star heuristic algorithm applied to the layered graph. The defender’s goal is to maximize the probability of successful interdiction. We evaluate the performance of the proposed method by comparing it with a Mixed-Integer Linear Programming (MILP) approach used for the defender. The comparison considers both computational efficiency and solution quality. The results demonstrate that our approach effectively addresses the complexity of the problem and delivers high-quality solutions within a short computation time.

利用有限的警察资源侦缉罪犯是动态犯罪环境中的一个重大挑战,罪犯所处的位置随着时间的变化而不断变化,其复杂性因运输网络的广度而进一步增加。为了解决这一问题,我们提议了一个分层图示,其中每一步都与运输网络的重复相联。对于任何一套特定的攻击者战略,都使用适用于分层图的A-Star黑奴主义算法计算出近于最佳的防御战略。辩护人的目标是最大限度地增加成功阻截的概率。我们通过将拟议方法与维权者使用的混合- Intger线性程序(MILP)方法进行比较来评估该方法的绩效。比较考虑到计算效率和解决方案的质量。结果表明,我们的方法有效地解决了问题的复杂性,并在短的计算时间内提供了高质量的解决方案。


Article 57

Title@2025-06-27 (5): ARAG: Agentic Retrieval Augmented Generation for Personalized Recommendation

Title: ARAG: Agentic Retrieval Augmented Generation for Personalized Recommendation ARAG: Agentische Retrieval Augmented Generation für Personalisierte Empfehlung AARAG: 个人化推荐的 “ 危险回收增加的一代人 “ 2506.21931v1

Authors (10): Reza Yousefi Maragheh, Pratheek Vadla, Priyank Gupta, Kai Zhao, Aysenur Inan, Kehui Yao, Jianpeng Xu, Praveen Kanumala, Jason Cho, Sushant Kumar

Retrieval-Augmented Generation (RAG) has shown promise in enhancing recommendation systems by incorporating external context into large language model prompts. However, existing RAG-based approaches often rely on static retrieval heuristics and fail to capture nuanced user preferences in dynamic recommendation scenarios. In this work, we introduce ARAG, an Agentic Retrieval-Augmented Generation framework for Personalized Recommendation, which integrates a multi-agent collaboration mechanism into the RAG pipeline. To better understand the long-term and session behavior of the user, ARAG leverages four specialized LLM-based agents: a User Understanding Agent that summarizes user preferences from long-term and session contexts, a Natural Language Inference (NLI) Agent that evaluates semantic alignment between candidate items retrieved by RAG and inferred intent, a context summary agent that summarizes the findings of NLI agent, and an Item Ranker Agent that generates a ranked list of recommendations based on contextual fit. We evaluate ARAG accross three datasets. Experimental results demonstrate that ARAG significantly outperforms standard RAG and recency-based baselines, achieving up to 42.1% improvement in NDCG@5 and 35.5% in Hit@5. We also, conduct an ablation study to analyse the effect by different components of ARAG. Our findings highlight the effectiveness of integrating agentic reasoning into retrieval-augmented recommendation and provide new directions for LLM-based personalization.

在这项工作中,我们引入了AARAG,这是个人化建议的代理检索启动框架,将一个多剂协作机制纳入RAG管道。为了更好地了解用户的长期和届会行为,ARAG利用了四个基于LLM的专职代理机构:一个用户理解代理机构,从长期和会期背景中总结用户的偏好,一个自然语言推断(NLI)代理机构,评估RAG所检索的候选项目和推断意向之间的语义一致性,一个背景摘要代理机构,总结了NLI代理机构的调查结果,以及一个按背景编制建议排名列表的项目排序代理机构。我们评估了AARAG基于三个数据集的长期和届会行为。实验结果表明,ARAG大大超越了基于长期和会期背景的用户偏好,一个基于自然语言推断的代理机构,从长期和届会背景中总结了用户偏好,一个自然语言推断(NLIF)代理机构评估了候选人项目之间的语义一致性一致性,一个背景摘要代理机构,一个根据背景情况编制建议列表。我们评估了ARAG的三大数据集。


Article 58

Title@2025-06-27 (5): Cooperative Bearing-Only Target Pursuit via Multiagent Reinforcement Learning: Design and Experiment

Title: Cooperative Bearing-Only Target Pursuit via Multiagent Reinforcement Learning: Design and Experiment Cooperative Bearing-Only Target Pursuit über Multiagent-Verstärkung Lernen: Design und Experiment 通过多试剂强化学习,仅以合作定点追踪:设计和实验 2503.08740v2

Authors (5): Jianan Li, Zhikun Wang, Susheng Ding, Shiliang Guo, Shiyu Zhao

This paper addresses the multi-robot pursuit problem for an unknown target, encompassing both target state estimation and pursuit control. First, in state estimation, we focus on using only bearing information, as it is readily available from vision sensors and effective for small, distant targets. Challenges such as instability due to the nonlinearity of bearing measurements and singularities in the two-angle representation are addressed through a proposed uniform bearing-only information filter. This filter integrates multiple 3D bearing measurements, provides a concise formulation, and enhances stability and resilience to target loss caused by limited field of view (FoV). Second, in target pursuit control within complex environments, where challenges such as heterogeneity and limited FoV arise, conventional methods like differential games or Voronoi partitioning often prove inadequate. To address these limitations, we propose a novel multiagent reinforcement learning (MARL) framework, enabling multiple heterogeneous vehicles to search, localize, and follow a target while effectively handling those challenges. Third, to bridge the sim-to-real gap, we propose two key techniques: incorporating adjustable low-level control gains in training to replicate the dynamics of real-world autonomous ground vehicles (AGVs), and proposing spectral-normalized RL algorithms to enhance policy smoothness and robustness. Finally, we demonstrate the successful zero-shot transfer of the MARL controllers to AGVs, validating the effectiveness and practical feasibility of our approach. The accompanying video is available at https://youtu.be/HO7FJyZiJ3E.

本文论述对一个未知目标的多机器人追踪问题,包括目标国家估计和追逐控制。首先,在州估计中,我们侧重于仅使用随带信息,因为它很容易从视觉传感器获得,并且对小型、远处目标有效。挑战,如两角代表体中的随身测量和独一性没有线性,因此存在不稳定性,通过拟议的一个单一的只进行随身携带的信息过滤器来解决。这个过滤器将多重三维承载测量器整合在一起,提供一个简明的配方,并增强稳定性和抵御目标损失的能力。第二,在复杂环境中的目标追踪控制中,出现异质性和有限的FOV等挑战往往证明不充分。为了应对这些限制,我们建议采用新的多试剂强化学习框架,使多种混合工具能够搜索、本地化和遵循一个目标,同时有效地应对这些挑战。第三,为了弥合现实的视野差距,我们建议采用两种关键技术:在培训中纳入可调整的低级别控制成果,以复制真实世界的自主定位系统/ARCRRRSl的成功性。我们提议,AVAV的成功性向地面飞行器的成功性转移。


Article 59

Title@2025-06-26 (4): Sequence Modeling for N-Agent Ad Hoc Teamwork

Title: Sequence Modeling for N-Agent Ad Hoc Teamwork Sequenzmodellierung für N-Agent Ad Hoc Teamwork N-代理特设团队工作的序列建模 2506.05527v2

Authors (6): Caroline Wang, Di Yang Shi, Elad Liebman, Ishan Durugkar, Arrasy Rahman, Peter Stone

N-agent ad hoc teamwork (NAHT) is a newly introduced challenge in multi-agent reinforcement learning, where controlled subteams of varying sizes must dynamically collaborate with varying numbers and types of unknown teammates without pre-coordination. The existing learning algorithm (POAM) considers only independent learning for its flexibility in dealing with a changing number of agents. However, independent learning fails to fully capture the inter-agent dynamics essential for effective collaboration. Based on our observation that transformers deal effectively with sequences with varying lengths and have been shown to be highly effective for a variety of machine learning problems, this work introduces a centralized, transformer-based method for N-agent ad hoc teamwork. Our proposed approach incorporates historical observations and actions of all controlled agents, enabling optimal responses to diverse and unseen teammates in partially observable environments. Empirical evaluation on a StarCraft II task demonstrates that MAT-NAHT outperforms POAM, achieving superior sample efficiency and generalization, without auxiliary agent-modeling objectives.

现有的学习算法(POAM)认为,在处理不断变化的代理物时,只有独立学习才具有灵活性。然而,独立学习未能充分捕捉有效合作所必需的机构间动态。根据我们的观察,即变压器能够有效地处理不同长度的序列,并且已证明对各种机器学习问题非常有效,这项工作为N代理物特设团队引入了集中的、基于变压器的方法。我们提议的方法包括所有受控代理物的历史观察和行动,使在部分可观测环境中对不同和看不见的同僚作出最佳反应。StarCraft II任务的经验性评估表明,MAT-NAHT超越了POAM,实现了高级样本效率和普遍化,没有辅助代理物模拟目标。


Article 60

Title@2025-06-26 (4): xChemAgents: Agentic AI for Explainable Quantum Chemistry

Title: xChemAgents: Agentic AI for Explainable Quantum Chemistry xChemAgenten: Agentische KI für erklärbare Quantenchemie xchemAgents: 可解释量子化学的AAA剂 2505.20574v2

Authors (5): Can Polat, Mehmet Tuncel, Mustafa Kurban, Erchin Serpedin, Hasan Kurban

Recent progress in multimodal graph neural networks has demonstrated that augmenting atomic XYZ geometries with textual chemical descriptors can enhance predictive accuracy across a range of electronic and thermodynamic properties. However, naively appending large sets of heterogeneous descriptors often degrades performance on tasks sensitive to molecular shape or symmetry, and undermines interpretability. xChemAgents proposes a cooperative agent framework that injects physics-aware reasoning into multimodal property prediction. xChemAgents comprises two language-model-based agents: a Selector, which adaptively identifies a sparse, weighted subset of descriptors relevant to each target, and provides a natural language rationale; and a Validator, which enforces physical constraints such as unit consistency and scaling laws through iterative dialogue. On standard benchmark datasets, xChemAgents achieves up to a 22% reduction in mean absolute error over the state-of-the-art baselines, while producing faithful, human-interpretable explanations. Experiment results highlight the potential of cooperative, self-verifying agents to enhance both accuracy and transparency in foundation-model-driven materials science. The implementation and accompanying dataset are available at https://github.com/KurbanIntelligenceLab/xChemAgents.

多式联运图形神经网络的近期进展表明,以文本化学描述器增强原子XYZ的地形特征可以提高一系列电子和热力特性的预测准确性;然而,天真地附加大量不同描述器往往会降低对分子形状或对称敏感的任务的性能,并损害可解释性。 xChemAgents提出一个合作剂框架,将物理觉知推理注入多式联运属性预测。 xChemAgents 提议一个合作剂框架,将物理觉识推入到多式属性预测中。 xChemAgents 由两种语言模型构成的代理物组成:一个选择器,该选择器适应性地识别出与每个目标相关的稀有加权描述器子,并提供自然语言理由;以及一个验证器,通过迭代对话强制实施单位一致性和扩展法律等物理限制。关于标准基准数据集, xchemagenents 实现比最新基准基线的绝对误差高达22%,同时提出忠实、人际的解释。实验结果突出表明合作、自我验证的代理物的潜力,以提高基础建模材料的准确性和透明度。


Article 61

Title@2025-06-26 (4): Will LLMs be Professional at Fund Investment? DeepFund: A Live Arena Perspective

Title: Will LLMs be Professional at Fund Investment? DeepFund: A Live Arena Perspective Werden LLMs Professional bei Fund Investment sein? DeepFund: Eine Live Arena Perspektive LLM女士在基金投资方面是否具有专业性? 2503.18313v2

Authors (4): Changlun Li, Yao Shi, Yuyu Luo, Nan Tang

Large Language Models (LLMs) have demonstrated impressive capabilities across various domains, but their effectiveness in financial decision-making remains inadequately evaluated. Current benchmarks primarily assess LLMs’ understanding on financial documents rather than the ability to manage assets or dig out trading opportunities in dynamic market conditions. Despite the release of new benchmarks for evaluating diversified tasks on the financial domain, we identified four major problems in these benchmarks, which are data leakage, navel-gazing, over-intervention, and maintenance-hard. To pave the research gap, we introduce DeepFund, a comprehensive arena platform for evaluating LLM-based trading strategies in a live environment. Our approach implements a multi-agent framework where they serve as multiple key roles that realize the real-world investment decision processes. Moreover, we provide a web interface that visualizes LLMs’ performance with fund investment metrics across different market conditions, enabling detailed comparative analysis. Through DeepFund, we aim to provide a more realistic and fair assessment on LLM’s capabilities in fund investment, offering diversified insights and revealing their potential applications in real-world financial markets. Our code is publicly available at https://github.com/HKUSTDial/DeepFund.

大型语言模型(LLMS)在各个领域表现出了令人印象深刻的能力,但在金融决策方面的效力仍然没有得到充分的评价。目前的基准主要评估LLMS对金融文件的理解,而不是在活跃的市场条件下管理资产或挖掘贸易机会的能力。尽管为评价金融领域的多样化任务发布了新的基准,但我们查明了这些基准中的四个主要问题,即数据泄漏、收缩、过度干预和维护。为填补研究空白,我们引入了DeepFund,这是一个在现实环境中评价以LLM为基础的贸易战略的全面的舞台平台。我们的方法是实施一个多试办框架,作为实现现实世界投资决策过程的多重关键作用。此外,我们提供了一个网络界面,通过在不同市场条件下的基金投资指标将LLMs的业绩形象化,进行详细的比较分析。我们通过EmepFund,旨在对LM在资金投资方面的能力进行更现实和公正的评估,提供多样化的洞察,并揭示其在现实世界金融市场的潜在应用。我们的代码在https://github.com/HKustDIal/DIFO。


Article 62

Title@2025-06-26 (4): Integrated Multimodal Sensing and Communication: Challenges, Technologies, and Architectures

Title: Integrated Multimodal Sensing and Communication: Challenges, Technologies, and Architectures Integrierte multimodale Sensing und Kommunikation: Herausforderungen, Technologien und Architekturen 综合多式联运和通信:挑战、技术和结构 2506.22507v1

Authors (6): Yubo Peng, Luping Xiang, Kun Yang, Feibo Jiang, Kezhi Wang, Christos Masouros

The evolution towards 6G networks requires the intelligent integration of communication and sensing capabilities to support diverse and complex applications, such as autonomous driving and immersive services. However, existing integrated sensing and communication (ISAC) systems predominantly rely on single-modal sensors as primary participants, which leads to a limited representation of environmental features and significant performance bottlenecks under the emerging requirements of 6G applications. This limitation motivates a paradigm shift from single-modal to multimodal ISAC. In this article, we first analyze the key challenges in realizing multimodal ISAC, including the fusion of heterogeneous multimodal data, the high communication overhead among distributed sensors, and the design of efficient and scalable system architectures. We then introduce several enabling technologies, such as large AI models, semantic communication, and multi-agent systems, that hold promise for addressing these challenges. To operationalize these technologies, we zoom into three architectural paradigms: fusion-based multimodal ISAC (F-MAC), interaction-based multimodal ISAC (I-MAC), and relay-based multimodal ISAC (R-MAC), each tailored to organize devices and modalities for efficient collaboration in different scenarios. Thereafter, a case study is presented based on the F-MAC scheme, demonstrating that the scheme achieves more comprehensive sensing and improves sensing accuracy by approximately 80% compared to conventional single-modal ISAC systems. Finally, we discuss several open issues to be addressed in the future.

向6G网络的演变要求明智地整合通信和遥感能力,以支持多种复杂应用,如自主驱动和沉浸式服务;然而,现有的综合遥感和通信系统(ISAC)主要依赖单一式传感器作为主要参与者,这导致环境特征的代表性有限,6G应用新要求下的重大绩效瓶颈;这一限制促使从单一模式转向多式ISAC的范式转变;在本条中,我们首先分析实现多式ISAC的主要挑战,包括多种多式联运数据的融合、分布式传感器之间的高通信间接费用以及高效和可扩缩的系统结构的设计;然后,我们引入一些有利的技术,例如大型AI模型、语义通信和多剂系统,这些技术对于应对这些挑战很有希望;为了实施这些技术,我们将规模化成三个建筑范式:基于单一模式的ISAC(F-MAC)、基于互动的多式联运ISAC(I-MAC)和基于中继式的多式ISAC(R-MAC),每个都适合在不同情景下组织高效合作的装置和模式;然后,我们引入了几个有利的技术,例如大型AI模型、语义性通信和多剂系统。