cs.PL @ 2025-06-27: 022
-
00 06-26 (4) Large Language Model-Powered Agent for C to Rust Code Translation Large Language Model-Powered Agent für C to Rust Code Übersetzung C至Rust 代码翻译的大型语言示范授权代理 2505.15858v2 -
01 06-25 (3) PhasePoly: An Optimization Framework forPhase Polynomials in Quantum Circuits PhasePoly: Ein Optimierungsrahmen fürphasige Polynome in Quantenkreisläufen 阶段性:量子电路阶段多元性优化框架 2506.20624v1 -
02 06-25 Deadlock-free Context-free Session Types Deadlock-freie Kontext-freie Sitzungstypen 无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无 2506.20356v1 -
03 06-25 Unfolding Iterators: Specification and Verification of Higher-Order Iterators, in OCaml Entfaltungs-Iteratoren: Spezifikation und Überprüfung von höher angeordneten Iteratoren, in OCaml 卸载迭代器: OCaml 中高分迭代器的规格和核查 2506.20310v1 -
04 06-25 Dynamic Race Detection With O(1) Samples Dynamische Race-Erkennung mit O(1)-Proben 有O(1)样本的动态竞赛探测样本 2506.20127v1 -
05 06-24 (2) QHackBench: Benchmarking Large Language Models for Quantum Code Generation Using PennyLane Hackathon Challenges QHackBench: Benchmarking großer Sprachmodelle für die Quantencode-Generation mit PennyLane Hackathon-Herausforderungen QHackBench:利用PennyLane Hackathon挑战为量制代码生成量设定大语言模式基准 2506.20008v1 -
06 06-24 Free Doubly-Infinitary Distributive Categories are Cartesian Closed Kostenlose Doppel-Unendliche Verteilung Kategorien sind kartesisch geschlossen 免费Doubly-Infinite分配类别为笛卡尔人封闭类别 2403.10447v6 -
07 06-24 A Certified Proof Checker for Deep Neural Network Verification in Imandra Ein Certified Proof Checker für Deep Neural Network Verification in Imandra Imandra 深神经网络核查认证校验器 2405.10611v2 -
08 06-24 A Sound Type System for Secure Currency Flow Ein Sound Type System für sicheren Währungsfluss 安全货币流动的健全类型系统 2405.12976v2 -
09 06-24 The Autonomous Data Language – Concepts, Design and Formal Verification Die autonome Datensprache – Konzepte, Design und formale Überprüfung 自主数据语言 – – 概念、设计和正式核查 2506.19457v1 -
10 06-24 The Elements of Differentiable Programming Die Elemente der differenzierbaren Programmierung 不同方案拟订要素 2403.14606v3 -
11 06-23 (1) AutoPDL: Automatic Prompt Optimization for LLM Agents AutoPDL: Automatische Prompt-Optimierung für LLM-Agenten AAUPDL:LLM代理器自动快速优化 2504.04365v2 -
12 06-23 Piloting Copilot, Codex, and StarCoder2: Hot Temperature, Cold Prompts, or Black Magic? Pilotieren von Copilot, Codex und StarCoder2: Heiße Temperatur, kalte Prompts oder schwarze Magie? 联合飞行员 代码代码和星际代码2: 热温、冷感或黑魔法? 2210.14699v3 -
13 06-21 (6) HPVM-HDC: A Heterogeneous Programming System for Accelerating Hyperdimensional Computing HPVM-HDC: Ein heterogenes Programmiersystem zur Beschleunigung des Hyperdimensionalen Rechnens HPVM-HDC:加速超多维电子化的多样化规划系统 2410.15179v3 -
14 06-21 ASC-Hook: fast and transparent system call hook for Arm ASC-Hook: Schneller und transparenter Systemrufhaken für Arm ASC-Hook: 快速和透明的系统调用武器钩 2412.05784v4 -
15 06-20 (5) $Δ$-Nets: Interaction-Based System for Optimal Parallel $λ$-Reduction $Δ$-Nets: Interaktionsbasiertes System für eine optimale parallele $λ$-Reduktion \(-净额:最佳平行互动系统\)$美元-削减 2505.20314v3 -
16 06-20 Low Overhead Allocation Sampling in a Garbage Collected Virtual Machine Low Overhead Allocation Probenahme in einem Garbage gesammelte virtuelle Maschine 在垃圾收集的虚拟机器中进行低间接费用分配抽样 2506.16883v1 -
17 06-20 CodeV-R1: Reasoning-Enhanced Verilog Generation CodeV-R1: Grundlegende Verilog-Generierung 代码V-R1:有理性的增强性性性性性性性生殖器生成 2505.24183v2 -
18 06-20 Zoozve: A Strip-Mining-Free RISC-V Vector Extension with Arbitrary Register Grouping Compilation Support (WIP) Zoozve: Eine Strip-Mining-freie RISC-V-Vektorerweiterung mit arbitrary Register-Gruppierungsunterstützung (WIP) Zooozve:利用任意登记册组汇编支助,扩大一个带分解自由的RISC-V矢量扩展区(WIP) 2504.15678v2 -
19 06-19 (4) ChatDBG: Augmenting Debugging with Large Language Models ChatDBG: Augmenting Debugging mit großen Sprachmodellen 聊天DBG: 使用大语言模式加强调试 2403.16354v5 -
20 06-19 Floating-Point Neural Networks Are Provably Robust Universal Approximators Floating-Point-Neural-Netzwerke sind wahrscheinlich robuste Universal-Annäherung 浮动点神经网络具有可可预见强健的通用通用近似器 2506.16065v1 -
21 06-19 WAMI: Compilation to WebAssembly through MLIR without Losing Abstraction WAMI: Kompilation zu WebAssembly durch MLIR ohne Abstraktion zu verlieren WAMI:通过MLIR通过网络大会汇编,不失抽象 2506.16048v1
Article 0
Title@2025-06-26 (4): Large Language Model-Powered Agent for C to Rust Code Translation
Title: Large Language Model-Powered Agent for C to Rust Code Translation | Large Language Model-Powered Agent für C to Rust Code Übersetzung | C至Rust 代码翻译的大型语言示范授权代理 2505.15858v2 |
Authors (6): HoHyun Sim, Hyeonjoong Cho, Yeonghyeon Go, Zhoulai Fu, Ali Shokri, Binoy Ravindran
The C programming language has been foundational in building system-level software. However, its manual memory management model frequently leads to memory safety issues. In response, a modern system programming language, Rust, has emerged as a memory-safe alternative. Moreover, automating the C-to-Rust translation empowered by the rapid advancements of the generative capabilities of LLMs is gaining growing interest for large volumes of legacy C code. Despite some success, existing LLM-based approaches have constrained the role of LLMs to static prompt-response behavior and have not explored their agentic problem-solving capability. Applying the LLM agentic capability for the C-to-Rust translation introduces distinct challenges, as this task differs from the traditional LLM agent applications, such as math or commonsense QA domains. First, the scarcity of parallel C-to-Rust datasets hinders the retrieval of suitable code translation exemplars for in-context learning. Second, unlike math or commonsense QA, the intermediate steps required for C-to-Rust are not well-defined. Third, it remains unclear how to organize and cascade these intermediate steps to construct a correct translation trajectory. To address these challenges in the C-to-Rust translation, we propose a novel intermediate step, the Virtual Fuzzing-based equivalence Test (VFT), and an agentic planning framework, the LLM-powered Agent for C-to-Rust code translation (LAC2R). The VFT guides LLMs to identify input arguments that induce divergent behaviors between an original C function and its Rust counterpart and to generate informative diagnoses to refine the unsafe Rust code. LAC2R uses the MCTS to systematically organize the LLM-induced intermediate steps for correct translation. We experimentally demonstrated that LAC2R effectively conducts C-to-Rust translation on large-scale, real-world benchmarks.
C编程语言是建立系统级软件的基础语言。然而,其人工内存管理模式经常导致记忆安全问题。作为回应,现代系统编程语言Rust(Rust)已成为一种耐记忆的替代方案。此外,由于LLM的基因化能力迅速提高,使得C-Rust翻译自动化起来。尽管取得了一些成功,但基于LLMst的现有方法限制了LLM(LLM)的作用,使其成了静态的快速反应行为,而没有探索其中间解决问题的能力。在C-Rst翻译中应用LLM(LM)代理能力,带来了不同的挑战,因为这一任务不同于传统的LLM代理应用程序,例如数学或普通QA域。首先,平行C-Rust数据集的缺乏,阻碍了适当的C(LM)代码翻译的检索。第二,与数学或普通QA(Commerical-R)的QA(C-RM),为C-RM(LM)的原始-RM(LM-R-RM)的解算法解算的中间步骤没有很好地界定。第三,它仍然不清楚如何组织和不断组织和升级的翻译。
Article 1
Title@2025-06-25 (3): PhasePoly: An Optimization Framework forPhase Polynomials in Quantum Circuits
Title: PhasePoly: An Optimization Framework forPhase Polynomials in Quantum Circuits | PhasePoly: Ein Optimierungsrahmen fürphasige Polynome in Quantenkreisläufen | 阶段性:量子电路阶段多元性优化框架 2506.20624v1 |
Authors (9): Zihan Chen, Henry Chen, Yuwei Jin, Minghao Guo, Enhyeok Jang, Jiakang Li, Caitlin Chan, Won Woo Ro, Eddy Z. Zhang
Quantum computing has transformative computational power to make classically intractable computing feasible. As the algorithms that achieve practical quantum advantage are beyond manual tuning, quantum circuit optimization has become extremely important and integrated into today’s quantum software stack. This paper focuses on a critical type of quantum circuit optimization – phase-polynomial optimization. Phase polynomials represents a class of building-block circuits that appear frequently in quantum modular exponentials (the most time-consuming component in Shor’s factoring algorithm), in quantum approximation optimization algorithms (QAOA), and in Hamiltonian simulations. Compared to prior work on phase polynomials, we focus more on the impact of phase polynomial synthesis in the context of whole-circuit optimization, from single-block phase polynomials to multiple block phase polynomials, from greedy equivalent sub-circuit replacement strategies to a systematic parity matrix optimization approach, and from hardware-oblivious logical circuit optimization to hardware-friendly logical circuit optimization. We also provide a utility of our phase polynomial optimization framework to generate hardware-friendly building blocks. Our experiments demonstrate improvements of up to 50%-with an average total gate reduction of 34.92%-and reductions in the CNOT gate count of up to 48.57%, averaging 28.53%, for logical circuits. Additionally, for physical circuits, we achieve up to 47.65% CNOT gate reduction with an average reduction of 25.47% across a representative set of important benchmarks.
量子计算具有变革性的计算能力, 使古老的棘手计算变得可行。 由于实现实际量子优势的算法超出了手工调整的范围, 量子电路优化已经变得极为重要, 并被纳入今天的量子软件堆。 本文侧重于量子电路优化的关键类型 – – 阶段- 分球优化。 阶段多元分子代表了一组建筑块电路, 其表现经常出现在量子模块指数( Shor的保理算法中最耗时的成分)、 量子近似优化算法(QAOAA) 和汉密尔顿模拟中。 与以前在多环形模型中的工作相比, 量子电路优化已经变得极其重要, 并被纳入今天的量子软件堆。 从单块电路优化到多个区段的聚相优化。 从贪婪的次电路替换策略到系统的平价矩阵优化, 从硬件模糊的逻辑电路流优化到硬件友好逻辑精度精度优化65 逻辑优化(QAAAAAAAAAAAA) 我们还利用了我们的阶段多式优化框架框架框架框架, 创建对硬件友好重要建筑块建筑块进行硬件友好分析。 我们实验在5057%的大门中, 将平均削减了50% 将平均削减, 削减到平均削减了25%的门, , 削减, 将达到平均削减了25%
Article 2
Title@2025-06-25 (3): Deadlock-free Context-free Session Types
Title: Deadlock-free Context-free Session Types | Deadlock-freie Kontext-freie Sitzungstypen | 无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无无 2506.20356v1 |
Authors (2): Andreia Mordido, Jorge A. Pérez
We consider the problem of statically ensuring that message-passing programs never run into deadlocks. We focus on concurrent functional programs governed by context-free session types, which can express rich tree-like structures not expressible in standard session types. Existing techniques based on context-free session types enforce protocol conformance but not deadlock freedom. We propose a type system that enhances context-free session types with a priority-based approach to deadlock freedom, considering polymorphic and recursive types. Interestingly, the notions needed for avoiding deadlocks fit nicely into this expressive setting. We prove that well-typed programs respect their protocols and never run into deadlocks at run-time; we also demonstrate the expressiveness gains with respect to prior work by means of examples.
我们考虑了静态地确保信息传递程序永远不会陷入僵局的问题。我们侧重于由无背景会议类型管理的并行功能方案,这些方案可以表达在标准会议类型中无法体现的丰富树型结构。基于无背景会议类型的现有技术可以强制执行协议的合规性,而不是陷入僵局自由。我们提出了一个类型的系统,用基于优先事项的方法加强无背景会议类型,解决僵局自由,考虑到多种变化和循环类型。有趣的是,避免僵局所需的概念适合这种表达式环境。我们证明,完善型方案尊重其协议,在运行时从未陷入僵局;我们还以实例的方式展示了先前工作在明确性方面的成果。
Article 3
Title@2025-06-25 (3): Unfolding Iterators: Specification and Verification of Higher-Order Iterators, in OCaml
Title: Unfolding Iterators: Specification and Verification of Higher-Order Iterators, in OCaml | Entfaltungs-Iteratoren: Spezifikation und Überprüfung von höher angeordneten Iteratoren, in OCaml | 卸载迭代器: OCaml 中高分迭代器的规格和核查 2506.20310v1 |
Authors (2): Ion Chirica, Mário Pereira
Albeit being a central notion of every programming language, formally and modularly reasoning about iteration proves itself to be a non-trivial feat, specially in the context of higher-order iteration. In this paper, we present a generic approach to the specification and deductive verification of higher-order iterators, written in the OCaml language. Our methodology follows two key principles: first, the usage of the Gospel specification language to describe the general behaviour of any iteration schema; second, the usage of the Cameleer framework to deductively verify that every iteration client is correct with respect to its logical specification. To validate our approach we develop a set of verified case studies, ranging from classic list iterators to graph algorithms implemented in the widely used OCamlGraph library.
尽管这是每种编程语言的中心概念,但关于迭代的正规和模块化推理本身证明是一种非三重性表演,特别是在较高顺序迭代的背景下。我们在本文件中提出了对以OCaml语言书写的较高顺序迭代器的规格和推算核查的通用方法。我们的方法遵循两个主要原则:第一,使用福音规格语言来描述任何迭代系统的一般行为;第二,使用卡默勒框架来推断每个迭代客户在逻辑规格方面是否正确。为了验证我们的方法,我们开发了一套经过核实的案例研究,从经典列表迭代器到在广泛使用的OCamlGraph图书馆中实施的图表算法。
Article 4
Title@2025-06-25 (3): Dynamic Race Detection With O(1) Samples
Title: Dynamic Race Detection With O(1) Samples | Dynamische Race-Erkennung mit O(1)-Proben | 有O(1)样本的动态竞赛探测样本 2506.20127v1 |
Authors (4): Mosaad Al Thokair, Minjian Zhang, Umang Mathur, Mahesh Viswanathan
Happens before-based dynamic analysis is the go-to technique for detecting data races in large scale software projects due to the absence of false positive reports. However, such analyses are expensive since they employ expensive vector clock updates at each event, rendering them usable only for in-house testing. In this paper, we present a sampling-based, randomized race detector that processes only constantly many events of the input trace even in the worst case. This is the first sub-linear time (i.e., running in o(n) time where n is the length of the trace) dynamic race detection algorithm; previous sampling based approaches like Pacer run in linear time (i.e., O(n)). Our algorithm is a property tester for HB-race detection – it is sound in that it never reports any false positive, and on traces that are far, with respect to hamming distance, from any race-free trace, the algorithm detects an HB-race with high probability. Our experimental evaluation of the algorithm and its comparison with state-of-the-art deterministic and sampling based race detectors shows that the algorithm does indeed have significantly low running time, and detects races quite often.
之前发生的动态分析是发现大型软件项目数据竞赛的技术,因为没有虚假的正面报告。然而,这种分析费用昂贵,因为每次活动都采用昂贵的矢量时钟更新,只能用于内部测试。在本文中,我们提出了一个基于抽样的随机随机的种族探测器,该探测器即使在最坏的情况下也只能不断地处理许多输入痕量的事件。这是第一个次线性时间(即运行时间在o(n)时间里,是追踪时间的长度)动态种族探测算法;以前的基于取样的方法,如Pacer在线性时间(即O(n)时间运行)。我们的算法是一个用于HB-色谱检测的财产测试器,它从不报告任何虚假的正数,在距离方面,从任何无种族的踪迹来看,算法从任何无种族的踪迹中,探测出一个HB-种族的高度概率。我们对算法的实验性评估及其与州-艺术确定性和基于抽样的种族探测器的比较表明,算法确实非常低的时间和种族的测算法。
Article 5
Title@2025-06-24 (2): QHackBench: Benchmarking Large Language Models for Quantum Code Generation Using PennyLane Hackathon Challenges
Title: QHackBench: Benchmarking Large Language Models for Quantum Code Generation Using PennyLane Hackathon Challenges | QHackBench: Benchmarking großer Sprachmodelle für die Quantencode-Generation mit PennyLane Hackathon-Herausforderungen | QHackBench:利用PennyLane Hackathon挑战为量制代码生成量设定大语言模式基准 2506.20008v1 |
Authors (7): Abdul Basit, Minghao Shao, Haider Asif, Nouhaila Innan, Muhammad Kashif, Alberto Marchisio, Muhammad Shafique
Recent advances in Large Language Models (LLMs) have demonstrated strong potential in code generation, yet their effectiveness in quantum computing remains underexplored. This paper benchmarks LLMs for PennyLane-based quantum code generation using real-world challenges from the Quantum Hackathon (QHack). We introduce QHackBench, a novel benchmark dataset derived from QHack competitions, and evaluate model performance under vanilla prompting and Retrieval-Augmented Generation (RAG). Our structured evaluation framework assesses functional correctness, syntactic validity, and execution success across varying challenge difficulties. Results indicate that RAG-enhanced models, supplemented with an augmented PennyLane dataset, approximately generate similar results as the standard prompting, particularly in complex quantum algorithms. Additionally, we introduce a multi-agent evaluation pipeline that iteratively refines incorrect solutions, further enhancing execution success rates. To foster further research, we commit to publicly releasing QHackBench, along with our evaluation framework and experimental results, enabling continued advancements in AI-assisted quantum programming.
大语言模型(LLMS)最近的进展表明,在代码生成方面有巨大的潜力,但其量子计算方面的效力仍未得到充分探讨。本文基准了利用Qantum Hackathon(QHack)(Quantum Hackathon)(Qantum Hackathon)(QHackbench)(QHackBench)(QHackBench)(QHackBench)(QHack Bench)(QHackBench)(QHackBench)(QHack Bench)(QHackBench)(QHackBench)(QHackBench)(QHack Bench)(QHack Bench)(QHack Bench)(LLLLLM)(LLLM)(LLLLLM)(LLLM)(LLLLLM)(LM)(LM)(LLLLLLM)(LLM)(LLLLM)(LM)(LLLLLM)(LLLM)(LM)(LM)(LLLLM)(LLLLM)(LLM(LM)(LM)(DM)(LLLLLM)(DM)(LM)(DM)(DM)(LLM)(LLLLLM)(LLLLLLLLLM)(LLLLM)(LLLLLLLM)(LM)(LM)(LM)(LM)(LM)(LM)(LM)(LM)(LM)(LM)(LM)(LM)(LM)(LM)(LM)(LLLLLLLLLM)(LM)(LM)(LM)(LM)(LM)(LLLM)(LM)(LM)(LM)(LM)(LM)(LM)(LM)(LM)(LM)(LM)(LM)
Article 6
Title@2025-06-24 (2): Free Doubly-Infinitary Distributive Categories are Cartesian Closed
Title: Free Doubly-Infinitary Distributive Categories are Cartesian Closed | Kostenlose Doppel-Unendliche Verteilung Kategorien sind kartesisch geschlossen | 免费Doubly-Infinite分配类别为笛卡尔人封闭类别 2403.10447v6 |
Authors (2): Fernando Lucatelli Nunes, Matthijs Vákár
We investigate categories in which products distribute over coproducts, a structure we call doubly-infinitary distributive categories. Through a range of examples, we explore how this notion relates to established concepts such as extensivity, infinitary distributivity, and cartesian closedness. We show that doubly-infinitary distributivity strictly strengthens the classical notion of infinitary distributivity. Moreover, we prove that free doubly-infinitary distributive categories are cartesian closed, unlike free distributive categories. The paper concludes with observations on non-canonical isomorphisms, alongside open questions and directions for future research.
我们调查了产品在共同产品上分配的类别,我们称之为双重至无穷分配的类别。我们通过一系列实例探索了这一概念与诸如扩展、无穷分配和碳水化合物封闭等既定概念的关系。我们表明,双重至无穷分配严格地强化了无穷分配的经典概念。此外,我们证明,免费的双重至无穷分配的类别是封闭的,不同于免费分配的类别。文件最后提出了关于非卡尼氏异形态的观察意见,并提出了供未来研究的开放问题和方向。
Article 7
Title@2025-06-24 (2): A Certified Proof Checker for Deep Neural Network Verification in Imandra
Title: A Certified Proof Checker for Deep Neural Network Verification in Imandra | Ein Certified Proof Checker für Deep Neural Network Verification in Imandra | Imandra 深神经网络核查认证校验器 2405.10611v2 |
Authors (6): Remi Desmartin, Omri Isac, Grant Passmore, Ekaterina Komendantskaya, Kathrin Stark, Guy Katz
Recent advances in the verification of deep neural networks (DNNs) have opened the way for a broader usage of DNN verification technology in many application areas, including safety-critical ones. However, DNN verifiers are themselves complex programs that have been shown to be susceptible to errors and numerical imprecision; this, in turn, has raised the question of trust in DNN verifiers. One prominent attempt to address this issue is enhancing DNN verifiers with the capability of producing certificates of their results that are subject to independent algorithmic checking. While formulations of Marabou certificate checking already exist on top of the state-of-the-art DNN verifier Marabou, they are implemented in C++, and that code itself raises the question of trust (e.g., in the precision of floating point calculations or guarantees for implementation soundness). Here, we present an alternative implementation of the Marabou certificate checking in Imandra – an industrial functional programming language and an interactive theorem prover (ITP) – that allows us to obtain full proof of certificate correctness. The significance of the result is two-fold. Firstly, it gives stronger independent guarantees for Marabou proofs. Secondly, it opens the way for the wider adoption of DNN verifiers in interactive theorem proving in the same way as many ITPs already incorporate SMT solvers.
最近对深神经网络(DNN)的核查进展为在许多应用领域更广泛地使用DNN核查技术开辟了道路,包括安全临界领域。然而,DNN核查员本身是复杂的程序,已经证明容易出错和数字不精确;这反过来又提出了对DNN核查员的信任问题。解决这一问题的一个显著尝试是提高DNN核查员的能力,使其能够产生其结果的证书,并接受独立算法检查。虽然马拉布证书检查的配方已经存在于最先进的DNNN核查员Marabou之上,但在C+++执行,而且该编码本身也提出了信任问题(例如,浮动点计算精确性或保证执行的正确性)。在这里,我们提出了在Imandra(一种工业功能性编程语言和交互式标本证明(ITP))中进行马拉布证书检查的另一种备选做法,使我们能够获得证书正确性的充分证明。结果在C+++中具有双重意义,而该结果本身也提出了信任问题(例如,在浮动点计算或保证执行的精确性保证方面)在Imandregrom 中为许多互动证据打开。
Article 8
Title@2025-06-24 (2): A Sound Type System for Secure Currency Flow
Title: A Sound Type System for Secure Currency Flow | Ein Sound Type System für sicheren Währungsfluss | 安全货币流动的健全类型系统 2405.12976v2 |
Authors (3): Luca Aceto, Daniele Gorla, Stian Lybech
In this paper we focus on TinySol, a minimal calculus for Solidity smart contracts, introduced by Bartoletti et al. We start by rephrasing its syntax (to emphasise its object-oriented flavour) and give a new big-step operational semantics. We then use it to define two security properties, namely call integrity and noninterference. These two properties have some similarities in their definition, in that they both require that some part of a program is not influenced by the other part. However, we show that the two properties are actually incomparable. Nevertheless, we provide a type system for noninterference and show that well-typed programs satisfy call integrity as well; hence, programs that are accepted by our type system satisfy both properties. We finally discuss the practical usability of the type system and its limitations by means of some simple examples.
在本文中,我们关注的焦点是TinySol, 这是Bartoletti等人介绍的“团结”智能合同的一个微小微积分。 我们首先修改其语法(强调其面向对象的口味), 并给出一个新的大步操作语义。 然后我们用它来定义两个安全属性, 即“ 完整性” 和“ 不干涉” 。 这两种属性在定义上有一些相似之处, 因为它们都要求程序的某些部分不受另一部分的影响 。 然而, 我们表明这两个属性实际上是无法比较的。 然而, 我们提供了一个不干预类型系统, 并且显示, 类型精良的程序满足了要求的完整性; 因此, 我们类型系统所接受的程序既满足了两种属性。 我们最后通过一些简单的例子来讨论类型系统的实用性及其局限性。
Article 9
Title@2025-06-24 (2): The Autonomous Data Language – Concepts, Design and Formal Verification
Title: The Autonomous Data Language – Concepts, Design and Formal Verification | Die autonome Datensprache – Konzepte, Design und formale Überprüfung | 自主数据语言 – – 概念、设计和正式核查 2506.19457v1 |
Authors (3): Tom T. P. Franken, Thomas Neele, Jan Friso Groote
Nowadays, the main advances in computational power are due to parallelism. However, most parallel languages have been designed with a focus on processors and threads. This makes dealing with data and memory in programs hard, which distances the implementation from its original algorithm. We propose a new paradigm for parallel programming, the data-autonomous paradigm, where computation is performed by autonomous data elements. Programs in this paradigm are focused on making the data collaborate in a highly parallel fashion. We furthermore present AuDaLa, the first data autonomous programming language, and provide a full formalisation that includes a type system and operational semantics. Programming in AuDaLa is very natural, as illustrated by examples, albeit in a style very different from sequential and contemporary parallel programming. Additionally, it lends itself for the formal verification of parallel programs, which we demonstrate.
目前,计算能力的主要进步是平行的。然而,大多数平行语言的设计重点是处理器和线条。这使得处理程序的数据和记忆变得困难,使执行与原始算法相去甚远。我们提出了平行程序的新模式,即数据自主模式,即由自主数据元素进行计算。这个模式中的方案侧重于使数据以高度平行的方式协作。我们还介绍了第一种数据自主程序语言AuDaLa,这是第一个数据自主程序语言,并提供了一种包括类型系统和操作语义的全面正规化。AuDaLa的编程非常自然,例如实例所展示的,尽管其风格与顺序和当代平行程序非常不同。此外,它也有助于对平行程序进行正式核查,我们展示了这一点。
Article 10
Title@2025-06-24 (2): The Elements of Differentiable Programming
Title: The Elements of Differentiable Programming | Die Elemente der differenzierbaren Programmierung | 不同方案拟订要素 2403.14606v3 |
Authors (2): Mathieu Blondel, Vincent Roulet
Artificial intelligence has recently experienced remarkable advances, fueled by large models, vast datasets, accelerated hardware, and, last but not least, the transformative power of differentiable programming. This new programming paradigm enables end-to-end differentiation of complex computer programs (including those with control flows and data structures), making gradient-based optimization of program parameters possible. As an emerging paradigm, differentiable programming builds upon several areas of computer science and applied mathematics, including automatic differentiation, graphical models, optimization and statistics. This book presents a comprehensive review of the fundamental concepts useful for differentiable programming. We adopt two main perspectives, that of optimization and that of probability, with clear analogies between the two. Differentiable programming is not merely the differentiation of programs, but also the thoughtful design of programs intended for differentiation. By making programs differentiable, we inherently introduce probability distributions over their execution, providing a means to quantify the uncertainty associated with program outputs.
人工智能最近取得了显著的进步,这得益于大型模型、庞大的数据集、加速硬件,以及最后但并非最不重要的、不同编程的变革力量。这个新的编程模式使得复杂的计算机程序(包括具有控制流程和数据结构的计算机程序)最终到终端的差别化成为可能,使基于梯度的程序参数优化成为可能。作为一个新兴模式,不同的编程建立在计算机科学和应用数学的若干领域之上,包括自动区分、图形模型、优化和统计。这本书全面审查了可用于不同编程的基本概念。我们采用了两种主要观点,即优化和概率,在两种方案之间有明确的相似之处。可区分的编程不仅仅是程序的不同,而且还是旨在区分程序的深思熟虑设计。通过使方案具有差异性,我们必然引入了执行这些方案的概率分布,为量化与方案产出相关的不确定性提供了一种手段。
Article 11
Title@2025-06-23 (1): AutoPDL: Automatic Prompt Optimization for LLM Agents
Title: AutoPDL: Automatic Prompt Optimization for LLM Agents | AutoPDL: Automatische Prompt-Optimierung für LLM-Agenten | AAUPDL:LLM代理器自动快速优化 2504.04365v2 |
Authors (4): Claudio Spiess, Mandana Vaziri, Louis Mandel, Martin Hirzel
The performance of large language models (LLMs) depends on how they are prompted, with choices spanning both the high-level prompting pattern (e.g., Zero-Shot, CoT, ReAct, ReWOO) and the specific prompt content (instructions and few-shot demonstrations). Manually tuning this combination is tedious, error-prone, and specific to a given LLM and task. Therefore, this paper proposes AutoPDL, an automated approach to discovering good LLM agent configurations. Our approach frames this as a structured AutoML problem over a combinatorial space of agentic and non-agentic prompting patterns and demonstrations, using successive halving to efficiently navigate this space. We introduce a library implementing common prompting patterns using the PDL prompt programming language. AutoPDL solutions are human-readable, editable, and executable PDL programs that use this library. This approach also enables source-to-source optimization, allowing human-in-the-loop refinement and reuse. Evaluations across three tasks and seven LLMs (ranging from 3B to 70B parameters) show consistent accuracy gains ($9.06\pm15.3$ percentage points), up to 68.9pp, and reveal that selected prompting strategies vary across models and tasks.
大型语言模型(LLMS)的性能取决于它们是如何推动的,其选择范围既包括高层次的促动模式(例如零热、Cot、ReAct、ReWOO),也包括具体的快速内容(指令和短片演示),人工调整这种组合是乏味的、容易出错的,并且是特定LLM和任务所特有的。因此,本文件提议AutopDL,这是一个发现良好的LLM代理配置的自动化方法。我们的方法将这一问题作为结构化的AutML问题,它覆盖在代理人和非代理人的促动模式和演示的组合空间中,使用连续的减半来高效率地浏览这个空间。我们引入了一个图书馆,使用PDL快速编程语言来实施共同的提示模式。AutPDL的解决方案是人类可读、可编辑和可执行的PDL程序,使用这个图书馆。这个方法还有助于源源到源的优化,允许在操作中改进和再利用。在三个任务和七个LMSMS(从3美元到70美元参数)的组合中,我们引入了一致的精确度(9.06)战略。
Article 12
Title@2025-06-23 (1): Piloting Copilot, Codex, and StarCoder2: Hot Temperature, Cold Prompts, or Black Magic?
Title: Piloting Copilot, Codex, and StarCoder2: Hot Temperature, Cold Prompts, or Black Magic? | Pilotieren von Copilot, Codex und StarCoder2: Heiße Temperatur, kalte Prompts oder schwarze Magie? | 联合飞行员 代码代码和星际代码2: 热温、冷感或黑魔法? 2210.14699v3 |
Authors (5): Jean-Baptiste Döderlein, Nguessan Hermann Kouadio, Mathieu Acher, Djamel Eddine Khelladi, Benoit Combemale
Language models are promising solutions for tackling increasing complex problems. In software engineering, they recently gained attention in code assistants, which generate programs from a natural language task description (prompt). They have the potential to save time and effort but remain poorly understood, limiting their optimal use. In this article, we investigate the impact of input variations on two configurations of a language model, focusing on parameters such as task description, surrounding context, model creativity, and the number of generated solutions. We design specific operators to modify these inputs and apply them to three LLM-based code assistants (Copilot, Codex, StarCoder2) and two benchmarks representing algorithmic problems (HumanEval, LeetCode). Our study examines whether these variations significantly affect program quality and how these effects generalize across models. Our results show that varying input parameters can greatly improve performance, achieving up to 79.27% success in one-shot generation compared to 22.44% for Codex and 31.1% for Copilot in default settings. Actioning this potential in practice is challenging due to the complex interplay in our study - the optimal settings for temperature, prompt, and number of generated solutions vary by problem. Reproducing our study with StarCoder2 confirms these findings, indicating they are not model-specific. We also uncover surprising behaviors (e.g., fully removing the prompt can be effective), revealing model brittleness and areas for improvement.
语言模型是解决日益复杂的问题的有希望的解决方案。在软件工程中,它们最近得到了代码助理的注意,代码助理从自然语言任务描述(即速)中生成了程序。它们具有节省时间和精力的潜力,但是仍然无法很好地理解,限制了它们的最佳使用。在本条中,我们调查了投入差异对语言模型两种配置的影响,侧重于任务描述、周围环境、模型创造力和生成的解决方案的数量等参数。我们设计了具体操作员来修改这些投入,并将其应用到三个基于LLLM的代码助理(Coitol、codex、StarCoder2)和两个代表算法问题的基准(HumanEval、LeetCode)。我们的研究审查了这些差异是否显著地影响方案质量和这些影响如何贯穿各种模型。我们的研究结果表明,不同的投入参数可以大大改善业绩,在一集中达到79.27%的成功率,而在代码x为22.44 %,在默认环境下为Codor 模型设计为31.1%。由于我们的研究中复杂的相互作用,这种潜力在实际中的行动具有挑战性――温度、迅速和生成的解决方案的最佳环境和数量因问题而不同而变化。我们的研究,我们用的是:用StarderStarder2 正在充分地展示这些研究。我们的研究,我们的研究可以确认这些研究。我们的研究,要揭示这些结果。我们的研究,我们的行为可以迅速地表明这些结果。
Article 13
Title@2025-06-21 (6): HPVM-HDC: A Heterogeneous Programming System for Accelerating Hyperdimensional Computing
Title: HPVM-HDC: A Heterogeneous Programming System for Accelerating Hyperdimensional Computing | HPVM-HDC: Ein heterogenes Programmiersystem zur Beschleunigung des Hyperdimensionalen Rechnens | HPVM-HDC:加速超多维电子化的多样化规划系统 2410.15179v3 |
Authors (10): Russel Arbore, Xavier Routh, Abdul Rafae Noor, Akash Kothari, Haichao Yang, Weihong Xu, Sumukh Pinge, Minxuan Zhou, Tajana Rosing, Vikram Adve
Hyperdimensional Computing (HDC), a technique inspired by cognitive models of computation, has been proposed as an efficient and robust alternative basis for machine learning. HDC programs are often manually written in low-level and target specific languages targeting CPUs, GPUs, and FPGAs – these codes cannot be easily retargeted onto HDC-specific accelerators. No previous programming system enables productive development of HDC programs and generates efficient code for several hardware targets. We propose a heterogeneous programming system for HDC: a novel programming language, HDC++, for writing applications using a unified programming model, including HDC-specific primitives to improve programmability, and a heterogeneous compiler, HPVM-HDC, that provides an intermediate representation for compiling HDC programs to many hardware targets. We implement two tuning optimizations, automatic binarization and reduction perforation, that exploit the error resilient nature of HDC. Our evaluation shows that HPVM-HDC generates performance-competitive code for CPUs and GPUs, achieving a geomean speed-up of 1.17x over optimized baseline CUDA implementations with a geomean reduction in total lines of code of 1.6x across CPUs and GPUs. Additionally, HPVM-HDC targets an HDC Digital ASIC and an HDC ReRAM accelerator simulator, enabling the first execution of HDC applications on these devices.
由认知计算模型所启发的一种技术,即超超度计算(HDC),已被提议为机器学习的一个高效和稳健的替代基础。HDC程序通常以低级别和目标特定语言手工编写,针对CPU、GPUs和FPGAs,这些代码无法轻易地重新锁定到HDC专用加速器上。以前没有一种编程系统能够促进HDC程序的生产发展,并为若干硬件目标生成高效代码。我们建议HDC使用一种多样化的编程系统:一种新型编程语言HDC++,用于使用统一的编程模型编写应用程序,包括HDC特定原始生物(HDC),以提高程序可操作性,以及一个混杂的HPVM-HDC编程,为将HDC程序编集到许多硬件目标提供中间代表。我们实施了两个调整优化、自动双倍化和递减、利用HDC的适应性能性能特性,我们的评价显示,HPVMC为CPU和GSDDC首次执行系统,在GHPLA目标上实现1.6的地平流速度,这些CSDDDDDC总的CSLULA目标。
Article 14
Title@2025-06-21 (6): ASC-Hook: fast and transparent system call hook for Arm
Title: ASC-Hook: fast and transparent system call hook for Arm | ASC-Hook: Schneller und transparenter Systemrufhaken für Arm | ASC-Hook: 快速和透明的系统调用武器钩 2412.05784v4 |
Authors (4): Yang Shen, Min Xie, Wenzhe Zhang, Tao Wu
Intercepting system calls is crucial for tools that aim to modify or monitor application behavior. However, existing system call interception tools on the ARM platform still suffer from limitations in terms of performance and completeness. This paper presents an efficient and comprehensive binary rewriting framework, ASC-Hook, specifically designed for intercepting system calls on the ARM platform. ASC-Hook addresses two key challenges on the ARM architecture: the misalignment of the target address caused by directly replacing the SVC instruction with br x8, and the return to the original control flow after system call interception. This is achieved through a hybrid replacement strategy and our specially designed trampoline mechanism. By implementing multiple completeness strategies specifically for system calls, we ensured comprehensive and thorough interception. Experimental results show that ASC-Hook reduces overhead to at least 1/29 of that of existing system call interception tools. We conducted extensive performance evaluations of ASC-Hook, and the average performance loss for system call-intensive applications is 3.7\% .
截取系统呼叫对于旨在修改或监测应用行为的工具至关重要,然而,在ARM平台上现有的系统呼叫截取工具仍然在性能和完整性方面受到限制。本文件介绍了一个高效而全面的二进制重写框架(ASC-Hook,专门设计用于截取系统呼叫ARM平台的ASC-Hook,专为截取系统而设计的ASC-Hook)。ASC-Hook应对了ARM架构的两大挑战:直接用brx8取代SVC指令导致目标地址的错配,以及系统呼叫拦截后返回原控制流程。这是通过混合替换战略和我们专门设计的蹦床机制实现的。我们通过执行针对系统呼叫的多重完整战略,确保了全面彻底的截取。实验结果表明,ASC-Hook将现有系统呼叫拦截工具的间接费用减少到至少1/29。我们对ASC-Hook进行了广泛的绩效评估,而系统呼叫密集应用的平均性能损失是3.7。
Article 15
Title@2025-06-20 (5): $Δ$-Nets: Interaction-Based System for Optimal Parallel $λ$-Reduction
Title: $Δ$-Nets: Interaction-Based System for Optimal Parallel $λ$-Reduction | $Δ$-Nets: Interaktionsbasiertes System für eine optimale parallele $λ$-Reduktion | \(-净额:最佳平行互动系统\)$美元-削减 2505.20314v3 |
Authors (1): Daniel Augusto Rizzi Salvadori
I present a model of universal parallel computation called $\Delta$-Nets, and a method to translate $\lambda$-terms into $\Delta$-nets and back. Together, the model and the method constitute an algorithm for optimal parallel $\lambda$-reduction, solving the longstanding enigma with groundbreaking clarity. I show that the $\lambda$-calculus can be understood as a projection of $\Delta$-Nets$-$one that severely restricts the structure of sharing, among other drawbacks. Unhindered by these restrictions, the $\Delta$-Nets model opens the door to new parallel programming language implementations and computer architectures that are more efficient and performant than previously possible.
我提出了一个称为$Delta$-Nets的通用平行计算模型,以及将$lambda$-terms 转换成$Delta$-nets和回调的方法。模型和方法共同构成一个优化平行$\lambda$降值的算法,以突破性清晰度解决长期谜题。我表明,$Limbda$-calculus可以被理解为一个严重限制共享结构的$Delta$-Nets-one的预测,除其他缺陷外。由于这些限制,$Delta$-Nets模型为平行语言实施和计算机结构打开了大门,这些语言实施和计算机结构比以前可能更有效和更实用。
Article 16
Title@2025-06-20 (5): Low Overhead Allocation Sampling in a Garbage Collected Virtual Machine
Title: Low Overhead Allocation Sampling in a Garbage Collected Virtual Machine | Low Overhead Allocation Probenahme in einem Garbage gesammelte virtuelle Maschine | 在垃圾收集的虚拟机器中进行低间接费用分配抽样 2506.16883v1 |
Authors (2): Christoph Jung, C. F. Bolz-Tereick
Compared to the more commonly used time-based profiling, allocation profiling provides an alternate view of the execution of allocation heavy dynamically typed languages. However, profiling every single allocation in a program is very inefficient. We present a sampling allocation profiler that is deeply integrated into the garbage collector of PyPy, a Python virtual machine. This integration ensures tunable low overhead for the allocation profiler, which we measure and quantify. Enabling allocation sampling profiling with a sampling period of 4 MB leads to a maximum time overhead of 25% in our benchmarks, over un-profiled regular execution.
与更常用的基于时间的特征分析相比,分配概况分析提供了一种对分配使用大量动态打字语言的替代观点,然而,对方案内每项分配的特征分析非常低效。我们展示了一个抽样分配剖面分析器,深入地融入了PyPy的垃圾收集器,PyPy, Python虚拟机器。这种整合确保了分配剖面分析器的低金枪鱼管理费,我们对其进行测量和量化。在4MB取样期内进行分配抽样分析,导致我们基准中的最大时间管理费为25 % , 高于正常执行。
Article 17
Title@2025-06-20 (5): CodeV-R1: Reasoning-Enhanced Verilog Generation
Title: CodeV-R1: Reasoning-Enhanced Verilog Generation | CodeV-R1: Grundlegende Verilog-Generierung | 代码V-R1:有理性的增强性性性性性性性生殖器生成 2505.24183v2 |
Authors (19): Yaoyu Zhu, Di Huang, Hanqi Lyu, Xiaoyun Zhang, Chongxiao Li, Wenxuan Shi, Yutong Wu, Jianan Mu, Jinghua Wang, Yang Zhao, Pengwei Jin, Shuyao Cheng, Shengwen Liang, Xishan Zhang, Rui Zhang, Zidong Du, Qi Guo, Xing Hu, Yunji Chen
Large language models (LLMs) trained via reinforcement learning with verifiable reward (RLVR) have achieved breakthroughs on tasks with explicit, automatable verification, such as software programming and mathematical problems. Extending RLVR to electronic design automation (EDA), especially automatically generating hardware description languages (HDLs) like Verilog from natural-language (NL) specifications, however, poses three key challenges: the lack of automated and accurate verification environments, the scarcity of high-quality NL-code pairs, and the prohibitive computation cost of RLVR. To this end, we introduce CodeV-R1, an RLVR framework for training Verilog generation LLMs. First, we develop a rule-based testbench generator that performs robust equivalence checking against golden references. Second, we propose a round-trip data synthesis method that pairs open-source Verilog snippets with LLM-generated NL descriptions, verifies code-NL-code consistency via the generated testbench, and filters out inequivalent examples to yield a high-quality dataset. Third, we employ a two-stage “distill-then-RL” training pipeline: distillation for the cold start of reasoning abilities, followed by adaptive DAPO, our novel RLVR algorithm that can reduce training cost by adaptively adjusting sampling rate. The resulting model, CodeV-R1-7B, achieves 68.6% and 72.9% pass@1 on VerilogEval v2 and RTLLM v1.1, respectively, surpassing prior state-of-the-art by 12~20%, while matching or even exceeding the performance of 671B DeepSeek-R1. We will release our model, training pipeline, and dataset to facilitate research in EDA and LLM communities.
通过可核实的奖励(RLVR)强化学习培训的大型语言模型(LLM)在明确、可自动化的核查任务(如软件编程和数学问题)方面取得了突破。将RLVR扩大到电子设计自动化(EDA),特别是自动生成硬件描述语言(HDLs),如来自自然语言(NL)规格的Verilog(HDLL),然而,这带来了三大挑战:缺乏自动和准确的核查环境,缺少高质量的NLVR码配对,以及RLVR的计算成本过高。为此,我们引入了代码V-R1,一个用于培训VRR1的RV-7框架,用于培训Verilog的LLLMM。首先,我们开发了一个基于规则的测试引擎,对金参考进行强有力的等值检查。第二,我们提出了一个圆柱数据合成方法,将开放源的 Verilog布片与LM生成的NLLS描述配对,通过生成的测试对代码-NLF20码一致性进行校准,并通过等示例进行过滤, 提供高品质数据集。第三,我们开始两个阶段的“DLVR-R-R-R-R-R-R-R-R-R-d-d-d-d-drodustris-d-d-d-d-dal”的升级的测试,然后进行成本的升级的升级的校程的校程的校程的校程的校程的校制算算法。
Article 18
Title@2025-06-20 (5): Zoozve: A Strip-Mining-Free RISC-V Vector Extension with Arbitrary Register Grouping Compilation Support (WIP)
Title: Zoozve: A Strip-Mining-Free RISC-V Vector Extension with Arbitrary Register Grouping Compilation Support (WIP) | Zoozve: Eine Strip-Mining-freie RISC-V-Vektorerweiterung mit arbitrary Register-Gruppierungsunterstützung (WIP) | Zooozve:利用任意登记册组汇编支助,扩大一个带分解自由的RISC-V矢量扩展区(WIP) 2504.15678v2 |
Authors (7): Siyi Xu, Limin Jiang, Yintao Liu, Yihao Shen, Yi Shi, Shan Cao, Zhiyuan Jiang
Vector processing is crucial for boosting processor performance and efficiency, particularly with data-parallel tasks. The RISC-V “V” Vector Extension (RVV) enhances algorithm efficiency by supporting vector registers of dynamic sizes and their grouping. Nevertheless, for very long vectors, the static number of RVV vector registers and its power-of-two grouping can lead to performance restrictions. To counteract this limitation, this work introduces Zoozve, a RISC-V vector instruction extension that eliminates the need for strip-mining. Zoozve allows for flexible vector register length and count configurations to boost data computation parallelism. With a data-adaptive register allocation approach, Zoozve permits any register groupings and accurately aligns vector lengths, cutting down register overhead and alleviating performance declines from strip-mining. Additionally, the paper details Zoozve’s compiler and hardware implementations using LLVM and SystemVerilog. Initial results indicate Zoozve yields a minimum 10.10$\times$ reduction in dynamic instruction count for fast Fourier transform (FFT), with a mere 5.2\% increase in overall silicon area.
矢量处理对于提高处理器的性能和效率至关重要,特别是在数据平行任务方面。RISC-V“V”矢量扩展(RVVV)通过支持动态尺寸的矢量登记册及其分组来提高算法效率。然而,对于非常长的矢量而言,RV矢量登记册的静数及其2组的功率可能导致性能限制。为了消除这一限制,这项工作引入了Zooozve,一个RISC-V矢量指示扩展,消除了脱衣采掘的需要。Zoozve允许灵活矢量登记长度和计数配置,以推进数据的平行计算。在数据适应性登记册分配方法下,Zoozve允许任何登记组并准确地调整矢量长度,削减登记册的间接费用并减缓脱衣采伐的性能下降。此外,文件细节Zozve使用LVM和系统Verilog, 显示Zozve能产生最低10.10美元的时间减少动态指示值,用于快速四变换(FFT),在一个小的面积上增加了一个5.2个区域。
Article 19
Title@2025-06-19 (4): ChatDBG: Augmenting Debugging with Large Language Models
Title: ChatDBG: Augmenting Debugging with Large Language Models | ChatDBG: Augmenting Debugging mit großen Sprachmodellen | 聊天DBG: 使用大语言模式加强调试 2403.16354v5 |
Authors (4): Kyla H. Levin, Nicolas van Kempen, Emery D. Berger, Stephen N. Freund
Debugging is a critical but challenging task for programmers. This paper proposes ChatDBG, an AI-powered debugging assistant. ChatDBG integrates large language models (LLMs) to significantly enhance the capabilities and user-friendliness of conventional debuggers. ChatDBG lets programmers engage in a collaborative dialogue with the debugger, allowing them to pose complex questions about program state, perform root cause analysis for crashes or assertion failures, and explore open-ended queries like “why is x null?”. To handle these queries, ChatDBG grants the LLM autonomy to “take the wheel”: it can act as an independent agent capable of querying and controlling the debugger to navigate through stacks and inspect program state. It then reports its findings and yields back control to the programmer. By leveraging the real-world knowledge embedded in LLMs, ChatDBG can diagnose issues identifiable only through the use of domain-specific reasoning. Our ChatDBG prototype integrates with standard debuggers including LLDB and GDB for native code and Pdb for Python. Our evaluation across a diverse set of code, including C/C++ code with known bugs and a suite of Python code including standalone scripts and Jupyter notebooks, demonstrates that ChatDBG can successfully analyze root causes, explain bugs, and generate accurate fixes for a wide range of real-world errors. For the Python programs, a single query led to an actionable bug fix 67% of the time; one additional follow-up query increased the success rate to 85%. ChatDBG has seen rapid uptake; it has already been downloaded more than 75,000 times.
调试对于程序员来说是一项关键但具有挑战性的任务。 本文提议了 CHATDBG , 是一个 AI 驱动调试助手 。 聊天DBG 整合了大语言模型( LLMS) , 以大大增强常规调试器的能力和用户友好性。 聊天DBG 允许程序员与调试器进行协作对话, 允许他们提出有关程序状态的复杂问题, 对崩溃或主张失败进行根源分析, 并探索“ 为何是无效的” 等开放式查询。 要处理这些查询, 聊天DBG 授予LDB 自主权, 以“ 掌握方向盘 ” : 它可以作为一个能够查询和控制调试器的大型调试器( LLLMDBG ) , 以快速调试器( LLDBDG ) 自动调试“ ” : 它可以作为一个独立的代理代理机构, 包括C/C+G 快速调试算器, 将一个真实的调序算法解算器解算出一个已知的PBDBDBDB 错误, , 可以成功地解算算出一个系统。
Article 20
Title@2025-06-19 (4): Floating-Point Neural Networks Are Provably Robust Universal Approximators
Title: Floating-Point Neural Networks Are Provably Robust Universal Approximators | Floating-Point-Neural-Netzwerke sind wahrscheinlich robuste Universal-Annäherung | 浮动点神经网络具有可可预见强健的通用通用近似器 2506.16065v1 |
Authors (5): Geonho Hwang, Wonyeol Lee, Yeachan Park, Sejun Park, Feras Saad
The classical universal approximation (UA) theorem for neural networks establishes mild conditions under which a feedforward neural network can approximate a continuous function $f$ with arbitrary accuracy. A recent result shows that neural networks also enjoy a more general interval universal approximation (IUA) theorem, in the sense that the abstract interpretation semantics of the network using the interval domain can approximate the direct image map of $f$ (i.e., the result of applying $f$ to a set of inputs) with arbitrary accuracy. These theorems, however, rest on the unrealistic assumption that the neural network computes over infinitely precise real numbers, whereas their software implementations in practice compute over finite-precision floating-point numbers. An open question is whether the IUA theorem still holds in the floating-point setting. This paper introduces the first IUA theorem for floating-point neural networks that proves their remarkable ability to perfectly capture the direct image map of any rounded target function $f$, showing no limits exist on their expressiveness. Our IUA theorem in the floating-point setting exhibits material differences from the real-valued setting, which reflects the fundamental distinctions between these two computational models. This theorem also implies surprising corollaries, which include (i) the existence of provably robust floating-point neural networks; and (ii) the computational completeness of the class of straight-line programs that use only floating-point additions and multiplications for the class of all floating-point programs that halt.
古典通用近似( UA) 神经网络的古典通用近似( UA) 理论建立了温和的条件, 进料神经网络可以在这种条件下任意精确地估计一个连续功能$f美元。 最近的结果显示,神经网络也享有更普遍的间隙通用近近近( IUA) 理论, 也就是说, 使用间域网络的抽象解释语解可以任意精确地接近美元直接图像地图( 即对一组输入应用$f美元的结果 ) 。 然而, 这些理论依据的假设是不切实际的, 神经网络可以对无限精确的多数值进行计算, 而它们在实践中的软件执行则对有限精准浮点浮点数字进行计算。 一个公开的问题就是, 使用间距网的抽象解释语解, 浮点神经网络的第一个IUA理论, 证明其完全能够精确地捕捉到任何圆形目标的直成像图, 美元, 显示其清晰度是没有限制的。 我们的IUA理论网络在实践中的软件执行过程对有限, 相对于有限, 有限地, 浮点的计算, 浮点模型的精确的计算, 也反映了这些浮点 和 水平 的 的模型的精确值 的 , , 和 的 的 的 , 的 代表了这些 的 的 和 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 和 的 的 的 的 的 的 的 的 的 的 和 的 的 的 的 的 的 的 的 的 的 的 的 和 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的
Article 21
Title@2025-06-19 (4): WAMI: Compilation to WebAssembly through MLIR without Losing Abstraction
Title: WAMI: Compilation to WebAssembly through MLIR without Losing Abstraction | WAMI: Kompilation zu WebAssembly durch MLIR ohne Abstraktion zu verlieren | WAMI:通过MLIR通过网络大会汇编,不失抽象 2506.16048v1 |
Authors (4): Byeongjee Kang, Harsh Desai, Limin Jia, Brandon Lucia
WebAssembly (Wasm) is a portable bytecode format that serves as a compilation target for high-level languages, enabling their secure and efficient execution across diverse platforms, including web browsers and embedded systems. To improve support for high-level languages without incurring significant code size or performance overheads, Wasm continuously evolves by integrating high-level features such as Garbage Collection and Stack Switching. However, existing compilation approaches either lack reusable design – requiring redundant implementation efforts for each language – or lose abstraction by lowering high-level constructs into low-level shared representations like LLVM IR, which hinder the adoption of high-level features. MLIR compiler infrastructure provides the compilation pipeline with multiple levels of abstraction, preserving high-level abstractions throughout the compilation pipeline, yet the current MLIR pipeline relies on the LLVM backend for Wasm code generation, thereby inheriting LLVM’s limitations. This paper presents a novel compilation pipeline for Wasm, featuring Wasm dialects explicitly designed to represent high-level Wasm constructs within MLIR. Our approach enables direct generation of high-level Wasm code from corresponding high-level MLIR dialects without losing abstraction, providing a modular and extensible way to incorporate high-level Wasm features. We illustrate this extensibility through a case study that leverages Stack Switching, a recently introduced high-level feature of Wasm. Performance evaluations on PolyBench benchmarks show that our pipeline, benefiting from optimizations within the MLIR and Wasm ecosystems, produces code with at most 7.7\% slower, and faster in some execution environments, compared to LLVM-based compilers.
瓦西姆(Wasm)是一个便携式字码格式,可以作为高语言的汇编目标,使其能够在包括网络浏览器和嵌入系统在内的不同平台上安全高效地安全高效地执行。为了在不引起大量代码大小或性能间接费用的情况下更好地支持高层次语言,瓦西姆(Wasm)通过整合垃圾收集和斯塔克切换等高层次特征不断演进。然而,现有的汇编方法要么缺乏可重复使用的设计 – – 需要为每种语言做出多余的执行努力 – – 要么通过降低高层次结构,在低层次共同代表机构,例如LLLVM IR(LLLLVM IR),这妨碍了采用高层次标准。MLIR(LIR)编集基础设施提供多层次抽象的编集管道,在整个编集管道中保留高层次的抽象内容,而目前的MLVM(LVM)编集依靠LLVM的后端版本生成,从而继承LLLVM的局限性。本文为WMMM(W)提供了一个新的编集管道,明确设计用于在MLLLLLLIIR(高层次)内代表高层次的低层次的瓦标准结构。我们从高层次直接生成的版本的版本的版本,在高层次平流流路路路段和高层次平流数据中可以直接生成一个高层次的版本的版本的版本,在高层次上,在高层次的版本的版本中提供高层次的版本的版本,在高层次上进行。