IndustryarXiv:2605.05409v1 Announce Type: new Abstract: Financial document question answering (QA) demands complex multi-step numerical reasoning over heterogeneous evidence--structured tables, textual narratives, and…
cs.AI updates on arXiv.org·May 9·Score 7.0
IndustryarXiv:2605.05402v1 Announce Type: new Abstract: Artificial intelligence (AI) and computer vision are transforming transportation data collection. This study introduces an AI-enabled analytics framework leveraging…
cs.AI updates on arXiv.org·May 9·Score 7.0
IndustryarXiv:2605.05217v1 Announce Type: new Abstract: We propose a self-supervised physics-informed neural network (PINN) framework that adaptively balances physics-based and data-driven supervision for scientific machine…
cs.LG updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05209v1 Announce Type: new Abstract: Neural networks that land in flat regions of the loss landscape tend to generalise better than those in sharp regions. Sharpness-Aware Minimisation exploits this to…
cs.LG updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05278v1 Announce Type: new Abstract: Resource-efficient machine learning increasingly uses sparse Mixture-of-Experts (MoE) architectures, where the gate acts as both a learning component and a routing…
cs.LG updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05213v1 Announce Type: new Abstract: Chronic rhinosinusitis (CRS) is a common heterogeneous inflammatory disorder that causes substantial morbidity and healthcare costs. CRS is difficult to identify early…
cs.LG updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05226v1 Announce Type: new Abstract: The central challenge of reinforcement learning for reasoning lies not only in the sparsity of outcome-level supervision, but more fundamentally in how to transform…
cs.LG updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05216v1 Announce Type: new Abstract: Large language models (LLMs) with a large number of parameters achieve strong performance but are often prohibitively expensive to deploy. Recent work explores using teams…
cs.LG updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05219v1 Announce Type: new Abstract: Prefix caching is a key latency optimization for autoregressive LLM serving, yet existing systems assume dense per-token key/value reuse. State-space models change the…
cs.LG updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05224v1 Announce Type: new Abstract: The unauthorized use of personal data in model training has emerged as a growing privacy threat. Unlearnable examples (UEs) address this issue by embedding imperceptible…
cs.LG updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05218v1 Announce Type: new Abstract: Predictive multiplicity and chaotic dynamics represent two fundamental challenges in machine learning that have evolved independently despite their conceptual connections.…
cs.LG updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05443v1 Announce Type: new Abstract: LLM watermarks must be detectable without compromising text quality, yet most existing schemes bias the next-token distribution and pay for detection with measurable…
cs.CL updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05758v1 Announce Type: new Abstract: Despite the success of large language models (LLMs) on general-purpose tasks, their performance in highly specialized domains such as biomedicine remains unsatisfactory. A…
cs.CL updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05835v1 Announce Type: new Abstract: Large reasoning models (LRMs) sometimes note in their chain of thought (CoT) that they may be under evaluation. Researchers worry that this verbalised evaluation awareness…
cs.CL updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05532v1 Announce Type: new Abstract: This paper evaluates whether a domain trained Small Language Model (SLM) can outperform frontier Large Language Models on structured contract extraction at radically lower…
cs.CL updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05485v1 Announce Type: new Abstract: LLMs can solve program synthesis tasks but remain inefficient and unreliable on hard instances requiring large combinatorial search. Given a small set of reasoning traces,…
cs.CL updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05626v1 Announce Type: new Abstract: Large Language Models (LLMs) excel at generating contextually appropriate responses but remain poorly calibrated for multi-party conversations, where deciding when to…
cs.CL updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05676v1 Announce Type: new Abstract: Recently, the prominent performance of large language models (LLMs) has been largely driven by multi-task instruct-tuning. Unfortunately, this training paradigm suffers…
cs.CL updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05245v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) remains brittle on multi-hop questions in realistic deployment settings, where retrieved evidence may be noisy or redundant and only…
cs.CL updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05392v1 Announce Type: new Abstract: Large-scale datasets are widely used to perform summarization tasks, but they may not include queries alongside documents and summaries. In the search for suitable…
cs.CL updates on arXiv.org·May 8·Score 7.0
IndustryarXiv:2605.05892v1 Announce Type: new Abstract: Activation steering has emerged as a promising alternative for controlling language-model behavior at inference time by modifying intermediate representations while…
cs.CL updates on arXiv.org·May 8·Score 7.0
IndustryThis paper proposes a clinical chatbot that grounds answers in official guidelines using prioritized evidence retrieval and verifiable citations. It is relevant for builders working on high-stakes RAG systems where…
cs.AI updates on arXiv.org·May 6·Score 9.6
IndustryThis paper gives a formal algebraic semantics for governed execution, backed by a substantial Rocq mechanization. It is technically rigorous, but the abstraction is far from the day-to-day concerns of most AI builders.
cs.AI updates on arXiv.org·May 6·Score 8.2
IndustryThis paper connects an AI interface to battery experimentation infrastructure to speed up formation-protocol optimization for sodium-ion cells. It is most relevant as an example of closed-loop materials discovery, where…
cs.AI updates on arXiv.org·May 6·Score 8.4
IndustryarXiv:2605.00242v1 Announce Type: cross Abstract: Millimetre-wave (mmWave) radar offers a more privacy-preserving alternative to RGB-based human pose estimation. However, existing methods typically rely on pre-extracted…
cs.AI updates on arXiv.org·May 5
SafetyarXiv:2605.00236v1 Announce Type: cross Abstract: Safety-aligned large language models rely on RLHF and instruction tuning to refuse harmful requests, yet the internal mechanisms implementing safety behavior remain…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00111v1 Announce Type: cross Abstract: Person re-identification (Re-ID) aims to match images of the same individual across non-overlapping camera views and remains challenging due to domain shifts caused by…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00087v1 Announce Type: cross Abstract: Many recent news reports have claimed that content generated by large language models (LLMs) is taking over the web. However, these claims are typically not based on a…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00133v1 Announce Type: cross Abstract: Modern crop advisory systems exhibit a critical limitation termed \textit{economic blindness}. These systems primarily optimize for biological yield, often overlooking…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00074v1 Announce Type: cross Abstract: DNA-synthesis providers screen incoming orders by searching the requested sequence against curated hazard lists. We show that this baseline collapses to a 100%…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00068v1 Announce Type: cross Abstract: Inertial Confinement Fusion (ICF) holds transformative promise for sustainable, near-limitless clean energy, yet remains constrained by prohibitively high costs and…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00082v1 Announce Type: cross Abstract: The Forward-Forward (FF) algorithm presents a compelling, bio-inspired alternative to backpropagation. However, while efficient in training, it has a computationally…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00197v1 Announce Type: cross Abstract: Studies attempting to simulate human behavior with grow in numbers while LLM-only social networks have started appearing outside of controlled settings. However, the…
cs.AI updates on arXiv.org·May 5
AgenticarXiv:2605.00055v1 Announce Type: cross Abstract: We report a safety incident in a deployed multi-agent research system in which a primary AI agent installed 107 unauthorized software components, overwrote a system…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00015v1 Announce Type: cross Abstract: Time Series Foundation Models (TSFMs) advance generalization and data efficiency in time series forecasting by unified large-scale pretraining. But TSFMs remain lacking…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00056v1 Announce Type: cross Abstract: Groundwater in the Densu Basin is increasingly threatened by heavy metal contamination, but conventional methods fail to capture the statistical complexity and spatial…
cs.AI updates on arXiv.org·May 5
AgenticarXiv:2605.00737v1 Announce Type: new Abstract: Agentic AI architectures augment LLMs with external tools, unlocking strong capabilities. However, tool use is not always beneficial; some calls may be redundant or even…
cs.AI updates on arXiv.org·May 5
PhysicalarXiv:2605.00412v1 Announce Type: new Abstract: World models have recently re-emerged as a central paradigm for embodied intelligence, robotics, autonomous driving, and model-based reinforcement learning. However,…
cs.AI updates on arXiv.org·May 5
AgenticarXiv:2605.00334v1 Announce Type: new Abstract: Production agentic systems make many model calls per user request, and most of those calls are short, structured, and routine. This raises a practical routing question…
cs.AI updates on arXiv.org·May 5
PhysicalarXiv:2605.00438v1 Announce Type: new Abstract: Long-horizon robotic manipulation requires plans that are both logically coherent and geometrically grounded. Existing Vision-Language-Action policies usually hide…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00440v1 Announce Type: new Abstract: The evolution of artificial intelligence (AI) has rendered the boundary between humanity and computational machinery increasingly ambiguous. In the presence of more…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00572v1 Announce Type: new Abstract: Algorithm performance in combinatorial optimization is highly sensitive to parameter settings, while a single globally tuned configuration often fails to exploit the…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00005v1 Announce Type: cross Abstract: The increasing deployment of deep neural networks (DNNs) in cyber-physical systems (CPS) enhances perception fidelity, but imposes substantial computational demands on…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00011v1 Announce Type: cross Abstract: Federated Learning (FL) enables collaborative intelligence across decentralized data source devices in a privacy-preserving way. While substantial research attention has…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00033v1 Announce Type: cross Abstract: Remote and webcam-based eye tracking in multi-line reading suffers from various noise factors and layout ambiguity, precisely where real-time reading support needs…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00043v1 Announce Type: cross Abstract: Big data platforms are widely used in modern enterprises, and an in-production intelligent assistant is increasingly important to help users quickly find actionable…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00742v1 Announce Type: new Abstract: LLMs excel at predictive tasks and complex reasoning tasks, but many high-value deployments rely on decisions under uncertainty, for example, which tool to call, which…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00059v1 Announce Type: cross Abstract: Deep reinforcement learning (DRL) finds extensive application in autonomous drone navigation within complex, high-risk environments. However, its practical deployment…
cs.AI updates on arXiv.org·May 5
IndustryarXiv:2605.00071v1 Announce Type: cross Abstract: Agentic payment systems extend delegated action to financial transfers, but scaling them on stablecoin rails in regulated settings requires safeguards that remain…
cs.AI updates on arXiv.org·May 5
SafetyarXiv:2605.00123v1 Announce Type: new Abstract: Safety trained large language models (LLMs) can often be induced to answer harmful requests through jailbreak prompts. Because we lack a robust understanding of why LLMs…
cs.AI updates on arXiv.org·May 5