←── back to feed
/topics/arxiv-cs-ai-papers-june-11-2026
arXiv cs.AI papers June 11 2026
43 items●1 sources●updated 6d ago●trend 0
On June 11, 2026, arXiv cs.AI published 20 papers spanning agent reasoning, memory systems, scientific synthesis, and trustworthy AI. Key topics include multi-agent orchestration, behavioral forecasting, financial reasoning agents, tactile commonsense, and spatial reasoning in multimodal models.
- SemantiClean framework prioritizes auditability and reproducibility over accuracy in e-commerce semantic inference
- SciConBench benchmark contains 9.11K questions from systematic reviews to evaluate AI scientific conclusion synthesis
- MoCA-Agent uses market-of-claims mechanism for financial and tabular reasoning with claim-level verification
- INFRAMIND addresses infrastructure-blind multi-agent orchestration on shared GPU clusters under concurrent load
- RecToM models nested beliefs via recursive perspective construction for Theory of Mind reasoning tasks
[BLG]blog/rss43
From Explicit Elements to Implicit Intent: A Predefined Library for Auditable Behavioral Inference
Position: Hippocampal Explicit Memory Is the Cornerstone for AGI
Can AI Agents Synthesize Scientific Conclusions?
Knowing When to Ask: Self-Gated Clarification for Hierarchical Language Agents
Automated Mediator for Human Negotiation: Pre-Mediation via a Structured LLM Pipeline
INFRAMIND: Infrastructure-Aware Multi-Agent Orchestration
Forecasting Future Behavior as a Learning Task
Search Discipline for Long-Horizon Research Agents
MoCA-Agent: A Market-of-Claims Code Agent for Financial and Numerical Reasoning
SkillJuror: Measuring How Agent Skill Organization Changes Runtime Behavior
HERO: Hindsight-Enhanced Reflection from Environment Observations for Agentic Self-Distillation
Architecture-Aware Reinforcement Learning Makes Sliding-Window Attention Competitive in Math Reasoning
TouchThinker: Scaling Tactile Commonsense Reasoning to the Open World with Large-scale Data and Action-aware Representation
TreeSeeker: Tree-Structured Trial, Error, and Return in Deep Search
Lung-R1: A Knowledge Graph-Guided LLM for Pulmonary Diagnostic Reasoning
Organize then Retrieve: Hierarchical Memory Navigation for Efficient Agents
Mind the Perspective: Let's Reason Recursively for Theory of Mind
When Do Data-Driven Systems Exhibit the Capability to Infer?
SVoT: State-aware Visualization-of-Thought for Spatial Reasoning via Reinforcement Learning
Toward Trustworthy AI: Multi-Target Adversarial Attacks and Robust Defenses for Continuous Data Summarization
Skill-Augmented AI Agents for Medical Research Analysis: An Exploratory Multi-Model Human Evaluation in an NSCLC Transcriptomic Biomarker Task
StatefulDiscovery: Evidence-Calibrated Claim Formation in Open-Ended Scientific Discovery
AutoMine Solution for AV2 2026 Scenario Mining Challenge
Embodied-BenchClaw: An Autonomous Multi-Agent System for Embodied Spatial Intelligence Benchmark Construction
The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning
MODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning
Human-Enhanced Loop Modeling (HELM): Agent-Based Finite Element Modeling of Concrete Bridge Barriers
Existential Indifference: Self-Nonpreservation as a Necessary Architectural Condition for Aligned Superintelligence (or: The Suicidal AI)
A Lightweight Multi-Agent Framework for Automated Concrete Barrier Design
Automating Geometry-Intensive Compliance Checking in BIM: Graph-Based Semantic Reasoning Framework
IntElicit: Eliciting and Assessing Contextualized Creativity via Dialogue Policy Optimization
Towards Responsibly Non-Compliant Machines
The Impossibility of Eliciting Latent Knowledge
A Five-Plane Reference Architecture for Runtime Governance of Production AI Agents
PROJECTMEM: A Local-First, Event-Sourced Memory and Judgment Layer for AI Coding Agents
Nonslop: A Gamified Experiment in Human-AI Collaborative Writing
From Architecture to Output: Structural Origins of Hallucination in Large Language Models and the Amplifying Role of Data
From Consumption to Reflection: Designing Human-AI Relations for Stable Reasoning
MA-DLE: Speech-based Automatic Depression Level Estimation via Memory Augmentation
To Intervene or Not: Guiding Inference-time Alignment with Probabilistic Model Blending
Dual-Stance Evaluation of Sycophancy: The Structure of Agreement and the Limits of Intervention
From Awareness to Action: Understanding and Overcoming the Research-Practice Gap in Algorithmic Fairness for Public Health
The Environmental Cost of LLMs in AIED: Reporting and Practices