←── back to feed
/topics/arxiv-cs-cl-papers-may-26-2026
arXiv cs.CL papers May 26 2026
29 items●1 sources●updated 22d ago●trend 0
On May 26, 2026, arXiv's computational linguistics section published 20 papers spanning neural speech decoding, harmful content detection, retrieval-augmented generation, legal NLP, multimodal document processing, and LLM interpretability. Topics include end-to-end intracortical speech decoding without external language models, dialect bias in language models, long-context memory diagnostics, and medical reasoning in Hindi.
[BLG]blog/rss29
End-to-End Intracortical Speech Decoding from Neural Activity
Distinguishing Right from Wrong in Debates: Attribution Analysis of Chinese Harmful Memes
How Much Structure Do LLMs Need? Evaluating LLMs for Bibliometric Cluster Description
Structure-Aware RAG: Structured Retrieval Augmented Generation from Noisy Data for Conversational Agents
Side-by-side Comparison Amplifies Dialect Bias in Language Models
SEAL: Synergistic Co-Evolution of Agents and Learning Environments
Found in Conversation: LLMs Teach Themselves to Close the Multi-Turn Gap
Phonetic Modeling of Dialectal Variation in Vietnamese Speech
Temporal Concept Drift in Legal Judgment Prediction: Neural Baselines Across Three Epochs of Ukrainian Court Decisions
Decompose-and-Refine: Structured Legal Question Answering with Parametric Retrieval
Grammatically-Guided Sparse Attention for Efficient and Interpretable Transformers
Unveil: Unified Visual-Textual Integration and Distillation for Multi-modal Document Retrieval
Generating Legal Commentaries from Case Databases via Retrieval, Clustering, and Generation
AstroMind: A High-Fidelity Benchmark for Spacecraft Behavior Reasoning Based on Large Language Models
WhenLoss: Diagnosing Write and Retrieval Bottlenecks in Long-Context Memory Systems
Word Class Representations Spontaneously Emerge from Successor Representations Trained on Natural Language
CSP-Atlas: Concept-Specific Neural Circuits in a Sparse Python Transformer
Guarded Repair for Harm-Aware Post-hoc Replacement of LLM Mathematical Reasoning
Measuring the Depth of LLM Unlearning via Activation Patching
HiMed: Incentivizing Hindi Reasoning in Medical LLMs
Know You Before You Speak: User-State Modeling for LLM Personalization in Multi-Turn Conversation
Mix-MoE: Improving Multilingual Machine Translation of Large Language Models through Mixed MoEs
CP-Agent: A Calibrated Risk-Controlled Agent for Feedback-Driven Competitive Programming
The Path Matters: Learning a Token-Commitment Policy for Diffusion Language Models
TS-Skill: A Benchmark for Evaluating Analytical Skills in Time-Series Question Answering
The Tokenizer Tax Across 25 European Languages: Domain Invariance, Cross-Lingual Few-Shot Effects, and the Ukrainian Penalty
World-State Transformations for Neuro-symbolic Interactive Storytelling
ROC Analysis for Evaluating Translation Quality Estimation Systems
StepGap: A Hybrid NLI-LLM Checker for Step-Level Evidence-Gap Detectionin Multi-Hop Question Answering