←── back to feed
/topics/arxiv-cs-cl-papers-june-8-2026

arXiv cs.CL papers June 8 2026

50 items1 sourcesupdated 9d agotrend 0

On June 8, 2026, arXiv's cs.CL section published 20 papers spanning multilingual factual consistency, LLM personalization, reasoning failure diagnosis, retrieval-augmented generation, and cultural alignment. Topics ranged from cross-lingual QA datasets and web agent architectures to behavioral biometrics in prompts, evidence utilization diagnostics, and tone-aware health communication systems.

[BLG]blog/rss50
Improving Cross-Lingual Factual Recall via Consistency-Driven Reinforcement Learning
arXiv cs.CL · Jonathan von Rad, Louis Arts, George Burgess, Eleftheria Kolokytha, Harry O'Donnell, Ektor Oikonomidis Doumpas, Eduardo Sanchez, Yao Lu, Pontus Stenetorp · 9d
Re-Centering Humans in LLM Personalization
arXiv cs.CL · Lechen Zhang, Jiarui Liu, Tal August · 9d
UnpredictaBench: A Benchmark for Evaluating Distributional Randomness in LLMs
arXiv cs.CL · Amirhossein Abaskohi, Amirhossein Dabiriaghdam, Liang Luo, Ellie Dingqiao Wen, Lele Wang, Giuseppe Carenini, Peter West · 9d
How Language Models Fail: Token-Level Signatures of Committed and Persistent Reasoning Failures
arXiv cs.CL · Tanvi Thoria, Kiana Jafari, Marc R. Schlichting, Mykel J. Kochenderfer · 9d
CAF-Gen: A Multi-Agent System for Enriching Argumentation Structures
arXiv cs.CL · Jakub B\k{a}ba, Jaros{\l}aw Chudziak · 9d
The Piggyback Hypothesis of Generalization: Explaining and Mitigating Emergent Misalignment
arXiv cs.CL · Jiachen Zhao, Zhengxuan Wu, Aryaman Arora, Yiyou Sun, David Bau, Weiyan Shi · 9d
What Do People Actually Want From AI? Mapping Preference Plurality
arXiv cs.CL · Julia Sep\'ulveda Coelho, Scott A. Hale · 9d
HKJudge: A Legal Discourse-Annotated Corpus for Interpreting What Courts Find, How They Reason, and What They Rule
arXiv cs.CL · Xi Xuan, Wenxin Zhang, Yufei Zhou, King-kui Sin, Chunyu Kit · 9d
Signal-Driven Observation for Long-Horizon Web Agents
arXiv cs.CL · Shubham Gaur, Ian Lane · 9d
Data-Efficient Autoregressive-to-Diffusion Language Models via On-Policy Distillation
arXiv cs.CL · Xingyu Su, Jacob Helwig, Shubham Parashar, Atharv Chagi, Lakshmi Jotsna, Degui Zhi, James Caverlee, Dileep Kalathil, Shuiwang Ji · 9d
Does Topic Sentiment Cause Perceived Ideology? Comparing Human and LLM Annotations in Political News Articles
arXiv cs.CL · Upasana Chatterjee · 9d
Modular Monolingual Adaptation using Pretrained Language Models
arXiv cs.CL · Nalin Kumar, Ond\v{r}ej Du\v{s}ek · 9d
When to Think Deeply: Inhibitory Deliberation for LLM Reasoning
arXiv cs.CL · Zhixuan He, Yue Feng · 9d
Evidence Graph Consistency in Retrieval-Augmented Generation: A Model-Dependent Analysis of Hallucination Detection
arXiv cs.CL · Jianru Shen · 9d
PromptPrint: Behavioral Biometrics Through Natural Language Prompting in LLMs
arXiv cs.CL · Shaiv Patel, Kartik Narayan, Vishal Patel · 9d
A Four-Condition Diagnostic Protocol for Evidence Utilization in Long-Context and Retrieval-Augmented Language Models
arXiv cs.CL · Haizhou Xia · 9d
When Better Codebooks Are Not Enough: Predictive Performance and Behavioral Reliability in LLM Political Event Coding
arXiv cs.CL · Zixian He, Bharath Raahul Murugesan, Patrick Brandt, Yibo Hu · 9d
Explain Like I'm 5 or Whatever I Choose: Evaluating the Interactive Potential of Language Model Responses
arXiv cs.CL · Indu Panigrahi, Tal August · 9d
TA-RAG: Tone-Aware Retrieval-Augmented Generation for Peer-Support Health Communication
arXiv cs.CL · Yong-Bin Kang, Anthony McCosker · 9d
Korean Culture into LLM Alignment: Toward Cultural Coherence
arXiv cs.CL · MinJae Jung, Minwoo Kim · 9d
Quantifying Media Representation Dynamics Across 25 Years of News Reporting on Policing-related Deaths
arXiv cs.CL · Farhan Samir, Jappun Dhillon, Meghna Ravikumar, Syed Ishtiaque Ahmed, Vered Shwartz · 9d
Progress-SQL: Improving Reinforcement Learning for Text-to-SQL via Progressive Rewards
arXiv cs.CL · Shihao Zhang, Xiaoman Wang, Yuan Liu, Yunshi Lan, Weining Qian · 9d
The Dark Regulome: Disentangling Predictability from Regulation in Genomic Foundation Models
arXiv cs.CL · Chahat Baranwal, Aadtya Baranwal, Lakshya Nitin Tandon · 9d
Translate-R1: Cost-Aware Translation Tool Use via Reinforcement Learning
arXiv cs.CL · Pratik Jayarao, Chaitanya Dwivedi, Himanshu Gupta, Neeraj Varshney, Adithya M Devraj, Meet Vadera, Priyanka Nigam, Bing Yin · 9d
Characterize Then Distill: Mechanistic Reasoning in Large Output Spaces
arXiv cs.CL · Debjyoti Saha Roy, Byron C. Wallace, Javed A. Aslam · 9d
CRAFT: A Unified Counterfactual Reasoning Framework for Tabular Question Answering and Fact Verification
arXiv cs.CL · Chenshuo Pan, Yu Zhao, Jie Zhang, Changzai Pan, Zhenhe Wu, Jiayi Liang, Yujie Mao, Shuangyong Song, Yongxiang Li, Zhongjiang He · 9d
Interpreting Brain Responses to Language with Sparse Features from Language Models
arXiv cs.CL · Michael A. Lepori, Kendrick Kay, Greta Tuckute · 9d
Are Large Language Models Suitable for Graph Computation? Progress and Prospects
arXiv cs.CL · Yuting Zhang, Yi Han, Kai Wang, Wei Ni, Angela Bonifati, Wenjie Zhang · 9d
An Expanded Synthetic Conversation Dataset for Multi-Turn Smishing Detection
arXiv cs.CL · Carl Lochstampfor, Ayan Roy · 9d
EASE-TTT: Evidence-Aligned Selective Test-Time Training for Long-Context Question Answering
arXiv cs.CL · Xiaopeng Yuan, Zebin Wang, Suwen Wang, Zongxin Yang, Haohan Wang, Yushun Dong · 9d
ThinkBooster: A Unified Framework for Seamless Test-Time Scaling of LLM Reasoning
arXiv cs.CL · Vladislav Smirnov (MBZUAI), Chieu Nguyen (MBZUAI), Sergey Senichev (Independent Researcher), Minh Ngoc Ta (MBZUAI), Ekaterina Fadeeva (ETH Z\"urich), Artem Vazhentsev (MBZUAI), Daria Galimzianova (MBZUAI), Nikolai Rozanov (MBZUAI, Imperial College London), Viktor Mazanov (Innopolis University), Jingwei Ni (ETH Z\"urich), Tianyi Wu (NUS), Igor Kiselev (Accenture), Mrinmaya Sachan (ETH Z\"urich), Iryna Gurevych (MBZUAI), Preslav Nakov (MBZUAI), Timothy Baldwin (MBZUAI), Artem Shelmanov (MBZUAI) · 9d
Didact: A Cross-Domain Capability Discovery System for Defence
arXiv cs.CL · Aarya Bodhankar, Aditya Joshi, Bao Gia Doan, Thomas Marchant, Oscar Leslie, Flora Salim · 9d
Auditing Training Data in Domain-adapted LLMs: LoRA-MINT
arXiv cs.CL · Gonzalo Mancera, Daniel DeAlcala, Aythami Morales, Julian Fierrez, Ruben Tolosana, Francisco Jurado · 9d
OpenHalDet: A Unified Benchmark for Hallucination Detection across Diverse Generation Scenarios
arXiv cs.CL · Xinyi Li, Zhen Fang, Yongxin Deng, Jinyuan Luo, Hongnan Ma, Changdae Oh, Zijing Shi, Shanshan Ye, Hanchen Wang, Shu-Lin Chen, Yadan Luo, Mengyue Yang, Sean Du, Sharon Li, Ling Chen · 9d
Tree-of-Experience: A Structured Experience-Management Solution for Self-Evolving Agents under Low-Repetition and Implicit-Reward Environments
arXiv cs.CL · Zihao Deng, Yining Zhu, Leiming Wang, Jingfei Lu, Junbo Wang, Chuncheng Ran, Yu Yang, Dixuan Yang, Jikun Shen · 9d
Contrastive Training with LLM-generated Near-Misses for Robust Code-Switching Speech Recognition
arXiv cs.CL · Tung X. Nguyen, Hieu Minh Truong, Giang-Son Nguyen, Nhu Vo, Wray Buntine, Dung D. Le · 9d
Principles of Concept Representation in Sentence Encoders
arXiv cs.CL · Isabelle Mohr, John Dujany, Jonathan Souquet, Andre Freitas · 9d
MADE: Beyond Scoring via a Multilingual Agentic Diagnosing Engine for Fine-Grained Evaluation Insights
arXiv cs.CL · Yilun Liu, Miao Zhang, Shimin Tao, Minggui He, Chunguang Zhao, Chenxin Liu, Li Zhang, Chen Liu, Cheng Qian, Liqun Deng, Xiaojun Meng, Daimeng Wei · 9d
Beyond Rubrics: Exploration-Guided Evaluation Skills for Reward Modeling
arXiv cs.CL · Xing Yue, Linjuan Wu, Daoxin Zhang, Yongliang Shen, Weiming Lu · 9d
TRACE: Trajectory Reasoning through Adaptive Cross-Step Evidence Aggregation for LLM Agents
arXiv cs.CL · Vijitha Mittapalli, Shreyaa Jayant Dani, Satya Srujana Pilli, Snigdha Ansu, Mohammadreza Teymoorianfard, Franck Dernoncourt, Hongjie Chen, Yu Wang, Ryan A. Rossi, Nesreen K. Ahmed · 9d
Modeling semantic association in self-paced reading with language model embeddings
arXiv cs.CL · Sara M{\o}ller {\O}stergaard, Kenneth Enevoldsen, Afra Alishahi, Bruno Nicenboim · 9d
mmPISA-bench: Do LLMs Reason Equally Well Across 43 Languages?
arXiv cs.CL · Yerzhan Sapenov, Jaromir Savelka · 9d
SigmaScale: LLM Compression with SVD-based Low-Rank Decomposition and Learned Scaling Matrices
arXiv cs.CL · Ernests Lavrinovics, Marco Letizia, Roy Janco, Shai Segal, Johannes Bjerva, Maurizio Pierini · 9d
Style or Content? Evaluating Style Classifiers with Controlled Content Overlap
arXiv cs.CL · Zhuo Liu, Haozheng Du, Xiangxiang Xu, Hangfeng He · 9d
Learning Perspectivist Social Meaning via Demographic-Conditioned Fusion Embeddings
arXiv cs.CL · Amanda Cercas Curry, Lucio La Cava, Luca Maria Aiello, Gianmarco De Francisci Morales · 9d
Explicit Evidence Grounding via Structured Inline Citation Generation
arXiv cs.CL · Anar Yeginbergen, Amelie W\"uhrl, Anna Rogers, Rodrigo Agerri · 9d
UrduMMLU: A Massive Multitask Benchmark for Urdu Language Understanding
arXiv cs.CL · Ahmer Tabassum, Sarfraz Ahmad, Hasan Iqbal, Owais Aijaz, Momina Ahsan, Preslav Nakov · 9d
Geometry of Semantic Space: Comparative Study of Discrete and Continuous Models
arXiv cs.CL · Gabriel Bounias, Sabine Ploux · 9d
From Correctness to Utility: Gain-Based Prefix Evaluation for LLM Reasoning
arXiv cs.CL · Yuhang Zhou, Yixin Cao, Guangnan Ye · 9d
Adversarial Creation and Detection of AI-Generated Social Bot Content
arXiv cs.CL · Mykola Trokhymovych, Ricardo Baeza-Yates, Alessandro Flammini, Diego Saez-Trumper, Filippo Menczer · 9d