←── back to feed
/topics/arxiv-cs-cl-papers-june-2-2026
arXiv cs.CL papers June 2 2026
50 items●1 sources●updated 15d ago●trend 0
On June 2, 2026, arXiv's cs.CL section published 20 papers spanning dialogue parsing, LLM robustness, AI-generated text detection, Chinese grammar correction, speculative decoding, humor generation, language diffusion models, medical LLM safety, knowledge-grounded generation, financial sentiment analysis, self-supervised learning, LLM evaluation metrics, legal document processing, AI disclosure, fake news detection, knowledge base question answering, and LLM effects on student writing.
- DraDDP: first multimodal multi-party dialogue discourse parsing dataset with 495 segments, 6,374 utterances, 9.1 hours video from TV dramas
- CSRP framework for Chinese grammatical error correction uses continual pre-training on 5.9M samples plus reinforcement learning with efficiency-aware rewards
- TrustLDM benchmark evaluates safety, privacy, and fairness across language diffusion model architectures
- RealityTest: large-scale multimodal multilingual benchmark testing whether AI systems disclose their identity when asked
- BOUTEF: multilingual corpus for fake news in North Africa covering Algeria and Tunisia with fake narratives, genuine narratives, and debunking information
[BLG]blog/rss50
DraDDP: A Multimodal Multi-Party Dialogue Discourse Parsing Dataset
Toward Robust In-Context Learning: Leveraging Out-of-distribution Proxies for Target Inaccessible Demonstration Retrieval
AEyeDE: An Attention-Based Attribution Framework for AI-Generated Text Detection
CSRP: Chain-of-Thought Reasoning for Chinese Text Correction via Reinforcement Learning with Efficiency-Aware Rewards
SENSE: Semantic Embedding Navigation with Soft-gated Evaluation for Retrieval-based Speculative Decoding
lmfaoooo at SemEval-2026 Task 1: Humor Is an Audience. Preference Modeling for Constrained Humor Generation
TrustLDM: Benchmarking Trustworthiness in Language Diffusion Models
ART: Attention Run-time Termination for Efficient Large Language Model Decoding
Cognitive-Linguistic Indicators of Depression in Online Communities: Analysed by DistilBERT and Holographic Reduced Representation
A Multi-Domain Red Teaming Framework for Safety, Robustness, and Fairness Evaluation of Medical Large Language Models
TCAR-Gen: Temporal Graph Retrieval with Evidence Fusion for Knowledge-Grounded Generation
LLMs for Cardiovascular Risk Prediction from Structured Clinical Data
Graph-Augmented Retrieval for Cross-Entity Financial Sentiment Analysis: A Comparative Study
DLLM-JEPA: Joint Embedding Predictive Architectures for Masked Diffusion Language Models
Agreement Metrics for LLM-as-Judge Evaluation: What to Report and Why
Enhancing BiGRU with a KAN Block for Legal Document Classification and Summarization
RealityTest: How People Probe AI Identity and Whether Models Disclose It
BOUTEF: A Multilingual Corpus for FakeNews in North Africa -- Language as a Weapon
DeSQ: Decomposition-based SPARQL Query Generation
Effects of Varying LLM Access on Essay Writing Behavior
Parameter Alignment Mitigates Catastrophic Forgetting in Multilingual Expert Language Models
Model-Based Quality Assessment for Massively Multilingual Parallel Data
Uncovering Temporal Framing in the News
Bridging Reasoning Trajectories in On-Policy Distillation via Near-Future Guidance
Which Institutional Frameworks Do Chatbots Assume? Auditing Jurisdictional Defaults in Multilingual LLMs
Isolating LLM Lexical Bias: A Curation-Free Triangulated Metric for Preference-Stage Learning
How Far Do Auto-Interpretation Labels Generalize: A Controlled Study Across Languages, Scripts, and Rewordings
Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism
ProtStructQA: A Denotation Threshold in Protein Structural Reasoning
SALSA: Speech Aware LLM Adaptation via Learned Steering Activation Vectors
Short-form Text Rewriting with Phi Silica
On the Limits of LLM Adaptability: Impact of Model-Internalized Priors on Annotation Task Performance
Do Text Edits Generalize to Visual Generation? Benchmarking Cross-Modal Knowledge Editing in UMMs
LaSR: Context-Aware Speech Recognition via Latent Reasoning
Skill or Skip? Learning Selective Skill Invocation in Agentic Tasks via Dual-Granularity Preference Learning
ProactiveLLM: Learning Active Interaction for Streaming Large Language Models
Learning to Retrieve: Dual-Level Long-Term Memory for Text-to-SQL Agents
Revisiting Parameter-Based Knowledge Editing in Large Language Models: Theoretical Limits and Empirical Evidence
Sandboxed Coding Agents are Competitive Omni-modal Task Solvers
SPADER: Step-wise Peer Advantage with Diversity-Aware Exploration Rewards for Multi-Answer Question Answering
Toward Responsible and Epistemically Grounded Multilingual LLMs for Computational Social Science and Humanities
Linguistics-Aware Non-Distortionary LLM Watermarking
MemPro: Agentic Memory Systems as Evolvable Programs
Robust Reasoning via Dynamic Token Selection for Distribution-Aligned Self-Distillation
French parsing enhanced with a word clustering method based on a syntactic lexicon
LinguIUTics at PsyDefDetect: Iterative Imbalance-Aware Fine-tuning of Qwen3-8B for Psychological Defense Mechanism Classification
FineVerify: Scaling Test-Time Compute with Fine-Grained Self-Verification for Agentic Search
OCC-RAG: Optimal Cognitive Core for Faithful Question Answering
EPIC: Efficient and Parallel Inference under CFG Constraints for Diffusion Language Models
WaveFilter: Enhancing the Long-Context Capability of Diffusion LLMs via Wavelet-Guided KV Cache Filtering