←── back to feed
/topics/arxiv-cs-cl-papers-june-5-2026
arXiv cs.CL papers June 5 2026
50 items●1 sources●updated 11d ago●trend 0
On June 5, 2026, arXiv's cs.CL section published 20 papers spanning model collapse dynamics, self-supervised learning objectives, medical LLM fine-tuning, multimodal safety benchmarks, streaming ASR punctuation, interpretability frameworks, long-context memory, sycophancy audits, reasoning distillation, persuasion tracing, personalization, language processing dynamics, reasoning trace analysis, early failure detection, schema discovery, machine translation complexity, and multilingual coreference resolution.
- Bilayer SIR/SIRS framework models cross-contamination of synthetic data across AI ecosystem populations (arXiv:2606.05168)
- JEPA-inspired hybrid pre-training combines latent-space prediction with masked language modeling for deeper semantic representations (arXiv:2606.05173)
- GRPO post-training with variance-aware rubric rewards applied to Qwen2.5-3B for heart-focused medical QA (arXiv:2606.05174)
- MCBench introduces 1,196 multimodal safety scenarios across vision, audio, and text for Omni LLMs (arXiv:2606.05177)
- LANTERN memory layer recovers 78.3% of ground-truth facts in compacted conversations with <25ms latency per turn (arXiv:2606.05182)
- Sycophancy audit across Gemini 2.0, 2.5, 3.0 variants reveals social-compliance behaviors beyond binary false outputs (arXiv:2606.05183)
[BLG]blog/rss50
Epidemiology of Model Collapse: Modeling Synthetic Data Contamination via Bilayer SIR Dynamics
Predict and Reconstruct: Joint Objectives for Self-Supervised Language Representation Learning
Improving Heart-Focused Medical Question Answering in LLMs via Variance-Aware Rubric Rewards with GRPO
Generic Triple-Latent Compression with Gated Associative Retrieval
PEFT of SLM for Telecommunications Customer Support: A Comparative Study of LoRA Configurations with Energy Consumption Analysis
MCBench: A Multicontext Safety Assessment Benchmark for Omni Large Language Models
Efficient Punctuation Restoration via Weighted Lookahead Scoring Method for Streaming ASR Systems
From Scoring to Explanations: Evaluating SHAP and LLM Rationales for Rubric-based Teaching Quality Assessment
Multi-Granularity Reasoning for Natural Language Inference
LANTERN: Layered Archival and Temporal Episodic Retrieval Network for Long-Context LLM Conversations
The Granularity Gap: A Multi-Dimensional Longitudinal Audit of Sycophancy in Gemini Models
LoRi: Low-Rank Distillation for Implicit Reasoning
A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing
Self-supervised User Profile Generation for Personalization
Trajectory Dynamics in Language Model Hidden States Predict Human Processing Costs Beyond Surprisal
ReasoningFlow: Discourse Structures for Understanding LLM Reasoning Traces
When Evidence is Sparse: Weakly Supervised Early Failure Alerting in Dialogs and LLM-Agent Trajectories
Executable Schema Contracts: From Automatic Ingestion to Multi-Source Retrieval
ComplexityMT: Benchmarking the Interaction Between Text Complexity and Machine Translation
Multilingual Coreference Resolution via Cycle-Consistent Machine Translation
Localizing Prompt Ambiguity in Large Language Models with Probe-Targeted Attribution
MASF: A Multi-Model Adaptive Selection Framework for Abstractive Text summarization
CHASE: Adversarial Red-Blue Teaming for Improving LLM Safety using Reinforcement Learning
Multilingual Detection of Alzheimer's Disease from Speech: A Cross-Linguistic Transfer Learning Approach
ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time?
AURA: Intent-Directed Probing for Implicit-Need Surfacing in Situated LLM Agents
InfoShield: Privacy-Preserving Speech Representations for Mental Health Screening via Information-Theoretic Optimization
Using Large Language Models to Support High Volume Application Review for an Undergraduate Research Program
Domain-Aware Mispronunciation Detection and Diagnosis Using Language-Specific Statistical Graphs
TensorBench: Benchmarking Coding Agents on a Compiler-Based Tensor Framework
Predictable Scaling Laws of Optimal Hyperparameters for LLM Continued Pre-training
What's in a Name? Morphological Shortcuts by LLMs in Pharmacology
An ERP Study on Recursive Locative Processing in Mandarin-Speaking Children with Autism
AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints
When New Generators Arrive: Lifelong Machine-Generated Text Attribution via Ridge Feature Transfer
Bootstrapping Semantic Layer from Execution for Text-to-SQL
QueryAgent-R1: Bridging Query Generation and Product Retrieval for E-Commerce Query Recommendation
Value-and-Structure Alignment for Routing-Consistent Quantization of Mixture-of-Experts Models
Rethinking LoRA Memory Through the Lens of KV Cache Compression
Beyond tokens: a unified framework for latent communication in LLM-based multi-agent systems
Interpreting Style Representations via Style-Eliciting Prompts
Narrative Knowledge Weaver: Narrative-Centric Retrieval-Augmented Reasoning for Long-Form Text Understanding
AdaPLD: Adaptive Retrieval and Reuse for Efficient Model-Free Speculative Decoding
PlanBench-V: A Spatial Planning Map Benchmark for Vision-Language Models
MARDoc: A Memory-Aware Refinement Agent Framework for Multimodal Long Document QA
CollabBench: Benchmarking and Unleashing Collaborative Ability of LLMs with Diverse Players via Proactive Engagement
Can LLMs Be Constrained to the Past? Improving Knowledge Cutoff through Recall-Based Prompting
ProSPy: A Profiling-Driven SQL-Python Agentic Framework for Enterprise Text-to-SQL
Mechanistic Insights into Functional Sparsity in Multimodal LLMs via CoRe Heads
Towards Truly Multilingual ASR: Generalizing Code-Switching ASR to Unseen Language Pairs