Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CL

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computation and Language

Authors and titles for recent submissions

  • Thu, 8 Jan 2026
  • Wed, 7 Jan 2026
  • Tue, 6 Jan 2026
  • Mon, 5 Jan 2026
  • Thu, 1 Jan 2026

See today's new changes

Total of 483 entries : 1-50 51-100 101-150 151-200 ... 451-483
Showing up to 50 entries per page: fewer | more | all

Thu, 8 Jan 2026 (showing first 50 of 131 entries )

[1] arXiv:2601.04160 [pdf, other]
Title: All That Glisters Is Not Gold: A Benchmark for Reference-Free Counterfactual Financial Misinformation Detection
Yuechen Jiang, Zhiwei Liu, Yupeng Cao, Yueru He, Ziyang Xu, Chen Xu, Zhiyang Deng, Prayag Tiwari, Xi Chen, Alejandro Lopez-Lira, Jimin Huang, Junichi Tsujii, Sophia Ananiadou
Comments: 39 pages; 24 figures
Subjects: Computation and Language (cs.CL); Computational Engineering, Finance, and Science (cs.CE); Computational Finance (q-fin.CP)
[2] arXiv:2601.04157 [pdf, html, other]
Title: FLEx: Language Modeling with Few-shot Language Explanations
Adar Avsian, Christopher Richardson, Anirudh Sundar, Larry Heck
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[3] arXiv:2601.04135 [pdf, html, other]
Title: LLMberjack: Guided Trimming of Debate Trees for Multi-Party Conversation Creation
Leonardo Bottona, Nicolò Penzo, Bruno Lepri, Marco Guerini, Sara Tonelli
Comments: 9 pages, 3 figures
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[4] arXiv:2601.04131 [pdf, html, other]
Title: ContextFocus: Activation Steering for Contextual Faithfulness in Large Language Models
Nikhil Anand, Shwetha Somasundaram, Anirudh Phukan, Apoorv Saxena, Koyel Mukherjee
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[5] arXiv:2601.04126 [pdf, html, other]
Title: InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training
Ziyun Zhang, Zezhou Wang, Xiaoyi Zhang, Zongyu Guo, Jiahao Li, Bin Li, Yan Lu
Comments: Work In Progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[6] arXiv:2601.04098 [pdf, html, other]
Title: Layer-wise Positional Bias in Short-Context Language Modeling
Maryam Rahimi, Mahdi Nouri, Yadollah Yaghoobzadeh
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[7] arXiv:2601.04093 [pdf, html, other]
Title: SearchAttack: Red-Teaming LLMs against Real-World Threats via Framing Unsafe Web Information-Seeking Tasks
Yu Yan, Sheng Sun, Mingfeng Li, Zheming Yang, Chiwei Zhu, Fei Ma, Benfeng Xu, Min Liu
Comments: We find that the key to jailbreak the LLM is objectifying its safety responsibility, thus we delegate the open-web to inject harmful semantics and get the huge gain from unmoderated web resources
Subjects: Computation and Language (cs.CL)
[8] arXiv:2601.04086 [pdf, html, other]
Title: KDCM: Reducing Hallucination in LLMs through Explicit Reasoning Structures
Jinbo Hao, Kai Yang, Qingzhen Su, Yifan Li, Chao Jiang
Subjects: Computation and Language (cs.CL)
[9] arXiv:2601.04056 [pdf, html, other]
Title: Bridging the Discrete-Continuous Gap: Unified Multimodal Generation via Coupled Manifold Discrete Absorbing Diffusion
Yuanfeng Xu, Yuhao Chen, Liang Lin, Guangrun Wang
Comments: 10 pages, 5 figures
Subjects: Computation and Language (cs.CL)
[10] arXiv:2601.04055 [pdf, html, other]
Title: Modular Prompt Optimization: Optimizing Structured Prompts with Section-Local Textual Gradients
Prith Sharma, Austin Z. Henley
Subjects: Computation and Language (cs.CL)
[11] arXiv:2601.04043 [pdf, html, other]
Title: When Helpers Become Hazards: A Benchmark for Analyzing Multimodal LLM-Powered Safety in Daily Life
Xinyue Lou, Jinan Xu, Jingyi Yin, Xiaolong Wang, Zhaolu Kang, Youwei Liao, Yixuan Wang, Xiangyu Shi, Fengran Mo, Su Yao, Kaiyu Huang
Subjects: Computation and Language (cs.CL)
[12] arXiv:2601.04036 [pdf, other]
Title: Analyzing and Improving Cross-lingual Knowledge Transfer for Machine Translation
David Stap
Comments: PhD dissertation defended on November 26th, 2025
Subjects: Computation and Language (cs.CL)
[13] arXiv:2601.04029 [pdf, html, other]
Title: SpeakerSleuth: Evaluating Large Audio-Language Models as Judges for Multi-turn Speaker Consistency
Jonggeun Lee, Junseong Pyo, Gyuhyeon Seo, Yohan Jo
Comments: 28 pages
Subjects: Computation and Language (cs.CL)
[14] arXiv:2601.04025 [pdf, html, other]
Title: Simulated Students in Tutoring Dialogues: Substance or Illusion?
Alexander Scarlatos, Jaewook Lee, Simon Woodhead, Andrew Lan
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[15] arXiv:2601.03997 [pdf, html, other]
Title: VotIE: Information Extraction from Meeting Minutes
José Pedro Evans, Luís Filipe Cunha, Purificação Silvano, Alípio Jorge, Nuno Guimarães, Sérgio Nunes, Ricardo Campos
Subjects: Computation and Language (cs.CL)
[16] arXiv:2601.03986 [pdf, html, other]
Title: Benchmark^2: Systematic Evaluation of LLM Benchmarks
Qi Qian, Chengsong Huang, Jingwen Xu, Changze Lv, Muling Wu, Wenhao Liu, Xiaohua Wang, Zhenghua Wang, Zisu Huang, Muzhao Tian, Jianhan Xu, Kun Hu, He-Da Wang, Yao Hu, Xuanjing Huang, Xiaoqing Zheng
Subjects: Computation and Language (cs.CL)
[17] arXiv:2601.03981 [pdf, html, other]
Title: RADAR: Retrieval-Augmented Detector with Adversarial Refinement for Robust Fake News Detection
Song-Duo Ma, Yi-Hung Liu, Hsin-Yu Lin, Pin-Yu Chen, Hong-Yan Huang, Shau-Yung Hsu, Yun-Nung Chen
Subjects: Computation and Language (cs.CL)
[18] arXiv:2601.03940 [pdf, html, other]
Title: Large-Scale Aspect-Based Sentiment Analysis with Reasoning-Infused LLMs
Paweł Liskowski, Krzysztof Jankowski
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[19] arXiv:2601.03926 [pdf, html, other]
Title: Doc-PP: Document Policy Preservation Benchmark for Large Vision-Language Models
Haeun Jang, Hwan Chang, Hwanhee Lee
Subjects: Computation and Language (cs.CL)
[20] arXiv:2601.03914 [pdf, html, other]
Title: When Models Decide and When They Bind: A Two-Stage Computation for Multiple-Choice Question-Answering
Hugh Mee Wong, Rick Nouwen, Albert Gatt
Comments: Under review
Subjects: Computation and Language (cs.CL)
[21] arXiv:2601.03908 [pdf, html, other]
Title: Decide Then Retrieve: A Training-Free Framework with Uncertainty-Guided Triggering and Dual-Path Retrieval
Wang Chen, Guanqiang Qi, Weikang Li, Yang Li, Deguo Xia, Jizhou Huang
Subjects: Computation and Language (cs.CL)
[22] arXiv:2601.03874 [pdf, html, other]
Title: Evaluating Small Decoder-Only Language Models for Grammar Correction and Text Simplification
Anthony Lamelas
Comments: 9 pages, 12 figures
Subjects: Computation and Language (cs.CL)
[23] arXiv:2601.03872 [pdf, html, other]
Title: Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning
Jinyang Wu, Guocheng Zhai, Ruihan Jin, Jiahao Yuan, Yuhao Shen, Shuai Zhang, Zhengqi Wen, Jianhua Tao
Subjects: Computation and Language (cs.CL)
[24] arXiv:2601.03868 [pdf, html, other]
Title: What Matters For Safety Alignment?
Xing Li, Hui-Ling Zhen, Lihao Yin, Xianzhi Yu, Zhenhua Dong, Mingxuan Yuan
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[25] arXiv:2601.03860 [pdf, html, other]
Title: PartisanLens: A Multilingual Dataset of Hyperpartisan and Conspiratorial Immigration Narratives in European Media
Michele Joshua Maggini, Paloma Piot, Anxo Pérez, Erik Bran Marino, Lúa Santamaría Montesinos, Ana Lisboa, Marta Vázquez Abuín, Javier Parapar, Pablo Gamallo
Subjects: Computation and Language (cs.CL)
[26] arXiv:2601.03858 [pdf, html, other]
Title: What Does Loss Optimization Actually Teach, If Anything? Knowledge Dynamics in Continual Pre-training of LLMs
Seyed Mahed Mousavi, Simone Alghisi, Giuseppe Riccardi
Subjects: Computation and Language (cs.CL)
[27] arXiv:2601.03851 [pdf, html, other]
Title: Rethinking Table Pruning in TableQA: From Sequential Revisions to Gold Trajectory-Supervised Parallel Search
Yu Guo, Shenghao Ye, Shuangwu Chen, Zijian Wen, Tao Zhang, Qirui Bai, Dong Jin, Yunpeng Hou, Huasen He, Jian Yang, Xiaobin Tan
Comments: 16 pages, 5 figures
Subjects: Computation and Language (cs.CL)
[28] arXiv:2601.03823 [pdf, html, other]
Title: Step Potential Advantage Estimation: Harnessing Intermediate Confidence and Correctness for Efficient Mathematical Reasoning
Fei Wu, Zhenrong Zhang, Qikai Chang, Jianshu Zhang, Quan Liu, Jun Du
Subjects: Computation and Language (cs.CL)
[29] arXiv:2601.03812 [pdf, html, other]
Title: AI Generated Text Detection
Adilkhan Alikhanov, Aidar Amangeldi, Diar Demeubay, Dilnaz Akhmetzhan, Nurbek Moldakhmetov, Omar Polat, Galymzhan Zharas
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[30] arXiv:2601.03798 [pdf, html, other]
Title: Where meaning lives: Layer-wise accessibility of psycholinguistic features in encoder and decoder language models
Taisiia Tikhomirova, Dirk U. Wulff
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[31] arXiv:2601.03792 [pdf, html, other]
Title: VietMed-MCQ: A Consistency-Filtered Data Synthesis Framework for Vietnamese Traditional Medicine Evaluation
Huynh Trung Kiet, Dao Sy Duy Minh, Nguyen Dinh Ha Duong, Le Hoang Minh Huy, Long Nguyen, Dien Dinh
Comments: 11 pages, 4 figures. Dataset and code released
Subjects: Computation and Language (cs.CL)
[32] arXiv:2601.03791 [pdf, html, other]
Title: Do LLMs Really Memorize Personally Identifiable Information? Revisiting PII Leakage with a Cue-Controlled Memorization Framework
Xiaoyu Luo, Yiyi Chen, Qiongxiu Li, Johannes Bjerva
Comments: 20 pages, 13 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[33] arXiv:2601.03790 [pdf, html, other]
Title: NeoAMT: Neologism-Aware Agentic Machine Translation with Reinforcement Learning
Zhongtao Miao, Kaiyan Zhao, Masaaki Nagata, Yoshimasa Tsuruoka
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[34] arXiv:2601.03786 [pdf, html, other]
Title: Compact Example-Based Explanations for Language Models
Loris Schoenegger, Benjamin Roth
Comments: 8 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[35] arXiv:2601.03785 [pdf, html, other]
Title: Membox: Weaving Topic Continuity into Long-Range Memory for LLM Agents
Dehao Tao, Guoliang Ma, Yongfeng Huang, Minghu Jiang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[36] arXiv:2601.03783 [pdf, html, other]
Title: HearSay Benchmark: Do Audio LLMs Leak What They Hear?
Jin Wang, Liang Lin, Kaiwen Luo, Weiliu Wang, Yitian Chen, Moayad Aloqaily, Xuehai Tang, Zhenhong Zhou, Kun Wang, Li Sun, Qingsong Wen
Subjects: Computation and Language (cs.CL)
[37] arXiv:2601.03779 [pdf, html, other]
Title: Tracing the complexity profiles of different linguistic phenomena through the intrinsic dimension of LLM representations
Marco Baroni, Emily Cheng, Iria deDios-Flores, Francesca Franzon
Subjects: Computation and Language (cs.CL)
[38] arXiv:2601.03775 [pdf, html, other]
Title: Do LLM Self-Explanations Help Users Predict Model Behavior? Evaluating Counterfactual Simulatability with Pragmatic Perturbations
Pingjun Hong, Benjamin Roth
Subjects: Computation and Language (cs.CL)
[39] arXiv:2601.03752 [pdf, html, other]
Title: Evaluation of Multilingual LLMs Personalized Text Generation Capabilities Targeting Groups and Social-Media Platforms
Dominik Macko
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[40] arXiv:2601.03746 [pdf, html, other]
Title: Whose Facts Win? LLM Source Preferences under Knowledge Conflicts
Jakob Schuster, Vagrant Gautam, Katja Markert
Comments: Data and code: this https URL
Subjects: Computation and Language (cs.CL)
[41] arXiv:2601.03743 [pdf, html, other]
Title: O-Researcher: An Open Ended Deep Research Model via Multi-Agent Distillation and Agentic RL
Yi Yao, He Zhu, Piaohong Wang, Jincheng Ren, Xinlong Yang, Qianben Chen, Xiaowan Li, Dingfeng Shi, Jiaxian Li, Qiexiang Wang, Sinuo Wang, Xinpeng Liu, Jiaqi Wu, Minghao Liu, Wangchunshu Zhou
Comments: 22 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[42] arXiv:2601.03727 [pdf, html, other]
Title: Stuttering-Aware Automatic Speech Recognition for Indonesian Language
Fadhil Muhammad, Alwin Djuliansah, Adrian Aryaputra Hamzah, Kurniawati Azizah
Comments: Preprint
Subjects: Computation and Language (cs.CL)
[43] arXiv:2601.03717 [pdf, html, other]
Title: MIND: From Passive Mimicry to Active Reasoning through Capability-Aware Multi-Perspective CoT Distillation
Jin Cui, Jiaqi Guo, Jiepeng Zhou, Ruixuan Yang, Jiayi Lu, Jiajun Xu, Jiangcheng Song, Boran Zhao, Pengju Ren
Comments: 13 pages, 8 figures
Subjects: Computation and Language (cs.CL)
[44] arXiv:2601.03714 [pdf, html, other]
Title: Visual Merit or Linguistic Crutch? A Close Look at DeepSeek-OCR
Yunhao Liang, Ruixuan Ying, Bo Li, Hong Li, Kai Yan, Qingwen Li, Min Yang, Okamoto Satoshi, Zhe Cui, Shiwen Ni
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[45] arXiv:2601.03707 [pdf, other]
Title: AirNav: A Large-Scale Real-World UAV Vision-and-Language Navigation Dataset with Natural and Diverse Instructions
Hengxing Cai, Yijie Rao, Ligang Huang, Zanyang Zhong, Jinhan Dong, Jingjun Tan, Wenhao Lu, Renxin Zhong
Subjects: Computation and Language (cs.CL)
[46] arXiv:2601.03700 [pdf, html, other]
Title: ADEPT: Adaptive Dynamic Early-Exit Process for Transformers
Sangmin Yoo, Srikanth Malla, Chiho Choi, Wei D. Lu, Joon Hee Choi
Comments: 11 figures, 8 tables, 22 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[47] arXiv:2601.03699 [pdf, html, other]
Title: RedBench: A Universal Dataset for Comprehensive Red Teaming of Large Language Models
Quy-Anh Dang, Chris Ngo, Truong-Son Hy
Subjects: Computation and Language (cs.CL)
[48] arXiv:2601.03698 [pdf, html, other]
Title: Evaluation Framework for AI Creativity: A Case Study Based on Story Generation
Pharath Sathya, Yin Jou Huang, Fei Cheng
Comments: Work in progress
Subjects: Computation and Language (cs.CL)
[49] arXiv:2601.03682 [pdf, html, other]
Title: From Implicit to Explicit: Token-Efficient Logical Supervision for Mathematical Reasoning in LLMs
Shaojie Wang, Liang Zhang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[50] arXiv:2601.03676 [pdf, html, other]
Title: Towards Compositional Generalization of LLMs via Skill Taxonomy Guided Data Synthesis
Yifan Wei, Li Du, Xiaoyan Yu, Yang Feng, Angsheng Li
Comments: The code and data for our methods and experiments are available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Total of 483 entries : 1-50 51-100 101-150 151-200 ... 451-483
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status