Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CL

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computation and Language

Authors and titles for recent submissions

  • Fri, 11 Jul 2025
  • Thu, 10 Jul 2025
  • Wed, 9 Jul 2025
  • Tue, 8 Jul 2025
  • Fri, 4 Jul 2025

See today's new changes

Total of 462 entries : 1-50 51-100 101-150 151-200 ... 451-462
Showing up to 50 entries per page: fewer | more | all

Fri, 11 Jul 2025 (showing first 50 of 66 entries )

[1] arXiv:2507.07998 [pdf, other]
Title: PyVision: Agentic Vision with Dynamic Tooling
Shitian Zhao, Haoquan Zhang, Shaoheng Lin, Ming Li, Qilong Wu, Kaipeng Zhang, Chen Wei
Comments: 26 Pages, 10 Figures, Technical report
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2] arXiv:2507.07988 [pdf, other]
Title: Automating Expert-Level Medical Reasoning Evaluation of Large Language Models
Shuang Zhou, Wenya Xie, Jiaxi Li, Zaifu Zhan, Meijia Song, Han Yang, Cheyenna Espinoza, Lindsay Welton, Xinnie Mai, Yanwei Jin, Zidu Xu, Yuen-Hei Chung, Yiyun Xing, Meng-Han Tsai, Emma Schaffer, Yucheng Shi, Ninghao Liu, Zirui Liu, Rui Zhang
Comments: 22 pages,6 figures
Subjects: Computation and Language (cs.CL)
[3] arXiv:2507.07983 [pdf, other]
Title: Performance and Practical Considerations of Large and Small Language Models in Clinical Decision Support in Rheumatology
Sabine Felde, Rüdiger Buchkremer, Gamal Chehab, Christian Thielscher, Jörg HW Distler, Matthias Schneider, Jutta G. Richter
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[4] arXiv:2507.07981 [pdf, html, other]
Title: Why is Your Language Model a Poor Implicit Reward Model?
Noam Razin, Yong Lin, Jiarui Yao, Sanjeev Arora
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[5] arXiv:2507.07957 [pdf, html, other]
Title: MIRIX: Multi-Agent Memory System for LLM-Based Agents
Yu Wang, Xi Chen
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[6] arXiv:2507.07939 [pdf, html, other]
Title: SAGE: A Visual Language Model for Anomaly Detection via Fact Enhancement and Entropy-aware Alignment
Guoxin Zang, Xue Li, Donglin Di, Lanshun Nie, Dechen Zhan, Yang Song, Lei Fan
Comments: Accepted by ACMMM2025
Subjects: Computation and Language (cs.CL)
[7] arXiv:2507.07910 [pdf, html, other]
Title: DTECT: Dynamic Topic Explorer & Context Tracker
Suman Adhya, Debarshi Kumar Sanyal
Comments: Code: this http URL | Demo: this http URL | Video: this http URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[8] arXiv:2507.07887 [pdf, html, other]
Title: Automating MD simulations for Proteins using Large language Models: NAMD-Agent
Achuth Chandrasekhar, Amir Barati Farimani
Comments: 34 pages
Subjects: Computation and Language (cs.CL)
[9] arXiv:2507.07870 [pdf, html, other]
Title: DocCHA: Towards LLM-Augmented Interactive Online diagnosis System
Xinyi Liu, Dachun Sun, Yi R. Fung, Dilek Hakkani-Tür, Tarek Abdelzaher
Subjects: Computation and Language (cs.CL)
[10] arXiv:2507.07868 [pdf, html, other]
Title: Alpay Algebra V: Multi-Layered Semantic Games and Transfinite Fixed-Point Simulation
Bugra Kilictas, Faruk Alpay
Comments: 18 pages, 2 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[11] arXiv:2507.07847 [pdf, html, other]
Title: From Ambiguity to Accuracy: The Transformative Effect of Coreference Resolution on Retrieval-Augmented Generation systems
Youngjoon Jang, Seongtae Hong, Junyoung Son, Sungjin Park, Chanjun Park, Heuiseok Lim
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[12] arXiv:2507.07824 [pdf, html, other]
Title: Conditional Unigram Tokenization with Parallel Data
Gianluca Vico, Jindřinch Libovický
Comments: 21 pages, 4 figures, submitted to Tokenization Workshop (TokShop) at ICML 2025
Subjects: Computation and Language (cs.CL)
[13] arXiv:2507.07817 [pdf, other]
Title: On the Effect of Instruction Tuning Loss on Generalization
Anwoy Chatterjee, H S V N S Kowndinya Renduchintala, Sumit Bhatia, Tanmoy Chakraborty
Comments: Transactions of the Association for Computational Linguistics (TACL)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[14] arXiv:2507.07810 [pdf, html, other]
Title: Understanding and Controlling Repetition Neurons and Induction Heads in In-Context Learning
Nhi Hoai Doan, Tatsuya Hiraoka, Kentaro Inui
Subjects: Computation and Language (cs.CL)
[15] arXiv:2507.07808 [pdf, html, other]
Title: Bridging Logic and Learning: Decoding Temporal Logic Embeddings via Transformers
Sara Candussio, Gaia Saveri, Gabriele Sarti, Luca Bortolussi
Comments: 16 pages, 3 figures, to be published in ECML-PKDD
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[16] arXiv:2507.07803 [pdf, html, other]
Title: StreamUni: Achieving Streaming Speech Translation with a Unified Large Speech-Language Model
Shoutao Guo, Xiang Li, Shaolei Zhang, Mengge Liu, Wei Chen, Yang Feng
Comments: The code is at this http URL The model is at this http URL
Subjects: Computation and Language (cs.CL)
[17] arXiv:2507.07748 [pdf, html, other]
Title: When Large Language Models Meet Law: Dual-Lens Taxonomy, Technical Advances, and Ethical Governance
Peizhang Shao, Linrui Xu, Jinxi Wang, Wei Zhou, Xingyu Wu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[18] arXiv:2507.07741 [pdf, html, other]
Title: Code-Switching in End-to-End Automatic Speech Recognition: A Systematic Literature Review
Maha Tufail Agro, Atharva Kulkarni, Karima Kadaoui, Zeerak Talat, Hanan Aldarmaki
Subjects: Computation and Language (cs.CL)
[19] arXiv:2507.07725 [pdf, html, other]
Title: Not All Preferences are What You Need for Post-Training: Selective Alignment Strategy for Preference Optimization
Zhijin Dong
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[20] arXiv:2507.07700 [pdf, html, other]
Title: Rethinking the Privacy of Text Embeddings: A Reproducibility Study of "Text Embeddings Reveal (Almost) As Much As Text"
Dominykas Seputis, Yongkang Li, Karsten Langerak, Serghei Mihailov
Comments: This paper has been accepted for oral presentation in the reproducibility track at RecSys 2025
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[21] arXiv:2507.07695 [pdf, html, other]
Title: KeyKnowledgeRAG (K^2RAG): An Enhanced RAG method for improved LLM question-answering capabilities
Hruday Markondapatnaikuni, Basem Suleiman, Abdelkarim Erradi, Shijing Chen
Comments: 21 pages, 14 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[22] arXiv:2507.07694 [pdf, html, other]
Title: SAS: Simulated Attention Score
Chuanyang Zheng, Jiankai Sun, Yihang Gao, Yuehao Wang, Peihao Wang, Jing Xiong, Liliang Ren, Hao Cheng, Janardhan Kulkarni, Yelong Shen, Atlas Wang, Mac Schwager, Anderson Schneider, Xiaodong Liu, Jianfeng Gao
Comments: Tech Report
Subjects: Computation and Language (cs.CL)
[23] arXiv:2507.07653 [pdf, html, other]
Title: An Automated Length-Aware Quality Metric for Summarization
Andrew D. Foland
Subjects: Computation and Language (cs.CL)
[24] arXiv:2507.07640 [pdf, html, other]
Title: Lost in Pronunciation: Detecting Chinese Offensive Language Disguised by Phonetic Cloaking Replacement
Haotan Guo, Jianfei He, Jiayuan Ma, Hongbin Na, Zimu Wang, Haiyang Zhang, Qi Chen, Wei Wang, Zijing Shi, Tao Shen, Ling Chen
Comments: In progress
Subjects: Computation and Language (cs.CL)
[25] arXiv:2507.07634 [pdf, html, other]
Title: FrugalRAG: Learning to retrieve and reason for multi-hop QA
Abhinav Java, Srivathsan Koundinyan, Nagarajan Natarajan, Amit Sharma
Comments: Accepted at ICML Workshop: Efficient Systems for Foundation Models
Subjects: Computation and Language (cs.CL)
[26] arXiv:2507.07630 [pdf, html, other]
Title: Exploring the Limits of Model Compression in LLMs: A Knowledge Distillation Study on QA Tasks
Joyeeta Datta, Niclas Doll, Qusai Ramadan, Zeyd Boukhers
Comments: Accepted four publication at the 26th Meeting of the Special Interest on Discourse and Dialogue
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[27] arXiv:2507.07586 [pdf, html, other]
Title: Bayesian Discrete Diffusion Beats Autoregressive Perplexity
Cooper Doyle
Comments: 12 pages, 2 figures, 2 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[28] arXiv:2507.07572 [pdf, other]
Title: Single-to-mix Modality Alignment with Multimodal Large Language Model for Document Image Machine Translation
Yupu Liang, Yaping Zhang, Zhiyang Zhang, Yang Zhao, Lu Xiang, Chengqing Zong, Yu Zhou
Comments: Accepted by ACL 2025 Main
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[29] arXiv:2507.07562 [pdf, html, other]
Title: The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
Jierun Chen, Tiezheng Yu, Haoli Bai, Lewei Yao, Jiannan Wu, Kaican Li, Fei Mi, Chaofan Tao, Lei Zhu, Manyi Zhang, Xiaohui Li, Lu Hou, Lifeng Shang, Qun Liu
Subjects: Computation and Language (cs.CL)
[30] arXiv:2507.07543 [pdf, html, other]
Title: The Cross-Lingual Cost: Retrieval Biases in RAG over Arabic-English Corpora
Chen Amiraz, Yaroslav Fyodorov, Elad Haramaty, Zohar Karnin, Liane Lewin-Eytan
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[31] arXiv:2507.07539 [pdf, html, other]
Title: CEA-LIST at CheckThat! 2025: Evaluating LLMs as Detectors of Bias and Opinion in Text
Akram Elbouanani, Evan Dufraisse, Aboubacar Tuo, Adrian Popescu
Comments: Notebook for the CheckThat! Lab at CLEF 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[32] arXiv:2507.07518 [pdf, html, other]
Title: Triadic Multi-party Voice Activity Projection for Turn-taking in Spoken Dialogue Systems
Mikey Elmers, Koji Inoue, Divesh Lala, Tatsuya Kawahara
Comments: Accepted to Interspeech 2025
Subjects: Computation and Language (cs.CL)
[33] arXiv:2507.07509 [pdf, html, other]
Title: Toward Real-World Chinese Psychological Support Dialogues: CPsDD Dataset and a Co-Evolving Multi-Agent System
Yuanchen Shi, Longyin Zhang, Fang Kong
Comments: 10pages,8 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[34] arXiv:2507.07505 [pdf, other]
Title: Hallucination Stations: On Some Basic Limitations of Transformer-Based Language Models
Varin Sikka, Vishal Sikka
Comments: 6 pages; to be submitted to AAAI-26 after reviews
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[35] arXiv:2507.07499 [pdf, html, other]
Title: Extracting ORR Catalyst Information for Fuel Cell from Scientific Literature
Hein Htet, Amgad Ahmed Ali Ibrahim, Yutaka Sasaki, Ryoji Asahi
Comments: 28 pages, 12 figures, 6 tables
Subjects: Computation and Language (cs.CL); Data Analysis, Statistics and Probability (physics.data-an)
[36] arXiv:2507.07498 [pdf, other]
Title: Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code
Keqin Bao, Nuo Chen, Xiaoyuan Li, Binyuan Hui, Bowen Yu, Fuli Feng, Junyang Lin, Xiangnan He, Dayiheng Liu
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[37] arXiv:2507.07495 [pdf, other]
Title: PLAN-TUNING: Post-Training Language Models to Learn Step-by-Step Planning for Complex Problem Solving
Mihir Parmar, Palash Goyal, Xin Liu, Yiwen Song, Mingyang Ling, Chitta Baral, Hamid Palangi, Tomas Pfister
Comments: 15 Pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[38] arXiv:2507.07484 [pdf, other]
Title: Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models
Kaiqu Liang, Haimin Hu, Xuandong Zhao, Dawn Song, Thomas L. Griffiths, Jaime Fernández Fisac
Comments: Project page, code & data: this http URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[39] arXiv:2507.07451 [pdf, html, other]
Title: RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning
Hongzhi Zhang, Jia Fu, Jingyuan Zhang, Kai Fu, Qi Wang, Fuzheng Zhang, Guorui Zhou
Comments: this http URL
Subjects: Computation and Language (cs.CL)
[40] arXiv:2507.07441 [pdf, other]
Title: SAND: Boosting LLM Agents with Self-Taught Action Deliberation
Yu Xia, Yiran Jenny Shen, Junda Wu, Tong Yu, Sungchul Kim, Ryan A. Rossi, Lina Yao, Julian McAuley
Subjects: Computation and Language (cs.CL)
[41] arXiv:2507.07439 [pdf, html, other]
Title: Towards Interpretable Time Series Foundation Models
Matthieu Boileau, Philippe Helluy, Jeremy Pawlus, Svitlana Vyetrenko
Comments: International Conference on Machine Leaning (ICML) 2025 Workshop on Foundation Models for Structured Data
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[42] arXiv:2507.07421 [pdf, html, other]
Title: SynthEHR-Eviction: Enhancing Eviction SDoH Detection with LLM-Augmented Synthetic EHR Data
Zonghai Yao, Youxia Zhao, Avijit Mitra, David A. Levy, Emily Druhl, Jack Tsai, Hong Yu
Comments: Equal contribution for the first two authors
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[43] arXiv:2507.07419 [pdf, html, other]
Title: MedReadCtrl: Personalizing medical text generation with readability-controlled instruction learning
Hieu Tran, Zonghai Yao, Won Seok Jang, Sharmin Sultana, Allen Chang, Yuan Zhang, Hong Yu
Comments: Equal contribution for the first two authors. arXiv admin note: text overlap with arXiv:2406.09205
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[44] arXiv:2507.07414 [pdf, html, other]
Title: GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation
Fardin Rastakhiz
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[45] arXiv:2507.07307 [pdf, html, other]
Title: Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation
Anirban Saha Anik, Xiaoying Song, Elliott Wang, Bryan Wang, Bengisu Yarimbas, Lingzi Hong
Subjects: Computation and Language (cs.CL)
[46] arXiv:2507.07280 [pdf, html, other]
Title: The Impact of Background Speech on Interruption Detection in Collaborative Groups
Mariah Bradford, Nikhil Krishnaswamy, Nathaniel Blanchard
Comments: Long Paper AIED 2025
Subjects: Computation and Language (cs.CL)
[47] arXiv:2507.07248 [pdf, other]
Title: Medical Red Teaming Protocol of Language Models: On the Importance of User Perspectives in Healthcare Settings
Minseon Kim, Jean-Philippe Corbeil, Alessandro Sordoni, Francois Beaulieu, Paul Vozila
Subjects: Computation and Language (cs.CL)
[48] arXiv:2507.07229 [pdf, html, other]
Title: SynthTextEval: Synthetic Text Data Generation and Evaluation for High-Stakes Domains
Krithika Ramesh, Daniel Smolyak, Zihao Zhao, Nupoor Gandhi, Ritu Agarwal, Margrét Bjarnadóttir, Anjalie Field
Subjects: Computation and Language (cs.CL)
[49] arXiv:2507.07188 [pdf, html, other]
Title: Prompt Perturbations Reveal Human-Like Biases in LLM Survey Responses
Jens Rupprecht (1), Georg Ahnert (1), Markus Strohmaier (1 and 2 and 3) ((1) University of Mannheim, (2) GESIS - Leibniz Institute for the Social Sciences, (3) Complexity Science Hub)
Comments: 18 pages, 17 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[50] arXiv:2507.07186 [pdf, html, other]
Title: Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs
Itay Itzhak, Yonatan Belinkov, Gabriel Stanovsky
Comments: CoLM 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Total of 462 entries : 1-50 51-100 101-150 151-200 ... 451-462
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack