• MERLIN: A Testbed for Multilingual Multimodal Entity Recognition and Linking
    Sathyanarayanan Ramamoorthy, Vishwa Shah, Simran Khanuja, Zaid Sheikh, Shan Jie, Ann Chia, Shearman Chua, Graham Neubig

  • What Can String Probability Tell Us About Grammaticality?
    Jennifer Hu, Ethan Wilcox, Siyuan Song, Kyle Mahowald, Roger Levy

  • Benchmarking Linguistic Diversity of Large Language Models
    Yanzhu Guo, Guokan Shang, Chloé Clavel

  • Simulating Hard Attention Using Soft Attention
    Andy Yang, Lena Strobl, David Chiang, Dana Angluin

  • Objectifying the Subjective: Cognitive Biases in Topic Interpretations
    Swapnil Hingmire, Ze Shi Li, Shiyu (Vivienne) Zeng, Ahmed Musa Awon, Luiz Franciscatto Guerra, Neil Ernst

  • A Systematic Review of NLP for Dementia: Tasks, Datasets and Opportunities
    Roi Reichart, Lotem Peled-Cohen

  • Cross-layer Attention Sharing for Pre-trained Large Language Models
    Yongyu Mu, Yuzhang Wu, Yuchun Fan, Chenglong Wang, Hengyu Li, Jiali Zeng, Qiaozhi He, Murun Yang, Fandong Meng, Jie Zhou, Tong Xiao, Jingbo Zhu

  • Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts
    Chenghua Lin, Hanhua Hong, Chenghao Xiao, Yang Wang, Yiqi Liu, Wenge Rong

  • Generative Induction of Dialogue Task Schemas with Streaming Refinement and Simulated Interactions
    James Finch, Yasasvi Josyula, Jinho Choi

  • On the Effect of Instruction Tuning Loss on Generalization
    Anwoy Chatterjee, H S V N S Kowndinya Renduchintala, Sumit Bhatia, Tanmoy Chakraborty

  • Can LLMs Automate Fact-Checking Article Writing?
    Dhruv Sahnan, David Corney, Irene Larraz, Giovanni Zagni, Ruben Miguez, Zhuohan Xie, Iryna Gurevych, Elizabeth Churchill, Tanmoy Chakraborty, Preslav Nakov

  • Self-Consistency Falls Short!: The Adverse Effects of Positional Bias on Long-Context Problems
    Adam Byerly, Daniel Khashabi

  • Localizing Factual Inconsistencies in Attributable Text Generation
    Arie Cattan, Paul Roit, Shiyue Zhang, David Wan, Roee Aharoni, Idan Szpektor, Mohit Bansal, Ido Dagan

  • On the Limitations of Language Targeted Pruning: Investigating the Calibration Language Impact in Multilingual LLM Pruning
    Simon Kurz, Jian-Jia Chen, Lucie Flek, Zhixue Zhao

  • Inferring Scientific Cross-Document Coreference and Hierarchy with Definition-Augmented Relational Reasoning
    Lior Forer, Tom Hope

  • Dissecting GraphRAG: A Modular Analysis of Knowledge Structuring for Factoid Question Answering
    Noriki Nishida, Rumana Ferdous Munne, Shanshan Liu, Narumi Tokunaga, Yuki Yamagata, Fei Cheng, Koiji Kozaki, Yuji Matsumoto

  • PsyMem: Fine-grained psychological alignment and Explicit Memory Control for Advanced Role-Playing LLMs
    Yunxiao Qin, Xilong Cheng, Yuting Tan, Zhengnan Li, Ye Wang, Hongjiang Xiao, Yuan Zhang,Yikang Liu

  • MoNaCo: More Natural and Complex Questions for Reasoning Across Dozens of Documents
    Tomer Wolfson, Harsh Trivedi, Mor Geva, Dan Roth, Tushar Khot, Ashish Sabharwal, Reut Tsarfaty

  • Goal Alignment in LLM-Based User Simulators for Conversational AI
    Shuhaib Mehri, Xiaocheng Yang, Takyoung Kim, Gokhan Tur, Shikib Mehri, Dilek Hakkani-Tür

  • Text-to-SQL Task-oriented Dialogue Ontology Construction
    Renato Vukovic, Carel van Niekerk, Michael Heck, Benjamin Ruppik, Hsien-Chin Lin, Shutong Feng, Nurul Lubis, Milica Gasic

  • A Survey on Memory-Efficient Fine-Tuning for Large Language Models
    Yeachan Kim, Mingyu Lee, SangKeun Lee

  • In-N-Out: A Parameter-Level API Graph Dataset for Tool Agents
    Seungkyu Lee, Nalim Kim, Yohan Jo

  • Universal Jailbreak Suffixes Are Strong Attention Hijackers
    Matan Ben-Tov, Mor Geva, Mahmood Sharif

  • Retain or Reframe? A Computational Framework for the Analysis of Framing in News Articles and Reader Comments
    Matteo Guida, Yulia Otmakhova, Eduard Hovy, Lea Frermann

  • From Belief Entrenchment to Robust Reasoning in LLM Agents
    Jihwan Oh, Minchan Jeong, Jongwoo Ko, Se-Young Yun

  • Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators
    Sungjib Lim, Woojung Song, Yohan Jo, Eun-Ju Lee

  • ResearchQA: Evaluating Scholarly Question Answering at Scale Across 75 Fields with Survey-Mined Questions and Rubrics
    Li S. Yifei, Allen Chang, Chaitanya Malaviya, Mark Yatskar

  • Start Making Sense(s): A Developmental Probe of Attention Specialization Using Lexical Ambiguity
    Pamela Riviere, Sean Trott