Tutorials

Morning: 9:00-12:30

Towards Effective and Efficient Multi-Agent Language Model Systems: Foundations, Prospects, and Applications

Organizers: Xuan Wang, Shuxiang Cao, Yuchen Zhuang and Wenqi Shi

Description: Multi-agent systems powered by large language models (LLMs) offer a promising paradigm for tackling complex reasoning, decision-making, and problem-solving tasks. However, achieving both effectiveness and efficiency in such systems remains a critical challenge. This tutorial introduces recent advances in building effective and efficient multi-agent LLM systems, focusing on three core components. First, we discuss the design of individual LLM agents. We present state-of-the-art techniques for enabling capable agents using efficient and compact LLMs, including model distillation, dynamic routing, and memory- and compute-efficient serving, providing a foundation for scalable and responsive agent design under resource constraints. Second, we cover coordination and communication among agents, crucial for collective performance, highlighting methods for improving multi-agent reasoning and decision-making through prompt and graph optimization, sycophancy mitigation, and structured LLM-based frameworks. Last, we explore real-world applications of LLM agents in areas such as industry, healthcare, quantum computing, and various scientific domains.

Future of Work in the Age of LLMs

Organizers: Zhiruo Wang, Yijia Shao, David Nguyen and Diyi Yang

Description: The recent development of large language models (LLMs) has revolutionized the landscape of human work. These models possess the ability to follow complex human instructions and operate versatile computer software, enabling them to participate in, augment, or even automate realistic occupational tasks that once thought to be exclusive to humans. As LLMs are increasingly integrated into workplaces, they are already reshaping labor dynamics and raising urgent concerns about job displacement, diminished human agency, and overreliance on automation. As a result, the future of work is undergoing a profound transformation: How will human occupations and task requirements evolve over time? And what roles will LLM-based systems play, as they become increasingly capable collaborators and autonomous workers? And how can we build technological and data infrastructures to support human-AI collaboration?

This tutorial will provide an overview of the future of work shaped by the interplay of LLMs and humans, examining the emerging challenges, opportunities, and ethical considerations in this dynamic landscape. We begin by examining the economic landscape of work and how NLP technologies drive automation, followed by methods for developing LLMs that augment human labor and recent advances in LLM-based agents. We then cover evaluation approaches for workplace contexts, including datasets, benchmarks, and metrics, and conclude with open questions on technical, human, and societal implications.

The Data Frontier for Large Language Models: Selection, Synthesis, and Tools

Organizers: Lijun Wu, Wentao Zhang and Conghui He

Description: As the development of Large Language Models (LLMs) matures, the focus of the research community is undergoing a critical shift from a purely model-centric to a data-centric paradigm. It is now evident that the quality, diversity, and composition of training data—not merely its scale—are the primary drivers of a model's advanced capabilities, from complex reasoning to reliable instruction following. However, acquiring and curating such high-quality data remains a significant bottleneck.

This tutorial provides a comprehensive and practical guide to the state-of-the-art in data research directions for LLMs. We structure the tutorial around the two core pillars of modern data strategy: intelligent data selection and advanced data synthesis.

In the first part, we delve into methods for curating the most valuable information from vast, noisy datasets, covering techniques like LLM-as-a-judge for automated quality filtering and active learning for maximizing annotation efficiency.

The second part explores the synthetic data revolution, detailing paradigms that range from generating complex reasoning traces (e.g., Chain-of-Thought) to deploying sophisticated multi-agent workflows that can autonomously create high-quality, diverse instruction data from raw seeds.

Finally, we will conclude with a practical overview of open-source tools and platforms that facilitate these data-centric workflows, empowering researchers and practitioners to build better models through better data.

Attendees will leave with a principled framework and actionable insights for designing and implementing the advanced data strategies required to build the next generation of powerful, specialized, and aligned LLMs.

Afternoon: 14:00-17:30

The Interplay between Metaphors and NLP

Organizers: Joanne Boisson, Jose Camacho-Collados, Elisa Sanchez-Bayona and Rodrigo Agerri

Description: This tutorial will provide an overview of the field of metaphor processing in NLP. We will discuss the influence of various metaphor theories on the creation of annotated resources and on the research projects and algorithm designs. We will particularly focus on recent research made possible by the last generation of transformer-based models and LLMs for metaphor interpretation in multilingual and multimodal settings. Two types of audience may benefit from this tutorial: researchers in the humanities and computational social sciences interested in automatic or semi automatic metaphor analysis, and NLP researchers interested in understanding metaphor processing and improving metaphor modeling.

Knowledge Control for Responsible Generative AI: Bridging Academia,Industry, and Society

Organizers: Zheyuan Liu, Yixin Wan, Kai-Wei Chang, Meng Jiang, Jieyu Zhao, Nouha Dziri, Yuning Mao, Jiachen Gu, and Jindong Gu

Description: This tutorial explores post-training knowledge control as a practical and responsible framework for shaping the behavior of large generative models. It brings together perspectives from academia, industry, and society to address the growing need for scalable, interpretable, and ethically grounded interventions in AI systems. Through discussions of diverse knowledge control techniques like machine unlearning, knowledge editing, and inference-time control, the tutorial illustrates how these approaches can remove or adjust undesirable model behaviors while preserving integrity and usefulness. It emphasizes bridging research, deployment, and societal perspectives, offering attendees a holistic understanding of how advances across academia, industry, and society can jointly promote safer, fairer, and more accountable generative AI.

Current Advances in LLM Reasoning

Organizers: Akhil Arora, Vishrav Chaudhary, Julia Kreutzer, Nearchos Potamitis, Nouha Dziri and Niket Tandon

Description: As Large language models (LLMs) increasingly tackle reasoning-heavy tasks, from mathematics to commonsense to multilingual understanding, researchers face three pressing questions: How well do models reason? How can we make them reason better? What are the next frontiers in LLM reasoning? This tutorial answers these questions through a unified view of LLM reasoning. This tutorial explores comprehensive evaluation strategies to assess the reasoning abilities of models and discusses two types of methods to improve models' reasoning: advanced inference time methods, such as structured and self-improvement inference methods, and (ii) post-training methods, such as RLHF, DPO, and GRPO that aim to make LLMs think more like humans. The tutorial explores these technical discussions while maintaining a practical outlook through illustrative demos and short guided hands-on exercises. The tutorial is designed for both researchers and practitioners seeking practical insights into LLM reasoning.