CollabLLM: From Passive Responders to Active Collaborators

📢 Oustanding Paper Award @ ICML 2025

Overview

CollabLLM transforms traditional language models from passive responders to active collaborators in multi-turn conversations. This repository provides the complete framework for computing multiturn-aware rewards and training collaborative language models.

Installation

To get started, create a new environment and install collabllm via pip:

conda create -n collabllm python=3.10
conda activate collabllm
pip install collabllm

Optional: For distributed training

If you need distributed training:

pip install deepspeed
conda install mpi4py

Optional: For customized datasets and metrics

You may install additional packages (e.g., pip install bigcodebench matplotlib) for task-specific metrics or evaluation.

Quick Start

Lightweight usage: Compute Multiturn-aware Rewards (MRs) for any model responses and construct datasets following notebook_tutorials/.
Synthetic data generation: Generating high-quality synthetic conversational data following scripts/engine/build_dataset.py. (Include your API keys in .env file)
Train CollabLLM: Conduct SFT/DPO models training to maximize MRs following examples under scripts/train/*.py.

Add Your Own Task

To apply CollabLLM to a new task:

Add a Dataset:
Place your single-turn dataset in examples/single_turn_ds/ and register it in __init__.py.
(Optional) Add Metrics:
Add new metrics to examples/metrics/ and register them in __init__.py.

You can now run data generation, reward computation, and model training using your customized setup.

Citation

If you find our work useful in your research, please cite the following:

@inproceedings{collabllm2025,
    title={CollabLLM: From Passive Responders to Active Collaborators},
    author={Shirley Wu and Michel Galley and Baolin Peng and Hao Cheng and 
            Gavin Li and Yao Dou and Weixin Cai and James Zou and 
            Jure Leskovec and Jianfeng Gao},
    booktitle={International Conference on Machine Learning (ICML)},
    year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
collabllm		collabllm
examples		examples
notebook_tutorials		notebook_tutorials
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
transparency_document.pdf		transparency_document.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CollabLLM: From Passive Responders to Active Collaborators

Overview

Installation

Optional: For distributed training

Optional: For customized datasets and metrics

Quick Start

Add Your Own Task

Citation

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

Wuyxin/collabllm

Folders and files

Latest commit

History

Repository files navigation

CollabLLM: From Passive Responders to Active Collaborators

Overview

Installation

Optional: For distributed training

Optional: For customized datasets and metrics

Quick Start

Add Your Own Task

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages