This is the code repository for the ICLR paper Weighted Training for Cross-Task Learning. If you use this code for your work, please cite
@article{chen2022weighted,
title={Weighted Training for Cross-Task Learning},
author={Chen, Shuxiao and Crammer, Koby and He, Hangfeng and Roth, Dan and Su, Weijie J},
journal={International Conference on Learning Representations},
year={2022}
}
Use virtual environment tools (e.g miniconda) to install packages and run experiments
python==3.6.7
pip install -r requirements.txt
The code is organized as follows:
- data/preprocess_max_len.py (preprocess the data with the max sentence length of BERT, it's almost the same as the preprocess.py in the huggingfacce ner.)
- data/process_data.py (preprocess the datasets for different settings)
- testing/significance_testing.py (test of statistical significance)
- utils_cross_task.py (prepare data for BERT based models)
- modeling_multi_bert.py (multitask models based on BERT)
- weighted_training_basics.py (some basic functions for weighted training)
- other python files (core learning paradigms for our experiments, including single-task learning, (weighted) pre-training, (weighted) joint training, and (weighted) normalized joint training, and some variants, such as pre training with fixed weights. The corresponding learning paradigms can be easily distinguished by their names.)
The scripts are organized as follows:
- run_experiments.sh (running experiments for our main results and analysis)
- other scripts (core learning paradigms for our experiments, including single-task learning, (weighted) pre-training, (weighted) joint training, and (weighted) normalized joint training, and some variants, such as pre training with fixed weights. The corresponding learning paradigms can be easily distinguished by their names.)
Change the /path/to/working/dir to the path to your working directory.
To reproduce the experiments for our main results and analysis:
sh scripts/run_experiments.sh
Note that you may need to divide the whole scripts into several parts to reproduce all experiments.