Skip to content

WilliamG-LORA/NLP_multi_task

Repository files navigation

环境:

  1. cuda cuda 11.0,pytorch 1.8.0
  2. apex 应该用cuda 10.2,cuda 11.0 不兼容,参考这里修改apex,成功编译
  3. deepspeed pip install deepspeed==0.3.15

任务

zero-shot classication

dataset: tnews,data/tnews_RawData_example.json
train: scripts/zero-shot-tnews_small.sh

fill-in-the-blank

dataset: chid, data/chid_RawData_example.json
dataprocess: preprocess_chid_finetune.py
train: scripts/chid/finetune_chid_small.sh

dialog

dataset: STC, data/STC_RawData_example.json
dataprocess: preprocess_stc_finetune.py
train: finetune_lm_small.sh

待改进:

  1. fill-in-the-blank和dialog任务的预训练模型加载不成功
  2. 大模型显存占用过大

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors