- cuda cuda 11.0,pytorch 1.8.0
- apex 应该用cuda 10.2,cuda 11.0 不兼容,参考这里修改apex,成功编译
- deepspeed pip install deepspeed==0.3.15
dataset: tnews,data/tnews_RawData_example.json
train: scripts/zero-shot-tnews_small.sh
dataset: chid, data/chid_RawData_example.json
dataprocess: preprocess_chid_finetune.py
train: scripts/chid/finetune_chid_small.sh
dataset: STC, data/STC_RawData_example.json
dataprocess: preprocess_stc_finetune.py
train: finetune_lm_small.sh
- fill-in-the-blank和dialog任务的预训练模型加载不成功
- 大模型显存占用过大