环境：

cuda cuda 11.0，pytorch 1.8.0
apex 应该用cuda 10.2，cuda 11.0 不兼容，参考这里修改apex，成功编译
deepspeed pip install deepspeed==0.3.15

任务

zero-shot classication

dataset: tnews，data/tnews_RawData_example.json
train: scripts/zero-shot-tnews_small.sh

fill-in-the-blank

dataset: chid, data/chid_RawData_example.json
dataprocess: preprocess_chid_finetune.py
train: scripts/chid/finetune_chid_small.sh

dialog

dataset: STC, data/STC_RawData_example.json
dataprocess: preprocess_stc_finetune.py
train: finetune_lm_small.sh

待改进：

fill-in-the-blank和dialog任务的预训练模型加载不成功
大模型显存占用过大

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
bpe_3w_new		bpe_3w_new
data		data
data_utils		data_utils
fp16		fp16
model		model
mpu		mpu
scripts		scripts
README.md		README.md
arguments.py		arguments.py
configure_data.py		configure_data.py
finetune_chid.py		finetune_chid.py
finetune_dialog.py		finetune_dialog.py
generate_samples.py		generate_samples.py
learning_rates.py		learning_rates.py
preprocess_chid_finetune.py		preprocess_chid_finetune.py
preprocess_chid_zeroshot.py		preprocess_chid_zeroshot.py
preprocess_stc_finetune.py		preprocess_stc_finetune.py
requirements.txt		requirements.txt
utils.py		utils.py
zero-shot_chid.py		zero-shot_chid.py
zero-shot_tnews.py		zero-shot_tnews.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

环境：

任务

zero-shot classication

fill-in-the-blank

dialog

待改进：

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

环境：

任务

zero-shot classication

fill-in-the-blank

dialog

待改进：

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages