Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

数据集 #12

Open
1984chen opened this issue Apr 30, 2020 · 2 comments
Open

数据集 #12

1984chen opened this issue Apr 30, 2020 · 2 comments

Comments

@1984chen
Copy link

请问现在训练用的数据集能上传了吗?

@1984chen
Copy link
Author

还有就是corpus.pkl

@XiaoyuanYi
Copy link
Member

@1984chen
你好,我们已经上传了一个古诗数据集用于测试比较各个模型:
https://github.com/THUNLP-AIPoet/Datasets/tree/master/CCPC
该数据集存储为了json个是,用的时候需要根据各个model重新预处理一下。corpus.pkl应该是预处理后的文件。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants