Paper ID | HLT-18.2 | ||
Paper Title | A Large-Scale Chinese Long-text Extractive Summarization Corpus | ||
Authors | Kai Chen, Guanyu Fu, Qingcai Chen, Baotian Hu, Harbin Institute of Technology, Shenzhen, China | ||
Session | HLT-18: Language Understanding 6: Summarization and Comprehension | ||
Location | Gather.Town | ||
Session Time: | Friday, 11 June, 13:00 - 13:45 | ||
Presentation Time: | Friday, 11 June, 13:00 - 13:45 | ||
Presentation | Poster | ||
Topic | Human Language Technology: [HLT-LRES] Language Resources and Systems | ||
IEEE Xplore Open Preview | Click here to view in IEEE Xplore | ||
Abstract | Recently, large-scale datasets have vastly facilitated the development in nearly domains of Natural Language Processing. However, lacking large scale Chinese corpus is still a critical bottleneck for further research on deep text summarization methods. In this paper, we publish a large-scale Chinese Long-text Extractive Summarization corpus named CLES. The CLES contains about 104K |