This is a comprehensive archive of newswire text data that has been acquired from Chinese news sources by the LDC over several years. ... There are 286 files, totalling approximately 1.5GB in compressed form."--LDC catalog.
Title from disc label. Data source(s): Newswire. Application(s): Natural language processing, language modeling, information retrieval. Author(s): David Graff, Ke Chen.