Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/5502
PIRA download icon_1.1View/Download Full Text
Title: A grammar-informed corpus-based sentence database for linguistic and computational studies
Authors: Xu, H
Chen, HK
Huang, CR 
Lu, Q 
Chiu, TS
Shi, DT 
Issue Date: May-2012
Source: LREC 2012 : proceedings of the 8th International Conference on Language Resources and Evaluation : Istanbul, Turkey, 23-25 May 2012, p. 3140-3144
Abstract: We adopt the corpus-informed approach to example sentence selections for the construction of a reference grammar. In the process, a database containing sentences that are carefully selected by linguistic experts including the full range of linguistic facts covered in an authoritative Chinese Reference Grammar is constructed and structured according to the reference grammar. A search engine system is developed to facilitate the process of finding the most typical examples the users need to study a linguistic problem or prove their hypotheses. The database can also be used as a training corpus by computational linguists to train models for Chinese word segmentation, POS tagging and sentence parsing.
Keywords: Chinese reference grammar
Sentence database
Linguistic study
Publisher: European Language Resources Association (ELRA)
Rights: Reproduced with permission of the author.
The conference paper is available at http://www.lrec-conf.org/proceedings/lrec2012/summaries/401.html
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
PublishedPaper_401_Paper[1].pdf409.35 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Show full item record

Page views

140
Last Week
1
Last month
Citations as of Apr 14, 2024

Downloads

53
Citations as of Apr 14, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.