Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/5202
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Computing-
dc.creatorLi, W-
dc.creatorLu, Q-
dc.date.accessioned2014-12-11T08:26:11Z-
dc.date.available2014-12-11T08:26:11Z-
dc.identifier.isbn978-4-905166-02-3-
dc.identifier.urihttp://hdl.handle.net/10397/5202-
dc.language.isoenen_US
dc.publisherInstitute for Digital Enhancement of Cognitive Development, Waseda Universityen_US
dc.rights© 2011 The PACLIC 25 Organizing Committee and PACLIC Steering Committeeen_US
dc.rightsCopyright of contributed papers reserved by respective authorsen_US
dc.rightsCopyright 2011 by Wanyin Li, Qin Luen_US
dc.subjectCollocation extractionen_US
dc.subjectStatistical modelen_US
dc.subjectSyntactic rulesen_US
dc.subjectSemantic relationshipen_US
dc.subjectSimilarity calculationen_US
dc.subjectHowNeten_US
dc.titleA hybrid extraction model for Chinese noun/verb synonym bi-gramen_US
dc.typeConference Paperen_US
dcterms.abstractStatistical-based collocation extraction approaches suffer from (1) low precision rate because high co-occurrence bi-grams may be syntactically unrelated and are thus not true collocations; (2) low recall rate because some true collocations with low occurrences cannot be identified successfully by statistical-based models. To integrate both syntactic rules as well as semantic knowledge into a statistical model for collocation extraction is one way to achieve a high precision while keeping a reasonable recall. This paper designs a cascade system which employs a hybrid model by integrating both syntactic and semantic knowledge into a statistical model for Chinese synonymous noun/verb collocations extraction. The grammatically bounded noun/verb collocations are extracted first from a syntactic-rule based module, which is then inputted to a semantic-based module for further retrieval of low frequent bi-gram collocations.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationProceedings of the 25th Pacific Asia Conference on Language, Information and Computation (PACLIC 25), 16-18 Dec, Nanyang Technological University, Singapore, p. 430-439-
dcterms.issued2011-12-16-
dc.identifier.scopus2-s2.0-84863869937-
dc.identifier.rosgroupidr60560-
dc.description.ros2011-2012 > Academic research: refereed > Refereed conference paper-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_IR/PIRAen_US
dc.description.pubStatusPublisheden_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Li_Hybrid_Extraction_Bi-gram.pdf135.34 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

117
Last Week
1
Last month
Citations as of Mar 24, 2024

Downloads

119
Citations as of Mar 24, 2024

SCOPUSTM   
Citations

1
Last Week
0
Last month
Citations as of Mar 29, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.