Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/40011
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Computing-
dc.creatorLi, S-
dc.creatorSong, T-
dc.creatorGao, J-
dc.creatorYao, P-
dc.creatorLi, W-
dc.date.accessioned2016-05-17T10:08:52Z-
dc.date.available2016-05-17T10:08:52Z-
dc.identifier.issn1003-0077-
dc.identifier.urihttp://hdl.handle.net/10397/40011-
dc.language.isozhen_US
dc.publisher中国中文信息学会 ; 北京信息工程学院en_US
dc.rights© 2009 China Academic Journal Electronic Publishing House. It is to be used strictly for educational and research use.en_US
dc.rights© 2009 中国学术期刊电子杂志出版社。本内容的使用仅限于教育、科研之目的。en_US
dc.subjectArtificial intelligenceen_US
dc.subjectNatural language processingen_US
dc.subjectDomain analysisen_US
dc.subjectDomain termen_US
dc.subjectDomain term componenten_US
dc.subjectLink analysisen_US
dc.subjectUsage discrepancyen_US
dc.titleA method of lexical domain analysis based on usage discrepancyen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage72-
dc.identifier.epage78-
dc.identifier.volume23-
dc.identifier.issue6-
dcterms.abstract领域知识的表达形式最终体现在词汇的 领域性上,因此对领域词及其部件的领域度分析是一个关键。该文在分词的基础上,对各个领域语料进行分析,利用词语之间的关系,引入链接分析方法分析词语在 各个领域中的使用重要性,并通过词语在各个领域中的使用差异性计算其领域度,从而达到领域分析的目的,获取某个领域的领域部件词。该文采用以上方法在军 事、娱乐等领域进行了实验,实验结果表明该方法相对于当前常用的tf×idf方法和Bootstrapping方法,可以更有效地进行领域分析获取领域部 件词。 -
dcterms.abstractThe representation of domain knowledge usually focuses on the domain lexicons,and then domain analysis for terms or term components is a natural task.In this paper,we propose a novel domain analysis method based on the discrepancy of lexical usage.Based on the word segmentation result,we introduce a link analysis method to compute the usage degree of each word for several typical domain corpora.Then through analyzing the discrepancy of the word usage in different domains,we can acquire the domain term component with larger usage discrepancy.This method is experimented on several domains such as military,entertainment and so on,achieving better results than the commonly used tf×idf method and Bootstapping method. -
dcterms.accessRightsopen accessen_US
dcterms.alternative一种基于使用差异的词语领域性分析方法-
dcterms.bibliographicCitation中文信息学报 (Journal of Chinese information processing), 2009, v. 23, no. 6, p. 72-78-
dcterms.isPartOf中文信息学报 (Journal of Chinese information processing)-
dcterms.issued2009-
dc.identifier.rosgroupidr48014-
dc.description.ros2009-2010 > Academic research: refereed > Publication in refereed journal-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_IR/PIRAen_US
dc.description.pubStatusPublisheden_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
r48014.pdf222.17 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

296
Last Week
1
Last month
Citations as of Mar 24, 2024

Downloads

134
Citations as of Mar 24, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.