Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/98202
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Chinese and Bilingual Studiesen_US
dc.creatorLiu, Hen_US
dc.creatorNeergaard, Ken_US
dc.creatorSantus, Een_US
dc.creatorHuang, CRen_US
dc.date.accessioned2023-04-17T07:31:17Z-
dc.date.available2023-04-17T07:31:17Z-
dc.identifier.isbn978-2-9517408-9-1en_US
dc.identifier.urihttp://hdl.handle.net/10397/98202-
dc.descriptionTenth International Conference on Language Resources and Evaluation (LREC'16), May 23-28, 2016, Portorož, Sloveniaen_US
dc.language.isoenen_US
dc.publisherEuropean Language Resources Association (ELRA)en_US
dc.rightsCopyright by the European Language Resources Associationen_US
dc.rightsThe LREC 2016 Proceedings are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/)en_US
dc.rightsThe following publication Liu Hongchao, Karl Neergaard, Enrico Santus, and Chu-Ren Huang. 2016. EVALution-MAN: A Chinese Dataset for the Training and Evaluation of DSMs. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 4583–4587, Portorož, Slovenia. European Language Resources Association (ELRA) is available at https://aclanthology.org/L16-1726.en_US
dc.subjectDataseten_US
dc.subjectDistributioanl Semantic Modelsen_US
dc.subjectTariningen_US
dc.subjectEvaluatingen_US
dc.titleEVALution-MAN : a Chinese dataset for the training and evaluation of DSMsen_US
dc.typeConference Paperen_US
dc.identifier.spage4583en_US
dc.identifier.epage4587en_US
dcterms.abstractDistributional semantic models (DSMs) are currently being used in the measurement of word relatedness and word similarity. One shortcoming of DSMs is that they do not provide a principled way to discriminate different semantic relations. Several approaches have been adopted that rely on annotated data either in the training of the model or later in its evaluation. In this paper, we introduce a dataset for training and evaluating DSMs on semantic relations discrimination between words, in Mandarin, Chinese. The construction of the dataset followed EVALution 1.0, which is an English dataset for the training and evaluating of DSMs. The dataset contains 360 relation pairs, distributed in five different semantic relations, including antonymy, synonymy, hypernymy, meronymy and nearsynonymy. All relation pairs were checked manually to estimate their quality. In the 360 word relation pairs, there are 373 relata. They were all extracted and subsequently manually tagged according to their semantic type. The relatas’frequency was calculated in a combined corpus of Sinica and Chinese Gigaword. To the best of our knowledge, EVALution-MAN is the first of its kind for Mandarin, Chinese.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIn N Calzolari, K Choukri, T Declerck, S Goggi, M Grobelnik, B Maegaard, J Mariani, H Mazo, A Moreno, J Odijk & S Piperidis (Eds.), Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), p. 4583-4587. Portorož, Slovenia : European Language Resources Association (ELRA), 2016en_US
dcterms.issued2016-05-
dc.relation.ispartofbookProceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)en_US
dc.relation.conferenceInternational Conference on Language Resources and Evaluation [LREC]en_US
dc.publisher.placePortorož, Sloveniaen_US
dc.description.validate202304 bcwwen_US
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberCBS-0390-
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThis work is partially funded by the Hong Kong PhD Fellowship Scheme for Enrico Santus under PF12-13656.en_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS9585651-
dc.description.oaCategoryCCen_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Huang_Evalution-Man_Chinese_Dataset.pdf146.48 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

70
Citations as of May 11, 2025

Downloads

28
Citations as of May 11, 2025

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.