Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/98189
PIRA download icon_1.1View/Download Full Text
Title: A parallel corpus-driven approach to bilingual oenology term banks: how culture differences influence wine tasting terms
Authors: Wang, VX
Chen, X 
Quan, S 
Huang, CR 
Issue Date: Oct-2020
Source: In ML Nguyen, MC Luong & S Song (Eds.), Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation, 24-26 October, 2020, University of Science, Vietnam National University Hanoi, Vietnam, p. 318-328. Association for Computational Linguistics, 2020
Abstract: This paper describes the construction of an English-Chinese Parallel Corpus of wine reviews and elaborates on one of its applications – i.e. an E-C bilingual oenology term bank of wine tasting terms. The corpus is sourced from Decanter China, containing 1211 aligned wine reviews in both English and Chinese with 149,463 Chinese characters and 66,909 English words. It serves as a dataset for investigating crosslingual and cross-cultural differences in describing the sensory properties of wines. Our log-likelihood tests revealed good candidates for the Chinese translations of the English words in wine reviews. One of the most challenging features of this domain-specific bilingual term bank is the dominant many-to-many nature of term mapping. We focused on the one-to-many English-Chinese mapping relations of two major types: (a) the words without a single precise translation (e.g. “palate”) and (b) the words that are underspecified and involve ‘place-holder’ translation (e.g. “aroma”). Our study differs from previous bilingual CompuTerm studies by focusing on an area where cultural and sensory experiences favour many-to-many mappings instead of the default one-to-one mapping preferred in scientific and jurisprudential areas. This necessity for many-to-many mappings in turn challenges the basic design feature of many state-of-the-art automatic bilingual term-extraction approaches.
Publisher: Association for Computational Linguistics
Description: 34th Pacific Asia Conference on Language, Information and Computation, Oct. 2020, Hanoi, Vietnam
Rights: Copyright of contributed papers reserved by respective authors.
Posted with permission of the author.
The following publication Vincent Xian Wang, Xi Chen, Songnan Quan, and Chu-Ren Huang. 2020. A Parallel Corpus-driven Approach to Bilingual Oenology Term Banks: How Culture Differences Influence Wine Tasting Terms. In Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation, pages 318–328, Hanoi, Vietnam. Association for Computational Linguistics is available at https://aclanthology.org/2020.paclic-1.37.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
Quan_Parallel_Corpus-Driven_Approach.pdf2.03 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

111
Citations as of May 11, 2025

Downloads

59
Citations as of May 11, 2025

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.