Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/98194
PIRA download icon_1.1View/Download Full Text
Title: MYCanCor : a video corpus of spoken Malaysian Cantonese
Authors: Liesenfeld, A 
Issue Date: May-2018
Source: In N Calzolari, K Choukri, C Cieri, T Declerck, K Hasida, H Isahara, B Maegaard, J Mariani, A Moreno, J Odijk, S Piperidis & T Tokunaga (Eds.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan, p. 764-767. European Language Resources Association (ELRA), 2018.
Abstract: The Malaysia Cantonese Corpus (MYCanCor) is a collection of recordings of Malaysian Cantonese speech mainly collected in Perak, Malaysia. The corpus consists of around 20 hours of video recordings of spontaneous talk-in-interaction (56 settings) typically involving 2-4 speakers. A short scene description as well as basic speaker information is provided for each recording. The corpus is transcribed in CHAT (minCHAT) format and presented in traditional Chinese characters (UTF8) using the Hong Kong Supplementary Character Set (HKSCS). MYCanCor is expected to be a useful resource for researchers interested in any aspect of spoken language processing or Chinese multimodal corpora.
Keywords: Malaysian Cantonese
Spoken corpora
Naturally-occurring talk-in-interaction
Publisher: European Language Resources Association (ELRA)
ISBN: 979-10-95546-00-9
Description: Eleventh International Conference on Language Resources and Evaluation (LREC 2018), May 7-12, 2018, Miyazaki, Japan
Rights: Copyright by the European Language Resources Association
The LREC 2018 Proceedings are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/)
The following publication Liesenfeld, A. (2018, May). MYCanCor: A Video Corpus of spoken Malaysian Cantonese. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) is available at https://aclanthology.org/L18-1122.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
Liesenfeld_Mycancor_Video_Corpus.pdf558.75 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

168
Last Week
17
Last month
Citations as of Nov 10, 2025

Downloads

35
Citations as of Nov 10, 2025

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.