Please use this identifier to cite or link to this item: https://hdl.handle.net/1959.11/31289
Title: Conditional Entropy Measures Intelligibility among Related Languages
Contributor(s): Moberg, Jens (author); Gooskens, Charlotte  (author); Nerbonne, John (author); Vaillette, Nathan (author)
Publication Date: 2007
Open Access: Yes
Handle Link: https://hdl.handle.net/1959.11/31289
Open Access Link: https://clinjournal.org/clinj/CLIN17Open Access Link
Abstract: The Scandinavian languages are so alike that their speakers often communicate, each using their own language, which Haugen (1966) dubbed SEMICOMMUNICATION. The success of semi-communication depends on the languages involved, and, moreover, can be asymmetric: for example, Swedish is more easily understandable for a Dane, than Danish for a Swede. It has been argued that non-linguistic factors could explain intelligibility, including its asymmetry. Gooskens (2006), however, found a high correlation between linguistic distance and intelligibility. This suggests that we need to seek linguistic factors that influence intelligibility, and that potentially asymmetric factors would be particularly interesting. Gooskens' distance techniques cannot capture asymmetry. The present paper attempts to develop a model of the success of semi-communication based on conditional entropy, in particular using the conditional entropy of the phonememapping in corresponding (cognate) words. Semantically corresponding words were taken from frequency lists and aligned, and the conditional entropy of the phoneme mapping in aligned word pairs was calculated. This gives us information about the difficulty of predicting a phoneme in a native language given a corresponding phoneme in the foreign language. We also examine the conditional entropy of selected word classes, such as native/loan and function/content words.
Publication Type: Conference Publication
Conference Details: CLIN17: 17th Meeting of Computational Linguistics in the Netherlands, Leuven, Belgium, 12th January, 2007
Source of Publication: Proceedings of the 17th Meeting of Computational Linguistics in the Netherlands, p. 51-66
Publisher: Landelijke Onderzoekschool Taalwetenschap, Netherlands Graduate School of Linguistics
Place of Publication: Netherlands
ISSN: 2211-4009
Fields of Research (FoR) 2008: 200310 Other European Languages
200406 Language in Time and Space (incl. Historical Linguistics, Dialectology)
Socio-Economic Objective (SEO) 2008: 970120 Expanding Knowledge in Language, Communication and Culture
950201 Communication Across Languages and Culture
Peer Reviewed: Yes
HERDC Category Description: E1 Refereed Scholarly Conference Publication
Appears in Collections:Conference Publication
School of Humanities, Arts and Social Sciences

Files in This Item:
3 files
File Description SizeFormat 
Show full item record

Page view(s)

1,316
checked on Mar 8, 2023

Download(s)

8
checked on Mar 8, 2023
Google Media

Google ScholarTM

Check


Items in Research UNE are protected by copyright, with all rights reserved, unless otherwise indicated.