Predicting intelligibility and perceived linguistic distance by means of the Levenshtein algorithm

Title
Predicting intelligibility and perceived linguistic distance by means of the Levenshtein algorithm
Publication Date
2008-01
Author(s)
Beijering, Karin
Gooskens, Charlotte
Heeringa, Wilbert
Type of document
Journal Article
Language
en
Entity Type
Publication
Publisher
John Benjamins Publishing Co
Place of publication
Netherlands
DOI
10.1075/avt.25.05bei
UNE publication id
une:1959.11/30112
Abstract
In this article, we investigate the predictive value of so-called Levenshtein distances for both intelligibility scores and perceived linguistic distances. Additionally, we compare two measuring methods, namely normalised and non-normalised Levenshtein distances. The Levenshtein algorithm is a string edit distance measure that quantifies the distance between the pronunciations of corresponding words in different dialects or closely related languages. It calculates the minimal costs required to change a string of segments into another by means of insertions, deletions or substitutions. Kessler (1995) introduced the algorithm for measuring distances between Irish Gaelic dialects. Since then it has been applied successfully to Dutch dialects (Heeringa 2004, 213–278), Sardinian dialects (Bolognesi & Heeringa 2002) and German dialects (Nerbonne & Siedle 2005).
Link
Citation
Linguistics in the Netherlands, 25(1), p. 13-24
ISSN
1569-9919
0929-7332
Start page
13
End page
24

Files:

NameSizeformatDescriptionLink