Predicting intelligibility and perceived linguistic distance by means of the Levenshtein algorithm

Beijering, Karin; Gooskens, Charlotte; Heeringa, Wilbert

doi:10.1075/avt.25.05bei

Title

Predicting intelligibility and perceived linguistic distance by means of the Levenshtein algorithm

Publication Date

2008-01

Author(s)

Beijering, Karin

Gooskens, Charlotte

Heeringa, Wilbert

Type of document

Journal Article

Language

en

Entity Type

Publication

Publisher

John Benjamins Publishing Co

Place of publication

Netherlands

DOI

10.1075/avt.25.05bei

UNE publication id

une:1959.11/30112

Abstract

In this article, we investigate the predictive value of so-called Levenshtein distances for both intelligibility scores and perceived linguistic distances. Additionally, we compare two measuring methods, namely normalised and non-normalised Levenshtein distances. The Levenshtein algorithm is a string edit distance measure that quantifies the distance between the pronunciations of corresponding words in different dialects or closely related languages. It calculates the minimal costs required to change a string of segments into another by means of insertions, deletions or substitutions. Kessler (1995) introduced the algorithm for measuring distances between Irish Gaelic dialects. Since then it has been applied successfully to Dutch dialects (Heeringa 2004, 213–278), Sardinian dialects (Bolognesi & Heeringa 2002) and German dialects (Nerbonne & Siedle 2005).

Link

link

Citation

Linguistics in the Netherlands, 25(1), p. 13-24

ISSN

1569-9919

0929-7332

Start page

13

End page

24

Predicting intelligibility and perceived linguistic distance by means of the Levenshtein algorithm

Files: