Challenging lexical coverage conventions: Evaluating the vocabulary demands of family-genre film and television

Title
Challenging lexical coverage conventions: Evaluating the vocabulary demands of family-genre film and television
Publication Date
2025-12-01
Author(s)
Milliner, Brett
Pinchbeck, Geoffrey
Type of document
Journal Article
Language
en
Entity Type
Publication
Publisher
Elsevier Ltd
Place of publication
The Netherland
DOI
10.1016/j.rmal.2025.100230
UNE publication id
une:1959.11/70978
Abstract

The contribution of studies investigating lexical coverage to the field of applied linguistics cannot be understated. Lexical coverage research has helped establish the vocabulary knowledge most essential for second language (L2) comprehension and elevate the importance of high-frequency vocabulary knowledge acquisition. Approaches to lexical coverage research have, however, begun to come under closer scrutiny in recent studies, with some experts questioning the accuracy of coverage estimates. Understanding these limitations, the current study applies an alternative approach to evaluating the lexical knowledge required to comprehend the OPUS-family-genre corpus, a collection of closed captions from 1597 family-genre films and television programs (10,744,767 tokens). In contrast to previous conventions that used band-based (1000-word) predictions of lexical coverage, in this study, coverage is evaluated at the individual word-unit level. It compares the coverage provided by four word lists: (1) a lemma list derived from tagging the OPUS-family-genre corpus, (2) a flemma list, and two word-family lists, (3) the BNC, and (4) the BNC/COCA. The study also models how a part-of-speech lexical tagger (TagAnt) can be used to evaluate lemma-based lexical coverage. The analysis revealed that English language learners will know 90, 95, and 98% of the running words appearing in family-genre films and television if they know the first 855, 2005, and 4393 flemmas, from the attached word lists. More simply, knowing the first 900 words from our supplementary word frequency lists would enable English language learners to start viewing family-genre films and television.

Link
Citation
Research Methods in Applied Linguistics, 4(3), p. 1-12
ISSN
2772-7661
Start page
1
End page
12
Rights
Attribution 4.0 International

Files:

NameSizeformatDescriptionLink