ISO 24614-1:2010

Language resource management — Word segmentation of written texts — Part 1: Basic concepts and general principles
ISO 24614-1:2010 presents the basic concepts and general principles of word segmentation, and provides language-independent guidelines to enable written texts to be segmented, in a reliable and reproducible manner, into word segmentation units (WSU). The many applications and fields that need to segment texts into words — and thus to which ISO 24614-1:2010 can be applied — include translation, content management, speech technologies, computational linguistics and lexicography.
OEN:
ISO
Langue:
English
Code(s) de l'ICS:
01.140.10
Statut:
Publié
Date de Publication:
2010-10-24
Numéro Standard:
ISO 24614-1:2010