Title
Automatic Term Extraction Combining Different Information
Other title
Extraction automatique de termes combinant différentes informations
Date Issued
01 January 2014
Access level
metadata only access
Resource Type
conference paper
Author(s)
Université de Montpellier 2
Publisher(s)
Association pour le Traitement Automatique des Langues
Abstract
Comprehensive terminology is essential for a community to describe, exchange, and retrieve data. In multiple domain, the explosion of text data produced has reached a level for which automatic terminology extraction and enrichment is mandatory. Automatic Term Extraction (or Recognition) methods use natural language processing to do so. Methods featuring linguistic and statistical aspects as often proposed in the literature, rely some problems related to term extraction as low frequency, complexity of the multi-word term extraction, human effort to validate candidate terms. In contrast, we present two new measures for extracting and ranking muli-word terms from domain-specific corpora, covering the all mentioned problems. In addition we demonstrate how the use of the Web to evaluate the significance of a multi-word term candidate, helps us to outperform precision results obtain on the biomedical GENIA corpus with previous reported measures such as C-value.
Start page
407
End page
412
Volume
2
Language
French
OCDE Knowledge area
LingĂ¼Ăstica
Estudios generales de idiomas
Subjects
Scopus EID
2-s2.0-85123701454
ISBN
9780000000002
Conference
Proceedings of TALN 2014
Sources of information:
Directorio de ProducciĂ³n CientĂfica
Scopus