Title
Focusing synonymy on source code repositories
Other title
[Enfocando la sinonimia sobre repositorios de código fuente]
Date Issued
01 July 2020
Access level
metadata only access
Resource Type
journal article
Publisher(s)
Associacao Iberica de Sistemas e Tecnologias de Informacao
Abstract
We expect that a huge corpus of code could be rich in patterns, and the corpus of software has statistical properties, which are similar to the corpus of natural language. In spite of code and text are similar, written code is a new problem domain for the techniques of natural language processing. Taking in account that synonymy is the base for building a WordNet, this work addresses the synonymy in source code repositories, presenting an approach and techniques based on naming patterns and term frequency for that. The proposal has been evaluated in popular repositories of Apache, Eclipse and Red Hat. The results show that the proposal does not require an intensive processing.
Start page
573
End page
589
Volume
2020
Issue
E31
Language
Spanish
OCDE Knowledge area
Ciencias de la información
Scopus EID
2-s2.0-85089946122
Source
RISTI - Revista Iberica de Sistemas e Tecnologias de Informacao
ISSN of the container
16469895
Sources of information: Directorio de Producción Científica Scopus