Title
Exploring scientific literature by textual and image content using DRIFT
Date Issued
01 April 2022
Access level
metadata only access
Resource Type
journal article
Author(s)
Getulio Vargas FoundationPraia de Botafogo
Publisher(s)
Elsevier Ltd
Abstract
Digital libraries represent the most valuable resource for storing, querying, and retrieving scientific literature. Traditionally, the reader/analyst aims to compose a set of articles based on keywords, according to his/her preferences, and manually inspect the resulting list of documents. Except for the articles which share citations or common keywords, the results retrieved will be limited to those which fulfill a syntactic match. Besides, if instead of having an article as a reference, the user has an image, the process of finding and exploring articles with similar content becomes infeasible. This paper proposes a visual analytic methodology for exploring and analyzing scientific document collections that consider both textual and image content. The proposed technique relies on combining multiple Content-Based Image Retrieval (CBIR) components and multidimensional projection to map the documents to a visual space based on their similarity, thus enabling an interactive exploration. Moreover, we extend its analytical capabilities with visual resources to display complementary information on selected documents that uncover hidden patterns and semantic relations. We evidence the effectiveness of our methodology through three case studies and a user evaluation, which attest to its usefulness during the process of scientific collections exploration.
Start page
140
End page
152
Volume
103
Language
English
OCDE Knowledge area
Matemáticas aplicadas Ciencias de la computación
Scopus EID
2-s2.0-85125507564
Source
Computers and Graphics (Pergamon)
ISSN of the container
00978493
Sponsor(s)
This work was supported by CONCYTEC-Peru through its executing unit ProCiencia (grant #419-2019 ), Universidad Católica San Pablo, CNPq-Brazil (grants #303552/2017-4 , #312483/2018-0 ), São Paulo Research Foundation (FAPESP)-Brazil (grant #2013/ 07375-0 ) and Getulio Vargas Foundation . The views expressed are those of the authors and do not reflect the official policy or position of the São Paulo Research Foundation. This work was supported by CONCYTEC-Peru through its executing unit ProCiencia (grant #419-2019), Universidad Cat?lica San Pablo, CNPq-Brazil (grants #303552/2017-4, #312483/2018-0), S?o Paulo Research Foundation (FAPESP)-Brazil (grant #2013/ 07375-0) and Getulio Vargas Foundation. The views expressed are those of the authors and do not reflect the official policy or position of the S?o Paulo Research Foundation.
Sources of information: Directorio de Producción Científica Scopus