Title
Similarity preserving snippet-based visualization of web search results
Date Issued
01 January 2014
Access level
metadata only access
Resource Type
journal article
Author(s)
Roman F.S.
Pagliosa P.
Casaca W.
Helou E.S.
De Oliveira M.C.F.
Nonato L.G.
Publisher(s)
IEEE Computer Society
Abstract
Internet users are very familiar with the results of a search query displayed as a ranked list of snippets. Each textual snippet shows a content summary of the referred document (or webpage) and a link to it. This display has many advantages, for example, it affords easy navigation and is straightforward to interpret. Nonetheless, any user of search engines could possibly report some experience of disappointment with this metaphor. Indeed, it has limitations in particular situations, as it fails to provide an overview of the document collection retrieved. Moreover, depending on the nature of the query for example, it may be too general, or ambiguous, or ill expressed the desired information may be poorly ranked, or results may contemplate varied topics. Several search tasks would be easier if users were shown an overview of the returned documents, organized so as to reflect how related they are, content wise. We propose a visualization technique to display the results of web queries aimed at overcoming such limitations. It combines the neighborhood preservation capability of multidimensional projections with the familiar snippet-based representation by employing a multidimensional projection to derive two-dimensional layouts of the query search results that preserve text similarity relations, or neighborhoods. Similarity is computed by applying the cosine similarity over a 'bag-of-wordsâ' vector representation of collection built from the snippets. If the snippets are displayed directly according to the derived layout, they will overlap considerably, producing a poor visualization. We overcome this problem by defining an energy functional that considers both the overlapping among snippets and the preservation of the neighborhood structure as given in the projected layout. Minimizing this energy functional provides a neighborhood preserving two-dimensional arrangement of the textual snippets with minimum overlap. The resulting visualization conveys both a global view of the query results and visual groupings that reflect related results, as illustrated in several examples shown. © 2014 IEEE.
Start page
457
End page
470
Volume
20
Issue
3
Language
English
OCDE Knowledge area
Ingeniería de sistemas y comunicaciones
Informática y Ciencias de la Información
Subjects
Scopus EID
2-s2.0-84897377620
Source
IEEE Transactions on Visualization and Computer Graphics
ISSN of the container
10772626
DOI of the container
10.1109/TVCG.2013.242
Sources of information:
Directorio de Producción Científica
Scopus