Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

Interactive query expansion with automatically generated category-specific thesauri

Book
Publication Date:
2001
abstract:
The categorization of documents into subject-specific categories is a useful enhancement for large document collections addressed by information retrieval systems, as a user can first browse a category tree in search of the category that best matches her interests, and then issue a query for more specific documents ``from within the category''. This approach combines two modalities in information seeking that are most popular in Web-based search engines, i.e. category-based site browsing (as exemplified by e.g. {sc Yahoo}$^{smallsc TM}$) and keyword-based document querying (as exemplified by e.g. {sc AltaVista}$^{smallsc TM}$). Appropriate query expansion tools need to be provided, though, in order to allow the user to incrementally refine her query through further retrieval passes, thus allowing the system to produce a series of subsequent document rankings that hopefully converge to the user's expected ranking. In this work we propose that automatically generated, category-specific ``associative'' thesauri be used for such purpose. We discuss a method for their generation, and discuss how the thesaurus specific to a given category may usefully be endowed with "gateways" to the thesauri specific to its parent and children categories
Iris type:
03.01 Monografia o trattato scientifico
Keywords:
Categorization; Information search and retrieval
List of contributors:
Sebastiani, Fabrizio
Authors of the University:
SEBASTIANI FABRIZIO
Handle:
https://iris.cnr.it/handle/20.500.14243/97846
Book title:
Text Databases and Document Management: Theory and Practice
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)