Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Design, Implementation and Test of a Flexible Tor-oriented Web Mining Toolkit

Contributo in Atti di convegno
Data di Pubblicazione:
2017
Abstract:
Searching and retrieving information from the Web is a primary activity needed to monitor the development and usage of Web resources. Possible benefits include improving user experience (e.g. by optimizing query results) and enforcing data/user security (e.g. by identifying harmful websites). Motivated by the lack of ready-to-use solutions, in this paper we present a flexible and accessible toolkit for structure and content mining, able to crawl, download, extract and index resources from the Web. While being easily configurable to work in the "surface" Web, our suite is specifically tailored to explore the Tor dark Web, i.e. the ensemble of Web servers composing the world's most famous darknet. Notably, the toolkit is not just a Web scraper, but it includes two mining modules, respectively able to prepare content to be fed to an (external) semantic engine, and to reconstruct the graph structure of the explored portion of the Web. Other than discussing in detail the design, features and performance of our toolkit, we report the findings of a preliminary run over Tor, that clarify the potential of our solution.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
dark web; tor web graph
Elenco autori:
Guarino, Stefano; Celestini, Alessandro
Autori di Ateneo:
CELESTINI ALESSANDRO
GUARINO STEFANO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/391925
  • Dati Generali

Dati Generali

URL

http://doi.acm.org/10.1145/3102254.3102266
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)