Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

WebDocs: a real-life huge transactional dataset

Conference Paper
Publication Date:
2004
abstract:
This short note describes the main characteristics of WebDocs, a huge real-life transactional dataset we made publicly available to the Data Mining community through the FIMI repository. We built WebDocs from a spidered collection of web html documents. The whole collection contains about 1.7 millions documents, mainly written in English, and its size is about 5GB.
Iris type:
04.01 Contributo in Atti di convegno
Keywords:
Frequent itemsets mining datasets
List of contributors:
Orlando, Salvatore; Silvestri, Fabrizio; Lucchese, Claudio; Perego, Raffaele
Authors of the University:
PEREGO RAFFAELE
Handle:
https://iris.cnr.it/handle/20.500.14243/58442
  • Overview

Overview

URL

http://ftp.informatik.rwth-aachen.de/Publications/CEUR-WS/Vol-126/
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)