Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

Improving Estimation Accuracy of Aggregate Queries on Data Cubes

Academic Article
Publication Date:
2010
abstract:
In this paper, we investigate the problem of estimation of a target database from summary databases derived from a base datacube. We show that such estimates can be derived by choosing a primary database with the desired target measure but not the desired dimensions, and use a proxy database to estimate the results. This technique is common in statistics, but an important issue we are addressing is the accuracy of these estimates. Specifically, given multiple primary and multiple proxy databases, the problem is how to select the primary and proxy databases that will generate the most accurate target database estimation possible. We propose an algorithmic approach which makes use of the principles of information entropy for determining the steps to select or compute the primary and proxy databases that provide the most accurate target database. We show that the primary database with the largest number of cells in common with the target database and the proxy database provides the more accurate estimates. We prove that this is consistent with maximizing the entropy. We provide some experimental results on the accuracy of the target database estimation in order to verify our results. Furthermore, we investigate the accuracy results in cases where the dimensions are defined over a hierarchy of categories and roll-up and drill-down operations are needed to generate the desired target results.
Iris type:
01.01 Articolo in rivista
Keywords:
Query estimation; Entropy; Accuracy analysis
List of contributors:
POURABBAS DOLATABAD, Elaheh
Handle:
https://iris.cnr.it/handle/20.500.14243/170312
Published in:
DATA & KNOWLEDGE ENGINEERING
Journal
  • Overview

Overview

URL

http://www.sciencedirect.com/science/article/pii/S0169023X09001281
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)