Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

An analysis based on F-discrepancy for sampling in regression tree learning

Conference Paper
Publication Date:
2014
abstract:
When the problem of learning from data is solved through a regression tree estimator, the quality of the available observations is an important issue, since it influences directly the accuracy of the resulting model. It becomes particuarly relevant when there is freedom to sample the input space arbitrarily to build the tree model or, alternatively, when we need to select a subsample to train the tree estimator on a computationally feasible input set, or to evaluate the goodness of the estimation on a test set. Here the accuracy of estimation based on regression trees is analyzed from the point of view of geometric properties of the available input data. In particular, the concept of F-discrepancy, a quantity that measures how well a set of points represents the distribution underlying the input generation process, is applied to derive conditions for convergence to the optimal piecewise-constant estimator for the unknown function we want to learn. The analysis has a constructive nature, allowing to select in practice good input sets for the problem at hand, as shown in a simulation example involving a real data set.
Iris type:
04.01 Contributo in Atti di convegno
List of contributors:
Cervellera, Cristiano; Maccio', Danilo; Gaggero, Mauro
Authors of the University:
CERVELLERA CRISTIANO
GAGGERO MAURO
MACCIO' DANILO
Handle:
https://iris.cnr.it/handle/20.500.14243/287308
Published in:
PROCEEDINGS OF ... INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (PRINT)
Series
  • Overview

Overview

URL

http://www.scopus.com/inward/record.url?eid=2-s2.0-84908469557&partnerID=q2rCbXpz
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)