Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

SparkBOOST

Software
Publication Date:
2016
abstract:
SparkBOOST is a Java library built over Apache Spark that provides a distributed implementation of AdaBoost.MH and MP-Boost machine learning algorithms. These boosting algorithms are known to be very effective and robust to overfitting in many application domains, e.g. in natural language processing contexts. SparkBOOST offers to developers a fast way to scale these algorithms to large scale problems, where one needs to build classifiers from very large training datasets or simply needs to quickly classify huge stream of documents. The library can be integrated into custom programs by using a simple API. The SparkBOOST implementation also provides some command line tools to perform learning and classification on data sources available in LibSVM format.
Iris type:
05.11 Software
Keywords:
Classification; Boosting; Spark; Big data; SOFTWARE ENGINEERING. Design Tools and Techniques; Software Architectures; Language Classifications; Database Applications
List of contributors:
Fagni, Tiziano
Authors of the University:
FAGNI TIZIANO
Handle:
https://iris.cnr.it/handle/20.500.14243/323156
  • Overview

Overview

URL

https://github.com/tizfa/sparkboost
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)