Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

Clustering protein structures with Hadoop

Conference Paper
Publication Date:
2016
abstract:
Machine learning is a widely used technique in structural biology, since the analysis of large conformational ensembles originated from single protein structures (e.g. derived from NMR experiments or molecular dynamics simulations) can be approached by partitioning the original dataset into sensible subsets, revealing important structural and dynamics behaviours. Clustering is a good unsupervised approach for dealing with these ensembles of structures, in order to identify stable conformations and driving characteristics shared by the different structures. A common problem of the applications that implement protein clustering is the scalability of the performance, in particular concerning the data load into memory. In this work we show how it is possible to improve the parallel performance of the GROMOS clustering algorithm by using Hadoop. The preliminary results show the validity of this approach, providing a hint for future development in this field.
Iris type:
04.01 Contributo in Atti di convegno
Keywords:
Hadoop Clustering; protein structures; Molecular dynamics; Data parallel
List of contributors:
Chiappori, Federica; Paschina, Giacomo; Roverelli, Luca; D'Agostino, Daniele; Merelli, Ivan
Authors of the University:
CHIAPPORI FEDERICA CATERINA
MERELLI IVAN
Handle:
https://iris.cnr.it/handle/20.500.14243/320436
Book title:
Computational Intelligence Methods for Bioinformatics and Biostatistics. CIBB 2015
  • Overview

Overview

URL

http://link.springer.com/chapter/10.1007/978-3-319-44332-4_11
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)