Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Natural vs. Random Protein Sequences: Discovering Combinatorics Properties on Amino Acid Words

Articolo
Data di Pubblicazione:
2015
Abstract:
Casual mutations and natural selection have driven the evolution of protein amino acid sequences that we observe at present in nature. The question about which is the dominant force of proteins evolution is still lacking of an unambigu- ous answer. Casual mutations tend to randomize protein sequences while, in order to have the correct functionality, one expects that selection mechanisms impose rigid contraints on amino acid sequences. Moreover, one also has to consider that the space of all possible amino acid sequences is so astonishingly large that it could be reasonable to have a well tuned amino acid sequence in- distinguishable from a random one. In order to study the possibility to discriminate between random and natural amino acid sequences, we introduce different measures of association between pairs of amino acids in a sequence, and apply them to a dataset of 1, 047 nat- ural protein sequences and 10, 470 random sequences, carefully generated in order to preserve the relative length and amino acid distribution of the natu- ral proteins. We analize the multidimensional measures with machine learning techniques and show that, to a reasonable extent, natural protein sequences can be differentiated from random ones
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
Protein sequence; Random sequence; Combinatorics of words; Amino acid association
Elenco autori:
Felici, Giovanni; Vergni, Davide; Santoni, Daniele
Autori di Ateneo:
SANTONI DANIELE
VERGNI DAVIDE
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/302725
Pubblicato in:
JOURNAL OF THEORETICAL BIOLOGY
Journal
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)