Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

A fuzzy method for RNA-Seq differential expression analysis in presence of multireads

Articolo
Data di Pubblicazione:
2016
Abstract:
Background When the reads obtained from high-throughput RNA sequencing are mapped against a reference database, a significant proportion of them - known as multireads - can map to more than one reference sequence. These multireads originate from gene duplications, repetitive regions or overlapping genes. Removing the multireads from the mapping results, in RNA-Seq analyses, causes an underestimation of the read counts, while estimating the real read count can lead to false positives during the detection of differentially expressed sequences. Results We present an innovative approach to deal with multireads and evaluate differential expression events, entirely based on fuzzy set theory. Since multireads cause uncertainty in the estimation of read counts during gene expression computation, they can also influence the reliability of differential expression analysis results, by producing false positives. Our method manages the uncertainty in gene expression estimation by defining the fuzzy read counts and evaluates the possibility of a gene to be differentially expressed with three fuzzy concepts: over-expression, same-expression and under-expression. The output of the method is a list of differentially expressed genes enriched with information about the uncertainty of the results due to the multiread presence. We have tested the method on RNA-Seq data designed for case-control studies and we have compared the obtained results with other existing tools for read count estimation and differential expression analysis. Conclusions The management of multireads with the use of fuzzy sets allows to obtain a list of differential expression events which takes in account the uncertainty in the results caused by the presence of multireads. Such additional information can be used by the biologists when they have to select the most relevant differential expression events to validate with laboratory assays. Our method can be used to compute reliable differential expression events and to highlight possible false positives in the lists of differentially expressed genes computed with other tools.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
RNA-Seq; Differential expression; Multireads; Fuzzy sets; Possibilistic modeling
Elenco autori:
Caratozzolo, MARIANO FRANCESCO; Marzano, Flavia; Consiglio, Arianna; Grillo, Giorgio; Liuni, Sabino
Autori di Ateneo:
CARATOZZOLO MARIANO FRANCESCO
CONSIGLIO ARIANNA
GRILLO GIORGIO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/333567
Pubblicato in:
BMC BIOINFORMATICS
Journal
  • Dati Generali

Dati Generali

URL

https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-016-1195-2
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)