Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel

Academic Article
Publication Date:
2014
abstract:
A major use of the 1000 Genomes Project (1000GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants. © 2014 Macmillan Publishers Limited. All rights reserved.
Iris type:
01.01 Articolo in rivista
List of contributors:
Maschio, Andrea; Porcu, Eleonora; Angius, Andrea; Cucca, Francesco; Colonna, Vincenza; Sanna, Serena; Busonero, Fabio; Sidore, Carlo
Authors of the University:
ANGIUS ANDREA
BUSONERO FABIO
COLONNA VINCENZA
MASCHIO ANDREA
SANNA SERENA
SIDORE CARLO
Handle:
https://iris.cnr.it/handle/20.500.14243/221569
Published in:
NATURE COMMUNICATIONS
Journal
  • Overview

Overview

URL

http://www.scopus.com/record/display.url?eid=2-s2.0-84902504030&origin=inward
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)