TY - JOUR
T1 - Resequencing of 200 human exomes identifies an excess of low-frequency non-synonymous coding variants
AU - Li, Yingrui
AU - Vinckenbosch, Nicolas
AU - Tian, Geng
AU - Huerta-Sanchez, Emilia
AU - Jiang, Tao
AU - Jiang, Hui
AU - Albrechtsen, Anders
AU - Andersen, Gitte
AU - Cao, Hongzhi
AU - Korneliussen, Thorfinn
AU - Grarup, Niels
AU - Guo, Yiran
AU - Hellman, Ines
AU - Jin, Xin
AU - Li, Qibin
AU - Liu, Jiangtao
AU - Liu, Xiao
AU - Sparsø, Thomas
AU - Tang, Meifang
AU - Wu, Honglong
AU - Wu, Renhua
AU - Yu, Chang
AU - Zheng, Hancheng
AU - Astrup, Arne
AU - Bolund, Lars
AU - Holmkvist, Johan
AU - Jørgensen, Torben
AU - Kristiansen, Karsten
AU - Schmitz, Ole
AU - Schwartz, Thue W
AU - Zhang, Xiuqing
AU - Li, Ruiqiang
AU - Yang, Huanming
AU - Wang, Jian
AU - Hansen, Torben
AU - Pedersen, Oluf
AU - Nielsen, Rasmus
AU - Wang, Jun
PY - 2010/11
Y1 - 2010/11
N2 - Targeted capture combined with massively parallel exome sequencing is a promising approach to identify genetic variants implicated in human traits. We report exome sequencing of 200 individuals from Denmark with targeted capture of 18,654 coding genes and sequence coverage of each individual exome at an average depth of 12-fold. On average, about 95% of the target regions were covered by at least one read. We identified 121,870 SNPs in the sample population, including 53,081 coding SNPs (cSNPs). Using a statistical method for SNP calling and an estimation of allelic frequencies based on our population data, we derived the allele frequency spectrum of cSNPs with a minor allele frequency greater than 0.02. We identified a 1.8-fold excess of deleterious, non-syonomyous cSNPs over synonymous cSNPs in the low-frequency range (minor allele frequencies between 2% and 5%). This excess was more pronounced for X-linked SNPs, suggesting that deleterious substitutions are primarily recessive.
AB - Targeted capture combined with massively parallel exome sequencing is a promising approach to identify genetic variants implicated in human traits. We report exome sequencing of 200 individuals from Denmark with targeted capture of 18,654 coding genes and sequence coverage of each individual exome at an average depth of 12-fold. On average, about 95% of the target regions were covered by at least one read. We identified 121,870 SNPs in the sample population, including 53,081 coding SNPs (cSNPs). Using a statistical method for SNP calling and an estimation of allelic frequencies based on our population data, we derived the allele frequency spectrum of cSNPs with a minor allele frequency greater than 0.02. We identified a 1.8-fold excess of deleterious, non-syonomyous cSNPs over synonymous cSNPs in the low-frequency range (minor allele frequencies between 2% and 5%). This excess was more pronounced for X-linked SNPs, suggesting that deleterious substitutions are primarily recessive.
KW - Base Sequence
KW - Chromosomes, Human, X/genetics
KW - Exons/genetics
KW - Gene Conversion/genetics
KW - Gene Frequency/genetics
KW - Genes, Recessive/genetics
KW - Genetic Variation
KW - Genetics, Population
KW - Human Genome Project
KW - Humans
KW - Introns/genetics
KW - Polymorphism, Single Nucleotide
KW - Untranslated Regions/genetics
U2 - 10.1038/ng.680
DO - 10.1038/ng.680
M3 - Article
C2 - 20890277
SN - 1061-4036
VL - 42
SP - 969
EP - 972
JO - Nature Genetics
JF - Nature Genetics
IS - 11
ER -