University of Tasmania
Browse
76019 Journal Article.pdf (655.81 kB)

Comparing genotyping algorithms for Illumina's Infinium whole-genome SNP BeadChips

Download (655.81 kB)
journal contribution
posted on 2023-05-17, 10:33 authored by Ritchie, ME, Liu, R, Carvalho, BS, Bahlo, M, Booth, DR, Broadley, SA, Brown, MA, Simon James FooteSimon James Foote, Griffiths, LR, Kilpatrick, TJ, Lechner-Scott, J, Moscato, P, Perreau, VM, Rubio, JP, Scott, RJ, Jim Stankovich, Stewart, GJ, Bruce TaylorBruce Taylor, Wiley, J, Clarke, G, Cox, MB, Csurhes, PA, Danoy, P, Joanne DickinsonJoanne Dickinson, Drysdale, K, Field, J, Greer, JM, Guru, P, Hadler, J, Hoban, E, McMorran, BJ, Jensen, CJ, Johnson, LJ, McCallum, R, Merriman, M, Merriman, T, Polanowski, A, Pryce, K, Tajouri, L, Whittock, L, Wilkins, EJ, Browning, BL, Browning, SR, Perera, D, Butzkueven, H, Carroll, WM, Chapman, C, Kermode, AG, Marriott, M, Mason, D, Heard, RN, Pender, MP, Slee, M, Tubridy, N, Willoughby, E, Irizarry, RA
Background: Illumina’s Infinium SNP BeadChips are extensively used in both small and large-scale genetic studies. A fundamental step in any analysis is the processing of raw allele A and allele B intensities from each SNP into genotype calls (AA, AB, BB). Various algorithms which make use of different statistical models are available for this task. We compare four methods (GenCall, Illuminus, GenoSNP and CRLMM) on data where the true genotypes are known in advance and data from a recently published genome-wide association study. Results: In general, differences in accuracy are relatively small between the methods evaluated, although CRLMM and GenoSNP were found to consistently outperform GenCall. The performance of Illuminus is heavily dependent on sample size, with lower no call rates and improved accuracy as the number of samples available increases. For X chromosome SNPs, methods with sex-dependent models (Illuminus, CRLMM) perform better than methods which ignore gender information (GenCall, GenoSNP). We observe that CRLMM and GenoSNP are more accurate at calling SNPs with low minor allele frequency than GenCall or Illuminus. The sample quality metrics from each of the four methods were found to have a high level of agreement at flagging samples with unusual signal characteristics. Conclusions: CRLMM, GenoSNP and GenCall can be applied with confidence in studies of any size, as their performance was shown to be invariant to the number of samples available. Illuminus on the other hand requires a larger number of samples to achieve comparable levels of accuracy and its use in smaller studies (50 or fewer individuals) is not recommended.

History

Publication title

BMC Bioinformatics

Volume

12

Article number

68

Number

68

Pagination

1-12

ISSN

1471-2105

Department/School

Menzies Institute for Medical Research

Publisher

Biomed Central Ltd

Place of publication

Middlesex House, 34-42 Cleveland St, London, England, W1T 4Lb

Rights statement

Licensed under Creative Commons Attribution 2.0 Generic (CC BY 2.0) http://creativecommons.org/licenses/by/2.0/

Repository Status

  • Open

Socio-economic Objectives

Expanding knowledge in economics

Usage metrics

    University Of Tasmania

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC