SAQC
SAQC evaluates genome-wide single-nucleotide polymorphism (SNP) array data quality by computing allele-frequency (AF) deviation indices and applying empirical distribution–based confidence-interval methods to detect poor-quality arrays and samples.
Key Features:
- Quality indices development: Introduces quality indices that measure deviation of estimated individual-level allele frequencies (AFs) from expected values using standardized distances.
- Statistical properties evaluation: Investigates statistical properties of the quality indices and reports empirical lognormal distributions across large genomic studies.
- Reference data establishment: Provides AF reference data and quality-index reference data for multiple SNP array platforms derived from diverse reference populations.
- Confidence interval methodology: Implements a confidence-interval method that leverages empirical distributions of quality indices to identify poor-quality SNP arrays or DNA samples.
- Data visualization and evaluation: Provides visualization tools to interpret data quality metrics for SNP array assessments.
Scientific Applications:
- Genome-wide association studies (GWAS): Ensures accuracy and precision of downstream analyses in GWAS by identifying and mitigating poor-quality SNP array data.
- SNP array–based genetic research: Supports quality assessment across genetic studies that use genome-wide SNP arrays and reference-population comparisons.
Methodology:
Quantifies deviations in allele frequencies via standardized distances, establishes empirical distributions for these quality indices (noting lognormal behavior), and applies an empirical confidence-interval method to detect poor-quality arrays or samples.
Topics
Details
- Tool Type:
- desktop application
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- R
- Added:
- 12/18/2017
- Last Updated:
- 11/25/2024
Operations
Publications
Yang H, Lin H, Kang M, Chen C, Lin C, Li L, Wu J, Chen Y, Pan W. SAQC: SNP Array Quality Control. BMC Bioinformatics. 2011;12(1). doi:10.1186/1471-2105-12-100. PMID:21501472. PMCID:PMC3101186.
Documentation
Links
Software catalogue
http://www.mybiosoftware.com/saqc-1-0-snp-array-quality-control.html