AbCD
AbCD calculates pre-estimated effective sample sizes for sequencing-based genetic study design across minor allele frequency (MAF) categories, sequencing coverages (0.5–30X), sample sizes (20–20,000), and four ethnic groups (Europeans, Africans, Asians, African Americans).
Key Features:
- Pre-estimated effective sample sizes: Provides effective sample size estimates stratified by MAF categories.
- Coverage and sample size range: Covers arbitrary sequencing depths from 0.5–30X and sample sizes from 20 to 20,000.
- Ethnic group-specific estimates: Supplies estimates tailored to Europeans, Africans, Asians, and African Americans.
- ShotGun short-read simulation: Simulates short reads with user-specified read length, average depth, and cycle-specific sequencing error rates and generates realistic read depth distributions for low- and high-depth scenarios.
- DesignPlanner integration and LD-aware genotype calling: Integrates ShotGun-generated sequence data with initial SNP discovery and employs a linkage disequilibrium-aware method to call genotypes and produce MAF-specific effective sample sizes.
- Evaluation of combined sequencing and genotyping data: Supports assessment of combined designs such as whole-genome low-depth with exonic high-depth sequencing and integration with existing GWAS genotyping data.
Scientific Applications:
- Study design optimization: Optimizes allocation of sequencing coverage and sample size to achieve target statistical power across MAF strata.
- Low-coverage sequencing strategies: Informs use of low-depth sequencing in large cohorts by providing MAF-specific effective sample size estimates.
- Genome-wide association studies (GWAS): Guides design choices for large-scale GWAS and other association studies relying on sequencing data.
- Combined sequencing and integration with GWAS: Assists planning of studies combining whole-genome low-depth and exonic high-depth sequencing or integrating sequence data with existing GWAS genotypes.
Methodology:
ShotGun performs short-read simulation with parameters for read length, average depth, and cycle-specific sequencing error rates to generate read depth distributions; DesignPlanner integrates ShotGun output with initial SNP discovery and uses a linkage disequilibrium-aware genotype-calling method to produce MAF-specific effective sample sizes.
Topics
Collections
Details
- License:
- Not licensed
- Tool Type:
- desktop application, web application
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- Java
- Added:
- 8/20/2017
- Last Updated:
- 1/19/2020
Operations
Data Inputs & Outputs
Data retrieval
Publications
Kang J, Huang K, Xu Z, Wang Y, Abecasis GR, Li Y. AbCD: arbitrary coverage design for sequencing-based genetic studies. Bioinformatics. 2013;29(6):799-801. doi:10.1093/bioinformatics/btt041. PMID:23357921. PMCID:PMC3597143.