SpaTemHTP
"SpaTemHTP" is an analytical pipeline to address the challenges of processing and analyzing the vast amounts of data generated by high-throughput phenotyping (HTP) platforms, particularly in outdoor and field-based settings. With the rapid advancement of phenotyping technologies, accurately estimating the genotypic component of plant phenotype over time has become crucial and challenging, mainly when data inaccuracies, failures, and environmental factors substantially affect phenotype measurements.
Core Modules and Functionalities:
- Detection of Outliers: The first module in the SpaTemHTP pipeline focuses on identifying and handling outliers in the dataset, ensuring that anomalous data points do not skew subsequent analyses.
- Imputation of Missing Values: Recognizing the common issue of missing data in outdoor HTP platform measurements, this module efficiently imputes missing values, allowing for a more complete and accurate analysis of the plant growth data.
- Mixed-Model Genotype Adjusted Means Computation with Spatial Adjustment: The final module computes mixed-model genotype adjusted means, incorporating spatial adjustment to account for the environmental and positional effects on phenotype expression. This step is crucial for accurately estimating genotypic values and growth curves.
- Smooth Genotype Growth Curves Estimation: By sequentially applying these three steps, SpaTemHTP is particularly effective in estimating smooth genotype growth curves from raw data, even when such data contain significant noise.
- Change-Point Analysis and Growth Phase Modeling: A pipeline extension includes modeling the genotype time series data and performing change-point analysis to identify growth phases and the optimal timing for distinguishing genotypic differences.
- Genotype Clustering and ANOVA: The estimated genotypic values during the optimal growth phase are used to cluster genotypes, with two-way ANOVA confirming the consistency of these clusters throughout the growth duration.
Topic
Genotype and phenotype;Plant biology;Sequencing;Workflows;Agricultural science
Detail
Operation: Imputation;Genotyping;Phasing
Software interface: Command-line interface
Language: R
License: The GNU General Public License v3.0
Cost: Free with restrictions
Version name: 1.0.4.9999
Credit: The Bill and Melinda Gates Foundation to the Donald Danforth Plant Science Center (“Sorghum Genomics Toolbox”), CRP-GLDC ICRISAT, PSX.
Input: -
Output: -
Contact: Soumyashree Kar ksoumya2301@gmail.com ,Jana Kholová J.Kholova@cgiar.org
Collection: -
Maturity: -
Publications
- SpaTemHTP: A Data Analysis Pipeline for Efficient Processing and Utilization of Temporal High-Throughput Phenotyping Data.
- Kar S, et al. SpaTemHTP: A Data Analysis Pipeline for Efficient Processing and Utilization of Temporal High-Throughput Phenotyping Data. SpaTemHTP: A Data Analysis Pipeline for Efficient Processing and Utilization of Temporal High-Throughput Phenotyping Data. 2020; 11:552509. doi: 10.3389/fpls.2020.552509
- https://doi.org/10.3389/FPLS.2020.552509
- PMID: 33329623
- PMC: PMC7714717
Download and documentation
Documentation: https://github.com/ICRISAT-GEMS/SpaTemHTP/blob/master/README.md
Home page: https://github.com/ICRISAT-GEMS/SpaTemHTP
< Back to DB search