SpaTemHTP

"SpaTemHTP" is an analytical pipeline to address the challenges of processing and analyzing the vast amounts of data generated by high-throughput phenotyping (HTP) platforms, particularly in outdoor and field-based settings. With the rapid advancement of phenotyping technologies, accurately estimating the genotypic component of plant phenotype over time has become crucial and challenging, mainly when data inaccuracies, failures, and environmental factors substantially affect phenotype measurements.

Core Modules and Functionalities:

- Detection of Outliers: The first module in the SpaTemHTP pipeline focuses on identifying and handling outliers in the dataset, ensuring that anomalous data points do not skew subsequent analyses.

- Imputation of Missing Values: Recognizing the common issue of missing data in outdoor HTP platform measurements, this module efficiently imputes missing values, allowing for a more complete and accurate analysis of the plant growth data.

- Mixed-Model Genotype Adjusted Means Computation with Spatial Adjustment: The final module computes mixed-model genotype adjusted means, incorporating spatial adjustment to account for the environmental and positional effects on phenotype expression. This step is crucial for accurately estimating genotypic values and growth curves.

- Smooth Genotype Growth Curves Estimation: By sequentially applying these three steps, SpaTemHTP is particularly effective in estimating smooth genotype growth curves from raw data, even when such data contain significant noise.

- Change-Point Analysis and Growth Phase Modeling: A pipeline extension includes modeling the genotype time series data and performing change-point analysis to identify growth phases and the optimal timing for distinguishing genotypic differences.

- Genotype Clustering and ANOVA: The estimated genotypic values during the optimal growth phase are used to cluster genotypes, with two-way ANOVA confirming the consistency of these clusters throughout the growth duration.

Topic

Genotype and phenotype;Plant biology;Sequencing;Workflows;Agricultural science

Detail

  • Operation: Imputation;Genotyping;Phasing

  • Software interface: Command-line interface

  • Language: R

  • License: The GNU General Public License v3.0

  • Cost: Free with restrictions

  • Version name: 1.0.4.9999

  • Credit: The Bill and Melinda Gates Foundation to the Donald Danforth Plant Science Center (“Sorghum Genomics Toolbox”), CRP-GLDC ICRISAT, PSX.

  • Input: -

  • Output: -

  • Contact: Soumyashree Kar ksoumya2301@gmail.com ,Jana Kholová J.Kholova@cgiar.org

  • Collection: -

  • Maturity: -

Publications

  • SpaTemHTP: A Data Analysis Pipeline for Efficient Processing and Utilization of Temporal High-Throughput Phenotyping Data.
  • Kar S, et al. SpaTemHTP: A Data Analysis Pipeline for Efficient Processing and Utilization of Temporal High-Throughput Phenotyping Data. SpaTemHTP: A Data Analysis Pipeline for Efficient Processing and Utilization of Temporal High-Throughput Phenotyping Data. 2020; 11:552509. doi: 10.3389/fpls.2020.552509
  • https://doi.org/10.3389/FPLS.2020.552509
  • PMID: 33329623
  • PMC: PMC7714717

Download and documentation


< Back to DB search