*This README.txt file was generated on 2025-01-02 by ELE SALTMARSH* General Information ------------------ ### Title: *Data and Code for "Algorithm-Driven Root Optimization for Maize Yield in the Midwest"* ### Author Information: - Principal Investigator: Ele Saltmarsh - Co-investigator: Armen Kemanian - Corresponding Author: Armen Kemanian (kxa15@psu.edu) ### Date of data collection/creation: Jan 2025 ### Funders and sponsors of data collection: Funding for this research was provided by USDA-NIFA Award No. 2021-67034-35044. Any opinions, findings, conclusions, or recommendations expressed in this publication are those of the authors and should not be construed to represent any official USDA or U.S. Government determination or policy. Data & File Overview -------------------- ### File list: "Source Code.r" - Contains step by step code and annotations to reproduce all works carried out in the manuscript "max_evo.csv" - Presents results from optimization including county (County), latitude (lat), longitude (lon), optimized yield (Mg/ha) (Yield), and the model (Model) and root distribution parameters A (PAR_0), B (PAR_1), C (PAR_2), and D (PAR_3) which produce this yield. "max_evo_root_frac.csv" - As above, with root fraction in each soil layer "max_evo_root_frac_clustered.csv" - As above, with assigned cluster based on optimal root distribution "env_vars.csv" - Environmental data showing total precipitation (PPT) and extreme degree days (EDD), and the mean daily maximum (TX) and minimum (TN) temperature, solar radiation (SOLAR), daytime vapor pressure deficit (VPD), and plan available water between 0-1 m (PAW.0_1) and 1-2 m (PAW.1_2) in the soil for the whole growing season and for vegetative and reproductive growth periods. "balanced_rf_data.csv" - Combined location, cluster, and environmental data upscaled using SMOTE balancing techniques "paw_data.csv" - Contains date (Date), daily transpiration (Transpiration), daily soil water content in each layer (seq of 0.025 to 1.975 by 0.05, representing depth of each layer), rooting phenotype (Roots), year (Year), and county (County) "cluster_comparisons.csv" - Contains County, Year, Cluster, Yield, Transpiration, lat, and lon "convergence_data.csv" - Contains Generation, Individual, Yield (FITNESS), Population, and Model "repeatability_data.csv" - Contains County, maximum yield identified by the comprehensive search (comp_max), maximum yield identified by the optimization search repeat 1 (opt_max_1), difference between comp_max and opt_max_1 (diff_1), diff_1 as a percentage of comp_max (perc_1), maximum yield identified by the optimization search repeat 2 (opt_max_2), difference between comp_max and opt_max_2 (diff_2), diff_2 as a percentage of comp_max (perc_2), and Model