Sunbelt Computer Software

PhysioFlow — Crop Physiology Data Analysis Platform

An open-source interactive web platform for exploratory data analysis, spatial modeling, and machine learning of crop ecophysiological parameters (soybean and sugarcane). Built within the Goiás Verde Project.

🌐 Live demo →

Deployed on Streamlit Community Cloud. Login-protected via Supabase — request credentials from the maintainer.

English | Português | Español

Version: 1.0
Initiative: Goiás Verde Project (Instituto Federal Goiano – Campus Rio Verde & Center of Excellence in Exponential Agriculture – CEAGRE)

📖 System Operation Manual →

End-to-end walkthrough: installation, data upload, cleaning pipeline, EDA, modeling, spatial analysis, time series, group comparison, experimental statistics, statistical glossary, and FAQ. 16 chapters, 26 screenshots. Extended abstract and introductory chapter available in English; complete edition currently in Portuguese — see manual.pt.md. Spanish edition: incremental translation in progress.

📋 Overview

This repository contains the Streamlit application developed to consolidate the cleaning, descriptive analysis, predictive modeling, and geospatial modeling workflow of Crop Physiology field datasets. The application validates data collected by photosynthesis systems (IRGA), chlorophyll meters, and ceptometers, applying an automated pipeline for cleaning and advanced analysis.

The tool is dataset-agnostic, provided the spreadsheet columns map correctly to the expected variables (e.g., photosynthesis rate A, transpiration E, stomatal conductance gs, chlorophylls, leaf area index LAI/IAF, etc.).

🛠️ Main Features

Ingestion & Data Profile — Ingestion of Excel (.xlsx, .xls), CSV, or TXT/TSV files with a delimiter selector. An automatic data profile (Physiology / Generic) adapts the whole interface: physiology datasets are validated against a 31-column schema (priority tiers Required / Recommended / Optional, type coercion and coordinate-range checks); any other dataset gets a neutral summary, with no physiology-specific assumptions.
Cleaning Pipeline — Transparent, reactive cleaning. In the physiology profile: column removal, dropping records without essential metadata, removing empty grid points, and 5 modes of replicate treatment (mean, median, replicate expansion via melt, or single replicate). In the generic profile: pass-through with optional repetition aggregation by mean/median.
Exploratory Data Analysis (EDA) — Complete descriptive statistics, data completeness checks, histograms, boxplots, correlation heatmaps (Pearson, Spearman, Kendall), category composition, normality tests (Shapiro-Wilk, Anderson-Darling, D'Agostino-Pearson), multicollinearity (VIF), hotspot rankings, and a multi-method outlier consensus audit.
Regression — Bivariate regression fitting with presets tailored for crop physiology (e.g., gs vs. A, Ci vs. A, gs vs. E) with support for confidence intervals and faceting.
Predictive Modeling (Regression & Classification) — Training and comparison of scikit-learn models with cross-validation, holdout metrics, and feature-importance charts. Regression (Linear, Ridge, Random Forest, Gradient Boosting, HistGradientBoosting, Decision Tree, KNN) and Classification (Logistic, Random Forest, Decision Tree, Gradient Boosting, HistGradientBoosting, KNN, SVM, Naive Bayes) with accuracy/F1/precision/recall, confusion matrix, optional GroupKFold and feature scaling.
Experimental Statistics (designs) — Design-aware ANOVA for CRD, RCBD, Latin square, factorial (2–3 factors), split-plot, strip-plot and nested designs; assumption tests (Shapiro–Wilk, Levene) with Q–Q plots; mean-comparison procedures (Tukey, Scott-Knott, Duncan, Scheffé, LSD, Dunnett vs. control); ANCOVA; dose/polynomial regression; correlation (Pearson, Spearman, partial). Exports a reproducible Python script. Validated number-by-number against R (aov, car::Anova, emmeans, ScottKnott) — see docs/validacao_externa.md.
Spatial Analysis — Inverse Distance Weighting (IDW) interpolation, spatial autocorrelation via global Moran's I and local LISA (map of significant clusters), Getis-Ord Gi* hotspots, regular UTM grid cells aggregation, Ordinary Kriging with a fitted spherical variogram model, and an optional point-distribution map overlaid on Rio Verde (GO) municipal boundaries (via the optional geobr library, installed locally).
Time Series — Daily aggregation and STL time series decomposition (Trend, Seasonal, and Residuals).
Group Comparison — Statistical comparisons between groups (e.g., soybean vs. sugarcane) using the Mann-Whitney U test, log-linear regression, and cumulative hourly profiles.

📂 Project Layout

fisiologia-streamlit/
├── app.py                      # Streamlit app entry point
├── requirements.txt            # Production dependencies
├── pyproject.toml              # Pytest configuration
├── src/
│   ├── auth.py                 # Login integration (Supabase)
│   ├── state.py                # Session state helpers
│   ├── schema.py               # Schema specs, ranges, and profile detection
│   ├── profile.py              # Data profile (Physiology / Generic) resolution
│   ├── pipeline.py             # Cleaning and replicates pipeline
│   ├── stats_utils.py          # Experimental-design engine (ANOVA, tests, designs)
│   ├── components/             # Reusable UI components (Sidebar, Top-Filters)
│   │   ├── sidebar.py
│   │   └── filters.py
│   ├── config/
│   │   └── settings.py         # Custom style configurations (CSS) and constants
│   ├── i18n/                   # Translation module (PT, EN, ES)
│   └── pages/                  # Modules rendering individual pages
├── tests/                      # Automated unit tests (pytest)
└── docs/                       # Documentation (Architecture, Data dictionary)

🚀 How to Run Locally

1. Create and Activate Virtual Environment

python -m venv .venv

# Linux/macOS
source .venv/bin/activate

# Windows (PowerShell)
# .venv\Scripts\Activate.ps1

2. Install Dependencies

pip install -U pip wheel
pip install -r requirements.txt

3. Run the Streamlit Application

python -m streamlit run app.py

The application will open in your default browser at http://localhost:8501.

🧪 Running Unit Tests

The test suite validates data pipeline transformations and schema validation integrity:

PYTHONPATH=. .venv/bin/pytest tests/

👥 Support and Acknowledgments

This project was developed with support and funding from CNPq, CAPES, FAPEG, the Federal Institute Goiano (IF Goiano – Campus Rio Verde), and the Center of Excellence in Exponential Agriculture (CEAGRE).

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
.github/workflows		.github/workflows
.streamlit		.streamlit
Paper		Paper
assets		assets
data/sample/test		data/sample/test
docs		docs
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
.python-version		.python-version
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.es.md		README.es.md
README.md		README.md
README.pt.md		README.pt.md
app.py		app.py
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PhysioFlow — Crop Physiology Data Analysis Platform

🌐 Live demo →

📖 System Operation Manual →

📋 Overview

🛠️ Main Features

📂 Project Layout

🚀 How to Run Locally

1. Create and Activate Virtual Environment

2. Install Dependencies

3. Run the Streamlit Application

🧪 Running Unit Tests

👥 Support and Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Sunbelt Computer Software

PL/B Language Development and Support

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

PhysioFlow — Crop Physiology Data Analysis Platform

🌐 Live demo →

📖 System Operation Manual →

📋 Overview

🛠️ Main Features

📂 Project Layout

🚀 How to Run Locally

1. Create and Activate Virtual Environment

2. Install Dependencies

3. Run the Streamlit Application

🧪 Running Unit Tests

👥 Support and Acknowledgments

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages