Machine-Learning Crop-Type Mapping Sensitivity to Feature Selection and Hyperparameter Tuning
| dc.contributor.author | Mayra Silvia Pérez-Flores | |
| dc.contributor.author | Frédéric Satgé | |
| dc.contributor.author | Paul Montesano | |
| dc.contributor.author | Renaud Hostache | |
| dc.contributor.author | Ramiro Pillco-Zolá | |
| dc.contributor.author | Diego Tola | |
| dc.contributor.author | Elvis Uscamayta-Ferrano | |
| dc.contributor.author | Lautaro Bustillos | |
| dc.contributor.author | Marie‐Paule Bonnet | |
| dc.contributor.author | Céline Duwig | |
| dc.coverage.spatial | Bolivia | |
| dc.date.accessioned | 2026-03-22T20:02:50Z | |
| dc.date.available | 2026-03-22T20:02:50Z | |
| dc.date.issued | 2026 | |
| dc.description.abstract | To improve crop yields and incomes, farmers consistently adapt their practices to climate and market fluctuations, resulting in highly variable crop field distribution and coverage in space and time. As these dynamics illustrate farmers’ challenges, up-to-date crop-type mapping is essential for understanding farmers’ needs and supporting their adoption of sustainable practices. With global coverage and frequent temporal observations, remote sensing data are generally integrated into machine learning models to monitor crop dynamics. Unlike physical-based models that rely on straightforward use, implementing machine learning models requires extensive user interaction. In this context, this study assesses how sensitive the models’ outputs are to feature selection and hyperparameter tuning, as both processes rely on user judgment. To achieve this, Sentinel-1 (S1) and Sentinel-2 (S2) features are integrated into five distinct models (Random Forest (RF), Support Vector Machine (SVM), Light Gradient Boosting (LGB), Histogram-based Gradient Boosting (HGB), and Extreme Gradient Boosting (XGB)), considering several features selection (Variance Inflation Factor (VIF) and Sequential Feature Selector (SFS)) and hyperparameter tuning (Grid-Search) setup. Results show that the preprocess modeling feature selection (VIF) discards the features that the wrapped method (SFS) keeps, resulting in less reliable crop-type mapping. Additionally, hyperparameter tuning appears to be sensitive to the input features, and considering it after any feature selection improved the crop-type mapping. In this context a three-step nested modeling setup, including first hyperparameter tuning, followed by a wrapped feature selection (SFS) and additional hyperparameter tuning, leads to the most reliable model outputs. For the study region, LGB and XGB (SVM) are the most (least) suitable models for crop-type mapping, and model reliability improves when integrating S1 and S2 features rather than considering S1 or S2 alone. Finally, crop-type maps are derived across different regions and time periods to highlight the benefits of the proposed method for monitoring crop dynamics in space and time. | |
| dc.identifier.doi | 10.3390/rs18040563 | |
| dc.identifier.uri | https://doi.org/10.3390/rs18040563 | |
| dc.identifier.uri | https://andeanlibrary.org/handle/123456789/79668 | |
| dc.language.iso | en | |
| dc.publisher | Multidisciplinary Digital Publishing Institute | |
| dc.relation.ispartof | Remote Sensing | |
| dc.source | Institut polytechnique de Grenoble | |
| dc.subject | Hyperparameter | |
| dc.subject | Feature selection | |
| dc.subject | Computer science | |
| dc.subject | Machine learning | |
| dc.subject | Random forest | |
| dc.subject | Artificial intelligence | |
| dc.subject | Gradient boosting | |
| dc.subject | Boosting (machine learning) | |
| dc.subject | Feature (linguistics) | |
| dc.subject | Feature vector | |
| dc.title | Machine-Learning Crop-Type Mapping Sensitivity to Feature Selection and Hyperparameter Tuning | |
| dc.type | article |