Machine-Learning Crop-Type Mapping Sensitivity to Feature Selection and Hyperparameter Tuning

Mayra Silvia Pérez-Flores; Frédéric Satgé; Paul Montesano; Renaud Hostache; Ramiro Pillco-Zolá; Diego Tola; Elvis Uscamayta-Ferrano; Lautaro Bustillos; Marie‐Paule Bonnet; Céline Duwig

doi:10.3390/rs18040563

Machine-Learning Crop-Type Mapping Sensitivity to Feature Selection and Hyperparameter Tuning

dc.contributor.author	Mayra Silvia Pérez-Flores
dc.contributor.author	Frédéric Satgé
dc.contributor.author	Paul Montesano
dc.contributor.author	Renaud Hostache
dc.contributor.author	Ramiro Pillco-Zolá
dc.contributor.author	Diego Tola
dc.contributor.author	Elvis Uscamayta-Ferrano
dc.contributor.author	Lautaro Bustillos
dc.contributor.author	Marie‐Paule Bonnet
dc.contributor.author	Céline Duwig
dc.coverage.spatial	Bolivia
dc.date.accessioned	2026-03-22T20:02:50Z
dc.date.available	2026-03-22T20:02:50Z
dc.date.issued	2026
dc.description.abstract	To improve crop yields and incomes, farmers consistently adapt their practices to climate and market fluctuations, resulting in highly variable crop field distribution and coverage in space and time. As these dynamics illustrate farmers’ challenges, up-to-date crop-type mapping is essential for understanding farmers’ needs and supporting their adoption of sustainable practices. With global coverage and frequent temporal observations, remote sensing data are generally integrated into machine learning models to monitor crop dynamics. Unlike physical-based models that rely on straightforward use, implementing machine learning models requires extensive user interaction. In this context, this study assesses how sensitive the models’ outputs are to feature selection and hyperparameter tuning, as both processes rely on user judgment. To achieve this, Sentinel-1 (S1) and Sentinel-2 (S2) features are integrated into five distinct models (Random Forest (RF), Support Vector Machine (SVM), Light Gradient Boosting (LGB), Histogram-based Gradient Boosting (HGB), and Extreme Gradient Boosting (XGB)), considering several features selection (Variance Inflation Factor (VIF) and Sequential Feature Selector (SFS)) and hyperparameter tuning (Grid-Search) setup. Results show that the preprocess modeling feature selection (VIF) discards the features that the wrapped method (SFS) keeps, resulting in less reliable crop-type mapping. Additionally, hyperparameter tuning appears to be sensitive to the input features, and considering it after any feature selection improved the crop-type mapping. In this context a three-step nested modeling setup, including first hyperparameter tuning, followed by a wrapped feature selection (SFS) and additional hyperparameter tuning, leads to the most reliable model outputs. For the study region, LGB and XGB (SVM) are the most (least) suitable models for crop-type mapping, and model reliability improves when integrating S1 and S2 features rather than considering S1 or S2 alone. Finally, crop-type maps are derived across different regions and time periods to highlight the benefits of the proposed method for monitoring crop dynamics in space and time.
dc.identifier.doi	10.3390/rs18040563
dc.identifier.uri	https://doi.org/10.3390/rs18040563
dc.identifier.uri	https://andeanlibrary.org/handle/123456789/79668
dc.language.iso	en
dc.publisher	Multidisciplinary Digital Publishing Institute
dc.relation.ispartof	Remote Sensing
dc.source	Institut polytechnique de Grenoble
dc.subject	Hyperparameter
dc.subject	Feature selection
dc.subject	Computer science
dc.subject	Machine learning
dc.subject	Random forest
dc.subject	Artificial intelligence
dc.subject	Gradient boosting
dc.subject	Boosting (machine learning)
dc.subject	Feature (linguistics)
dc.subject	Feature vector
dc.title	Machine-Learning Crop-Type Mapping Sensitivity to Feature Selection and Hyperparameter Tuning
dc.type	article

Collections

Artículo Científico Publicado

Machine-Learning Crop-Type Mapping Sensitivity to Feature Selection and Hyperparameter Tuning

Files

Collections