Identifying, attributing, and overcoming common data quality issues of manned station observations

dc.contributor.authorStefan Hunziker
dc.contributor.authorStefanie Gubler
dc.contributor.authorJuan Marcos Calle
dc.contributor.authorIsabel Moreno
dc.contributor.authorMarcos Andrade
dc.contributor.authorFernando Velarde
dc.contributor.authorLaura Ticona
dc.contributor.authorGualberto Carrasco
dc.contributor.authorYaruska Castellón
dc.contributor.authorClara Oria
dc.coverage.spatialBolivia
dc.date.accessioned2026-03-22T13:54:30Z
dc.date.available2026-03-22T13:54:30Z
dc.date.issued2017
dc.descriptionCitaciones: 98
dc.description.abstractABSTRACT In situ climatological observations are essential for studies related to climate trends and extreme events. However, in many regions of the globe, observational records are affected by a large number of data quality issues. Assessing and controlling the quality of such datasets is an important, often overlooked aspect of climate research. Besides analysing the measurement data, metadata are important for a comprehensive data quality assessment. However, metadata are often missing, but may partly be reconstructed by suitable actions such as station inspections. This study identifies and attributes the most important common data quality issues in Bolivian and Peruvian temperature and precipitation datasets. The same or similar errors are found in many other predominantly manned station networks worldwide. A large fraction of these issues can be traced back to measurement errors by the observers. Therefore, the most effective way to prevent errors is to strengthen the training of observers and to establish a near real‐time quality control ( QC ) procedure. Many common data quality issues are hardly detected by usual QC approaches. Data visualization, however, is an effective tool to identify and attribute those issues, and therefore enables data users to potentially correct errors and to decide which purposes are not affected by specific problems. The resulting increase in usable station records is particularly important in areas where station networks are sparse. In such networks, adequate selection and treatment of time series based on a comprehensive QC procedure may contribute to improving data homogeneity more than statistical data homogenization methods.
dc.identifier.doi10.1002/joc.5037
dc.identifier.urihttps://doi.org/10.1002/joc.5037
dc.identifier.urihttps://andeanlibrary.org/handle/123456789/43421
dc.language.isoen
dc.publisherWiley
dc.relation.ispartofInternational Journal of Climatology
dc.sourceOeschger Centre for Climate Change Research
dc.subjectMetadata
dc.subjectComputer science
dc.subjectData quality
dc.subjectData mining
dc.subjectData collection
dc.subjectQuality (philosophy)
dc.subjectData science
dc.subjectMissing data
dc.subjectUSable
dc.titleIdentifying, attributing, and overcoming common data quality issues of manned station observations
dc.typearticle

Files