To Örebro University

oru.seÖrebro University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Estimation with probability edited survey data under nonresponse
Örebro University, Örebro University School of Business.
2025 (English)Report (Other academic)
Abstract [en]

Probabilistic editing has been introduced to enable valid inference using established survey sampling theory in situations when some of the collected data points may have measurement errors and are therefore submitted to an editing process. To reduce the editing effort anavoid over-editing, in current practice selective editing is most often used, which is a form of editing that limits the edit checks to those potential errors that, if indeed in error, are likely to have the biggest impact on estimates to be produced. However, selective editing is not grounded in probability theory associated with survey sampling, and cannot provide expressions for point and variance estimates that account for the uncertainties introduced by selective editing.

In the spirit of the total survey error paradigm, this paper extends the previous work on probabilistic editing by proposing an estimation procedure that provides valid inference when two kinds of nonsampling error are simultaneously present, in addition to the sampling error: the measurement error, requiring an editing step, and the practically unavoidable nonresponse error which also needs to be taken into account when producing unbiased estimates.

In a three-phase selection setup, bias due to measurement error is estimated through probabilistic editing while weight adjustment employing auxiliary information is used to deal with nonresponse. An estimator based on calibration for nonresponse and corrected for bias due to measurement error is introduced. Its theoretical variance and an estimator of the variance are derived. A simulation study illustrates the three-phase selection setup and the practical performance of the derived point and variance estimators.

Place, publisher, year, edition, pages
Örebro: Örebro University School of Business , 2025. , p. 43
Series
Working Papers, School of Business, ISSN 1403-0586 ; 3/2025
Keywords [en]
nonsampling errors, probabilistic editing, selective editing, calibration estimator, measurement bias estimation
National Category
Probability Theory and Statistics
Identifiers
URN: urn:nbn:se:oru:diva-120398OAI: oai:DiVA.org:oru-120398DiVA, id: diva2:1949604
Available from: 2025-04-03 Created: 2025-04-03 Last updated: 2025-04-03Bibliographically approved
In thesis
1. Probabilistic Approach to Data Editing: Contributions to Editing in Survey Sampling
Open this publication in new window or tab >>Probabilistic Approach to Data Editing: Contributions to Editing in Survey Sampling
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The efficiency and quality of data editing processes are challenges for National Statistical Institutes (NSIs) in producing reliable official statistics. The traditional approach to data editing, heavily reliant on manual interventions, is resource-intensive and may introduce biases, impacting the overall accuracy of statistical estimates. This thesis aims to address these challenges by developing an in-novative editing framework based on probabilistic theory, allowing for a more resource-efficient editing process while providing accurate estimates of data quality. Furthermore, the thesis proposes an estimation procedure that accounts for various error sources, offering unbiased estimates of population parameters with appropri-ate measures of accuracy.

In addition to the introductory part, the thesis is structured around four key papers, each contributing to the overall objective of improving data editing and estimation processes in official statistics. Paper I presents a combined selective and probabilistic editing approach that maintains data quality while reducing resource demands. Paper II explores the integration of probabilistic editing with generalized regression (GREG) estimation, demonstrating improved accuracy in population parameter estimation. Paper III extends the framework to address nonresponse errors alongside measurement errors, using a three-phase sampling setup. Paper IV investigates the impact of various score functions in the probabilis-tic editing framework, emphasizing the importance of selecting effective score functions to minimize variance and improve estimate accuracy. Each paper contains, in addition to a theoretical part, an empirical section where concepts are numerically illustrated based on either real data or synthetic data.

Place, publisher, year, edition, pages
Örebro: Örebro University, 2025. p. 27
Series
Örebro Studies in Statistics, ISSN 1651-8608 ; 10
Keywords
data editing, selective editing, measurement error, survey statistics
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:oru:diva-120183 (URN)9789175296395 (ISBN)9789175296401 (ISBN)
Public defence
2025-04-15, Örebro universitet, Långhuset, Hörsal L3, Fakultetsgatan 1, Örebro, 13:15 (English)
Opponent
Supervisors
Available from: 2025-03-24 Created: 2025-03-24 Last updated: 2025-04-09Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Free full text

Authority records

Ilves, Maiki

Search in DiVA

By author/editor
Ilves, Maiki
By organisation
Örebro University School of Business
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 41 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf