oru.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Kernel imputation: a method to reduce bias of hot deck imputation
Department of Statistics, Stockholm University, Stockholm, Sweden.ORCID iD: 0000-0002-3888-4695
2011 (English)Conference paper, Oral presentation with published abstract (Other academic)
Abstract [en]

Consider a sample from a finite population with missing values, where the goal is to estimate some finite population characteristic, typically a mean or a total. One way of handling the missing values is hot deck imputation. Each donee unit that has some missing values is then matched up with a pool of donor units, based on the similarity between values that are observed both on the donee and its potential donors. The missing values are then filled in by copies from corresponding observed values on units that are (randomly) drawn from the donor pool.

Hot deck imputation is good at preserving distributions among variables, and therefore provides robustness to nonlinear relationships. Estimates may however suffer from bias, if the continuity of the observed variables is not sufficiently accounted for in the matching of the donee to its potential donors, for example if continuous variables are categorized. The bias is especially evident if the donee is located at the boundary of the observed data.

By incorporating several ideas from kernel density estimation, we propose how to reduce the bias of hot deck imputation. Also, as a way of accounting for imputation uncertainty through multiple imputation, we base our method on Lo’s (1988) finite population Bayesian bootstrap.

Results from simulations show that our method performs at least as well as competing methods for the estimation of means and confidence intervals, especially given a larger sample size and nonlinear relationships among the variables.

Place, publisher, year, edition, pages
2011.
Keywords [en]
Kernel imputation; Finite population; Bayesian bootstrap; Hot deck imputation; Boundary bias
National Category
Probability Theory and Statistics
Identifiers
URN: urn:nbn:se:oru:diva-66543OAI: oai:DiVA.org:oru-66543DiVA, id: diva2:1197025
Conference
Third Baltic-Nordic Conference on Survey Statistics, Norrfällsviken, Sweden, June 13-17, 2011
Available from: 2018-04-11 Created: 2018-04-11 Last updated: 2018-04-12Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Kernel imputation

Authority records BETA

Pettersson, Nicklas

Search in DiVA

By author/editor
Pettersson, Nicklas
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 123 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf