HYBRID SIS AND MARKOV CHAIN MONTE CARLO SAMPLING METHODOLOGY FOR GOODNESS-OF-FIT TESTS ON CONTINGENCY TABLES
Loading...
Authors
Saluke, Patrick M.
Advisors
Yoshida, Ruriko
Second Readers
Carlyle, W M.
Subjects
Markov Chain Monte Carlo
MCMC
Sequential Importance Sampling
SIS
sparse
multidimensional contingency table
MCMC
Sequential Importance Sampling
SIS
sparse
multidimensional contingency table
Date of Issue
2018-09
Date
Publisher
Monterey, CA; Naval Postgraduate School
Language
Abstract
Logistic regression is one of the most popular means of modeling contingency table data due to its ease of use. Simple asymptotic inference (like a χ2 approximation) for evaluating goodness-of-fit tests, however, may not be valid for sparse datasets having cell counts less than 5. In these cases, we often attempt exact conditional inference via a sampler, such as Markov Chain Monte Carlo (MCMC) or Sequential Importance Sampling (SIS). This paper proposes a hybrid sampling scheme that combines MCMC and SIS to sample sparse, multidimensional contingency tables satisfying fixed marginals when MCMC alone does not guarantee an exhaustive sampling of the conditional state space. To investigate its suitability, the proposed hybrid scheme is applied to an observational dataset from Alzheimer's researcher JA Mortimer measuring the cognitive states of nuns over a 15-year period beginning in 1991. Through the application of our proposed scheme, we find the estimated p-values via a hybrid MCMC and SIS sampler are remarkably similar to the χ2 asymptotic approximation p-values, even for sparse contingency tables.
Type
Thesis
Description
Series/Report No
Department
Operations Research (OR)
Organization
Identifiers
NPS Report Number
Sponsors
Funding
Format
Citation
Distribution Statement
Approved for public release; distribution is unlimited.
Rights
This publication is a work of the U.S. Government as defined in Title 17, United States Code, Section 101. Copyright protection is not available for this work in the United States.
