Fusion of Hard and Soft Information in Nonparametric Density Estimation
Authors
Royset, Johannes O.
Wets, Roger J-B
Subjects
density estimation
data analytics
data fusion
epi-splines
data analytics
data fusion
epi-splines
Advisors
Date of Issue
2015-06-10
Date
June 10, 2015
Publisher
Language
Abstract
This article discusses univariate density estimation in situations when the sample (hard
information) is supplemented by “soft” information about the random phenomenon. These situations
arise broadly in operations research and management science where practical and computational reasons
severely limit the sample size, but problem structure and past experiences could be brought in. In
particular, density estimation is needed for generation of input densities to simulation and stochastic
optimization models, in analysis of simulation output, and when instantiating probability models. We
adopt a constrained maximum likelihood estimator that incorporates any, possibly random, soft information
through an arbitrary collection of constraints. We illustrate the breadth of possibilities by
discussing soft information about shape, support, continuity, smoothness, slope, location of modes,
symmetry, density values, neighborhood of known density, moments, and distribution functions. The
maximization takes place over spaces of extended real-valued semicontinuous functions and therefore
allows us to consider essentially any conceivable density as well as convenient exponential transformations.
The infinite dimensionality of the optimization problem is overcome by approximating splines
tailored to these spaces. To facilitate the treatment of small samples, the construction of these splines
is decoupled from the sample. We discuss existence and uniqueness of the estimator, examine consistency
under increasing hard and soft information, and give rates of convergence. Numerical examples
illustrate the value of soft information, the ability to generate a family of diverse densities, and the
effect of misspecification of soft information.
Type
Article
Description
Series/Report No
Department
Operations Research (OR)
Organization
Naval Postgraduate School (U.S.)
Identifiers
NPS Report Number
Sponsors
U.S. Army Research Laboratory and the U.S. Army Research Office grant 00101-80683
U.S. Army Research Laboratory and the U.S. Army Research Office grant W911NF-10-1-0246
U.S. Army Research Laboratory and the U.S. Army Research Office grant W911NF-12-1-0273
U.S. Army Research Laboratory and the U.S. Army Research Office grant W911NF-10-1-0246
U.S. Army Research Laboratory and the U.S. Army Research Office grant W911NF-12-1-0273
Funder
U.S. Army Research Laboratory and the U.S. Army Research Office grant 00101-80683
U.S. Army Research Laboratory and the U.S. Army Research Office grant W911NF-10-1-0246
U.S. Army Research Laboratory and the U.S. Army Research Office grant W911NF-12-1-0273
U.S. Army Research Laboratory and the U.S. Army Research Office grant W911NF-10-1-0246
U.S. Army Research Laboratory and the U.S. Army Research Office grant W911NF-12-1-0273
Format
39 p.
Citation
J.O. Royset and R. J-B Wets, 2015, "Fusion of Hard and Soft Information in Nonparametric Density Estimation," European J. Operational Research, Vol. 247, No. 2, pp. 532-547.
Distribution Statement
Rights
This publication is a work of the U.S. Government as defined in Title 17, United States Code, Section 101. Copyright protection is not available for this work in the United States.