Absolute bounds on set intersection and union sizes from distribution information
Abstract
Estimation of set intersection and union sizes is important for access method selection for a database. Absolute bounds on sizes are often much easier to compute than size estimates, requiring no distributional or independence assumptions, and can answer many of the same needs. We present a large compendium of quick closed-form bounds on set intersection and union sizes, each applying to a different situation; they can be expressed as rules, and managed by rule-based or 'knowledge-base' architecture. These methods use general-purpose statistics precomputed on the data, and exploit homomorphisms (onto mappings) of the data items onto distributions that can be more easily analyzed. Our methods can be used anytime, but tend to work best when there are strong or complex correlations in the data. This circumstance is poorly addressed by the standard methods of independence-assumption and distributional-assumption estimates, and hence our methods fill a need
NPS Report Number
NPS-52-85-014Related items
Showing items related by title, author, creator and subject.
-
Absolute bounds on set intersection and union sizes from distribution information
Rowe, Neil C. (Monterey, California. Naval Postgraduate School, 1988-07);Estimation of set intersection and union sizes is important for access method selection for a database and other data retrieval problems. Absolute bounds on sizes are often easier to compute than estimates, requiring no ... -
Nonparametric cost evaluation of the Department of Defense's small parcel shipping procedures
Murphy, Douglas M. (Monterey, California. Naval Postgraduate School, 1995-09);The current Department of Defense shipping procedures do not encourage Installation Transportation Officers (ITOs) to use small parcel carriers (SPCs) for domestic ground transportation. There is still a significant amount ... -
Stress distribution in two circular cylinders intersecting at right angles under the influence of internal pressure
Harmon, Leonard Edward (Pasadena, California; California Institute of Technology, 1949);This investigation is an experimental study of the stress distribution in two circular cylinders intersecting at right angles and acted on by internal pressure. Two specimens of the thick-wall category were tested to ...