Detection of erroneous payments utilizing supervised and utilizing supervised and unsupervised data mining techniques

Download
Author
Yanik, Todd E.
Date
2004Advisor
Buttrey, Samuel E.
Second Reader
Whitaker, Lyn R.
Metadata
Show full item recordAbstract
In this thesis we develop a procedure for detecting erroneous payments in the Defense Finance Accounting Service, Internal Review's (DFAS IR) Knowledge Base Of Erroneous Payments (KBOEP), with the use of supervised (Logistic Regression) and unsupervised (Classification and Regression Trees (C & RT)) modeling algorithms. S-Plus software was used to construct a supervised model of vendor payment data using Logistic Regression, along with the Hosmer-Lemeshow Test, for testing the predictive ability of the model. The Clementine Data Mining software was used to construct both supervised and unsupervised model of vendor payment data using Logistic Regression and C & RT algorithms. The Logistic Regression algorithm, in Clementine, generated a model with predictive probabilities, which were compared against the C & RT algorithm. In addition to comparing the predictive probabilities, Receiver Operating Characteristic (ROC) curves were generated for both models to determine which model provided the best results for a Coincidence Matrix's True Positive, True Negative, False Positive and False Negative Fractions. The best modeling technique was C & RT and was given to DFAS IR to assist in reducing the manual record selection process currently being used. A recommended ruleset was provided, along with a detailed explanation of the algorithm selection process.
Rights
This publication is a work of the U.S. Government as defined in Title 17, United States Code, Section 101. Copyright protection is not available for this work in the United States.Collections
Related items
Showing items related by title, author, creator and subject.
-
An improved unsupervised modeling methodology for detecting fraud in vendor payment transactions
Rouillard, Gregory W. (Monterey, California. Naval Postgraduate School, 2003);(DFAS) vendor payment transactions through Unsupervised Modeling (cluster analysis). Clementine Data Mining software is used to construct unsupervised models of vendor payment data using the K-Means, Two Step, and Kohonen ... -
Using WorldView-2 to determine bottom-type and bathymetry
Lee, Krista R.; Kim, Angela M.; Olsen, R.C.; Kruse, Fred A. (SPIE, 2011);Observations taken from DigitalGlobeâ s WorldView-2 (WV-2) sensor were analyzed for bottom-type and bathymetry for data taken at Guam and Tinian in late February and early March of 2010. Classification of bottom type was ... -
A Method for Automated Cavitation Detection with Adaptive Thresholds
Gregg, Seth W.; Steele, John P.H.; Van Bossuyt, Douglas L. (Wiley, 2018);Hydroturbine operators who wish to collect cavitation intensity data to estimate cavitation erosion rates and calculate remaining useful life (RUL) of the turbine runner face several practical challenges related to long ...