Testing effectiveness of genetic algorithms for exploratory data analysis
Carter, Jason W.
Bhargava, Hemant K.
Haga, William J.
MetadataShow full item record
Heuristic methods of solving exploratory data analysis problems suffer from one major weakness - uncertainty regarding the optimality of the results. The developers of DaMI (Data Mining Initiative), a genetic algorithm designed to mine the CCEP (Comprehensive Clinical Evaluation Program) database in the search for a Persian Gulf War syndrome, proposed a method to overcome this weakness: reproducibility -- the conjecture that consistent convergence on the same solutions is both necessary and sufficient to ensure a genetic algorithm has effectively searched an unknown solution space. We demonstrate the weakness of this conjecture in light of accepted genetic algorithm theory. We then test the conjecture by modifying the CCEP database with the insertion of an interesting solution of known quality and performing a discovery session using DaMI on this modified database. The necessity of reproducibility as a terminating condition is falsified by the algorithm finding the optimal solution without yielding strong reproducibility. The sufficiency of reproducibility as a terminating condition is analyzed by manual examination of the CCEP database in which strong reproducibility was experienced. Ex post facto knowledge of the solution space is used to prove that DaMI had not found the optimal solutions though it gave strong reproducibility, causing us to reject the conjecture that strong reproducibile is a sufficient terminating condition.
RightsThis publication is a work of the U.S. Government as defined in Title 17, United States Code, Section 101. Copyright protection is not available for this work in the United States.
Showing items related by title, author, creator and subject.
Timmerman, Michael Jay (Monterey, California. Naval Postgraduate School, 1993-09);This research was aimed at improving the genetic algorithm used in an earlier anti-submarine warfare simulator. The problem with the earlier work was that it focused on the development of the environmental model, and did ...
Johnson, Donald C. (Monterey, California. Naval Postgraduate School, 1995-09);Human resource management is gaining recognition as being one of the core attributes of effective software project management. With the great dependency the Department of Defense (DoD) has on systems software, the development ...
Santos, Almir Garnier (Monterey, California. Naval Postgraduate School, 1993-09);The need to search effectively for objects presents itself in many civilian and military applications. This thesis develops and tests six heuristics and an optimal branch and bound procedure to solve the heretofore ...