Megacity analysis: A clustering approach to classification
Housley, Jimmy L.
Whitaker, Lyn R.
Appleget, Jeffrey A.
MetadataShow full item record
Megacities are characterized by large populations (at least 10 million) and interdependent infrastructure, demographic, economic, and government networks (the four pillars). To be successful in future operations, the military must expand its understanding of megacities and their networks. In particular the Joint Warfare Analysis Center (JWAC) is interested in these megacity networks and their implications for potential urban operations. We develop a methodology to group like megacities into five clusters. With 33 variables describing the four pillars, we construct a data set using over 90 data sources for 41 large urban areas. This work greatly expands previous work in both the number of cities studied and the number of variables used. We also study clustering sensitivity to missing values by generating an ensemble of 5,000 clusterings based on randomly imputed missing values. We compare these to clustering without imputation, the ensemble consensus or average clustering, and clusterings from previous studies in addition to identifying which cities are sensitive to missing values. Our work not only informs JWAC of the similarities and differences between the 41 cities studied, it provides a method to identify for which cities, more data collection is warranted, and it provides a blueprint for future work in this area.
RightsThis publication is a work of the U.S. Government as defined in Title 17, United States Code, Section 101. Copyright protection is not available for this work in the United States.
Showing items related by title, author, creator and subject.
Miller, Gregory A.; Moore, Scott M.; Beery, Paul T.; Parker, Gary W. (2012-12);This paper analyzes faculty scholarly collaboration networks at the Naval Postgraduate School (NPS). A brief description of the mission and organization of NPS provides background. The source of the network data is ...
Buttrey, Samuel E.; Whitaker, Lyn R. (2015);This paper describes treeClust, an R package that produces dissimilarities useful for clustering. These dissimilarities arise from a set of classification or regression trees, one with each variable in the data acting ...
Breckon, Thomas Joseph (Monterey, California. Naval Postgraduate School, 1970-06);The partition problem is that step in the layout problem in which it must be decided which of the elementary digital circuits are to be coalesced into a single, electronic package. A solution of the partition problem must ...