Developing a Dark Web Collection and Infrastructure for Computational and Social Sciences

Loading...
Thumbnail Image
Authors
Zhang, D.
Zeng, S.
Huang, C-N
Fan, L.
Yu, X.
Dang, Y.
Larson, C.
Denning, D.
Roberts, Nancy C.
Chen, H.
Subjects
Incremental forum spidering
Multilingual translation
Social network visualization
Dark Web archive
Advisors
Date of Issue
2010
Date
2010
Publisher
Language
Abstract
In recent years, there have been numerous studies from a variety of perspectives analyzing the Internet presence of hate and extremist groups. Yet the websites and forums of extremist and terrorist groups have long remained an underutilized resource for terrorism researchers due to their ephemeral nature and access and analysis problems. The purpose of the Dark Web archive is to provide a research infrastructure for use by social scientists, computer and information scientists, policy and security analysts, and others studying a wide range of social and organizational phenomena and computational problems. The Dark Web Forum Portal provides web enabled access to critical international jihadist and other extremist web forums. The focus of this paper is on the significant extensions to previous work including: increasing the scope of data collection, adding an incremental spidering component for regular data updates; enhancing the searching and browsing functions; enhancing multilingual machine-translation for Arabic, French, German and Russian; and advanced Social Network Analysis. A case study on identifying active participants is shown at the end.
Type
Article
Description
Proceedings of the 2010 IEEE International Conference on Intelligence and Security Informatics (ISI 2010).
Series/Report No
Department
Defense Analysis (DA)
Organization
Naval Postgraduate School (U.S.)
Identifiers
NPS Report Number
Sponsors
Funder
Format
Citation
Zhang, D., Zeng, S., Huang, C-N, Fan, L., Yu, X., Dang, Y., Larson, C., Denning, D., Roberts, N., Chen, H., “Developing a Dark Web Collection and Infrastructure for Computational and Social Sciences,” Proceedings of the 2010 IEEE International Conference on Intelligence and Security Informatics (ISI 2010).
Distribution Statement
Rights
This publication is a work of the U.S. Government as defined in Title 17, United States Code, Section 101. Copyright protection is not available for this work in the United States.
Collections