Blog fingerprinting identifying anonymous posts written by an author of interest using word and character frequency analysis

Download
Author
Dreier, David J.
Date
2009-09Advisor
Martell, Craig H.
Second Reader
Schein, Andrew I.
Metadata
Show full item recordAbstract
Internet blogs are an easily accessible means of global communications. Monitoring blogs for criminal and terrorist activity is a serious challenge, due to blogs' anonymous nature and the sheer volume of data. The intelligence community is often faced with more information than it can process. The need exists to develop methods for processing the massive amounts of data this media presents, without a significant increase in manpower. An automated tool capable of indentifying posts written by an individual, given a sample of his writing, would allow law enforcement and intelligence agencies to gather evidence that would otherwise be overlooked due to manpower and time constraints. This research focuses on identifying blog posts written by a particular author, when we do not have a model of every potential author. Previous research either builds a distinct model for every possible author, or limits itself to large documents. Neither approach is appropriate for processing blog posts. Blog posts tend to be short documents, and building a distinct model of each author is unreasonable if you are looking for one author among millions. We address this problem by combining sample posts by other authors to create a model of an "average author."
Related items
Showing items related by title, author, creator and subject.
-
Designing a binary counter
Weiss, Arnim Mark (Annapolis, Maryland: Naval Postgraduate School, 1949);As part of the curriculum of the Electronics Engineering Course at the U. S. Naval Postgraduate School the author was assigned as a junior engineer to the Engineering Products Department, RCA Victor Division, Radio Corporation ... -
Department of Operations Research technical report list, 1966-1976
Marshall, Kneale T. (Monterey, California. Naval Postgraduate School, 1977-08); NPS55-77-35This report contains an alphabetic listing by author of the unclassified technical reports written in the Department of Operations Research for the period 1966-1976 inclusive. Each citation lists the tile, technical report ... -
Performance review of the officer rate generator, version 1.0
Read, Robert R. (Monterey, California. Naval Postgraduate School, 1992-10); NPS-OR-93-002Manpower attrition rage generator, Marine Corps, Manpower planning models, Personnel inventory cells, Shrinkage methods, Statistically stable attrition rates. Abstract: The report deals with the author's review of the ...