Language identification by statistical analysis.

Download
Author
Rau, Morton David
Date
1974-09Advisor
Weitzman, R.A.
Power, V.M.
Metadata
Show full item recordAbstract
An analysis was conducted of English and Spanish text. The statistical analysis determined the independent probability of letters and the joint probability of various letter combinations for large samples of each language. Various methods were tested in an attempt to utilize these characteristics to identify the language of a short sample text. By use of the joint probability of various vowel-consonant relationships and the Kolmogorov-Smirnov Goodness of Fit Test an identification system was defined that provided a significance level of .0077 for a sample of 107 letters (approximately 21 words). Investigation also showed that the space rate or the interword structure in each language contains a measure of intelligence and was useful in identification
Rights
This publication is a work of the U.S. Government as defined in Title 17, United States Code, Section 101. Copyright protection is not available for this work in the United States.Collections
Related items
Showing items related by title, author, creator and subject.
-
UNDERSTANDING THE ROLE OF OVERT AND COVERT ONLINE COMMUNICATION IN INFORMATION OPERATIONS
Dimitrov, Boyan I. (Monterey, CA; Naval Postgraduate School, 2022-03);This thesis combines regression, sentiment, and social network analysis to explore how Russian online media agencies, both overt and covert, affect online communication on Twitter when North Atlantic Treaty Organization ... -
Bi-spectral method for radar target recognition
Yeo, Jiunn Wah (Monterey California. Naval Postgraduate School, 2006-12);Target recognition and identification in battlefields has been a crucial determinant to the ultimate success or failure of modern military campaigns. Since World War II, the Identification of Friend or Foe (IFF) systems ... -
Language Translation for File Paths
Rowe, Neil C.; Schwamm, Riqui; Garfinkel, Simson L. (Monterey, California. Naval Postgraduate School, 2013);Forensic examiners are frequently confronted with content in languages that they do not understand, and they could benefit from machine translation into their native language. But automated translation of file paths is a ...