Automated metadata extraction

Download
Author
Migletz, James J.
Date
2008-06Advisor
Garfinkel, Simson
Second Reader
Squire, Kevin
Metadata
Show full item recordAbstract
Metadata is data that describes data. There are many computer forensic uses of metadata and being able to extract metadata automatically provides positive forensic implications. This thesis presents a new technique for batch processing disk images and automatically extracting metadata from files and file contents. The technique is embodied in a program called fiwalk that has a plug-in architecture allowing new metadata extractors to be readily incorporated. Output from fiwalk can be provided in multiple formats such as ARFF and text. The plug-ins created for this thesis include one created by Simson Garfinkel for extracting metadata from .jpeg files, two for Microsoft Office documents (one for prior to Office 2007 release and one for Office 2007 release), and a default plug-in for extracting metadata from .gif, .pdf, and .mp3 files. To better understand the metadata available in common file formats such as .doc, .docx, .odt, .pdf, .mp3, .mp4, .jpeg, tiff, and .gif, an examination of these formats is provided.
Collections
Related items
Showing items related by title, author, creator and subject.
-
Expeditionary Mine Countermeasures (ExMCM) C4I Requirements (Continuation)
Das, Arijit (Monterey, California: Naval Postgraduate SchoolMonterey, California. Naval Postgraduate School, 2019-12); NPS-19-N065-AThe ExMCM is a broad program providing an innovative approach to the Mine Warfare mission area, required to operate with both U.S Navy and U.S. Marine Corps forces. The large number of sonar imagery files (from the MK18 ... -
Robodata Archive for Visualizing CRUSER Unmanned System Field Experimentation [video]
Brutzman, Don (Naval Postgraduate School, Monterey, California, 2017-04-11);NPS performs many experiments with unmanned systems but few projects are able record results in a systematic reusable way. The robodata.nps.edu project is designed to establish NPS data-collection capabilities for a wide ... -
Discovery and Reuse of Modeling and Simulation Assets / Paper 10S-SIW-048
Gustavson, Paul; Dumanoir, Paul; Blais, Curtis; Daehler-Wilking, Richard (2010);The ability to discover existing modeling and simulation (M&S) assets is a critical need for enabling effective reuse and for reducing the duplication of capabilities. Such visibility and accessibility is key to optimizing the ...