Leaking Sensitive Information in Complex Document Files - and How to Prevent It
Abstract
Complex document formats such as PDF and Microsoft’s Compound File Binary Format can contain
information that is hidden but recoverable, as a result of text highlighting, cropping, or the embedding
of high-resolution JPEG images. Private information can be released inadvertently if these fi les are
distributed in electronic form. Simple experiments involving the creation of test documents can
determine whether a particular program embeds hidden information.