Using Context to Disambiguate Web Captions
Rowe, Neil C.
MetadataShow full item record
The easiest way to index multimedia from ordinary Web pages is to find their captions. However, captions are not used consistently, and retrieval effectiveness for caption-based multimedia browsers is significantly poorer than that for text retrieval. We show that statistical "context" information about the Web pages at a site can help recognize image captions by quantifying their "representativeness". Experiments were conducted on a random sample of 5010 image captions from 3.2 million candidates from 5 million Web pages, and 1220 audio and video captions from 720,000 candidates from those same Web pages. They showed that while statistical context information was definitely a good clue, it usually did not appear to add much beyond what good local clues in the candidate captionimage pair itself provide, and provided no help for caption-audio and caption-video pairs.
This paper appeared in the Internet Computing Conference, Las Vegas, NV, June 2004.
Showing items related by title, author, creator and subject.
Rowe, Neil C.; Guglielmo, Eugene J. (Monterey, California. Naval Postgraduate School, 1992-07); NPS-CS-92-011Descriptive natural-language captions can help organize multimedia data. We described our MARIE system that interprets English queries directing the fetch of media objects. it is novel in the extent to which it exploits ...
Rowe, Neil C. (Monterey, California. Naval Postgraduate School, 2004);We survey research on using captions in data mining from the Web. Captions are text that describes some other information (typically, multimedia). Since text is considerably easier to index and manipulate than non-text ...
Exploiting Captions for Multimedia Data Mining / Chapter in Encyclopedia of Multimedia Technology and Networking Rowe, Neil C. (Monterey, California. Naval Postgraduate School, 2005);Captions are essential accompaniments to multimedia data objects as a way to facilitate their data mining. This article describes the kinds of possible captions and the task of recognizing them. It then discusses the forms ...