Reading between the lines : object localization using implicit cues from image tags

dc.contributor.advisorLease, Matthew A.en
dc.contributor.advisorGrauman, Kristen Lorraine, 1979-en
dc.creatorHwang, Sung Juen
dc.date.accessioned2010-11-10T15:14:19Zen
dc.date.available2010-11-10T15:14:19Zen
dc.date.available2010-11-10T15:14:30Zen
dc.date.issued2010-05en
dc.date.submittedMay 2010en
dc.date.updated2010-11-10T15:14:30Zen
dc.descriptiontexten
dc.description.abstractCurrent uses of tagged images typically exploit only the most explicit information: the link between the nouns named and the objects present somewhere in the image. We propose to leverage “unspoken” cues that rest within an ordered list of image tags so as to improve object localization. We define three novel implicit features from an image’s tags—the relative prominence of each object as signified by its order of mention, the scale constraints implied by unnamed objects, and the loose spatial links hinted by the proximity of names on the list. By learning a conditional density over the localization parameters (position and scale) given these cues, we show how to improve both accuracy and efficiency when detecting the tagged objects. We validate our approach with 25 object categories from the PASCAL VOC and LabelMe datasets, and demonstrate its effectiveness relative to both traditional sliding windows as well as a visual context baseline.en
dc.description.departmentComputer Sciencesen
dc.format.mimetypeapplication/pdfen
dc.identifier.urihttp://hdl.handle.net/2152/ETD-UT-2010-05-1514en
dc.language.isoengen
dc.subjectComputer visionen
dc.subjectObject recognitionen
dc.subjectObject detectionen
dc.titleReading between the lines : object localization using implicit cues from image tagsen
dc.type.genrethesisen
thesis.degree.departmentComputer Sciencesen
thesis.degree.disciplineComputer Sciencesen
thesis.degree.grantorUniversity of Texas at Austinen
thesis.degree.levelMastersen
thesis.degree.nameMaster of Artsen

Access full-text files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
HWANG-THESIS.pdf
Size:
1.67 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.12 KB
Format:
Plain Text
Description: