Image captioning algorithms for images taken by people with visual impairments
dc.contributor.advisor | Gurari, Danna | |
dc.creator | Zhang, Meng, M.S. in Information Studies | |
dc.creator.orcid | 0000-0003-4505-6983 | |
dc.date.accessioned | 2020-05-07T18:49:24Z | |
dc.date.available | 2020-05-07T18:49:24Z | |
dc.date.created | 2019-05 | |
dc.date.issued | 2019-07-08 | |
dc.date.submitted | May 2019 | |
dc.date.updated | 2020-05-07T18:49:25Z | |
dc.description.abstract | People with visual impairments regularly encounter the challenge that their visual impairments expose them to a time-consuming, or even impossible, task: what content is presented in an image without assistance. One method to address this problem is image captioning with machine learning. With the help of image captioning algorithms together with artificial intelligence speech system, people who are blind can instantly learn what is in an image, since such systems can automatically generate text captions. In this work, we analyze the new VizWiz dataset and compare it to the MSCOCO dataset, which is widely used for evaluating the performance of image captioning algorithms. We also implement and evaluate two state-of-the-art image caption models with accuracy, runtime, and resource analysis. Hopefully, our research will help the improvement of image captioning algorithms which focus on fulfilling the everyday needs of people with visual impairments | |
dc.description.department | Information | |
dc.format.mimetype | application/pdf | |
dc.identifier.uri | https://hdl.handle.net/2152/81211 | |
dc.identifier.uri | http://dx.doi.org/10.26153/tsw/8224 | |
dc.language.iso | en | |
dc.subject | Image captioning | |
dc.subject | Computer vision | |
dc.subject | Deep learning | |
dc.subject | People with visual impairments | |
dc.title | Image captioning algorithms for images taken by people with visual impairments | |
dc.type | Thesis | |
dc.type.material | text | |
thesis.degree.department | Information | |
thesis.degree.discipline | Information Studies | |
thesis.degree.grantor | The University of Texas at Austin | |
thesis.degree.level | Masters | |
thesis.degree.name | Master of Science in Information Studies |
Access full-text files
Original bundle
1 - 1 of 1
Loading...
- Name:
- ZHANG-MASTERSREPORT-2019.pdf
- Size:
- 4.27 MB
- Format:
- Adobe Portable Document Format