Factorized adapters for robust domain adaptation in ASR
dc.contributor.advisor | Harwath, David | |
dc.contributor.committeeMember | Mooney, Raymond J | |
dc.creator | Maddela, Sai Kiran | |
dc.date.accessioned | 2023-08-22T01:08:38Z | |
dc.date.available | 2023-08-22T01:08:38Z | |
dc.date.created | 2023-05 | |
dc.date.issued | 2023-04-20 | |
dc.date.submitted | May 2023 | |
dc.date.updated | 2023-08-22T01:08:39Z | |
dc.description.abstract | Automatic Speech Recognition (ASR) performance is readily degraded by environmental noise, reverberation, and other forms of distortion. A significant challenge in noise robust ASR is dealing with forms of environmental noise that were unseen during model training, especially when multiple types of distortion are present simultaneously. In this thesis, we propose the use of factorized adapters for robust ASR. By utilizing a framework that enables us to compose multiple adapters that have each been trained to model one type of noise, we demonstrate that our models can generalize in a zero-shot fashion to distortion combinations that were unseen during training. | |
dc.description.department | Computer Science | |
dc.format.mimetype | application/pdf | |
dc.identifier.uri | https://hdl.handle.net/2152/121172 | |
dc.identifier.uri | http://dx.doi.org/10.26153/tsw/48002 | |
dc.language.iso | en | |
dc.subject | Robust speech recognition | |
dc.subject | Domain adaptation | |
dc.title | Factorized adapters for robust domain adaptation in ASR | |
dc.type | Thesis | |
dc.type.material | text | |
thesis.degree.department | Computer Sciences | |
thesis.degree.discipline | Computer Science | |
thesis.degree.grantor | The University of Texas at Austin | |
thesis.degree.level | Masters | |
thesis.degree.name | Master of Science in Computer Sciences |
Access full-text files
Original bundle
1 - 1 of 1