Multi-timescale representation learning in LSTM Language Models
dc.contributor.advisor | Huth, Alexander G. | |
dc.creator | Mahto, Shivangi | |
dc.date.accessioned | 2022-11-21T22:57:51Z | |
dc.date.available | 2022-11-21T22:57:51Z | |
dc.date.created | 2020-05 | |
dc.date.issued | 2022-10-06 | |
dc.date.submitted | May 2020 | |
dc.date.updated | 2022-11-21T22:57:52Z | |
dc.description.abstract | Representations within Language Models (LMs) are difficult to interpret. For example, how different layers of an LSTM LM retain information over different periods of time is unclear. In this paper, we present methods to interpret and control the timescale of information routing through an LSTM unit. We found out that a standard LSTM LM favors representations of small timescale information (up to 20 tokens). We then introduce a prior based on statistical properties of natural language, which is applied on the distribution of timescale across LSTM units to achieve an effective multi-timescale LM. The proposed model learns representations of both short as well long timescale. It also achieves better prediction performance than a standard LSTM LM on Penn Treebank and WikiText-2 datasets, especially on rare words. | |
dc.description.department | Computer Science | |
dc.format.mimetype | application/pdf | |
dc.identifier.uri | https://hdl.handle.net/2152/116770 | |
dc.identifier.uri | http://dx.doi.org/10.26153/tsw/43665 | |
dc.language.iso | en | |
dc.subject | Language Models | |
dc.subject | Timescale learning | |
dc.subject | LSTMs | |
dc.subject | Natural language processing | |
dc.subject | Machine Learning | |
dc.title | Multi-timescale representation learning in LSTM Language Models | |
dc.type | Thesis | |
dc.type.material | text | |
thesis.degree.department | Computer Sciences | |
thesis.degree.discipline | Computer Science | |
thesis.degree.grantor | The University of Texas at Austin | |
thesis.degree.level | Masters | |
thesis.degree.name | Master of Science in Computer Sciences |
Access full-text files
Original bundle
1 - 1 of 1