Topic modeling via scatter/gather clustering
Access full-text files
Date
2015-05
Authors
Tyler, Marcus Mitchell
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Latent variable models such as Latent Dirichlet Allocation provide rich tools for analyzing large document corpora. They can uncover a wide range of hidden information such as topics in text, communities in social networks, and patterns in images. Scatter/Gather is a clustering technique that allows users to interactively combine and split groups. When joined with latent variable models, Scatter/Gather organizes topics into themes, enables topic browsing, and improves processing time for large numbers of topics.
Department
Description
text