Topic modeling via scatter/gather clustering

Date

2015-05

Authors

Tyler, Marcus Mitchell

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Latent variable models such as Latent Dirichlet Allocation provide rich tools for analyzing large document corpora. They can uncover a wide range of hidden information such as topics in text, communities in social networks, and patterns in images. Scatter/Gather is a clustering technique that allows users to interactively combine and split groups. When joined with latent variable models, Scatter/Gather organizes topics into themes, enables topic browsing, and improves processing time for large numbers of topics.

Description

text

LCSH Subject Headings

Citation