Abstract:
|
Using the Stanford Encyclopedia of Philosophy, we compare latent Dirichlet allocation and information retrieval for topic detection and document clustering. The focus is on the relative strengths and weaknesses of each for data analysis, rather than a theoretical description of the two techniques. Code to implement both approaches will be provided via an external link.
|