Topic extractionSubscribe to RSS - Topic extraction

Topic extraction from Message Text using external scripts/library

Introduction

Topic modeling means detecting “abstract” topics from a collection of texts. There are various techniques like: statistical technique, LDA, Hierarchical Dirichlet process etc. Amongst all of the techniques, Latent Dirichlet Allocation(LDA) has got more success based on accuracy and usability.

This statistics based text processing algorithm takes text as input and produces a list of topics based on inner implementation of LDA variant. Number of topics to be detected should be pre-defined before applying LDA method.