gkaramanolakis / disclda

Supervised Topic modeling with Discriminative Latent Dirichlet Allocation using Collapsed Gibbs sampling

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This fork is intended to extend lda to a form of disclda.

It amounts to a simple matrix manipulation of the topic vector for a document for that is based on the label of a document

In DIscLDA, each document has a label $y_d$ and there is a stochastic matrix $T^y$ associated with each label.

Where in vanilla LDA you draw the word topics $z{d,i} ~ Cat(theta_d)$, in DiscLDA you draw $z{d,i} ~ Cat(T^{y_d}theta_d)$

About

Supervised Topic modeling with Discriminative Latent Dirichlet Allocation using Collapsed Gibbs sampling

License:Mozilla Public License 2.0


Languages

Language:HTML 61.0%Language:Python 28.1%Language:PowerShell 4.9%Language:C 3.1%Language:Batchfile 2.3%Language:Shell 0.6%Language:Makefile 0.1%