Contextualized Topic Modeling with Python (EACL2021)

Combining BERT and friends with Neural Variational Topic Models

CTM Architecture

Overview

Suppose we have a small set of documents in Portuguese that is not large enough to reliably run standard topic modeling algorithms. However, we have enough English documents in the same domain. With our cross-lingual zero-shot topic model (ZeroShotTM), we can first learn topics on English and then predict topics for Portuguese documents (as long as we use pre-trained representations that account for both English and Portuguese).

Read More

Related