The system, generated using Google Takeout's Watch History Data, specifically the titles,
is modeled using Latent Dirichlet Allocation (LDA), a generative probabilistic model of a corpus.
Intertopic Distance Map (L)
The circle size indicates the extent of word inclusion within the topic cluster. The circle distance represents similarity between topics.
If two circles overlap, it signifies similarity between the corresponding topics.
Top-30 Most Relevant Terms for Topics (R)
Each bar refers to the list of leading keywords shaping the topics.
For keywords extraction, salience and discriminative power serve as criteria and can be adjusted through a lambda parameter (λ).
This is a data-driven life-logging visualization project done by master's students of ViBA Lab.
Feel free to ask questions.