Interpreting LogLikelihood For LDA Topic Modeling

New Altair Community Member

Jun 19, 2018

Updated Nov 5, 2024 by Jocelyn

Hi RM Community,

Based on the attached picture, how should I interpret Loglikelihood values changing with number of topics. Is higher better or lower better. Does it needs to be squared to be positive?

Thanks!

Find more posts tagged with

AI Studio

Sort by:

1 - 15 of 151

MartinLiebig

Altair Employee

Jun 19, 2018

Hi,

it's the negative LLH. The lower the better.

BR,
Martin

svtorykh

New Altair Community Member

Jun 19, 2018

Thanks for prompt reply, so in this case -230000 is better than -240000 or vice versa?

MartinLiebig

Altair Employee

Accepted Answer

Jun 19, 2018

Hi @svtorykh,

-240000 is better.

BR,

Martin

svtorykh

New Altair Community Member

Jun 19, 2018

Thanks so much Martin!

MartinLiebig

Altair Employee

Jun 19, 2018

By the way, @svtorykh,

one of the next updates will have more performance measures for LDA. Just need to find time to implement it. LLH by itself is always tricky, because it naturally falls down for more topics.

BR,

Martin

svtorykh

New Altair Community Member

Jun 19, 2018

That would be very nice to have! Please keep us posted Martin!

jozeftomas_2020

Banned

Jun 20, 2018

Hello. I want to find the optimal K-number for KMEANS with the LDA Loglikelihood value

For me, using alpha and beta as heuristics for the top 5 is the highest. Now, how to use K optimally. Does anyone know how to help? Thanks a lot I searched a lot, but I did not find anything:smileysad:

MartinLiebig

Altair Employee

Jun 20, 2018

Hey @jozeftomas_2020,

i am fairly confused. KMeans and LDA are fairly different models. Why and how do you want to mix them?

~Martin

jozeftomas_2020

Banned

Jun 20, 2018

In the articles I have seen using the LDA to find optimal k, but I do not know how?
And how can I understand which LDA has a better result? Alpha and beta need to be adjusted a little or too high to get a better result?

I'm so sorry
Thanks a lot