Great problem in converting text to feature vector

m_keshavarz_com
m_keshavarz_com New Altair Community Member
edited November 5 in Community Q&A
Hello
Sorry
I have a thousand sentences
I want to turn text into a feature vector using word2vec instead of tfidf
And then, clustering the texts based on the words
But do not know how
This is my process but
Word2vec is now clustered based on the words extracted, not according to the words in the sentences. And then clustering sentences
Please help me what to do
How to extract wordwords with word2vec and cluster them according to them?
Thanks
Any help is sure to be useful to me and thanks
Waiting
Vc.zip 31.3K

Answers

  • MartinLiebig
    MartinLiebig
    Altair Employee

    Hi,

    word2vec maps words to vectors. Not sentences. You need to do an aggregation in order to get sentence vectors. This is not trivially done.

     

    BR,

    Martin