🎉Community Raffle - Win $25

An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Join and Win

"count syllables based on a predefined dictionary"

User: "markus_dressel"
New Altair Community Member
Updated by Jocelyn

Hi community,

I want to count the syllables within in document. Therefore, I have a predefined dictionary (in excel) which contains 85000 words and its corresponding syllable values. 

Now I want to tokenize the document and and count the sylables within the document. As result I would to retrieve the number of syllables in the text. What operators do I need ?

I hope you can help me with that topic.

 

Best regards and thanks in advance,

 

Markus 

Find more posts tagged with

Sort by:
1 - 1 of 11
    User: "Telcontar120"
    New Altair Community Member
    Accepted Answer

    Sure, this is actually pretty straightforward.  First tokenize the document using "Process Document" and then output the wordlist using the "Wordlist to Data" operator, which will give you an exampleset of the wordlist with the counts.  Then you should be able to join your syllable count in using "Join" (you'll join on the words/tokens) and use "Generate Attributes" to compute the product of the word count and the syllable count per word, and then use "Aggregate" to get the sum of that product.  And you should have total syllables in the document!