TermFrequency : how are WV values normalized?

Question

Hello all, I'm trying to understand the word vectors generated by the TextInput operator using Term Frequency. I believe the value is based on the normalized number of occurrences of terms in the document. How is this done? To illustrate my question I have a parsimonious process below. The directory specified in the text parameter simply contains one .txt document that reads as: The word this occurs twice in this document. (I'm aware that using TFIDF will return 0's in the word vector due to log(1/1)=0 in this setup, but TermFrequency should be ok.) The resulting ExampleSet (rounded) isRow NumberIDLabelThewordthisoccurstwiceindocument11ClassLabelTest10.3160.3160.6320.3160.3160.3160.316 Noting 0.316 (well, without rounding) = 1/(square root of 10) and 0.632... = 2/(square root of 10), the WV[term i] appears to be the flat number of occurrences of term i, divided by square root of 10. Perhaps I was expecting 8 (the number of occurrences of all terms in the document) in the denominator. If someone could provide a formula for the denominator or explain where this square root of 10 comes from, I would greatly appreciate it. Thank you so much in advance, Miwa