Well to answer the question I posed in my last post: presence is indeed better than frequency! My previous experiments led me to the opposite conclusion in contradiction of results published by Pang et al (2002). It seems the voodoo I was missing was removing stop words and then length normalizing each vector. This combination boosted [...]
Archive for 1 October 2008
Presence versus frequency redux
Posted: 1 October 2008 in UncategorizedTags: computational linguistics
1


