Archive for 1 October 2008

Presence versus frequency redux

Posted: 1 October 2008 in Uncategorized
Tags:

Well to answer the question I posed in my last post:  presence is indeed better than frequency!  My previous experiments led me to the opposite conclusion in contradiction of results published by Pang et al (2002).  It seems the voodoo I was missing was removing stop words and then length normalizing each vector.  This combination boosted [...]