normalize weighting

Welcome to the Forums :wave:

Quote:

just because the second file has less words doesn't make it a better match for foo

Yes it does. A higher proportion of the words in that file are "foo", and hence it has more relevance. Unless by "better match" you mean that a 200,000 word file with 2 instances of "foo" would be a better match than a 1 word file consisting solely of "foo", in which case you have to go by the absolute number.
If you want to normalise something, then you have to know what you're normalising it to. If you want to make it more relevant if the first word is "foo" than the second, then there are ways of doing such things, but you need to decide how you're determining what makes one file a "better match" than another before you can proceed.

zaza