Results 1 to 2 of 2

Thread: normalize weighting

Thread Tools
- Show Printable Version
Display
- Switch to Linear Mode
- Switch to Hybrid Mode
- Threaded Mode

Threaded View

Previous Post

Next Post

Mar 20th, 2006, 08:59 PM #1
bimmerfan325

View Profile

View Forum Posts
Thread Starter
New Member

Join Date

Mar 2006

Posts

1
normalize weighting

i'm trying to figure out a fair weighting system for words in a file. this is what i mean. lets say i have a file that has 200 unique words and the word 'foo' is there 5 times. then i have another file that has 150 unique words with 'foo' also listed 5 times. it works out to 5/200=.025 and 5/150=.033. just because the second file has less words doesn't make it a better match for foo. they should be equal. is there anyway i can normalize the weights. and i can't just use the number of occurances because there are other metrics involved. i hope this makes sense. any help would be appreciated.
Reply With Quote

Quick Navigation Maths Forum Top

« Previous Thread | Next Thread »

Posting Permissions

You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
[VIDEO] code is On
HTML code is Off

Click Here to Expand Forum to Full Width

Terms and Conditions | About Us | Privacy Notice | Contact Us | Advertise | Sitemap| California - Do Not Sell My Info

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.

All times are GMT -5. The time now is 09:42 AM.