|
-
Jan 14th, 2010, 05:42 PM
#1
Thread Starter
Hyperactive Member
screen scraping problem
HI guys,
I'm working on building a website that contains a database of companies in the U.S. What I want to be able to do is have a database that contains all the locations of a given company's offices, and a link to thier career search site.
What I would like to happen is to enter a company's website and then have the program parse through the website looking for the locations and the career website search.
My problem is creating a program that can find this information. The problem is because not all companies have a search for jobs on thier site, they just list the positions. Some won't link a search careers webpage on thier main website it will be on a webpage that you have go into 2-5 clicks before you can search.
Plus figuring out how to determine if it's an office location or not is difficult because some don't list them, others put them in a little app like flash where you can hover over the areas. Some use third party apps like telo, those would probably be the easier of them because everyone uses that.
But any good ideas on how to go about pulling this information off a company's website without having to manually look it up?
Posting Permissions
- You may not post new threads
- You may not post replies
- You may not post attachments
- You may not edit your posts
-
Forum Rules
|
Click Here to Expand Forum to Full Width
|