|
-
Mar 8th, 2006, 07:35 AM
#1
Thread Starter
Member
Getting Values from HTML page
Hello!
I'm making program that should analyze data from HTML pages and put them into one XML file. Pages are similar and displaying info about companies. I tried to work with InnerText of Body (htmldocument.body.innertext) but i need something better because i can't locate correct information in InnerText string because of content movment (location of data that should be extracted is different on every page as consequence of changings in header of page, for example, company name is not same in width as on first page.... you know what I mean.)
I have some idea now, to solve it with OuterText with counting HTML tags, but is there some way to search htmldocument.body.children collection with children value as search key and to get some identificational string. ?
there must be some way to search HTML document on structured way...
here i need identification of required child that will point on child like they are all in same layer, childs of same parent node.
Posting Permissions
- You may not post new threads
- You may not post replies
- You may not post attachments
- You may not edit your posts
-
Forum Rules
|
Click Here to Expand Forum to Full Width
|