well i thought I had this all done and dusted but there is always one problem!
I am reading an entire webpage into a string, so i can do stuff with it.
now obviously the page will show all standard html tags. however in this webpage it has other html tags, which are shown in example type forms (like what you see on htmlgoodies.com)
is there a way i can strip out the "outer" html tags? I have a regex expression which strips out ALL tags but i just want to strip out the outer tags if possible so i can perform operations on the main inner page.
not only this, i wish to perform some search on html tags entered as a search string - when doing this using regex.match, it finds it once but after that it doesnt! I am using a loop to go through to make regex keep doing the matches but it only finds the 1 tag in the string.
any ideas?
