Results 1 to 6 of 6

Thread: Extracting main body text from webpage

Thread Tools
- Show Printable Version
Display
- Switch to Linear Mode
- Switch to Hybrid Mode
- Threaded Mode

Threaded View

Previous Post

Next Post

Sep 10th, 2013, 11:42 PM #1
JXDOS

View Profile

View Forum Posts
Thread Starter
Hyperactive Member

Join Date

Aug 2006

Location

Mars...

Posts

423
Extracting main body text from webpage

Hi all,

I am trying to download various texts from different news sources in a systematic way. I know I can use get element by id/tag but it would be quite tedious to make one for each of the 100+ sources. Is there a way to use webclient or webbrowser to extract the main content without the html formatting?

e.g. wclient.downloadMainBodyText?? A bit like the content that shows up when an iPad/iPhone uses the Reader function in safari.

Thanks in advance.

If my post has been helpful, please rate it!
Reply With Quote

Quick Navigation Visual Basic .NET Top

« Previous Thread | Next Thread »

Tags for this Thread

html, textbox

Posting Permissions

You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
[VIDEO] code is On
HTML code is Off

Click Here to Expand Forum to Full Width

Terms and Conditions | About Us | Privacy Notice | Contact Us | Advertise | Sitemap| California - Do Not Sell My Info

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.

All times are GMT -5. The time now is 02:53 AM.