Results 1 to 1 of 1

Thread: Web crawler

  1. #1

    Thread Starter
    New Member
    Join Date
    Aug 2009
    Posts
    9

    Web crawler

    I am attempting to build a web crawler and was wondering where to start? I already have a function that can return a List(Of String) of all the links on a page.


    vb Code:
    1. Private Function getemailsfromhtml(ByVal html As String) As List(Of String)
    2.  
    3.         Dim output As New List(Of String)
    4.         If html = "" Then
    5.             html = " "
    6.         End If
    7.         Dim s As String = "([a-zA-Z0-9_\-\.]+)@((\[[0-9]{1,3}\.[0-9]{1,3}\.[0-9]{1,3}\.)|(([a-zA-Z0-9\-]+\.)+))([a-zA-Z]{2,4}|[0-9]{1,3})"
    8.         Dim m As MatchCollection = Regex.Matches(html, s)
    9.         For i As Integer = 0 To m.Count - 1
    10.             output.Add(m(i).ToString)
    11.         Next
    12.         Return output
    13.     End Function
    Last edited by giodamelio; Jun 17th, 2010 at 05:05 PM.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  



Click Here to Expand Forum to Full Width