Results 1 to 4 of 4

Thread: SE simplification request

  1. #1

    Thread Starter
    Banned
    Join Date
    Nov 2024
    Location
    lake Titikaka
    Posts
    3

    SE simplification request

    I wish to understand how search engine crawlers travers websites, in terms of the pathways they take.

  2. #2

  3. #3

    Thread Starter
    Banned
    Join Date
    Nov 2024
    Location
    lake Titikaka
    Posts
    3

    Re: SE simplification request

    that's not helping me understand it.
    I know it scrapes a webpage for links, ok, but next what, and after that what?

    I would also assume it starts with some sort of seed site list, but if so wouldn't that leave web addresses unreachable in a void?

    and if I were to assume it goes threw all scraped links, wouldn't that inflation jam up the crawler?

  4. #4
    PowerPoster Arnoutdv's Avatar
    Join Date
    Oct 2013
    Posts
    6,738

    Re: SE simplification request

    Then you first need to collect a list of all domains.
    https://www.quora.com/How-do-I-scrap...p-level-domain

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  



Click Here to Expand Forum to Full Width