|
-
Nov 19th, 2024, 05:23 AM
#1
Thread Starter
Banned
SE simplification request
I wish to understand how search engine crawlers travers websites, in terms of the pathways they take.
-
Nov 19th, 2024, 05:34 AM
#2
Re: SE simplification request
-
Nov 19th, 2024, 05:42 AM
#3
Thread Starter
Banned
Re: SE simplification request
that's not helping me understand it.
I know it scrapes a webpage for links, ok, but next what, and after that what?
I would also assume it starts with some sort of seed site list, but if so wouldn't that leave web addresses unreachable in a void?
and if I were to assume it goes threw all scraped links, wouldn't that inflation jam up the crawler?
-
Nov 19th, 2024, 07:19 AM
#4
Re: SE simplification request
Then you first need to collect a list of all domains.
https://www.quora.com/How-do-I-scrap...p-level-domain
Posting Permissions
- You may not post new threads
- You may not post replies
- You may not post attachments
- You may not edit your posts
-
Forum Rules
|
Click Here to Expand Forum to Full Width
|