-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Security issues in crawling #12
Comments
|
|
Took me some time to find this out, but Scrapy has a safety valve around the downloader. If the aggregated size of Response in progress is larger than 5 MB it stops the flow of further Request into the downloader. |
@justinccdev |
Whilst we may very probably want to use fairsharing.org information and/or Bioschemas live deploys in the future as sources for default sites to crawl, we don't want to restrict the user as to what they can crawl, I think. |
Thoughts -
The text was updated successfully, but these errors were encountered: