I have to crawl suppose abc.com domain ,in visiting URLs it redirect to lots of third parties URLs like facebook.com,google.com etc.
Is there any rules for go colly to restriction of domain like scrapy linkextractor rules?
colly.Collector has a field AllowedDomains. Try setting this.
And also RedirectHandler in Collector can be used
This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.