By JRun
Hi,
I need to scrape many hundreds of thousands of URLs. Should I go for 10 separate droplets for 10 parallel url scrapers, or should I go with 10 parallel scarpers within the same droplet? I suspect that IP addresses need to be different for the first case (10 different droplets). Is that correct?
Thanks.
This textbox defaults to using Markdown to format your answer.
You can type !ref in this text area to quickly search our full set of tutorials, documentation & marketplace offerings and insert the link!
This comment has been deleted
If you can afford the cost, and the system won’t fall under “abuse”, I’d say go with the 10 droplets plan. This will give you more CPU than a single droplet.
As for a single IP, you can set up a SSH tunnel from 9 of the servers to the 10th. This will give you the same IP across all requests, even though they are coming from different servers.
Get paid to write technical tutorials and select a tech-focused charity to receive a matching donation.
Full documentation for every DigitalOcean product.
The Wave has everything you need to know about building a business, from raising funding to marketing your product.
Stay up to date by signing up for DigitalOcean’s Infrastructure as a Newsletter.
New accounts only. By submitting your email you agree to our Privacy Policy
Scale up as you grow — whether you're running one virtual machine or ten thousand.
Sign up and get $200 in credit for your first 60 days with DigitalOcean.*
*This promotional offer applies to new accounts only.