james.... you see, this is why i tell people that they need to take everything they read in a mail list with a large ass grain of salt!!!!! where did you find your information that lead you to think this is an end run around amazon? you have misread the AWS license/information. amazon will allow us to do what we want, provided the restrictions as outlined in my initial posting are followed. if you have any questions, let us know. but i sure as hell wish people (old or young) wouldn't post things as possible fact, that they haven't really checked out!!! just my $.02 worth. -bruce -----Original Message----- From: fedora-list-bounces@xxxxxxxxxx [mailto:fedora-list-bounces@xxxxxxxxxx]On Behalf Of James Wilkinson Sent: Thursday, August 18, 2005 9:50 AM To: fedora-list@xxxxxxxxxx Subject: Re: waaaaaay offf topic!! grid computing project bruce wrote: > i know this is way off topic, but i'm considering creating a seti@home like > grid project for testing purposes. the goal of the project would be to > extract book information from the amazon.com site/servers using the amazon > AWS services. > > to get to the scale in a fast enough timeframe, i might have to create some > sort of distributed/grid application. the key issue is that while Amazon > permits the > extraction/use of the book data from their site/servers, Amazon restricts > how fast you can hit their servers with a given machine/IP. amazon allows a > server to hit their site oncer/second. the obvious solution is to create a > distributed app that would be used to parse/extract the information, > building the database. > > while the initial app would be to test, to make sure everything would work > correctly, the obvious end result would be to use the database to support a > possible business venture. > > the client app for this project would consist of a perl/python app used to > hit the amazon.com server, and then to return the data to the test server. > the goal is to extract information for ~2-3 million books. i estimate that > i'd have to have a network of 200-500 machines to accomplish this over 2-3 > days.... with each client machine hitting the amazon server once every 5 > seconds... And keeping this up to date... As far as I can tell, this looks like an end-run around the AWS License conditions: they might well view the distributed application as a single "Application". It's also one that they might well notice: you think they're *that* unlikely to notice which IP addresses are hammering at their servers? There's a chance that they'll just track down the IP addresses in question, find out who owns them, and sue you. There's a rather larger chance that they'll just block you. Now you might be prepared to take this risk if you're just experimenting. But I seriously wouldn't want to build a business that is solely dependent on the goodwill of Amazon. I don't know exactly what you're planning, but it sounds a bad idea. I swear the sigmonster came up with this by itself! James. -- E-mail address: james | DON'T be put off by "horror stories" spread by @westexe.demon.co.uk | others. People who talk about death and serious | injury are very rarely the ones who have actually | suffered such things. -- Adrian Plass -- fedora-list mailing list fedora-list@xxxxxxxxxx To unsubscribe: http://www.redhat.com/mailman/listinfo/fedora-list