Re: Wget question...

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

To: bedouglas@xxxxxxxxxxxxx, For users of Fedora Core releases <fedora-list@xxxxxxxxxx>
Subject: Re: Wget question...
From: Gordon Messmer <yinyang@xxxxxxxxx>
Date: Thu, 29 Jun 2006 16:57:35 -0700
Cc:
In-reply-to: <108701c69619$730da260$0301a8c0@xxxxxxxx>
References: <108701c69619$730da260$0301a8c0@xxxxxxxx>
Reply-to: For users of Fedora Core releases <fedora-list@xxxxxxxxxx>
User-agent: Thunderbird 1.5.0.2 (X11/20060501)

bruce wrote:


i know wget isn't as robust as nutch, but can someone tell me if wget keeps
a track of the URLs that it's bben through so it doesn't repeat/get stuck in
a never ending processs...

I don't know about the implementation details, but if I create two pagesthat link to each other, and tell wget to download them recursively, itdoes not loop. Maybe it does so if there are references that can't bedetected by examining the "stack" leading back to the first page.

You may want to look at the section of the man page detailing the "-nc"option. I use the options "-r -nc" when downloading a complex set of pages.

References:
- Wget question...
  - From: bruce

Prev by Date: Re: Eth0 Fails Randomly
Next by Date: laptop and laptop-mode-tools
Previous by thread: Re: Wget question...
Next by thread: kernel 2.6.16-1.2133 won't install on my x86_64
Index(es):
- Date
- Thread

[Index of Archives] [Current Fedora Users] [Fedora Desktop] [Fedora SELinux] [Yosemite News] [Yosemite Photos] [KDE Users] [Fedora Tools] [Fedora Docs]

Powered by Linux