Fedora Users — Re: [Fedora] Re: Alternatives to du

Alan Cox wrote:

Unfortunately, in this case, the directory structure is what it is. Ican't change it without a massive amount of pain.


Then you won't get sane performance. The ext3 directory hash does what it
can to improve performance in this case but you'll probably find most of
your extra overhead (compared to users with sane directory structures) is
simply down to filename lookup and directory scanning overhead.

I can see why a filename lookup would be slow in a large directory andwhy keeping it locked between the initial check for a name's existencethe write to create a new name becomes problematic, but doesn't du justdo a linear walk anyway? I don't see why making the tree deeper andless wide would make a lot of difference there. What might make adifference would be sorting the list and doing the stat()s in inodeorder. A variation of this question just came up on the backuppc list.It does hash things into a directory tree, but since it keeps anonline backup containing a history of many other machines you end upwith millions of files with all duplicates hardlinked. The issue thereis that because most of the directory entries are hardlinks to existingfiles the inodes are wildly out-of order and you end up waiting for alot of seeks if you try to stat them in directory scan order. I've beenusing reiserfs for a long time for my backuppc partition but would beinterested to know if any of the other filesystems might have anyadvantage in handling huge numbers of directory entries (overall, not ina single directory).


--
  Les Mikesell
   lesmikesell@xxxxxxxxx