Re: Question about buffer usage

Dave Pifke <[email protected]> wrote:
>
> I'm doing some performance tweaking on a web server and am curious about 
> a memory usage pattern I'm seeing.  After doing some Googling I'm still 
> not sure whether what I'm seeing is normal or is indicative of 
> suboptimal VM or VFS behavior.
> 
> The system has 2GB of RAM.  At the moment, ~117MB is free, ~1.5GB is 
> being used for buffers(!), and ~180MB is cached.  I would expect cache 
> to be much larger than buffers.

Maybe not, with lots of small files.  All the metadata associated with
those files (directory blocks, inode blocks, allocation maps, etc) is
cached in the blockdev pagecache (aka buffer cache).

> According to Documentation/filesystems/proc.txt:
> 
>       Buffers: Relatively temporary storage for raw disk blocks
>                shouldn't get tremendously large (20MB or so)
> 
> 75% of total RAM is seeming tremendously large to me. :)

The document is optimistic ;)

> I'm running Debian stable on a dual-Opteron system:
> 
> Linux file003b 2.6.8-11-amd64-k8-smp #1 SMP Wed Jun 1 00:01:27 CEST 2005 
> x86_64 GNU/Linux
> 
> (I can upgrade the kernel if this is a known issue with 2.6.8.)

I wouldn't expect later kernels would change this a lot.

> Perhaps of interest is that I have a 2TB JFS filesystem on a 3ware SATA 
> RAID controller.

It could be a JFS quirk - I don't know much about JFS.  It'd be interesting
to know if other filesystems behave in a similar manner.

>  The machine is running Apache (mpm_worker) and that's 
> about it.  It's all image files - a third thumbnails (<1kB), a third 
> resized to the 50-100kB range, and a third originals (max 1MB). 
> Thumbnails should be getting served much more frequently than the 
> others, so the disk access pattern is lots of small files.  Directories 
> are hashed year/month/day/hour with maybe 4000 files in each.
> 
> slabtop shows ~161MB total size (I think this counts against buffers?). 
>   Far and away the bigest users are buffer_head at ~50MB and 
> radix_tree_node at ~99MB.

I'd have expected more dentry_cache.

> The machine is fairly responsive, but I'm seeing a load average of 
> between 2 and 3 with about 250 concurrent Apache requests - I'm 
> obviously hoping for better performance.
> 
> Ideas?  Or is this about what one would expect given the system 
> configuration and workload?

One thing you could do is to (re)mount the filesystems with `-o noatime'.

When userspace reads a file the kernel will update the atime.  This
involves dirtying the in-core inode and later writing it to an on-disk
inode.  That write requires that the on-disk inode's block be in blockdev
pagecache.

With `-o noatime', the kernel will no longer need to write the inode atimes
and will satisfy all inode read requests from the separate inode cache. 
Hence the blockdev pagecache page which backs the inode will remain
untouched and will be reclaimed.

That should release _some_ of the blockdev pagecache, but not a lot, I
expect.  Maybe JFS is just metadata-intensive..

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: Question about buffer usage
  - From: Dave Pifke <[email protected]>

References:
- Question about buffer usage
  - From: Dave Pifke <[email protected]>

Prev by Date: Re: [PATCH] pci_ids: cleanup comments
Next by Date: Re: [discuss] Re: x86_64: 2.6.14-rc4 swiotlb broken
Previous by thread: Question about buffer usage
Next by thread: Re: Question about buffer usage
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]