Re: large files unnecessary trashing filesystem cache?

On Tue, 18 Oct 2005, Badari Pulavarty wrote:

On Tue, 2005-10-18 at 23:58 +0200, Bodo Eggert wrote:

Badari Pulavarty <pbadari@gmail.com> wrote:

On Tue, 2005-10-18 at 22:01 +0200, Guido Fiala wrote:

[large files trash cache]

Is there a reason why those applications couldn't use O_DIRECT ?

The cache trashing will affect all programs handling large files:

mkisofs * > iso
dd < /dev/hdx42 | gzip > imagefile
perl -pe's/filenamea/filenameb/' < iso | cdrecord - # <- never tried

Are these examples which demonstrate the thrashing problem.
Few product (database) groups here are trying to get me to
work on a solution before demonstrating the problem. They
also claim exactly what you are saying. They want a control
on how many pages (per process or per file or per filesystem
or system wide) you can have in filesystem cache.

Thats why I am pressing to find out the real issue behind this.
If you have a demonstratable testcase, please let me know.
I will be happy to take a look.

Changing a few programs will only partly cover the problems.

I guess the solution would be using random cache eviction rather than
a FIFO. I never took a look the cache mechanism, so I may very well be
wrong here.

Read-only pages should be re-cycled really easily & quickly. I can't
belive read-only pages are causing you all the trouble.

the problem is that there are many sources of read-only pages (how manyshared library pages are not read-only for example) and not all of themare of equal value to the system

the ideal situation would probably be something like the adaptiveread-ahead approach where the system balances the saved pages betweenprocesses/files rather then just benifiting the process that uses pagesthe fastest.

I don't have any idea how to implement this sanely without a horribleperformance hit due to recordkeeping, but someone else may have a betteridea.

thinking out loud here, how bad would it be to split the LRU list based onthe number of things that have a page mapped? even if it only split itinto a small number of lists (say even just 0, 1+) and then evicted pagesfrom the 0 list in prefrence to the 1+ list (or at least add a fixed valueto the age of the 0 pages to have them age faster). this would limit howbadly library and code pages get evicted by a large file access.

David Lang

--
There are two ways of constructing a software design. One way is to make it so simple that there are obviously no deficiencies. And the other way is to make it so complicated that there are no obvious deficiencies.
 -- C.A.R. Hoare
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

References:
- Re: large files unnecessary trashing filesystem cache?
  - From: Badari Pulavarty <pbadari@gmail.com>

Prev by Date: Re: [PATCH] fix nr_unused accounting, and avoid recursing in iput with I_WILL_FREE set
Next by Date: Re: [uml-devel] [PATCH] build fix for uml/amd64
Previous by thread: Re: large files unnecessary trashing filesystem cache?
Next by thread: Re: large files unnecessary trashing filesystem cache?
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]