[RFC] Fine-grained memory priorities and PI

On Dec 15, 2005, at 03:21, David S. Miller wrote:

Not when we run out, but rather when we reach some low water mark,the "critical sockets" would still use GFP_ATOMIC memory but only"critical sockets" would be allowed to do so.
But even this has faults, consider the IPSEC scenerio I mentioned,and this applies to any kind of encapsulation actually, even simpletunneling examples can be concocted which make the "criticalsocket" idea fail.
The knee jerk reaction is "mark IPSEC's sockets critical, and markthe tunneling allocations critical, and... and..." well you haveGFP_ATOMIC then my friend.
In short, these "seperate page pool" and "critical socket" ideas donot work and we need a different solution, I'm sorry folks spent somuch time on them, but they are heavily flawed.

What we really need in the kernel is a more fine-grained memorypriority system with PI, similar in concept to what's being done tothe scheduler in some of the RT patchsets. Currently we have a veryblack-and-white memory subsystem; when we go OOM, we just startkilling processes until we are no longer OOM. Perhaps we should havesome way to pass memory allocation priorities throughout the kernel,including a "this request has X priority", "this request will helpfree up X pages of RAM", and "drop while dirty under certain OOM tofree X memory using this method".

The initial benefit would be that OOM handling would become morereliable and less of a special case. When we start to run low onfree pages, it might be OK to kill the SETI@home process long beforewe OOM if such action might prevent the OOM. Likewise, you might beable to flag certain file pages as being "less critical", such thatthe kernel can kill a process and drop its dirty pages for files in /tmp. Or the kernel might do a variety of other things just byfailing new allocations with low priority and forcing existingallocations with low priority to go away using preregistered handlers.

When processes request memory through any subsystem, their memorypriority would be passed through the kernel layers to the allocator,along with any associated information about how to free the memory ina low-memory condition. As a result, I could configure my databaseto have a much higher priority than SETI@home (or boinc or whatever),so that when the database server wants to fill memory with clean DBcache pages, the kernel will kill SETI@home for it's memory, even ifwe could just leave some DB cache pages unfaulted.

Questions? Comments? "This is a terrible idea that should never haveseen the light of day"? Both constructive and destructive criticismwelcomed! (Just please keep the language clean! :-D)

Cheers,
Kyle Moffett

--
Q: Why do programmers confuse Halloween and Christmas?
A: Because OCT 31 == DEC 25.



-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: [RFC] Fine-grained memory priorities and PI
  - From: Con Kolivas <kernel@kolivas.org>
- Re: [RFC] Fine-grained memory priorities and PI
  - From: Andi Kleen <ak@suse.de>

References:
- Re: [RFC][PATCH 0/3] TCP/IP Critical socket communication mechanism
  - From: Matt Mackall <mpm@selenic.com>
- Re: [RFC][PATCH 0/3] TCP/IP Critical socket communication mechanism
  - From: "David S. Miller" <davem@davemloft.net>
- Re: [RFC][PATCH 0/3] TCP/IP Critical socket communication mechanism
  - From: Sridhar Samudrala <sri@us.ibm.com>
- Re: [RFC][PATCH 0/3] TCP/IP Critical socket communication mechanism
  - From: "David S. Miller" <davem@davemloft.net>

Prev by Date: [PATCH 2/3] m68k: compile fix - ADBREQ_RAW missing declaration
Next by Date: Re: [RFC][PATCH 0/3] TCP/IP Critical socket communication mechanism
Previous by thread: Re: [RFC][PATCH 0/3] TCP/IP Critical socket communication mechanism
Next by thread: Re: [RFC] Fine-grained memory priorities and PI
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]