Here is set of patches that implements a *simple minded* RSS controller
for containers. It would be nice to split up the memory controller design
and implementation in phases
1. RSS control
2. Page Cache control (with split clean and dirty accounting/control)
3. mlock() control
4. Kernel accounting and control
The beancounter implementation follows a very similar approach. The split
up makes the design of the controller easier. RSS for example, can be tracked
per mm_struct. Page Cache could be tracked per inode, per thread
or per mm_struct (depending on what form is most suitable).
The definition of RSS was debated on lkml, please see
http://lkml.org/lkml/2006/10/10/130
This patchset is a proof of concept implementation and the accounting can
be easily adapted to meet the definition of RSS as and when it is re-defined
or revisited. The changes required should be small.
The reclamation logic has been borrowed from Dave Hansen's challenged
memory controller and from shrink_all_memory(). The accounting was inspired
from Rohit Seth's container patches.
The good
--------
No additional pointers required in struct page.
There is also a lot of scope for code reuse in tracking the rss of a process
(this reuse is yet to be exploited).
The not so good
---------------
The patches contain a lot of debugging code.
Applying the patches
--------------------
This patchset has been developed on top of 2.6.19-rc2 with the latest
containers patch applied.
To run and test this patch, additional fixes are required.
Please see
http://lkml.org/lkml/2006/11/6/10
http://lkml.org/lkml/2006/11/6/245
Series
------
container-res-groups-fix-parsing.patch
container-memctlr-setup.patch
container-memctlr-callbacks.patch
container-memctlr-acct.patch
container-memctlr-task-migration.patch
container-memctlr-shares.patch
container-memctlr-reclaim.patch
Setup
-----
To test the series, here's what you need to do
0. Get the latest containers patches against 2.6.19-rc2
1. Apply all the fixes
2. Apply these patches
3. Build the kernel and mount the container filesystem
mount -t container container /container
4. Disable cpuset's (to simply assignment of tasks to resource groups)
cd /container
echo 0 > cpuset_enabled
5. Add the current task to a new group
mkdir /container/a
echo $$ > tasks
cat memctlr_stats
6. Set limits
echo "res=memctlr,max_shares=10" > memctlr_shares
7. Spin the system, hang it, revolve it, crash it!!
8. Please provide feedback, both code review and any thing else that
can be useful for further development
Testing
-------
Kernbench was run on these patches and it did not show any significant
overhead in the tests.
--
Balbir Singh,
Linux Technology Center,
IBM Software Labs
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
[Index of Archives]
[Kernel Newbies]
[Netfilter]
[Bugtraq]
[Photo]
[Stuff]
[Gimp]
[Yosemite News]
[MIPS Linux]
[ARM Linux]
[Linux Security]
[Linux RAID]
[Video 4 Linux]
[Linux for the blind]
[Linux Resources]