Re: Which of the virtualization approaches is more suitable for kernel?

Kirill Korotaev wrote:

- fine grained namespaces are actually an obfuscation, since kernel
subsystems are tightly interconnected. e.g. network -> sysctl -> proc,
mqueues -> netlink, ipc -> fs and most often can be used only as a
whole container.

I think a lot of _strange_ interconnects there could
use some cleanup, and after that the interconenctions
would be very small

Why do you think they are strange!? Is it strange that networkingexports it's sysctls and statictics via proc?

Is it strange for you that IPC uses fs?
It is by _design_.

Great, and this kind of simple design also worked well for the first fewiterations of Linux-VServer. However, some people need more flexibilityas we are seeing by the wide range of virtualisation schemes beingproposed. In the 2.1.x VServer patch the network and (process&IPC)isolation and virtualisation have been kept seperate, and can be managedwith seperate utilities. There is also a syscall and utility to managethe existing kernel filesystem namespaces.

Eric's pspace work keeps the PID aspect seperate too, which I neverenvisioned possible.

I think that if we can keep as much seperation between systems aspossible, then we will have a cleaner design. Also it will make lifeeasier for the core team as we can more easily divide up the patches forconsideration by the relevant subsystem maintainer.

- you need to track dependencies between namespaces (e.g. NAT requiresconntracks, IPC requires FS etc.). this should be handled, otherwise onecontainer being able to create nested container will be able to make oops.

This is just normal refcounting. Yes, IPC requires filesystem code, butit doesn't care about the VFS, which is what filesystem namespaces abstract.

do you have support for it in tools?

> i.e. do you support namespaces somehow? can you create half
> virtualized container?

See the util-vserver package, it comes with chbind and vnamespace whichallow creation of 'half-virtualized' containers, though most of the restof the functionality, such as per-vserver ulimits, disklimits, etc havebeen shoehorned into the general vx_info structure. As we merge intothe mainstream we can review each of these decisions and decide whetherit is an inherantly per-process decision, or more XX_info structures arewarranted.

this doesn't look very cool to me, as IRQs should
be handled in the host context and TCP/IP in the
proper network space ...

this is exactly what it does.
on IRQ context is switched to host.
In TCP/IP to context of socket or network device.

That sounds like an interesting innovation, and we can compare ourpatches in this space once we have some common terms of reference andstarting points.

the question here is, do we really want to turn itoff at all? IMHO the design and implementationshould be sufficiently good so that it does neither
impose unnecessary overhead nor change the default
behaviour ...
this is the question I want to get from Linus/Andrew.
I don't believe in low overhead. It starts from virtualization, thengoes reource management etc.These features _definetely_ introduce overhead and increase resourceconsumption. Not big, but why not configurable?

Obviously, our projects have different goals; Linux-VServer has verylittle performance overhead. Special provisions are made to achievescalability on SMP and to avoid unnecessary cacheline issues. Once thatis sorted out, it's very hard to measure any performance overhead of it,especially when the task_struct->vx_info pointer is null.

However I see nothing wrong with making all code disappear without thekernel config option enabled. I expect that as time goes on, you'd justas soon disable it as you would disable the open() system call. I thinkthat's what Herbert was getting at with his comment.

Seems, you are just trying to move from the topic. Great.

I always did want to be a Lumberjack!

Sam.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

References:
- Which of the virtualization approaches is more suitable for kernel?
  - From: Kirill Korotaev <dev@sw.ru>
- Re: Which of the virtualization approaches is more suitable for kernel?
  - From: Herbert Poetzl <herbert@13thfloor.at>
- Re: Which of the virtualization approaches is more suitable for kernel?
  - From: Kirill Korotaev <dev@sw.ru>

Prev by Date: Re: softlockup interaction with slow consoles
Next by Date: Re: 2.6.15-rt17
Previous by thread: Re: Which of the virtualization approaches is more suitable for kernel?
Next by thread: Re: Which of the virtualization approaches is more suitable for kernel?
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]