Re: Thinking outside the box on file systems

Kyle Moffett wrote:

Let me repeat myself here: Algorithmically you fundamentally CANNOTimplement inheritance-based ACLs without one of the following (althoughif you have some other algorithm in mind, I'm listening):(A) Some kind of recursive operation *every* time you change aninheritable permission(B) A unified "starting point" from which you begin *every*access-control lookup (or one "starting point" per useful semanticgrouping, like a namespace).
The "(A)" is presently done in userspace and that's what you want toavoid. As to (B), I will attempt to prove below that you cannotimplement "(B)" without breaking existing assumptions and restricting avery nice VFS model.

No recursion is needed because only one acl exists, so that is the onlyone you need to update. At least on disk. Any cached acls in memory ofdescendant objects would need updated, but the number of those should berelatively small. The starting point would be the directory you startthe lookup from. That may be the root, or it may be some otherdirectory that you have a handle to, and thus, already has its effectiveacl computed.

What ACL would "task->cwd" use?

Options:
(1.a) Use the one calculated during the original chdir() call.
(1.b) Navigate "up" task->cwd building an ACL backwards.
(1.c) $CAN_YOU_THINK_OF_SOMETHING_ELSE_HERE

1.a

Unsolvable problems with each option:

(1.a.I)
You just broke all sorts of chrooted daemons. When I start bind in itschroot jail, it does the following:
  chdir("/private/bind9");
  chroot(".");
  setgid(...);
  setuid(...);
The "/private" directory is readable only by root, since root is theonly one who will be navigating you into these chroots for any reason.You only switch UID/GID after the chroot() call, at which point you areinside of a sub-context and your cwd is fully accessible. If you stickan inheritable ACL on "/private", then the "cwd" ACL will not allowaccess by anybody but root and my bind won't be able to read any configfiles.

If you want the directory to be root accessible but the files inside tohave wider access then you set the acl on the directory to have one acegranting root access to the directory, and one ace that is inheritablegranting access to bind. This latter ace does not apply to thedirectory itself, only to its children.

You also break relative paths and directory-moving. Say a process doeschdir("/foo/bar"). Now the ACL data in "cwd" is appropriate for/foo/bar. If you later chdir("../quux"), how do you unapply the changesmade when you switched into that directory? For inheritable ACLs, youcan't "unapply" such an ACL state change unless you save state for allthe parent directories, except... What happens when you are in"/foo/bar" and another process does "mv /foo/bar /foobar/quux"?Suddenly any "cwd" ACL data you have is completely invalid and you haveto rebuild your ACLs from scratch. Moreover, if the directory you arein was moved to a portion of the filesystem not accessible from yourcurrent namespace then how do you deal with it?

Yes, if /foo/quux is not already cached in memory, you would have towalk the tree to build its acl. /foo should already be cached in memoryso this work is minimal. Is this so horrible of a problem?

As for moving, it is handled the same way as any other event that makescwd go away, such as deleting it or revoking your access; cwd is nowinvalid.

For example:
NS1 has the / root dir of /dev/sdb1 mounted on /mnt
NS2 has the /bar subdir of /dev/sdb1 mounted on /mnt
Your process is in NS2 and does chdir("/mnt/quux"). A user in NS1 does:"mv /mnt/bar/quux /mnt/quux". Now your "cwd" is in a directory on afilesystem you have mounted, but it does not correspond *AT ALL* to anypath available from your namespace.

Which would be no different than if they just deleted the entire thing.Your cwd no longer exists.

Another example:
Your process has done dirfd=open("/media/cdrom/somestuff") when theadmin does "umount -l /media/cdrom". You still have the CD-ROM open andaccessible but IT HAS NO PATH. It isn't even mounted in *any*namespace, it's just kind of dangling waiting for its last users to goaway. You can still do fchdir(dirfd), openat(dirfd, "foo/bar", ...),open("./foo"), etc.

What's this got to do with acls? If you are asking what effect theumount thas on the acls of the cdrom, the answer is none. The acls areon the disc and nothing on the disc has changed.

No, this is correct because in the root directory "/", the ".." entry isjust another link to the root directory. So the absolute path"/../../../../../.." is just a fancy name for the root directory. Theabove jail-escape-as-root exploit is possible because it is impossibleto determine whether a directory is or is not a subentry of anotherdirectory without an exhaustive search. So when your "cwd" points to apath outside of the chroot, the one special case in the code for the"root" directory does not ever match and you can "chdir" all the way upto the real root. You can even do an fstat() after every iteration tofigure out whether you're there or not!

Ohh, I see... yes... that is a very clever way for root to misusechroot(). What does it have to do with this discussion?

And yes, this has been exploited before, although not often aschroot()-ed uid=0 daemons aren't all that common.
So, pray tell, when this code runs and you do the "chroot" call, whatACL do you think should get stuck on "cwd"? It doesn't referenceanything available relative to the chroot.

Same root abuse, same result. The acl on the cwd would still be exactlywhat it was before the chroot.

With this you just got into the big-ugly-nasty-recursive-behavioragain. Say I untar 20 kernel source trees and then have my program openall 1000 available FDs to various directories in the kernel sourcetree. Now I run 20 copies of this program, one for each tree, stillwell within my ulimits even on a conservative box. Now run "mvdir_full_of_kernel_sources some/new/dir". The only thing you can do tofind all of the FDs is to iterate down the entire subdirectory treelooking for open files and updating their contexts one-by-one. Exceptyou have 20,000 directory FDs to update. Ouch.

Ok, so you found a pedantic corner case that is slow. So? And it isstill going to be faster than chmod -R.

To sum up, when doing access control the only values you can safely andefficiently get at are:
(A)  The dentry/inode
(B)  The superblock
(C)  *Maybe* the vfsmount if those patches get accepted
Any access control model which tries to poke other values is just goingto have a shitload of corner cases where it just falls over.


If by falls over you mean takes some time, then yes.... so what?

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: Thinking outside the box on file systems
  - From: Kyle Moffett <[email protected]>
- Re: Thinking outside the box on file systems
  - From: [email protected]

References:
- Thinking outside the box on file systems
  - From: Marc Perkel <[email protected]>
- Re: Thinking outside the box on file systems
  - From: alan <[email protected]>
- Re: Thinking outside the box on file systems
  - From: Michael Tharp <[email protected]>
- Re: Thinking outside the box on file systems
  - From: [email protected] (Lennart Sorensen)
- Re: Thinking outside the box on file systems
  - From: Kyle Moffett <[email protected]>
- Re: Thinking outside the box on file systems
  - From: Phillip Susi <[email protected]>
- Re: Thinking outside the box on file systems
  - From: Kyle Moffett <[email protected]>
- Re: Thinking outside the box on file systems
  - From: Phillip Susi <[email protected]>
- Re: Thinking outside the box on file systems
  - From: Kyle Moffett <[email protected]>
- Re: Thinking outside the box on file systems
  - From: Phillip Susi <[email protected]>
- Re: Thinking outside the box on file systems
  - From: Kyle Moffett <[email protected]>

Prev by Date: Re: [PATCH 4/6] infiniband: mlx4_MAD_IFC copies out response unconditionally
Next by Date: Re: Storing Maintainers info around the kernel tree
Previous by thread: Re: Thinking outside the box on file systems
Next by thread: Re: Thinking outside the box on file systems
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]