Re: How to manage shared persistent local caching (FS-Cache) with NFS?

On Dec 5, 2007, at 12:11 PM, David Howells wrote:

Okay... I'm getting to the point where I want to release my localcachingpatches again and have NFS work with them. This means making NFSmounts share
or not share appropriately - something that's engendered a fair bit of
argument.

So I'd like to solicit advice on how best to deal with this problem.

Let me explain the problem in more detail.


================
CURRENT PRACTICE
================
As the kernel currently stands, coherency is ignored for mountsthat haveslightly different combinations of parameters, even if theseparameters justaffect the properties of network "connection" used or just mark asuperblock
as being read-only.
Consider the case of a file remotely available by NFS. Imagine theclient seesthree different views of this file (they could be by threeoverlapping mounts,
or by three hardlinks or some combination thereof).

This is how NFS currently operates without any superblock sharing:

				+---------+
    Object on server --->	|	  |
				|  inode  |
				|	  |
				+---------+
				    /|\
				   / | \
				  /  |	\
				 /   |	 \
				/    |	  \
			       /     |	   \
			      /	     |	    \
			     /	     |	     \
			    /	     |	      \
			   /	     |	       \
			  /	     |		\
			 |	     |		 |
			 |	     |		 |
:::::::::::::NFS::::::::|:::::::::::|:::::::::::|:::::::::::::::::::::::::::::
			 |	     |		 |
			 |	     |		 |
			 |	     |		 |
   +---------+	    +---------+	     |		 |
   |	     |	    |	      |	     |		 |
   | mount 1 |----->| super 1 |	     |		 |
   |	     |	    |	      |	     |		 |
   +---------+	    +---------+	     |		 |
				     |		 |
				     |		 |
   +---------+			+---------+	 |
   |	     |			|	  |	 |
   | mount 2 |----------------->| super 2 |	 |
   |	     |			|	  |	 |
   +---------+			+---------+	 |
						 |
						 |
   +---------+				    +---------+
   |	     |				    |	      |
   | mount 3 |----------------------------->| super 3 |
   |	     |				    |	      |
   +---------+				    +---------+
Each view of the file on the client winds up with a separate inodein aseparate superblock and with a separate pagecache. As far as theclient kernelis concerned, they *are* three different files. Any incoherencyeffects areignored by the kernel and if they cause a userspace application aproblem,
that's just too bad.

Generally, however, this is not a problem because:
(a) an application is unlikely to be attempting to manipulatemultiple views
      of a file simultaneously and

  (b) cross-view hard links haven't been and aren't used that much.


=============================
POSSIBLE FS-CACHE SCENARIO #1
=============================
However, now we're introducing persistent local caching into themix. That means we can no longer ignore such remote possibilities- they are possible, therefore we have to deal with them, whetherwe like it or not.

I don't see how persistent local caching means we can no longerignore (a) and (b) above. Can you amplify this a bit? Nothing yousay in the rest of your proposal convinces me that having multiplecaches for the same export is really more than a theoretical issue.

Frankly, the reason why admins mount exports multiple times isprecisely because they want different applications to access thefiles in different ways. Admins *want* one mount point to beavailable ro, and another rw. They *want* one mount point to use'noac' and another not to. They *want* multiple sockets, more RPCslots, and unique caches for different applications. No one would goto the trouble of mounting an export again, using different options,unless that's precisely the behavior that they wanted.

This is actually a feature of NFS. It's used as a standard part ofproduction environments, for example, when running Oracle databaseson NFS. One mount point is rw and is used by the database engine.Another mount point is ro and is used for back-up utilities, like RMAN.

Another example is local software distribution. One mount point isro, and is accessed by normal users. Another mount point accessesthe same export rw, and is used by administrators who provide updatesfor the software.

As useful as the feature is, one can also argue that mounting thesame export multiple times is infrequent in most normal use cases.Practically speaking, why do we really need to worry about it?

The real problem here is that the NFS protocol itself does notsupport strong cache coherence. I don't see why the Linux kernelmust fix that problem.

The only real problem with the first scenario is that you may havemore than one copy of a file in the persistent cache. How often willthat be the case? Since the local persistence cache is probably disk-based and thus large relative to memory, what's the problem withusing a little extra space?

The problems you ascribe to your second and third caching scenarios(deadlocking and reconnection) are, however, real and substantial.You don't have these issues when caching each mount point separately,right?

It seems to me that implementing the first scenario is (a)straightforward, (b) has fewer runtime risks (ie deadlocks), (c)doesn't take away features that some people still use, and (d) solvesmore than 80% of the issues here (80/20 rule of thumb).

Lastly, there's already a mount option that allows admins to controlwhether the page and attribute caches are shared -- "sharecache". Isthis mount option not adequate for persistent caching?


--
Chuck Lever
chuck[dot]lever[at]oracle[dot]com
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

References:
- How to manage shared persistent local caching (FS-Cache) with NFS?
  - From: David Howells <[email protected]>

Prev by Date: Re: [PATCH] exec: allow > 2GB executables to run on 64-bit systems
Next by Date: Re: [PATCH] exec: allow > 2GB executables to run on 64-bit systems
Previous by thread: Re: How to manage shared persistent local caching (FS-Cache) with NFS?
Next by thread: Re: How to manage shared persistent local caching (FS-Cache) with NFS?
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]