Christoph Lameter <[email protected]> wrote:
> Maybe I do not understand properly. I thought page_mkwrite is called
> before a page is made writable not before it is dirtied. If its only
> called before the page is dirtied then a better name maybe before_dirty or
page_mkwrite() is called before any of the PTEs referring to a page are made
writable. This must precede a page being dirtied by writing to it directly
through an mmap'd section. It does not catch write() and co. dirtying pages,
but then there's no need since prepare_write() is available for that.
> What do you mean by "written to the cache"? It cannot be written back
> since the page has been dirtied yet. So "written to the cache" means
> that the FS does some reservation, right?
See the FS-Cache patches posted to LKML on the 19th of May, in particular the
documentation included in the patch with the subject:
[PATCH 10/14] FS-Cache: Generic filesystem caching facility [try #10]
These patches permit data retrieved from network filesystems (NFS and AFS for
now) to be cached locally on disk.
The page is fetched from the server and then written to the cache. We don't
let the clean page be modified or released until the write to the cache is
complete. This permits us to keep track of what state the cache is in.
> If set_page_dirty cannot reserve the page then we know that some severe
> action is required. The FS method set_page_dirty() could:
But by the time set_page_dirty() is called, it's too late as the code
currently stands. We've already marked the PTE writable and dirty. The
page_mkwrite() op is called _first_.
> 1. Determine the ENOSPC condition before it sets the page dirty.
> That leaves the potential that some writes to the page have occurred
> by other processes.
You have to detect ENOSPC before you modify the PTE, otherwise someone else
can jump through your hoop and dirty the page before you can stop them.
> 2. Track down all processes that use the mapping (or maybe less
That's bad, even if you restrict it to those that have MAP_SHARED and
PROT_WRITE set. They should not be terminated if they haven't attempted to
write to the mapping.
> severe: processes that have set the dirty bit in the pte) and
> terminate them with SIGBUS.
What's wrong with my suggestion anyway?
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
[Index of Archives]
[Video 4 Linux]
[Linux for the blind]