The page tables for hugetlb mappings are handled differently than page tables
for normal pages. Rather than integrating multiple page size support into the
main VM (which would tremendously complicate the code) some hooks were created.
This allows hugetlb special cases to be handled "out of line" by a separate
interface.
Hugetlbfs was the huge page interface chosen. At the time, large database
users were the only big users of huge pages and the hugetlbfs design meets
their needs pretty well. Over time, hugetlbfs has been expanded to enable new
uses of huge page memory with varied results. As features are added, the
semantics become a permanent part of the Linux API. This makes maintenance of
hugetlbfs an increasingly difficult task and inhibits the addition of features
and functionality in support of ever-changing hardware.
To remedy the situation, I propose an API (currently called
pagetable_operations). All of the current hugetlbfs-specific hooks are moved
into an operations struct that is attached to VMAs. The end result is a more
explicit and IMO a cleaner interface between hugetlbfs and the core VM. We are
then free to add other hugetlb interfaces (such as a /dev/zero-styled character
device) that can operate either in concert with or independent of hugetlbfs.
There should be no measurable performance impact for normal page users (we're
checking if pagetable_ops != NULL instead of checking for vm_flags &
VM_HUGETLB). Of course we do increase the VMA size by one pointer. For huge
pages, there is an added indirection for pt_op() calls. This patch series does
not change the logic of the the hugetlbfs operations, just moves them into the
pagetable_operations struct.
Comments? Do you think it's as good of an idea as I do?
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
[Index of Archives]
[Kernel Newbies]
[Netfilter]
[Bugtraq]
[Photo]
[Stuff]
[Gimp]
[Yosemite News]
[MIPS Linux]
[ARM Linux]
[Linux Security]
[Linux RAID]
[Video 4 Linux]
[Linux for the blind]
[Linux Resources]