Re: [PATCH] Add I/O hypercalls for i386 paravirt

Andi Kleen wrote:

On Tue, Aug 21, 2007 at 10:23:14PM -0700, Zachary Amsden wrote:
In general, I/O in a virtual guest is subject to performance problems.The I/O can not be completed physically, but must be virtualized. Thismeans trapping and decoding port I/O instructions from the guest OS.Not only is the trap for a #GP heavyweight, both in the processor andthe hypervisor (which usually has a complex #GP path), but this forcesthe hypervisor to decode the individual instruction which has faulted.
Is that really that expensive? Hard to imagine.

You have an expensive (16x cost of hypercall on some processors)privilege transition, you have to decode the instruction, then you haveto verify protection in the page tables mapping the page allowsexecution (P, !NX, and U/S check). This is a lot more expensive than ahypercall.

e.g. you could always have a fast check for inb/outb at the beginning
of the #GP handler. And is your initial #GP entry really more expensive

than a hypercall?

The number of reasons for #GP is enormous, and there are too many pathsto optimize with fast checks. We do have a fast check for inb/outb;it's just not fast enough.

On P4, hypercall entry is 120 cycles. #GP is about 2000. Modernprocessors are better, but a hypercall is always faster than a fault.Many times, the hypercall can be handled and ready to return before a#GP would even complete.

On workloads that sit there and hammer network cards, these costs becomesignificant, and latency sensitive network benchmarks suffer.

Worse, even with hardware assist such as VT, the exit reason alone isnot sufficient to determine the true nature of the faulting instruction,requiring a complex and costly instruction decode and simulation.
It's unclear to me why that should be that costly.

Worst case it's a switch()

There are 24 different possible I/O operations; sometimes with a portencoded in the instruction, sometimes with input in the DX register,sometimes with a rep prefix, and for 3 different operand sizes.

Combine that with the MMU checks required, and it's complex and branchyenough to justify short-circuiting the whole thing with a simple hypercall.

Zach
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: [PATCH] Add I/O hypercalls for i386 paravirt
  - From: Andi Kleen <ak@suse.de>

References:
- [PATCH] Add I/O hypercalls for i386 paravirt
  - From: Zachary Amsden <zach@vmware.com>
- Re: [PATCH] Add I/O hypercalls for i386 paravirt
  - From: Andi Kleen <ak@suse.de>

Prev by Date: Re: [PATCH] Memory controller Add Documentation
Next by Date: Re: drivers/scsi/advansys.c - ld error ( Re: 2.6.23-rc3-mm1 )
Previous by thread: Re: [PATCH] Add I/O hypercalls for i386 paravirt
Next by thread: Re: [PATCH] Add I/O hypercalls for i386 paravirt
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]