Re: [kvm-devel] [PATCH 3/3] virtio PCI device

Avi Kivity wrote:

rx and tx are closely related. You rarely have one without the other.
In fact, a turned implementation should have zero kicks or interruptsfor bulk transfers. The rx interrupt on the host will process new txdescriptors and fill the guest's rx queue; the guest's transmitfunction can also check the receive queue. I don't know if that'sachievable for Linuz guests currently, but we should aim to make itpossible.

ATM, the net driver does a pretty good job of disabling kicks/interruptsunless they are needed. Checking for rx on tx and vice versa is a goodidea and could further help there. I'll give it a try this week.

Another point is that virtio still has a lot of leading zeros in itsmileage counter. We need to keep things flexible and learn from othersas much as possible, especially when talking about the ABI.

Yes, after thinking about it over holiday, I agree that we should atleast introduce a virtio-pci feature bitmask. I'm not inclined toattempt to define a hypercall ABI or anything like that right now buthaving the feature bitmask will at least make it possible to do such athing in the future.

I'm wary of introducing the notion of hypercalls to this devicebecause it makes the device VMM specific. Maybe we could have thedevice provide an option ROM that was treated as the device "BIOS"that we could use for kicking and interrupt acking? Any idea of howthat would map to Windows? Are there real PCI devices that use theoption ROM space to provide what's essentially firmware?Unfortunately, I don't think an option ROM BIOS would map well toother architectures.
The BIOS wouldn't work even on x86 because it isn't mapped to theguest address space (at least not consistently), and doesn't know theguest's programming model (16, 32, or 64-bits? segmented or flat?)
Xen uses a hypercall page to abstract these details out. However, I'mnot proposing that. Simply indicate that we support hypercalls, anduse some layer below to actually send them. It is the responsibilityof this layer to detect if hypercalls are present and how to call them.
Hey, I think the best place for it is in paravirt_ops. We can evenpatch the hypercall instruction inline, and the driver doesn't need toknow about it.

Yes, paravirt_ops is attractive for abstracting the hypercall callingmechanism but it's still necessary to figure out how hypercalls would beidentified. I think it would be necessary to define a virtio specifichypercall space and use the virtio device ID to claim subspaces.

For instance, the hypercall number could be (virtio_devid << 16) | (callnumber). How that translates into a hypercall would then be part of theparavirt_ops abstraction. In KVM, we may have a single virtio hypercallwhere we pass the virtio hypercall number as one of the arguments orsomething like that.

Not much of an argument, I know.
wrt. number of queues, 8 queues will consume 32 bytes of pci spaceif all you store is the ring pfn.
You also at least need a num argument which takes you to 48 or 64depending on whether you care about strange formatting. 8 queuesmay not be enough either. Eric and I have discussed whether the 9pvirtio device should support multiple mounts per-virtio device andif so, whether each one should have it's own queue. Any devicesthat supports this sort of multiplexing will very quickly startusing a lot of queues.
Make it appear as a pci function? (though my feeling is thatmultiple mounts should be different devices; we can then hotplugmountpoints).
We may run out of PCI slots though :-/
Then we can start selling virtio extension chassis.

:-) Do you know if there is a hard limit on the number of devices on aPCI bus? My concern was that it was limited by something stupid like an8-bit identifier.


Regards,

Anthony Liguori

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: [kvm-devel] [PATCH 3/3] virtio PCI device
  - From: Avi Kivity <[email protected]>

References:
- [PATCH 0/3] virtio PCI driver
  - From: Anthony Liguori <[email protected]>
- [PATCH 1/3] Export vring functions for modules to use
  - From: Anthony Liguori <[email protected]>
- [PATCH 2/3] Put the virtio under the virtualization menu
  - From: Anthony Liguori <[email protected]>
- [PATCH 3/3] virtio PCI device
  - From: Anthony Liguori <[email protected]>
- Re: [kvm-devel] [PATCH 3/3] virtio PCI device
  - From: Avi Kivity <[email protected]>
- Re: [kvm-devel] [PATCH 3/3] virtio PCI device
  - From: Anthony Liguori <[email protected]>
- Re: [kvm-devel] [PATCH 3/3] virtio PCI device
  - From: Avi Kivity <[email protected]>
- Re: [kvm-devel] [PATCH 3/3] virtio PCI device
  - From: Anthony Liguori <[email protected]>
- Re: [kvm-devel] [PATCH 3/3] virtio PCI device
  - From: Avi Kivity <[email protected]>
- Re: [kvm-devel] [PATCH 3/3] virtio PCI device
  - From: Anthony Liguori <[email protected]>
- Re: [kvm-devel] [PATCH 3/3] virtio PCI device
  - From: Avi Kivity <[email protected]>

Prev by Date: Re: [PATCH 1/3] fix setsid() for sub-namespace /sbin/init
Next by Date: Re: [PATCHv4 5/6] Allow setting O_NONBLOCK flag for new sockets
Previous by thread: Re: [kvm-devel] [PATCH 3/3] virtio PCI device
Next by thread: Re: [kvm-devel] [PATCH 3/3] virtio PCI device
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]