Re: [RFC PATCH 0/6] Convert all tasklets to workqueues

Ingo Molnar wrote:

But it was not me who claimed that 'workqueues are slow'.


The claim was:  slower than tasklets.

choice. I am just wondering out loud whether this particular tool, inits current usage pattern, makes much technological sense. My claim is:it could very well be that it doesnt make _much_ sense, and in that casewe should provide a non-intrusive migration path away in terms of acompatible API wrapper to a saner (albeit by virtue of trying to emulatean existing API, slower) mechanism. The examples cited so far had thetasklet as an intermediary towards a softirq - what's the technologicalpoint in such a splitup?

I already answered that in detail. In sum, a driver cannot define itsown softirq. Softirqs are not modular.


Tasklets are the closest thing to softirqs for a driver.

The most scalable workloads dont involve any (or many) softirq middlemenat all: you queue work straight from the hardirq context to the targetprocess context. And that's what you want to do _anyway_, because youwant to create as little locally cached data for the hardirq context, asthe target task could easily be on another CPU. (this is generally truefor things like block IO, but it's also true for things like networkIO.)
the most scalable solution would be _for the network adapter to figureout the target CPU for the packet_.

I agree completely. Wanna implement this? I will kiss your feet, andmulti-core CPU vendors will worship you as a demi-god.

Until such time, we must deal with the network stack as it exists today,and the network drivers as they exist and work today.

Not many (if any) such adaptersexist at the moment. (as it would involve allocating NR_CPUs irqs tothat adapter alone.)


Good news:  this is becoming the norm for modern NICs, especially 10Gbps.

Plenty of NICs already exist that support multiple RX rings (persumablyone per CPU), and newer NICs will raise individual MSI[-X] interruptsbased on the RX ring into which a packet was received.


In this area, NIC vendors are way ahead of the Linux net stack.

The Linux net stack is unfortunately not threaded enough to sanely dealwith dividing /flows/ up across multiple CPUs, even if the NIC doessupport multiple transmit and receive queues. [side note: initialmulti-queue TX is being worked on, on netdev]

Tasklet is single thread by definition and purpose. Those a few placeswhere people used tasklets to do per-cpu jobs (RCU f.e.) exist justbecause they had troubles with allocating new softirq. [...]
no. The following tale is the true and only history of the RCU tasklet;-) The RCU guys first used a tasklet, then noticed its bad scalability(a particular VFS-intense benchmark regressed because only a single CPUwould do RCU completion on an 8-way box) so they switched it to aper-cpu tasklet - without realizing that a per-cpu tasklet is in essencea softirq. I pointed it out to them (years down the road ...) then the"convert rcu-tasklet to softirq" patch was born.

You focused on the example rather than the key phrase: tasklet issingle thread by definition and purpose.

Wanting to change that without analysis of the impact illustrates theapples-to-oranges change being proposed.

outlined above: if you want good scalability, dont use middlemen :-)Figure out the target task as early as possible and let it do as much ofthe remaining work as possible. _Increasing_ the amount of cachedcontext (by doing delayed processing in tasklets or even softirqs on thesame CPU where the hardirq arrived) only increases the cross-CPU cost.Keeping stuff in a softirq only makes (some) sense as long as you haveno target task at all (routing, filtering, etc.).


I do not disagree with these theoretical musings :)

I care the most about the "who will do all this work?" question. Innetwork driver land, these changes impact hot paths. I am lazy, anddon't care to revisit each network driver hot path and carefully re-tuneeach based on this proposal. Who is volunteering?


	Jeff


-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

References:
- [RFC PATCH 0/6] Convert all tasklets to workqueues
  - From: Steven Rostedt <[email protected]>
- Re: [RFC PATCH 0/6] Convert all tasklets to workqueues
  - From: Linus Torvalds <[email protected]>
- Re: [RFC PATCH 0/6] Convert all tasklets to workqueues
  - From: Ingo Molnar <[email protected]>
- Re: [RFC PATCH 0/6] Convert all tasklets to workqueues
  - From: Linus Torvalds <[email protected]>
- Re: [RFC PATCH 0/6] Convert all tasklets to workqueues
  - From: Ingo Molnar <[email protected]>
- Re: [RFC PATCH 0/6] Convert all tasklets to workqueues
  - From: Jeff Garzik <[email protected]>
- Re: [RFC PATCH 0/6] Convert all tasklets to workqueues
  - From: Ingo Molnar <[email protected]>
- Re: [RFC PATCH 0/6] Convert all tasklets to workqueues
  - From: Alexey Kuznetsov <[email protected]>
- Re: [RFC PATCH 0/6] Convert all tasklets to workqueues
  - From: Ingo Molnar <[email protected]>

Prev by Date: Re: how about mutual compatibility between Linux's GPLv2 and GPLv3?
Next by Date: Re: 2.6.22-rc6 spurious hangs
Previous by thread: Re: [RFC PATCH 0/6] Convert all tasklets to workqueues
Next by thread: Re: [RFC PATCH 0/6] Convert all tasklets to workqueues
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]