Re: RFC: issues concerning the next NAPI interface

Jan-Bernd Themann wrote:

On Monday 27 August 2007 17:51, James Chapman wrote:
In the second half of my previous reply (which seems to have beendeleted), I suggest a way to avoid this problem without using hardwareinterrupt mitigation / coalescing. Original text is quoted below.
 >> I've seen the same and I'm suggesting that the NAPI driver keeps
 >> itself in polled mode for N polls or M jiffies after it sees
 >> workdone=0. This has always worked for me in packet forwarding
 >> scenarios to maximize packets/sec and minimize latency.
To implement this, there's no need for timers, hrtimers or generic NAPIsupport that others have suggested. A driver's poll() would set aninternal flag and record the current jiffies value when findingworkdone=0 rather than doing an immediate napi_complete(). Early inpoll() it would test this flag and if set, do a low-cost test to see ifit had any work to do. If no work, it would check the saved jiffiesvalue and do the napi_complete() only if no work has been done for aconfigurable number of jiffies. This keeps interrupts disabled longer atthe expense of many more calls to poll() where no work is done. Socritical to this scheme is modifying the driver's poll() to fastpath thecase of having no work to do while waiting for its local jiffy count toexpire.
The problem I see with this approach is that the time that passes between
two jiffies might be too long for 10G ethernet adapters.

Why would staying in polled mode for 2 jiffies be too long in the 10Gcase? I don't see why 10G makes any difference. Your poll() would becalled as fast as your CPU allows during those 2 jiffies (it wouldactually be between 1 and 2 jiffies in practice). It is thereforecritical that the driver's poll() implementation is as efficient aspossible for the "no work" case to minimize the overhead of the extrapoll() calls. Your poll might be called thousands of times in 1-2jiffies with nothing to do...

(I tried to implement
a timer based approach with usual timers and the result was a disaster).
HW interrupts / or HP timer avoid the jiffy problem as they activate softIRQs

as soon as you call netif_rx_schedule.

My scheme doesn't use timers to do netif_rx_schedule() because thedevice stays in polled mode for 1-2 jiffies _after_ it detects it has nomore work. So the device remains scheduled, processing packets as usual.The device deschedules itself and re-enables its interrupts only when ithas a period of 1-2 jiffies of doing no work.

BTW, I chose 2 jiffies in the example patch just to keep the patchsimple. It might be more for systems with large HZ or those that want tobe even more aggressive at staying in polled mode. I envisage it beinganother parameter that can be tweaked using ethtool if people see abenefit of this scheme.

--
James Chapman
Katalix Systems Ltd
http://www.katalix.com
Catalysts for your Embedded Linux software development

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

References:
- Re: RFC: issues concerning the next NAPI interface
  - From: James Chapman <jchapman@katalix.com>
- Re: RFC: issues concerning the next NAPI interface
  - From: David Miller <davem@davemloft.net>
- Re: RFC: issues concerning the next NAPI interface
  - From: James Chapman <jchapman@katalix.com>
- Re: RFC: issues concerning the next NAPI interface
  - From: Jan-Bernd Themann <ossthema@de.ibm.com>

Prev by Date: Re: Who wants to maintain KR list for stable releases? (was Re: nmi_watchdog=2 regression in 2.6.21)
Next by Date: [PATCH] fix bogus hotplug cpu warning
Previous by thread: Re: RFC: issues concerning the next NAPI interface
Next by thread: Re: RFC: issues concerning the next NAPI interface
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]