Re: [patch 5/6] statistics infrastructure

..oops, should have made sure that my mailer does line breaksappropriately. Getting it right this time, sorry ...




Andi Kleen wrote:

> Locks and indirect function calls?
> It seems very wrong to me to make such heavy weight statistic
> functions. Most likely you will disturb the performance whatever is
> being counted badly.


Well, that's a tradeoff between flexibility/function and performance.

I don't have any reliable numbers ready at hand. At least, doing I/O tomy SCSI devices with enabled statistics didn't feel bad.

Here is the rational for both the indirect function call and the lock:

If a statistic is disabled (that's the default) neither is locking donenor is a function called indirectly. So far so good, I would say.

If a statistic is enabled the lock for this entity is grabbed and oneindirect function call is done. Anything else is inlined. I use granularper-interface (per-entity, for example per-LUN or per-HBA) locking.

The indirect function call allows customization of the ways dataprocessing is done for particular statistics.

For example, one could deflate a histogram of latencies into a counterproviding the total of latency measurements, that is the total ofrequests observed; or inflate a statistic the other way around ifrequired. Another example: one can make a statistic gather data forrecurring periods of time, like megabytes per seconds instead of justthe total amount of bytes transferred, or like queue utilization perwhatever-unit-of-time instead of just an overall utilization.

A statistic that feeds on request sizes can be setup to provide thefollowing "views":

- number of requests observed (counter)
- number of requests per unit of time (history based on a counter)
- number of bytes transfered (counter)

- number of bytes transfered per unit of time = transfer rate (historybased on a counter)- traffic pattern (histogram for descrete request sizes or for ranges ofrequest sizes)

- raw measurement data gathered
- etc.

As a device driver programmer I might pick a "view" a user is interestedin. My pick might miss by a mile. I simply don't know for sure beforehand.

The indirect function call could be a replaced by a switch statement.Not sure whether this is less critical and more acceptable than indirectfunction calls. Might be architucture dependent.

We can get rid of the indirect function call (or an alternative switchstatement) if the vote is against this level of flexibility.Then it would be solely up to the exploiter to define once and for allwhether a particular sort of data is shown as a simple counter, ahistogram, a fill level indicator, this history-type statistic thing ina ringbuffer etc. This might be fine for a considerable number of cases.

The lock is there to avoid trouble with concurrent updates to astatistic. If per-CPU data was used, concurrent updates are fine as longas they are done on different CPUs. Precautions for concurrent updatesto the same per-CPU is still needed, though.

The current interface allows to use the lock this way:

lock(stat_x->interface);
statistics_inc_nolock(stat_x, y);
statistics_inc_nolock(stat_m, n);
statistics_add_nolock(stat_a, e, f);
unlock(stat_x->interface);

Because we hold the same lock when creating output for users, coherencyof several statistics of a single entity can be achieved if statisticupdates are done within one critical section as shown above.

The lock is also used to make sure that updates to a statistic don'thappen while the setup of a statistic is changed by users. If we get ridof the indirect function call, some of these setup changes go away,anyway. Other cases, like statistic resets or inflating a 5-counterhistogram to a 25-counter histogram, don't go away. If I can figure outhow to reallocate, say, an array of counters for a histogram withoutholding a lock while updates happen... Maybe I could temporarily turn ofa statistic.



> Take a look at many other subsystems - they do per CPU counters etc.
> to make this all fast.

I am looking into per CPU data.

But, is this really required for _all_ statistics? I see that it makessense to have per CPU optimizations for very critical components, likeparts of VM. But there are still a lot of do-it-yourself type statisticsaround that use an atomic_t, for example, without implementing it per CPU.

Then, I am not sure yet whether per CPU data is feasible for histogramsand other more complex statistics. I have got to find out.

I tried to write the code in a way that allows to add other statistictype, like counters, histograms and so on, with moderate effort. Maybe Ican use the internal interface to plug in some disclipline based on perCPU counters...



> But it's still unclear why it would need such an heavyweight
> infrastructure. Normally it's not that bad to reimplemented on the
> fly. Maybe some common code can be refactored out of that, but
> probably not too much.
>
> [... lots of other code snipped ... ]
>
> Looks all very very overdesigned to me. How about you just start
> with a minimum specification and describe what you want to do?

As a device driver programmer I don't want to reinvent the wheel whencoding statistics. I would prefer to use a few and easy to use libraryfunctions. I don't want to worry about getting my personal wheel beingfunctional. I'd rather use my time to worry about which kind of data isreally needed and which is not.

I'd like to provide a tool that can be customized at run time to acertain degree because it might not be acceptable for customers toinstall private kernels in order to get tuned statistics.As a device driver programmer I can make an educated guess, at best,about certain parameters that impact the processing of statistic data.Users might know better whether they need to focus on latencies from 2ms up to 64 ms or from 100 ms up to 500 ms, because this kind ofdecisions depends on the environment to be measured, e.g. devices attached.

In a device driver, I don't want to spent much thought about thestatistic's user interface. Particularly not if the statistic is alittle bit more complex than a simple counter. Would be really nice tohave a user interface that looks the same for all exploiters, i.e. tohave common output formats for counters, histograms, fill levelindicators etc.

Martin

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Prev by Date: Re: swsusp: documentation fixes
Next by Date: Re: [PATCH] ia64: disable preemption in udelay()
Previous by thread: Re: [patch 5/6] statistics infrastructure
Next by thread: [patch 5/6] statistics infrastructure - documentation
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]