Re: [Patch 7/7] Generic netlink interface (delay accounting)

Jamal,

Pls keep lkml and lse-tech on cc since some of this affects the usage
of delay accounting.


jamal wrote:

Hi Shailabh,
Apologies for taking a week to respond ..
On Mon, 2006-27-02 at 15:26 -0500, Shailabh Nagar wrote:
jamal wrote:
Yes, the current intent is to allow multiple listeners to receive theresponses sent by the kernel.
Responses or events? There is a difference:
Response implies the program in user space requested (ex a GET) for that
information and is receiving such info.
Event implies the program in user space asked to be informed of changes
in the kernel. Example an exit would be considered an event.Events are received by virtue of registering to a multicast group.[..]

My design was to have the listener get both responses (what I callreplies in the code)

as well as events (data sent on exit of pid)

Since this interface (taskstats) is currently designed for thatpossibility, having multiple listeners, one for
each "component" such as delay accounting, is the model we're using.
We expect each component to have a pair of userspace programs, one forsending commands and the otherto "listen" to all replies + data generated on task exits.
You need to have a sender of GETs essentially and a listener of events.
Those are two connections. The replies of a get from user1 will not be
sent to user2 as well - unless ... thats what you are trying to achieve;
the question is why?

Yes, I was trying to have an asymmetric model where the userspace senderof GETsdoesn't receive the reply as a unicast. Rather the reply is sent bymulticast (alongwith all the

event data).

Reason for this unintuitive design was to make it easier to process thereturned data.

The expected usage of delay accounting is to periodically "sample" thedelays for alltasks (or tgids) in the system. Also, get the delays from exiting pids(lets forget how tgid exit

is handled for now...irrelevant to this discussion).

Using the above two pieces of data, userspace can aggregate the "delays"seen by any

grouping of tasks that it chooses to implement.

In this usage scenario, its more efficient to have one receiver get bothresponse and event

data and process in a loop.

However, we could switch to the model you suggest and use amultithreaded send/receive

userspace utility.

The listeneris expected to register/deregister interest through
TASKSTATS_CMD_LISTEN and IGNORE.
It is not necessary if you follow the model i described.
How does this correlate to TASKSTATS_CMD_LISTEN/IGNORE?
See above. Its mainly an optimization so that if no listener is present,there's no need to generate the data.
Also not necessary - There is a recent netlink addition to make sure
that events dont get sent if no listeners exist.
genetlink needs to be extended. For now assume such a thing exists.

Ok. Will this addition work for both unicast and multicast modes ?

+
Good point. Should check for users sending it as a cmd and treat it as anoop.
More like return an -EINVAL

Will this be necessary ? Isn't genl_rcv_msg() going to return a -EOPNOTSUPP
automatically for us since we've not registered the command ?

I'm just using
this as a placeholder for data thats returned without being requested.

So it is unconditional?

Yes.

Come to think of it, there's no real reason to have a genlmsghdr forreturned data, is there ?
All messages should be consistent whether they are sent from user
or kernel.

Ok. will retain genetlink header.

Other than to copy the genlmsghdr that was sent so user can identifywhich command was sent
(and I'm doing that through the reply type, perhaps redundantly).
yes, that is a useful trick. Just make sure they are reflected
correctly.
Actually, the next iteration of the code will move to dynamicallygenerated ID. But yes, will need to check for that.
Also if you can provide feedback whether the doc i sent was any use
and what wasnt clear etc.

Will do.

Thanks for the review.
Couple of questions about general netlink:
is it intended to remain a size that will always be aligned to theNLMSG_ALIGNTO so that (NLMSG_DATA(nlhdr) + GENL_HDRLEN) can alwaysbe used as a pointer to the genlmsghdr ?
I am not sure i followed.
The whole message (nlhdr, genlhdr, optionalhdr, TLVs) has to be inthe end 32 bit aligned.

Ok , so separate padding isn't needed to make the genlhdr, optionalhdrand TLV parts aligned

too.

Adding some macros like genlmsg_data(nlh) would be handy (currently Ijust define and use it locally).
Send a patch.

will do.


Thanks,
Shailabh

cheers,
jamal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: [Lse-tech] Re: [Patch 7/7] Generic netlink interface (delay accounting)
  - From: jamal <hadi@cyberus.ca>

References:
- [Patch 0/7] Per-task delay accounting
  - From: Shailabh Nagar <nagar@watson.ibm.com>
- [Patch 7/7] Generic netlink interface (delay accounting)
  - From: Shailabh Nagar <nagar@watson.ibm.com>

Prev by Date: Re: Is that an acceptable interface change?
Next by Date: Re: 2.6.16 serious consequences / GPL_EXPORT_SYMBOL / USB drivers of major vendor excluded
Previous by thread: [Patch 7/7] Generic netlink interface (delay accounting)
Next by thread: Re: [Lse-tech] Re: [Patch 7/7] Generic netlink interface (delay accounting)
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]