Re: [linux-pm] PowerOP 0/3: System power operating point management API

Patrick Mochel wrote:

On Mon, 8 Aug 2005, Todd Poynor wrote:

(apologies for use of obsolete cpufreq mailing list address in myinitial message.)

...

PowerOP is intended to leave all power
policy decisions to higher layers.

What do those higher layers look like? Do you have a userspace component
that uses this interface?

cpufreq is one example, it manages an abstraction of systempower/performance levels based on cpu speed, which maps onto thePowerOP-level hardware capabilities in some fashion, and has both kerneland userspace components to manage the desired policy associated withthis. Regardless of whether this notion of configurable operatingpoints would remain a separate layer from cpufreq or was more tightlyintegrated, the code to set these operating points can handle thingssuch as setting validated voltage levels to match cpu speeds, etc.

For embedded systems, I am aware only of the Dynamic Power Managementproject, which you also mention and does indeed manage power policybased on the notions of power parameters and operating points. Thesettings of these are configured entirely from userspace via sysfs,using shell scripts or convenience libraries that access the sysfsattributes. A system designer chooses the operating points to beemployed in the system based on the information from the processor orboard vendor that describes validated, supported operating points andbased on the characteristics of the system (how fast it needs to runwhile in use for different purposes and how much battery power can bespent for those purposes).

For example, a designer implementing a system based on an Intel XScalePXA27x processor can choose from among about 16 validated operatingpoints listed in the most recent specification update. Those operatingpoints are comprised of register settings with inscrutable names such asCCCR[L], CCCR[2N], CLKCFG[T], CCCR[A], and two or three others. A fewof those operating points run the CPU at identical frequencies, but haveother changes in memory clocking, system bus clocking, and the abilityto quickly switch between certain cpu frequencies based on otherproperties of the platform (so-called "Turbo-mode" frequency scaling).A DPM- or PowerOP-based system can be configured with the subset ofdesired operating points and a particular operating point activated asneeded. The policy decision as to what operating point is appropriateto activate is a matter for custom code provided by the designer,tailored to their system. It is also possible to write automatedoperating point selection algorithms based on such criteria as systembusyness.

Who is using this code? Are there vendors that are already shipping
systems with this enabled?

Is this part of the DPM project? If so, what other components are left in
DPM?

The concepts and general Linux implementation of power parameters andoperating points stems from the power-aware computing work done byBishop Brock and Karthick Rajamani of IBM Research, and a somewhatdifferent implementation is a part of the DPM project, which MontaVista(and reportedly others in the near future) does ship. So far as Iunderstand there are or soon will be mobile phones that use that code asthe low- to mid-layers of the power management stack (the high-layerpolicy management is performed by a custom application of which I haveno knowledge).

I mentioned in a previous email the next step of creating and activatingoperating points from userspace. If that were in place, DPM wouldadditionally consist primarily of:

1. Machine-specific backends to set operating points for the systemsthat DPM has been ported to. If something like PowerOP is accepted intoa broader community then that code would come along for the ride.XScale PXA27x and various ARM OMAPs are among the systems supported, aswell as potentially others not yet making an appearance in open source.

2. DPM has further concepts of "operating state" (generally, whether thesystem is idle, processing interrupts, running a normal-power-usagetask, running a background task without deadlines that can be assigned alow power/performance level, etc.) and the unfortunately-named "policy"that maps each operating state to an operating point, along with thecode to switch in different operating points as the system switchesoperating states. The "policy" is a bit of a misnomer; a systemdesigner must create the desired operating points and decide upon thestate -> point mappings appropriate, as well as make decisions on whento update the mappings based on external events, changing workloads,etc. There are a few extra ramifications of modifying operating pointsin this fashion, including the need to handle such transitions while ininterrupt context or in the idle loop, as well as a general concern forlow overhead since switching may occur very frequently (such as at everyentry and exit from idle).

3. Kernel-to-userspace power event notification is temporarily based onexecuting hotplug scripts. This is outside the true domain of DPM, butin the absence of an acpid-like de facto standard for communicatingpower events it seemed best to provide some sort of mechanism. kobjectuevents are now the proper choice, and I'd propose use of that, as aseparate matter from what I'm hoping to accomplish with PowerOP or therest of DPM.

All of these are GPL software available on the project site.

What are your plans to integrate this more with the cpufreq code?

At this point it's a proposed layer that does not disturb existingcpufreq code much, but if the cpufreq folks are receptive to these ideasI'd be all for a tighter integration. Others have already asked for theability to manage voltages along with cpu speed, so in one way oranother it seems likely that an expanded set of power parameters may beprovided in the future. But I don't have any insight into the wishes orgoals of the project. Thanks,

--
Todd
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

References:
- PowerOP 0/3: System power operating point management API
  - From: Todd Poynor <tpoynor@mvista.com>
- Re: [linux-pm] PowerOP 0/3: System power operating point management API
  - From: Patrick Mochel <mochel@digitalimplant.org>

Prev by Date: RE: Please help with following NUMA-related questions
Next by Date: Re: smbus driver for ati xpress 200m
Previous by thread: Re: [linux-pm] PowerOP 0/3: System power operating point management API
Next by thread: Re: PowerOP 0/3: System power operating point management API
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind]