On Wed, 18 Aug 2010 08:44:16 +0300 Gilboa Davara <gilboad@xxxxxxxxx> wrote: > On Tue, 2010-08-17 at 13:08 -0400, Steve Blackwell wrote: > > On Wed, 18 Aug 2010 02:12:16 +0930 > > Tim <ignored_mailbox@xxxxxxxxxxxx> wrote: > > > > > On Tue, 2010-08-17 at 12:05 -0400, Steve Blackwell wrote: > > > > I've been looking at my logs some more. I don't understand these > > > > messages: > > > > > > > > Aug 17 10:30:50 steve kernel: CPU0: Temperature above > > > > threshold, cpu clock throttled (total events = 455) > > > > Aug 17 10:30:50 steve kernel: CPU1: Temperature above > > > > threshold, cpu clock throttled (total events = 455) > > > > Aug 17 10:30:50 steve kernel: CPU1: Temperature/speed normal > > > > Aug 17 10:30:50 steve kernel: CPU0: Temperature/speed normal > > > > > > And the CPU overheating as well as your hard drive? > > > > > > Is the computer in a hot room? Are the fans working? Is the > > > ventilation blocked? Is the computer wedged in between things > > > that restrict airflow? Are things full of fluff and dust? > > > > > > > > Well it would seems so but I don't trust the messages. It doesn't > > seem reasonable that the CPUs go overtemp and then immediately cool > > down enough to be OK. > > Actually it is possible. > Your CPU has auto-throttle support. Read: When the CPU passes a > certain temperature threshold, it automatically clocks down (or > inserts NOPs) in-order to prevent is from burning out. Never the > less, if your machine's cooling is sufficient you shouldn't see this > message. > > If you CPU's high and low water mark are the same (E.g. 90C), the CPU > will reach 90C, throttle, and drop to 89C - all in one second. > I'd suggest you configure lm_sensros and monitor the CPU and board > temperature. > $ sensors-detect > $ /etc/init.d/lm_sensors restart > $ sensors -s > $ sensors > > - Gilboa > P.S. can you post your hardware configuration? > Running sensors-detect produced the same /etc/sysconfig/lm_sensors file that I already had. Running sensors shows this: # sensors atk0110-acpi-0 Adapter: ACPI interface Vcore Voltage: +1.42 V (min = +1.45 V, max = +1.75 V) +3.3 Voltage: +1.68 V (min = +3.00 V, max = +3.60 V) +5.0 Voltage: +1.62 V (min = +4.50 V, max = +5.50 V) +12.0 Voltage: +11.98 V (min = +11.20 V, max = +13.20 V) CPU FAN Speed: 56250 RPM (min = 0 RPM) CHASSIS FAN Speed: 0 RPM (min = 0 RPM) POWER FAN Speed: 0 RPM (min = 0 RPM) CPU Temperature: +62.0°C (high = +90.0°C, crit = +125.0°C) MB Temperature: +49.0°C (high = +70.0°C, crit = +125.0°C) Power Temperature: +24.0°C (high = +80.0°C, crit = +125.0°C) The first thing that jumps out at me is that I think I need a new PSU! How is this machine even running if 3 of the 4 voltages are low? The second thing is that the temps are just fine. So why do I keep getting these messages in the logs? Perhaps because the power rails are low? The chassis and power fans are 2 wire so no data. lshw dumps a lot of information. Anything in particular you are looking for? I think I have solved my lockup problem. I'll write a separate post about that. Thanks, Steve -- Changing lives one card at a time http://www.send1cardnow.com
Attachment:
signature.asc
Description: PGP signature
-- users mailing list users@xxxxxxxxxxxxxxxxxxxxxxx To unsubscribe or change subscription options: https://admin.fedoraproject.org/mailman/listinfo/users Guidelines: http://fedoraproject.org/wiki/Mailing_list_guidelines