freeipmi-users
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Freeipmi-users] PS Status showing Unrecognized Event after upgradin


From: Ingard Mevåg
Subject: Re: [Freeipmi-users] PS Status showing Unrecognized Event after upgrading a supermicro X8 mobo ipmimodule to fw 3.12
Date: Mon, 17 Feb 2014 13:51:56 +0100

Also if relevant, here is the output from ipmicfg -pminfo. I noticed that
the Slave Address is different here from the output of the -psfruinfo cmd.

address@hidden:~/ipmicfg_1.14.3_20130725/linux/64bit#
./ipmicfg-linux.x86_64.static -pminfo
 [SlaveAddress = 78h] [Module 1]
 Item                           |                Value
 ----                           |                -----
 Status                         | [NONE OF THE ABOVE](01h)
 AC Input Voltage               |              230.5 V
 AC Input Current               |               0.81 A
 DC 12V Output Voltage          |              12.18 V
 DC 12V Output Current          |               9.50 A
 Temperature 1                  |              29C/84F
 Temperature 2                  |               0C/32F
 Fan 1                          |             7296 RPM
 Fan 2                          |            10080 RPM
 DC 12V Output Power            |                114 W
 AC Input Power                 |                178 W
 PMBus Revision                 |
 PWS Serial Number              |
 PWS Module Number              |         PWS-1K41P-1R
 PWS Revision                   |               REV1.1

 [SlaveAddress = 7Ah] [Module 2]
 Item                           |                Value
 ----                           |                -----
 Status                         | [NONE OF THE ABOVE](01h)
 AC Input Voltage               |              229.5 V
 AC Input Current               |               0.64 A
 DC 12V Output Voltage          |              12.18 V
 DC 12V Output Current          |               9.50 A
 Temperature 1                  |              28C/82F
 Temperature 2                  |               0C/32F
 Fan 1                          |             7232 RPM
 Fan 2                          |             9632 RPM
 DC 12V Output Power            |                114 W
 AC Input Power                 |                138 W
 PMBus Revision                 |
 PWS Serial Number              |      ```````````````
 PWS Module Number              |         PWS-1K41P-1R
 PWS Revision                   |               REV1.1



2014-02-17 13:02 GMT+01:00 Ingard Mevåg <address@hidden>:

> Hi again Al and thanks for your help so far!
>
> After a lot of googling I've found this post which might have some of the
> information:
> http://www.tummy.com/articles/supermicro-ipmi-nagios-check/
>
> Below is the output of the cmds you suggested:
>
> address@hidden:~# ipmi-fru
> FRU Inventory Device: BMC FRU (ID 00h)
>
>   FRU Error: board info area checksum invalid
>
>   FRU Error: product info area checksum invalid
> address@hidden:~# ipmi-fru --bridge-fru
> FRU Inventory Device: BMC FRU (ID 00h)
>
>   FRU Error: board info area checksum invalid
>
>   FRU Error: product info area checksum invalid
> address@hidden:~# ipmi-sensors --bridge-sensors
> ID   | Name          | Type              | Reading    | Units | Event
> 4    | System Temp   | Temperature       | 30.00      | C     | 'OK'
> 71   | CPU Temp      | OEM Reserved      | N/A        | N/A   | 'OEM Event
> = 0000h'
> 138  | FAN 1         | Fan               | N/A        | RPM   | N/A
> 205  | FAN 2         | Fan               | 1695.00    | RPM   | 'OK'
> 272  | FAN 3         | Fan               | 4470.00    | RPM   | 'OK'
> 339  | FAN 4         | Fan               | N/A        | RPM   | N/A
> 406  | FAN 5         | Fan               | N/A        | RPM   | N/A
> 473  | CPU Vcore     | Voltage           | 0.86       | V     | 'OK'
> 540  | +3.3VCC       | Voltage           | 3.31       | V     | 'OK'
> 607  | +12 V         | Voltage           | 12.19      | V     | 'OK'
> 674  | CPU DIMM      | Voltage           | 1.54       | V     | 'OK'
> 741  | +5 V          | Voltage           | 5.15       | V     | 'OK'
> 808  | -12 V         | Voltage           | -12.48     | V     | 'OK'
> 875  | VBAT          | Voltage           | 3.25       | V     | 'OK'
> 942  | +3.3VSB       | Voltage           | 3.30       | V     | 'OK'
> 1009 | AVCC          | Voltage           | 3.31       | V     | 'OK'
> 1076 | Chassis Intru | Physical Security | N/A        | N/A   | 'OK'
> 1143 | PS Status     | Power Supply      | N/A        | N/A   |
> 'Unrecognized Event = 0100h' 'Unrecognized Event = 0200h' 'Unrecognized
> Event = 0400h' 'Unrecognized Event = 0800h' 'Unrecognized Event = 1000h'
> 'Unrecognized Event = 2000h' 'Unrecognized Event = 4000h'
> address@hidden:~# ipmi-sensors --shared-sensors
> ID   | Name          | Type              | Reading    | Units | Event
> 4    | System Temp   | Temperature       | 30.00      | C     | 'OK'
> 71   | CPU Temp      | OEM Reserved      | N/A        | N/A   | 'OEM Event
> = 0000h'
> 138  | FAN 1         | Fan               | N/A        | RPM   | N/A
> 205  | FAN 2         | Fan               | 1695.00    | RPM   | 'OK'
> 272  | FAN 3         | Fan               | 4655.00    | RPM   | 'OK'
> 339  | FAN 4         | Fan               | N/A        | RPM   | N/A
> 406  | FAN 5         | Fan               | N/A        | RPM   | N/A
> 473  | CPU Vcore     | Voltage           | 0.86       | V     | 'OK'
> 540  | +3.3VCC       | Voltage           | 3.31       | V     | 'OK'
> 607  | +12 V         | Voltage           | 12.19      | V     | 'OK'
> 674  | CPU DIMM      | Voltage           | 1.54       | V     | 'OK'
> 741  | +5 V          | Voltage           | 5.15       | V     | 'OK'
> 808  | -12 V         | Voltage           | -12.29     | V     | 'OK'
> 875  | VBAT          | Voltage           | 3.25       | V     | 'OK'
> 942  | +3.3VSB       | Voltage           | 3.30       | V     | 'OK'
> 1009 | AVCC          | Voltage           | 3.31       | V     | 'OK'
> 1076 | Chassis Intru | Physical Security | N/A        | N/A   | 'OK'
> 1143 | PS Status     | Power Supply      | N/A        | N/A   |
> 'Unrecognized Event = 0100h' 'Unrecognized Event = 0200h' 'Unrecognized
> Event = 0400h' 'Unrecognized Event = 0800h' 'Unrecognized Event = 1000h'
> 'Unrecognized Event = 2000h' 'Unrecognized Event = 4000h'
>
>
> Kind regards
> Ingard
>
>
>
> 2014-02-15 19:22 GMT+01:00 Al Chu <address@hidden>:
>
> Hi Ingard,
>>
>> I realized another possibility.  The sensor could be "shared".  So you
>> could try the --shared-sensors option.
>>
>> Al
>>
>> On Fri, 2014-02-14 at 10:26 -0800, Albert Chu wrote:
>> > Hi Ingard,
>> >
>> > Searching through the NEWS file it was released in FreeIPMI 1.0.2.
>> >
>> > Is it possible you're looking at FRU info?  (b/c the option is called
>> > -psfruinfo). You can try ipmi-fru with the --bridge-fru option and see
>> > if that works.
>> >
>> > Also try running ipmi-sensors with --bridge-sensors.  If that doesn't
>> > work, then this might be an OEM extension from Supermicro.  The Slave
>> > Addresses they list (0x70 and 0x72) are not the defaults.  If
>> > ipmi-sensors cannot find this sensor in the SDR (sensor data
>> > repository), then Supermicro is getting this through some other means
>> > that isn't standard.
>> >
>> > If you ask Supermicro for the OEM extension information, then it's
>> > possible it could be added into FreeIPMI.
>> >
>> > Al
>> >
>> > It appears in the ipmicfg example below that they are going through the
>> FRU to get information.
>> >
>> > On Fri, 2014-02-14 at 08:14 +0100, ingard Mevåg wrote:
>> > > Hi Al
>> > >
>> > > Thanks for the information. I had already tried to reset the device,
>> but running with --ignore-unrecognized-events did the trick :)
>> > > At least my monitoring is happy now for this one node running latest
>> beta. I was wondering if you knew when this feature got introduced and/or
>> if there are deb packages for the ubuntu LTS releases somewhere? I've got
>> quite a few Lucid and Precise servers running version 0.7.15 and 0.8.12.
>> > >
>> > > Also, is it possible to get information per PSU somehow? Supermicro's
>> ipmicfg gives the following for instance:
>> > >
>> > > address@hidden:~/ipmicfg_1.14.3_20130725/linux/64bit#
>> ./ipmicfg-linux.x86_64 -psfruinfo
>> > >  [SlaveAddress = 70h] [Module 1]
>> > >  Item                           |                Value
>> > >  ----                           |                -----
>> > >  Status                         |                   On
>> > >  Temperature                    |              29C/84F
>> > >  Fan 1                          |             7213 RPM
>> > >  Fan 2                          |            10076 RPM
>> > >
>> > >  [SlaveAddress = 72h] [Module 2]
>> > >  Item                           |                Value
>> > >  ----                           |                -----
>> > >  Status                         |                   On
>> > >  Temperature                    |              28C/82F
>> > >  Fan 1                          |             7213 RPM
>> > >  Fan 2                          |             9732 RPM
>> > >
>> > > Regards
>> > > Ingard
>> > >
>> > > On 13 Feb 2014, at 17:12, Al Chu <address@hidden> wrote:
>> > >
>> > > > Hi Ingard,
>> > > >
>> > > > This sounds familiar, although I cannot recall how to fix it
>> through the
>> > > > firmware. It's possible a cold reset of the BMC could to it.   You
>> can
>> > > > do a cold reset via
>> > > >
>> > > >> bmc-device --cold-reset
>> > > >
>> > > > If that doesn't work, you can tell ipmimonitoring to ignore
>> unrecognized
>> > > > events via the --ignore-unrecognized-events option.  After that
>> > > > everything should work.
>> > > >
>> > > > Al
>> > > >
>> > > > On Thu, 2014-02-13 at 16:17 +0100, Ingard Mevåg wrote:
>> > > >> Hi guys
>> > > >>
>> > > >> I've been upgrading the firmware on the ipmi module on some
>> > > >> supermicro X8SIU recently and I'm now experiencing problems with
>> the PS
>> > > >> Status sensor. Is there anything I can do to make the sensor
>> detect the
>> > > >> PSUs properly?
>> > > >>
>> > > >> Link to board:
>> > > >>
>> http://www.supermicro.com/products/motherboard/Xeon3000/3400/X8SIU.cfm?IPMI=Y
>> > > >> The latest firmware as of now is version 3.12.
>> > > >> Output from ipmimonitoring:
>> > > >> address@hidden:/usr/local# ./sbin/ipmimonitoring -V
>> > > >> ipmi-sensors - 1.4.0.beta0
>> > > >>
>> > > >> address@hidden:/usr/local# ./sbin/ipmimonitoring
>> > > >> ID   | Name          | Type              | State    | Reading    |
>> Units |
>> > > >> Event
>> > > >> 4    | System Temp   | Temperature       | Nominal  | 32.00      |
>> C     |
>> > > >> 'OK'
>> > > >> 71   | CPU Temp      | OEM Reserved      | N/A      | N/A        |
>> N/A   |
>> > > >> 'OEM Event = 0000h'
>> > > >> 205  | FAN 2         | Fan               | Nominal  | 1695.00    |
>> RPM   |
>> > > >> 'OK'
>> > > >> 272  | FAN 3         | Fan               | Nominal  | 4655.00    |
>> RPM   |
>> > > >> 'OK'
>> > > >> 473  | CPU Vcore     | Voltage           | Nominal  | 0.85       |
>> V     |
>> > > >> 'OK'
>> > > >> 540  | +3.3VCC       | Voltage           | Nominal  | 3.31       |
>> V     |
>> > > >> 'OK'
>> > > >> 607  | +12 V         | Voltage           | Nominal  | 12.19      |
>> V     |
>> > > >> 'OK'
>> > > >> 674  | CPU DIMM      | Voltage           | Nominal  | 1.54       |
>> V     |
>> > > >> 'OK'
>> > > >> 741  | +5 V          | Voltage           | Nominal  | 5.15       |
>> V     |
>> > > >> 'OK'
>> > > >> 808  | -12 V         | Voltage           | Nominal  | -12.48     |
>> V     |
>> > > >> 'OK'
>> > > >> 875  | VBAT          | Voltage           | Nominal  | 3.25       |
>> V     |
>> > > >> 'OK'
>> > > >> 942  | +3.3VSB       | Voltage           | Nominal  | 3.30       |
>> V     |
>> > > >> 'OK'
>> > > >> 1009 | AVCC          | Voltage           | Nominal  | 3.31       |
>> V     |
>> > > >> 'OK'
>> > > >> 1076 | Chassis Intru | Physical Security | Nominal  | N/A        |
>> N/A   |
>> > > >> 'OK'
>> > > >> 1143 | PS Status     | Power Supply      | N/A      | N/A        |
>> N/A   |
>> > > >> 'Unrecognized Event = 0100h' 'Unrecognized Event = 0200h'
>> 'Unrecognized
>> > > >> Event = 0400h' 'Unrecognized Event = 0800h' 'Unrecognized Event =
>> 1000h'
>> > > >> 'Unrecognized Event = 2000h' 'Unrecognized Event = 4000h'
>> > > >>
>> > > >> Kind Regards
>> > > >> Ingard
>> > > >> _______________________________________________
>> > > >> Freeipmi-users mailing list
>> > > >> address@hidden
>> > > >> https://lists.gnu.org/mailman/listinfo/freeipmi-users
>> > > > --
>> > > > Albert Chu
>> > > > address@hidden
>> > > > Computer Scientist
>> > > > High Performance Systems Division
>> > > > Lawrence Livermore National Laboratory
>> > > >
>> > >
>> --
>> Albert Chu
>> address@hidden
>> Computer Scientist
>> High Performance Systems Division
>> Lawrence Livermore National Laboratory
>>
>>
>
>
> --
> Ingard Mevåg
> Driftssjef
> JottaCloud
>
> Mobil: +47 450 22 834
> E-post: address@hidden
> Webside: www.jottacloud.com
>



-- 
Ingard Mevåg
Driftssjef
JottaCloud

Mobil: +47 450 22 834
E-post: address@hidden
Webside: www.jottacloud.com


reply via email to

[Prev in Thread] Current Thread [Next in Thread]