Oracle FAQ Your Portal to the Oracle Knowledge Grid
HOME | ASK QUESTION | ADD INFO | SEARCH | E-MAIL US
 

Home -> Community -> Mailing Lists -> Oracle-L -> Re:RE: Re[2]: Sun Boxes Crashing

Re:RE: Re[2]: Sun Boxes Crashing

From: Tommy Pham <TPham_at_specialized.com>
Date: Fri, 08 Sep 2000 15:52:33 -0700
Message-Id: <10613.116564@fatcity.com>


--MIME MULTIPART BOUNDARY=.968453578:+'1

Content-Type: text/plain; charset=US-ASCII
Content-Id: <1606142899-2_at_specialized.com>
Content-Transfer-Encoding: 7bit

The command is prtdiag -v, requires root priviledge. Here's a partial output, pay attention to the current column.

System Temperatures (Celsius):



Brd State Current Min Max Trend
--- ------- ------- --- --- -----
 0      OK       29     26   34  stable
 1     OK      35    34  38  stable
 2      OK       31     27   35  stable
 4      OK       30     26   34  stable
 5      OK       34     32   36  stable
 6      OK       31     27   35  stable
 8      OK       30     26   35  stable
10      OK       33     30   37  stable
12      OK       31     28   36  stable
14      OK       33     29   37  stable
CLK     OK       28     26   29  stable

The I/O board (1) always/most of the time has higher temperature.

TP

        ,--o         ,--o         ,--o          ,--o
      _-\_<_        _-\_<_       _-\_<_       _-\_<_  let's bike!!! 
     (*)/-'(*) ___ (*)/-'(*) __ (*)/-'(*) __ (*)/-(*)        



____________________Reply Separator____________________
Subject: RE: Re[2]: Sun Boxes Crashing Author: ORACLE-L_at_fatcity.com
Date: 9/8/00 9:05 AM

Please explain how to get the temp. prtdiad -v does not do it for me. Could you ost your output?

> -----Original Message-----

> From: root_at_fatcity.com [mailto:root_at_fatcity.com]On Behalf Of Molina,
> Gerardo
> Sent: Thursday, September 07, 2000 8:11 PM
> To: Multiple recipients of list ORACLE-L
> Subject: RE: Re[2]: Sun Boxes Crashing
>
>
> You mean low 30's degrees Celsius right?
>
> One of my CPU boards on an E4500 shows a max temp of 39 degrees C.
>
> Has Sun recommended an acceptable max temp to avoid this problem?
>
> TIA,
> Gerardo
>

> -----Original Message-----
> Sent: Thursday, September 07, 2000 11:56 AM
> To: Multiple recipients of list ORACLE-L
>
>
> We had the parameters set since day one.   It didn't correct the problem.
> One
> thing that has helped/kept our system (E6500) up and running by
> cooling down
> our
> computer room.   Prtdiag -v shows the memory/cpu boards are in
> the low 30's
> F.
>  We still don't an answer from Sun yet.
>
> TP
> Sr. Apps DBA / Solaris Systems Admin.
>
>
>
>
>         ,--o         ,--o         ,--o          ,--o
>       _-\_<_        _-\_<_       _-\_<_       _-\_<_  let's bike!!!
>      (*)/-'(*) ___ (*)/-'(*) __ (*)/-'(*) __ (*)/-(*)
>
>
>
> ____________________Reply Separator____________________
> Author: ORACLE-L_at_fatcity.com
> Date:       9/6/00 10:20 PM
>
>
>
> Sounds exactly like the type of problem we experienced recently (same
> environment)...here is a response direct from Sun techo's:
>
> I've had a look at the crash dump and the problem appears to be
> that a kernel thread's stack overflowed.
>
> This, unfortunately, happens occasionally if you run with too many levels
> of different filesystems/device layers. (In your case vxfs/vxio/sd)
>
> The only real way to fix/workaround the problem is to increase the size of
> the
> kernel stacks.
>
> i'd suggest you add
>
> set rpcmod:svc_run_stksize=0x4000
> set lwp_default_stksize=0x4000
>
> to /etc/system then reboot the domain to take effect.
>
> the only reason it happened is because the kernel thread was
> interrupted to
> service a disk request. This pushed the stack over the normal
> 0x2000 limit.
>
> The crash was sufficient to corrupt one of our largest datafiles and
> appeared to
> occur after running a large insert batch job in parallel.
>
> We only made these changes yesterday afternoon so obviously it's too early
> to
> qualify!
>
> Regards
> Grant
>
>
>
>
>
> "Rama Malladi" <rmalladi_at_inteliant.com> on 07/09/2000 08:40:38
>
> Please respond to ORACLE-L_at_fatcity.com
>
>
> To:   Multiple recipients of list ORACLE-L <ORACLE-L_at_fatcity.com>
> cc:    (bcc: GRANT G HOLYOAKE/NSO/CSDA)
>
>
>
> We have several Sun boxes (Solaris 2.6) running Oracle 8, 8i. One of the
> boxes (description given below) Kept rebooting and this machine happens to
> run one of the most critical billing systems (Murphy's law!).
>
> Overall, this machine rebooted some 40 times, in a period of 2 months and
> some nights, it rebooted as many as 10 times! Our SysAdmin contacted Sun
> Engineers and they never told us what exactly was the problem, and kept
> replacing CPUs, Memory boards, SCSI cards etc ... This happened several
> times and last week there was an article in Computer Weekly
> magazine saying
> several customers were having this kind of problem on Sun boxes and Sun
> tried to hush up the matter ...!!
>
> Has anybody else faced this kind of situation?
>
> Just curious ...
> Rama
>
> =================================
> System Configuration:  Sun Microsystems  sun4u 8-slot Sun Enterprise
> E4500/E5500
> SunOS uscaelmux06 5.6 Generic_105181-21 sun4u sparc SUNW,Ultra-Enterprise
>
> --
> Author: Rama Malladi
>   INET: rmalladi_at_inteliant.com
>
> Fat City Network Services    -- (858) 538-5051  FAX: (858) 538-5051
> San Diego, California        -- Public Internet access / Mailing Lists

> --------------------------------------------------------------------
> To REMOVE yourself from this mailing list, send an E-Mail message > to: ListGuru_at_fatcity.com (note EXACT spelling of 'ListGuru') and in > the message BODY, include a line containing: UNSUB ORACLE-L > (or the name of mailing list you want to be removed from). You may > also send the HELP command for other information (like subscribing). > > > > > > > -- > Author: > INET: grant.g.holyoake_at_centrelink.gov.au > > Fat City Network Services -- (858) 538-5051 FAX: (858) 538-5051 > San Diego, California -- Public Internet access / Mailing Lists
> --------------------------------------------------------------------
> To REMOVE yourself from this mailing list, send an E-Mail message > to: ListGuru_at_fatcity.com (note EXACT spelling of 'ListGuru') and in > the message BODY, include a line containing: UNSUB ORACLE-L > (or the name of mailing list you want to be removed from). You may > also send the HELP command for other information (like subscribing). > > > > -- > Author: Molina, Gerardo > INET: Gerardo.Molina_at_schwab.com > > Fat City Network Services -- (858) 538-5051 FAX: (858) 538-5051 > San Diego, California -- Public Internet access / Mailing Lists
> --------------------------------------------------------------------
> To REMOVE yourself from this mailing list, send an E-Mail message > to: ListGuru_at_fatcity.com (note EXACT spelling of 'ListGuru') and in > the message BODY, include a line containing: UNSUB ORACLE-L > (or the name of mailing list you want to be removed from). You may > also send the HELP command for other information (like subscribing). > -- Author: Glenn Travis

  INET: Glenn.Travis_at_wcom.com
Fat City Network Services    -- (858) 538-5051  FAX: (858) 538-5051
San Diego, California        -- Public Internet access / Mailing Lists
--------------------------------------------------------------------
To REMOVE yourself from this mailing list, send an E-Mail message to: ListGuru_at_fatcity.com (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).  

--MIME MULTIPART BOUNDARY=.968453578:+'1

Content-Type: application/octet-stream
Content-Id: <1606142899-3_at_specialized.com>
Content-Transfer-Encoding: base64
Content-Disposition: attachment; filename="RFC822.txt"

UmVjZWl2ZWQ6IGZyb20gc210cC5zcGVjaWFsaXplZC5jb20gWzIwOC4xOTUuMTgyLjQ4XSBieSBz cGVjaWFsaXplZC5jb20gKGNjTWFpbCBMaW5rIHRvIFNNVFAgUjguNTIuMDIuMSkNCgk7IEZyaSwg MDggU2VwIDIwMDAgMTQ6MzE6NDkgLTA3MDANClJldHVybi1QYXRoOiByb290QGZhdGNpdHkuY3Rz LmNvbQ0KUmVjZWl2ZWQ6IGZyb20gbmV3c2ZlZWQuY3RzLmNvbSAoWzIwOS42OC4xOTIuMTk5XSkg YnkgMjA4LjE5NS4xODIuNDgNCiAgKE5vcnRvbiBBbnRpVmlydXMgZm9yIEludGVybmV0IEVtYWls IEdhdGV3YXlzIDEuMCkgOw0KICBGcmksIDA4IFNlcCAyMDAwIDIxOjMzOjM1IDAwMDAgKEdNVCkN ClJlY2VpdmVkOiBmcm9tIGZhdGNpdHkuVVVDUCAodXVjcEBsb2NhbGhvc3QpDQoJYnkgbmV3c2Zl ZWQuY3RzLmNvbSAoOC45LjMvOC45LjMpIHdpdGggVVVDUCBpZCBPQUEzMjczNjsNCglGcmksIDgg U2VwIDIwMDAgMTQ6Mjk6NDggLTA3MDAgKFBEVCkNClJlY2VpdmVkOiBieSBmYXRjaXR5LmNvbSAo MDQtTWF5LTIwMDAvdjEuMGYtYjY5L2JhYikgdmlhIFVVQ1AgaWQgMDAxRjc2OTY7IEZyaSwgMDgg U2VwIDIwMDAgMDg6MDU6MDggLTA4MDANCk1lc3NhZ2UtSUQ6IDxGMDAxLjAwMUY3Njk2LjIwMDAw OTA4MDgwNTA4QGZhdGNpdHkuY29tPg0KRGF0ZTogRnJpLCAwOCBTZXAgMjAwMCAwODowNTowOCAt MDgwMA0KVG86IE11bHRpcGxlIHJlY2lwaWVudHMgb2YgbGlzdCBPUkFDTEUtTCA8T1JBQ0xFLUxA ZmF0Y2l0eS5jb20+DQpYLUNvbW1lbnQ6IE9yYWNsZSBSREJNUyBDb21tdW5pdHkgRm9ydW0NClgt U2VuZGVyOiBHbGVubiBUcmF2aXMgPEdsZW5uLlRyYXZpc0B3Y29tLmNvbT4NClNlbmRlcjogcm9v dEBmYXRjaXR5LmNvbQ0KUmVwbHktVG86IE9SQUNMRS1MQGZhdGNpdHkuY29tDQpFcnJvcnMtVG86 IE1MLUVSUk9SU0BmYXRjaXR5LmNvbQ0KRnJvbTogR2xlbm4gVHJhdmlzIDxHbGVubi5UcmF2aXNA d2NvbS5jb20+DQpTdWJqZWN0OiBSRTogUmVbMl06IFN1biBCb3hlcyBDcmFzaGluZw0KT3JnYW5p emF0aW9uOiBGYXQgQ2l0eSBOZXR3b3JrIFNlcnZpY2VzLCBTYW4gRGllZ28sIENhbGlmb3JuaWEN ClgtTGlzdFNlcnZlcjogdjEuMGYsIGJ1aWxkIDY5OyBMaXN0R3VydSAoYykgMTk5Ni0yMDAwIEJy Received on Fri Sep 08 2000 - 17:52:33 CDT

Original text of this message

HOME | ASK QUESTION | ADD INFO | SEARCH | E-MAIL US