Re: Oracle 10g hangs intermittently waiting for I/O

From: Rajeev Prabhakar <rprabha01_at_gmail.com>
Date: Fri, 15 May 2009 18:08:45 -0400
Message-ID: <2ba656800905151508y1cfd10e0nb8e3fa505802ca20_at_mail.gmail.com>



Pawel,

So that I can understand your environment better, could you please answer these questions :

  1. Does the database server's /tmp filesystem have adequate free space ?
  2. Is the oracle installation configured with adequate swap space ( I know no sysadmin would like to have additional swap allocated), however, going with typical oracle specified swap settings have saved me a lots of headaches (while load/stress testing) (including node freeze/reboots).
  3. What is the size of your temp tablespace ? (Allocated and Free space during typical application usage window) ?
  4. Are you (or your application) utilizing GTTs (global temporary tables) ?
  5. Are you utilizing huge pages on your database server ?
  6. Is this dedicated OR shared server configuration ?
  7. Have you looked at the prospect of any soft block corruption for any table ?
  8. Any i/o bound concurrent processes (database specific or otherwise) running during the window when database freeze is being observed ? say - backups (database OR o.s.)
  9. Are there any specific kinds of processes that are first reported to be getting hung / experiencing slowness (e.g. reports/html output etc..) ?

-Rajeev

2009/5/15 Paweł Kotlarz <pkotla_at_go2.pl>:
> Rajeev,
>
> Oracle shows many sessions waiting for direct path read (temp). Tanel's
> waitprof reports single events taking many seconds though most of
> them are below 15ms.
>
> On the OS level vmstat shows normal reading for some time and then
> sessions in an uninterruptible sleep with no I/O taking place. iostat -x
> and asmiostat (ML 437996.1) show specific volumes. Just after the
> performance returns to normal these volumes show much greater queue
> length (iostat) or much greater average read time (asmiostat).
>
> I ran strace on a process servicing the session on which I used waitprof
> earlier. It stops on a read call.
>
> Currently I only know that the sysadmins found nothing in Linux logs and on
> a 'system management page'. Unfortunately it is difficult to obtain
> more information from them unless I tell what exactly to check...
>
> Thanks,
> Pawel

--
http://www.freelists.org/webpage/oracle-l
Received on Fri May 15 2009 - 17:08:45 CDT

Original text of this message