Re: Direct read stopped suddenly

From: Pap <oracle.developer35_at_gmail.com>
Date: Wed, 25 May 2022 01:43:25 +0530
Message-ID: <CAEjw_fg5RbmX5d4K1gMcxKmG0rkkv+bN8CcRypp=W=bpV8T+=A_at_mail.gmail.com>

Thank you Sayan. This query has been running fine since the last 5-6months and always using cell smart scan , as i can see from 'offload returned bytes' column value >0 in dba_hist_sqlstat for this query and thus the elapsed time was alway <1second and never encountered such behaviour. Do you mean to say on DB node-1 the object was fully cached whereas on node-2 and 3 and 4 it wasn't, and thus when the query was executed from node-2/3/4 it was going for smart scan whereas when it was executed from node-1 it was going for buffered read? And should we then flush it from the buffer cache but it can then happen again , so how should we handle this odd behaviour?

On Wed, May 25, 2022 at 1:28 AM Sayan Malakshinov <xt.and.r_at_gmail.com> wrote:

> Hi Pap,
>
> ntcache is a number of blocks of that object in the buffer cache. Since
> there are almost all blocks in the cache, oracle decides to do not use
> direct reads.
>
>
> Best regards,
> Sayan Malakshinov
> Oracle performance tuning expert
> Oracle Database Developer Choice Award winner
> Oracle ACE
> http://orasql.org
>
> вт, 24 мая 2022 г., 22:29 Pap <oracle.developer35_at_gmail.com>:
>
>> Hello Listers, We have one legacy system on version 11.2.0.4 and it is
>> planned to be migrated to 19C. It's a four node database. We have a select
>> query suddenly stopped doing smartscan from one node and the response time
>> goes from <1 seconds to >5seconds. And the surprising thing is , when we
>> manually trigger the sql , it's doing a smart scan when triggered from
>> other nodes but from one node only it's not doing that and the response
>> time going high for that specific node. It seems that one query showing
>> this behaviour and other running fine.Ideally my understanding was that it
>> should not vary based on DB node as because the storage nodes are common
>> across all the four DB nodes. So I'm wondering , why is it happening?
>>
>> Collected the 10358 trace and below is the trace. It does show a
>> difference in the value of "ntcache" and "ntdist" for both the nodes and
>> that is possibly driving the direct read/smartscan decision. So I wanted to
>> understand what is that and why that value is different in both DB nodes
>> and how to fix this odd behaviour?
>>
>> query:-
>> SELECT c1, c8, c2, c3, c4, c5, c6, c7 from TAB1 where c1 = :1
>>
>> Plan hash value: 2108848690
>>
>> --------------------------------------------------------------------------------------------------------
>> | Id | Operation | Name | Rows |
>> Bytes | Cost (%CPU)| Time |
>>
>> --------------------------------------------------------------------------------------------------------
>> | 0 | SELECT STATEMENT | | |
>> | 1583 (100)| |
>> | 1 | TABLE ACCESS STORAGE FULL| TAB1 | 9 | 711
>> | 1583 (1)| 00:00:19 |
>>
>> --------------------------------------------------------------------------------------------------------
>> Query Block Name / Object Alias (identified by operation id):
>> -------------------------------------------------------------
>> 1 - SEL$1 / TAB1_at_SEL$1
>>
>>
>> Trace from DB node which is doing smartscan:-
>>
>> NSMTIO: qertbFetch:[MTT < OBJECT_SIZE < VLOT]: Checking cost to read from
>> caches(local/remote) and checking storage reduction factors (OLTP/EHCC Comp)
>> NSMTIO: kcbdpc:DirectRead: tsn: 29, objd: 1607047, objn: 1607047
>> ckpt: 1, nblks: 757301, ntcache: 10, ntdist:30
>> Direct Path for tsn 29 objd 1607047 objn 1607047
>> Direct Path 1 ckpt 1, nblks 757301 ntcache 10 ntdist 30
>> Direct Path mndb 0 tdiob 12 txiob 0 tciob 66
>> Direct path diomrc 128 dios 2
>> NSMTIO: Additional Info: VLOT=65089150
>> Object# = 1607047, Object_Size = 757301 blocks
>> SqlId = 64d0vkmtjgq2x, plan_hash_value = 2108848690, Partition# = 0
>> KCBO: prepared ksr msg for cid=131228, tsn=29, obj=1607047, 0
>>
>>
>> Trace from DB node which is not doing smartscan:-
>>
>> NSMTIO: qertbFetch:[MTT < OBJECT_SIZE < VLOT]: Checking cost to read from
>> caches(local/remote) and checking storage reduction factors (OLTP/EHCC Comp)
>> NSMTIO: kcbdpc:NoDirectRead:[CACHE_READ]: tsn: 29, objd: 1607047, objn:
>> 1607047
>> ckpt: 0, nblks: 757301, ntcache: 756116, ntdist:1185
>> Direct Path for tsn 29 objd 1607047 objn 1607047
>> Direct Path 0 ckpt 0, nblks 757301 ntcache 756116 ntdist 1185
>> Direct Path mndb 18 tdiob 10 txiob 0 tciob 70
>> Direct path diomrc 128 dios 2
>> NSMTIO: Additional Info: VLOT=65089150
>> Object# = 1607047, Object_Size = 757301 blocks
>> SqlId = 64d0vkmtjgq2x, plan_hash_value = 2108848690, Partition# = 0
>>
>

--
http://www.freelists.org/webpage/oracle-l

Received on Tue May 24 2022 - 22:13:25 CEST

This message: [ Message body ]
Next message: Mladen Gogala: "Re: RMAN Restore Error"
In reply to Sayan Malakshinov: "Re: Direct read stopped suddenly"
Next in thread: Sayan Malakshinov: "Re: Direct read stopped suddenly"
Reply: Sayan Malakshinov: "Re: Direct read stopped suddenly"
Reply: Pap: "Re: Direct read stopped suddenly"
Reply: Pap: "Re: Direct read stopped suddenly"

Contemporary messages sorted: [ by date ] [ by thread ] [ by subject ] [ by author ]

Original text of this message