Oracle Redo log并行机制详解

当前位置：首页 - 数据库 - 正文

君子好学，自强不息！

2022-11-27 | 数据库 | 1818ip | 628°c

A⁺ A^-

Oracle的数据库日志称为Redo log，所有数据改变都记录Redo log，可以用于修复受损的数据库。Redo log 是用于恢复和一个高级特性的重要数据，一个redo条目包含了相应操作导致的数据库变化的所有信息，所有redo条目最终都要被写入redo文件中去。

Redo log buffer是为了避免Redo文件IO导致性能瓶颈而在sga中分配出的一块内存。一个redo条目首先在用户内存(PGA)中产生，然后由oracle服务进程拷贝到log buffer中，当满足一定条件时，再由LGWR进程写入redo文件。由于log buffer是一块“共享”内存，为了避免冲突，它是受到redo allocation latch保护的，每个服务进程需要先获取到该latch才能分配redo buffer。因此在高并发且数据修改频繁的oltp系统中，我们通常可以观察到redo allocation latch的等待。Redo写入redo buffer的整个过程如下：

在PGA中生产Redo Enrey -> 服务进程获取Redo Copy latch(存在多个—CPU_COUNT*2) -> 服务进程获取redo allocation latch(仅1个) -> 分配log buffer -> 释放redo allocation latch -> 将Redo Entry写入Log Buffer -> 释放Redo Copy latch;

shared strand

为了减少redo allocation latch等待，在oracle 9.2中，引入了log buffer的并行机制。其基本原理就是，将log buffer划分为多个小的buffer，这些小的buffer被成为strand(为了和之后出现的private strand区别，它们被称之为shared strand)。每一个strand受到一个单独redo allocation latch的保护。多个shared strand的出现，使原来序列化的redo buffer分配变成了并行的过程，从而减少了redo allocation latch等待。

shared strand的初始数据量是由参数log_parallelism控制的;在10g中，该参数成为隐含参数，并新增参数_log_parallelism_max控制shared strand的最大数量;_log_parallelism_dynamic则控制是否允许shared strand数量在_log_parallelism和_log_parallelism_max之间动态变化。

HELLODBA.COM>selectnam.ksppinm,val.KSPPSTVL,nam.ksppdesc 2fromsys.x$ksppinam, 3sys.x$ksppsvval 
4wherenam.indx=val.indx 5--ANDnam.ksppinmLIKE'_%' 
6ANDupper(nam.ksppinm)LIKE'%LOG_PARALLE%';  
KSPPINMKSPPSTVLKSPPDESC ------------------------------------------------------------------------------ 
_log_parallelism1Numberoflogbufferstrands _log_parallelism_max2Maximumnumberoflogbufferstrands 
_log_parallelism_dynamicTRUEEnabledynamicstrands

每一个shared strand的大小 = log_buffer/(shared strand数量)。strand信息可以由表x$kcrfstrand查到(包含shared strand和后面介绍的private strand，10g以后存在)。　

HELLODBA.COM>selectindx,strand_size_kcrfafromx$kcrfstrandwherelast_buf_kcrfa!='00';  INDXSTRAND_SIZE_KCRFA 
--------------------------- 03514368 
13514368  
HELLODBA.COM>showparameterlog_buffer  
NAMETYPEVALUE ----------------------------------------------------------------------------- 
log_bufferinteger7028736

关于shared strand的数量设置，16个cpu之内最大默认为2，当系统中存在redo allocation latch等待时，每增加16个cpu可以考虑增加1个strand，最大不应该超过8。并且_log_parallelism_max不允许大于cpu_count。

注意：在11g中，参数_log_parallelism被取消，shared strand数量由_log_parallelism_max、_log_parallelism_dynamic和cpu_count控制。

Private strand

为了进一步降低redo buffer冲突，在10g中引入了新的strand机制——Private strand。Private strand不是从log buffer中划分的，而是在shared pool中分配的一块内存空间。

HELLODBA.COM>select*fromV$sgastatwherenamelike'%strand%'; 　　POOLNAMEBYTES 
　　------------------------------------------------ 
　　sharedpoolprivatestrands2684928 
　　HELLODBA.COM>selectindx,strand_size_kcrfafromx$kcrfstrandwherelast_buf_kcrfa='00'; 
　　INDXSTRAND_SIZE_KCRFA 
　　--------------------------- 
　　266560 
　　366560 
　　466560 
　　566560 
　　666560 
　　766560 
　　866560 
　　...

Private strand的引入为Oracle的Redo/Undo机制带来很大的变化。每一个Private strand受到一个单独的redo allocation latch保护，每个Private strand作为“私有的”strand只会服务于一个活动事务。获取到了Private strand的用户事务不是在PGA中而是在Private strand生成Redo，当flush private strand或者commit时，Private strand被批量写入log文件中。如果新事务申请不到Private strand的redo allocation latch，则会继续遵循旧的redo buffer机制，申请写入shared strand中。事务是否使用Private strand，可以由x$ktcxb的字段ktcxbflg的新增的第13位鉴定：

HELLODBA.COM>selectdecode(bitand(ktcxbflg,4096),0,1,0)used_private_strand,count(*) 　　2fromx$ktcxb 
　　3wherebitand(ksspaflg,1)!=0 
　　4andbitand(ktcxbflg,2)!=0 
　　5groupbybitand(ktcxbflg,4096); 
　　USED_PRIVATE_STRANDCOUNT(*) 
　　----------------------------- 
　　110 
　　01

对于使用Private strand的事务，无需先申请Redo Copy Latch，也无需申请Shared Strand的redo allocation latch，而是flush或commit是批量写入磁盘，因此减少了Redo Copy Latch和redo allocation latch申请/释放次数、也减少了这些latch的等待，从而降低了CPU的负荷。过程如下：

事务开始 -> 申请Private strand的redo allocation latch (申请失败则申请Shared Strand的redo allocation latch) -> 在Private strand中生产Redo Enrey -> Flush/Commit -> 申请Redo Copy Latch -> 服务进程将Redo Entry批量写入Log File -> 释放Redo Copy Latch -> 释放Private strand的redo allocation latch。#p#

注意：对于未能获取到Private strand的redo allocation latch的事务，在事务结束前，即使已经有其它事务释放了Private strand，也不会再申请Private strand了。

每个Private strand的大小为65K。10g中，shared pool中的Private strands的大小就是活跃会话数乘以65K，而11g中，在shared pool中需要为每个Private strand额外分配4k的管理空间，即：数量*69k。

--10g: 　　SQL>select*fromV$sgastatwherenamelike'%strand%'; 
　　POOLNAMEBYTES 
　　------------------------------------------------ 
　　sharedpoolprivatestrands1198080 
　　HELLODBA.COM>selecttrunc(value*KSPPSTVL/100)*65*1024 
　　2from(selectvaluefromv$parameterwherename='transactions')a, 
　　3(selectval.KSPPSTVL 
　　4fromsys.x$ksppinam,sys.x$ksppsvval 
　　5wherenam.indx=val.indx 
　　6ANDnam.ksppinm='_log_private_parallelism_mul')b; 
　　TRUNC(VALUE*KSPPSTVL/100)*65*1024 
　　------------------------------------- 
　　1198080 
　　--11g: 
　　HELLODBA.COM>select*fromV$sgastatwherenamelike'%strand%'; 
　　POOLNAMEBYTES 
　　------------------------------------------------ 
　　sharedpoolprivatestrands706560 
　　HELLODBA.COM>selecttrunc(value*KSPPSTVL/100)*(65+4)*1024 
　　2from(selectvaluefromv$parameterwherename='transactions')a, 
　　3(selectval.KSPPSTVL 
　　4fromsys.x$ksppinam,sys.x$ksppsvval 
　　5wherenam.indx=val.indx 
　　6ANDnam.ksppinm='_log_private_parallelism_mul')b; 
　　TRUNC(VALUE*KSPPSTVL/100)*(65+4)*1024 
　　------------------------------------- 
　　706560

Private strand的数量受到2个方面的影响：logfile的大小和活跃事务数量。

参数_log_private_mul指定了使用多少logfile空间预分配给Private strand，默认为5。我们可以根据当前logfile的大小(要除去预分配给log buffer的空间)计算出这一约束条件下能够预分配多少个Private strand：

HELLODBA.COM>selectbytesfromv$logwherestatus='CURRENT'; 　　BYTES 
　　---------- 
　　52428800 
　　HELLODBA.COM>selecttrunc(((selectbytesfromv$logwherestatus='CURRENT')-(selectto_number(value)fromv$parameterwherename='log_buffer'))* 
　　2(selectto_number(val.KSPPSTVL) 
　　3fromsys.x$ksppinam,sys.x$ksppsvval 
　　4wherenam.indx=val.indx 
　　5ANDnam.ksppinm='_log_private_mul')/100/66560) 
　　6as"calculatedprivatestrands"
　　7fromdual; 
　　calculatedprivatestrands 
　　-------------------------- 
　　5 
　　HELLODBA.COM>selectcount(1)"actualprivatestrands"fromx$kcrfstrandwherelast_buf_kcrfa='00'; 
　　actualprivatestrands 
　　---------------------- 
　　5

当logfile切换后(和checkpoint一样，切换之前必须要将所有Private strand的内容flush到logfile中，因此我们在alert log中可能会发现日志切换信息之前会有这样的信息：”Private strand flush not complete”，这是可以被忽略的)，会重新根据切换后的logfile的大小计算对Private strand的限制：

HELLODBA.COM>altersystemswitchlogfile; 　　Systemaltered. 
　　HELLODBA.COM>selectbytesfromv$logwherestatus='CURRENT'; 
　　BYTES 
　　---------- 
　　104857600 
　　HELLODBA.COM>selecttrunc(((selectbytesfromv$logwherestatus='CURRENT')-(selectto_number(value)fromv$parameterwherename='log_buffer'))* 
　　2(selectto_number(val.KSPPSTVL) 
　　3fromsys.x$ksppinam,sys.x$ksppsvval 
　　4wherenam.indx=val.indx 
　　5ANDnam.ksppinm='_log_private_mul')/100/66560) 
　　6as"calculatedprivatestrands"
　　7fromdual; 
　　calculatedprivatestrands 
　　-------------------------- 
　　13 
　　HELLODBA.COM>selectcount(1)"actualprivatestrands"fromx$kcrfstrandwherelast_buf_kcrfa='00'; 
　　actualprivatestrands 
　　---------------------- 
　　13

参数_log_private_parallelism_mul用于推算活跃事务数量在最大事务数量中的百分比，默认为10。Private strand的数量不能大于活跃事务的数量。

HELLODBA.COM>showparametertransactions 　　NAMETYPEVALUE 
　　----------------------------------------------------------------------------- 
　　transactionsinteger222 
　　transactions_per_rollback_segmentinteger5 
　　HELLODBA.COM>selecttrunc((selectto_number(value)fromv$parameterwherename='transactions')* 
　　2(selectto_number(val.KSPPSTVL) 
　　3fromsys.x$ksppinam,sys.x$ksppsvval 
　　4wherenam.indx=val.indx 
　　5ANDnam.ksppinm='_log_private_parallelism_mul')/100) 
　　6as"calculatedprivatestrands"
　　7fromdual; 
　　calculatedprivatestrands 
　　-------------------------- 
　　22 
　　HELLODBA.COM>selectcount(1)"actualprivatestrands"fromx$kcrfstrandwherelast_buf_kcrfa='00'; 
　　actualprivatestrands 
　　---------------------- 
　　22