ASM 翻译系列第三十二弹:自制数据抽取小工具
原作者:Bane Radulovic
译者: 邱大龙
审核: 魏兴华
DBGeeK社区联合出品
Find block in ASM
在本系列文章【 Where is my data】中,我已经演示了如何从ASM磁盘中定位和抽取一个Oracle的block,为了让这件事做起来不那么复杂,我又写了一个perl脚本find_block.pl来简化整个操作,只需要提供数据文件的名称和需要提取的block,这个脚本就可以输出从ASM磁盘组中抽取块的命令。
find_block.pl
find_block.pl是一个perl脚本,脚本里集成了dd或kfed命令来从ASM磁盘中抽取一个块,脚本可以在Linux和Unix的ASM版本下工作,且不管是单实例还是RAC环境。(不能是FLEX ASM)
脚本需要以Grid软件owner的身份来运行,而且要确保perl的二进制文件来自于Oracle安装软件的home目录下。在集群环境下,这个脚本可以运行在任意节点上,在运行脚本前,请检查ASM的环境变量,确定ORACLE_SID, ORACLE_HOME, LD_LIBRARY_PATH设定正确,而且对于10G和11GR1版本,需要设置PERL5LIB环境变量:
export PERL5LIB=$ORACLE_HOME/perl/lib/5.8.3:$ORACLE_HOME/perl/lib/site_perl
可以以如下的方式运行脚本:
$ORACLE_HOME/perl/bin/perl find_block.pl filename block
其中: filename是要抽取的块所在的文件名,对于数据文件来说,这个文件名可以从V$DATAFILE的NAME字段获取到,block代表要从ASM抽取的块号,这个块号是数据库的块号,而不是ASM的块号。
这个脚本的输出看起来像下面这样:
dd if=[ASM disk path] ... of=block_N.dd
在Exadata中是这样:
kfed read dev=[ASM disk path] ... > block_N.txt
对于数据文件来说,如果文件的冗余度是external外部冗余模式,这个脚本将产生一条单一的命令,对于是normal冗余,这个脚本将产生2个命令,对于high冗余,将产生3条命令。
Example with ASM version 10.2.0.1
第一个例子是单实例10.2.0.1的ASM版本,首先我在数据库中创建了一张表,插入一些数据。
[oracle@cat10g ~]$ sqlplus / as sysdba
SQL*Plus: Release 10.2.0.1.0 - Production on [date]
SQL> create table TAB1 (name varchar2(16)) tablespace USERS;
Table created.
SQL> insert into TAB1 values ('CAT');
1 row created.
SQL> insert into TAB1 values ('DOG');
1 row created.
SQL> commit;
Commit complete.
SQL> select ROWID, NAME from TAB1;
ROWID NAME
------------------ --------------------------------
AAANE+AAEAAAAGHAAA CAT
AAANE+AAEAAAAGHAAB DOG
SQL> select DBMS_ROWID.ROWID_BLOCK_NUMBER('AAANE+AAEAAAAGHAAA') "Block" from dual;
Block
---------
391
SQL> select t.name "Tablespace", f.name "Datafile"
from v$tablespace t, v$datafile f
where t.ts#=f.ts# and t.name='USERS';
Tablespace Datafile
------------ --------------------------------------
USERS +DATA/cat/datafile/users.259.783204313
SQL>
以上我们造取了两条数据,并且定位到了数据所在的文件和BLOCK号,切换到ASM环境,注意设置正确的环境变量PERL5LIB,然后运行脚本:
$ export PERL5LIB=$ORACLE_HOME/perl/lib/5.8.3:$ORACLE_HOME/perl/lib/site_perl
$ $ORACLE_HOME/perl/bin/perl find_block.pl +DATA/cat/datafile/users.259.783204313 391
dd if=/dev/oracleasm/disks/ASMDISK01 bs=8192 count=1 skip=100359 of=block_391.dd
$
find_block.pl脚本如预期产生了输出,由于这是一个外部冗余的磁盘组,这个脚本只产生了一行dd命令的输出,我们把输出的dd命令复制后执行:
$ dd if=/dev/oracleasm/disks/ASMDISK01 bs=8192 count=1 skip=100359 of=block_391.dd
$
执行后会将块的内容输出到文本文件中block_3237.dd中,然后使用操作系统的od工具,可以看到插入表中的数据:
$ od -c block_391.dd | tail -3
0017740 , 001
0017760 001 003 D O G , 001 001 003 C A T 001 006 u G
0020000
$
非常好,正式我们插入的数据!
Example with ASM version 12.1.0.1 in Exadata
ASM空间的占用取决于2个因素:文件的实际大小和磁盘组的冗余度。
在external冗余的磁盘组中,空间的占用:文件实际大小+1个AU(文件头)+1个额外的AU(如果文件大于60个AU)。
在一个normal冗余的磁盘组中,空间的占用:两倍的文件实际大小+2个AU(文件头)+3个额外的AU(如果文件大于60个AU)
在一个high冗余的磁盘组中,空间的占用:三倍的文件实际大小+3个AU(文件头)+3个额外的AU(如果文件大于60个AU)
在Exadata中我们不能使用dd命令抽取数据块,因为ASM的磁盘对于数据库的server来说是不可见的,为了获得数据块,我们可以使用kfed工具,因此find_block.pl脚本做了这种自适应,如果是Exadata的环境,会使用kfed工具来从ASM磁盘中抽取块。
我们来看一个ASM 12.1.0.1 版本下的一个例子,是一个Exadata环境下双节点的RAC,数据文件是PDB中的一个数据文件。
和上面的例子一样,我首先创建一张表然后插入一些数据:
$ sqlplus / as sysdba
SQL*Plus: Release 12.1.0.1.0 Production on [date]
SQL> alter pluggable database BR_PDB open;
Pluggable database altered.
SQL> show pdbs
CON_ID CON_NAME OPEN MODE RESTRICTED
------ -------- ----------- ----------
2 PDB$SEED READ ONLY NO
...
5 BR_PDB READ WRITE NO
SQL>
$ sqlplus bane/welcome1@BR_PDB
SQL*Plus: Release 12.1.0.1.0 Production on [date]
SQL> create table TAB1 (n number, name varchar2(16)) tablespace USERS;
Table created.
SQL> insert into TAB1 values (1, 'CAT')
1 row created.
SQL> insert into TAB1 values (2, 'DOG');
1 row created.
SQL> commit;
Commit complete.
SQL> select t.name "Tablespace", f.name "Datafile"
from v$tablespace t, v$datafile f
where t.ts#=f.ts# and t.name='USERS';
Tablespace Datafile
---------- ---------------------------------------------
USERS +DATA/CDB/054.../DATAFILE/users.588.860861901
SQL> select ROWID, NAME from TAB1;
ROWID NAME
------------------ ----
AAAWYEABfAAAACDAAA CAT
AAAWYEABfAAAACDAAB DOG
SQL> select DBMS_ROWID.ROWID_BLOCK_NUMBER('AAAWYEABfAAAACDAAA') "Block number" from dual;
Block number
------------
131
SQL>
同样获得插入数据的文件号和块号,切换到ASM的环境,然后运行perl脚本:
$ $ORACLE_HOME/perl/bin/perl find_block.pl +DATA/CDB/0548068A10AB14DEE053E273BB0A46D1/DATAFILE/users.588.860861901 131
kfed read dev=o/192.168.1.9/DATA_CD_03_exacelmel05 ausz=4194304 aunum=16212 blksz=8192 blknum=131 | grep -iv ^kf > block_131.txt
kfed read dev=o/192.168.1.11/DATA_CD_09_exacelmel07 ausz=4194304 aunum=16267 blksz=8192 blknum=131 | grep -iv ^kf > block_131.txt
我们观察到,find_block.pl脚本这次产生了2个命令,因此我们可以知道这是一个normal冗余的磁盘组,我们运行其中一个命令:
$ kfed read dev=o/192.168.1.9/DATA_CD_03_exacelmel05 ausz=4194304 aunum=16212 blksz=8192 blknum=131 | grep -iv ^kf > block_131.txt
$
我们将块的内容输出到了文本文件block_131.txt中,然后看到了我上面插入的数据DOG和CAT:
$ more block_131.txt
...
FD5106080 00000000 00000000 ... [................]
Repeat 501 times
FD5107FE0 00000000 00000000 ... [........,......D]
FD5107FF0 012C474F 02C10202 ... [OG,......CAT..,-]
$
Find any block
find_block.pl用来从ASM磁盘组中的任何一个文件中抽取块,不仅仅是数据文件,为了一乐,我对控制文件和控制文件上一个随机的块运行这个脚本:
$ $ORACLE_HOME/perl/bin/perl find_block.pl +DATA/CDB/CONTROLFILE/current.289.843047837 5
kfed read dev=o/192.168.1.9/DATA_CD_10_exacelmel05 ausz=4194304 aunum=73 blksz=16384 blknum=5 | grep -iv ^kf > block_5.txt
kfed read dev=o/192.168.1.11/DATA_CD_01_exacelmel07 ausz=4194304 aunum=66 blksz=16384 blknum=5 | grep -iv ^kf > block_5.txt
kfed read dev=o/192.168.1.10/DATA_CD_04_exacelmel06 ausz=4194304 aunum=78 blksz=16384 blknum=5 | grep -iv ^kf > block_5.txt
$
我们注意到脚本正确的计算出了控制文件的block size(不同于数据块的大小8K,为16K),并且脚本产生出了3个不同的命令,虽然磁盘组DATA是normal冗余,但是控制文件却做了high冗余,也就是做了三副本,控制文件在这一点上跟ASM的元数据文件一样。
Conclusion
find_block.pl脚本通过dd或者kfed命令来从ASM磁盘组的文件中抽取块,可能大多数情况下,我们想要从数据文件中抽取一个块,但是这个脚本不仅仅适用于数据文件,也可以从控制文件、日志文件、任何的ASM文件中抽取块。
如果文件是external外部冗余的,那么这个脚本将输出一个单一的命令,执行这个命令可以直接从ASM的磁盘中抽取块。
如果文件是normal冗余的,这个脚本将输出2个命令,它用来从不同的磁盘中抽取块,这可能会比较有用,例如后台日志提示数据块损坏,ASM不能修复它,那么就可以通过镜像块来修复。
如果文件是high冗余的,这个脚本将产生3个命令。
最后,使用这个脚本你不用知道文件的冗余度、块的大小,和任何其他属性,你只需要关心文件名和块号。
附脚本
#!$ORACLE_HOME/perl/bin/perl -w
#
# The find_block.pl constructs the command(s) to extract a block from ASM.
# For a complete info about this script see ASM Support Guy blog post:
# http://asmsupportguy.blogspot.com/2014/10/find-block-in-asm.html
#
# Copyright (C) 2014 Bane Radulovic
#
# This program is free software: you can redistribute it and/or modify it under
# the terms of the GNU General Public License as published by the Free Software
# Foundation, either version 3 of the License, or any later version.
# This program is distributed in the hope that it will be useful, but WITHOUT
# ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
# FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details
# at http://www.gnu.org/licenses/.
#
# Version 1.00, Oct 2014
# The initial release.
#
# Version 1.01, Oct 2014
# Minor improvements.
#
# Version 1.02, Oct 2014
# Added support for AFD disks.
#
# Version 1.03, Nov 2014
# Added sanity checks, e.g. if the requested block is reasonable,
# if the specified filename is valid, etc.
#
# Version 1.04, Nov 2014
# Improved the check for Exadata storage cell based disk.
#
use strict;
use DBI;
use DBD::Oracle qw(:ora_session_modes);
use POSIX;
# Handle the version query
die "find_block.pl version 1.04n"
if ( $ARGV[0] =~ /^-v/i );
# Check the number of input arguments
die "Usage: $ORACLE_HOME/perl/bin/perl find_block.pl filename blockn"
unless ( @ARGV == 2 );
# Get the filename from the first input argument
my $filename = shift @ARGV;
# Check if the filename makes sense.
# The 'minimum' filename is +DGNAME/filename,
# i.e. it has to begin with the '+' followed by a disk group name,
# followed by at least one '/', followed by directory or file name...
die "Error: The $filename is not a valid file name.n"
unless ( $filename =~ /^+w/ && $filename =~ //w/ );
# Get the disk group name out of the user specified filename
my $diskgroup_name = substr($filename, 1, index($filename, "/") -1 );
# Get the ASM file name out of the user specified filename
my $asmfile = substr($filename, rindex($filename, "/") +1 );
# Get the block number from the second input argument
my $block_number = shift @ARGV;
# Check if the block number is an integer
die "Usage: $ORACLE_HOME/perl/bin/perl find_block.pl filename blockn"
unless ( $block_number =~ /^d+$/ );
# Check if the ASM SID is set
die "Error: ASM SID not set.n"
unless ( $ENV{ORACLE_SID} =~ /+ASM/ );
# Connect to the (local) ASM instance
my $dbh = DBI->connect('dbi:Oracle:', "", "", { ora_session_mode => ORA_SYSDBA })
or die "$DBI::errstrn";
# Check if the disk group exists and if it is mounted
my $group_number = &asm_diskgroup("group_number", $diskgroup_name);
die "Error: Disk group $diskgroup_name not mounted or does not exist.n"
unless ( $group_number );
# Check if the user specified file exists in the disk group
my $file_number = &asm_alias("file_number", $asmfile, $group_number);
die "Error: File $asmfile does not exist in disk group $diskgroup_name.n"
unless ( $file_number );
# Get the block size for the file
my $block_size = &asm_file("block_size", $group_number, $file_number);
# Get the number of blocks in the file
my $file_blocks = &asm_file("blocks", $group_number, $file_number);
# Check if the user specified block number makes sense
die "Error: Block range for file $asmfile is: 0 - $file_blocks.n"
unless ( $block_number >= 0 && $block_number <= $file_blocks );
# Get the disk group AU size
my $au_size = &asm_diskgroup("allocation_unit_size", $diskgroup_name);
# Work out the blocks per AU and the virtual extent number
my $blocks_per_au = $au_size/$block_size;
my $xnum_kffxp = floor($block_number/$blocks_per_au);
# Get the disk and AU numbers into the @disk_au array
my @disk_au = &asm_kffxp($file_number, $group_number, $xnum_kffxp);
die "Could not get any disk and AU numbers for file $asmfile.n"
unless ( @disk_au );
# Get the disk path(s) and generate the block extract command(s)
while ( @disk_au ) {
# Do not assume anything
my $storage_cell = "FALSE";
# Get the disk number from @disk_au
my $disk_number = shift @disk_au;
# Get the AU number from @disk_au
my $au_number = shift @disk_au;
# Get the path for that disk number
my $path = &asm_disk("path", $group_number, $disk_number);
# If there is no path move to the next disk
if ( ! $path ) {
next;
}
# If ASMLIB is in use, the path will return ORCL:DISKNAME.
# Set the path to /dev/oracleasm/disks/DISKNAME
elsif ( $path =~ /ORCL:(.*)/ ) {
$path = "/dev/oracleasm/disks/".$1;
}
# If ASM Filter Driver (AFD) is in use, the path will return AFD:DISKNAME.
# Get the actual path from /dev/oracleafd/disks/DISKNAME
elsif ( $path =~ /AFD:(.*)/ ) {
if ( ! open AFDDISK, "/dev/oracleafd/disks/".$1 ) { next }
else { chomp($path = <AFDDISK>) }
}
# For Exadata storage cell based disk, the path will start with o/IP address
elsif ( $path =~ /^o/d{1,3}./ ) {
$storage_cell = "TRUE";
}
if ( $storage_cell eq "TRUE" ) {
# Construct the kfed command for Exadata storage cell based disk
# dev=$path ausz=$au_size aunum=$au_number blksz=$block_size blknum=$block_number
# The grep filters out the kfed stuff
print "kfed read dev=$path ausz=$au_size aunum=$au_number blksz=$block_size blknum=$block_number | grep -iv ^kf > block_$block_number.txtn";
}
else {
# Construct the dd command
# if=$path bs=$block_size count=1 skip=$skip of=block_$block_number.dd
my $skip=$au_number*$blocks_per_au + $block_number%$blocks_per_au;
print "dd if=$path bs=$block_size count=1 skip=$skip of=block_$block_number.ddn";
}
}
# We are done. Disconnect from the (local) ASM instance
$dbh->disconnect;
# Subs
# Get a column from v$asm_file for a given group number and file number
sub asm_file {
my $col = shift @_;
my $group_number = shift @_;
my $file_number = shift @_;
my $sql = $dbh->prepare("select $col from v$asm_file where group_number=$group_number and file_number=$file_number");
$sql->execute;
my $col_value = $sql->fetchrow_array;
$sql->finish;
return $col_value;
}
# Get a column from v$asm_alias for a given (file) name and group number
sub asm_alias {
my $col = shift @_;
my $name = shift @_;
my $group_number = shift @_;
my $sql = $dbh->prepare("select $col from v$asm_alias where lower(name)=lower('$name') and group_number=$group_number");
$sql->execute;
my $col_value = $sql->fetchrow_array;
$sql->finish;
return $col_value;
}
# Get a column from v$asm_diskgroup for a given disk group name
sub asm_diskgroup {
my $col = shift @_;
my $name = shift @_;
my $sql = $dbh->prepare("select $col from v$asm_diskgroup where name=upper('$name')");
$sql->execute;
my $col_value = $sql->fetchrow_array;
$sql->finish;
return $col_value;
}
# Get a column from v$asm_disk for a given group number and disk number
sub asm_disk {
my $col = shift @_;
my $group_number = shift @_;
my $disk_number = shift @_;
my $sql = $dbh->prepare("select $col from v$asm_disk where group_number=$group_number and disk_number=$disk_number");
$sql->execute;
my $col_value = $sql->fetchrow_array;
$sql->finish;
return $col_value;
}
# Get the disk and AU numbers from x$kffxp for a given virtual extent number.
# This will return one row for an external redundancy file,
# two rows for a normal redundancy and three rows for a high redundancy.
# Well, it will return an array with disk and AU pairs, not rows.
sub asm_kffxp {
my $file_number = shift @_;
my $group_number = shift @_;
my $xnum = shift @_;
# The @disk_au array to hold the disk number, AU number rows
my @disk_au;
my $sql = $dbh->prepare("select disk_kffxp, au_kffxp from x$kffxp where number_kffxp=$file_number and group_kffxp=$group_number and xnum_kffxp=$xnum");
$sql->execute;
# Expecting one disk number and one AU number per row
while ( my @row = $sql->fetchrow_array) {
# Add each (element of the) row to @disk_au array
foreach ( @row ) { push @disk_au, $_ }
}
$sql->finish;
return @disk_au;
}
- JavaScript 教程
- JavaScript 编辑工具
- JavaScript 与HTML
- JavaScript 与Java
- JavaScript 数据结构
- JavaScript 基本数据类型
- JavaScript 特殊数据类型
- JavaScript 运算符
- JavaScript typeof 运算符
- JavaScript 表达式
- JavaScript 类型转换
- JavaScript 基本语法
- JavaScript 注释
- Javascript 基本处理流程
- Javascript 选择结构
- Javascript if 语句
- Javascript if 语句的嵌套
- Javascript switch 语句
- Javascript 循环结构
- Javascript 循环结构实例
- Javascript 跳转语句
- Javascript 控制语句总结
- Javascript 函数介绍
- Javascript 函数的定义
- Javascript 函数调用
- Javascript 几种特殊的函数
- JavaScript 内置函数简介
- Javascript eval() 函数
- Javascript isFinite() 函数
- Javascript isNaN() 函数
- parseInt() 与 parseFloat()
- escape() 与 unescape()
- Javascript 字符串介绍
- Javascript length属性
- javascript 字符串函数
- Javascript 日期对象简介
- Javascript 日期对象用途
- Date 对象属性和方法
- Javascript 数组是什么
- Javascript 创建数组
- Javascript 数组赋值与取值
- Javascript 数组属性和方法
- 使用SAP C4C自定义BO实现自定义的Number Range
- 限流算法简介及Guava RateLimiter令牌桶限流介绍
- K8S的名称空间创建&&版本的升级、回滚操作
- SAP Cloud for Customer CLR(Code List Restriction)的一种高级用法
- SAP WebClient UI One Hit Navigation的实现方法
- 【打包构建】Mac下使用expect实现执行sudo命令时自动输入密码
- ASP.NET Core 奇淫技巧之SPA部署
- SAP Cloud for Customer Rule Editor的使用方法和底层工作原理
- (数据科学学习手札94)QGIS+Conda+jupyter玩转Python GIS
- 72-STM32+ESP8266+AIR202基本控制篇-移植使用-移植Android的MQTT包到自己的工程项目
- 用上Latex实现编辑伪代码
- TensorFlow交叉熵函数(cross_entropy)·理解
- 第05期:Prometheus 数据查询(一)
- 技术分享 | MySQL 复制那点事 - Seconds_behind_Master 参数调查笔记
- 线程有多少种状态?Runnable 一定在执行任务吗?