HBase shell的使用记录
阅读原文时间:2023年07月09日阅读:2

该命令列出hbase中所有的表

hbase(main):007:0* list
TABLE
SYSTEM:CATALOG
SYSTEM:FUNCTION
SYSTEM:MUTEX
SYSTEM:SEQUENCE
SYSTEM:STATS
iot_flow_cdr_201805
iot_msg_cdr_201805
iot_voice_cdr_201805
8 row(s) in 0.0200 seconds
=> ["SYSTEM:CATALOG", "SYSTEM:FUNCTION", "SYSTEM:MUTEX", "SYSTEM:SEQUENCE", "SYSTEM:STATS", "iot_flow_cdr_201805", "iot_msg_cdr_201805", "iot_voice_cdr_201805"]

查看表的详细信息

hbase(main):012:0* desc 'iot_flow_cdr_201805'
Table iot_flow_cdr_201805 is ENABLED
iot_flow_cdr_201805, {TABLE_ATTRIBUTES => {coprocessor$1 => '|org.apache.phoenix.coprocessor.ScanRegionObserver|805306366|', coprocessor
$2 => '|org.apache.phoenix.coprocessor.UngroupedAggregateRegionObserver|805306366|', coprocessor$3 => '|org.apache.phoenix.coprocessor.G
roupedAggregateRegionObserver|805306366|', coprocessor$4 => '|org.apache.phoenix.coprocessor.ServerCachingEndpointImpl|805306366|'}
COLUMN FAMILIES DESCRIPTION
{NAME => 'info', DATA_BLOCK_ENCODING => 'NONE', BLOOMFILTER => 'ROW', REPLICATION_SCOPE => '0', VERSIONS => '1', COMPRESSION => 'SNAPPY'
, TTL => 'FOREVER', MIN_VERSIONS => '0', KEEP_DELETED_CELLS => 'FALSE', BLOCKSIZE => '65536', IN_MEMORY => 'false', BLOCKCACHE => 'true'
}
1 row(s) in 0.0780 seconds

其中:NANE指列簇名,VERSIONS指版本号,COMPRESSION指压缩算法,TTL指生存时间(过期时间)

create创建表,help 'create'查看该命令的用法

hbase(main):015:0* help 'create'
Creates a table. Pass a table name, and a set of column family
specifications (at least one), and, optionally, table configuration.
Column specification can be a simple string (name), or a dictionary
(dictionaries are described below in main help output), necessarily
including NAME attribute.
Examples:

Create a table with namespace=ns1 and table qualifier=t1
  hbase> create 'ns1:t1', {NAME => 'f1', VERSIONS => 5}

Create a table with namespace=default and table qualifier=t1
  hbase> create 't1', {NAME => 'f1'}, {NAME => 'f2'}, {NAME => 'f3'}
  hbase> # The above in shorthand would be the following:
  hbase> create 't1', 'f1', 'f2', 'f3'
  hbase> create 't1', {NAME => 'f1', VERSIONS => 1, TTL => 2592000, BLOCKCACHE => true}
  hbase> create 't1', {NAME => 'f1', CONFIGURATION => {'hbase.hstore.blockingStoreFiles' => '10'}}

部分参数说明:

NAME:指定列簇名

VERSION:指定版本号(默认为1)

TTL:指定表超时时间,超过该时间的row会被删除

COMPRESSION:指定压缩方式,hbase支持的压缩算法有gzip、lzo、snappy/Zippy

这里我创建一个test表,包含一个列簇info,压缩方式为snappy

hbase(main):020:0* create 'test',{NAME=>'info','COMPRESSION'=>'snappy'}
0 row(s) in 2.3540 seconds

=> Hbase::Table - test
hbase(main):021:0> desc 'test'
Table test is ENABLED
test
COLUMN FAMILIES DESCRIPTION
{NAME => 'info', DATA_BLOCK_ENCODING => 'NONE', BLOOMFILTER => 'ROW', REPLICATION_SCOPE => '0', VERSIONS => '1', COMPRESSION => 'SNAPPY'
, MIN_VERSIONS => '0', TTL => 'FOREVER', KEEP_DELETED_CELLS => 'FALSE', BLOCKSIZE => '65536', IN_MEMORY => 'false', BLOCKCACHE => 'true'
}
1 row(s) in 0.0480 seconds

使用help ‘put’命令查看帮助

hbase(main):025:0* help 'put'
Put a cell 'value' at specified table/row/column and optionally
timestamp coordinates.  To put a cell value into table 'ns1:t1' or 't1'
at row 'r1' under column 'c1' marked with the time 'ts1', do:

  hbase> put 'ns1:t1', 'r1', 'c1', 'value'
  hbase> put 't1', 'r1', 'c1', 'value'
  hbase> put 't1', 'r1', 'c1', 'value', ts1
  hbase> put 't1', 'r1', 'c1', 'value', {ATTRIBUTES=>{'mykey'=>'myvalue'}}
  hbase> put 't1', 'r1', 'c1', 'value', ts1, {ATTRIBUTES=>{'mykey'=>'myvalue'}}
  hbase> put 't1', 'r1', 'c1', 'value', ts1, {VISIBILITY=>'PRIVATE|SECRET'}

参数说明:

t1:指定表名

r1:指定rowkey值

c1:指定列名,注意指定列名时,要带上列簇,格式为 列簇名:列名

value:指定列值

现在,我往test表中插入几条数据

hbase(main):028:0* put 'test','1','info:name','tom'
0 row(s) in 0.4460 seconds

hbase(main):029:0> put 'test','1','info:age','18'
0 row(s) in 0.0140 seconds

help 'get'查看该命令的用法

hbase(main):031:0* help 'get'
Get row or cell contents; pass table name, row, and optionally
a dictionary of column(s), timestamp, timerange and versions. Examples:

  hbase> get 'ns1:t1', 'r1'
  hbase> get 't1', 'r1'
  hbase> get 't1', 'r1', {TIMERANGE => [ts1, ts2]}
  hbase> get 't1', 'r1', {COLUMN => 'c1'}
  hbase> get 't1', 'r1', {COLUMN => ['c1', 'c2', 'c3']}
  hbase> get 't1', 'r1', {COLUMN => 'c1', TIMESTAMP => ts1}
  hbase> get 't1', 'r1', {COLUMN => 'c1', TIMERANGE => [ts1, ts2], VERSIONS => 4}
  hbase> get 't1', 'r1', {COLUMN => 'c1', TIMESTAMP => ts1, VERSIONS => 4}
  hbase> get 't1', 'r1', {FILTER => "ValueFilter(=, 'binary:abc')"}
  hbase> get 't1', 'r1', 'c1'
  hbase> get 't1', 'r1', 'c1', 'c2'
  hbase> get 't1', 'r1', ['c1', 'c2']
  hbase> get 't1', 'r1', {COLUMN => 'c1', ATTRIBUTES => {'mykey'=>'myvalue'}}
  hbase> get 't1', 'r1', {COLUMN => 'c1', AUTHORIZATIONS => ['PRIVATE','SECRET']}
  hbase> get 't1', 'r1', {CONSISTENCY => 'TIMELINE'}
  hbase> get 't1', 'r1', {CONSISTENCY => 'TIMELINE', REGION_REPLICA_ID => 1}

参数说明:

t1:指定表名

r1:指定rowkey查询

c1:指定列名查询

现在我们查询刚才test表中的数据

#指定rowkey查询
hbase(main):035:0* get 'test','1'
COLUMN                              CELL
 info:age                           timestamp=1526396011498, value=18
 info:name                          timestamp=1526396001809, value=tom
2 row(s) in 0.0580 seconds

#指定rowkey和列名查询,注意指定列名时要以列簇名为前缀
hbase(main):037:0* get 'test','1',{COLUMN=> 'info:name'}
COLUMN                              CELL
 info:name                          timestamp=1526396001809, value=tom
1 row(s) in 0.0280 seconds

#指定rowkey和列簇查询
hbase(main):042:0* get 'test','1','info'
COLUMN                              CELL
 info:age                           timestamp=1526396011498, value=18
 info:name                          timestamp=1526396001809, value=tom
2 row(s) in 0.0170 seconds

scan命令是用来查询hbase表数据,可以全表扫描列出所有数据,也可以指定扫描条件过滤数据,该命令的用法很丰富,过滤条件很多。这里简单列举两个个

#全表扫描
hbase(main):044:0* scan 'test'
ROW                                 COLUMN+CELL
 1                                  column=info:age, timestamp=1526396011498, value=18
 1                                  column=info:name, timestamp=1526396001809, value=tom
1 row(s) in 0.0610 seconds

#先插入两条数据
hbase(main):045:0> put 'test','2','info:name','jerry'
0 row(s) in 0.0190 seconds

hbase(main):046:0> put 'test','11','info:name','mary'
0 row(s) in 0.0160 seconds

#根据rowkey前缀,过滤扫描
hbase(main):047:0> scan 'test',{ROWPREFIXFILTER => '1'}
ROW                                 COLUMN+CELL
 1                                  column=info:age, timestamp=1526396011498, value=18
 1                                  column=info:name, timestamp=1526396001809, value=tom
 11                                 column=info:name, timestamp=1526396516873, value=mary
2 row(s) in 0.0310 seconds

该命令只能删除某个rowkey的某一列数据,不能整行删除

#比如我删除rowkey为1的info:age这列数据
hbase(main):049:0* delete 'test','1','info:age'
0 row(s) in 0.0590 seconds

hbase(main):050:0> get 'test','1'
COLUMN                              CELL
 info:name                          timestamp=1526396001809, value=tom
1 row(s) in 0.0150 seconds

这个命令和delete相比,它可以直接删除删除一行数据,而delete指定删除指定列数据,

比如删除test表中rowkey为2的整行数据

hbase(main):054:0* deleteall 'test','2'
0 row(s) in 0.0140 seconds

hbase(main):055:0> scan 'test'
ROW                                 COLUMN+CELL
 1                                  column=info:name, timestamp=1526396001809, value=tom
 11                                 column=info:name, timestamp=1526396516873, value=mary
2 row(s) in 0.0190 seconds

disable使某张表处于不可用状态,enable则是恢复表为正常状态,is_enabled是判断表的状态,exists是判断表是否存在

hbase(main):058:0* disable 'test'
0 row(s) in 2.3280 seconds

hbase(main):059:0> is_enabled 'test'
false
0 row(s) in 0.0630 seconds

hbase(main):061:0> enable 'test'
0 row(s) in 1.2760 seconds

hbase(main):063:0> exists 'test'
Table test does exist
0 row(s) in 0.0200 seconds

truncate命令清空表数据,drop删除表,注意删除表之前,需要disable表

hbase(main):066:0* truncate 'test'
Truncating 'test' table (it may take a while):
 - Disabling table...
 - Truncating table...
0 row(s) in 3.6610 seconds

hbase(main):067:0> scan 'test'
ROW                                 COLUMN+CELL
0 row(s) in 0.3420 seconds

#删除表前,先disable该表
hbase(main):068:0> disable 'test'
0 row(s) in 2.2660 seconds

hbase(main):069:0> drop 'test'
0 row(s) in 1.2660 seconds

该命令用于region的转移,当你发现多个regionserver间的region分配不均时,可以使用该命令转移region来进行平衡

该命令用于开启或关闭regionserver的自动平衡功能,balance_switch true开启,balance_switch false关闭,开启后,regionserver之间的region会自动进行平衡。balance_switch status可查看当前balance状态。

手机扫一扫

移动阅读更方便

阿里云服务器
腾讯云服务器
七牛云服务器

你可能感兴趣的文章