29回勉強会資料「PostgreSQLのリカバリ超入門」
See also http://www.interdb.jp/pgsql (Coming soon!)
初心者向け。PostgreSQLのWAL、CHECKPOINT、 オンラインバックアップの仕組み解説。
これを見たら、次は→ http://www.slideshare.net/satock/29shikumi-backup
29回勉強会資料「PostgreSQLのリカバリ超入門」
See also http://www.interdb.jp/pgsql (Coming soon!)
初心者向け。PostgreSQLのWAL、CHECKPOINT、 オンラインバックアップの仕組み解説。
これを見たら、次は→ http://www.slideshare.net/satock/29shikumi-backup
This document discusses the application of PostgreSQL in a large social infrastructure project involving smart meter management. It describes three main missions: (1) loading 10 million datasets within 10 minutes, (2) saving data for 24 months, and (3) stabilizing performance for large scale SELECT statements. Various optimizations are discussed to achieve these missions, including data modeling, performance tuning, reducing data size, and controlling execution plans. The results showed that all three missions were successfully completed by applying PostgreSQL expertise and customizing it for the large-scale requirements of the project.
16. ? 2018 NTT DATA Corporation 16
FileSystem on Microsoft Azure
? NativeAzureFileSystem
– HADOOP-9629:
Support Windows Azure Storage - Blob file system in Hadoop
– wasb://...
? AdlFileSystem
– HADOOP-12666:
Support Microsoft Azure Data Lake - as a file system in Hadoop
– adl://...
? AzureBlobFileSystem
– HADOOP-15407:
Support Windows Azure Storage - Blob file system in Hadoop
– adfs://...
17. ? 2018 NTT DATA Corporation 17
FileSystem specification
? HADOOP-9361: Strictly define the expected behavior of
filesystem APIs and write tests to verify compliance
? FileSystemの仕様をちゃんと文書化
– https://hadoop.apache.org/docs/r3.1.0/hadoop-project-
dist/hadoop-common/filesystem/index.html
? 仕様の満足度のテストシステムを追加
@Test
public void testGetFileStatusRoot() throws Throwable {
ContractTestUtils.assertIsDirectory(
getFileSystem().getFileStatus(new Path("/")));
}
24. ? 2018 NTT DATA Corporation 24
HBaseのWAL書き込み
? HDFSのhflush/hsyncを利用
– HADOOP-1700: Append to files in HDFS
– HADOOP-2657: Enhancements to DFSClient to support flushing data at any point in time
? fsyncに相当: ファイルをcloseせずwriteをdurableに
? WALエントリのwrite確定後にクライアントにレスポンス
DataNode
NameNode
DataNode
Client
DataNode
write
ack
write
IBR
addBlock
LocatedBlock
write
ack
ackwrite済みのデータの
ackが戻るまで待つ
hsyncだとackを返
す前にfsync
25. ? 2018 NTT DATA Corporation 25
HBase on Amazon S3
? HBASE-17437: Support specifying a WAL directory
outside of the root directory
– contributed by AWS
– https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-hbase-s3.html
<property>
<name>hbase.rootdir</name>
<value>s3a://mybucket/hbase-root</value>
</property>
<property>
<name>hbase.wal.dir</name>
<value>hdfs://mycluster/hbase-wal</value>
</property>
26. ? 2018 NTT DATA Corporation 26
HBase 1.0 API
? APIをクリーンアップ
– HBASE-10479: HConnection interface is public but is used internally, and contains a bunch of methods
– HBASE-12127: Move the core Connection creation functionality into ConnectionFactory
– ..........
? Googleのエンジニアが協力
? HBaseのアプリがそのままCloud Bigtableに接続可能
– ConnectionFactgory#createConnectionが返すオブジェクトを切り替え
– 利用できないAPIもある
? https://cloud.google.com/bigtable/docs/hbase-differences
<property>
<name>hbase.client.connection.impl</name
<value>com.google.cloud.bigtable.hbase1_x.BigtableConnection</value>
</property>