In the first half, we give an introduction to modern serialization systems, Protocol Buffers, Apache Thrift and Apache Avro. Which one does meet your needs?
In the second half, we show an example of data ingestion system architecture using Apache Avro.
AWS Black Belt Online Seminarの最新コンテンツ: https://aws.amazon.com/jp/aws-jp-introduction/#new
過去に開催されたオンラインセミナーのコンテンツ一覧: https://aws.amazon.com/jp/aws-jp-introduction/aws-jp-webinar-service-cut/
The document discusses two Rust-based full-text search engines: Tantivy and Bayard. It describes trying out Tantivy using its CLI tool to index 1000 Wikipedia articles and perform searches. It then discusses Bayard's features like supporting Japanese text, REST API, and clustering. The author shares their experience setting up Bayard with Docker to index and search Japanese sample data.
In the first half, we give an introduction to modern serialization systems, Protocol Buffers, Apache Thrift and Apache Avro. Which one does meet your needs?
In the second half, we show an example of data ingestion system architecture using Apache Avro.
AWS Black Belt Online Seminarの最新コンテンツ: https://aws.amazon.com/jp/aws-jp-introduction/#new
過去に開催されたオンラインセミナーのコンテンツ一覧: https://aws.amazon.com/jp/aws-jp-introduction/aws-jp-webinar-service-cut/
The document discusses two Rust-based full-text search engines: Tantivy and Bayard. It describes trying out Tantivy using its CLI tool to index 1000 Wikipedia articles and perform searches. It then discusses Bayard's features like supporting Japanese text, REST API, and clustering. The author shares their experience setting up Bayard with Docker to index and search Japanese sample data.
33. !! INSERT breaks bin-log !!
use testdb;
INSERT INTO a_data (id, create_date) VALUES ('1', NOW());
34. !! INSERT breaks bin-log !!
ERROR: Error in Log_event::read_log_event(): 'read error',
data_len: 66899, event_type: 41
Could not read entry at offset 881:Error in log format or
read error
# at 575
#140620 3:59:32 server id 1 end_log_pos 642 Query
thread_id=575 exec_time=0 error_code=0
!
BEGIN
/*!*/;
--
use SYSTEMtestdb/*!*/;
etstdbINSERT INTO a_data (id, create_date) VALUES ('1',
/*!*/;
35. !! INSERT breaks bin-log !!
ERROR: Error in Log_event::read_log_event(): 'read error',
data_len: 66899, event_type: 41
Could not read entry at offset 881:Error in log format or
read error
# at 575
#140620 3:59:32 server id 1 end_log_pos 642 Query
thread_id=575 exec_time=0 error_code=0
!
BEGIN
/*!*/;
--
use SYSTEMtestdb/*!*/;
etstdbINSERT INTO a_data (id, create_date) VALUES ('1',
/*!*/;
1. 途中挟まっている「SYSTEM」という文字列はタイムゾーン
2. 「use <database>」の間にタイムゾーンの文字が割り込む
3. INSERT文行頭に「1文字削れたDB名」がくっつき,VALUES以降が消失してしまう
4. flushすれば再開するが,結局INSERT毎にbin-logが破損してしまう