狠狠撸

狠狠撸Share a Scribd company logo
Multilingual Translation Laboratory
National Institute of Information and Communications Technology
{chenchen.ding, mutiyama, eiichiro.sumita}@nict.go.jp
NICT at WAT 2015
Chenchen Ding, Masao Utiyama, Sumita Eiichiro
Even simple approaches work, as long as the properties of source and target languages considered
? REV-REO for JE SMT may be taken as a new baseline because of its simplicity (and generality)
? Character-based KJ translation seems enough for SMT, while specific post-processing needed
Conclusion
JE trans. BLEU RIBES HUM.
org. base. 18.45 .6451 -
our base. 18.09 .6397 -
REV-REO 18.96 .6845 +16.50
DEP-REO 18.98 .6599 +16.00
KJ trans. BLEU RIBES HUM.
org. base. 69.73 .9408 -
our base. 70.92 .9427 +18.25
+post-proc. 71.11 .9429 +10.50
Japanese-to-English (JE) SMT
Two pre-reordering approaches
? REV-REO (morpheme-level)
? DEP-REO (chunk- / morpheme-level)
また 提案 psrf は 変調 型 psrf より構造 が 簡単 で ある
また は 提案 psrf より 変調 型 psrfが 構造 で ある 簡単
また 提案 psrf は 変調 型 psrf より 構造 が 簡単 で ある
morpheme-level reordering
chunk-level reordering
また 提案 psrf は 変調 型 psrf より 構造 が 簡単 で ある
ある で 簡単 が 構造 より psrf 型 変調は psrf 提案 また
reverse reversetopic marker
REV-REO: J. Katz-Brown and M. Collins. 2008.
Syntactic reordering in preprocessing for Japanese → English translation:
MIT system description for NTCIR-7 patent translation task.
DEP-REO: C. Ding, K. Sakanushi, H. Touji, and M. Yamamoto. 2015.
Inter-, Intra-, and Extra-chunk Pre-reordering for Statistical Japanese-to-English MT
(丁, 酒主, 通事, 山本. 2014. 統計的日英翻訳における依存構造に基づく事前並べ替えルール)
Korean-to-Japanese (KJ) SMT
Purely character-based
? Unicode characters taken as basic units
? Nothing else
?? ?? ( 5 0 ) ?? ???? ?? ?? ??? ???? ??
製造 装置 ( 5 0 ) ?? 図示?? ?? 冷却 機構? 設置?? ??
製 造 装 置 5 0 に は 、 図 示 し な い 冷 却 機 構 も 設 け ら れ て い て
JE translation
? MeCab
? Cabocha
? mkn 5-gram
KJ translation
? max-ph-len=9
? mkn 9-gram
Evaluation
SMT:
LM:
Moses PB
SRILM

More Related Content

More from Association for Computational Linguistics (20)

PDF
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Association for Computational Linguistics
?
PDF
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Association for Computational Linguistics
?
PDF
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Association for Computational Linguistics
?
PDF
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Association for Computational Linguistics
?
PDF
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Association for Computational Linguistics
?
PDF
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Association for Computational Linguistics
?
PDF
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Association for Computational Linguistics
?
PDF
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
Association for Computational Linguistics
?
PDF
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
Association for Computational Linguistics
?
PDF
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Association for Computational Linguistics
?
PDF
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Association for Computational Linguistics
?
PDF
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Association for Computational Linguistics
?
PDF
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Association for Computational Linguistics
?
PDF
Chenchen Ding - 2015 - NICT at WAT 2015
Association for Computational Linguistics
?
PDF
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Association for Computational Linguistics
?
PDF
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Association for Computational Linguistics
?
PDF
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Association for Computational Linguistics
?
PDF
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Association for Computational Linguistics
?
PDF
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian Translation
Association for Computational Linguistics
?
PDF
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Association for Computational Linguistics
?
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Association for Computational Linguistics
?
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Association for Computational Linguistics
?
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Association for Computational Linguistics
?
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Association for Computational Linguistics
?
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Association for Computational Linguistics
?
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Association for Computational Linguistics
?
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Association for Computational Linguistics
?
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
Association for Computational Linguistics
?
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
Association for Computational Linguistics
?
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Association for Computational Linguistics
?
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Association for Computational Linguistics
?
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Association for Computational Linguistics
?
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Association for Computational Linguistics
?
Chenchen Ding - 2015 - NICT at WAT 2015
Association for Computational Linguistics
?
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Association for Computational Linguistics
?
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Association for Computational Linguistics
?
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Association for Computational Linguistics
?
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Association for Computational Linguistics
?
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian Translation
Association for Computational Linguistics
?
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Association for Computational Linguistics
?

Chenchen Ding - 2015 - NICT at WAT 2015

  • 1. Multilingual Translation Laboratory National Institute of Information and Communications Technology {chenchen.ding, mutiyama, eiichiro.sumita}@nict.go.jp NICT at WAT 2015 Chenchen Ding, Masao Utiyama, Sumita Eiichiro Even simple approaches work, as long as the properties of source and target languages considered ? REV-REO for JE SMT may be taken as a new baseline because of its simplicity (and generality) ? Character-based KJ translation seems enough for SMT, while specific post-processing needed Conclusion JE trans. BLEU RIBES HUM. org. base. 18.45 .6451 - our base. 18.09 .6397 - REV-REO 18.96 .6845 +16.50 DEP-REO 18.98 .6599 +16.00 KJ trans. BLEU RIBES HUM. org. base. 69.73 .9408 - our base. 70.92 .9427 +18.25 +post-proc. 71.11 .9429 +10.50 Japanese-to-English (JE) SMT Two pre-reordering approaches ? REV-REO (morpheme-level) ? DEP-REO (chunk- / morpheme-level) また 提案 psrf は 変調 型 psrf より構造 が 簡単 で ある また は 提案 psrf より 変調 型 psrfが 構造 で ある 簡単 また 提案 psrf は 変調 型 psrf より 構造 が 簡単 で ある morpheme-level reordering chunk-level reordering また 提案 psrf は 変調 型 psrf より 構造 が 簡単 で ある ある で 簡単 が 構造 より psrf 型 変調は psrf 提案 また reverse reversetopic marker REV-REO: J. Katz-Brown and M. Collins. 2008. Syntactic reordering in preprocessing for Japanese → English translation: MIT system description for NTCIR-7 patent translation task. DEP-REO: C. Ding, K. Sakanushi, H. Touji, and M. Yamamoto. 2015. Inter-, Intra-, and Extra-chunk Pre-reordering for Statistical Japanese-to-English MT (丁, 酒主, 通事, 山本. 2014. 統計的日英翻訳における依存構造に基づく事前並べ替えルール) Korean-to-Japanese (KJ) SMT Purely character-based ? Unicode characters taken as basic units ? Nothing else ?? ?? ( 5 0 ) ?? ???? ?? ?? ??? ???? ?? 製造 装置 ( 5 0 ) ?? 図示?? ?? 冷却 機構? 設置?? ?? 製 造 装 置 5 0 に は 、 図 示 し な い 冷 却 機 構 も 設 け ら れ て い て JE translation ? MeCab ? Cabocha ? mkn 5-gram KJ translation ? max-ph-len=9 ? mkn 9-gram Evaluation SMT: LM: Moses PB SRILM