ݺߣ

ݺߣShare a Scribd company logo
Neural Reranking Helps Subjective Quality of Machine Translation: NAIST at WAT 2015
Graham Neubig, Makoto Morishita, Satoshi Nakamura
Nara Institute of Science and Technology (NAIST), Japan
Neural Reranking
Quantitative Results
Experimental Setup
he has
a cold
Input
T2S
N-best w/MT Features
1. ˤϺ֤äƤ t=-0.5 l=-5.6 | -6.1
2. ˤLа֤äƤ t=-0.9 l=-5.8 | -6.7
3. ˤLа t=-1.5 l=-5.3 | -6.8
4. ˤLа t=-1.9 l=-5.4 | -7.3
Neural
Model
Neural Features
nmt=-5.8
nmt=-5.5
nmt=-3.4
nmt=-5.2
2. ˤϺ֤äƤ t=-0.5 l=-5.6 nmt=-5.8 | -10.9
3. ˤLа֤äƤ t=-0.9 l=-5.8 nmt=-5.5 | -11.2
1. ˤLа t=-1.5 l=-5.3 nmt=-3.4 | -9.2
4. ˤLа t=-1.9 l=-5.4 nmt=-5.2 | -12.5
Rescored/Reranked N-best
Reranking
 Data: ASPEC Scientific Abstracts
C Japanese ? English, Chinese
 Baseline: NAIST WAT2014 Tree-to-String System
C Strong baseline achieving high scores
C Implemented using Travatar
(http://phontron.com/travatar)
 Neural MT Model: Attentional model
C Trained ~500k sent., 256 hidden nodes, 2 model
ensemble
C Use words occurring 3+ times (vocab
50,000~80,000)
C Trained w/ lamtram
(http://github.com/neubig/lamtram)
 Automatic Evaluation: BLEU, RIBES
 Manual Evaluation: WAT 2015 HUMAN Score
Research Questions
1. Does reranking with neural MT models improve
subjective impressions of translation results?
2. If so, what are the qualitative differences
between reranked and non-reranked output?
3. How big of an n-best list do we need?
en-ja ja-en zh-ja ja-zh
0
10
20
30
40
50
BLEU
en-ja ja-en zh-ja ja-zh
70
72
74
76
78
80
82
84
86
Base
Rerank
RIBES
+1.6
+2.8
+2.5
+1.5 +1.8
+2.7
+1.4
+1.8
Confirm what we know: Neural reranking helps BLEU.
en-ja ja-en zh-ja ja-zh
0
10
20
30
40
50
60
70
Base
Rerank
HUMAN
+12.5
+23.7 +10.0
+4.2
Show what we didn't know: Also helps manual eval.
Reranking and N-best Size
 All data sets improve approximately log-linear
 Little saturation even at 100-best
Detailed Analysis
#1 Improvement: Phrasal Reordering (+26, -4)
Source
Base
Rerank
Ref
֢2ˤƤϡֱcθܞƤˌ뻯ѧФˡ
kࡢӲYƤw褦
In case 2, reddening, induration, and skin ulcer appeared during
chemical therapy for liver metastasis of rectal cancer.
In case 2, occurred during chemotherapy for liver metastasis of
rectal cancer, flare, induration, skin ulcer.
In case 2, the flare, induration, skin ulcer was produced during
the chemotherapy for hepatic metastasis of rectal cancer.
#2 Improvement: Auxiliary Verb Ins./Del. (+15, -0)
Source
Base
Rerank
Ref
ˤä֧䷽ʽϱΤ褦ʤ
ˤä?롣
Governing equation derived by this method is useful for
turbulent shear flow like turbulent flow near wall.
The governing equation is obtained by this is also useful for
such as wall turbulence shear flow.
The governing equation obtained by this is also useful for shear
flow such as wall turbulence.
#3 Improvement: Coordinate Structures (+13, -2)
Source
Base
Rerank
Ref
``ӹϸܶȹˤĤʼӟȥ֥`
ˤФ
Laser work is done by local heating and ablation with high
density light flux.
The laser processing is carried out by local heating by high-
density luminous flux and ablation.
The laser processing is carried out by local heating and ablation
by high-density flux.
#4 Improvement: Verb Agreement (+6, -0)
Source
Base
Rerank
Ref
󥰥ߥ奢\֥åȷӻˤⴥ줿
Langmuir-Blodgett method and inclusion compounds are
mentioned.
Langmuir-Blodgett method and inclusion is also discussed.
Langmuir-Blodgett method and inclusion are also mentioned.
Type Improved Degraded % Impr.
Reordering 55 9 86%
Deletion 20 10 67%
Insertion 19 2 90%
Substitution 15 11 58%
Conjugation 8 1 89%
Total 117 33 78%
Overall improvements re-confirmed
Particularly reordering, insertion, and conjugation errors
Qualitative Analysis
What Didn't Work: Terminology (+2, -4)
Source
Base
Rerank
Ref
ä⾀ӋyäƤ롣
Infrared ray applied measurement using radiant heat is useful
for stress analysis.
The infrared application measurement using radiant heat is
useful in the stress analysis.
Infrared ray application measurement using radiation heat is
useful for stress analysis.
Conclusion
 Reranking with neural MT models leads to
subjective improvements in MT quality
 Future work includes comparisons with neural
language models or neural MT w/o reranking

More Related Content

More from Association for Computational Linguistics (20)

PDF
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Association for Computational Linguistics
?
PDF
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Association for Computational Linguistics
?
PDF
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Association for Computational Linguistics
?
PDF
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Association for Computational Linguistics
?
PDF
Chenchen Ding - 2015 - NICT at WAT 2015
Association for Computational Linguistics
?
PDF
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
Association for Computational Linguistics
?
PDF
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
Association for Computational Linguistics
?
PDF
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Association for Computational Linguistics
?
PDF
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Association for Computational Linguistics
?
PDF
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Association for Computational Linguistics
?
PDF
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Association for Computational Linguistics
?
PDF
Chenchen Ding - 2015 - NICT at WAT 2015
Association for Computational Linguistics
?
PDF
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Association for Computational Linguistics
?
PDF
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Association for Computational Linguistics
?
PDF
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Association for Computational Linguistics
?
PDF
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian Translation
Association for Computational Linguistics
?
PDF
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Association for Computational Linguistics
?
PDF
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Association for Computational Linguistics
?
PDF
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Association for Computational Linguistics
?
PDF
Katsuhito Sudoh - 2015 Chinese-to-Japanese Patent Machine Translation based o...
Association for Computational Linguistics
?
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Association for Computational Linguistics
?
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Association for Computational Linguistics
?
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Association for Computational Linguistics
?
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Association for Computational Linguistics
?
Chenchen Ding - 2015 - NICT at WAT 2015
Association for Computational Linguistics
?
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
Association for Computational Linguistics
?
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
Association for Computational Linguistics
?
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Association for Computational Linguistics
?
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Association for Computational Linguistics
?
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Association for Computational Linguistics
?
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Association for Computational Linguistics
?
Chenchen Ding - 2015 - NICT at WAT 2015
Association for Computational Linguistics
?
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Association for Computational Linguistics
?
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Association for Computational Linguistics
?
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Association for Computational Linguistics
?
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian Translation
Association for Computational Linguistics
?
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Association for Computational Linguistics
?
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Association for Computational Linguistics
?
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Association for Computational Linguistics
?
Katsuhito Sudoh - 2015 Chinese-to-Japanese Patent Machine Translation based o...
Association for Computational Linguistics
?

Recently uploaded (20)

PDF
Our Guide to the July 2025 USPS? Rate Change
Postal Advocate Inc.
?
DOCX
MUSIC AND ARTS 5 DLL MATATAG LESSON EXEMPLAR QUARTER 1_Q1_W1.docx
DianaValiente5
?
PPTX
How to Configure Taxes in Company Currency in Odoo 18 Accounting
Celine George
?
PPTX
How to use _name_search() method in Odoo 18
Celine George
?
PDF
Gladiolous Cultivation practices by AKL.pdf
kushallamichhame
?
PDF
The Power of Compound Interest (Stanford Initiative for Financial Decision-Ma...
Stanford IFDM
?
PPTX
Peer Teaching Observations During School Internship
AjayaMohanty7
?
PPTX
Elo the HeroTHIS IS A STORY ABOUT A BOY WHO SAVED A LITTLE GOAT .pptx
JoyIPanos
?
PDF
VCE Literature Section A Exam Response Guide
jpinnuck
?
PPTX
How to Configure Refusal of Applicants in Odoo 18 Recruitment
Celine George
?
DOCX
ANNOTATION on objective 10 on pmes 2022-2025
joviejanesegundo1
?
PPT
M&A5 Q1 1 differentiate evolving early Philippine conventional and contempora...
ErlizaRosete
?
PDF
Rapid Mathematics Assessment Score sheet for all Grade levels
DessaCletSantos
?
PDF
THE PSYCHOANALYTIC OF THE BLACK CAT BY EDGAR ALLAN POE (1).pdf
nabilahk908
?
PPTX
Ivn Bornacelly - Presentation of the report - Empowering the workforce in th...
EduSkills OECD
?
PPTX
How to Add New Item in CogMenu in Odoo 18
Celine George
?
PPTX
How to Manage Wins & Losses in Odoo 18 CRM
Celine George
?
PPTX
How to Setup Automatic Reordering Rule in Odoo 18 Inventory
Celine George
?
PPTX
Urban Hierarchy and Service Provisions.pptx
Islamic University of Bangladesh
?
PDF
CAD25 Gbadago and Fafa Presentation Revised-Aston Business School, UK.pdf
Kweku Zurek
?
Our Guide to the July 2025 USPS? Rate Change
Postal Advocate Inc.
?
MUSIC AND ARTS 5 DLL MATATAG LESSON EXEMPLAR QUARTER 1_Q1_W1.docx
DianaValiente5
?
How to Configure Taxes in Company Currency in Odoo 18 Accounting
Celine George
?
How to use _name_search() method in Odoo 18
Celine George
?
Gladiolous Cultivation practices by AKL.pdf
kushallamichhame
?
The Power of Compound Interest (Stanford Initiative for Financial Decision-Ma...
Stanford IFDM
?
Peer Teaching Observations During School Internship
AjayaMohanty7
?
Elo the HeroTHIS IS A STORY ABOUT A BOY WHO SAVED A LITTLE GOAT .pptx
JoyIPanos
?
VCE Literature Section A Exam Response Guide
jpinnuck
?
How to Configure Refusal of Applicants in Odoo 18 Recruitment
Celine George
?
ANNOTATION on objective 10 on pmes 2022-2025
joviejanesegundo1
?
M&A5 Q1 1 differentiate evolving early Philippine conventional and contempora...
ErlizaRosete
?
Rapid Mathematics Assessment Score sheet for all Grade levels
DessaCletSantos
?
THE PSYCHOANALYTIC OF THE BLACK CAT BY EDGAR ALLAN POE (1).pdf
nabilahk908
?
Ivn Bornacelly - Presentation of the report - Empowering the workforce in th...
EduSkills OECD
?
How to Add New Item in CogMenu in Odoo 18
Celine George
?
How to Manage Wins & Losses in Odoo 18 CRM
Celine George
?
How to Setup Automatic Reordering Rule in Odoo 18 Inventory
Celine George
?
Urban Hierarchy and Service Provisions.pptx
Islamic University of Bangladesh
?
CAD25 Gbadago and Fafa Presentation Revised-Aston Business School, UK.pdf
Kweku Zurek
?
Ad

Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machine Translation: NAIST at WAT2015

  • 1. Neural Reranking Helps Subjective Quality of Machine Translation: NAIST at WAT 2015 Graham Neubig, Makoto Morishita, Satoshi Nakamura Nara Institute of Science and Technology (NAIST), Japan Neural Reranking Quantitative Results Experimental Setup he has a cold Input T2S N-best w/MT Features 1. ˤϺ֤äƤ t=-0.5 l=-5.6 | -6.1 2. ˤLа֤äƤ t=-0.9 l=-5.8 | -6.7 3. ˤLа t=-1.5 l=-5.3 | -6.8 4. ˤLа t=-1.9 l=-5.4 | -7.3 Neural Model Neural Features nmt=-5.8 nmt=-5.5 nmt=-3.4 nmt=-5.2 2. ˤϺ֤äƤ t=-0.5 l=-5.6 nmt=-5.8 | -10.9 3. ˤLа֤äƤ t=-0.9 l=-5.8 nmt=-5.5 | -11.2 1. ˤLа t=-1.5 l=-5.3 nmt=-3.4 | -9.2 4. ˤLа t=-1.9 l=-5.4 nmt=-5.2 | -12.5 Rescored/Reranked N-best Reranking Data: ASPEC Scientific Abstracts C Japanese ? English, Chinese Baseline: NAIST WAT2014 Tree-to-String System C Strong baseline achieving high scores C Implemented using Travatar (http://phontron.com/travatar) Neural MT Model: Attentional model C Trained ~500k sent., 256 hidden nodes, 2 model ensemble C Use words occurring 3+ times (vocab 50,000~80,000) C Trained w/ lamtram (http://github.com/neubig/lamtram) Automatic Evaluation: BLEU, RIBES Manual Evaluation: WAT 2015 HUMAN Score Research Questions 1. Does reranking with neural MT models improve subjective impressions of translation results? 2. If so, what are the qualitative differences between reranked and non-reranked output? 3. How big of an n-best list do we need? en-ja ja-en zh-ja ja-zh 0 10 20 30 40 50 BLEU en-ja ja-en zh-ja ja-zh 70 72 74 76 78 80 82 84 86 Base Rerank RIBES +1.6 +2.8 +2.5 +1.5 +1.8 +2.7 +1.4 +1.8 Confirm what we know: Neural reranking helps BLEU. en-ja ja-en zh-ja ja-zh 0 10 20 30 40 50 60 70 Base Rerank HUMAN +12.5 +23.7 +10.0 +4.2 Show what we didn't know: Also helps manual eval. Reranking and N-best Size All data sets improve approximately log-linear Little saturation even at 100-best Detailed Analysis #1 Improvement: Phrasal Reordering (+26, -4) Source Base Rerank Ref ֢2ˤƤϡֱcθܞƤˌ뻯ѧФˡ kࡢӲYƤw褦 In case 2, reddening, induration, and skin ulcer appeared during chemical therapy for liver metastasis of rectal cancer. In case 2, occurred during chemotherapy for liver metastasis of rectal cancer, flare, induration, skin ulcer. In case 2, the flare, induration, skin ulcer was produced during the chemotherapy for hepatic metastasis of rectal cancer. #2 Improvement: Auxiliary Verb Ins./Del. (+15, -0) Source Base Rerank Ref ˤä֧䷽ʽϱΤ褦ʤ ˤä?롣 Governing equation derived by this method is useful for turbulent shear flow like turbulent flow near wall. The governing equation is obtained by this is also useful for such as wall turbulence shear flow. The governing equation obtained by this is also useful for shear flow such as wall turbulence. #3 Improvement: Coordinate Structures (+13, -2) Source Base Rerank Ref ``ӹϸܶȹˤĤʼӟȥ֥` ˤФ Laser work is done by local heating and ablation with high density light flux. The laser processing is carried out by local heating by high- density luminous flux and ablation. The laser processing is carried out by local heating and ablation by high-density flux. #4 Improvement: Verb Agreement (+6, -0) Source Base Rerank Ref 󥰥ߥ奢\֥åȷӻˤⴥ줿 Langmuir-Blodgett method and inclusion compounds are mentioned. Langmuir-Blodgett method and inclusion is also discussed. Langmuir-Blodgett method and inclusion are also mentioned. Type Improved Degraded % Impr. Reordering 55 9 86% Deletion 20 10 67% Insertion 19 2 90% Substitution 15 11 58% Conjugation 8 1 89% Total 117 33 78% Overall improvements re-confirmed Particularly reordering, insertion, and conjugation errors Qualitative Analysis What Didn't Work: Terminology (+2, -4) Source Base Rerank Ref ä⾀ӋyäƤ롣 Infrared ray applied measurement using radiant heat is useful for stress analysis. The infrared application measurement using radiant heat is useful in the stress analysis. Infrared ray application measurement using radiation heat is useful for stress analysis. Conclusion Reranking with neural MT models leads to subjective improvements in MT quality Future work includes comparisons with neural language models or neural MT w/o reranking