5. Unknown words
Example
Word house completely independently of the
word houses.
Training data do not add any knowledge
about the translation of houses.
6. What is Factored model
Redefining a word from a single symbol to a
vector of factors
Traditional Factored
Word
8. Components of Factored translation models
Language model
Translation model
Reordering model
Translation steps
Generation steps
Each component defines one or more feature
functions that are combined in a log-linear model:
Factored Translation
11. Methodology
Translation model
Prepare on training- Run POS tagger on corpus to tagged
the data
Establish word alignment and POS tagged alignment
using GIZA++
I Went To shop
牀牀鉦牆
牀牀牆牀牆牀牆
牀 牀鉦牆牀牆
PRP V PREP NN
PRP
NN
V
12. Methodology
According to the alignment of word and tag
source sentence will be reordered
Extract phrase pairs that are consistent with
the word alignment
Estimate scoring functions (conditional
phrase translation probabilities or lexical
translation probabilities)