Derivational morphology is a fundamental and complex characteristic of language. In this paper we propose the new task of predicting the derivational form of a given base-form lemma that is appropriate for a given context. We present an encoder--decoder style neural network to produce a derived form character-by-character, based on its corresponding character-level representation of the base form and the context. We demonstrate that our model is able to generate valid context-sensitive derivations from known base forms, but is less accurate under a lexicon agnostic setting.
9. MOTIVATION
How well we can predict derivations directly from the
context?
9
Ekaterina Vylomova, evylomova@gmail.com
. . the ergometer ’s inability to properly SIMULATE
the larger rowers drag on a boat . . .
. . . this SIMULATE package is based on Simula 's object
oriented features and its coroutine concept . . .
. . . Bay pilots trained for the visit on a SIMULATE
at the California Maritime Academy . . .
10. MOTIVATION
How well we can predict derivations directly from the
context?
10
Ekaterina Vylomova, evylomova@gmail.com
. . the ergometer ’s inability to properly SIMULATE
the larger rowers drag on a boat . . .
. . . this SIMULATE package is based on Simula 's object
oriented features and its coroutine concept . . .
. . . Bay pilots trained for the visit on a SIMULATE
at the California Maritime Academy . . .
11. MOTIVATION
How well we can predict derivations directly from the
context?
11
Ekaterina Vylomova, evylomova@gmail.com
. . the ergometer ’s inability to properly SIMULATE
the larger rowers drag on a boat . . .
. . . this SIMULATION package is based on Simula 's object
oriented features and its coroutine concept . . .
. . . Bay pilots trained for the visit on a SIMULATE
at the California Maritime Academy . . .
12. MOTIVATION
How well we can predict derivations directly from the
context?
12
Ekaterina Vylomova, evylomova@gmail.com
. . the ergometer ’s inability to properly SIMULATE
the larger rowers drag on a boat . . .
. . . this SIMULATION package is based on Simula 's object
oriented features and its coroutine concept . . .
. . . Bay pilots trained for the visit on a SIMULATOR
at the California Maritime Academy . . .
13. BASELINE : 3-gram Modified KN smoothing
13
Ekaterina Vylomova, evylomova@gmail.com
This SIMULATE package is based on Simula 's object oriented features ... -47.9
This SIMULATES package is based on Simula 's object oriented features ... -50.0
This SIMULATED package is based on Simula 's object oriented features ... -49.0
This SIMULATING package is based on Simula 's object oriented features ... -49.5
This SIMULATION package is based on Simula 's object oriented features ... -46.1
This SIMULATOR package is based on Simula 's object oriented features ... -48.9
This SIMULATORS package is based on Simula 's object oriented features ... -50.7
log p
14. BASELINE : 3-gram Modified KN smoothing
14
Ekaterina Vylomova, evylomova@gmail.com
This SIMULATE package is based on Simula 's object oriented features ... -47.9
This SIMULATES package is based on Simula 's object oriented features ... -50.0
This SIMULATED package is based on Simula 's object oriented features ... -49.0
This SIMULATING package is based on Simula 's object oriented features ... -49.5
This SIMULATION package is based on Simula 's object oriented features ... -46.1
This SIMULATOR package is based on Simula 's object oriented features ... -48.9
This SIMULATORS package is based on Simula 's object oriented features ... -50.7
log p
15. DATASET
? English Verb Nominalizations only
? CELEX: <accusation, accuse+ation>
24 suffix classes / 1,456 base lemmas / 3,079 unique lemma
pairs
? Contexts: 107,041 contextual instances from English Wikipedia
? Pre-trained word embeddings: word2vec trained on Google News
15
Ekaterina Vylomova, evylomova@gmail.com
26. ERROR ANALYSIS
26
Ekaterina Vylomova, evylomova@gmail.com
Correct (Context of ...) Predicted
Extra Variety of Forms
student studint, studion, studyant,
student
Especially in Split Lexicon Setting
trailer trailer, trailation, trailment
27. ERROR ANALYSIS
27
Ekaterina Vylomova, evylomova@gmail.com
Correct (Context of ...) Predicted
Lack of Forms
government and governance government
Bias Towards Productive Suffixes
stoppage stoption