Discriminative Modeling extraction Sets for Machine Translation

About This Presentation

Title:

Description:

Number of Views:106

Avg rating:3.0/5.0

Slides: 28

Provided by: Justin246

Learn more at: http://www.cs.cmu.edu

Category:

more less

Transcript and Presenter's Notes

Title: Discriminative Modeling extraction Sets for Machine Translation

1
Discriminative Modeling extraction Sets for
Machine Translation

2
Contribution

Extraction set
Nested collections of all the overlapping phrase
pairs consistent with an underlying
word-alignment
Advantages over word-factored alignment model
Can incorporate features on phrase pairs, more
than word link
Optimize a extraction-based loss function really
direct to generating translation
Perform better than both supervised and
unsupervised baseline

3
Progress of Statistical MT

4
Outline

5
Extraction set models
6
Extraction Set Models

7
Extraction Sets from Word Alignments

8
Extraction Sets from Word Alignments

9
Extraction Sets from Word Alignments
10
Possible and Null Alignment Links

11
Interpreting Possible and Null Alignment Links

12
Interpreting Possible and Null Alignment Links
13
Linear Model for Extraction Set

14
Scoring Extraction Sets

15
Model Estimation
16
MIRA(Margin-infused Relaxed Algorithm)

17
Extraction Set Loss Function

18
Model Inference
19
Possible Decompositions
20
DP for Extraction Sets

21
DP for Extraction Sets
22
Finding Pseudo-Gold ITG Alignment

23
Experiments
24
Five systems for comparison

25
Data

Discriminative training and alignment evaluation
Trained baseline HMM on 11.3 million words of
FBIS newswire data
Hand-aligned portion of the NIST MT02 test set
150 training and 191 test sentences
End-to-end translation experiments
Trained on 22.1 million word prarllel corpus
consisting of sentence up to 40 of newswire data
from GALE program
NIST MT04/MT05 test sets