README.md
2.3 KB · 104 lines · markdown Raw
1 ---
2 language:
3 - en
4 - es
5
6 tags:
7 - translation
8
9 license: apache-2.0
10 ---
11
12 ### eng-spa
13
14 * source group: English
15 * target group: Spanish
16 * OPUS readme: [eng-spa](https://github.com/Helsinki-NLP/Tatoeba-Challenge/tree/master/models/eng-spa/README.md)
17
18 * model: transformer
19 * source language(s): eng
20 * target language(s): spa
21 * model: transformer
22 * pre-processing: normalization + SentencePiece (spm32k,spm32k)
23 * download original weights: [opus-2020-08-18.zip](https://object.pouta.csc.fi/Tatoeba-MT-models/eng-spa/opus-2020-08-18.zip)
24 * test set translations: [opus-2020-08-18.test.txt](https://object.pouta.csc.fi/Tatoeba-MT-models/eng-spa/opus-2020-08-18.test.txt)
25 * test set scores: [opus-2020-08-18.eval.txt](https://object.pouta.csc.fi/Tatoeba-MT-models/eng-spa/opus-2020-08-18.eval.txt)
26
27 ## Benchmarks
28
29 | testset | BLEU | chr-F |
30 |-----------------------|-------|-------|
31 | newssyscomb2009-engspa.eng.spa | 31.0 | 0.583 |
32 | news-test2008-engspa.eng.spa | 29.7 | 0.564 |
33 | newstest2009-engspa.eng.spa | 30.2 | 0.578 |
34 | newstest2010-engspa.eng.spa | 36.9 | 0.620 |
35 | newstest2011-engspa.eng.spa | 38.2 | 0.619 |
36 | newstest2012-engspa.eng.spa | 39.0 | 0.625 |
37 | newstest2013-engspa.eng.spa | 35.0 | 0.598 |
38 | Tatoeba-test.eng.spa | 54.9 | 0.721 |
39
40
41 ### System Info:
42 - hf_name: eng-spa
43
44 - source_languages: eng
45
46 - target_languages: spa
47
48 - opus_readme_url: https://github.com/Helsinki-NLP/Tatoeba-Challenge/tree/master/models/eng-spa/README.md
49
50 - original_repo: Tatoeba-Challenge
51
52 - tags: ['translation']
53
54 - languages: ['en', 'es']
55
56 - src_constituents: {'eng'}
57
58 - tgt_constituents: {'spa'}
59
60 - src_multilingual: False
61
62 - tgt_multilingual: False
63
64 - prepro: normalization + SentencePiece (spm32k,spm32k)
65
66 - url_model: https://object.pouta.csc.fi/Tatoeba-MT-models/eng-spa/opus-2020-08-18.zip
67
68 - url_test_set: https://object.pouta.csc.fi/Tatoeba-MT-models/eng-spa/opus-2020-08-18.test.txt
69
70 - src_alpha3: eng
71
72 - tgt_alpha3: spa
73
74 - short_pair: en-es
75
76 - chrF2_score: 0.721
77
78 - bleu: 54.9
79
80 - brevity_penalty: 0.978
81
82 - ref_len: 77311.0
83
84 - src_name: English
85
86 - tgt_name: Spanish
87
88 - train_date: 2020-08-18 00:00:00
89
90 - src_alpha2: en
91
92 - tgt_alpha2: es
93
94 - prefer_old: False
95
96 - long_pair: eng-spa
97
98 - helsinki_git_sha: d2f0910c89026c34a44e331e785dec1e0faa7b82
99
100 - transformers_git_sha: f7af09b4524b784d67ae8526f0e2fcc6f5ed0de9
101
102 - port_machine: brutasse
103
104 - port_time: 2020-08-24-18:20