| ## EvaLatin 2020 Models #evalatin20_models | |
| EvaLatin 2020 Models are distributed under the | |
| [CC BY-NC-SA](https://creativecommons.org/licenses/by-nc-sa/4.0/) licence. | |
| The models are based solely on [EvaLatin 2020](https://github.com/CIRCSE/LT4HALA) | |
| treebanks, and additionally use [multilingual BERT](https://github.com/google-research/bert/blob/master/multilingual.md). | |
| The models require [UDPipe 2](https://ufal.mff.cuni.cz/udpipe/2). | |
| ### Download | |
| The latest version 200831 of the EvaLatin 2020 models | |
| can be downloaded from [LINDAT/CLARIN repository](https://hdl.handle.net/11234/1-4803). | |
| The models are also available in the [REST service](https://lindat.mff.cuni.cz/services/udpipe/). | |
| ### Acknowledgements #evalatin20_models_acknowledgements | |
| This work was supported by the grant no. GX20-16819X of the Grant Agency of the | |
| Czech Republic, and has been using language resources stored and distributed by | |
| the LINDAT/CLARIAH-CZ project of the Ministry of Education, Youth and Sports of | |
| the Czech Republic (project LM2018101). | |
| The models were trained on [EvaLatin 2020](https://github.com/CIRCSE/LT4HALA) treebanks. | |
| Finally, [multilingual BERT](https://github.com/google-research/bert/blob/master/multilingual.md) | |
| is used to provide contextualized word embeddings. | |
| ### Publications | |
| - Milan Straka, Jana Straková (2020): [UDPipe at EvaLatin 2020: Contextualized Embeddings | |
| and Treebank Embeddings](https://arxiv.org/abs/2006.03687). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, 2006.03687 | |
| ### Model Performance | |
| | Model | Dataset | UPOS | Lemma | | |
| |:------|:------------------|------:|-------:| | |
| | latin-evalatin20-200830 | test classical | 96.73 | 96.39 | | |
| | latin-evalatin20-200830 | test cross-genre | 90.47 | 86.89 | | |
| | latin-evalatin20-200830 | test cross-time | 87.58 | 90.59 | | |