|
Project Information
|
This project contains mappings from language and treebank specific part-of-speech (POS) tagsets to a set of 12 universal POS tags, as described in "A Universal Part-of-Speech Tagset" by Slav Petrov, Dipanjan Das and Ryan McDonald (link). Currently, mappings for 25 treebanks covering the following 22 languages are available: Arabic, Basque, Bulgarian, Catalan, Chinese, Czech, Danish, Dutch, English, French, German, Greek, Hungarian, Italian, Japanese, Korean, Portuguese, Russian, Slovene, Spanish, Swedish, Turkish. |