My favorites | Sign in
Project Home Downloads Wiki Issues Source
Project Information
Members

This project contains mappings from language and treebank specific part-of-speech (POS) tagsets to a set of 12 universal POS tags, as described in

"A Universal Part-of-Speech Tagset" by Slav Petrov, Dipanjan Das and Ryan McDonald (link).

Currently, mappings for 25 treebanks covering the following 22 languages are available:

Arabic, Basque, Bulgarian, Catalan, Chinese, Czech, Danish, Dutch, English, French, German, Greek, Hungarian, Italian, Japanese, Korean, Portuguese, Russian, Slovene, Spanish, Swedish, Turkish.

Powered by Google Project Hosting