My favorites | Sign in
Project Home Downloads Wiki Issues Source
Search
for
PossiblyUsefulCorpora  
some corpora for under-resourced languages that we're investigating
Updated Oct 26, 2011 by alex.rudnick

Introduction

L3XDG will, in the near future, use some machine learning techniques to try to offset the ambiguity that comes with using a rule-based system, exploring the most probable translations first.

In order to facilitate this, we'll make use of the small amount of bitext that's available for our languages of interest.

Quechua corpora

bitext

dictionaries

Quechua dictionaries are available...

relevant information

Dialect codes for the regional dialects of Quechua are listed here: http://en.wikipedia.org/wiki/ISO_639:q

Guaraní / Jopará corpora


Sign in to add a comment
Powered by Google Project Hosting