My favorites | Sign in
Project Home Downloads Wiki Issues Source
READ-ONLY: This project has been archived. For more information see this post.
Project Information
Members

The analysis of Turkish texts is significant in Turkish language, literature and a wide spectrum of areas. It is a complicated task to count language structures in the texts manually. By the way, a computer application that processes and analyzes Turkish text documents or document sets (corpus) is beneficial. In this study, the text processing and analyzing tool is developed to analyze the texts and computes various phonetic, syllable, affix, stem, word, sentence frequencies. The text processing and analyzing tool that are developed can analyze Turkish texts using the frequency distributions of various language elements such as phonemes, syllables, affixes, words etc. The tool is developed with Java programming language and it is implemented according to PCMEF architecture. The program that is developed provides facilities for adding new languages and it is not difficult to extend to do the same for some Turkic dialects, such as Turkmenistan and Azerbaijan

Powered by Google Project Hosting