My favorites | Sign in
Project Home Downloads Issues Source
Project Information
Members
Featured
Downloads
Links

Turkish Deasciifier for Java

Turkish Deasciifier allows you to convert Turkish text written with ASCII-only characters into a proper Turkish text with Turkish-specific letters.

(Turkish deasciifier ile Türkçe karakterler (ş, ı, ö, ç, ğ, ü) kullanmadan yazılmış yazıları doğru Türkçe karakter karşılıkları ile düzeltebilirsiniz.)

For instance,

Hadi bir masal uyduralim, icinde mutlu, doygun, telassiz durdugumuz.
will be converted to:
Hadi bir masal uyduralım, içinde mutlu, doygun, telaşsız durduğumuz.

It is adapted from Emre Sevinç's Turkish Deasciifier for Python which was influenced by Deniz Yüret's Emacs Turkish Mode implementation which was inspired by Gökhan Tür's Turkish Text Deasciifier.

Zemberek library also offers such a functionality, however, this library is compact, faster (almost 2000 times) and easier to use when compared to Zemberek.

Example Usage

Deasciifier d = new Deasciifier();
d.setAsciiString("Hadi bir masal uyduralim, icinde mutlu, doygun, telassiz durdugumuz.");
System.out.println(d.convertToTurkish());

Source Code

You can use Subversion (SVN) to check out source code of this project anonymously with the following command:

svn checkout https://turkish-deasciifier.googlecode.com/svn/trunk/ turkish-deasciifier
The source you have checked out will be an Eclipse project so that you can import project easily using Eclipse IDE or Netbeans.

Authors

Current maintainer: Ahmet Alp Balkan <ahmet at ahmetalpbalkan.com>

(feel free to contact for any questions or contributions)

Powered by Google Project Hosting