Description
This project is a collection of MySQL User-Defined Functions (UDF) for Thai language processing. This project is usable now but still under heavy development.
This library currently has only one function:
- THAI_SPACE_SPLIT() If you feed it with a unicode string, it separates the string into words with spaces. The result string can be used to make full-text indexes with a full-text search system such as Sphinx and Zebra (for Koha). This function is ready to use. It employs ICU. HTML codes are also stripped from the output strings.
Installation
Makefile.local is provided. Read chapter 21 in MySQL Reference Manual on how to install a UDF.
Detailed instructions will be written as time permitted.