|
Project Information
Links
|
ClearTKClearTK provides a framework for developing statistical natural language processing (NLP) components in Java and is built on top of Apache UIMA. It is developed by the Center for Computational Language and Education Research (CLEAR) at the University of Colorado at Boulder. Please see the conceptual overview for a broad introduction to ClearTK. Features
Getting StartedFor the latest released version: See the user setup. For the latest and greatest from the repository: See the developer setup. See the tutorial for an example of how to build a simple statistical machine learning component with ClearTK, and the module listing for an overview of the modules contained in ClearTK. LicenseMost of ClearTK is distributed under the BSD license which can be viewed here. However, there are a couple of sub-projects that are licensed under the GPL license because they depend on GPL licensed third party libraries. In order to comply with the GPL we have isolated the code that has GPL dependencies into sub-projects which are licensed with GPL and these projects are excluded from our main release on our downloads page. For additional details related to software license please see the following resources:
Questions?If you have questions about ClearTK please post them to cleartk-users@googlegroups.com. Cite ClearTKIf you use ClearTK to support academic research, then please cite the following paper as appropriate: @inproceedings{ogren_cleartk:uima_2008,
title = {
{ClearTK}: A {UIMA} toolkit
for statistical natural language processing},
booktitle = {
Towards Enhanced Interoperability for Large {HLT} Systems:
{UIMA} for {NLP} workshop
at Language Resources and Evaluation Conference ({LREC})},
author = {
Philip V. Ogren and
Philipp G. Wetzler and
Steven Bethard},
year = {2008}
}
|