Export to GitHub

cleartk - issue #311

implement BILOU chunking scheme

Posted on Jun 25, 2012 by Grumpy Rabbit

The BILOU chunking scheme (Beginning, Inside and Last tokens of multi-token chunks, Unit-length chunks and Outside) outperform BIO:

Lev Ratinov and Dan Roth. 2009. Design challenges and misconceptions in named entity recognition. In Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL '09). Association for Computational Linguistics, Stroudsburg, PA, USA, 147-155.

We should implement BILOU in the new org.cleartk.classifier.chunking package.

Comment #1

Posted on Jul 24, 2012 by Grumpy Rabbit

(No comment was entered for this change.)

Comment #2

Posted on Feb 17, 2013 by Swift Monkey

(No comment was entered for this change.)

Comment #3

Posted on Apr 1, 2013 by Swift Hippo

Since I wanted to experiment with it, here is a patch.

Attachments

bilou.patch 9KB

Comment #4

Posted on May 3, 2013 by Grumpy Rabbit

(No comment was entered for this change.)

Comment #5

Posted on Oct 24, 2013 by Grumpy Rabbit

(No comment was entered for this change.)

Comment #6

Posted on Oct 30, 2013 by Grumpy Rabbit

This issue was closed by revision 8a46b95bbe0f.

Status: Fixed

Labels:
Type-Enhancement Priority-Medium Component-ml Milestone-2.0

Code

Archive

cleartk - issue #311

Comment #1

Comment #2

Comment #3

Comment #4

Comment #5

Comment #6