My favorites | Sign in
Project Logo
                
Show all Featured wiki pages:
Howto
Feeds:
People details
Project owners:
  benoit.favre, dgillick

This summarization system, crafted for the Text Analysis Conference (TAC) evaluation campaign, generates summaries by extracting sentences that contain the most frequent word bigrams (called concepts) from the input documents. It uses Integer Linear Programming (ILP) for determining, under a length constraint that set of sentences. With this system, we obtained very good scores for the update task at the TAC'08 evals and among the best scores at TAC'09.

So far, we only released the raw code which contains a lot of dependencies to internal stuff at ICSI, but we plan to add a cleaned-up, standalone version for public use.

NEWS:

DEPENDENCIES:

Note that the SRL system is only needed if you want to use sentence compression in TAC'09.

REFERENCES:









Hosted by Google Code