|
Project Information
|
One of five Student Work Projects on the Indian-summer school on Linked Data (http://lod2.eu/Article/ISSLOD2011). We implemented a simple entity disambiguation approach and made the results visible on a Web UI. Introduction- Given:
- Reference knowledge base(s) K
- Text fragment T
- Set E of Named Entities
- Task:
- Find URI for each of the Named Entities
URI lockup- Input: String I (label of an entity)
- Get all entities with rdfs:label l
- SELECT DISTINCT ?uri WHERE { ?uri rdfs:label “Paris”@en. }
- or each entity ?e, merge all labels
- SELECT ?label WHERE { ?e rdfs:label ?label. }
Disambiguation Approach- Remove stopwords in T
- Stem each word in T
- For each c in C
- Remove stopwords in c
- Stem each word in c
- Calculate Jaccard coef. between c and T
- Return ranked list of entities
Web UI
|