GitHub - fujimizu/stupa: an Associative Search Engine

Overview

Stupa is an associative search engine. You can search related documents with high performance and high precision. Since document data and inverted indexes are kept in memory, stupa reflects updates of documents in search results in real time.

A server implementation of Stupa is possible by using Thrift.

Install

% ./configure
% make
% make check  (googletest required)
% sudo make install

Usage

Search related documents interactively

% stpctl search [-b][-f] file [invsize]

Convert an input tsv file to a binary format file

% stpctl save [-b] infile outfile [invsize]

Options

 -b        read a binary format file
 -f        search by feature strings
           (default: search by document identifier strings)
 invsize   maximum size of inverted indexes (default:100)

Format of Input Data

List of input documents

document_id1 \t key1-1 \t key1-2 \t key1-3 \t ...\n
document_id2 \t key2-1 \t key2-2 \t key2-3 \t ...\n
...

document_id : string
key : string

Requirement

C++ compiler with STL (Standard Template Library)

License

GPL2 (Gnu General Public License Version 2)

Author

Mizuki Fujisawa <fujisawa@bayon.cc>

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
stupa-evhttp		stupa-evhttp
stupa-thrift		stupa-thrift
stupa		stupa
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stupa-evhttp

stupa-evhttp

stupa-thrift

stupa-thrift

stupa

stupa

README.md

README.md

Repository files navigation

Overview

Install

Usage

Search related documents interactively

Convert an input tsv file to a binary format file

Options

Format of Input Data

List of input documents

Requirement

Recommended

License

Author

About

Releases

Packages

Languages

fujimizu/stupa

Folders and files

Latest commit

History

Repository files navigation

Overview

Install

Usage

Search related documents interactively

Convert an input tsv file to a binary format file

Options

Format of Input Data

List of input documents

Requirement

Recommended

License

Author

About

Resources

Stars

Watchers

Forks

Languages