| Projects on Google Code | Results 1 - 10 of 22 |
This software allows the automated collection of large numbers of full text articles.
You can use the GUI to perform a PubMed search and then download the PDFs from the search results, or you can work with the API and just give it a set of starting point URLs or PubMed IDs.
= Getting, Instal...
= This is a generic Full Text Search engine for Django projects =
Currently implements three backends: dummy, simple and pgsql.
* *dummy* - just uses ILIKE to do the search (no indexes, very slow)
* *simple* - implements the search using two helper tables for the indexes
* *pgsql* - us...
This behaviour allows Cake developers to make a model *fulltext searchable*, meaning that the user will be able to perform mysql fulltext search (which powers Wikimedia for example) on the models.
For example, a fulltext search allows end users to use boolean expressions (eg. '-' to exclude a ter...
http://www.djapian.com/media/images/text-djapian.png
*If you are using Djapian please tell us about your project in reply to this [http://groups.google.com/group/djapian-users/browse_thread/thread/a3005b17f199d2f9 post]*
Use this package to allow full-text search in your Django project.
Ver...
*<font color="red">Documentation in progress</font>*
=== Lib Dependencies ===
# [http://commons.apache.org/lang/ commons-lang-2.4.jar]
# [http://code.google.com/p/google-collections/ google-collect-1.0-rc2.jar]
# [http://commons.apache.org/logging/ commons-logging.jar]
=== Sample ===
...
Gisgraphy is a free and open source framework. Its goal is to provide tools to use free GIS Data on the Web. Actually it manage Geonames and OpenStreetMap (34 million entries). it provides an importer to inject the data into a strongly typed Postgres / Postgis database and use them via webservices :...
gisgraphy,
geonames,
fulltext,
geolocalisation,
webservices,
postgis,
toponyms,
lucene,
solr,
masclet,
gazeteers,
openstreetmap
http://marjory.wolff-hamburg.de/img/MarjoryLogo.png
Marjory is a webservice for indexing and searching for documents, utilizing a full-text search engine.
It is somewhat similar to [http://lucene.apache.org/solr/ Solr], but is written in PHP and the underlying architecture allows for using sea...
= Summary =
The boilerpipe library provides algorithms to detect and remove the surplus "clutter" (boilerplate, templates) around the main textual content of a web page.
The library already provides specific strategies for common tasks (for example: news article extraction) and may also be eas...
boilerplate,
removal,
template,
removal,
full,
text,
extraction,
fulltext,
cleaning,
java,
library,
HTML,
Web,
webpage,
content
Search tools,you can visit http://pinot.berlios.de/ for more details.
uffts (ultra-fast fulltext search) will first index all text files (and eventually doc, opendocument, etc.), and allow you to perform near-instant searches within all of them. A special tree designed around ascii text, which results in compression for large amounts of data, should make it possible ...