|
InstallationInstructions
Installation guide for semantic vectors package.
Phase-Deploy SummaryThis page contains brief instructions for installing and running the Semantic Vectors package. These sketch instructions presume that you're reasonably familiar with Java, Ant, CLASSPATHs, etc. If not, you might struggle a bit. Better documentation will hopefully follow sometime soon. Prerequisites (all Installations)
Binary Installation from Jar DistributionThis is the simplest approach for just getting SemanticVectors working.
You won't be able to alter the programs beyond the configuration that's possible with command line flags. If you want to do that, you need to build from source. Compiling from Source - Package InstallationPrerequisitesAs well as the Lucene and Java prerequisites, you will also need to install Apache Ant.
Installation
Compiling from Source - Most Recent Development InstallationIf you might be making changes to the code, or want to try out new features that are checked in but not yet in the numbered releases, please consider checking out the most recent version from the svn repository. If you make changes that turn out to be useful, please write to us and tell us about them, and we'll probably urge you to submit them to the repository. To checkout from source, see the checkout instructions. You will need to use SVN to checkout the code (for Eclipse users, try Subclipse). To build the project, use ant as described above, and make sure that either the resulting build/classes directory or build/*.jar file is in your CLASSPATH. If this fails or is too daunting (which it may be, there's a lot going on and getting everything to work together eventually becomes cumbersome), please don't hesitate to contact the project developers and we'll try to help out or make a new numbered release with the development features you need. To Build and Search a Model
Training CyclesModels can be built in several phases by passing the document vectors back to rebuild new term vectors. See TrainingCycles. Bilingual ModelsFor instructions on building a bilingual model from a parallel corpus, see BilingualModels. Positional IndexesFor instructions on building an index based on term positions, see PositionalIndexes. Permutation SearchIndexes can now be built that encode directional relationships between words. See PermutationSearch. Clustering and VisualizationHave some fun building clusters and pictures! Instructions are on the ClusteringAndVisualization page. Vector Store FormatsThe SemanticVectors package currently (as of version 1.6) supports two different vector store formats, a plain text format and an optimized format created by the Lucene I/O packages. For more information including format translation utilities, see VectorStoreFormats. Developer API DocsSee http://semanticvectors.googlecode.com/svn/javadoc/latest-stable/index.html Some useful information may be found in the ReleaseLog. |