My favorites | Sign in
Logo
                
Details: Show all Hide all

Older

  • Oct 30, 2009
    r421 (Removed PPRINT forms that were erroneously part of the last ...) committed by leslie.polzer   -   Removed PPRINT forms that were erroneously part of the last commit.
    Removed PPRINT forms that were erroneously part of the last commit.
  • Oct 30, 2009
    r420 (Fix read-term-vector to correctly resize the buffer. Also ma...) committed by leslie.polzer   -   Fix read-term-vector to correctly resize the buffer. Also make the conscious discussion to require read-chars to be called with a buffer of sufficient size. Discussion: http://groups.google.com/group/montezuma-dev/browse_thread/thread/914a1af34d36497a
    Fix read-term-vector to correctly resize the buffer. Also make the conscious discussion to require read-chars to be called with a buffer of sufficient size. Discussion: http://groups.google.com/group/montezuma-dev/browse_thread/thread/914a1af34d36497a
  • Oct 21, 2009
    issue 5 (:store-term-vector isn't char-safe) changed by leslie.polzer   -   This issue was closed by revision r419.
    Status: Fixed
    This issue was closed by revision r419.
    Status: Fixed
  • Oct 21, 2009
    r419 (Yoni Rabkin: proper string handling for :store-term-vector c...) committed by leslie.polzer   -   Yoni Rabkin: proper string handling for :store-term-vector code paths in term vectors I/O. Fixes issue #5 .
    Yoni Rabkin: proper string handling for :store-term-vector code paths in term vectors I/O. Fixes issue #5 .
  • Sep 05, 2009
    issue 5 (:store-term-vector isn't char-safe) reported by yonirabkin   -   What steps will reproduce the problem? 1. Add :store-term-vector :with-positions-offsets to `make-field' 2. index a corpus Depending on your input, you'll get: "Illegal :UTF-8 character starting at byte position ..."
    What steps will reproduce the problem? 1. Add :store-term-vector :with-positions-offsets to `make-field' 2. index a corpus Depending on your input, you'll get: "Illegal :UTF-8 character starting at byte position ..."
  • Aug 18, 2009
    r418 (Reuters corpus: index not only .txt but also .sgm files. Pat...) committed by leslie.polzer   -   Reuters corpus: index not only .txt but also .sgm files. Patch by Luís Oliveira.
    Reuters corpus: index not only .txt but also .sgm files. Patch by Luís Oliveira.
  • Aug 18, 2009
    r417 (Fixed Reuters indexer's BUILD-INDEX function. ) committed by leslie.polzer   -   Fixed Reuters indexer's BUILD-INDEX function.
    Fixed Reuters indexer's BUILD-INDEX function.
  • Aug 18, 2009
    r416 (Added DOCUMENT-COUNT specialized on INDEX. ) committed by leslie.polzer   -   Added DOCUMENT-COUNT specialized on INDEX.
    Added DOCUMENT-COUNT specialized on INDEX.
  • Aug 18, 2009
    r415 (Updated BUGS.txt. ) committed by leslie.polzer   -   Updated BUGS.txt.
    Updated BUGS.txt.
  • Jul 23, 2009
    issue 3 (standard tokenizer hangs on some input) Status changed by leslie.polzer   -   Fixed by Plato Wu and merged as r414.
    Status: Verified
    Fixed by Plato Wu and merged as r414.
    Status: Verified
  • Jul 23, 2009
    r414 (Fix for pathological regexp tokenizer input (issue #3). Patc...) committed by leslie.polzer   -   Fix for pathological regexp tokenizer input ( issue #3 ). Patch by Plato Wu <netawater@gmail.com>.
    Fix for pathological regexp tokenizer input ( issue #3 ). Patch by Plato Wu <netawater@gmail.com>.
  • Jul 11, 2009
    issue 1 (broken :must-not-occur or phrase query) Status changed by leslie.polzer   -  
    Status: Verified
    Status: Verified
  • Jul 11, 2009
    r413 (Also compute weights for prohibited clauses (patch by Plato ...) committed by leslie.polzer   -   Also compute weights for prohibited clauses (patch by Plato Wu <netawater@gmail.com>).
    Also compute weights for prohibited clauses (patch by Plato Wu <netawater@gmail.com>).
  • Jul 11, 2009
    r412 (Fixed duplicate parts in Boolean Subscorer test file. ) committed by leslie.polzer   -   Fixed duplicate parts in Boolean Subscorer test file.
    Fixed duplicate parts in Boolean Subscorer test file.
  • Jul 08, 2009
    issue 1 (broken :must-not-occur or phrase query) Status changed by leslie.polzer   -  
    Status: Started
    Status: Started
  • Jul 05, 2009
    issue 3 (standard tokenizer hangs on some input) Status changed by leslie.polzer   -  
    Status: Started
    Status: Started
  • Jul 05, 2009
    issue 1 (broken :must-not-occur or phrase query) Status changed by leslie.polzer   -  
    Status: Verified
    Status: Verified
  • Jul 05, 2009
    r411 (Correctly handle disjunction scoring; fixes #1. Patch by Pla...) committed by leslie.polzer   -   Correctly handle disjunction scoring; fixes #1. Patch by Plato Wu <netawater@gmail.com>
    Correctly handle disjunction scoring; fixes #1. Patch by Plato Wu <netawater@gmail.com>
  • May 26, 2009
    issue 2 (Index broken after 2000 documents?) changed by leslie.polzer   -   Fixed in 0.1.3; a comprehensive test case was added by Yoni Rabkin.
    Status: Verified
    Owner: yonirabkin
    Fixed in 0.1.3; a comprehensive test case was added by Yoni Rabkin.
    Status: Verified
    Owner: yonirabkin
  • May 26, 2009
    montezuma-0.1.3.tar.gz (Montezuma 0.1.3) file uploaded by leslie.polzer   -  
    Labels: Featured OpSys-All Type-Source
    Labels: Featured OpSys-All Type-Source
  • May 26, 2009
    r410 (Tagged 0.1.3. ) committed by leslie.polzer   -   Tagged 0.1.3.
    Tagged 0.1.3.
  • May 26, 2009
    r409 (Added regression unit test directory. Added Yoni Rabkin's tc...) committed by leslie.polzer   -   Added regression unit test directory. Added Yoni Rabkin's tc-m2k test. Bumped version to 0.1.3.
    Added regression unit test directory. Added Yoni Rabkin's tc-m2k test. Bumped version to 0.1.3.
  • Apr 08, 2009
    montezuma-0.1.3b.tar.gz (Montezuma 0.1.3b (with UTF8 support) -- second release candi...) file uploaded by leslie.polzer   -  
    Labels: Featured Type-Source
    Labels: Featured Type-Source
  • Apr 08, 2009
    r408 (tagged release-candidate-0.1.3b) committed by leslie.polzer   -   tagged release-candidate-0.1.3b
    tagged release-candidate-0.1.3b
  • Apr 08, 2009
    r407 (added term merging test case) committed by leslie.polzer   -   added term merging test case
    added term merging test case
  • Apr 08, 2009
    r406 (added simple test cases for STRING-TO-BYTES) committed by leslie.polzer   -   added simple test cases for STRING-TO-BYTES
    added simple test cases for STRING-TO-BYTES
  • Apr 08, 2009
    r405 (fixed buglet in src/util/strings.lisp) committed by leslie.polzer   -   fixed buglet in src/util/strings.lisp
    fixed buglet in src/util/strings.lisp
  • Apr 07, 2009
    r404 (Fixed START/END bug in STRING-TO-BYTES. See http://groups.g...) committed by leslie.polzer   -   Fixed START/END bug in STRING-TO-BYTES. See http://groups.google.com/group/montezuma-dev/browse_thread/thread/17fd29d862d22a61
  • Feb 24, 2009
    r403 (added montezuma-indexfiles; restructured contrib) committed by leslie.polzer   -   added montezuma-indexfiles; restructured contrib
    added montezuma-indexfiles; restructured contrib
  • Feb 20, 2009
    r402 (added contrib/ieslick) committed by leslie.polzer   -   added contrib/ieslick
    added contrib/ieslick
  • Feb 19, 2009
    r401 (Tweaked make-release.sh) committed by leslie.polzer   -   Tweaked make-release.sh
    Tweaked make-release.sh
  • Feb 19, 2009
    montezuma-0.1.3a.tar.gz (Montezuma 0.1.3a (with UTF8 support) release candidate) file uploaded by leslie.polzer   -  
    Labels: Featured Type-Source
    Labels: Featured Type-Source
  • Feb 19, 2009
    r400 (Corrected tag version: 1.2.0a -> 0.1.3a) committed by leslie.polzer   -   Corrected tag version: 1.2.0a -> 0.1.3a
    Corrected tag version: 1.2.0a -> 0.1.3a
  • Feb 19, 2009
    r399 (Tagged 1.2.0a) committed by leslie.polzer   -   Tagged 1.2.0a
    Tagged 1.2.0a
  • Feb 18, 2009
    r398 (Added UTF8 support (beta)) committed by leslie.polzer   -   Added UTF8 support (beta)
    Added UTF8 support (beta)
  • Feb 18, 2009
    issue 4 (*word-file-path* isn't set correctly) Status changed by leslie.polzer   -   Fixed in r397.
    Status: Fixed
    Fixed in r397.
    Status: Fixed
  • Feb 18, 2009
    r397 (Make word file path ASD-relative in tc-stop-filter test file) committed by leslie.polzer   -   Make word file path ASD-relative in tc-stop-filter test file
    Make word file path ASD-relative in tc-stop-filter test file
  • Jan 12, 2009
    issue 4 (*word-file-path* isn't set correctly) reported by leslie.polzer   -   Using ASDF-BINARY-LOCATIONS will mess the current logic up by setting it to the FASL directory. Attached patch fixes it at the cost of hardcoding a bit more path.
    Using ASDF-BINARY-LOCATIONS will mess the current logic up by setting it to the FASL directory. Attached patch fixes it at the cost of hardcoding a bit more path.
  • Oct 08, 2008
    issue 2 (Index broken after 2000 documents?) commented on by yonirabkin   -   You can also change after how many documents this will happen by changing *index-writer-default-merge-factor* and *index-writer-default-min-merge-docs*. For instance, I changed them both to 100 and got up to 6000 documents without the crash.
    You can also change after how many documents this will happen by changing *index-writer-default-merge-factor* and *index-writer-default-min-merge-docs*. For instance, I changed them both to 100 and got up to 6000 documents without the crash.
 
Hosted by Google Code