84:80df3e7605a4
|
2011-01-21 |
Paul Boddie |
changeset
files
shortlog
graph
|
For large numbers of positions, sorting afterwards is likely to be much quicker. |
|
|
iixr/phrases.py
|
|
83:3ddb93334c95
|
2011-01-11 |
Paul Boddie |
changeset
files
shortlog
graph
|
Permit fields for documents to be spread across partitions, potentially because
documents have been added more than once to an index. |
|
|
iixr/merging.py
|
|
82:9867931a9269
|
2010-12-17 |
Paul Boddie |
changeset
files
shortlog
graph
|
Avoid identical adjacent tokens being matched to the same document token. |
|
|
iixr/phrases.py
|
|
81:ea2944f51430
|
2010-11-26 |
Paul Boddie |
changeset
files
shortlog
graph
|
Introduced support for higher-level sequential access to indexes. |
|
|
iixr/index.py iixr/terms.py
|
|
80:e0bd00412dbc
|
2010-11-26 |
Paul Boddie |
changeset
files
shortlog
graph
|
Introduced parameterisation of phrase discovery using different phrase filters
to that provided. |
|
|
iixr/phrases.py
|
|
79:2f94fb23bcff
|
2010-11-26 |
Paul Boddie |
changeset
files
shortlog
graph
|
Updated the copyright and licensing information. |
|
|
iixr/__init__.py
|
|
78:489129c7f225
|
2010-11-26 |
Paul Boddie |
changeset
files
shortlog
graph
|
Changed the from_document method to remember the current document and positions,
although the positions iterator will not be reset upon repeated invocations
involving the same document number. |
|
|
iixr/positions.py
|
|
77:7e79dd580a62
|
2010-11-23 |
Paul Boddie |
changeset
files
shortlog
graph
|
Added support for phrase searching where document positions are specified using
sequences of values, with the first value in each sequence being the token
index/position.
Added more tests of document numbers and position values being specified using
sequences. |
|
|
iixr/phrases.py test.py
|
|
76:f1cbbf5ef885
|
2010-11-22 |
Paul Boddie |
changeset
files
shortlog
graph
|
Made partition discovery more widely available, adding code to find the next
partition number to use, thus avoiding overwriting index data when opening a
writer on an existing index.
Made sure that term and field dictionaries are always written out: this might
not occur if the underlying writers have been obtained from an index writer and
then used to write data directly. |
|
|
iixr/filesystem.py iixr/index.py
|
|
75:8d35240236b2
|
2010-11-22 |
Paul Boddie |
changeset
files
shortlog
graph
|
Added integrity checks for appropriate term and position ordering. |
|
|
iixr/terms.py
|
|