keyword | position | short | action |
Text | first, required | T=QUERY | string containing the words to be analyzed. |
IDentification | middle
optional |
ID=number
ID=string |
only document number
only documents with IDs containing string (Regex expressions possible)
|
Option | middle
optional |
opt = 128 | 128 for a RegEx QUERY
32 for an "almost" QUERY 1 for a case sensitive QUERY (only if InvIdx was built with this option) |
SorTSequence | middle
optional |
sts=23 | (23 means: first sort column2, then subsort column3). Default is 32. No-sort is 0. |
EXTRA | middle
optional |
extra=xtra |
|
InvertedIndex | last, required | ii=invidx | the inverted index string to be used for this QUERY. |
Find = "Question: to be to" in Shakespeare Complete Works | RESULT | ||||||||||
scalar = EDIT(T=find, II=InvIdx) | n = 156 occurences of least frequent word.
This is the upper limit of find-string occurences. | ||||||||||
array =
EDIT(T=find, II=InvIdx)
string = EDIT(T=find, II=InvIdx) |
fills first 145 rows or maximum allocated length
| ||||||||||
vector = EDIT(T=find, II=InvIdx) | word counts and locations in POSs
⇾ see details.
Element 1 = ID1-QUERY-word1 count position Element 2 = ID1-QUERY-word2 count position and so on for all IDs and QUERY words |
QUERY string | QUERY for xx, yy, zz, etc full word(s) |
xx yy zz | xx AND yy AND zz must be present |
xx | yy | zz | xx OR yy OR zz |
xx | yy zz | (xx OR yy) AND zz |
xx |yy 1|2|3 | (xx OR yy) AND (1 OR 2 OR 3) |
<10 xx yy zz | 10 is the maximum spread of RESULT |
xx <1L zz | maximum 0 characters Left (no back steps). Use "-", "b", "B", "l" instead of "L" if you like. |
xx <5R zz" | maximum 4 characters to next word (limit forward QUERY). Use "+", "f", "F", "r" instead of "R" if you like. |
<100 xx <4- <12+ yy|zz pp|qq | RESULT max 100 chars wide, not more than 3 chars back, not more than 11 chars foreward find all occurences of xx, then yy or zz, then pp or qq. |
QUERY string | sample RESULTs found in Shakespeare Complete Works |
m?other | mother,other |
b.a.e | blade,blame,blaze,brace,brake,brave |
gr.*t | great,greatest,greet,grievest,groat,grumblest,grunt |
b.a.e gr.*t | great part of blame
brave Master Shootie the great |
column 1
hit position in ALL (only 2 rows shown) |
column 2
spread of result |
column 3
hit nr in ID |
column 4
start of ID-string in InvIdx |
1124245 | 41 | 30 | 193660 |
156590 | 83 | 6 | 193660 etc. |
Only the first 2 rows of found (out of 145) are shown with SorTSequence=2: smallest spread first |
Ham. To be, or not to be- that is the question: |
COUNTESS. To be young again, if we could, I will be a fool in question, hoping to be the |
If the marks used ("[]" ) are unique in ORIGINAL, they can be further expanded to generate a variety of secondary information like sophisticated table of contents etc. |
Ham. [to] [be], or not [to] [be]- that is [The] [question]: |
COUNTESS. [to] [be] young again, if we could, I will [be] a fool in [question], hoping [to] [be] [The] |