Texis and Metamorph use set logic for text queries. Set logic is easier to use and provides more abilities than boolean. The examples below make reference to single keywords, but keep in mind that each keyword can represent an entire list of things or any of the special pattern matchers.
Sets (or lists) of things are specified by placing the elements within parenthesis, separated by commas. Example: (bob,joe,sam,sue) . In the examples below, you could replace any of the keywords with a list like this.
The default behavior of the search is to locate an intersection (or 'AND') of every element within a query. This means that the query: "microsoft bob interface" is the equivalent to the boolean query: "microsoft AND bob AND interface" .
- - (without)
- The
- (minus) is the most commonly used logic symbol. It means the answer should EXCLUDE references to that item.
- + (mandatory)
- The
+ (plus) symbol in front of a search item means that the answer MUST INCLUDE that item. This is generally used in conjunction with the permutation operation.
- @N (permute)
-
The
@ followed by a number indicates how many intersections to locate of the terms in your query. This may be confusing at first, but it is very powerful.
|
Query |
Finds |
|
bob sam joe |
Bob with Sam and Joe |
bob sam -joe |
Bob with Sam without Joe |
bob sam joe @1 |
Bob with Sam, or Bob with Joe, or Joe with Sam |
A B C D @1 |
AB or AC or AD or BC or BD or CD |
+A B C D @1 |
ABC or ABD or ACD |
A B C -D @1 |
( AB or AC or BC ) without D |
|
Table 6.2: Search Logic Examples
The plus(+ ) and minus(- ) operators must be attached to the term to which they apply. There must be a space between the operator and any preceding term.
Correct |
Incorrect |
|
bob +sam -joe |
bob + sam - joe |
|
bob+sam-joe |
Go Back to Top of Page
Keywords Phrases and Wild-cards
To locate words, just type them in as you would in a word processor. Letter cases will be ignored.
The wild-card character * (asterisk) may be used to match just the prefix of a word or to ignore the middle of something.
If the item you wish to locate is more complicated than the simple * wild-card can accomplish, try using the regular expression matcher (https://docs.thunderstone.com/texis/site/pages/regexp.html).
To locate a number of adjacent words in a specific order, surround them with " (double quotation) characters. Putting a - (hyphen) between words will also force order and one word proximity.
|
Query |
Locates |
|
john |
john, John |
"john public" |
John Public |
web-browser |
Web browser, web-browser |
John*Public |
John Q. Public, John Public |
456*a*def |
1-456-789-ABCDEF |
activate |
activate, activation, activated, ... * |
|
Go Back to Top of Page
Invoking Thesaurus Expansion
The Search Appliance has a vocabulary of over 250,000 word and phrase associations. Each entry is generally classifiable by either its meaning or part of speech.
Depending on the administrator's Synonyms setting for this profile, synonyms may already be included for each term in your query. If not, synonyms may be included for individual terms within your query by preceding them with a ~ (tilde) character.
Go Back to Top of Page
Using the Special Pattern Matchers
These pattern matchers are used to locate hard-to-find items within text:
If improperly used these pattern matchers can slow queries. Therefore they require other keyword(s) in the query and are disabled entirely under Page proximity. For more details see the Vortex manual on Query Protection (https://docs.thunderstone.com/site/vortexman/link_qprot.html ).
Go Back to Top of Page
Using Word Forms
The Word forms options give you control over how many variations of your query terms will be sought in your search.
- Exact match:
- Only exact matches will be allowed. (the default)
- Plurals & possessives:
- Plural and possessive forms will be found. (s, es, 's)
- Any word forms:
- As many word forms as can be derived will be located.
- Custom:
-
Uses the
Custom Suffix List , Custom Suffix Default Removal , and Custom Suffix Min Length settings to create your own custom behavior.
|
Word |
president |
|
EXACT |
president |
PLURAL |
(above) + presidents president's |
ANY |
(above) + presidential presidency preside presides presiding presided |
|
Word |
tight |
|
EXACT |
tight |
PLURAL |
(above) + tights |
ANY |
(above) + tightly tightening tightened tighter tightest |
|
Word |
program |
|
EXACT |
program |
PLURAL |
(above) + programs program's |
ANY |
(above) + programming programmatic programmed programmer programmable |
|
Table 6.4: Word Form Examples
We call this morpheme processing, and it is generally smarter than a traditional "stemming" algorithm. It doesn't just rip the end off a word, it actually checks to see if it could be a valid form of the search term. ore information is available at https://docs.thunderstone.com/site/texisman/link_ling.html .
Notes: Thesaurus terms are also treated in the same manner. Words smaller than 4-5 characters will not be morpheme processed.
Go Back to Top of Page
Controlling Proximity
These options give you control over the region in which a match must be found.
- line:
- match terms must be located within the same line.
- sentence:
- all terms within the same sentence.
- paragraph:
- match terms must be located within the same paragraph.
- page:
-
(default) all terms within the same document.
In all cases the best possible matches for your query are located and ordered by decreasing quality. A bar graph is produced to indicate the quality of each answer.
Go Back to Top of Page
|