Open Search Server Documentation
Open Search Server Documentation
PRELIMINARY DRAFT
Emmanuel Keller Author Emmanuel Gosse Author
Sebastien Andrivet Translator, proofreader
InfoPro Digital
12-14, rue Mederic Paris France
http://www.open-search-server.com
© InfoPro Digital 2009
2 | Open Search Server | Introduction
Quick start
Use your favorite tool to uncompress the archive and extract the open-search-server folder.
• Windows / Mac: double clicking on the archive will usually decompress it and extract the folder.
• ZIP archive on Unix system: You can use the unzip command line utility, for example: unzip open-
search-server-XXX.zip
• TAR.GZ archive on Unix: You can use the tar command line utility, for example: tar -zxvf open-
search-server-XXX.tar.gz
XML result
Note: The answer is in XML format encoded in UTF-8.
12 | Open Search Server | API Search / Select
War deployment guide
This first version of the installation guide demonstrates that it takes few minutes to have a OSS server running and
ready to be used.
1. Install Apache Tomcat or another JAVA server: This installation guide assumes that it is installed. Please refer to
standard installation procedures at the corresponding website. http://tomcat.apache.org/index.html Version 5 or
newer available.
2. Deploy the OSS war file: Put oss.war in 'tomcat/webapps' tomcat directory. Rename it as you want (but keep 'war'
extension !). Ex. : oss.war
3. Configuration of war in Tomcat: In 'tomcat/conf/Catalina/localhost/' path, create a xml file named as same as you
have named your war at the step 2.1 (keep 'xml' extension !).
Example : oss.xml
4. Configuration of the physical index: In any folder where you would like to put it (no special needs), use '/mnt/
all_oss/', create the place you want to have your physical index at. For instance oss1 ( to match the previous steps).
a) put the file config.xml in. (don't change its name !). You can observe that oss.xml refers to it.
b) create a single folder named 'index' in oss1, At server start, empty index files will automatically be added
inside it.
Example of a basic config.xml:
<configuration>
<indices>
<index name="index" searchCache="100" filterCache="100" fieldCache="500" /
>
</indices>
<schema>
<analyzers>
<analyzer name="StandardAnalyzer"
tokenizer="LetterOrDigitTokenizerFactory">
<filter class="LowerCaseFilter" />
<filter class="ISOLatin1AccentFilter" />
</analyzer>
<analyzer name="TextAnalyzer" tokenizer="LetterOrDigitTokenizerFactory">
<filter class="LowerCaseFilter" />
</analyzer>
<analyzer name="TextAnalyzer" lang="en"
tokenizer="LetterOrDigitTokenizerFactory">
<filter class="LowerCaseFilter" />
<filter class="SnowballEnglishFilter" />
</analyzer>
<analyzer name="TextAnalyzer" lang="fr"
tokenizer="LetterOrDigitTokenizerFactory">
<filter class="LowerCaseFilter" />
<filter class="ISOLatin1AccentFilter" />
<filter class="FrenchStemFilter" />
</analyzer>
<analyzer name="TextAnalyzer" lang="de"
tokenizer="LetterOrDigitTokenizerFactory">
<filter class="LowerCaseFilter" />
<filter class="ISOLatin1AccentFilter" />
<filter class="SnowballGermanFilter" />
14 | Open Search Server | War deployment guide
</analyzer>
<analyzer name="TextAnalyzer" lang="nl"
tokenizer="LetterOrDigitTokenizerFactory">
<filter class="LowerCaseFilter" />
<filter class="ISOLatin1AccentFilter" />
<filter class="DutchStemFilter" />
</analyzer>
<analyzer name="TextAnalyzer" lang="es"
tokenizer="LetterOrDigitTokenizerFactory">
<filter class="LowerCaseFilter" />
<filter class="ISOLatin1AccentFilter" />
<filter class="SnowballSpanishFilter" />
</analyzer>
<analyzer name="TextAnalyzer" lang="it"
tokenizer="LetterOrDigitTokenizerFactory">
<filter class="LowerCaseFilter" />
<filter class="ISOLatin1AccentFilter" />
<filter class="SnowballItalianFilter" />
</analyzer>
<analyzer name="TextAnalyzer" lang="pt"
tokenizer="LetterOrDigitTokenizerFactory">
<filter class="LowerCaseFilter" />
<filter class="ISOLatin1AccentFilter" />
<filter class="SnowballPortugueseFilter" />
</analyzer>
<analyzer name="TextAnalyzer" lang="no"
tokenizer="LetterOrDigitTokenizerFactory">
<filter class="LowerCaseFilter" />
<filter class="ISOLatin1AccentFilter" />
<filter class="SnowballNorwegianFilter" />
</analyzer>
<analyzer name="TextAnalyzer" lang="se"
tokenizer="LetterOrDigitTokenizerFactory">
<filter class="LowerCaseFilter" />
<filter class="ISOLatin1AccentFilter" />
<filter class="SnowballSwedishFilter" />
</analyzer>
<analyzer name="TextAnalyzer" lang="fi"
tokenizer="LetterOrDigitTokenizerFactory">
<filter class="LowerCaseFilter" />
<filter class="ISOLatin1AccentFilter" />
<filter class="SnowballFinnishFilter" />
</analyzer>
</analyzers>
<fields default="content" unique="url">
<field name="lang" indexed="yes" stored="yes" />
<field name="title" analyzer="TextAnalyzer" indexed="yes"
stored="compress" termVector="positions_offsets" />
<field name="titleExact" analyzer="StandardAnalyzer" indexed="yes"
stored="compress" termVector="positions_offsets" />
<field name="content" analyzer="TextAnalyzer" indexed="yes"
stored="compress" termVector="positions_offsets" />
<field name="contentExact" analyzer="StandardAnalyzer" indexed="yes"
stored="compress" termVector="positions_offsets" />
<field name="contentBaseType" indexed="yes" stored="yes" />
<field name="url" indexed="yes" stored="yes" />
<field name="urlSplit" indexed="yes" stored="no" analyzer="TextAnalyzer"
termVector="positions_offsets" />
<field name="urlExact" indexed="yes" stored="no"
analyzer="StandardAnalyzer"
termVector="positions_offsets" />
<field name="metaDescription" indexed="no" stored="compress" />
<field name="metaKeywords" indexed="no" stored="compress" />
<field name="host" indexed="yes" stored="yes" />
Open Search Server | War deployment guide | 15
</fields>
</schema>
<parsers>
<parser class="com.jaeksoft.searchlib.parser.HtmlParser"
sizeLimit="8388608">
<contentType>text/html</contentType>
</parser>
<parser class="com.jaeksoft.searchlib.parser.PdfParser"
sizeLimit="8388608">
<contentType>application/pdf</contentType>
</parser>
<parser class="com.jaeksoft.searchlib.parser.DocParser"
sizeLimit="8388608">
<contentType>application/msword</contentType>
</parser>
<parser class="com.jaeksoft.searchlib.parser.PptParser"
sizeLimit="8388608">
<contentType>application/vnd.ms-powerpoint</contentType>
</parser>
</parsers>
</configuration>
16 | Open Search Server | War deployment guide