Main menu

Upgrading Solr 1.4.1 to 3.4.0

Published by dnovikov on Wed, 09/28/2011 - 10:10

You'll surely find a lot of deprecated WARNINGS in your output when start Solr. But the main issue I've faced after upgrading is change in schema.xml properties. For example, few HTML stripping classes were removed. You can read about this here. This is an excerpt:

* The deprecated HTMLStripReader, HTMLStripWhitespaceTokenizerFactory and
  HTMLStripStandardTokenizerFactory were removed. To strip HTML tags,
  HTMLStripCharFilter should be used instead, and it works with any
  Tokenizer of your choice.  (SOLR-1657)

So, you should make some changes in your schema.xml file. What I had:

<charFilter class="solr.MappingCharFilterFactory" mapping="mapping-ISOLatin1Accent.txt"/>

<tokenizer class="solr.HTMLStripWhitespaceTokenizerFactory"/>

What I've got:

<charFilter class="solr.MappingCharFilterFactory" mapping="mapping-ISOLatin1Accent.txt"/>

<charFilter class="solr.HTMLStripCharFilterFactory"/>

<tokenizer class="solr.WhitespaceTokenizerFactory"/>

Add new comment

Plain text

  • No HTML tags allowed.
  • Web page addresses and e-mail addresses turn into links automatically.
  • Lines and paragraphs break automatically.
By submitting this form, you accept the Mollom privacy policy.