Lucid Imagination

Secondary links

  • Contact Us
  • Sign Up or Login
  • Downloads
  • Solutions
    • Partners |
    • Blog |
    • Software |
    • Services |
    • Training |
    • Case Studies |
    • Webinars |
  • Developers
    • Blog |
    • Tech Articles |
    • Community |
    • Docs |
    • Downloads |
    • Whitepapers |
    • Podcasts |
  • About
    • Market Overview |
    • Management |
    • Company News |
    • In the Media |
    • Contact |

beta

Start new search

Back to search results

  1. FromDate
  2. Michael Kuhlmann1969-12-31 19:00
  3. Ahmet Arslan1969-12-31 19:00
  4. Ahmet Arslan1969-12-31 19:00
  5. Michael Kuhlmann1969-12-31 19:00
  6. Chris Hostetter1969-12-31 19:00

[solr-user] KeywordTokenizer for faceting gives too many results

Subject:
KeywordTokenizer for faceting gives too many results
From:
Michael Kuhlmann <michael.kuhlmann@...>
Date:
1969-12-31 19:00
Hi,

I have some fields that are only used for faceting, so they're only
queried by facet results. No modification is needed, no lowercase,
nothing. So the KeywordTokenizerFactory seems to be perfect for them.

Alas, when the value contains spaces, I'm still getting too many
results. I have a field defined like this:

    <fieldType name="text_unchanged" class="solr.StrField"
positionIncrementGap="100">
      <analyzer>
        <tokenizer class="solr.KeywordTokenizerFactory"/>
      </analyzer>
    </fieldType>

(Using solr.TextField didn't change anything)

When quering for:
....&fq=label:Aces+of+London

I get the result:
.... "facet_fields":{
	"label":[
	 "Aces of London",31,
	 "Feud London",2,
	 "Fly London",2],
....},

I get the same result when taking "Feud London" as the facet value.

When inspecting the index with the schema browser, I can see that all
labels are tokenized correctly in complete, i.e. there's no token
"London", but a token "Aces of London". So the KeywordTokenizer seems to
work as expected, at least for indexing. It's only that the facet query
is not narrow enough.

Even the superb Solr book didn't help me here. Does anybody have a clue
what I'm doing wrong here?

Greetings,
Michael

Solr Powered

Give us your feedback

  • Lucene
  • Solr
  • Nutch
  • Tika
  • Mahout
  • Droids
  • PyLucene
  • Lucene.Net
  • Lucy
  • Lucene4c
  • Open Relevance Project
  • How We Can Help:
    • Getting Started |
    • Support Subscriptions |
    • White Papers |
    • Training |
    • Consulting |
    • Contact Us |
  • Developers:
    • Blog |
    • Documentation |
    • Tech Articles |
    • Podcasts and Videos |
    • Community |
  • Downloads:
    • LucidWorks for Solr |
    • LucidWorks for Lucene |
    • LucidGaze for Solr |
    • LucidGaze for Lucene |
  • Products:
  • Services:

Contact | Privacy Policy | Legal Terms of Use | Copyrights and Disclaimers | Admin

Apache Solr, Apache Lucene, ApacheCon and their logos are trademarks of the Apache Software Foundation.

© 2010 Lucid Imagination. All Right reserved.