Friday, May 16, 2014

Installing fast indexed search for IMAP e-mail

With Dovecot and apache SOLR it's possible to install very fast mail search. You can search through the entire message contents within fractions of a second.

I've used Ubuntu and Debian to set this up.

Tomcat SOLR


Install the tomcat solr server with the following command:

$ apt-get install solr-tomcat

Find the SOLR schema on page http://hg.dovecot.org/dovecot-2.2/file/tip/doc/solr-schema.xml

Make sure your dovecot version matches:

Dovecot 2.2: (Ubuntu 14.04)
http://hg.dovecot.org/dovecot-2.2/raw-file/e99cd21e1f92/doc/solr-schema.xml

Dovecot 2.1 (Debian Wheezy)
http://hg.dovecot.org/dovecot-2.1/raw-file/300a3a81c2cb/doc/solr-schema.xml

Download it like this:

$ wget http://hg.dovecot.org/dovecot-2.2/raw-file/e99cd21e1f92/doc/solr-schema.xml

Move the file to the right location:

$ mv /etc/solr/conf/schema.xml /etc/solr/conf/schema.xml.bak
$ mv solr-schema.xml /etc/solr/conf/schema.xml

$ service tomcat6 restart

Check if tomcat works by browsing to:

http://localhost:8080

Note: I use Proxmox with openvz and it was necessary to assign 2 cpu's to the openvz container to get tomcat to run properly!

Security note
The admin page is publicly accessible by default! So I made sure only local connections are allowed by adding this line between the <Host></Host> tag in /etc/tomcat6/server.xml

<Valve className="org.apache.catalina.valves.RemoteAddrValve" allow="127.0.0.1"/>

Dovecot

Now setup Dovecot.

Install the plugin package:

$ apt-get install dovecot-solr

Then modify:

/etc/dovecot/conf.d/10-mail.conf:

...
# Space separated list of plugins to load for all services. Plugins specific to
# IMAP, LDA, etc. are added to this list in their own .conf files.
#mail_plugins =
mail_plugins = $mail_plugins fts fts_solr
...


/etc/dovecot/conf.d/90-plugin.conf:

...
plugin {
  fts = solr
  fts_solr = break-imap-search url=http://127.0.0.1:8080/solr/
}
...

The "break-imap-search" option will use Solr also for indexing TEXT and BODY searches. This makes your server non-IMAP-compliant, but it's what people want ;). This is always enabled in v2.1+.

Now when an IMAP client does a "SEARCH TEXT keyword command" you should see these log entries in /var/log/mail.log:

May 15 14:19:29 mail.example.com dovecot: indexer-worker(admin@intermesh.dev): Indexed 294 messages in INBOX

For more info read: http://wiki2.dovecot.org/Plugins/FTS/Solr

Enjoy the fast searches!

18 comments:

  1. Is it also possible to search with more than just one keyword?

    ReplyDelete
  2. This comment has been removed by the author.

    ReplyDelete
  3. Your article is what I've been looking for for a long time. I'm happy to find you like this. Could you visit my website if you have time? I'm sure you'll find a post of interest that you'll find interesting. 카지노사이트

    ReplyDelete
  4. I love the efforts you have put in this, thanks for all the great blog posts. You should take part in a contest for probably the greatest blogs on the web. I will suggest this web site! 스포츠토토

    ReplyDelete
  5. Some genuinely interesting points you have written. Helped me a lot, just what I was looking for Very clean and excellent user friendly style and design, as well as the content. You’re an expert in this topic! 바카라사이트

    ReplyDelete
  6. I really appreciate this post. I’ve been looking all over for this! Thank goodness I found it on Bing. You’ve made my day! Thank you again 파워볼

    ReplyDelete
  7. I appreciate, result in I found exactly what I used to be taking a look for. You’ve ended my four day lengthy hunt! God Bless you man. Have a nice day. Bye 카지노사이트

    ReplyDelete
  8. https://gameeffect.xyz Lots of people have cherished the game of baseball for a long time. You can find fans worldwide, from specialized very little-leaguers to die-difficult spectators. This article has suggestions to show how enjoyable baseball actually is.

    ReplyDelete
  9. https://gamezoom.xyz Getting a exercise spouse can significantly increase your muscle mass-building final results. Your companion might be a valuable method to obtain determination for sticking to your training session session, and pushing anyone to optimize your efforts when you workout. Possessing a dependable companion to work through with will also help help you stay safe as you will invariably have got a spotter.

    ReplyDelete
  10. Your content is nothing short of brilliant in many ways. I think this is engaging and eye-opening material. Thank you so much for caring about your content and your readers. 바카라사이트

    ReplyDelete
  11. https://gameboot.xyz You can see them on mags and also on Television, women and men who appear like their hands and thighs and legs will explode since their muscles are extremely large! There is absolutely no need to have that you should take your system to this stage should you don't prefer to, as the basic methods on this page will enable you to develop muscles within a healthy approach.

    ReplyDelete
  12. I wanted to thank you for this excellent read!! I definitely loved every little bit of it. I have you bookmarked your site to check out the new stuff you post. thefate.org site

    ReplyDelete
  13. If we are going to talk about murals, it is a harsh reality that there are people who do not appreciate artworks like this. I fell bad, but there is less that I can do; I want them to realize that murals are beautiful and worth appreciating. But I simply don't know where to start. Sometimes, I think of writing as my way to influence people to appraiser things that should be appreciated. I don't know if that is i need an assignment going to be effective, but I want to give it a try.

    ReplyDelete
  14. Indexing is the process of looking at files, email messages, and other content on your PC and cataloging their information.

    ReplyDelete
  15. They say, "All good boys go to heaven" bismarck cleaning services

    ReplyDelete