June 17, 2009

DotLucene and Accented Characters

During my projects working with Lucene, I had to index data from a database and make that searchable. One Of the issues that I came across was that first and last names with accents did not play nicely when searching for them. Once again a Java filter existed for this but nothing in C#. You can find here my ISOLatin1AccentFilter conversion which is a filter that replaces accented characters in the ISO Latin 1 character set by their unaccented equivalent (the case will not be altered).

No comments:

Post a Comment