Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make ICU tokenizer the default #2707

Merged
merged 6 commits into from
May 11, 2022

Conversation

lonvia
Copy link
Member

@lonvia lonvia commented May 10, 2022

The ICU tokenizer has been used in production for half a year on osm.org without major issues. Time to make it the default tokenizer. With the legacy tokenizer gone, there is also no need to automatically build the PostgreSQL module any longer. The code will remain and is still tested on the CI but the compilation needs to be explicitly activated. Existing installations should have the module around already, so there is very little need in the future to compile it.

@mtmail I'm aware that this conflicts rather strongly with the ongoing work in #2691. I'm happy to wait for it to be merged and resolve the conflicts here afterwards. But there are a couple of simplifications in the CI scripts that might make the other PR simpler as well. Among others, I have reduced the former 'icu-test', which now has become 'legacy-test' to be run on 20.04 only. Given that it is the past, it will be enough.

@mtmail
Copy link
Collaborator

mtmail commented May 10, 2022

I'm happy to get Ubuntu-22 merge-ready again later, it's not pressing. Just something I know users eventually will ask for. We've been using ICU tokenizer fine for months, too, no issues.

@lonvia lonvia merged commit 5ff35d9 into osm-search:master May 11, 2022
@lonvia lonvia deleted the make-icu-tokenizer-the-default branch May 11, 2022 06:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants