Yay.
redaktor got that too. I am interested in “Alphabet doesn't matter” - can you give a pointer to code?
What I did a while ago is in
https://github.com/redaktor/widgets-preview/tree/master/src/framework/String/search
and worked on phonetic search …
PS, just in case anyone needs to detect languages, this covers 800+ … https://github.com/redaktor/languages
@sl007 https://github.com/grishka/Smithereen/blob/d1433c5531be7c513860c9a369ec138afd180e92/src/main/java/smithereen/storage/SearchStorage.java
I use this https://github.com/jirutka/unidecode to transliterate everything into Latin alphabet and store these strings in a separate table (qsearch_index). So, basically, search only ever operates on Latin strings.