Indexing and searching with sphinx for words with signs like (++, #, .) -
hi have build index , need search words "c++", ".net" or "c#", there not result coming. here config:
source = xxxx path = /usr/local/etc/sphinx/var/data/xxxx docinfo = extern charset_type = utf-8 min_word_len = 1 min_infix_len = 7 stopwords = /usr/local/etc/sphinx/var/stopwords/stop_words_en.txt
i have try search sph_match_phrase , sph_match_all, there nothing useful coming.
what can allow this?
thanks nik
you have configure charset_table include symbols +, #, .,
ie
index your_index { charset_table = 0..9, a..z->a..z, a..z, +, #, u+002e,
you check how words tokenized via call keywords mysql call
with config provided i'd such output
mysql> call keywords ('c++ .net c# end_of_a_sentence.', 'your_index')
tokenized normalized c c net net c c end end of of sentence sentence
with addition config output tokenized normalized c++ c++ .net .net c# c# end end of of sentence. sentence.
the downside of dot (.) in charset_table word @ end of sentence tokenized , indexed dot
'the example sentence.'
the 'sentence' word tokenized 'sentence.' , searching 'sentence' gives nothing.
Comments
Post a Comment