As a reminder, for non-English texts, especially those which do not rely on spaces between words, try the trigram or gse tokenization methods which were added in Weaviate v1.24 for such cases.

