]> git.openstreetmap.org Git - nominatim.git/commit
introduce mutation variants to generic token analyser
authorSarah Hoffmann <lonvia@denofr.de>
Wed, 12 Jan 2022 15:25:47 +0000 (16:25 +0100)
committerSarah Hoffmann <lonvia@denofr.de>
Tue, 18 Jan 2022 10:09:21 +0000 (11:09 +0100)
commitb453b0ea95e7b1244912b7bc9fc26f58acb8ec80
tree142d09803238af855dc0ab8d13944c5f2bbdf025
parent0192a7af96d32faf5dd319469d376bf4140dfcbb
introduce mutation variants to generic token analyser

Mutations are regular-expression-based replacements that are applied
after variants have been computed. They are meant to be used for
variations on character level.

Add spelling variations for German umlauts.
nominatim/tokenizer/token_analysis/generic.py
nominatim/tokenizer/token_analysis/generic_mutation.py [new file with mode: 0644]
settings/icu_tokenizer.yaml
test/bdd/db/import/naming.feature
test/python/tokenizer/token_analysis/test_generic_mutation.py [new file with mode: 0644]