]> git.openstreetmap.org Git - nominatim.git/history - nominatim/tokenizer/icu_tokenizer.py
Merge pull request #3011 from lonvia/fix-flex-scripts
[nominatim.git] / nominatim / tokenizer / icu_tokenizer.py
2022-08-12 Tareq Al-AhdalMerge remote-tracking branch 'upstream/master' into...
2022-08-09 Sarah HoffmannMerge pull request #2792 from lonvia/new-type-annotations
2022-08-09 Sarah Hoffmannadapt to new type annotations from typeshed
2022-07-31 Sarah HoffmannMerge pull request #2784 from lonvia/doscs-customizing...
2022-07-29 Sarah Hoffmannoverhaul the token analysis interface
2022-07-29 Sarah Hoffmannmove PlaceName into the generic data module
2022-07-20 Sarah HoffmannMerge pull request #2772 from kianmeng/fix-typos
2022-07-20 Kian-Meng Angdocs: fix typos
2022-07-19 Sarah HoffmannMerge pull request #2770 from lonvia/typed-python
2022-07-18 Sarah Hoffmannadd type annotations to special phrase importer
2022-07-18 Sarah Hoffmannadd type annotations for ICU tokenizer
2022-07-07 Sarah HoffmannMerge pull request #2760 from lonvia/reorganize-data...
2022-07-06 Sarah Hoffmannmove PlaceInfo into data submodule
2022-06-24 Sarah HoffmannMerge pull request #2757 from lonvia/filter-postcodes
2022-06-23 Sarah Hoffmannhandle postcodes properly on word table updates
2022-06-23 Sarah Hoffmannfix linting issue
2022-06-23 Sarah Hoffmannfix up BDD tests for postcode changes
2022-06-23 Sarah Hoffmannmove postcode matcher in a separate file
2022-06-23 Sarah Hoffmannicu: switch postcodes to using the pre-formatted one
2022-06-23 Sarah Hoffmannintroduce and use analyzer for postcodes
2022-05-31 Sarah HoffmannMerge pull request #2732 from lonvia/fix-ordering-addre...
2022-05-31 Sarah HoffmannMerge pull request #2731 from lonvia/cleanup-special...
2022-05-31 Sarah HoffmannMerge pull request #2730 from lonvia/exclude-inclusion-tag
2022-05-31 Sarah Hoffmannexclude addr:inclusion from search
2022-05-11 Sarah HoffmannMerge pull request #2709 from lonvia/less-strict-countr...
2022-05-11 Sarah HoffmannMerge pull request #2708 from lonvia/use-format-literals
2022-05-11 Sarah Hoffmannpylint: disable no-self-use check
2022-05-11 Sarah HoffmannMerge pull request #2707 from lonvia/make-icu-tokenizer...
2022-05-10 Sarah Hoffmannalways state encoding when opening files in text mode
2022-04-04 Sarah HoffmannMerge pull request #2629 from tareqpi/country-names...
2022-03-20 Sarah HoffmannMerge pull request #2641 from lonvia/reinit-tokenizer-dir
2022-03-20 Sarah Hoffmannrestore the tokenizer directory when missing
2022-03-01 Sarah HoffmannMerge pull request #2621 from lonvia/housenumber-analyzer
2022-03-01 Sarah Hoffmanndo not expand records in select list
2022-03-01 Sarah Hoffmannfix linting issue
2022-03-01 Sarah Hoffmannadapt housenumber cleanup to new word table structure
2022-03-01 Sarah Hoffmannadd framework for analysing housenumbers
2022-03-01 Sarah Hoffmannicu: move token deduplication into TokenInfo
2022-03-01 Sarah Hoffmannicu: move housenumber token computation out of TokenInfo
2022-03-01 Sarah Hoffmannhandle unknown analyzer
2022-03-01 Sarah Hoffmannmove generation of normalized token form to analyzer
2022-02-25 Sarah HoffmannMerge pull request #2614 from lonvia/reorganise-country...
2022-02-24 Sarah Hoffmannbdd: run full import on tests
2022-02-23 Sarah Hoffmanndelete unused country name tokens
2022-01-21 Sarah HoffmannMerge pull request #2589 from lonvia/clean-housenumbers
2022-01-20 Sarah Hoffmanndo not clean housenumbers in reverse-only mode
2022-01-20 Sarah Hoffmannadd actual removal of housenumber tokens
2022-01-20 Sarah Hoffmannadd new command for cleaning word tokens
2022-01-20 Sarah HoffmannMerge pull request #2588 from lonvia/housenumber-sanitizer
2022-01-19 Sarah Hoffmannfactor out housenumber splitting into sanitizer
2022-01-04 Sarah HoffmannMerge pull request #2562 from lonvia/copyright-headers
2022-01-03 Sarah Hoffmannadd consistent SPDX copyright headers
2022-01-03 Sarah HoffmannMerge pull request #2559 from lonvia/disable-jit-in...
2021-12-14 Sarah HoffmannMerge pull request #2553 from lonvia/revert-street...
2021-12-08 Sarah Hoffmanncorrectly match abbreviated addr:street
2021-12-06 Sarah Hoffmannskip most addr: tags with suffixes
2021-12-06 Sarah Hoffmannrevert to using full names for street name matching
2021-10-27 Sarah HoffmannMerge pull request #2495 from lonvia/fix-normalization...
2021-10-27 Sarah HoffmannICU: use normalization from config in PHP
2021-10-26 Sarah HoffmannMerge pull request #2493 from lonvia/handle-frequent...
2021-10-26 Sarah Hoffmanndo not count words when in reverse-only mode
2021-10-25 Sarah HoffmannMerge pull request #2486 from lonvia/fix-special-phrases
2021-10-25 Sarah HoffmannICU: add an index over word_ids
2021-10-19 Sarah HoffmannMerge pull request #2472 from lonvia/word-count-computation
2021-10-19 Sarah Hoffmannicu: no longer precompute terms
2021-10-19 Sarah Hoffmannmake word recount a tokenizer-specific function
2021-10-11 Sarah HoffmannMerge pull request #2450 from mtmail/tiger-data-2021
2021-10-09 Sarah HoffmannMerge pull request #2460 from lonvia/multiple-analyzers
2021-10-05 Sarah Hoffmannuse analyser provided in the 'analyzer' property
2021-10-01 Sarah HoffmannMerge pull request #2458 from lonvia/add-tokenizer...
2021-10-01 Sarah Hoffmannintroduce sanitizer step before token analysis
2021-10-01 Sarah Hoffmannunify ICUNameProcessorRules and ICURuleLoader
2021-09-29 Sarah Hoffmannexport more data for the tokenizer name preparation
2021-09-29 Sarah Hoffmannadd wrapper class for place data passed to tokenizer
2021-09-28 Sarah HoffmannMerge pull request #2455 from lonvia/adjust-address...
2021-09-28 Sarah HoffmannMerge pull request #2454 from lonvia/sort-out-token...
2021-09-27 Sarah Hoffmannremove unused parameter
2021-09-27 Sarah Hoffmannicu tokenizer: switch to matching against partial names
2021-09-04 Sarah HoffmannMerge pull request #2440 from lonvia/generic-config...
2021-09-04 Sarah Hoffmannfix indent
2021-09-03 Sarah Hoffmannintroduce generic YAML config loader
2021-08-18 Sarah HoffmannMerge pull request #2428 from lonvia/rename-icu-tokenizer
2021-08-17 Sarah Hoffmannrename legacy_icu tokenizer to icu tokenizer