]> git.openstreetmap.org Git - nominatim.git/history - nominatim/tokenizer
Vagrant and CI tests for Ubuntu 22.04
[nominatim.git] / nominatim / tokenizer /
2022-07-07 Sarah HoffmannMerge pull request #2760 from lonvia/reorganize-data...
2022-07-06 Sarah Hoffmannmove country_info into data submodule
2022-07-06 Sarah Hoffmannmove PlaceInfo into data submodule
2022-06-24 Sarah HoffmannMerge pull request #2757 from lonvia/filter-postcodes
2022-06-23 Sarah Hoffmannhandle postcodes properly on word table updates
2022-06-23 Sarah Hoffmannadd documentation for postcode customization
2022-06-23 Sarah Hoffmannfix linting issue
2022-06-23 Sarah Hoffmannfix up BDD tests for postcode changes
2022-06-23 Sarah Hoffmannport legacy tokenizer to new postcode handling
2022-06-23 Sarah Hoffmannfix postcode pattern for Mozambique
2022-06-23 Sarah Hoffmannmove postcode matcher in a separate file
2022-06-23 Sarah Hoffmannicu: switch postcodes to using the pre-formatted one
2022-06-23 Sarah Hoffmannintroduce and use analyzer for postcodes
2022-06-23 Sarah Hoffmannpostcodes: introduce a default pattern for countries...
2022-06-23 Sarah Hoffmannpostcode: generate a generic form
2022-06-23 Sarah Hoffmannpostcodes: add support for optional spaces
2022-06-23 Sarah Hoffmannpostcodes: strip leading country codes
2022-06-23 Sarah Hoffmanninitial postcode cleaner for simple patterns
2022-06-23 Sarah Hoffmannremove postcodes from countries that don't have them
2022-05-31 Sarah HoffmannMerge pull request #2732 from lonvia/fix-ordering-addre...
2022-05-31 Sarah HoffmannMerge pull request #2731 from lonvia/cleanup-special...
2022-05-31 Sarah HoffmannMerge pull request #2730 from lonvia/exclude-inclusion-tag
2022-05-31 Sarah Hoffmannexclude addr:inclusion from search
2022-05-11 Sarah HoffmannMerge pull request #2709 from lonvia/less-strict-countr...
2022-05-11 Sarah HoffmannMerge pull request #2708 from lonvia/use-format-literals
2022-05-11 Sarah Hoffmannpylint: disable no-self-use check
2022-05-11 Sarah Hoffmannpylint: avoid explicit use of format() function
2022-05-11 Sarah HoffmannMerge pull request #2707 from lonvia/make-icu-tokenizer...
2022-05-10 Sarah Hoffmannalways state encoding when opening files in text mode
2022-04-04 Sarah HoffmannMerge pull request #2629 from tareqpi/country-names...
2022-03-20 Sarah HoffmannMerge pull request #2641 from lonvia/reinit-tokenizer-dir
2022-03-20 Sarah Hoffmannrestore the tokenizer directory when missing
2022-03-01 Sarah HoffmannMerge pull request #2621 from lonvia/housenumber-analyzer
2022-03-01 Sarah Hoffmanndo not expand records in select list
2022-03-01 Sarah Hoffmannfix linting issue
2022-03-01 Sarah Hoffmannadapt housenumber cleanup to new word table structure
2022-03-01 Sarah Hoffmannhousenumber analyzer: avoid creating too many variants
2022-03-01 Sarah Hoffmannadd new analyser for houenumbers
2022-03-01 Sarah Hoffmannadd framework for analysing housenumbers
2022-03-01 Sarah Hoffmannicu: move token deduplication into TokenInfo
2022-03-01 Sarah Hoffmannicu: move housenumber token computation out of TokenInfo
2022-03-01 Sarah Hoffmannhandle unknown analyzer
2022-03-01 Sarah Hoffmannmove generation of normalized token form to analyzer
2022-02-25 Sarah HoffmannMerge pull request #2614 from lonvia/reorganise-country...
2022-02-24 Sarah Hoffmannbdd: run full import on tests
2022-02-23 Sarah Hoffmanndelete unused country name tokens
2022-02-07 Sarah HoffmannMerge pull request #2602 from lonvia/filter-bad-housenu...
2022-02-07 Sarah Hoffmannadd tests for get_string_list()
2022-02-07 Sarah Hoffmannsanitizer: move helpers into a configuration class
2022-02-07 Sarah Hoffmannimplement is-a-name option for housenumbers
2022-01-21 Sarah HoffmannMerge pull request #2589 from lonvia/clean-housenumbers
2022-01-20 Sarah Hoffmanndo not clean housenumbers in reverse-only mode
2022-01-20 Sarah Hoffmannadd actual removal of housenumber tokens
2022-01-20 Sarah Hoffmannadd new command for cleaning word tokens
2022-01-20 Sarah HoffmannMerge pull request #2588 from lonvia/housenumber-sanitizer
2022-01-20 Sarah Hoffmannfix linting issues
2022-01-20 Sarah Hoffmanncomplete documentation for new clean-houseunubmers...
2022-01-20 Sarah Hoffmanngeneralize filter-kind parameter for sanatizers
2022-01-20 Sarah Hoffmannclean_housenumbers: make kinds and delimiters configurable
2022-01-19 Sarah Hoffmannfactor out housenumber splitting into sanitizer
2022-01-19 Sarah HoffmannMerge pull request #2585 from lonvia/name-mutations
2022-01-18 Sarah Hoffmannfix linting error
2022-01-18 Sarah Hoffmannmove parsing of mutation config to setup phase
2022-01-18 Sarah Hoffmannintroduce mutation variants to generic token analyser
2022-01-18 Sarah Hoffmannmove variant configuration reading in separate file
2022-01-18 Sarah Hoffmannrefactor variant production to use generators
2022-01-04 Sarah HoffmannMerge pull request #2562 from lonvia/copyright-headers
2022-01-03 Sarah Hoffmannadd consistent SPDX copyright headers
2022-01-03 Sarah HoffmannMerge pull request #2559 from lonvia/disable-jit-in...
2021-12-14 Sarah HoffmannMerge pull request #2553 from lonvia/revert-street...
2021-12-08 Sarah Hoffmanncorrectly match abbreviated addr:street
2021-12-06 Sarah Hoffmannskip most addr: tags with suffixes
2021-12-06 Sarah Hoffmannrevert to using full names for street name matching
2021-12-03 Sarah HoffmannMerge pull request #2539 from lonvia/clean-up-python...
2021-12-02 Sarah Hoffmannremove unnecessary pass statements
2021-12-02 Sarah Hoffmannmore unit tests for tokenizers
2021-10-27 Sarah HoffmannMerge pull request #2495 from lonvia/fix-normalization...
2021-10-27 Sarah HoffmannICU: use normalization from config in PHP
2021-10-26 Sarah HoffmannMerge pull request #2493 from lonvia/handle-frequent...
2021-10-26 Sarah Hoffmanndo not count words when in reverse-only mode
2021-10-25 Sarah HoffmannMerge pull request #2486 from lonvia/fix-special-phrases
2021-10-25 Sarah HoffmannICU: add an index over word_ids
2021-10-19 Sarah HoffmannMerge pull request #2472 from lonvia/word-count-computation
2021-10-19 Sarah Hoffmannicu: no longer precompute terms
2021-10-19 Sarah Hoffmannmake word recount a tokenizer-specific function
2021-10-11 Sarah HoffmannMerge pull request #2450 from mtmail/tiger-data-2021
2021-10-09 Sarah HoffmannMerge pull request #2460 from lonvia/multiple-analyzers
2021-10-07 Sarah Hoffmannadd documentation for new configuration of ICU tokenizer
2021-10-07 Sarah Hoffmannfix argument description for check_database
2021-10-06 Sarah Hoffmannreorganize and complete tests around generic token...
2021-10-06 Sarah Hoffmannadd tests for sanitizer tagging language
2021-10-06 Sarah Hoffmannapply variants by languages
2021-10-05 Sarah Hoffmannuse analyser provided in the 'analyzer' property
2021-10-05 Sarah Hoffmannremove support for properties on variants
2021-10-05 Sarah Hoffmannprecompute replacements while loading configuration
2021-10-04 Sarah Hoffmannmove parsing of token analysis config to analyzer
2021-10-04 Sarah Hoffmannmake token analyzers configurable modules
2021-10-04 Sarah Hoffmannextend ICU config to accomodate multiple analysers
2021-10-04 Sarah Hoffmannmove flatten_config_list into config module
2021-10-01 Sarah HoffmannMerge pull request #2458 from lonvia/add-tokenizer...
next