]> git.openstreetmap.org Git - nominatim.git/history - nominatim/tokenizer
icu: move token deduplication into TokenInfo
[nominatim.git] / nominatim / tokenizer /
2022-03-01 Sarah Hoffmannicu: move token deduplication into TokenInfo
2022-03-01 Sarah Hoffmannicu: move housenumber token computation out of TokenInfo
2022-03-01 Sarah Hoffmannhandle unknown analyzer
2022-03-01 Sarah Hoffmannmove generation of normalized token form to analyzer
2022-02-25 Sarah HoffmannMerge pull request #2614 from lonvia/reorganise-country...
2022-02-24 Sarah Hoffmannbdd: run full import on tests
2022-02-23 Sarah Hoffmanndelete unused country name tokens
2022-02-07 Sarah HoffmannMerge pull request #2602 from lonvia/filter-bad-housenu...
2022-02-07 Sarah Hoffmannadd tests for get_string_list()
2022-02-07 Sarah Hoffmannsanitizer: move helpers into a configuration class
2022-02-07 Sarah Hoffmannimplement is-a-name option for housenumbers
2022-01-21 Sarah HoffmannMerge pull request #2589 from lonvia/clean-housenumbers
2022-01-20 Sarah Hoffmanndo not clean housenumbers in reverse-only mode
2022-01-20 Sarah Hoffmannadd actual removal of housenumber tokens
2022-01-20 Sarah Hoffmannadd new command for cleaning word tokens
2022-01-20 Sarah HoffmannMerge pull request #2588 from lonvia/housenumber-sanitizer
2022-01-20 Sarah Hoffmannfix linting issues
2022-01-20 Sarah Hoffmanncomplete documentation for new clean-houseunubmers...
2022-01-20 Sarah Hoffmanngeneralize filter-kind parameter for sanatizers
2022-01-20 Sarah Hoffmannclean_housenumbers: make kinds and delimiters configurable
2022-01-19 Sarah Hoffmannfactor out housenumber splitting into sanitizer
2022-01-19 Sarah HoffmannMerge pull request #2585 from lonvia/name-mutations
2022-01-18 Sarah Hoffmannfix linting error
2022-01-18 Sarah Hoffmannmove parsing of mutation config to setup phase
2022-01-18 Sarah Hoffmannintroduce mutation variants to generic token analyser
2022-01-18 Sarah Hoffmannmove variant configuration reading in separate file
2022-01-18 Sarah Hoffmannrefactor variant production to use generators
2022-01-04 Sarah HoffmannMerge pull request #2562 from lonvia/copyright-headers
2022-01-03 Sarah Hoffmannadd consistent SPDX copyright headers
2022-01-03 Sarah HoffmannMerge pull request #2559 from lonvia/disable-jit-in...
2021-12-14 Sarah HoffmannMerge pull request #2553 from lonvia/revert-street...
2021-12-08 Sarah Hoffmanncorrectly match abbreviated addr:street
2021-12-06 Sarah Hoffmannskip most addr: tags with suffixes
2021-12-06 Sarah Hoffmannrevert to using full names for street name matching
2021-12-03 Sarah HoffmannMerge pull request #2539 from lonvia/clean-up-python...
2021-12-02 Sarah Hoffmannremove unnecessary pass statements
2021-12-02 Sarah Hoffmannmore unit tests for tokenizers
2021-10-27 Sarah HoffmannMerge pull request #2495 from lonvia/fix-normalization...
2021-10-27 Sarah HoffmannICU: use normalization from config in PHP
2021-10-26 Sarah HoffmannMerge pull request #2493 from lonvia/handle-frequent...
2021-10-26 Sarah Hoffmanndo not count words when in reverse-only mode
2021-10-25 Sarah HoffmannMerge pull request #2486 from lonvia/fix-special-phrases
2021-10-25 Sarah HoffmannICU: add an index over word_ids
2021-10-19 Sarah HoffmannMerge pull request #2472 from lonvia/word-count-computation
2021-10-19 Sarah Hoffmannicu: no longer precompute terms
2021-10-19 Sarah Hoffmannmake word recount a tokenizer-specific function
2021-10-11 Sarah HoffmannMerge pull request #2450 from mtmail/tiger-data-2021
2021-10-09 Sarah HoffmannMerge pull request #2460 from lonvia/multiple-analyzers
2021-10-07 Sarah Hoffmannadd documentation for new configuration of ICU tokenizer
2021-10-07 Sarah Hoffmannfix argument description for check_database
2021-10-06 Sarah Hoffmannreorganize and complete tests around generic token...
2021-10-06 Sarah Hoffmannadd tests for sanitizer tagging language
2021-10-06 Sarah Hoffmannapply variants by languages
2021-10-05 Sarah Hoffmannuse analyser provided in the 'analyzer' property
2021-10-05 Sarah Hoffmannremove support for properties on variants
2021-10-05 Sarah Hoffmannprecompute replacements while loading configuration
2021-10-04 Sarah Hoffmannmove parsing of token analysis config to analyzer
2021-10-04 Sarah Hoffmannmake token analyzers configurable modules
2021-10-04 Sarah Hoffmannextend ICU config to accomodate multiple analysers
2021-10-04 Sarah Hoffmannmove flatten_config_list into config module
2021-10-01 Sarah HoffmannMerge pull request #2458 from lonvia/add-tokenizer...
2021-10-01 Sarah Hoffmannadd unit tests for new sanatizer functions
2021-10-01 Sarah Hoffmannintroduce sanitizer step before token analysis
2021-10-01 Sarah Hoffmannunify ICUNameProcessorRules and ICURuleLoader
2021-09-29 Sarah Hoffmannfix typo
2021-09-29 Sarah Hoffmannexport more data for the tokenizer name preparation
2021-09-29 Sarah Hoffmannadd wrapper class for place data passed to tokenizer
2021-09-28 Sarah HoffmannMerge pull request #2455 from lonvia/adjust-address...
2021-09-28 Sarah HoffmannMerge pull request #2454 from lonvia/sort-out-token...
2021-09-27 Sarah Hoffmannremove unused parameter
2021-09-27 Sarah Hoffmannicu tokenizer: switch to matching against partial names
2021-09-04 Sarah HoffmannMerge pull request #2440 from lonvia/generic-config...
2021-09-04 Sarah Hoffmannfix indent
2021-09-03 Sarah Hoffmannintroduce generic YAML config loader
2021-09-03 Sarah HoffmannMerge pull request #2437 from lonvia/tweak-ranking...
2021-09-03 Sarah HoffmannMerge pull request #2436 from lonvia/country-configuration
2021-09-02 Sarah Hoffmannremove language and partition from name import
2021-08-18 Sarah HoffmannMerge pull request #2428 from lonvia/rename-icu-tokenizer
2021-08-17 Sarah Hoffmannrename legacy_icu tokenizer to icu tokenizer
2021-08-17 Sarah HoffmannMerge pull request #2425 from lonvia/tokenizer-document...
2021-08-16 Sarah Hoffmanndefine formal public Python interface for tokenizer
2021-07-28 Sarah HoffmannMerge pull request #2408 from lonvia/icu-change-word...
2021-07-28 Sarah Hoffmannfix Python linitin errors
2021-07-28 Sarah Hoffmannreinstate word column in icu word table
2021-07-28 Sarah Hoffmannadapt unit test for new word table
2021-07-28 Sarah Hoffmannconvert word info column to json before copying
2021-07-28 Sarah Hoffmannswitch word tokens to new word table layout
2021-07-28 Sarah Hoffmannswitch special phrases to new word table format
2021-07-28 Sarah Hoffmannswitch postcode tokens to new word table layout
2021-07-28 Sarah Hoffmannswitch housenumber tokens to new word table layout
2021-07-28 Sarah Hoffmannswitch country name tokens to new word table layout
2021-07-28 Sarah Hoffmannnew word table layout for icu tokenizer
2021-07-13 Sarah HoffmannMerge pull request #2393 from lonvia/fix-flake8-issues
2021-07-12 Sarah Hoffmannuse psycopg's SQL quoting where possible
2021-07-12 Sarah Hoffmannadd helper function for execute_values
2021-07-12 Sarah Hoffmannmore formatting fixes
2021-07-12 Sarah HoffmannMerge pull request #2391 from lonvia/fix-sonar-issues
2021-07-12 Sarah Hoffmannsplit up variant computation for better readability
2021-07-12 Sarah Hoffmannreorganise process_place function
2021-07-07 Sarah HoffmannMerge pull request #2384 from lonvia/actions-add-icu...
next