]> git.openstreetmap.org Git - nominatim.git/history - nominatim/tokenizer
bdd: recreate project directory for every run
[nominatim.git] / nominatim / tokenizer /
2021-12-08 Sarah Hoffmanncorrectly match abbreviated addr:street
2021-12-06 Sarah Hoffmannskip most addr: tags with suffixes
2021-12-06 Sarah Hoffmannrevert to using full names for street name matching
2021-12-03 Sarah HoffmannMerge pull request #2539 from lonvia/clean-up-python...
2021-12-02 Sarah Hoffmannremove unnecessary pass statements
2021-12-02 Sarah Hoffmannmore unit tests for tokenizers
2021-10-27 Sarah HoffmannMerge pull request #2495 from lonvia/fix-normalization...
2021-10-27 Sarah HoffmannICU: use normalization from config in PHP
2021-10-26 Sarah HoffmannMerge pull request #2493 from lonvia/handle-frequent...
2021-10-26 Sarah Hoffmanndo not count words when in reverse-only mode
2021-10-25 Sarah HoffmannMerge pull request #2486 from lonvia/fix-special-phrases
2021-10-25 Sarah HoffmannICU: add an index over word_ids
2021-10-19 Sarah HoffmannMerge pull request #2472 from lonvia/word-count-computation
2021-10-19 Sarah Hoffmannicu: no longer precompute terms
2021-10-19 Sarah Hoffmannmake word recount a tokenizer-specific function
2021-10-11 Sarah HoffmannMerge pull request #2450 from mtmail/tiger-data-2021
2021-10-09 Sarah HoffmannMerge pull request #2460 from lonvia/multiple-analyzers
2021-10-07 Sarah Hoffmannadd documentation for new configuration of ICU tokenizer
2021-10-07 Sarah Hoffmannfix argument description for check_database
2021-10-06 Sarah Hoffmannreorganize and complete tests around generic token...
2021-10-06 Sarah Hoffmannadd tests for sanitizer tagging language
2021-10-06 Sarah Hoffmannapply variants by languages
2021-10-05 Sarah Hoffmannuse analyser provided in the 'analyzer' property
2021-10-05 Sarah Hoffmannremove support for properties on variants
2021-10-05 Sarah Hoffmannprecompute replacements while loading configuration
2021-10-04 Sarah Hoffmannmove parsing of token analysis config to analyzer
2021-10-04 Sarah Hoffmannmake token analyzers configurable modules
2021-10-04 Sarah Hoffmannextend ICU config to accomodate multiple analysers
2021-10-04 Sarah Hoffmannmove flatten_config_list into config module
2021-10-01 Sarah HoffmannMerge pull request #2458 from lonvia/add-tokenizer...
2021-10-01 Sarah Hoffmannadd unit tests for new sanatizer functions
2021-10-01 Sarah Hoffmannintroduce sanitizer step before token analysis
2021-10-01 Sarah Hoffmannunify ICUNameProcessorRules and ICURuleLoader
2021-09-29 Sarah Hoffmannfix typo
2021-09-29 Sarah Hoffmannexport more data for the tokenizer name preparation
2021-09-29 Sarah Hoffmannadd wrapper class for place data passed to tokenizer
2021-09-28 Sarah HoffmannMerge pull request #2455 from lonvia/adjust-address...
2021-09-28 Sarah HoffmannMerge pull request #2454 from lonvia/sort-out-token...
2021-09-27 Sarah Hoffmannremove unused parameter
2021-09-27 Sarah Hoffmannicu tokenizer: switch to matching against partial names
2021-09-04 Sarah HoffmannMerge pull request #2440 from lonvia/generic-config...
2021-09-04 Sarah Hoffmannfix indent
2021-09-03 Sarah Hoffmannintroduce generic YAML config loader
2021-09-03 Sarah HoffmannMerge pull request #2437 from lonvia/tweak-ranking...
2021-09-03 Sarah HoffmannMerge pull request #2436 from lonvia/country-configuration
2021-09-02 Sarah Hoffmannremove language and partition from name import
2021-08-18 Sarah HoffmannMerge pull request #2428 from lonvia/rename-icu-tokenizer
2021-08-17 Sarah Hoffmannrename legacy_icu tokenizer to icu tokenizer
2021-08-17 Sarah HoffmannMerge pull request #2425 from lonvia/tokenizer-document...
2021-08-16 Sarah Hoffmanndefine formal public Python interface for tokenizer
2021-07-28 Sarah HoffmannMerge pull request #2408 from lonvia/icu-change-word...
2021-07-28 Sarah Hoffmannfix Python linitin errors
2021-07-28 Sarah Hoffmannreinstate word column in icu word table
2021-07-28 Sarah Hoffmannadapt unit test for new word table
2021-07-28 Sarah Hoffmannconvert word info column to json before copying
2021-07-28 Sarah Hoffmannswitch word tokens to new word table layout
2021-07-28 Sarah Hoffmannswitch special phrases to new word table format
2021-07-28 Sarah Hoffmannswitch postcode tokens to new word table layout
2021-07-28 Sarah Hoffmannswitch housenumber tokens to new word table layout
2021-07-28 Sarah Hoffmannswitch country name tokens to new word table layout
2021-07-28 Sarah Hoffmannnew word table layout for icu tokenizer
2021-07-13 Sarah HoffmannMerge pull request #2393 from lonvia/fix-flake8-issues
2021-07-12 Sarah Hoffmannuse psycopg's SQL quoting where possible
2021-07-12 Sarah Hoffmannadd helper function for execute_values
2021-07-12 Sarah Hoffmannmore formatting fixes
2021-07-12 Sarah HoffmannMerge pull request #2391 from lonvia/fix-sonar-issues
2021-07-12 Sarah Hoffmannsplit up variant computation for better readability
2021-07-12 Sarah Hoffmannreorganise process_place function
2021-07-07 Sarah HoffmannMerge pull request #2384 from lonvia/actions-add-icu...
2021-07-06 Sarah Hoffmannremove default parameter for namedtuple
2021-07-05 Sarah HoffmannMerge pull request #2371 from lonvia/increase-python...
2021-07-05 Sarah HoffmannMerge pull request #2381 from lonvia/reorganise-abbrevi...
2021-07-04 Sarah Hoffmannlimit the number of variants that can be produced
2021-07-04 Sarah Hoffmannrestrict partial word counting to names of reasoanble...
2021-07-04 Sarah Hoffmannfix subsequent replacements
2021-07-04 Sarah Hoffmannleave ICU variant properties empty for now
2021-07-04 Sarah Hoffmannonly consider partials in multi-words for initial count
2021-07-04 Sarah Hoffmannswitch to a more flexible variant description format
2021-07-04 Sarah Hoffmannuse yaml tag syntax to mark include files
2021-07-04 Sarah Hoffmannmake compund decomposition pure import feature
2021-07-04 Sarah Hoffmanncomplete tests for icu tokenizer
2021-07-04 Sarah Hoffmannfix full term token in special phrases
2021-07-04 Sarah Hoffmanncomplete tests for rule loader
2021-07-04 Sarah Hoffmanncorrectly quote strings when copying in data
2021-07-04 Sarah Hoffmannupdate unit tests for adapted abbreviation code
2021-07-04 Sarah Hoffmannadapt tests for ICU tokenizer
2021-07-04 Sarah Hoffmannmove abbreviation computation into import phase
2021-07-04 Sarah Hoffmannicu tokenizer: move transliteration rules in separate...
2021-06-04 Sarah HoffmannMerge pull request #2358 from AntoJvlt/documentation...
2021-06-02 Sarah HoffmannMerge pull request #2357 from lonvia/legacy-tokenizer...
2021-06-02 Sarah Hoffmannfix insertion of special terms and countries into word...
2021-05-24 Sarah HoffmannMerge pull request #2346 from lonvia/words-vs-tokens
2021-05-24 Sarah Hoffmannadd tests for new full name computation with ICU
2021-05-24 Sarah Hoffmannreorganize keyword creation for legacy tokenizer
2021-05-23 Sarah Hoffmannuse make_keywords for place search terms also
2021-05-18 Sarah HoffmannMerge pull request #2336 from lonvia/do-not-mask-error...
2021-05-18 Sarah HoffmannMerge pull request #2321 from AntoJvlt/csv-import-speci...
2021-05-18 Sarah Hoffmanndo not hide errors when importing tokenizer
2021-05-17 AntoJvltResolve conflicts
2021-05-17 AntoJvltAdded --no-replace command for special phrases importat...
next