X-Git-Url: https://git.openstreetmap.org./nominatim.git/blobdiff_plain/2e5c8b5cd33cd2647414a87d8fd8bcae8b4698ec..baee6f3de09226c3dc41cb2314a0ac348e865561:/docs/admin/Migration.md diff --git a/docs/admin/Migration.md b/docs/admin/Migration.md index 9b6395e6..950f7e19 100644 --- a/docs/admin/Migration.md +++ b/docs/admin/Migration.md @@ -1,17 +1,111 @@ # Database Migrations -This page describes database migrations necessary to update existing databases -to newer versions of Nominatim. +Since version 3.7.0 Nominatim offers automatic migrations. Please follow +the following steps: -SQL statements should be executed from the PostgreSQL commandline. Execute -`psql nominatim` to enter command line mode. +* stop any updates that are potentially running +* update Nominatim to the newer version +* go to your project directory and run `nominatim admin --migrate` +* (optionally) restart updates -## 3.5.0 -> master +Below you find additional migrations and hints about other structural and +breaking changes. **Please read them before running the migration.** + +!!! note + If you are migrating from a version <3.6, then you still have to follow + the manual migration steps up to 3.6. + +## 4.0.0 -> master + +### ICU tokenizer is the new default + +Nominatim now installs the [ICU tokenizer](../customize/Tokenizers.md#icu-tokenizer) +by default. This only has an effect on newly installed databases. When +updating older databases, it keeps its installed tokenizer. If you still +run with the legacy tokenizer, make sure to compile Nominatim with the +PostgreSQL module, see [Installation](Installation.md#building-nominatim). + +### geocodejson output changed + +The `type` field of the geocodejson output has changed. It now contains +the address class of the object instead of the value of the OSM tag. If +your client has used the `type` field, switch them to read `osm_value` +instead. + +## 3.7.0 -> 4.0.0 + +### NOMINATIM_PHRASE_CONFIG removed + +Custom blacklist configurations for special phrases now need to be handed +with the `--config` parameter to `nominatim special-phrases`. Alternatively +you can put your custom configuration in the project directory in a file +named `phrase-settings.json`. + +Version 3.8 also removes the automatic converter for the php format of +the configuration in older versions. If you are updating from Nominatim < 3.7 +and still work with a custom `phrase-settings.php`, you need to manually +convert it into a json format. + +### PHP utils removed + +The old PHP utils have now been removed completely. You need to switch to +the appropriate functions of the nominatim command line tool. See +[Introducing `nominatim` command line tool](#introducing-nominatim-command-line-tool) +below. + +## 3.6.0 -> 3.7.0 + +### New format and name of configuration file + +The configuration for an import is now saved in a `.env` file in the project +directory. This file follows the dotenv format. For more information, see +the [installation chapter](Import.md#configuration-setup-in-env). + +To migrate to the new system, create a new project directory, add the `.env` +file and port your custom configuration from `settings/local.php`. Most +settings are named similar and only have received a `NOMINATIM_` prefix. +Use the default settings in `settings/env.defaults` as a reference. + +### New location for data files + +External data files for Wikipedia importance, postcodes etc. are no longer +expected to reside in the source tree by default. Instead they will be searched +in the project directory. If you have an automated setup script you must +either adapt the download location or explicitly set the location of the +files to the old place in your `.env`. + +### Introducing `nominatim` command line tool + +The various php utilities have been replaced with a single `nominatim` +command line tool. Make sure to adapt any scripts. There is no direct 1:1 +matching between the old utilities and the commands of nominatim CLI. The +following list gives you a list of nominatim sub-commands that contain +functionality of each script: + +* ./utils/setup.php: `import`, `freeze`, `refresh` +* ./utils/update.php: `replication`, `add-data`, `index`, `refresh` +* ./utils/specialphrases.php: `special-phrases` +* ./utils/check_import_finished.php: `admin` +* ./utils/warm.php: `admin` +* ./utils/export.php: `export` + +Try `nominatim --help` for more information about each subcommand. + +`./utils/query.php` no longer exists in its old form. `nominatim search` +provides a replacement but returns different output. + +### Switch to normalized house numbers + +The housenumber column in the placex table uses now normalized version. +The automatic migration step will convert the column but this may take a +very long time. It is advisable to take the machine offline while doing that. + +## 3.5.0 -> 3.6.0 ### Change of layout of search_name_* tables The table need a different index for nearest place lookup. Recreate the -indexs suing the following shell script: +indexes using the following shell script: ```bash for table in `psql -d nominatim -c "SELECT tablename FROM pg_tables WHERE tablename LIKE 'search_name_%'" -tA | grep -v search_name_blank`; @@ -24,7 +118,7 @@ done The debugging UI is no longer directly provided with Nominatim. Instead we now provide a simple Javascript application. Please refer to -[Setting up the Nominatim UI](../Setup-Nominatim-UI) for details on how to +[Setting up the Nominatim UI](Setup-Nominatim-UI.md) for details on how to set up the UI. The icons served together with the API responses have been moved to the @@ -39,13 +133,15 @@ which needs a different database index. Create it with the following SQL command ```sql CREATE INDEX idx_placex_pendingsector_rank_address - ON placex USING BTREE (rank_address, geometry_sector) where indexed_status > 0; + ON placex + USING BTREE (rank_address, geometry_sector) + WHERE indexed_status > 0; ``` You can then drop the old index with: ```sql -DROP INDEX idx_placex_pendingsector +DROP INDEX idx_placex_pendingsector; ``` ### Unused index @@ -53,7 +149,7 @@ DROP INDEX idx_placex_pendingsector This index has been unused ever since the query using it was changed two years ago. Saves about 12GB on a planet installation. ```sql -DROP INDEX idx_placex_geometry_reverse_lookupPoint +DROP INDEX idx_placex_geometry_reverse_lookupPoint; ``` ### Switching to dotenv @@ -66,6 +162,14 @@ configuration file, run the following command after updating: ./utils/setup.php --setup-website ``` +### Update SQL code + +To update the SQL code to the leatest version run: + +``` +./utils/setup.php --create-functions --enable-diff-updates --create-partition-functions +``` + ## 3.4.0 -> 3.5.0 ### New Wikipedia/Wikidata importance tables @@ -78,10 +182,14 @@ follows: * reimport the tables: `./utils/setup.php --import-wikipedia-articles` * update the functions: `./utils/setup.php --create-functions --enable-diff-updates` * create a new lookup index: -``` -CREATE INDEX idx_placex_wikidata on placex -USING BTREE ((extratags -> 'wikidata')) -WHERE extratags ? 'wikidata' and class = 'place' and osm_type = 'N' and rank_search < 26 +```sql +CREATE INDEX idx_placex_wikidata + ON placex + USING BTREE ((extratags -> 'wikidata')) + WHERE extratags ? 'wikidata' + AND class = 'place' + AND osm_type = 'N' + AND rank_search < 26; ``` * compute importance: `./utils/update.php --recompute-importance` @@ -138,7 +246,7 @@ The new format is ### Natural Earth country boundaries no longer needed as fallback -``` +```sql DROP TABLE country_naturalearthdata; ``` @@ -164,27 +272,37 @@ following command: The reverse algorithm has changed and requires new indexes. Run the following SQL statements to create the indexes: -``` +```sql CREATE INDEX idx_placex_geometry_reverse_lookupPoint - ON placex USING gist (geometry) - WHERE (name is not null or housenumber is not null or rank_address between 26 and 27) - AND class not in ('railway','tunnel','bridge','man_made') - AND rank_address >= 26 AND indexed_status = 0 AND linked_place_id is null; + ON placex + USING gist (geometry) + WHERE (name IS NOT null or housenumber IS NOT null or rank_address BETWEEN 26 AND 27) + AND class NOT IN ('railway','tunnel','bridge','man_made') + AND rank_address >= 26 + AND indexed_status = 0 + AND linked_place_id IS null; CREATE INDEX idx_placex_geometry_reverse_lookupPolygon ON placex USING gist (geometry) WHERE St_GeometryType(geometry) in ('ST_Polygon', 'ST_MultiPolygon') - AND rank_address between 4 and 25 AND type != 'postcode' - AND name is not null AND indexed_status = 0 AND linked_place_id is null; + AND rank_address between 4 and 25 + AND type != 'postcode' + AND name is not null + AND indexed_status = 0 + AND linked_place_id is null; CREATE INDEX idx_placex_geometry_reverse_placeNode ON placex USING gist (geometry) - WHERE osm_type = 'N' AND rank_search between 5 and 25 - AND class = 'place' AND type != 'postcode' - AND name is not null AND indexed_status = 0 AND linked_place_id is null; + WHERE osm_type = 'N' + AND rank_search between 5 and 25 + AND class = 'place' + AND type != 'postcode' + AND name is not null + AND indexed_status = 0 + AND linked_place_id is null; ``` You also need to grant the website user access to the `country_osm_grid` table: -``` +```sql GRANT SELECT ON table country_osm_grid to "www-user"; ``` @@ -192,7 +310,7 @@ Replace the `www-user` with the user name of your website server if necessary. You can now drop the unused indexes: -``` +```sql DROP INDEX idx_placex_reverse_geometry; ``` @@ -221,8 +339,8 @@ CREATE INDEX idx_postcode_geometry ON location_postcode USING GIST (geometry); CREATE UNIQUE INDEX idx_postcode_id ON location_postcode USING BTREE (place_id); CREATE INDEX idx_postcode_postcode ON location_postcode USING BTREE (postcode); GRANT SELECT ON location_postcode TO "www-data"; -drop type if exists nearfeaturecentr cascade; -create type nearfeaturecentr as ( +DROP TYPE IF EXISTS nearfeaturecentr CASCADE; +CREATE TYPE nearfeaturecentr AS ( place_id BIGINT, keywords int[], rank_address smallint,