Return to GEMINI 2.3 home page
The UK Location Discovery Metadata Service (DMS) lies at the heart of UK Location and the delivery of the UK Location Strategy and INSPIRE - 'to know what data we have'.
The DMS underpins the coordinated and regulated publishing of public sector location information to the INSPIRE and UK Location specified standards. It provides the discovery component for a set of on-line services that will allow data users to evaluate and use public sector location information, that is to view, download and invoke as part of an end business application.
This guide sets out detailed guidance on the creation, publishing and maintenance of UK Location discovery metadata.
It covers both location information being published under INSPIRE and all other location information published through UK Location. The DMS is being implemented by extending the functionality of the UK public data publishing platform data.gov.uk.
The guide is a living document and will be continuously revised in the light of operational experience and feedback.
Please note that whilst the Devolved Administrations of Scotland, Wales and Northern Ireland are part of UK Location, they may have different publishing mechanisms to those described here. Therefore in the first instance, please use the appropriate contact information listed under “Where to obtain more information” below.
The primary audience for the Guide are business managers and information officers, located within data provider organisations, and their technical partners.
It is chiefly written for those data providers who have obligations under the INSPIRE Themes, but is also relevant to those who wish to publish location information into UK Location on a voluntary basis.
The guide will also be of interest to anyone who requires a general understanding of the UK Location Discovery Metadata Service and how it is intended to operate.
This guide assumes that the reader is familiar with the creation and management of metadata and has read the UK Location “Getting Started' series of guides.
Readers requiring an introduction to discovery metadata for geospatial data resources are referred to the UK GEMINI guide “Metadata Guidelines for Geospatial Data Resources, Introduction – Part 1” .
Throughout this guide we refer to “Data Providers” and “Data Publishers”. Within UK Location, the definition and distinction being made between these two roles is very important. So what is the difference?:
The organisation that creates the data and supplies the data for web publication, along with its metadata
The organisation that publishes the data on the web and supplies data services to data users
If you as a Data Provider publish directly, then you will perform both the roles of Data Provider and Data Publisher.
The latest versions of all the UK Location resources referred to in this guide can be found via the UK Location Resource Centre:
The latest information, and additional resources, can be obtained by visiting the UK Location web site.
If you would like to contact the UK Location Coordination Unit, please use the contact form at: http://location.defra.gov.uk/resources/contact-us/
If you are looking to publish location information specific to Scotland, Wales or Northern Ireland, please contact:
Alex Ramage, Spatial Information Policy, Scottish Government
Geography & Technology, Welsh Assembly Government
Data Providers publish their data and on-line services into UK Location by creating and publishing discovery metadata. These resources are registered with UK Location and made discoverable through its central metadata catalogue, which forms part of the public data access portal, data.gov.uk.
Deciding to publish data into UK Location is a major commitment. INSPIRE and UK Location is not a simple cataloguing service of what data exists – it is a network of on-line services for that data and an operational framework for future harmonisation and interoperability, such that it can be more easily combined within a data user application, or used directly through a range of on-line data services.
If you choose to publish, the following operational commitments apply:
The following service requirements apply:
Accountability for publishing data and associated resources rests with you the Data Provider. If you are a Data Provider and are publishing through a third party, they are responsible for providing a service to you as the Data Provider, but they are not accountable for the data and services published. Accountability at all times remains with you as Data Provider, e.g. in relation to INSPIRE compliance and conformance to INSPIRE and UK Location technical guidance.
The process for publishing metadata records for data is shown as a process model below.
This is highly generalised and you will need to view it in the context of your own internal data management processes and integrate it into this existing fabric.
The creation and publishing of discovery metadata for a dataset or series is triggered by the business decision (event) to publish the data.
The Data Provider creates the discovery metadata for the dataset or series, to the metadata requirements specified in this guide. The Data Provider should then apply internal quality controls on this metadata, before releasing it for publication.
Ideally, the metadata should be created by the part of the organisation responsible for the data itself and based on metadata created at the time the data was created, or updated.
The next stage of the process passes to the Data Publisher. This role may be within the same organisation as the Data Provider, or a third party acting on behalf of the Data Provider. Even if within the same organisation, it is quite likely to be a different part of the organisation, e.g. the data & knowledge management department, or within the IT department.
The Data Publisher needs to update the metadata, creating the links to any on-line resources associated with the data, e.g. data specifications, licences and services. If these do not already exist, then they may need to be created before the data can be published.
Once updated, the metadata needs to be passed through the Data Publisher’s own internal quality controls. At this point the metadata can be released for publication and registered with UK Location.
The final stage of the process passes to UK Location. At the point of registration, the discovery metadata is collected automatically from the Data Publisher, and subjected to a number of quality controls. Once complete, the Data Publisher is notified of the results. If no errors are recorded, the resource is added to the UK Location Catalogue and becomes discoverable through the UK Location discovery service. If errors are found, then the Data Publisher will need to correct these and re-submit the resource.
The process for publishing metadata records for on-line services is shown as a process model below.
This is highly generalised and you will need to view it in the context of your own internal data publishing processes and integrate it into this existing fabric.
The process is triggered by the creation of an on-line service for a published dataset or series. This may be a new service, or the use of an existing service, which has been extended to support the additional data resource. If the former, a service metadata record will need to be created. If the latter, than the existing service metadata record will need to be revised to couple it to the new dataset or series.
Having created or revised the service metadata, then as with data metadata records, it needs to be subject to internal quality controls and release management.
Collection of the service metadata record is as for data, unless it is an existing service, in which case it will be an update of the existing registration.
UK Location requires that Data Providers and their Data Publishers maintain their published metadata records in line with the INSPIRE Metadata Implementing Rule. This requires that the metadata elements and the record itself are kept up to date. For example, if you decide to change your published dataset, or move the location of a resource e.g. view service, then you are responsible for ensuring that these changes are updated in your metadata.
Data sharing, including charging and licensing, 3rd party intellectual property, and e-commerce issues are covered under separate UK Location data sharing operational guidance. This is in the process of being published. Please visit the UK Location Resource Centre for more details.
UK Location discovery metadata will be used by potential data users to discover and evaluate published location information for use in a range of applications. To enable the data user to do this, they require comprehensive information about the data and services. The better the discovery metadata, the more likely the data will be shared and re-used in an appropriate way.
GEMINI 2.3 has been updated so that UK Location discovery no longer needs an application profile.
Considerable care needs to be taken over the creation of discovery metadata. It is strongly recommended that you use a metadata editor tool, that fully supports the encoding requirements defined in  and uses the UK Location XSD and Schematron schemas [28,30], to ensure that your metadata is fully compliant with UKL and INSPIRE requirements.
For a detailed explanation of each element, please refer to the UK GEMINI2 standard and associated guidance. For metadata describing data (datasets and services), click here. For metadata describing services, click here.
Some metadata elements are particularly important to the operation of the DMS, rather than to the users of the DMS.
To support the operation of UK Location and INSPIRE, discovery metadata records must include a File Identifier for the resource. The file identifier is the unique persistant identifier of the metadata record; it must not change when the record is updated.
For more information please refer to the metadata element description here.
This must be updated when ever the metadata record is edited. Data.gov.uk harvesting will only collect the new record if the Metadata date is more recent than the old one.
For more information please refer to the metadata element description for Metadata date.
UK Location and INSPIRE is a network of distributed on-line services that provide access to published location information. To provide this access, the discovery metadata about the data needs to tell the data user what services are available and how to access them, in a machine-readable way. This is achieved using the discovery metadata element “Resource Locator”.
For the network to operate, it is critical that these links are correct:
The Resource Locator must define the Uniform Resource Locator (URL) for the on-line services that provide access to it.
As well as the URL, it is also necessary to provide distribution format information in all cases.
If the format and version are known, then they should be given.
If no on-line services are available for access, then you should enter the URL for further information about access.
Depending on the nature of the on-line service, machine-readable or human readable, one of two URLs should be provided - one giving the URL of an OGC web service ‘GetCapabilities request’, e.g.service=WMS&request=GetCapabilities&, or the location of further information about a service.
The following Data Provider licensing information must be provided:
Upon collection, UK Location will record the Use Constraints text and URL in the 'licence' field displayed on data.gov.uk. The UK Government encourages public bodies to release data explicitly under an open licence, which on data.gov.uk currently includes the "Open Government Licence" and the "OS OpenData Licence". For data.gov.uk to correctly categorise your data as open, ensure these licence titles are stated in the Use Constraints text field exactly as quoted here, for example “Open Government Licence”.
Published discovery metadata XML document resources will be validated in a three stage process, using:
Data Publishers are required to validate their metadata records using these before resource registration. UK Location has published guidance on using its Schematron rules .
Within a given organisation or thematic community, there may be the need to record additional items of metadata to meet local requirements. Existing examples include Marine Environmental Data and Information Network (MEDIN) and Academic Geospatial Metadata Application Profile (AGMAP).
UK Location will accept additional metadata elements within a metadata record XML document. These elements must be taken from ISO 19115 and be encoded in conformance with ISO 19139.
UK GEMINI2 includes a number of code lists taken from ISO 19115. Within a given organisation or thematic community, you may wish to extend these code lists. These extensions need to be incorporated into the INSPIRE and/or UK Location specified code lists before they can be used. If you wish to extend a code list for an INSPIRE metadata element, please contact the UK Location Coordination Unit.
The type of data being published will have a strong bearing on the content of the metadata created. It is strongly recommended that data providers create local guidance, based on UK GEMINI2 and this operational guide.
Publishing discovery metadata for UK Location for data and services is a three stage process:
This involves transferring your completed discovery metadata records to an on-line location, which can be machine accessed through the Internet. The records must be accessible using one of two mechanisms:
These mechanisms will be used by the Discovery Metadata Service, to collect registered discovery metadata records. For more information about this interface, please refer to the DMS Collection Interface Specification .
This can take a number of forms. It may be local to your business unit, or organisation, or a community resource, shared across a number of organisations.
The most important aspect of the harvest location is that it holds the published MASTER of the resources being published, i.e. it is your control point for their publication to UK Location and that it is the endpoint for services and their Coupled Resources.
If publishing elsewhere, e.g. to thematic or regional catalogues and information portals, you should consider using this same control point – applying the ‘publish only once principle’.
How you organise your records will be subject to your own local needs and practices. Possible catalogue/file directory structures include:
You can reflect this structure when you register harvest locations with UK Location, i.e. you can register all of the resources under a particular business unit or theme, and apply a filter against the rest of the contents.
The only mandatory structure to apply is if the location contains resources associated with more than one Data Provider. These need to be registered separately, so your harvest location must be structured such that these can be identified as a group.
Please note that initially the above will only be supported if using WAF as your publishing mechanism. If using CSW, then the specified URL must contain only files for publication to UK Location, published under a single registration, relating to a single Data Provider.
Subject to demand, CSW filtering will be introduced as part of a later release.
If you have an immediate requirement for UK Location to support CSW Filtering, by category or element, please contact the UK Location Coordination Unit, using the contact form at: http://location.defra.gov.uk/resources/contact-us/.
UK Location discovery metadata records are registered using the UK Government data publishing and discovery portal data.gov.uk.
Data.gov.uk is a single point of registration, discovery and access for a range of data, not purely location information. It provides two channels for registering discovery metadata.
Data and services published through UK Location must be registered using the dedicated data.gov.uk channel – Collection (discovery metadata).
The end-to-end process for publishing UK Location discovery metadata records for data and services, using data.gov.uk, is shown below.
Data and services are published against a registered organisation, e.g. government department or agency. This is the organisation legally accountable for the data being published - the UK Location “Data Provider”.
Please note that within data.gov.uk, the Data Provider is referred to as the “Publisher” for metadata and data resources, irrespective of whether they are publishing directly, or indirectly through a third party.
The registration of data.gov.uk “Publishers” is through a process of ‘request and approval’. The process is explained within the “Data” tab of data.gov.uk
If publishing directly, you will need to determine if a data.gov.uk Publisher account already exists for your organisation (this is clear within the online application process).
You can then either apply for editor access (which lets you register and administer published data resources) or administrator access, which gives you the same powers as editor access but also lets you edit publisher’s details (including name, description, and contact information), and authorize access for other users to be editor or administrator for the organisation.
If you are a third party Data Publisher, i.e. publishing on behalf of a Data Provider, you will need to adopt a different approach for data and services. For data, discovery metadata records must be published under the Data Provider’s name. For Services, these must be published under your Data Publisher’s name.
You will need to ensure that you have formal approval to publish from your Data Provider. This will be checked by The National Archives before approval is given.
Before you can publish data and services on data.gov.uk, you will need to sign up and associate yourself with one or more Publishers.
For further information, see the data.gov.uk User Guide 
If there is not an existing data.gov.uk Publisher for the UK Location Data Provider, you will need to contact the Cabinet Office data.gov.uk team to request the addition of your publisher: http://data.gov.uk/publisher/apply/cabinet-office
Publishers are associated with two tiers of user:
Typically, a Publisher will only have one Administrator, although additional Administrators may be added, e.g. to provide cover, or temporarily to facilitate a hand-over.
But, a Publisher may have a number of Editors, i.e. users with responsibility for publishing different datasets associated with different parts of the organisation.
Please note that Administrators and Editors can see and edit ALL resources associated with a Publisher, not just their own.
The addition of Administrators and Editors can only be done by the account Administrators; you can request this via the data.gov.uk site, or by contacting the account administrator directly.
Having created an account on data.gov.uk and gained editor rights to the relevant Publisher, you are now in a position to register metadata harvesting resources, for that Publisher.
To register a metadata harvesting resource, select [Dataset Harvesting] from the Publisher tools sub-menu, which appears on the “Data” tab.
Select [Add a harvesting resource]. Using the displayed form, enter:
The registration will establish a collection task, which will collect the discovery metadata records located from the specified URL.
It should be noted that in time, this same mechanism will be used to publish other resources, associated with a given Publisher, e.g. machine-readable Licences. It is also the intention to extend the functionality to support:
The collection task created as described above is only run when requested, that is, when you click the ‘Refresh’ button for that harvest source, which is on the list of harvesting sources visible to the administrators of the publisher.
Once collected, the individual data and service discovery metadata records will be subject to the following validation checks:
If the validation is successful, the metadata record will be visible within data.gov.uk, and will also be passed on to the Catalogue Publishing Service (data.gov.uk’s OGC CSW interface), from which it will be collected by the European INSPIRE GeoPortal.
In the event of an error being found, an error message will be returned to the Discovery Metadata status panel. Those that pass validation, will be added to the data.gov.uk metadata catalogue and the XML documents stored for subsequent use. Resources will take approximately 30 minutes to appear on data.gov.uk once successfully collected.
If, having harvested from a metadata resource, the collection task fails, errors will be returned to the Discovery Metadata status panel. Potential errors include:
INSPIRE carries out its own validation. At present, it does not reject any records. A searchable collection of validation reports is available at http://inspire-geoportal.ec.europa.eu/proxybrowser/ProxyBrowser/; this includes validation reports on metadata records, services, and layers within services.
To maintain previous discovery metadata registrations, first select [Data] then [Dataset Harvesting] from the Publisher tools sub-menu.
From this list of harvesting sources, you can
Once a discovery metadata record has been registered, the record should not be removed from the source location. If, in exceptional cases, you need to withdraw a metadata record:
Remember to ensure that you fix any links between metadata records. That is, if you withdraw a service record, check that no dataset records still point to it, e.g. by Resource locator. If you withdraw a dataset record, check that no series or service records still point to it, e.g. by Coupled resource,
This will then be promulgated to the Catalogue Publishing Service (data.gov.uk CSW interface) within a few hours, and from there should result in the record being removed from the European INSPIRE GeoPortal.
This should be an even rarer exception. If your harvest source contains a metadata record with the same file identifier as has been used before, but with a more recent metadata date, then when that record is harvested, it will be re-instated at data.gov.uk, and from there will appear in the Catalogue Publishing Service and the INSPIRE GeoPortal.
There are a number of technical solution options that will help you create UK Location discovery metadata records and publish them to UK Location. These can cover some or all of the following elements of a metadata record creation and publishing environment:
These options are outlined below. Using the information provided here, it is recommended that you then discuss your specific requirements with your IT department and/or system supplier.
UK Location includes its own Metadata Editor that can be used by Data Providers and Publishers to create compliant discovery metadata records.
The Editor is a web browser application, based on the open source product Geonetwork. It is available both as an on-line service and as a downloadable configuration package for use as part of a local server-based installation of Geonetwork.
For more information, to register for the on-line service or to download the configuration package, please visit the UK Location Resource Centre.
The use of Geonetwork as the basis for the UK Location Metadata Editor provides the option for Data Providers and Publishers to extend a local instance of the editor to provide cataloguing, change and release management and publishing (using CSW) services. This is all out-of-the-box Geonetwork functionality. In addition, there are other open source products that enable you to integrate Geonetwork with your existing GIS and build a full data publishing environment, supporting the delivery of on-line services, e.g. View (WMS) and Download (WFS) services.
For more information on Geonetwork and links to other open source Geoweb products, please visit: http://geonetwork-opensource.org
Most of the providers of Geospatial Information Systems provide metadata creation and publishing tools as part of their solution packages.
UK Location have created a number of resources to help solution providers integrate the support for UK Location discovery metadata creation and publishing into these existing packages:
All these resources are available through the UK Location Resource Centre. We would recommend that you speak to your system supplier to determine the support they can provide and to discuss how this can be best integrated into your existing systems.
Last updated: February 2018
This work is licensed under a Creative Commons Attribution 4.0 International License