Welcome to this lesson that will focus on metadata which are the data on the data and which play an extremely important role to keep track of data acquisition that we do for geographic information systems. The objectives of this lesson consist in emphasizing the importance of metadata, describing their content in the context of geo-information and show when and how these metadata are used. At the end of the lesson, You will be able to document a dataset and consult the metadata of a dataset in order to know if they match your needs or not. In this lesson, we will therefore address successively the general principles of documentation of a data set, we will then see the standardization efforts that have been carried out in the metadata field and we will finish with a few examples illustrating the use of metadata. With the development of information systems and databases from the 60s, 70s, we began to gather masses of information in computer systems, only to realize after a few months or years that all these data had become completely unusable for lack of saving the memory of the conditions in which these data had been acquired, of the type of phenomena they represented, of the accuracy of data, of a whole bunch of parameters of this type which in fact prevented the use of these data in the end. And we started talking of this data cemetery syndrome since all these large databases were finally buried, hence the following two axioms: first of all, the idea that an information system and especially a geographical information system is alive if it is used, and that for such a system to be used, it must contain reliable data, up to date and of which we know what they represent. These two axioms have as a first corollary to emphasize on the maintenance and the updating of data which can represent a significant effort, and as a second corollary, the fact that the volume and the nature of the data stored in an information system must be adapted to the needs of the users and to the available resources to organize the maintenance and the updating. This is why the concept of Big Data, which is very fashionable lately, is encountering a certain scepticism from many people whom I am. The metadata, as data on the data, aim at documenting spatial information so that it is understood in the same way by all its users. This implies that each data layer, that each object, each attribute, is described and characterized from its scale and its nature point of view. If we look now a little closer at what metadata is all about, the first aspect that we address is that of the source of information, where the institution can be documented who produced the data, possibly the institution that published it, that distributes these data, who is the contact person, who assumes the responsibility of updating for example, Etc. Etc. A second important element is the content. Here, in the cadastral data example, first the name and the description of these data. They are said to come from official measurements, that they include realty and soil cover, that they are extracted from an official cadastral database. Etc. Etc. The type of data, geometric, temporal data, numerical values ​​of integer or real type, character chains, vectors, object structures, etc. In our cadastral data example, the plot itself is represented... is of the geometry type represented by a polygon, whilst the attributes, council number, plot number, are integers, the designation and the link, character chains. An information on the reference scale and the accuracy of data, we see here that the scale of the cadastral plan was determined at the time of data entry, usually the 500th scale in urban areas, of the 1,000th in agricultural zones and the 5,000th in mountain areas. The data entry date or the availability date of these data on the infrastructure data, here the 31st December 1983. The quantitative elements constitute another aspect of the metadata including the spatial overlap, in the case of our cadastral data, the whole of the canton of Vaud and the bounding box, that is to say the framework which includes all the data concerned, volume values like the number of objects involved in the data set or possibly the disk space occupied by these data. The access modalities to the data are obviously a very important element, in particular the format in which the data is distributed, the associated rights, the tariffs which can, as seen here, be consulted online, the eventual restrictions which apply to the use of data, the service address or the person in charge of the distribution, here, the State of Vaud. For several years, many efforts to standardize metadata have been undertaken, efforts aimed at enabling a better understanding of geo-data at different levels, whether it is that of the producer, that of the administrator or the end user, to facilitate the trades between various partners and systems, to improve the capacity to search for geographic data and finally to create a reference framework for the development of methods and metadata management tools. At the international level the body in charge of standardization in the ​​geographic information and geomatic field is known as ISO / TC 211. This organization produced the ISO 19100 series of norms some of which are specific to metadata, in particular the 19115 and the 19115-2, which are about the description of geo-data, the 19119 on the description of geo-services and the 19139 on the exchange of metadata. At the European Union level, the standardization takes place through the INSPIRE Directive, in the U.S. it is the Federal Geographic Data Committee and in Switzerland, it's the GM03 standard. All these norms are ultimately very similar since they group the same basic data type with some specific variants. We will now see some examples of uses and consultation of metadata beginning with the geoportal of the IGN-France. In this geoportal, we can activate the data catalogue directly, as well as a display area of the data selection. We choose to display the layer of the buildings that appear on the map and in the selection area, we have the different layers with an information button which provides access to metadata, in this case, the name, a description, the data producer and the legend which we see is different when the scale is smaller than the 30,000th or greater. So, if you zoom in a bit in this map to get closer a little, to see these things on a smaller scale, we see that indeed, the richness of description is a little larger with a greater variety of building types. The data catalogue is also directly accessible with the list of different data available. Here, we choose the cadastral plots to display them. We see that these plots occupy the entire country by default. So we zooms for a more precise zone to have some detailed information. And again, the access to metadata is done by the "information" button, the name, a description, the input scale, the legend, etc. In the data catalog, we can add for the example a transport-related layer, in this case, the road network, this road network on a larger scale appears more clearly, and again the "information" button gives access to some metadata, particularly the producer which is the IGN and the legend of the road network. We see that in the case of IGN-France, the richness in metadata is relatively low. Second example, the geoportal of the Walloon region. We see that on the homepage, we have a direct access to the geo-catalog with several themes and in each theme, a set of applications which are all GIS web systems accessible online and for each of these applications, a fact sheet with in particular the use access conditions, the quality of data, the contact person etc. The same for geo-data with for example here for the land use map of Wallonia all the relevant metadata. The data can be accessed directly in an online map and in the online interface we find the data catalogue with its various themes. Here the regional road network. And for each theme, as earlier an information button which gives access to the same fact sheet containing keywords, the updating dates, the use access conditions, the quality of data, the contact person etc. Here we see in the quality the scale and the way with which the dataset has been constituted, especially here on the basis of digitalization, and the contact details of the contact person if necessary. In an additional button offering access to more information, we have access to the complete model of metadata, with the possibility of choosing the type of presentation, according to the European norm, the American norm or others and different actions, including the possibility of exporting these data in different formats. These complete metadata include general information, of elements describing the origin of the resource, the geometrical aspects, the contact details of the contact person for the metadata, the technical information elements such as the use access conditions etc. The geoportal of the Swiss Confederation which, in its interface, also includes the data catalogue, with data classified by thematic and here we will search in the land cover / use a map of the forests diversity. Also a small button of information which gives access to the metadata, so the name and description of this layer which was built on a Landsat image, the legend and additional information, particularly a link to the website of the data geocatalogue which includes the entire metadata of the different layers concerned, the possibility to export them in different formats. These metadata contain a set of information, the reference information, a descriptive summary, the data of the contact person for the resource, the same for the update and the scheduled update dates, the constraints of use, particularly the existence of a license, the numerical, quantitative aspects, so the covered area, the method of distribution, in particular the formats, the reference system used so the projection system, the quality of data, the metadata managers and the information on the legislative framework which in fact includes the use of these data. Another example, with the hydrographic network, we see here that the metadata also includes the detailed legend of the different types of watercourse sections and again links to different pieces of information, in this case also the geocatalogue where we find the same data structure as earlier, with this time having perhaps the scale of 25,000th as a particularity which was used to digitalize the data, the formats in which this data is available, so the ESRI shapefile, but also the different Internet links, the hyperlinks, which enable to access the Swisstopo store, the Swisstopo online store which allows to buy these data, the system of reference projection, EPSG: 21781, which we have already talked about in a previous lesson and the contact information for the management of metadata, to conlude again by the legislative elements, in particular the order on geo-information, which is the legal framework which establishes the use of geographical data in Switzerland. And finally the geoportal of the Canton of Vaud, in which we displayed the layers related to land property and we zoom on the small village of Arnex-sur-Orbe to find the cadastral data that we used in our previous examples, the parcel with its attributes and we see that the link points to the land register data with the type of land use and the owner of the plot. We see that the basic data, the realty, the metadata can be consulted directly online and there is an analog system to what we have already seen with the name, the description of the layer, and then a series of tabs which can open or close to access various other aspects of the metadata, in this case a link to the pricing of data, a link to the data structure, towards the data model which was used, a concept that we will discuss in the second module of the course. We also see that other types of information can be displayed, here the information relating to the water protection field, with in particular the hydrographic network and in this case in the region, a particular watercourse which is the Talent if I am not mistaken, yes the Talent, with the attributes related to this watercourse, so its identifier, its number etc. and again the access to the metadata of the hydrographic network layer and we see that this layer comes from a system called Gesreau, which is in fact the watercourse management GIS of the Canton of Vaud, the data are accessible to the public, the pricing rules are known, there is a basic rate of 25 francs to recover and use these data, the principles of diffusion, some technical complement elements, the reference system so it is the same projection system, it is a Swiss projection that has been used, the date on which the data was created, the date of the last update, and finally those responsible for the diffusion. To conclude we will see how to document a data set in QGIS, the GIS software that we will use throughout this course. We will use for this a project containing data from the Seychelles, a project whose preparation is the subject of a lesson which will be presented as part of the second module of this course. we see that by right clicking on the layer of hotels, these properties can be accessed, amongst which a heading contains the metadata with a descriptive part, a title, hotels, a short summary to describe in fact what it is about, possibly a list of keywords, a data format, so text or HTML, possibly an Internet link, a URL address, and then a set of properties which have already been defined, in particular the data format, the ESRI shapefile, the type of geometry of points, the number of objects and the hold in the units of the projection system used, and the coordinates of this projection system, so UTM zone 40 South. Another example with the district layer where we see again that by default, the data type is an SQL Spatial Lite database, with the address of the file containing this data, the type of data, the polygons, the number of objects, 25 and again the total hold of the layer and then the projection system used, so again UTM 40 South. We see that in the standard version of QGIS, the possibilities of documenting a data set are relatively limited. There are several extensions which are under development which will, gradually with time, correct this existing situation and offer more complete instruments for the metadata capture. So we saw during this lesson that metadata play a very important role in describing the datasets that we use in geographical information systems. We have seen the type of information which were stored, name, description, creation date type of content etc. We have seen that these elements had been the subject of several standardization efforts in order to have a a somewhat unified description of the data sets and we also saw that these elements were used particularly in national data infrastructures, the national geoportals in France, Belgium, Switzerland, to describe the available data sets.