CanWIN Curated Terms

Using curated terms or controlled vocabularies is an important step in data curation. It is a means of standardizing data so that it can be easily understood and reused, as well as shared between different systems. Controlled vocabularies are applied to your data before it is published. 

Keywords

Keywords are words that highlight the main ideas in your research. They also allow for your data to be easily found when searching the Data Catalogue or any other online repository. CanWIN has a list of standardized keywords used for Arctic and freshwater data, with the primary source being the Polar Data Catalogue (PDC). If there are keywords you would like to add to your data that are not on this list, please contact a data curator. 

CanWIN's Curated Keywords

 

Describing variables

There are certain variable descriptors that can be added to a tabular data, that gives a user more information about the variables in your data file. These descriptors are added as columns in your data file beside the variable they describe, or they are added to your dataset's data dictionary

For example, Result Value Qualifiers (RVQs) are a list of codes that explain erroneous or missing data. Consider the case where your dataset contains erroneous values because your instrument failed. By adding a result_value_qualifier column to your data, and adding the code FEF (Field Equipment failed) where the values are erroneous, this allows users to understand right away that there is an issue with those values because of instrument failure.

CanWIN curates a specific controlled vocabulary list for describing variables in your dataset, that we recommend you include where applicable.

CanWIN's Variable Descriptors

 

Variable names

Controlled vocabularies for variables are curated naming conventions that allow us to use a standard variable name that is associated with a specific definition. Much like Digital Object Identifiers (DOI's), these naming conventions and definitions have a permanent location, meaning they can always be referenced. This benefits the researcher since it allows their data variables to be described unambiguously, facilitating reuse by experts and non-experts alike.

CanWIN recommends that you use standardized variable names as the column headers in your tabular datasets. If your variable is not yet on our list, a data curator can work with you to find an appropriate standardized variable name. 

Click the button below to be taken to CanWIN's standardized vocabulary web application, which will help you to find a standardized name for your variable. Simply choose a variable category, and click on each standardized term under that category for their definitions and source information. 

CanWIN's Standardized Vocabulary Web App

Back Next