Occurrence Data

介紹

Resources which present evidence of the occurrence of a species at a particular place and normally on a specified date. These datasets expand on most Checklist Data because they contribute to mapping the historical or current distribution of a species. At the most basic, such datasets may provide only general locality information (even limited to a country identifier). Ideally they also include coordinates and a coordinate precision to support fine scale mapping. In many cases, these datasets may separately record multiple individuals of the same species. Examples of such datasets include databases of specimens in natural history collections, citizen science observations, data from species atlas projects, etc. If sufficient information exists in the source dataset (or applies consistently to all occurrences in the dataset), it is recommended that these datasets are presented as Sampling Event Data. These datasets include the same basic descriptive information included under Resource metadata.

如何將您的資料轉換為物種出現紀錄

最終，您的資料需要使用達爾文核心標準（DwC）的術語名稱作為欄位名稱以轉換為表格結構。

您可以嘗試將資料放入 Excel 模板中，該模板包含所有必填的達爾文核心集欄位和建議填寫的達爾文核心集欄位。

或者，如果您的資料儲存在支援的資料庫中，您可以使用 DwC 欄位名稱撰寫 SQL 表格（視圖）。請務必包含所有必填的達爾文核心集欄位，並盡可能多地新增建議填寫的達爾文核心集欄位。

For extra guidance, you can look at the exemplar datasets.

You can augment your table with extra DwC columns, but only DwC terms from this list.

模板

Populate it and upload it to the IPT. Try to augment it with as many GBIF-supported Occurrence Core terms and extensions as you can. You can augment your core table with extra DwC columns, but only DwC terms from this list.

Required and recommended DwC fields

You can review the required and recommended fields in the Data quality requirements for occurrences.

示範資料集

CUMV Amphibian Collection (Arctos) (also registered on GBIF here: https://doi.org/10.15468/emivh3)

問答集

問：如何表示某地沒有這個物種（指出物種的缺如狀態）？

A. Set occurrenceStatus="absent". In addition, individualCount and organismQuantity should be equal to 0.

問：如何模糊化敏感物種的出現紀錄？

A. How you generalize sensitive species data (e.g. restrict the resolution of the data) depends on the species' category of sensitivity. Where there is low risk of perverse outcomes, unrestricted publication of sensitive species data remains appropriate. Note it is the responsibility of the publisher to protect sensitive species occurrence data. For guidance, please refer to this best-practice guide. You could refer to this recent essay in Science, which presents a simplified assessment scheme that can be used to help assess the risks from publishing sensitive species data.

When generalizing data you should try not to reduce the value of the data for analysis, and make users aware how and why the original record was modified using the Darwin Core term informationWithheld.

As indicated in the best-practice guide, you should also publish a checklist of the sensitive species being generalized. For each species you should explain:

the rationale for inclusion in the list
the geographic coverage of sensitivity
敏感度類別
the date to review its sensitivity

This will help alert other data custodians that these species are regarded as potentially sensitive in a certain area and that they should take the sensitivity into account when publishing the results of their analyses, etc.

Helpful formulas for generalizing point location

The following formula obscures a latitude/longitude point by a factor of 5000m. Note pointX and pointY must be provided in 'length in meters' and TRUNC truncates the number to an integer by removing the decimal part:

pointX = TRUNC(pointX / 5000) * 5000
pointY = TRUNC(pointY / 5000) * 5000