This page describes how to use the Duplicate Matching tool in the occurrence editor to accelerate data entry.
The Duplicate Matching tool allows the user to search the entire database (i.e., all collections in the Symbiota portal) for records that share the same collector name, collector number, and date and link them to the record they are entering/editing.
You can turn an automatic duplicate search on or off using the “Auto search” checkbox. This will conduct a search for duplicates automatically after you enter the collector name, number, and date.
There are three possible results of a duplicate search using this tool.
This result occurs when a record with a matching collector last name, collector number, and date is found in a collection other than the one you are currently working in. This may represent a true duplicate specimen.
This result occurs when a record with a matching collector last name, collector number, and date is found within the collection you are currently working in. This may represent a true duplicate specimen, if you have them in your collection, or it could represent unintentional duplicate data entry (i.e., an error).
This result occurs when a record with a matching collector last name and date was found, but the exact collector number was not found. In this case, the results presented will include collector numbers slightly above and slightly below the number that you entered. These records may still be useful because they may share, e.g., locality data with the record that you are currently entering/editing.
You will have several options for using the results of the duplicate matching search. These will be listed at the bottom of the duplicate result that has been identified (see example below). You may:
Transfer the data from all the fields in the identified duplicate specimen or event into the data entry page that you are currently working on. This will replace any data that you have already entered (e.g., if you entered “Luis Gonzalez” in the collector/observer name, and the duplicate had was “Gonzalez, Luis” in this field, “Gonzalez, Luis” would overwrite your previous entry). Clicking this option will transfer the data and close the duplicate matching window.
Transfer the data from all fields in the identified duplicate specimen or event into the data entry page that you are currently working on, unless that field already has data in it. Only empty fields will be populated. Your previous entries will remain the same. Clicking this option will transfer the data and close the duplicate matching window.
Checking this box and then clicking one of the additional options will create a duplicate linkage between the record you are editing and the duplicate record. For more information about duplicate linkages, visit the Duplicate Clustering documentation.
This option is only available if the identified duplicate belongs to the same collection that you are currently editing. Clicking this option will take you to the occurrence editor page for the duplicate record so you can view it (e.g., if you want to verify the data or view the image). Once you have looked at the image, if you want to transfer any of the data into the record you are working on, you will need to click the “Duplicates” button on the record that you were editing again.
This option is only available if the identified duplicate belongs to the same collection that you are currently editing. If you click this option, the two records will be merged together. The record in the pop-up window will be prioritized over data in your occurrence editor. For example, if the record listed in the pop-up window has “USA” in the country field, and the record in the occurrence editor has “United States” in the country field, the merged record will have “USA” in the country field. Fields that are empty in one of the two records will be filled in with the data from the record that has data in that field. All images and linked resources will be included in the merged record. All unique determinations will be included in the merged record with the exception of any “current determination” belonging to the record in the occurrence editor, which will be discarded. As a result, you will have one current determination (belonging to the record in the pop-up window) and all other previous determinations associated with the record. For this reason, it is a good idea to check the Determinations tab of the merged record before moving on.
Katie Pearson. Duplicate Matching. In: Symbiota Support Hub (2024). Symbiota Documentation. https://biokic.github.io/symbiota-docs/editor/edit/duplicates/. Created on 30 Nov 2022, last edited on 27 Sep 2024.