Symbiota Data Fields

The Symbiota data schema is strongly aligned to the Darwin Core data exchange standard. For more details, links to the Darwin Core definitions are supplied for each term. Learn more about Darwin Core terms in the following TDWG pages:

Fields listed here differ from the fields visible in the data uploading tools. For field information specific to the data upload tools, see the Data Import fields page.

Table of Contents

Since portals have the ability to customize the field names found on their data entry form, field names may differ from the core field definition and how it is mapped to Darwin Core export tools.

Download full content as a CSV See full content as a CSV

Standard Fields

Catalog Number: The unique identifier (primary key) for the specimen record. This field should be used to store the barcode or the accession number (herbaria only). This field is enforced to be unique per collection
Ex: WIS-L-0123456, ASU0012345, 12345
See Darwin Core's catalogNumber

Additional Identifier Values: Any other identifier for a specimen record that is not the central catalog number. This field is typically used to store the old catalog numbers, accession numbers, National Park identifiers, etc. Identifiers can be assigned a tag name to distinguish it from other identifiers (e.g. old accession #, NPS#, etc). These identifiers map best to dwc:otherCatalogNumbers definition, and thus included in the exports under this field. More information about this system can be found on the Catalog Numbers documentation page.
Ex: 12345, TUZI 3082, NPS Acc #: GUIS-M-00126.
See Darwin Core’s otherCatalogNumbers

Collector: The name of the person who collected the specimen or made the observation.
Ex: C.G. Pringle, Goodding, L.N.
See Darwin Core's recordedBy

Associated Collectors: Other collectors that were present at the time of collection.
Ex: John R. Reeder, A. Nelson
This field is not defined by the Darwin Core standard, which places primary and secondary collectors concatenated the recordedBy field.

Number: The collection number assigned to the specimen by the collector.
Ex: 1294, 12490b, 94-132
See Darwin Core's recordNumber

Date: The date the specimen was collected. While dates can be entered using your preferred format, the value will be converted and stored as an ISO-8601 numeric format (YYYY-MM-DD). Note that unknown month and days can be entered as "00". For example, a collection with a date of "March 1956" can be entered as "1956-03-00".
Ex: 1983-09-15, 1983-07-00, 1934-00-00
See Darwin Core's eventDate

Verbatim Date: Can be used to record date exactly as entered on label. Particularly useful for non-standard date formats or date ranges.
Ex: Spring 1901, March-April 1952, late Sept. 1909
See Darwin Core's verbatimEventDate

Year: The numeric value of the year at the time of collection. This field (along with month and day) is automatically populated when the date is entered.
Ex: 1959
See Darwin Core's year

Month: The numeric value of the month at the time of collection. This field (along with year and day) is automatically populated when the date is entered.
Ex: 10
See Darwin Core's month

Day: The numeric value of the day at the time of collection. This field (along with year and month) is automatically populated when the date is entered.
Ex: 28
See Darwin Core's day

Day of year range: A range of collection dates can be represented here as numeric day of year values. These values will be automatically calculated if you enter a date range in the verbatim date field (e.g. 12 Sept 1968 to 19 Sept 1968, 1968-09-12 to 1968-09-19)
See Darwin Core’s startDayOfYear, endDayOfYear.

Scientific Name: The Latin name of the specimen without the author. Could be anything from kingdom down to subspecies or variety, depending on the level of the identification.
Ex: Pinaceae, Pinus, Pinus edulis, Pinus edulis var. fallax
See Darwin Core's scientificName

Author: The name of the person who first named the taxa. This field autofills after entering the scientific name.
Ex: L., Asa A. Gray
See Darwin Core's scientificNameAuthorship

Identification Qualifier: The determiner's expression of uncertainty in their identification. This will be listed on the label along with the scientific name.
Ex: cf., aff.
See Darwin Core's identificationQualifier

Family: The family to which the taxa belongs. This field autofills after entering the scientific name.
Ex: Pinaceae
See Darwin Core's family

Identified By: The name of the person who identified the specimen. Also called a determiner.
Ex: L. R. Landrum
See Darwin Core's identifiedBy

Date Identified: The date the identification was made. Date can be entered as free form text and do not need to be in a standard date format.
Ex: 1992, May 1992, 2 May 1992
See Darwin Core's dateIdentified

ID References: The reference source used to make the identification.
Ex: Nesom, Guy L. 2006. Flora of North America - Asteraceae. vol. 20
See Darwin Core's identificationReferences

ID Remarks: Any additional notes regarding the identification of the specimen.
See Darwin Core's identificationRemarks

Taxon Remarks: Any additional notes regarding the taxonomic name to which the specimen was identify.
See Darwin Core's taxonRemarks

Country: The name of the country in which the specimen was collected. To aid data entry, a drop down menu will appear as one types, though names outside of the list can still be entered.
Ex: USA, Canada, Mexico
See Darwin Core's country

State/Province: The name of the state or province in which the specimen was collected. As one types, a selection list will appear for the given country.
Ex: New York, Arizona, Sonora
See Darwin Core's stateProvince

County: The name of the county in which the specimen was collected. Choose one from the drop down menu. For specimens collected outside of the United States, enter the next geographic region below state/province.
Ex: Maricopa
See Darwin Core's county

Municipality: The name of the municipality in which the specimen was collected. For specimens collected outside of the United States, enter the 4th level geographic designation.
Ex: Paradise Valley
See Darwin Core's municipality

Locality: The detailed location in which the specimen was collected.
Ex: 9.5 miles NW of Sedona along Boynton Pass Rd.
See Darwin Core's locality

Location ID: An identifier for the set of location information (data associated with dcterms:Location). May be a global unique identifier or an identifier specific to the dataset.
Ex: https://opencontext.org/subjects/768A875F-E205-4D0B-DE55-BAB7598D0FD1
See Darwin Core's locationID

Locality Security: Checking the Locality Security checkbox will hide locality details below the level of county from unauthorized users. This is typically done because the species is rare or threatened. Images are also hidden to protect locality details that might be viewable from the label. Users that are logged into the system and have the necessary permission to view locality details (e.g. collection managers) will continue to have access to all data. This box will automatically be checked if the species name is on any of the rare species lists  (see sitemap). If one wishes to lock protection (on or off), click the Lock Security checkbox and/or enter a reason for security override in the text field. Leaving the locality security unlocked will allow default settings to be applied as determined by the sensitive species administrators, which is the recommendation for most records.
For more information on sensitive species protection, see the page on Sensitive Species Protection.
This field is not defined by the Darwin Core standard.

Latitude and Longitude (decimal format): The geographic latitude and longitude in decimal degrees. Latitudes from the southern hemisphere and longitudes in the western hemisphere (e.g. USA) should be entered as negative values. Click on the "Tools" button to enter the coordinates in the degree, minute, seconds (DMS) or the UTM formats. Decimal degrees are the preferred coordinate standard as defined by Darwin Core. See below for more information on using this tool.
Ex: 34.874022, -111.75774
See Darwin Core's decimalLatitude

Uncertainty (meters): The accuracy of the georeference coordinates in meters (numeric value only). This is measured as the radius of a circle where the true point would be found if known. If coordinates are collected using a GPS, than the accuracy would be the error found within the GPS unit (usually around 10m). While previously collected specimens that have coordinates on the label recorded by the collector typically do not state the source of the coordinates (GPS, map, etc), it is typically a good assumption that the coordinates are accurate within one to two hundred meters. If the locality details are vague such as just "Grand Canyon", then the coordinates should be the centroid within the uncertainty encompassing the greater area where the specimen may have been collected. If the locality is "Boynton Canyon, Sedona", the uncertainty would be about 1500 m. This field autofills when using GeoLocate for georeferencing.
Ex: 42000 for Phoenix, 20000 for Salt Lake City
See Darwin Core's coordinateUncertaintyInMeters

Datum: The geographic system that was used to get the coordinates. This field autofills when using [http://www.museum.tulane.edu/geolocate/|GeoLocate] or the Google Maps tool for georeferencing.
Ex: NAD27, NAD83, WGS84
See Darwin Core's geodeticDatum

Verbatim Coordinates: If the coordinates recorded on the specimen label are in a format other than decimal degrees, enter them here. When decimal lat/long fields are blank and one enters UTM or DMS using one of the formats displayed in the example below, decimal lat/long values will be automatically generated. Click the “<<” symbol to replace existing decimal values. This field autofills when using the DMS, UTM, and TRS georeferencing tools.
Ex: 34° 13.940' N 112° 2.370' W, 12 420944E 4064025N, TRS: T40N R32E S29
See Darwin Core’s verbatimCoordinates.

Elevation in Meters: The elevation in meters at which the specimen was collected. Also called altitude. Use only the left field with the right field blank when a single elevation exists.
Ex: 1400, 2000-2200
See Darwin Core's minimumElevationInMeters

Verbatim Elevation: The verbatim elevation at which the specimen was collected. This is typically used to record an elevation measurement that was recorded in feet or an uncertainty designation. When the elevation in meters field is left blank, the value will automatically be converted to meters. Click the “<<” symbol to replace the previously entered meters values.
Ex: 4500ft, 4500 feet, ca 4500', ca 2000m, 4500' +-300'
See Darwin Core’s verbatimElevation.

Depth in Meters: The range of depth below the local surface, in meters.
Ex: 100, 150-200
See Darwin Core’s minimumDepthInMeters, maximumDepthInMeters.

Verbatim Depth: The original verbatim description of the depth below the local surface at which the specimen was collected.
Ex: 100ft, 100 feet, ca 100', ca 30m, 100' +-10'
See Darwin Core's verbatimDepth

Georeferenced By: The name of the person who georeferenced the specimen record. This field autofills when using GeoLocate for georeferencing.
Ex: A. Gonzales, emakings, acbarber
See Darwin Core's georeferencedBy

Georeference Protocol: The source of the standards used to georeference.
Ex: Georeferencing Quick Guide. Zermoglio et al. 2020
See Darwin Core's georeferenceProtocol

Georeference Sources: The tool or tools used to georeference.
Ex: GeoLocate, Google Earth, USGS map
See Darwin Core's georeferenceSources

Georef Verificiation Status: Says whether or not the georeference has been reviewed or verified.
Ex: reviewed, not reviewed
See Darwin Core's georeferenceVerificationStatus

Georeference Remarks: Any additional notes regarding the georeferencing of the specimen.
See Darwin Core's georeferenceRemarks

Habitat: The description of the habitat in which the specimen was collected.
Ex: Wet areas along a small stream in chaparral.
See Darwin Core's habitat

Substrate: The substrate on which the specimen was collected. Mostly used for lichen and bryophyte specimens.
Ex: On basalt, trunk of oak
Darwin Core lumps this information into habitat.

Associated Taxa: A list of the names of other species occurring with the collected specimen.
Ex: Quercus, Arctostaphylos, Ceanothus, Rhus, Eriogonum, Salvia
See Darwin Core's associatedTaxa

Description: A physical description of the specimen at the time of collection. This often includes information that can be lost or difficult to observe after the collection and preservation process.
Ex: Shrub 3 m tall, corolla yellow

Notes: Any additional notes regarding the specimen.
See Darwin Core's occurrenceRemarks

Dynamic Properties: A list of additional measurements, facts, characteristics, or assertions about the specimen in a format that allows programmatic parsing of the data. See the Darwin Core link below for further details.
Ex: awnLengthInMeters=0.014, heightInMeters=1.5, relativeHumidity=28, airTemperatureInC=22
See Darwin Core's dynamicProperties

Life Stage: The age or stage of the organism at the time of collection/observation. Typically used for zoological collections.
Ex: larva, juvenile
See Darwin Core's lifeStage

Sex: The biological sex of the occurrence.
Ex: female, male
See Darwin Core's sex

Individual Count: The number of individuals represented by the occurrence
Ex: 2, 15
See Darwin Core's individualCount

Sampling Protocol: The names and references to methods used to collect or sample an occurrence
Ex: UV light trap, mist net, Takats et al. 2001. Guidelines for Nocturnal Owl Monitoring in North America. Beaverhill Bird Observatory and Bird Studies Canada, Edmonton, Alberta. 32 pp., http://www.bsc-eoc.org/download/Owl.pdf
See Darwin Core's samplingProtocol

Preparations: Preparation or preservation method for a specimen
Ex: in ethanol, skeleton
See Darwin Core's preparations

Phenology (Reproductive Condition): The reproductive stage the specimen is in. Typically used for plant and fungal collections.
Ex: flower, fruit, sterile
See Darwin Core's reproductiveCondition

Establishment Means: The state of establishment at the time of collection.
Ex: cultivated, invasive, native
See Darwin Core's establishmentMeans

Cultivated Checkbox: Check when the organism was established with the aid of humans and would not be able to exist on their own. This true/false field enables the ability to filter non-native or naturalized species.
Not currently exported in DwC format.

Type Status: The type designation of a specimen, if it is a type specimen
Ex: HOLOTYPE, ISOTYPE, PARATYPE
See Darwin Core's typeStatus

Disposition: The location or status of the physical specimen.
Ex: missing, on loan, in collection
See Darwin Core's disposition

Occurrence Id: This is the Global Unique Identification (GUID) for the specimen. This identification code should be stable and uniquely identify the specimen relative to all other specimens within the world.
Ex: 000866d2-c177-4648-a200-ead4007051b9, urn:catalog:UWBM:Bird:89776
See Darwin Core's occurrenceID

Field Number:
See Darwin Core's fieldNumber

Owner Code: The acronym of the owning institution. Only enter a value if the owning institution is different than what was entered when the metadata for the collection institution was added to the portal.
Ex: NPS, Forest Service
See Darwin Core's ownerInstitutionCode

Basis of Record: The type of record the specimen is classified as. For physical collections, this field defaults to “PreservedSpecimen” (aka herbarium specimen) and for observation projects, the default is “Observation”.
Ex: PreservedSpecimen, LivingSpecimen, Observation
See Darwin Core's basisOfRecord

Language: The language of the label information
Ex: English, Spanish, Portuguese
See Darwin Core's language

Processing Status: The status of the digital record. This field is used for internal data management and review. The values used are dictated by the specific workflow of each institution.

Label Project: Used for printing labels. You can create a label project and print that set of labels after you've entered the data.
Ex: Plants of Sedona 2012

Duplicate Quantity: The number of duplicate specimens created. This will dictate the number of labels printed for specimen.
Ex: 10

Material Sample Fields

Material Sample Module
Material Sample tab

Controlled vocabularies for Material Sample data fields are managed per portal, and the suggested examples provided below are derived from vocabularies used for the NEON Biorepository. These vocabularies vary by portal, and modifications may require community input. Contact your Portal Administrator for more information.

Sample Type: Controlled vocabulary defining the sample type, which is often anatomical in nature.
Ex: skull, liver, gastrointestinal tract, ectoparasite

Catalog Number / Barcode: A unique identifier for the material sample, analagous to _catalogNumber_ for specimen occurrences.
Ex: WIS-L-0123456, ASU0012345
See Darwin Core's catalogNumber

Material Sample ID (GUID): A globally unique identifier for the material sample. In the absence of a persistent global unique identifier, construct one from a combination of identifiers in the record that will make this identifier globally unique.
Ex: 06809dc5-f143-459a-be1a-6f03e63fc083
See Darwin Core's materialSampleID

Condition: Free text field to describe the physical condition of the sample. Use of a controlled vocabulary is recommended but not forced.
Ex: very poor, poor, fair, good, unknown

Disposition: Controlled vocabulary describing the current state of a sample with respect to its collection.
Ex: in collection, being processed, consumed, on loan
See Darwin Core's disposition

Preservation type: Controlled vocabulary defining the physical storage/preservation method of a sample.
Ex: dry, ethanol, frozen, pinned

Preparation date: The date of a sample's physical preparation. Dates in this field visually conform to MM/DD/YYYY formatting. Manual data entry into this field is validated using a calendar form.
Ex: 08/01/2022

Prepared by: Name of the individual who prepared a sample. The individual must have an user account in the portal to be recorded in this field.
Ex: Liao, Rosie; Johnston, Andrew

Preparation details: Free text field to record notes providing more context about the physical preparation and condition of the sample.
Ex: upper and lower GI tract; kidney, left, whole; prepared with borax

Individual count: The number of loanable objects associated with the sample, i.e. all pieces of the sample assigned to the same unique materialSampleID (see above).
Ex: 0, 1, 100
See Darwin Core's individualCount

Sample Size: Free text field to quantify the sample beyond counted number of objects, e.g. dry weight.
Ex: 200 uL

Storage Location: Free text field to describe a sample's permanent physical storage location.
Ex: Freezer 3; Oversize Storage; Cab011, Dwr002

Remarks: Free text field to provide additional notes, comments, and context unique to a sample that cannot be captured by other existing data fields. Limited to 250 characters.
Ex: genotype sampling; left jaw consumed in research; with post-cranial skeleton

Paleontology Fields

Paleo Module
Paleo Module on the Occurrence Data tab

Controlled vocabularies for the following data fields are managed per portal. Modifications to these values may require community discussion. Contact your Portal Administrator for more information.

Eon: The longest geologic time intervals. Ex: Archean, Proterozoic, Phanerozoic See Darwin Core’s earliestEonOrLowestEonothem, latestEonOrHighestEonothem

Era: A subdivision of an eon that is a shorter interval of geologic time.
Ex: Paleozoic, Mesozoic, Cenozoic.
See Darwin Core’s earliestEraOrLowestErathem, latestEraOrHighestErathem

Period: A subdivision of an era that is a shorter interval of geologic time.
Ex: Ordovician, Silurian, Devonian, Carboniferous, Mississippian, Pennsylvanian, Permian, Triassic, Jurassic, Cretaceous, Paleogene, Neogene, Quaternary.
See Darwin Core’s earliestPeriodOrLowestSystem, latestPeriodOrHighestSystem

Epoch: A subdivision of a period that is a shorter interval of geologic time.
Ex: Lower, Middle, Upper Ordovician; Wenlock; Pridoli; Lower, Middle, Upper Devonian; Lower, Middle, Upper Mississippian; Lower, Middle, Upper Pennsylvanian; Cisuralian; Lower, Middle, Upper Jurassic; Lower, Upper Cretaceous; Paleocene; Eocene; Oligocene; Miocene; Pliocene; Pleistocene; Holocene.
The Epoch field is currently being recorded using Series, the chronostratigraphic units.
See Darwin Core’s earliestEpochOrLowestSeries, latestEpochOrHighestSeries

Stage: The chronostratigraphic term given to the succession of rock strata laid down in a single geochronologic age.
Ex: Lochkovian, Emsian, Eifelian, Givetian, Frasnian, Tournaisian, Serpukhovian, Moscovian, Changhsingian, Norian, Oxfordian, Hauterivian, Albian, Maastrichtian, Thanetian, Messinian, etc.
See Darwin Core’s earliestAgeOrLowestStage, latestAgeOrHighestStage

Local Stage: A local name for a stage that was applied to this specimen.
Ex: Ulatsian, Helvetian.

Early Interval: Name of the earliest possible geochronologic eon, era, period, epoch or age, or the lowest chronostratigraphic eonothem, erathem, system, series, or stage attributable to the stratigraphic horizon from which the cataloged specimen was collected.
Ex: Aalenian, Aeronian, Albian, Anisian, Aptian, Aquitanian, Archean, Artinskian, Asselian, Bajocian, Barremian, Bartonian, etc.

Late Interval: Name of the latest possible geochronologic eon, era, period, epoch or age, or the highest chronostratigraphic eonothem, erathem, system, series or stage attributable to the stratigraphic horizon from which the cataloged specimen was collected.
Ex: Aalenian, Aeronian, Albian, Anisian, Aptian, Aquitanian, Archean, Artinskian, Asselian, Bajocian, Barremian, Bartonian, etc.

Absolute Age: Field to record the age of specimen/rock in years determined using radioactive decay of isotopes (Carbon-14, argon-argon, potassium-argon, uranium-lead, etc.) and other quantitative dating methods.
Ex: 20 Ma, 75 ka, 10.5 – 12.7 +/- 0.5 Ma, etc.

Storage Age: Field for institutions that arrange collections by geologic time or biostratigraphy. The physical location of a specimen within the collection space.
Ex: Miocene, Wasatchian, Paleocene, Bridgerian, etc.

Biota (Flora/Fauna): Name given to collections of fossils of the same age from a single locality or multiple localities within a specific geographic area.
Ex: Chalk Bluffs, Stewart Valley, Bridge Creek, Mazon Creek, etc.

Biostratigraphy (Biozone): The name of the lowest possible geological biostratigraphic zone of the stratigraphic horizon from which the cataloged item was collected.
Ex: “Wa0”, “Uvigerinella sparsicostata Zone”, “Ogygiocaris”
See Darwin Core’s lowestBiostratigraphicZone, highestBiostratigraphicZone

Group: The name of the lithostratigraphic group from which the cataloged specimen was collected. The National Geologic Map Database Geolex Search is a great resource for the named lithostratigraphic units accepted by the USGS.
Ex: Bathurst, Lower Wealden, Monte Cristo, Contra Costa, Panoche, etc.
See Darwin Core’s group

Formation: The name of the lithostratigraphic formation from which the cataloged specimen was collected. The National Geologic Map Database Geolex Search is a great resource for the named lithostratigraphic units accepted by the USGS.
Ex: Notch Peak, House Limestone, Fillmore, Chinle, etc.
See Darwin Core’s formation

Member: The name of the lithostratigraphic member from which the cataloged item was collected. The National Geologic Map Database Geolex Search is a great resource for the named lithostratigraphic units accepted by the USGS.
Ex: Lava Dam, Hellnmaria, Brown Mountain Sandstone
See Darwin Core’s member

Bed: The name of the lithostratigraphic bed from which the cataloged item was collected. The National Geologic Map Database Geolex Search is a great resource for the named lithostratigraphic units accepted by the USGS.
Ex: Harlem coal
See Darwin Core’s bed

Taxon Environment: The depositional environment of the rock unit from which cataloged specimen was collected.
Ex: marine, lacustrine, non-marine, marine-non-marine

Lithology: Field for terms to describe the types of rock/sediment from which the cataloged specimen was collected.
Ex: sandstone, mudstone, siltstone, shale, etc.
See Darwin Core's lithostratigraphicTerms

Strat Remarks: Field to record additional details about the geology, stratigraphy, more detailed lithology description, palynological sampling info, core data, etc.

Element: Field to record type of plant organ cataloged specimen represents.
Ex: stem, strobilus, sterile leaf, fertile leaf, pinnule(s), rooting organ, rootlet, megasporangium, sporangium, spore, sterile axis, fertile axis, root, etc.

Slide Properties: Field to record types of prepared slides of specimens, noting the type of preparation and mounting medium, and to provide England Finder coordinates for palynomorph slides.
Ex: strewn, petrographic thin-section, mounted peel

Geological Context ID: An identifier for the set of information associated with a GeologicalContext (the location within a geological context, such as stratigraphy). May be a global unique identifier or an identifier specific to the data set.
Ex: https://opencontext.org/subjects/e54377f7-4452-4315-b676-40679b10c4d9
See Darwin Core's geologicalContextID

Cite this page:

Ed Gilbert, ed: Laura Rocha Prado, Katie Pearson, Lindsay Walker. Symbiota Data Fields. In: Symbiota Support Hub (2021). Symbiota Documentation. https://biokic.github.io/symbiota-docs/editor/edit/fields/. Created on 21 Jul 2014, last edited on 19 Dec 2023.