Skip to content

TableInfo: biosample.tsv

abradyIGS edited this page Sep 3, 2021 · 20 revisions

The biosample.tsv table will contain one row per biosample in your program.

Field Field Description Required? Field Value Type Extra Info
id_namespace A CFDE-cleared identifier representing the top-level data space containing this biosample [part 1 of 2-component composite primary key] Required string id_namespace is a unique URI prefix pre-registered with CFDE and attached to your program (or a subset of your program) that identifies anything labeled with it as belonging to you. Please see the technical documentation for a full discussion of how this information is built and used.
local_id An identifier representing this biosample, unique within this id_namespace [part 2 of 2-component composite primary key] Required string The string formed by concatenating the id_namespace and local_id field values must be unique for each row in this table. Please see the technical documentation for a full discussion of how this information is to be used.
project_id_namespace The id_namespace of the primary project within which this biosample was created [part 1 of 2-component composite foreign key] Required string This will be the value of id_namespace in the row in project.tsv corresponding to the primary project that generated this biosample. If your program has not registered multiple CFDE identifier namespaces, this will be exactly the same value for all rows.
project_local_id The local_id of the primary project within which this biosample was created [part 2 of 2-component composite foreign key] Required string This will be the value of local_id in the row in project.tsv corresponding to the primary project that generated this biosample.
persistent_id A persistent, resolvable (not necessarily retrievable) URI or compact ID permanently attached to this biosample Non-required: Any number of rows after the header can be filled The value in each row must be different

Value type is string
Meant to serve as a permanent address to which landing pages (which summarize metadata associated with this file) and other relevant annotations and functions can optionally be attached, including information enabling resolution to a network location from which the file can be downloaded.

Actual network locations must not be embedded directly within this identifier: one level of indirection is required in order to protect persistent_id values from changes in network location over time as files are moved around.
creation_time An ISO 8601 -- RFC 3339 (subset)-compliant timestamp documenting this biosample's creation time: YYYY-MM-DDTHH:MM:SS±NN:NN Non-required: Any number of rows after the header can be filled Value must be datetime Example valid dates:
2021-01-08
2021-01-08T00:45:40Z
2021-01-08T00:45:40+00:00
anatomy An UBERON CV term ID used to locate the origin of this biosample within the physiology of its source or host organism Non-required: Any number of rows after the header can be filled Value must be a valid UBERON ID UBERON lookup service
Example valid UBERON IDs:
UBERON:0001988
UBERON:0001052
UBERON:0006956
Clone this wiki locally