Contact: Zinaida Perova, Project Lead, CancerModels.Org
Variability in creation, validation and description of patient-derived cancer models is a hurdle to reproducibility and clinical adoption of these models. CancerModels.Org team aims to collect feedback from as many people working in this field as possible to assemble and publish a Minimal Information standard to describe these models, similar to what has been done for the patient-derived xenograft models (PDX-MI).
Below is a collection of attributes to describe 3D patient-derived cancer models assembled in collaboration with the representatives from the following institutes: Baylor College of Medicine, Cornell University, DepMap - Broad Institute of MIT and Harvard, Dana-Farber Cancer Institute, Harvard, Huntsman Cancer Institute, MD Anderson Cancer Center, Sage Bionetworks, The Jackson Laboratory, UConn Health - University of Connecticut, University of California San Francisco, University Health Network - Princess Margaret Cancer Centre, University of Verona, University of Pittsburgh Medical Center, VHIO Vall d’Hebron Instituto de Oncología, Wellcome Sanger Institute
We are grateful for the time and effort you put into this work. If you have any feedback, or further suggestions, you can comment on the document here (please use the “Add Comment” feature): https://tinyurl.com/cancermodelsMIdraft
Attribute | Description | Example | Format | Required? |
---|---|---|---|---|
patient_id | Unique anonymous/de-identified ID for the patient from the provider | A0088 | free alphanumerical and -_~. | essential |
sex | Sex of the patient | female | male,female,other,not collected, or not provided | essential |
history | Cancer-relevant comorbidity or environmental exposure | lynch syndrome | free alphanumerical | desirable |
ethnicity | Patient Ethnic group. Can be derived from self-assessment or genetic analysis. | Eastern European | free text | desirable |
ethnicity_assessment_method | Patient Ethnic group assessment method | self-assessed/genetic | free alphanumerical. Leave blank if ethnicity was not collected/provided | desirable |
initial_diagnosis | Diagnosis of the patient when first diagnosed at age_at_initial_diagnosis - this can be different than the diagnosis at the time of collection which is collected in the sample section. No acronyms. | colorectal carcinoma | Use NCIT term when possible. Ctrl-Click here to see terms | desirable |
age_at_initial_diagnosis | The age at first diagnosis. | 70 | Numerical. Can be exact age or binned in 10 year groups (e.g. 10-19) | desirable |
age_category | Age category at time of sampling | Adult | Adult, Pediatric, Fetus, Not collected, Not provided | desirable |
HLA_type | HLA type | HLA-A | free text | desirable |
smoking_status | Patient's smoking history | current smoker, 1 pack per day | Current smoker, ex-smoker, non-smoker, never smoked; free text | desirable |
alcohol_status | Alcohol intake of the patient, self-reported | yes | yes/no/not provided | desirable |
alcohol_frequency | The average number of days per week on which the patient consumes alcohol, self-reported | 3 days/week | free alphanumerical | desirable |
family_history_of_cancer | If a first-degree relative of the patient has been diagnosed with a cancer of the same or different type | yes | free text | desirable |
sample_id | Unique ID of the patient tumour sample used to generate the model | CRC0228PRH0000000000D01000 | unique, free alphanumerical and -_~. | essential |
collection_date | Date of collections - if this is not available please indicate in column E, F collection events and the elapsed times between the collection. Important for understanding the time relationship between models generated for the same patient. | 42278 | MM YYYY or YYYY (depending on the regulations) | if several collections this is essential otherwise desirable |
collection_event | IF DATE ENTERED IN collection_date DO NOT FILL order collection 1 correspond to the 1st time a patient was sampled to generate a PDX, subsequent collection events are incremented by 1 | collection 1 | "collection " + event number | if several collections this is essential otherwise desirable |
months_since_collection_1 | IF DATE ENTERED IN collection_date DO NOT FILL, provide the time difference between the 1st collection event and the current one (in months) - collection event 1 should be 0, collection event 2 should be 6 if 6 months have elapsed between collection 1 and collection 2 and collection event 3 should be 9 if 9 months have elapsed between collection 1 and collection 3 | 0 | numerical | if several collections this is essential otherwise desirable |
age_in_years_at_collection | Patient age at tissue collection. Can be exact age or binned in 10 y group (1-9, 10-19, 20-29, ...) | 70 | numerical | essential |
diagnosis | diagnosis at time of collection of the patient tumour used to generate the model | colorectal carcinoma | Use NCIT ontology terms. Ctrl-Click here to see terms | essential |
molecular_subtype | Molecular subtype of cancer | ER-positive | free alphanumerical | desirable |
tumour_type | Collected tumour type | primary | primary, metastatic, recurrent, refractory, precancerous | essential |
primary_site | Site of the primary tumor where primary cancer is originating from (may not correspond to the site of the current tissue sample). | colon | Use NCIT ontology term where possible. Ctrl-click here to see terms. | essential |
collection_site | Site of collection of the tissue sample (can be different than the primary site if tumour type is metastatic) | colon | Use NCIT ontology term where possible. Ctrl-click here to see terms | essential |
collection_method | Method of collection of the tissue sample | biopsy | Biopsy, surgical resection, blood draw, free text | desirable |
stage | Stage of the patient cancer at the time of collection. Should be in the format of the stage classification | T2,N1,M0 | free alphanumerical | desirable |
staging_system | Stage classification system used to describe the stage. Add the version if available. | TNM staging system | free alphanumerical | desirable |
grade | Tumour grade value | Grade 1 | free alphanumerical | desirable |
grading_system | Grade classification corresponding used to describe the stage, or Not Specified Add the version if available. | Nottingham grading system | free alphanumerical | desirable |
virology_status | Positive virology status at the time of collection. Any relevant virology information which can influence cancer like EBV, HIV,HPV status. | HIV | free text | desirable |
gene_mutation_status | Outcome of mutational status tests for the following genes: BRAF, PIK3CA, PTEN, KRAS | KRAS:G12C | free text | desirable |
shareable | Is patient treatment information available and shareable? If yes fill out following treatment columns: treatment_naive_at_collection, treated_at_collection, treated_prior_to_collection. | yes | yes/no/not provided | essential |
treatment_naive_at_collection | Was the patient treatment naive at the time of collection? This includes the patient being treated at the time of tumour sample collection and if the patient was treated prior to the tumour sample collection. The value will be "yes" if either treated_at_collection or treated_prior_to_collection are "yes". | Yes | yes/no/not collected/not provided | essential |
treated_at_collection | Was the patient being treated for cancer at the time of tumour sample collection. | radiotherapy | radiotherapy, chemotherapy, targeted therapy, homorno-therapy, immunotherapy | desirable |
treated_prior_to_collection | Was the patient treated prior to the tumour sample collection. | radiotherapy | radiotherapy, chemotherapy, targeted therapy, homorno-therapy, immunotherapy | desirable |
response_to_treatment | Patient’s response to treatment. | Progressive disease | RECIST criteria: CR, PR, PD, SD | desirable |
model_id | Unique identifier for models derived from the same tissue sample | CRC0014LM | free alphanumerical and and -_~. | essential |
model_name | Most common name associate with model. Please use the CCLE name if available | LAMA84_HAEMATOPOIETIC_AND_LYMPHOID_TISSUE | free alphanumerical | essential |
model_name_aliases | Additional model names, if known | A 549 | free alphanumerical | desirable |
type | Type of model | PDO | PDO, PDXO, CDO, cell line, other |
essential |
parent_id | Model Id of the original model used to generate this model. If the model was not in this set, please refer to it by external id | CRC022 | free alphanumerical and and -_~. | essential if applicable |
origin_patient_sample_id | Unique ID of the patient tumour sample used to generate the model | CRC0228PRH0000000000D01000 | unique, free alphanumerical and -_~. | essential |
growth_properties | Observed growth properties of the model | adherent | Adherent, Semi-adherent, suspension, Matrigel/BME domes | essential |
media_id | Unique identifier for each media formulation (Catalog number) | MF-002-001 | free alphanumerical | essential if applicable |
growth_media | Base media formulation the model was grown in | DMEM | free alphanumerical | essential |
plate_coating | Coating on plate model was grown in | None | Laminin, Matrigel, Collagen, None | essential |
other_plate_coating | Other coating on plate model was grown in (not mentioned above) | None | Peg-based hydrogel, none | essential |
passage_number | Passage number at time of sequencing/screening | 20 | Freetext | essential |
contamination_testing | Has the model been tested for contamination? | yes | Y/N | essential |
contamination_details | What are the details of the contamination | mycoplasma | free alphanumerical | desirable |
supplements | Additional supplements the model was grown with | 10% R-spondin, 10% Noggin, 10% FBS | free alphanumerical | desirable |
drug | Additional drug/compounds the model was grown with | EGF | free alphanumerical | desirable |
drug_concentration | Concentration of Additional drug/compounds the model was grown with | 50ng/ml | free alphanumerical | desirable |
publications | The original (or earliest available) publication outlining how the model was established | PMID: 27771609, PMID: 27771610 | PMID: 8 digit format | desirable |
supplier | Model supplier name | ATTC | ROR term (if available), otherwise free text, not collected or not provided | essential |
supplier_type | Model supplier type - commercial, academic, other | Commercial | Commerical, Academic, Other | essential |
catalog_number | Catalogue number of cell model, if commercial | CCL-185 | free alphanumerical | essential |
vendor_link | Link to purchasable cell model, if commercial | https://www.atcc.org/products/ccl-185 | URL | essential |
rrid | Cellosaurus ID | CVCL_0023 | free alphanumerical | essential if applicable |
related_models | Model_ids of any related models linked to the same patient. | A088a | free alphanumerical and _~. Separated by commas in case of multiple models | essential if applicable |
external_ids | Depmap accession, Cellusaurus accession or other id. Please place in comma separated list | CVCL_0023 | free alphanumerical | desirable |
msi_status | MSI status of the model | MSS | MSI: Miscosatellite Instable, MSS: Miscrosatellite stable | desirable |
mutational_burden | Mutational rate per Mb | 4.76 | numerical | desirable |
ploidy | Ploidy status of the model | 2.03 | numerical | desirable |
morphological_features | Morphological features of the model | elongated | free text | essential |
histological_validation | Immunofluorescent or immunohistochemical validation of the model | High concordance | Free text | essential |
SNP_analysis | Was SNP analysis done on the model? | yes | Yes, no | essential |
STR_analysis | Was STR analysis done on the model? | yes | Yes, no | essential |
tumour_status | Gene expression validation of established model | Free text | essential | |
model_purity | Presence of tumour vs stroma or normal cells | Free text | essential | |
comments | Comments about the model that cannot be expressed by other fields | free text , not collected or not provided | desirable | |
accessibility | Any limitation of access of the model per type of users, such as academia only, industry and academia, or national limitation (e.g. no specific consent for sequencing) | academia only | academia, industry and academia or free text | essential |
Contact email for any requests from users about models. If multiple, include as comma separated list. | [email protected] | email format | essential | |
name | Contact person (should match that included in email column). | Jane Doe | free text | essential |
form_url | Link to a contact form, if available | http://tumor.informatics.jax.org/mtbwi/pdxDetails.do?modelID=J000106527 | URL | essential if applicable |
database_url | Link to the database/biobank/resource, if available | http://tumor.informatics.jax.org/mtbwi/pdxDetails.do?modelID=TM01220 | URL | essential if applicable |