Data Dictionary
Demographic and clinical data are provided by the Kentucky Cancer Registry which is part of the National Cancer Institute’s Surveillance, Epidemiology and End Results (SEER) Program and the Centers for Disease Control and Prevention National Program of Cancer Registries (NPCR) Program.
Field | Description | Reference |
---|---|---|
Patient ID | Unique identifier for referring to a patient and diagnosis record with in cBioPortal (see additional details below). | |
Sample ID | Unique identifier for the sample. | |
Sex | Patient sex | KCR Abstractors Manual |
Ethnicity | Patient ethnicity | KCR Abstractors Manual |
Race | Patient race | KCR Abstractors Manual |
Diagnosis Year | Year of diagnosis extracted from the date of diagnosis | KCR Abstractors Manual |
Diagnosis Age | Patient age at diagnosis | KCR Abstractors Manual |
Topography Code | ICD-O topography code of anatomical site of primary diagnosis | KCR Abstractors Manual |
Topography Description | ICD-O topography description of code for the anatomical site of primary diagnosis | KCR Abstractors Manual |
Histology Code | ICD-O histological code of classification of tumor | KCR Abstractors Manual |
Histology Description | ICD-O histological description for the histology code | KCR Abstractors Manual |
Best Stage Group | Calculated value for stage. It is calculated from the CS derived stage or the pathologic and clinical TNM Stage Groups recorded for this case. | KCR Abstractors Manual |
Behavior | Behavior of the tumor being reported. The fifth digit of the morphology code is the behavior code. | KCR Abstractors Manual |
Laterality | Describes the involvement of one or both sides of paired organs in a primary diagnosis of cancer | KCR Abstractors Manual |
Recurrence Status | Identifies the type of first recurrence after a period of documented disease-free intermission or remission | KCR Abstractors Manual |
Marital Status at Diagnosis | Patient’s marital status at the time of diagnosis for this tumor, if known. | KCR Abstractors Manual |
Nodes Examined | Number of regional lymph nodes examined. | KCR Abstractors Manual |
Nodes Positive | Number of regional lymph nodes positive. | KCR Abstractors Manual |
Overall Survival Status | Patient overall survival status derived from vital status from the registry. | KCR Abstractors Manual cBioPortal Official OS_STATUS |
Primary Payer | Patient’s primary payer or insurance carrier at the time of initial admission. | KCR Abstractors Manual |
SEER Site 1 | Calculated diagnosis site using Topography and Histology Codes used by SEER. | KCR Abstractors Manual SEER Recode |
SEER Super Site | Similar to SEER Site 1, except this is a larger site group assignment. For example, Lip and Tongue SEER Site 1 fall into the Oral Cavity and Pharynx SEER Super Site. | SEER Recode |
Site Group | Another cancer site designation used by the Kentucky Cancer Registry, calculated from the Topography and Histology Codes. | KCR Abstractors Manual |
Survival Status | Describes the patient and tumor status at last contact. | KCR Abstractors Manual |
Tobacco Status | Describes the patient’s tobacco use. | KCR Abstractors Manual |
Treatment Composite (All) | A summary of all therapy a patient received for a primary incidence of cancer, including first and subsequent courses, calculated from all therapy records in the cancer registry. | KCR Abstractors Manual |
Treatment Composite (First) | A summary of all first course therapy a patient received for a primary incidence of cancer, calculated from all first course therapy records in the cancer registry. | KCR Abstractors Manual |
Tumor Size | Size measured on the surgical resection specimen | KCR Abstractors Manual |
Urban Rural | A code indicating the urban or rural status of the patient’s address at diagnosis: 1-3 = Urban, 4-9 = Rural | KCR Abstractors Manual |
The data in cBioportal has been de-identified, but CRI can facilitate access to a rich set of additional data variables for cBioportal cohorts from either the cancer registry or other linked data sets. Please contact us for more information: CRISRF Data Request
Notes on Patient ID and Sample ID
For many of the fields in cBioPortal we are able to control the display field name, however, we cannot control the field names “Patient ID” and “Sample ID” due to limitations of the cBioportal software. The record-level data in this cBioPortal instance is based on a patient’s primary diagnosis of cancer (Patient ID) linked to a molecular variant report from a tissue sample related to that cancer diagnosis (Sample ID). For a single diagnosis record in the cancer registry, a patient could have multiple tissue samples and multiple primary cancer cases. In order to retain this relationship, we are using the Patient ID field to hold the unique identifier for the case or diagnosis. For each case, there is a field named “Registry Patient Identifier” which is a unique patient identifier and allows users to determine if a patient has multiple cases or samples in cBioPortal. While using cBioPortal, if you encounter records where the Registry Patient ID repeats or if a cohort reports more than occurrence of a Registry Patient ID, this is an indicator that a patient has more than one diagnosis or sample.