Glossary
Quick overview of all terms used throughout MIDOM
Protocol
Deidentification Protocol. A full, abstract description of a deidentification procedure. Describes what to do with each DICOM Element, voxel data, private elements. Also describes which datasets to reject completely. It consists of four elements Tags, Filters, Pixel, Private. See the objects page as well.
Deidentifier
The concrete implementation of a Protocol. Takes a DICOM Dataset and then either aplies a transformation to it, or rejects it. Different deidentifiers can implement the same protocol.
DICOM Tag
The name for a single type of data in a DICOM Dataset. Like
Modality,PatientAge, etc. All 4000+ official dicom tags are listed here. There are also private tags which can be created by any DICOM producer.Each dicom tag has an associated Value Representation (VR).
Value Representation (VR)
The ‘data type’ of a DICOM element’s value. For example an int, a string, a date. Also contains more exotic datatypes like ‘Unlimited Text’. All DICOM VR’s are listed in the DICOM standard
DICOM Element
A DICOM Tag - Value pair. Like
PatientName: A_Smith
DICOM Dataset
A set of DICOM elements. For extended description, see the objects page
ImageData element
A DICOM Element that contains image data. In most DICOM files this element is several times bigger than all other elements combined. The most common image data element is
PixelData(7FE0,0010) PixelDataBut there are several others:
(7FE0,0008) FloatPixelData (7FE0,0009) DoubleFloatPixelData (0070,0022) GraphicData (5400,0110) WaveformDataThis list might not be complete. The criteria for what constitutes ‘image data’ are not completely set.
ImageData elements can contain two types of PHI: Burnt-in image PHI and Dynamic image PHI
Burnt-in image PHI
Burnt-in / Static image PHI is always in the same place in an image. Many DICOM-producing modalities, especially in Ultrasound, write PHI like patient name and date of birth into the image. For a specific vendor, model and dataset type, this information can always be found at the same X-Y coordinates.
Dynamic image PHI
Dynamic image PHI has no pre-determined place. It is not added to the image on purpose. Faces and implant serial numbers fall into this category.
Action
An intended change to a single DICOM Element. The change is expressed as an Action code.
Delta
An observed change to a single DICOM Element. The change is expressed as a delta code.
Delta set
A set of Deltas for a set of distinct DICOM elements. See the objects page.
PHI
Personal Health Information. Also called PII (Personally Identifiable information) or `PI` (Personal Information). MIDOM prefers the term PHI as this is more specifically defined in the health domain.
For medical imaging data, personal health information is found in many of the standard DICOM elements. Obvious ones are patient name and date of birth. Next to that there are many more. The DICOM standard lists more than 400 elements for which some kind of processing is required to deidentify.
In addition to elements, PHI can also be present in image data directly as Burnt-in image PHI or Dynamic image PHI
Pixel data
A special DICOM Element that contains the bytes for the image component of a DICOM dataset. This element often take up many times more data than all other elements combined. Its processing is done by the specialized pixel module
Private tag
DICOM private tags are custom data elements that aren’t part of the standard specification, allowing healthcare organizations to store proprietary or specialized information. Private tags enhance flexibility, but are a well known PII leak risk. They are handled by the specialized private module.
More information on private tag structure can be found in the DICOM standard (very tough read), or in the pydicom docs (more understandable).