The Digital Brain Tumour Atlas, an open histopathology resource (2024)

  • Journal List
  • Sci Data
  • v.9; 2022
  • PMC8847577

As a library, NLM provides access to scientific literature. Inclusion in an NLM database does not imply endorsem*nt of, or agreement with, the contents by NLM or the National Institutes of Health.
Learn more: PMC Disclaimer | PMC Copyright Notice

The Digital Brain Tumour Atlas, an open histopathology resource (1)

Link to Publisher's site

Sci Data. 2022; 9: 55.

Published online 2022 Feb 15. doi:10.1038/s41597-022-01157-0

PMCID: PMC8847577

PMID: 35169150

Thomas Roetzer-Pejrimovsky,The Digital Brain Tumour Atlas, an open histopathology resource (2)1 Anna-Christina Moser,1 Baran Atli,1 Clemens Christian Vogel,1 Petra A. Mercea,1,2 Romana Prihoda,2,3 Ellen Gelpi,1 Christine Haberler,1 Romana Höftberger,1 Johannes A. Hainfellner,1 Bernhard Baumann,4 Georg Langs,5 and Adelheid Woehrer1

Author information Article notes Copyright and License information PMC Disclaimer

The data described in this article are referenced by "The DNA methylation landscape of glioblastoma disease progression shows extensive heterogeneity in time and space" in Nat Med, volume 24 onpage1611.

Associated Data

Data Citations
Data Availability Statement

Abstract

Currently, approximately 150 different brain tumour types are defined by the WHO. Recent endeavours to exploit machine learning and deep learning methods for supporting more precise diagnostics based on the histological tumour appearance have been hampered by the relative paucity of accessible digital histopathological datasets. While freely available datasets are relatively common in many medical specialties such as radiology and genomic medicine, there is still an unmet need regarding histopathological data. Thus, we digitized a significant portion of a large dedicated brain tumour bank based at the Division of Neuropathology and Neurochemistry of the Medical University of Vienna, covering brain tumour cases from 1995–2019. A total of 3,115 slides of 126 brain tumour types (including 47 control tissue slides) have been scanned. Additionally, complementary clinical annotations have been collected for each case. In the present manuscript, we thoroughly discuss this unique dataset and make it publicly available for potential use cases in machine learning and digital image analysis, teaching and as a reference for external validation.

Subject terms: CNS cancer, Cancer in the nervous system, Cancer microenvironment, CNS cancer

Measurement(s)Cancer Histology • Cellularity Measurement • Total Sample Tissue Area • brain neoplasm
Technology Type(s)Hematoxylin and Eosin Staining Method • digital curation • Histology and Immunohistochemistry Shared Resource
Factor Type(s)age • sex • location
Sample Characteristic - Organismhom*o sapiens

Open in a separate window

Machine-accessible metadata file describing the reported data: 10.6084/m9.figshare.16652272

Background & Summary

Brain tumours account for a large fraction of years of potential life lost as compared with tumours from other sites1, and have a significant negative impact on patients’ quality of life2. Overall, they are relatively uncommon neoplasms with an incidence of approximately 24 per 100.000 person-years3. Current diagnostic guidelines published by the WHO define approximately 150 distinct brain tumour types and assign grades I to IV, based on malignancy and potential to malignant transformation or progression. They are mainly differentiated by their histopathological phenotypes and molecular alterations4. While the majority of tumours is diagnosed solely based on histopathology, an integrated approach is mandatory for 19 tumour types.

Still, more accurate diagnostic distinctions are needed in order to i) better assess individual patients’ prognoses and ii) support more robust therapeutic decisions4,5. Recently, diagnostic algorithms trained on DNA methylation data have been shown to significantly increase diagnostic accuracy6. Similar advances focusing on histopathological data have been hampered, so far, by the lack of freely available histopathology datasets7. Most available histopathology data such as those available through TCGA8, IvyGAP9,10 or TCIA11 focus on only a few diagnostic entities. They mostly consist of digitized fresh frozen tissue sections, which feature relatively poor tissue morphology as compared to formalin-fixed and paraffin-embedded tissues. Still, even with these limited data, computational algorithms have been successfully trained - amongst others - for survival prediction12, detection of tumour-infiltrating lymphocytes13, and assessments of tumour microvessels14. However, larger datasets encompassing an even wider range of brain tumours and featuring improved cellular and morphological characteristics are necessary to further develop these algorithms and extend their applicability to the entire spectrum of brain tumour types.

Thus, we set out to compile a comprehensive resource of digitized Haematoxylin-eosin(H&E)-stained brain tumour whole slide images (WSIs) with clinical annotations (Fig.1). We aimed to capture the complete spectrum of brain tumours as encountered in day-to-day medical diagnostic practice. Importantly, we managed to specifically digitize slides of exceedingly rare pathologies, which are usually, if ever, seen only a few times in a pathologist’s lifetime. By performing a manual review of each slide, we ensure high scan quality and actuality of provided diagnoses. We envisage this dataset to be used for advancing digital pathology-based machine learning and for teaching purposes. Importantly, this dataset can be used for (1) inter-tumour comparisons thanks to the wide inclusion of distinct brain tumour types as well as (2) within-tumour-type investigations thanks to the inclusion of a large number of samples for the common tumour types.

Open in a separate window

Fig. 1

Overview of the data acquisition and publication process. First, histological slides and clinical records of brain tumour patients were retrieved from the biobank of the Division of Neuropathology and Neurochemistry, Medical University of Vienna. Then, slides were digitized using a Hamamatsu slidescanner. Clinical data were translated into standardized annotations. At least two experienced neuropathologists checked each slide scan to ensure conformity of the diagnosis with the current revised 4th edition of the “WHO Classification of Tumours of the Central Nervous System” and sufficient scan quality. Ambiguous cases were excluded and WSIs of inferior quality were re-scanned. Finally, data were made available via EBRAINS to the international research community. (Brain illustration adapted from Meaghan Hendricks from the Noun Project).

Methods

Sample acquisition

H&E stained tumour slides from FFPE tissues, which were collected for routine diagnostics in the time interval of 1995–2019 have been obtained from the biobank of the Division of Neuropathology & Neurochemistry, Medical University of Vienna. We digitized each slide in high magnification (40x objective, 228 nm/pixel) using a Hamamatsu NanoZoomer 2.0 HT slide-scanner. Each slide was manually reviewed to ensure high scan quality and sufficient diagnostic tumour tissue. Samples with equivocal diagnoses or missing molecular work-up otherwise needed to assign an integrated WHO 2016 diagnosis were excluded. A subset of glioblastoma scans (n = 381) has been published previously as part of the GBMatch study15.

Basic clinical annotations consisting of patient age and sex as well as tumour location and recurrence were acquired from local electronic records where available. Tumour locations have been assigned to the following 19 categories: frontal; parietal; insular; occipital; temporal; cerebellar; brain stem; spinal; lateral ventricle; diencephalon; third ventricle; fourth ventricle; sellar region; cranial nerves; basal ganglia; cerebral, NOS (not otherwise specified); posterior fossa, NOS; cranial, NOS; and other.

This study complies with the relevant ethical, legal and institutional regulations and the study protocol has been approved by the Ethics Committee of the Medical University of Vienna (EK1691–2017). Participant informed consent has been obtained as by institutional guidelines, necessitating restrictions on commercial use of the obtained data.

Estimation of cell density and scanned tissue area

Additionally, the total tissue area and the average cellularities were estimated for each scan using a custom MATLAB script (MATLAB R2017b, MathWorks) with a similar approach as previously published15,16. In summary, H&E stained WSIs were first colour-deconvoluted into separate Haematoxylin and Eosin channels17. Then, global, Phansalkar and Otsu thresholding were applied to the Haematoxylin channel to identify nuclei18,19. Watershedding was used to separate densely clustered cells20. Only cells with a minimum size of 4 pixels were kept. The total tissue area was determined by averaging all colour channels, thresholding at a threshold of 220, followed by binary close and open operations.

Data Records

Data are provided via EBRAINS21 as one ndpi-file per sample, sorted by diagnostic tumour type (in alphabetical order) for easier access. It is possible to download single files directly or all files of a specific tumour type or the whole dataset using a download manager (such as the Chrono Download Manager for the Google Chrome browser). Furthermore, supplementary clinical information, estimated cell densities and scanned tissue area is provided in a csv-spreadsheet with one row per tumour sample. An overview of all spreadsheet variables and descriptions is given in Table1.

Table 1

Recorded clinical variables and corresponding descriptions.

VariableDescription
uuidunique sample identifier
pat_idunique patient identifier
diagnosisprimary diagnosis according to the WHO Classification of Tumours of the Central Nervous System (2016)
gradeWHO grade according to the WHO Classification of Tumours of the Central Nervous System (2016)
subtypefurther specification of the histopathological subtype which is not a distinct entity as defined by the WHO, if applicable
secondary_diagnosissecondary diagnosis in cases where two distinct diagnosis according to the WHO are applicable
control1 if sample is a control sample without tumour tissue
agepatient age at the time of surgery
sexbiological patient sex
locationlist (in square brackets) of all applicable tumour locations; empty if location is unknown
lateralitylaterality of the tumour (left or right)
cellularityestimated cell density of the tissue (given in 1/mm2)
tissue_areaestimated scanned tissue area (in mm2)
recurrence0 if the entry corresponds to a primary tumour resection; if the entry corresponds to a tumour recurrence, the number of the recurrence is given (e.g., 2 corresponds to the second recurrence)
commentnotable findings that do not fit in other columns (e.g., important mutations not yet integrated in the WHO classification; other non-tumour pathologies in the control samples)

Open in a separate window

A total of 3,115 histological slides of 2,880 patients have been scanned. A total of 126 distinct diagnostic tumour types could be included. There are 1,395 female and 1,462 male patients in the dataset. The mean patient age at brain tumour surgery was 45 years, ranging from 9 days to 92 years. 2,530 of the scanned slides originated from primary operations and 538 from re-operations. See online-only Table1 for descriptive properties broken down by tumour type. Descriptive visualizations of patient age, sex, tumour location, cellularity, and scanned tissue area are given in Fig.2. Of note, we also scanned exceptionally rare tumour types such as melanotic schwannomas or liponeurocytomas (Fig.3). A total of 47 non-tumour slides from different non-tumour CNS regions and with different pathologies were included as controls.

Open in a separate window

Fig. 2

Descriptive statistics of the ‘Digital Brain Tumour Atlas’ patient cohort (not including control patients). (a) The age distribution by sex shows a bimodal distribution with most patients belonging to the higher-age categories. Since some uncommon tumour types like medulloblastoma occur mainly in children and have been strategically over-sampled, there is also a peak in younger patients. (b) The distribution of the different WHO grades shows a slight predominance of grade I and grade IV tumours. Of note, some tumour entities are not assigned WHO grades (‘NA’) and very few tumour types are assigned intermediate grades II-III (a total of five cases, not shown in the figure). (c) Tumour distribution with colour-coded locations and ratio-specific circle sizes. (Brain illustration adapted from Patrick J. Lynch, wikimedia) (d) Distribution of the cell densities of all included tumour samples by tumour grade. Note that lower-grade tumours are not necessarily less cell dense (e.g., in the case of cellular schwannoma). (e) The distribution of the scanned tissue areas (per slide).

Open in a separate window

Fig. 3

Exemplary images from exceedingly rare brain tumours, which are included in the DBTA. (a) Perineurioma component of a hybrid nerve sheath tumour. (b) Angiosarcoma. (c) Lymphoplasmacyte-rich meningioma. (d) Crystal-storing histiocytosis. (e) Embryonal tumour with multilayered rosettes. (f) Melanotic schwannoma. (g) Angiocentric glioma. (h) Cerebellar liponeurocytoma. (i) Pituicytoma.

Online-only Table 1

Overview of the frequencies and descriptive statistics of all brain tumour types included in the DBTA.

# of scansMedian ageAge rangeCellularityTissue AreaF:M ratioMost frequent location
Adamantinomatous craniopharyngioma8530 years2–79 years5219 ± 254/mm281 ± 7 mm21.36sellar region
Anaplastic astrocytoma, IDH-mutant4743 years24–65 years3765 ± 282/mm2162 ± 17 mm20.74frontal
Anaplastic astrocytoma, IDH-wildtype4754 years12–79 years4633 ± 361/mm285 ± 14 mm20.57temporal
Anaplastic ependymoma507 years0.3–77 years5571 ± 298/mm2138 ± 16 mm20.92fourth ventricle
Anaplastic ganglioglioma536 years14–70 years3129 ± 801/mm296 ± 44 mm2NAtemporal
Anaplastic meningioma4663 years30–84 years6291 ± 380/mm2223 ± 17 mm21.19frontal
Anaplastic oligodendroglioma, IDH-mutant and 1p/19q codeleted9149 years30–78 years4899 ± 217/mm2162 ± 11 mm20.72frontal
Anaplastic pleomorphic xanthoastrocytoma148 yearsNA2933/mm2245 mm2NAcerebral, NOS
Angiocentric glioma38 years3–35 years3519/mm2138 mm22temporal
Angiomatous meningioma3260 years43–79 years4185 ± 330/mm2201 ± 14 mm21.13frontal
Angiosarcoma266 years53–79 years2440/mm2112 mm2NAcerebral, NOS
Astroblastoma740 years12–83 years5417 ± 440/mm2120 ± 24 mm21.33frontal
Atypical choroid plexus papilloma426 years0.3–56 years6555/mm298 mm21lateral ventricle
Atypical meningioma8358 years13–92 years6901 ± 222/mm2225 ± 11 mm21.13frontal
Atypical teratoid/rhabdoid tumour173 years0.3–10 years6846 ± 791/mm261 ± 21 mm21.29temporal
CNS ganglioneuroblastoma10 yearsNA7247/mm2306 mm2NAfrontal
Cellular schwannoma2554 years27–79 years7459 ± 526/mm2128 ± 20 mm21.08spinal
Central neurocytoma2028 years6–41 years8053 ± 650/mm2110 ± 22 mm20.67lateral ventricle
Cerebellar liponeurocytoma450 years43–57 years6924/mm2125 mm2NAcranial nerves
Chondrosarcoma2137 years6–73 years5642 ± 621/mm2132 ± 28 mm21.33cranial, NOS
Chordoid glioma of the third ventricle434 years26–42 years5074/mm236 mm20.33third ventricle
Chordoid meningioma1247 years35–73 years4408 ± 776/mm2210 ± 17 mm23cranial, NOS
Chordoma2861 years4–85 years2808 ± 342/mm2114 ± 16 mm20.56cranial, NOS
Choriocarcinoma176 yearsNA8954/mm2131 mm2NAsellar region
Choroid plexus carcinoma73 years0.5–46 years6778 ± 691/mm2100 ± 35 mm2NAcerebral, NOS
Choroid plexus papilloma2129 years0.2–78 years5454 ± 467/mm2156 ± 26 mm20.91fourth ventricle
Clear cell meningioma1339 years8–74 years5027 ± 648/mm2182 ± 32 mm21.17sellar region
Crystal-storing histiocytosis162 yearsNA1435/mm2211 mm2NANA
Desmoplastic infantile astrocytoma and ganglioglioma111 years0.5–23 years5206 ± 642/mm2180 ± 37 mm21.5parietal
Diffuse astrocytoma, IDH-mutant7037 years18–60 years3013 ± 171/mm2105 ± 12 mm20.64frontal
Diffuse astrocytoma, IDH-wildtype1958 years20–77 years2730 ± 315/mm290 ± 23 mm20.36frontal
Diffuse large B-cell lymphoma of the CNS5968 years9–84 years6021 ± 450/mm290 ± 13 mm21.46frontal
Diffuse leptomeningeal glioneuronal tumour12 yearsNA8070/mm28 mm2NAfrontal
Diffuse midline glioma, H3 K27M-mutant2119 years3–64 years4258 ± 460/mm226 ± 7 mm21.1brain stem
Dysembryoplastic neuroepithelial tumour2531 years8–57 years2410 ± 196/mm276 ± 16 mm20.79temporal
Dysplastic cerebellar gangliocytoma138 yearsNA2345/mm2196 mm2NAcerebellar
EBV-positive diffuse large B-cell lymphoma, NOS134 yearsNA6595/mm25 mm2NAfrontal
Embryonal carcinoma139 yearsNA4888/mm2291 mm2NAparietal
Embryonal tumour with multilayered rosettes, C19MC-altered32 years2–3 years6087/mm2231 mm22parietal
Ependymoma4649 years2–78 years4813 ± 347/mm294 ± 12 mm20.96spinal
Ependymoma, RELA fusion-positive612 years4–55 years5814 ± 1401/mm2138 ± 40 mm20.5lateral ventricle
Epitheloid MPNST150 yearsNA3003/mm270 mm2NAother
Erdheim-Chester disease157 yearsNA2194/mm2239 mm2NANA
Ewing sarcoma46 years0.8–28 years8370/mm291 mm21spinal
Extraventricular neurocytoma136 yearsNA1193/mm2107 mm2NAspinal
Fibrosarcoma227 years20–34 years4639/mm2199 mm21cerebral, NOS
Fibrous meningioma5758 years12–84 years6103 ± 237/mm2228 ± 13 mm26.12cranial, NOS
Follicular lymphoma362 years62–64 years8741/mm2276 mm22occipital
Gangliocytoma136 yearsNA1127/mm210 mm2NAoccipital
Ganglioglioma8821 years2–65 years2932 ± 153/mm2110 ± 9 mm20.6temporal
Ganglioneuroma233 years27–39 years4228/mm2212 mm21other
Gemistocytic astrocytoma, IDH-mutant638 years29–56 years2036 ± 228/mm2121 ± 25 mm20.5temporal
Germinoma2016 years9–33 years7091 ± 686/mm221 ± 6 mm20.11diencephalon
Giant cell glioblastoma2143 years11–86 years3170 ± 301/mm2181 ± 20 mm20.62temporal
Glioblastoma, IDH-mutant3438 years25–73 years4867 ± 296/mm2172 ± 19 mm21frontal
Glioblastoma, IDH-wildtype47462 years17–87 years4481 ± 96/mm2151 ± 5 mm20.66temporal
Gliosarcoma5957 years9–86 years4794 ± 276/mm2221 ± 14 mm20.44temporal
Granular cell tumour of the sellar region146 yearsNA2172/mm297 mm2NAsellar region
Haemangioblastoma8850 years16–81 years5119 ± 185/mm2109 ± 10 mm21cerebellar
Haemangioma3051 years0.2–76 years2796 ± 292/mm2133 ± 15 mm22cranial, NOS
Haemangiopericytoma3439 years25–83 years9064 ± 489/mm2186 ± 18 mm20.48cranial, NOS
Hybrid nerve sheath tumours358 years32–72 years3342/mm2227 mm20.5spinal
Immature teratoma715 years0.0–56 years7927 ± 913/mm2107 ± 28 mm20.4third ventricle
Immunodeficiency-associated CNS lymphoma553 years31–73 years5209 ± 1227/mm265 ± 34 mm20.67cerebral, NOS
Inflammatory myofibroblastic tumour126 yearsNA5226/mm2298 mm2NAfrontal
Intravascular large B-cell lymphoma270 years62–78 years1242/mm2427 mm2NANA
Juvenile xanthogranuloma123 yearsNA12519/mm275 mm2NANA
Langerhans cell histiocytosis3213 years1–53 years6848 ± 565/mm2104 ± 19 mm20.78parietal
Leiomyoma150 yearsNA1864/mm228 mm2NAcranial, NOS
Leiomyosarcoma458 years50–77 years6955/mm2213 mm21occipital
Lipoma3810 years0.3–76 years757 ± 66/mm2120 ± 16 mm20.9spinal
Liposarcoma152 yearsNA2697/mm2107 mm2NAspinal
Low-grade B-cell lymphomas of the CNS1367 years50–83 years8087 ± 1088/mm286 ± 25 mm21.17spinal
Lymphoplasmacyte-rich meningioma246 years37–55 years9817/mm237 mm21cranial, NOS
MALT lymphoma of the dura568 years39–79 years9843 ± 1672/mm239 ± 23 mm21.5cranial, NOS
Malignant peripheral nerve sheath tumour1561 years17–81 years5886 ± 794/mm2136 ± 28 mm20.88spinal
Mature teratoma610 years0.2–49 years3259 ± 760/mm2135 ± 45 mm20.2spinal
Medulloblastoma, SHH-activated and TP53-mutant316 years0.5–52 years9539/mm2117 mm22cerebellar
Medulloblastoma, SHH-activated and TP53-wildtype930 years1–75 years10544 ± 581/mm2185 ± 38 mm20.5cerebellar
Medulloblastoma, WNT-activated713 years6–65 years7641 ± 954/mm279 ± 20 mm20.4fourth ventricle
Medulloblastoma, non-WNT/non-SHH328 years3–34 years8799 ± 412/mm2113 ± 13 mm20.33fourth ventricle
Melanotic schwannoma364 years51–69 years3110/mm255 mm20.5cranial nerves
Meningeal melanocytoma551 years35–54 years8152 ± 1898/mm2172 ± 60 mm20.67spinal
Meningeal melanoma261 years51–71 years6763/mm2160 mm2NAspinal
Meningothelial meningioma10455 years25–88 years5951 ± 212/mm2162 ± 10 mm24.47cranial, NOS
Metaplastic meningioma475 years56–85 years4613/mm2226 mm2NAfrontal
Metastatic tumours4758 years38–78 years5092 ± 399/mm2159 ± 14 mm20.88spinal
Microcystic meningioma2348 years33–75 years4475 ± 382/mm2213 ± 23 mm22.83frontal
Mixed germ cell tumour520 years12–44 years4379 ± 1084/mm2142 ± 45 mm20.25spinal
Myxopapillary ependymoma2335 years11–71 years3188 ± 360/mm2154 ± 16 mm20.64spinal
Neurofibroma1644 years0.7–65 years3640 ± 446/mm2151 ± 29 mm20.6spinal
Olfactory neuroblastoma1058 years27–69 years5053 ± 780/mm2213 ± 34 mm20.25cranial, NOS
Oligodendroglioma, IDH-mutant and 1p/19q codeleted8546 years12–73 years3587 ± 174/mm2136 ± 12 mm20.7frontal
Osteochondroma114 yearsNA2388/mm239 mm2NAspinal
Osteoma948 years40–69 years1570 ± 638/mm2107 ± 33 mm21.25frontal
Osteosarcoma830 years17–54 years4328 ± 498/mm2176 ± 20 mm23cerebral, NOS
Papillary craniopharyngioma1361 years44–82 years4941 ± 427/mm265 ± 13 mm20.86sellar region
Papillary ependymoma235 years35–35 years5573/mm2146 mm2NAspinal
Papillary glioneuronal tumour212 years12–13 years5877/mm2117 mm2NAcerebral, NOS
Papillary meningioma338 years20–61 years7270/mm2198 mm2NAtemporal
Papillary tumour of the pineal region1117 years4–48 years6094 ± 807/mm282 ± 28 mm21.75third ventricle
Paraganglioma1754 years25–69 years6734 ± 468/mm2165 ± 24 mm20.89spinal
Perineurioma123 yearsNA3580/mm226 mm2NAother
Pilocytic astrocytoma17311 years0.6–51 years3327 ± 117/mm2105 ± 8 mm20.66cerebellar
Pilomyxoid astrocytoma247 years0.4–56 years4073 ± 362/mm245 ± 11 mm21cranial nerves
Pineal parenchymal tumour of intermediate differentiation644 years9–55 years9287 ± 619/mm286 ± 44 mm20.2diencephalon
Pineoblastoma523 years1–67 years7436 ± 1352/mm287 ± 43 mm24third ventricle
Pineocytoma619 years1–40 years4086 ± 1245/mm260 ± 38 mm20.5diencephalon
Pituicytoma347 years27–67 years5231/mm233 mm22sellar region
Pituitary adenoma9954 years16–80 years7842 ± 225/mm256 ± 6 mm20.98sellar region
Pleomorphic xanthoastrocytoma2129 years5–72 years4215 ± 368/mm2145 ± 23 mm21.38temporal
Plexiform neurofibroma112 yearsNA12310/mm266 mm2NANA
Psammomatous meningioma2866 years29–83 years5201 ± 372/mm2125 ± 15 mm28.33spinal
Rhabdoid meningioma563 years52–89 years5016 ± 475/mm2235 ± 62 mm20.67occipital
Rhabdomyosarcoma351 years49–62 years2474/mm2299 mm22cranial, NOS
Rosette-forming glioneuronal tumour1125 years13–47 years3997 ± 822/mm256 ± 21 mm21.75cerebellar
Schwannoma8153 years14–78 years5715 ± 229/mm2124 ± 10 mm20.93cranial nerves
Secretory meningioma4158 years40–80 years6112 ± 313/mm2154 ± 16 mm212.67cranial, NOS
Spindle cell oncocytoma147 yearsNA4958/mm213 mm2NANA
Subependymal giant cell astrocytoma1415 years10–33 years3336 ± 447/mm2136 ± 31 mm20.56lateral ventricle
Subependymoma2454 years8–81 years1829 ± 151/mm2125 ± 20 mm21.18lateral ventricle
T-cell and NK/T-cell lymphomas of the CNS154 yearsNA7400/mm218 mm2NAcerebral, NOS
Tanycytic ependymoma141 yearsNA5440/mm2232 mm2NAspinal
Teratoma with malignant transformation140 yearsNA7202/mm2122 mm2NAfrontal
Transitional meningioma6858 years29–82 years7221 ± 213/mm2203 ± 13 mm22.09frontal
Undifferentiated pleomorphic sarcoma162 yearsNA5773/mm2401 mm2NAcranial, NOS

Open in a separate window

Cellularity and Tissue area are given as [mean ± SEM].

Technical Validation

All cases were initially selected based on the given diagnosis in the diagnostic electronic records. To ensure conformity with the WHO 2016 diagnosis, all slides have been independently reviewed by two neuropathologists experienced in neuro-oncology. In disputed cases, a third senior neuropathologist was consulted. Older cases with missing necessary molecular analyses were not included in the dataset.

Inter- and intraobserver variability is one factor that contributes to misdiagnoses or discrepant diagnoses. We mitigated the risk by including only cases that had already undergone thorough routine diagnostic work-up and were additionally reviewed independently by at least two neuropathologists as described above. In this way, we also ensured excellent image quality and the presence of sufficient diagnostic tumour tissue on each WSI. Scans with suboptimal image quality were either re-scanned (if possible) or excluded.

Usage Notes

Data access

The data can be accessed via EBRAINS21. In order to download the data set, users have to register with EBRAINS and agree to the general terms of use, access policy as well as the data use agreement for pseudonymised human data (https://ebrains.eu/terms). The data are distributed under the conditions that users cite the respective DOI, adhere to EBRAINS’ Data Use Agreement and do not use the data for commercial purposes.

WSI processing

The ndp.view2 (© Hamamatsu) software can be freely used to view and annotate slide scans saved in the ndpi format22. Alternatively, most other WSI programs such as the open-source OMERO software platform23 and the open-source QuPath software24 can work directly on ndpi-files. However, most programming languages and non-specialized image processing software cannot handle ndpi-files out of the box. Thus, we also provide a toolbox of MATLAB scripts that depend on the openslide library25 and can be used to

  1. Automatically tile large slide scans and export multiple smaller image patches in a given magnification.

  2. Convert annotation-files (.ndpa) to overlays, which can be used to extract specific regions of interest.

  3. Estimate the total tissue area on a WSI.

  4. Estimate the cell density on a WSI.

Of note, slide thickness and staining intensity vary to some degree, resulting in a slightly different histological appearance of each slide. Thus, for machine learning applications, we recommend astain normalization step such as WSICS26, more recent methods employing generative adversarial networks27 or style transfer learning28. Moreover, heavy stain colour augmentation should be performed29. Of note, the stain normalization step can be omitted with only a negligible drop in performance as has been shown by Tellez et al.29.

Acknowledgements

T.R. is a recipient of a DOC Fellowship (25262) of the Austrian Academy of Sciences at the Division of Neuropathology and Neurochemistry, Department of Neurology, Medical University of Vienna. The present work has been further supported by the Austrian Science Fund 1000 ideas project TAI98-B to A.W.

Online-only Table

Author contributions

T.R. and A.W. conceived and designed the project. T.R., A.C.M., C.C.V., P.M., B.A. and R.P. collected the data. T.R., E.G., R.H., C.H., J.A.H. and A.W. reviewed the data. T.R., B.B. and G.L. performed the image analysis. T.R. and A.W. wrote the paper with contributions from all authors.

Code availability

The custom-made MATLAB toolbox for loading, viewing and processing of ndpi & ndpa files and for estimating the total tissue area and average cell density of a WSI can be accessed at: https://github.com/tovaroe/WSI_histology.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1. Rouse C, Gittleman H, Ostrom QT, Kruchko C, Barnholtz-Sloan JS. Years of potential life lost for brain and CNS tumors relative to other cancers in adults in the United States, 2010. Neuro. Oncol. 2015;18:70–77. doi:10.1093/neuonc/nov249. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

2. Liu R, Page M, Solheim K, Fox S, Chang SM. Quality of life in adults with brain tumors: Current knowledge and future directions. Neuro. Oncol. 2009;11:330–339. doi:10.1215/15228517-2008-093. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

3. Ostrom QT, et al. CBTRUS Statistical Report: Primary Brain and Other Central Nervous System Tumors Diagnosed in the United States in 2013–2017. Neuro. Oncol. 2020;22:iv1–iv96. doi:10.1093/neuonc/noaa200. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

4. International Agency for Research on Cancer & Wiestler, O. D. WHO Classification of Tumours of the Central Nervous System. (International Agency for Research on Cancer, 2016).

5. van den Bent MJ, et al. A clinical perspective on the 2016 WHO brain tumor classification and routine molecular diagnostics. Neuro. Oncol. 2017;19:614–624. doi:10.1093/neuonc/now277. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

6. Capper D, et al. DNA methylation-based classification of central nervous system tumours. Nature. 2018;555:469–474. doi:10.1038/nature26000. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

7. Djuric U, Zadeh G, Aldape K, Diamandis P. Precision histology: how deep learning is poised to revitalize histomorphology for personalized cancer care. npj Precision Oncology. 2017;1:1–5. doi:10.1038/s41698-017-0022-1. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

9. Puchalski RB, et al. An anatomic transcriptional atlas of human glioblastoma. Science. 2018;360:660–663. doi:10.1126/science.aaf2666. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

10. Ivy Glioblastoma Atlas Project. https://glioblastoma.alleninstitute.org/ (2018).

11. National Cancer Institute Clinical Proteomic Tumor Analysis Consortium (CPTAC). Radiology Data from the Clinical Proteomic Tumor Analysis Consortium Glioblastoma Multiforme [CPTAC-GBM] collection. 10.7937/K9/TCIA.2018.3RJE41Q1 (2018).

12. Mobadersany P, et al. Predicting cancer outcomes from histology and genomics using convolutional networks. Proc. Natl. Acad. Sci. USA. 2018;115:E2970–E2979. doi:10.1073/pnas.1717139115. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

13. Spatial Organization and Molecular Correlation of Tumor-Infiltrating Lymphocytes Using Deep Learning on Pathology Images. Cell Rep. 23, 181–193.e7 (2018). [PMC free article] [PubMed]

14. Li X, Tang Q, Yu J, Wang Y, Shi Z. Microvascularity detection and quantification in glioma: a novel deep-learning-based framework. Lab. Invest. 2019;99:1515–1526. doi:10.1038/s41374-019-0272-3. [PubMed] [CrossRef] [Google Scholar]

15. Klughammer J, et al. The DNA methylation landscape of glioblastoma disease progression shows extensive heterogeneity in time and space. Nat. Med. 2018;24:1611–1624. doi:10.1038/s41591-018-0156-x. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

16. Roetzer T, et al. Evaluating cellularity and structural connectivity on whole brain slides using a custom-made digital pathology pipeline. J. Neurosci. Methods. 2019;311:215–221. doi:10.1016/j.jneumeth.2018.10.029. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

17. Ruifrok AC, Johnston DA. Quantification of histochemical staining by color deconvolution. Anal. Quant. Cytol. Histol. 2001;23:291–299. [PubMed] [Google Scholar]

18. A Threshold Selection Method from Gray-Level Histograms. https://ieeexplore.ieee.org/document/4310076.

19. Adaptive local thresholding for detection of nuclei in diversity stained cytology images. https://ieeexplore.ieee.org/abstract/document/5739305.

20. Topographic distance and watershed lines. Signal Processing38, 113–125 (1994).

21. Roetzer-Pejrimovsky T, 2021. The Digital Brain Tumour Atlas, an open histopathology resource. EBRAINS. [CrossRef]

23. Allan C, et al. OMERO: flexible, model-driven data management for experimental biology. Nat. Methods. 2012;9:245–253. doi:10.1038/nmeth.1896. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

24. Bankhead P, et al. QuPath: Open source software for digital pathology image analysis. Sci. Rep. 2017;7:1–7. doi:10.1038/s41598-017-17204-5. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

25. Goode A, Gilbert B, Harkes J, Jukic D, Satyanarayanan M. OpenSlide: A vendor-neutral software foundation for digital pathology. J. Pathol. Inform. 2013;4:27. doi:10.4103/2153-3539.119005. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

26. Stain Specific Standardization of Whole-Slide Histopathological Images. https://ieeexplore.ieee.org/abstract/document/7243333. [PubMed]

27. Zanjani, F. G., Zinger, S., Bejnordi, B. E., van der Laak, J. A. W. M. & de With, P. H. N. Stain normalization of histopathology images using generative adversarial networks. 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018)10.1109/isbi.2018.8363641 (2018).

28. Bug, D. et al. Context-Based Normalization of Histological Stains Using Deep Convolutional Features. in Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support 135–142 (Springer, Cham, 2017).

29. Quantifying the effects of data augmentation and stain color normalization in convolutional neural networks for computational pathology. Med. Image Anal. 58, 101544 (2019). [PubMed]

Articles from Scientific Data are provided here courtesy of Nature Publishing Group

The Digital Brain Tumour Atlas, an open histopathology resource (2024)
Top Articles
Latest Posts
Article information

Author: Lakeisha Bayer VM

Last Updated:

Views: 5788

Rating: 4.9 / 5 (49 voted)

Reviews: 80% of readers found this page helpful

Author information

Name: Lakeisha Bayer VM

Birthday: 1997-10-17

Address: Suite 835 34136 Adrian Mountains, Floydton, UT 81036

Phone: +3571527672278

Job: Manufacturing Agent

Hobby: Skimboarding, Photography, Roller skating, Knife making, Paintball, Embroidery, Gunsmithing

Introduction: My name is Lakeisha Bayer VM, I am a brainy, kind, enchanting, healthy, lovely, clean, witty person who loves writing and wants to share my knowledge and understanding with you.