Skip Navigation

How to Download

FRDR offers multiple ways to download datasets. Learn more in our documentation.

LUMID: Large-scale Unlabled Medical Imaging Dataset for Unsupervised and Self-supervised Learning

Description: LUMID is a large-scale, unlabeled collection of over 2 million medical images spanning multiple imaging modalities, including CT scans, X-rays, MRIs, and more. This dataset has been meticulously curated from publicly available medical imaging repositories, addressing the critical challenge of limited scale in existing public datasets and the inaccessibility of high-quality private datasets. The primary motivation behind creating this dataset is to empower the medical imaging community with a resource suited for developing and training advanced deep learning models. By enabling the use of unsupervised and self-supervised learning approaches, this dataset facilitates the learning of rich, transferable representations that can significantly enhance performance across various medical imaging tasks, including classification, segmentation, and anomaly detection.

Key Features:
1) Diversity: Comprising images from multiple modalities and a wide range of medical imaging scenarios.
2) Scalability: A dataset of unprecedented size, providing a robust foundation for training deep neural networks.
3) Versatility: Specifically designed for unsupervised and self-supervised learning methods, fostering innovation in representation learning for medical imaging.
4) Open Access: Built entirely from public datasets, ensuring transparency and reproducibility.

This dataset is intended to serve as a cornerstone for advancing research in medical AI, fostering the development of models capable of generalizing across diverse imaging types and clinical conditions.
Authors: Osman, Islam; University of British Columbia; ORCID iD 0000-0001-9935-2417
Gupta, Anubhav; University of British Columbia
Shehata, Mohamed S.; University of British Columbia
Braun, John W.; University of British Columbia
Keywords: Medical imaging
CT
MRI
X-rays
Field of Research: 
Computer and information sciences
>
Artificial intelligence (AI)
>
Computer vision in artificial intelligence
Publication Date: 2024-12-04
Publisher: Federated Research Data Repository / dépôt fédéré de données de recherche
URI: https://doi.org/10.20383/103.01017
Related Identifiers: 
This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

This dataset is derived from

Files in Dataset 
No files uploaded
Download entire dataset using Globus Transfer. This method requires a Globus account and installing software. Watch Video: Get Started with FRDR: Download a Dataset
Download with Globus
Files for this dataset are currently being backed up so it cannot be approved at this time. Please try later.

Access to this dataset is subject to the following terms:
Creative Commons Attribution 4.0 International (CC BY 4.0) https://creativecommons.org/licenses/by/4.0/
Citation
Osman, I., Gupta, A., Shehata, M., Braun, J. (2024). LUMID: Large-scale Unlabled Medical Imaging Dataset for Unsupervised and Self-supervised Learning. Federated Research Data Repository. https://doi.org/10.20383/103.01017