Datathon | Datasets

ELECTRONIC MEDICAL
RECORD DATASETS
MIMIC-IV Dataset

Introduction and and Access: https://mimic-iv.mit.edu/

When using this resource, please cite:

Johnson, A., Bulgarelli, L., Pollard, T., Horng, S., Celi, L. A., & Mark, R. (2020). MIMIC-IV (version 0.4). PhysioNet https://doi.org/10.13026/a3wn-hg05.

Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., .. & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215-e220.

eICU-CRD Dataset

Introduction and Documentation: https://eicu-crd.mit.edu/about/eicu/

When using this resource, please cite:

Pollard, T ,johnson, A., Raffia,]., Celi, L A., Badawi, 0., & Mark, R. (2019). eICU Collaborative Research Database (version 2.0). PhysioNet. https://doi.org/10 13026/C2WM1R. The elCU Collaborative Research Database, a freely available multi-center database for critical care research. Pollard TJ, _Johnson AEW, Raffa Celi LA, Mark RG and Badawi 0. Scientific Data (2018). DOI: http.//dx.doi.org/10.1038/sdata.2018.178.

MIMIC-III

Introduction and Documentation: https://physionet.org/content/mimiciii/1.4/

When using this resource, please cite:

Johnson, A., Pollard, T., & Mark, R. (2016). MIMIC-III Clinical Database (version 1.4). PhysioNethttps://doi.org/10.13026/C2XW26.

Additionally, please cite the original publication:

Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035.

Please include the standard citation for PhysioNet: 
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., … & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

 

MEDICAL IMAGING
DATASETS
MIMIC CXR Dataset
NIH Chest X-ray dataset
The Diffusion-Simulated Connectivity Dataset (DISCO)

When using this resource, please cite:

Rafael-Patino, Jonathan; Girard, Gabriel; Truffet, Raphael; Pizzolato, Marco; Caruyer, Emmanuel; Thiran, Jean-Philippe (2022), “The Diffusion-Simulated Connectivity Dataset”, Mendeley Data, V2, doi: 10.17632/fgf86jdfg6.2

Standford CheXpet dataset
PAIP 2020: MSI Prediction Colorectal Cancer

*Individual Data Access Request Required

Data and Documentation: https://paip2020.grand-challenge.org/

When using this resource, please cite:
https://paip2020.grand-challenge.org/
PAIP 2021: Perineural Invasion in Multiple Organ Cancer

*Individual Data Access Request Required

When using this resource, please cite: https://paip2021.grand-challenge.org/Home/

LDPolypVideo

When using this resource, please cite:

Yiting. Ma, Xuejin. Chen, Kai. Cheng, Yang. Li and Bin. Sun. “LDPolypVideo Benchmark: A Large-scale Colonoscopy Video Dataset of Diverse Polyps”, Medical Image Computing and Computer Assisted Intervention Society, 2021

MELA: Mediastinal Lesion Analysis

*Individual Data Access Request Required

Data and Documentation: https://mela.grand-challenge.org/

When using this resource, please cite:

https://mela.grand-challenge.org/

3D Medical Image Dataset from Medical Segmentation Decatholon

Data and Documentation: http://medicaldecathlon.com/

When using this resource, please cite:

https://arxiv.org/abs/1902.09063

VerSe2020

Data and Documentation: https://osf.io/t98fz/

When using this resource, please cite:

https://arxiv.org/pdf/2001.09193.pdf

Multi-Centre, Multi-Vendor & Multi-Disease Cardiac Image Segmentation Challenge (M&Ms)

Data and Documentation: https://www.ub.edu/mnms/

When using this resource, please cite:

https://ieeexplore.ieee.org/document/9458279

SARAS-MESAD
DFUC RBC-12-Dataset

*Individual Data Access Request Required

FU Seg

When using this resource, please cite: https://doi.org/10.5281/zenodo.4575314

PAIP 2019: Liver Cancer Segmentation

*Individual Data Access Request Required

Data and Documentation: https://paip2019.grand-challenge.org/

When using this resource, please cite: 

Kim, Y. J., Jang, H., Lee, K., Park, S., Min, S. G., Hong, C., Park, J. H., Lee, K., Kim, J., Hong, W., Jung, H., Liu, Y., Rajkumar, H., Khened, M., Krishnamurthi, G., Yang, S., Wang, X., Han, C. H., Kwak, J. T., Ma, J., … Choi, J. (2021). PAIP 2019: Liver cancer segmentation challenge. Medical image analysis67, 101854. https://doi.org/10.1016/j.media.2020.101854

AGGC2: Automated Gleason Grading Challenge 2022

*Please reach out to the organizers individually if you are interested in using this dataset.

Data and Documentation: https://aggc22.grand-challenge.org/

When using this resource, please cite:
https://aggc22.grand-challenge.org/