AIM-AHEAD Datasets and Analysis Platforms

DATASET

DATA

DATA TYPES

SIZE

EHR data from underserved communities

HIPAA limited dataset, individual-patient level data with dates and geographic indicators if needed for research

Subset of ~6M EHR

ANALYSIS PLATFORM

DATASET

DATA

DATA TYPES

SIZE

Multiple curated dataset options (further detail on website ) pre-curated or custom curated de-identified EHR, Limited Dataset, Full PHI EHR dataset, Imaging, Select clinical notes, select genomics data, synthetic data

Pre-curated datasets and custom-curated datasets of varying sizes. Curated from the EHR with over 5 million patient records.

ANALYSIS PLATFORM

DATASET

DATA

DATA TYPES

SIZE

Selected large-scale cohorts related to heart, lung, blood and sleep disorders. Includes both prospective clinical studies and associated genomic TOPMED data.

De-identified dataset. Including individual level genomic (TOPMED full genomes) and clinical datasets

ANALYSIS PLATFORM

NHLBI BioData Catalyst PIC-SURE and Seven Bridges Platforms

DATASET

DATA

DATA TYPES

SIZE

A variety of datasets available including clinical and genomic data

Public data, and controlled access data (depends on dataset)

ANALYSIS PLATFORM

DATASET

DATA

DATA TYPES

SIZE

The All of Us Research Program is building one of the largest biomedical data resources of its kind.

The All of Us Research Hub stores health data from a diverse group of participants from across the United States

616,000+ Participants; 360,000+ Electronic Health Records; 444,000+ Biosamples Received

ANALYSIS PLATFORM

DATASET

DATA

DATA TYPES

SIZE

ScHARe is a cloud-based research collaboration platform developed by the National Institute on Minority Health and Health Disparities and the National Institute of Nursing Research.

Google-hosted Public Datasets; ScHARe-hosted Public Datasets; ScHARe-hosted Project Datasets

ANALYSIS PLATFORM