08.1 Healthcare data

 

Top 27 Free Healthcare Datasets for Machine Learning

Machine Learning is revolutionizing the world of healthcare. ML models can help predict patient deterioration, optimize logistics, assist with real-time surgery and even determine drug dosage. As a result, medical personnel are able to work more efficiently, serve patients better and provide higher quality healthcare. When developing and training machine learning models for healthcare, open and free datasets are an essential starting point for data scientists and engineers, and they can be hard to come by. Here are 22 excellent open datasets for healthcare machine learning.


Best Open Source Medical Datasets for Machine Learning Projects

The global healthcare system produces vast amounts of medical data on a daily basis, which has the potential to be utilized for machine learning applications. Across all industries, data is regarded as a precious asset that enables companies to gain a competitive edge, and the healthcare sector is no different.


Medical Datasets for Machine Learning

Every day the global healthcare system generates tons of medical data that — at least, theoretically — could be used for machine learning purposes. Regardless of industry, data is considered a valuable resource that helps companies outperform their rivals, and healthcare is not an exception. This site briefly discusses challenges you face when working with medical data and make an overview of publicly available healthcare datasets, along with practical tasks they help complete.


National Center for Health Statistics

The National Center for Health Statistics (NCHS) is a rich source of data for researchers, teachers, and students who want to perform data analysis. This site compiles key sources of information found on the NCHS website for those who are interested in analysis of NCHS data as well as documentation and methodology of NCHS data systems.


Analyze Boston

Analyze Boston is the City of Boston’s open data hub to find facts, figures, and maps related to lives within the city. Analyze Boston is working to make this the default technology platform to support the publication of the City’s public information, in the form of data, and to make this information easy to find, access, and use by a broad audience.


Centers for Disease Control and Prevention

CDC is the leading United States science-based, data-driven, service organization that protects the public’s health.


Data.gov

Data.gov is the United States government’s open data website. It provides access to datasets published by agencies across the federal government. Data.gov is intended to provide access to government open data to the public, achieve agency missions, drive innovation, fuel economic activity, and uphold the ideals of an open and transparent government.


Human-related biological databases

A collection of human-related biological databases and a mini-review classifying them into different categories according to their data types. See Table 1 in the linked article.