- Office of the National Coordinator for Health Information Technology (ONC)
- 6/16/2015
- Linking of Clinical and Other Data for Research
- Use of Clinical Data for Research
STATUS: Completed Project
BACKGROUND
Linking and aggregating patient data from disparate sources cannot be done without “matching” these data in a secure manner that protects patent privacy. Effective patient matching allows users of the data (whether patients, providers, or researchers) to draw correct inferences using the data that have been linked together. Without accurate patient matching, stakeholders may inadvertently draw inaccurate conclusions that could significantly impair patient safety and privacy.
PROJECT PURPOSE & GOALS
Under the Office of the National Coordinator for Health Information Technology (ONC), this project identified the best patient attributes to address the challenge of linking patients’ data across research, clinical, and claims data sets in order to support the patient-centered outcomes research (PCOR) data infrastructure that enables standardization and sharing of patient data across organizations.
Project Objectives:
-
Improve data quality by standardizing patient attributes and algorithms that can be used to reliably perform patient matching across clinical and claims data sets to improve algorithm match rates.
-
Create an open source visual tool for patient matching and aggregation.
-
Create a privacy and security application programming interface (API) or PCOR infrastructure security “layer”.
-
Include clinical data research networks in the piloting and testing of the proposed standards and services.
-
Integrate the National Plan and Provider Enumeration System (NPPES) provider identification as an additional attribute to improve patient matching across sources.
PROJECT ACHIEVEMENTS & HIGHLIGHTS
-
To advance improvements to patient matching algorithms, the team launched the Patient Matching Algorithm Challenge and developed open-source, patient matching test harness software.
-
The project team developed a gold standard data set and used it to test a widely adopted patient matching algorithm in the Gold Standard & Algorithm Testing (GSAT) for Patient Matching Pilot, a collaboration with Oregon Community Health Information Network (OCHIN) and the Kaiser Permanente Center for Health Research’s (CHR) Data Coordinating Center.
-
To advance the improvement of data quality, the project team pilot tested an implementation of the Patient Data Demographic Quality (PDDQ) Framework in collaboration with OCHIN and the Kaiser Permanente CHR.
-
The team developed a trusted security layer to enable patient control regarding how their data is shared. They also developed tools and reference implementations to facilitate data sharing and assembly of a complete longitudinal patient record from multiple sources.
-
The team worked collaboratively with the Centers for Medicare & Medicaid Services (CMS) to implement the API for NPPES on the CMS website and develop a plan to promote its adoption and use by stakeholders needing to obtain information on health care providers and health plans.
PUBLICATIONS, PRESENTATIONS, AND OTHER PUBLICALLY AVAILABLE RESOURCES
Resources:
-
ONC published a final report, “Patient Matching, Aggregation, and Linking (PMAL) Project Final Report”, in 2019. The report is available here: https://aspe.hhs.gov/sites/default/files/private/pdf/259016/PMAL-Final-Report-08162019v2.pdf
-
ONC published a 2019 report on the pilot conducted to test the Data Quality Framework for Patient Matching Framework. The report is available here: https://aspe.hhs.gov/sites/default/files/private/pdf/259016/PMAL_PDDQ_Paper_08292019.pdf
-
The Patient Match Test Harness is publicly available for download from GitHub here: http://mitre.github.io/test-harness-interface/
-
The Open Source Toolkit for Patient Matching and Aggregation is available on GitHub. Key resources from this toolkit include:
- Test Harness interface: http://mitre.github.io/test-harness-interface/
- Description of Patient Match Services: https://github.com/mitre/ptmatch/blob/master/api/swagger.yaml
- Description of eCQM Services: https://github.com/mitre/ecqm/blob/master/api/swagger.yaml
-
Tool Specifications Document: Design & Component Development here: https://github.com/mitreid-connect/OpenID-Connect-Java-Spring-Server/
-
API profile specifications are available here: https://openid.bitbucket.io/HEART/
RELATED PROJECTS
Below is a list of ASPE-funded PCORTF projects that are related to this project
Expanding Data Collection for the National Program of Cancer Registries (NPCR) for Comparative Effectiveness Research (CER) - Enhanced data collection and data linkages between central cancer registries and other health related datasets can provide researchers with essential real-world and population-based data for CER. This project built upon previous funding provided to central cancer registries operated by 10 states—part of the National Program of Cancer Registries (NPCR)—to collect detailed treatment and outcomes data from electronic data sources. The PCORTF funding extended longitudinal data collection for a subset of cancer patients in these 10 states, enabling patient-level CER of cancer outcomes, and to improve the tools and timeliness of cancer reporting to the Centers for Disease Control.
PCOR: Privacy and Security Blueprint, Legal Analysis and Ethics Framework for Data Use, & Use of Technology for Privacy - Patient level data are essential to understanding and improving health outcomes. These data must be made available to researchers in a way that ensures the protection of patient privacy while providing sufficient granularity to allow meaningful research questions to be assessed. However, current laws and policies around the use of patient level data are nuanced and sometimes conflicting, creating confusion for researchers, providers, and patients. This is a collaborative effort between the ONC and Centers for Disease Control (CDC) to conduct research and create resources to improve the privacy of patients and their data.