How to use synthetic patient data with Fasten
- Choose a dataset that matches the workflow you want to mimic (FHIR Bulk exports, claims-heavy data, pediatrics, etc.).
- Convert it to NDJSON if necessary - Fasten’s
/ehi_exportendpoint provides FHIR resources in newline-delimited JSON (ndjson) format, with one resource per line. The datasets shared below are in a variety of formats, and may require some manipulation. - Load it into your pipeline tooling - e.g., drop ndjson files into the same object storage bucket that you store Fasten exports in, or POST bundles against your local test endpoint.
- Run the exact conversions/validations you expect to run for real patients: schema validation, de-identification, transformations, downstream notifications.
- Swap the input source with Fasten’s
/ehi_exportpayloads when you are ready for end-to-end testing; no code changes should be required if your fixtures match the contract.
Keep the original archive of each dataset in version control or object storage so test runs are reproducible and easy to diff across releases.
Recommended datasets
- https://synthea.mitre.org/downloads
- https://build.fhir.org/ig/HL7/standard-patient-health-record-ig/branches/master/personas-index.html
- https://synthetichealth.github.io/synthea/
- https://github.com/hapifhir/fhir-tutorial/blob/master/Transactions/lesson.md
- https://physionet.org/content/mimic-iv-fhir-demo/2.0/
- https://physionet.org/content/mimiciii/1.4/
- https://github.com/smart-on-fhir/sample-bulk-fhir-datasets
- https://www.mdpi.com/2079-9292/11/8/1199
- https://mimic.mit.edu/fhir/downloads.html
CCDA
- https://github.com/jmandel/sample_ccdas
- https://github.com/chunli866/CCDAtoFHIRSamples
- https://github.com/microsoft/FHIR-Converter/tree/main/data/SampleData/Ccda

