Castor Catalyst reads your clinical source data and structures it. Upload any data, Catalyst extracts it, a human reviewer accepts it, and you get clean records.
Upload PDF visit summaries, scanned paper PROs, lab results, EMR screenshots, or even handwritten worksheets. Catalyst handles the de-identification, extraction, and mapping. Your team reviews and approves before anything reaches the EDC.
So let them keep paper. Catalyst does the rest.
“Sites are better off with paper source… Editable in seconds for builds and amendments.”
Typed, scanned, or handwritten. If the study collects it, Catalyst reads it. Pick one below and watch it become structured data.
Illustrative examples. The confidence percentages shown are illustrative, not measured performance. Catalyst handles the document types your study produces, with the same source-to-EDC pipeline running underneath.
From one live, paper-based Site Upload study, with human review on every value. A single study, not a platform average, but real production data.
Site Upload moves sensitive source through Catalyst. The trust pillars cover what regulators, sites, and data managers all ask about first.
Site Upload works in any country, any language. Useful for studies that span US and non-US sites, or studies running entirely outside the US.
Studies where the bulk of source data is structured lab values and reference ranges, uploaded as PDFs by sites or participants, or wired in directly from a central lab.
Studies that capture lots of source data beyond standard CDISC fields: surgical variables, device data, complex endpoints. Catalyst trains a pipeline on your own example documents instead of hand-keying every field.
Surgical, oncology, and rare disease registries with chart review. Sites stay in their existing EMR workflow and upload source per the registry’s protocol.
Castor Catalyst’s Site Upload workflow lets sites upload any document the study produces directly to Catalyst. Common examples include PDF visit summaries (typed or scanned), scanned paper patient-reported outcomes, and lab results, either PDF panels uploaded by sites or participants, or central-lab feeds wired directly into Catalyst. Sites can also upload handwritten worksheets, operative reports, discharge letters, and other free-text clinical notes.
Once a source is uploaded, Catalyst de-identifies any remaining PHI by drawing redaction boxes over identifiers, then runs the document through an AI extraction pipeline. The extracted values are mapped to CDISC CDASH standards (or to pipelines tailored to your study when the source goes beyond CDISC, like surgical variables), passed through Human-in-the-Loop review by medically trained Castor staff, and committed to your Castor EDC with a Visual Audit Trail back to every source.
Site Upload works globally. Sites in any country, any language, can upload source documents directly. For US studies where participants can retrieve their own EHR records, see the Direct-to-Patient workflow.
PDFs (typed or scanned), photos, paper CRFs, paper PROs, lab panels (PDF from sites or participants, or central-lab feeds direct to Catalyst), surgical reports, discharge letters, and other free-text clinical notes. Handwriting is supported. Mixed multi-document scans are supported. If your source falls outside these types, the team confirms fit before the study starts.
Catalyst draws redaction boxes over participant identifiers before the document enters the extraction pipeline. The original source is stored in a logically segregated, encrypted Source Vault. Only pseudonymized structured data flows into the EDC. Source documents are purged 30 days after study close with a written destruction certificate.
Catalyst is one product with two entry points. If Direct-to-Patient isn’t quite the right fit for your study, here’s where to go next.
AI source-data extraction for clinical trials. The technology overview, both workflows, and the story in one place.
Participants retrieve their own records through HIPAA release or FHIR. No site upload. US studies.
Bring a real document from your study. An operative report, a paper PRO, a lab panel, an EMR screenshot. We’ll process it live and show you the audit trail field by field.