Key Takeaways
- 80%ย of clinical trials experience enrolment delays andย massive sunk costsย with underperformingย sites.
- Only 4% of US physiciansย participateย in clinicalย research, leaving 96% of patients inaccessibleย under otherย HCPsโย care.
- Tokenization links Citelineโs proprietary data;ย specifically,ย 300+ย millionย claims lives, 245+ million lab lives & 55 million+ EMRย patient livesย withย 1.7+ย millionย HCPย national provider identifiers (NPIs).
- Direct-to-patientย traditional advertising recruitmentย campaigns failย withย complexย protocol criteria, most commonly inย oncology and rare diseaseย protocols.
- Last-mile operationalization requiresย fair market valueย compensation and patient concierge services.
By Matt Holms, Vice President, Commercial Patient Engagement & Recruitment
Clinical trial siteย selectionย remainsย one of the pharma industry’s most expensive guessing games. The cost to set up a site can range from anywhere from $40,000 to $60,000.
Yet onlyย about 4% of physicians in the United Statesย participate inย clinical research.ย The chart below outlines the significant headwinds facing sponsors and sites for patient recruitment in the clinical trial industry.
Traditional patient recruitment approachesย can’t solve a scarcity problemย ย
Theย firstย generation of patient recruitment was the assumption that the clinical research sitesย that sponsorsย selectedย wouldย enrollย 100% of the trial with patients from their own database alone. Sponsors invest substantial resources intoย site selection, partnering with CROsย thatย claim superior investigator networks,ย that all tooย oftenย face rescue scenarios into Phase II and Phase III studies.
Theย secondย generationย of patient recruitmentย approachย attemptedย to circumvent siteย databaseย limitations through direct-to-patientย advertisingย campaignsย oftenย usingย a variety of outreach tactics to target patients not known to the study sites.ย While traditional patient recruitment campaigns haveย beenย effective for chronic indications, complex protocolย designsย expose this approachโsย fundamental weakness.
Manyย researchย sites are understaffed, and this is where the process often falls down. Rescue campaignsย are putย in place asย a last-ditchย effort, but if theย protocol is complex, patients are not able to self-report answers to critical I/E criteria in theย initial prescreener. This leads to high volumes ofย referrals that are not qualified against the key I/E criteria,ย overwhelming sites and further increasing burden.
Tokenization as infrastructure: Linkingย RWD & proprietaryย data for visibility at the patient, disease,ย and providerย Levelย
Citelineโsย visionย withย theย thirdย generationย of patientย recruitmentย replaces assumption-based planning with data-linked patient identification at population scale. This involvesย aggregating both proprietary and real-world data across multiple dimensions, creating visibility into patient populations that traditional site databases cannot access.
Tokenization technologyย aggregates disparateย data sources and eliminates silos by assigning unique patient tokens that link patient longitudinal data from sources such asย EMR, lab, and claims information into unifiedย patientย profiles. These patient tokensย canย beย linkedย to National Provider Identifier (NPI) numbers forย HCPs, as wellย asย facilityย NPIs, creating visibility into who treated patients,ย when theyย were treated,ย how theyย were treated, where treatment occurred, and what therapiesย were administered.
The IRT data redaction implications to results-based pricing
Privacy protection measures created unintended operational consequences thatย haveย destabilized recruitment vendor accountability models. Historically, recruitment companies verified thatย prescreenedย referred patientsย eitherย consented or randomized by matching Interactive Response Technologyย (IRT)ย data using unique identifiers like first and last initials combined with full date of birth. Sites updated vendor portalsย inconsistently, making IRT data theย gold standardย reconciliation mechanism for results-based pricing contractsย for recruitment vendors meeting patient delivery milestones.
There has been a big shift in the industry to redact that dataย whereย sponsorsโ IRT systems now typically onlyย collect yearย of birthย and/or genderย now.ย In order to corroborate that they actuallyย deliveredย aย consented orย randomizedย patient โ as this is the only mechanism for remuneration โ many vendors have resorted to bombarding sites with communicationsย toย validate if referrals signed anย informed consent formย (ICF) and ultimately randomized.
The accountability dilemma extends to referring physicians whoย operateย outside investigator networksย too. Non-investigator physicians face three simultaneous barriers: timeย requiredย for chart review and coordination, revenue loss from reimbursement structures, and patient loss from clinical continuity. Revenue and continuity concerns, such as violating anti-kickbackย statutes, require broader solutionsย toย facilitateย patient access to clinical trials.
The Last Mile: Why data identification isย only a partย of the solutionย ย
Citeline launched its tokenized patient match recruitment solution nearly twoย years ago, generating critical learnings about the gap between patient identification and patient randomization. The biggest learningย relates toย services and support infrastructure that facilitates getting a patient to the site, which the industry has recently coined โthe Last Mile.โ
Cross-vendor collaboration definesย ourย strategic approach. The company partners with organizations that have also tokenized data through Datavant, enabling token-sharing for direct-to-patient outreach whereย appropriate. Other partnerships focus onย providing โlastย mileโ patient concierge services that canย operateย on theย protected health information (PHI)ย sideย toย follow up with identified patients
Measurement challenges persistย whereย tracking conversion from identification to randomization requires coordination across MSLs, contracting teams, patient services, and sitesย โย a cross-functional orchestration that extends beyond traditional recruitment vendor scope.
From proprietary advantage to collaborative ecosystemย ย
Third-generation recruitmentย representsย a paradigm shiftย fromย simplyย hiring the best sites to identifying patientsย both in andย around sitesย at a specific moment in time based on their clinical pathway. It is important to frame patient access asย both aย data and service delivery challenge rather thanย simplyย a site selection problem.
Sponsors increasingly use tokenized RWD for protocol design and feasibility analysis before finalizingย inclusion-exclusion criteria, preventingย enrollmentย challenges through evidence-based protocol optimization. This trend still has a long way to go, with significant upside for the industry.
Therapeutic area segmentation matters:ย Direct-to-patient campaignsย retainย value in chronic disease indications where patients canย appropriatelyย answer prescreening questions, while oncology,ย rare diseaseย and anyย complex indicationsโย protocols require data-driven identification of protocol-matched patients and their treating physicians.
Third-generation recruitmentย willย succeedย not throughย justย proprietary advantage but through orchestrated collaboration that combines RWD/proprietary data, tokenization, artificial intelligence (AI), and complex services collaboration into randomized patients.



















