By clicking “Accept All Cookies”, you agree to the storing of cookies on your device to enhance site navigation, analyze site usage, and assist in our marketing efforts. View our Privacy Policy for more information.
Case Study
June 18, 2026

How an RCM Consulting Firm Cut Engagement Ramp Time and Scaled Their Data Operations

Andrew Luo
Andrew is the co-founder & CEO at OneSchema.

Overview

Revenue cycle management engagements start with a data problem. Every hospital client runs a different EHR — Epic, Cerner, Meditech — and exports patient, claim, and financial data in formats that don't match each other or the consulting platform's target schema. For an RCM consulting firm, multi-week data ramp at the start of each engagement was eroding margin and limiting the scope of strategic analysis they could deliver. By deploying OneSchema's File Agents, the firm achieved a 6x improvement in pipeline delivery speed — compressing data onboarding from weeks to days and enabling consultants to begin extracting insights from clean, normalized data on day one of every engagement.

Before OneSchema

RCM consulting requires deep analytical work: identifying denial patterns, modeling revenue recovery opportunities, analyzing charge capture gaps, evaluating authorization workflows. None of that work can begin until the underlying data is clean, normalized, and structurally coherent. And in most hospital environments, that data arrives in a state that requires significant preparation before any analysis can run on it.

Epic exports look different from Cerner exports. Cerner looks different from Meditech. Clearinghouse files from Optum don't align with files from Waystar or Availity. ERP data from Workday uses different field structures than Oracle or SAP. The claims data that makes up the core of an RCM engagement contains ICD-10 and CPT codes that need to be validated. Patient account records need referential integrity across visit and claim records — a patient ID in the charge data has to resolve to the same patient in the encounter data and the payer data. Denial records have to be matchable back to the original claims they reference. The data arrives in pieces, from systems that were never designed to interoperate, and has to be assembled into a coherent whole before any revenue cycle analysis can meaningfully proceed.

For consulting teams, this means the first weeks — sometimes longer — of every hospital engagement go toward data engineering rather than the strategic analysis the engagement was scoped to deliver. Custom transformation scripts are written. Brittle SQL pipelines are built. Excel is used to reconcile what can't be automated. When the data is finally clean enough to use, the scope of the opportunity assessment is already constrained by how much engagement time remains and by whatever the cleanup process revealed about what data could and couldn't be made usable.

The problem compounds at the organizational level. Because each engagement requires bespoke data engineering, the firm's capacity for new engagements is partly gated by its data engineering resources. Analysts with RCM domain expertise are stuck in spreadsheets. Data engineers build pipelines they'll never reuse. When the next hospital engagement starts, the process begins again — another custom ingestion layer, another few weeks of ramp, another set of one-off scripts that won't transfer to the client after.

The quality risks compound too. Data assembled manually from disparate systems carries the inherent risk that referential integrity wasn't fully verified, that fields were mapped incorrectly under time pressure, or that edge cases in the source system's export format weren't accounted for. Recommendations built on imperfect data carry the downstream risk of being wrong — or, worse in a healthcare context, of introducing regulatory exposure for the hospital client.

Why OneSchema

The firm needed to break the pattern of one-off data engineering per engagement. Their requirement was a platform that could deploy AI Agents to autonomously handle the entire data lifecycle — intake, prep, and reconciliation — across any EHR, clearinghouse, or ERP system, without custom scripting for each new client's stack. And critically, those agents had to be trustworthy with healthcare data. A generic AI tool isn't. RCM data involves ICD-10 codes, CPT codes, patient identifiers, authorization records, and claim-level financial data — the kind of domain-specific complexity that a general-purpose automation layer will get wrong in ways that aren't immediately visible. Recommendations built on bad data don't just waste engagement time; they create regulatory exposure for hospital clients. OneSchema builds AI Agents specifically designed for data operations use cases, with the domain knowledge of each use case built in. That's what makes the output reliable enough to run engagements on.

OneSchema's AI Agents addressed the full source system landscape that hospital engagements involve. Patient account details, transactions, charges, denials, claims, contracts, authorizations, discharge records, encounter visits, and survey data from Epic, Cerner, Meditech, Allscripts, payer clearinghouses, and ERP systems are autonomously ingested, mapped, and normalized to a consistent target schema. The agents handle the entire prep and reconciliation process: built-in validation covers medical coding fields — ICD-10, CPT codes — financial data, authorization records, and the referential integrity checks that ensure patient, visit, and claim records align before any analysis runs on them. No spreadsheet work required.

The AI Agents also handle the file types that don't arrive in structured formats. Contracts and policy documents that arrive as PDFs are extracted and parsed into structured, queryable data by the same agents processing the structured files. Source files arriving via SFTP, S3, or API flow through the same autonomous pipeline as files uploaded manually. The format of the incoming data — CSV, Excel, PDF, or EDI — doesn't determine whether the agents can process it.

For the firm, the evaluation came down to a practical question: could they start a new hospital engagement with a new EHR environment and have clean, analysis-ready data flowing into their analytics layer on day one, without a custom data engineering sprint? OneSchema's answer was yes. Rather than building one-off pipelines per client, the team deploys AI Agents configured for a new engagement's data sources. The same agents that worked for one Epic implementation work for the next Cerner implementation, with configuration adjustments rather than new code.

Impact

The impact on engagement economics was direct and measurable. Data ramp that had previously consumed multiple weeks of capacity compressed to days. Consultants moved from data cleanup to insights faster because the AI Agents were handling the intake, prep, and reconciliation work autonomously — the same multi-week process that had always preceded the analysis ran in the background without consuming engagement capacity. More billable hours went toward identifying revenue recovery opportunities, modeling denial patterns, and analyzing charge capture. The manual spreadsheet work that had defined the start of every hospital engagement was eliminated.

A 6x improvement in pipeline delivery speed changed what was feasible within an engagement's scope. Opportunity assessments that previously had to be scoped conservatively to account for the cleanup timeline could be scoped to the actual complexity of the client's revenue cycle. Analysis that would have been cut for time became possible. The recommendations the firm delivered were grounded in more complete data, processed more rigorously, and produced faster — all because the agents were doing the data lifecycle work autonomously from day one.

The scalability improvement compounded over multiple engagements. Because OneSchema's AI Agents are configurable rather than custom-coded, bringing on a new hospital client no longer required rebuilding the data engineering layer from scratch. The approach that worked for one Epic environment worked for the next Cerner environment. Teams that had been bottlenecked by data engineering capacity could take on more engagements. Analysts with RCM expertise spent their time on RCM analysis rather than spreadsheet reconciliation — which is what deploying AI Agents for operations is supposed to produce.

The quality improvement had its own downstream significance. Because every file passes through agents with consistent, documented logic — referential integrity checks, medical coding validation, financial field verification — the data foundation for each engagement is more reliable than what manual processes produced. Recommendations are more defensible. The regulatory risk that comes from building analysis on misaligned or incomplete data is reduced systematically, not managed case by case, and every file that flows through is audit-ready by default.

Overview

Revenue cycle management engagements start with a data problem. Every hospital client runs a different EHR — Epic, Cerner, Meditech — and exports patient, claim, and financial data in formats that don't match each other or the consulting platform's target schema. For an RCM consulting firm, multi-week data ramp at the start of each engagement was eroding margin and limiting the scope of strategic analysis they could deliver. By deploying OneSchema's File Agents, the firm achieved a 6x improvement in pipeline delivery speed — compressing data onboarding from weeks to days and enabling consultants to begin extracting insights from clean, normalized data on day one of every engagement.

Before OneSchema

RCM consulting requires deep analytical work: identifying denial patterns, modeling revenue recovery opportunities, analyzing charge capture gaps, evaluating authorization workflows. None of that work can begin until the underlying data is clean, normalized, and structurally coherent. And in most hospital environments, that data arrives in a state that requires significant preparation before any analysis can run on it.

Epic exports look different from Cerner exports. Cerner looks different from Meditech. Clearinghouse files from Optum don't align with files from Waystar or Availity. ERP data from Workday uses different field structures than Oracle or SAP. The claims data that makes up the core of an RCM engagement contains ICD-10 and CPT codes that need to be validated. Patient account records need referential integrity across visit and claim records — a patient ID in the charge data has to resolve to the same patient in the encounter data and the payer data. Denial records have to be matchable back to the original claims they reference. The data arrives in pieces, from systems that were never designed to interoperate, and has to be assembled into a coherent whole before any revenue cycle analysis can meaningfully proceed.

For consulting teams, this means the first weeks — sometimes longer — of every hospital engagement go toward data engineering rather than the strategic analysis the engagement was scoped to deliver. Custom transformation scripts are written. Brittle SQL pipelines are built. Excel is used to reconcile what can't be automated. When the data is finally clean enough to use, the scope of the opportunity assessment is already constrained by how much engagement time remains and by whatever the cleanup process revealed about what data could and couldn't be made usable.

The problem compounds at the organizational level. Because each engagement requires bespoke data engineering, the firm's capacity for new engagements is partly gated by its data engineering resources. Analysts with RCM domain expertise are stuck in spreadsheets. Data engineers build pipelines they'll never reuse. When the next hospital engagement starts, the process begins again — another custom ingestion layer, another few weeks of ramp, another set of one-off scripts that won't transfer to the client after.

The quality risks compound too. Data assembled manually from disparate systems carries the inherent risk that referential integrity wasn't fully verified, that fields were mapped incorrectly under time pressure, or that edge cases in the source system's export format weren't accounted for. Recommendations built on imperfect data carry the downstream risk of being wrong — or, worse in a healthcare context, of introducing regulatory exposure for the hospital client.

Why OneSchema

The firm needed to break the pattern of one-off data engineering per engagement. Their requirement was a platform that could deploy AI Agents to autonomously handle the entire data lifecycle — intake, prep, and reconciliation — across any EHR, clearinghouse, or ERP system, without custom scripting for each new client's stack. And critically, those agents had to be trustworthy with healthcare data. A generic AI tool isn't. RCM data involves ICD-10 codes, CPT codes, patient identifiers, authorization records, and claim-level financial data — the kind of domain-specific complexity that a general-purpose automation layer will get wrong in ways that aren't immediately visible. Recommendations built on bad data don't just waste engagement time; they create regulatory exposure for hospital clients. OneSchema builds AI Agents specifically designed for data operations use cases, with the domain knowledge of each use case built in. That's what makes the output reliable enough to run engagements on.

OneSchema's AI Agents addressed the full source system landscape that hospital engagements involve. Patient account details, transactions, charges, denials, claims, contracts, authorizations, discharge records, encounter visits, and survey data from Epic, Cerner, Meditech, Allscripts, payer clearinghouses, and ERP systems are autonomously ingested, mapped, and normalized to a consistent target schema. The agents handle the entire prep and reconciliation process: built-in validation covers medical coding fields — ICD-10, CPT codes — financial data, authorization records, and the referential integrity checks that ensure patient, visit, and claim records align before any analysis runs on them. No spreadsheet work required.

The AI Agents also handle the file types that don't arrive in structured formats. Contracts and policy documents that arrive as PDFs are extracted and parsed into structured, queryable data by the same agents processing the structured files. Source files arriving via SFTP, S3, or API flow through the same autonomous pipeline as files uploaded manually. The format of the incoming data — CSV, Excel, PDF, or EDI — doesn't determine whether the agents can process it.

For the firm, the evaluation came down to a practical question: could they start a new hospital engagement with a new EHR environment and have clean, analysis-ready data flowing into their analytics layer on day one, without a custom data engineering sprint? OneSchema's answer was yes. Rather than building one-off pipelines per client, the team deploys AI Agents configured for a new engagement's data sources. The same agents that worked for one Epic implementation work for the next Cerner implementation, with configuration adjustments rather than new code.

Impact

The impact on engagement economics was direct and measurable. Data ramp that had previously consumed multiple weeks of capacity compressed to days. Consultants moved from data cleanup to insights faster because the AI Agents were handling the intake, prep, and reconciliation work autonomously — the same multi-week process that had always preceded the analysis ran in the background without consuming engagement capacity. More billable hours went toward identifying revenue recovery opportunities, modeling denial patterns, and analyzing charge capture. The manual spreadsheet work that had defined the start of every hospital engagement was eliminated.

A 6x improvement in pipeline delivery speed changed what was feasible within an engagement's scope. Opportunity assessments that previously had to be scoped conservatively to account for the cleanup timeline could be scoped to the actual complexity of the client's revenue cycle. Analysis that would have been cut for time became possible. The recommendations the firm delivered were grounded in more complete data, processed more rigorously, and produced faster — all because the agents were doing the data lifecycle work autonomously from day one.

The scalability improvement compounded over multiple engagements. Because OneSchema's AI Agents are configurable rather than custom-coded, bringing on a new hospital client no longer required rebuilding the data engineering layer from scratch. The approach that worked for one Epic environment worked for the next Cerner environment. Teams that had been bottlenecked by data engineering capacity could take on more engagements. Analysts with RCM expertise spent their time on RCM analysis rather than spreadsheet reconciliation — which is what deploying AI Agents for operations is supposed to produce.

The quality improvement had its own downstream significance. Because every file passes through agents with consistent, documented logic — referential integrity checks, medical coding validation, financial field verification — the data foundation for each engagement is more reliable than what manual processes produced. Recommendations are more defensible. The regulatory risk that comes from building analysis on misaligned or incomplete data is reduced systematically, not managed case by case, and every file that flows through is audit-ready by default.