How Does Cumulus ETL Use NLP?

One of the big features of Cumulus is the ability to easily run natural language processing (NLP) over your clinical notes.

The general idea is that Cumulus ETL runs NLP over your clinical notes, extracts symptoms of interest, and records those findings alongside the coded FHIR data.

This way you can often surface symptoms that simply aren’t recorded in the traditional FHIR data.

NLP Is Always Specific to a Clinical Purpose

The first thing to understand is that Cumulus ETL always runs NLP in the context of a specific clinical purpose, which we’ll call a “study.”

Each study’s design will have its own needs and its own NLP strategy, so Cumulus ETL supports multiple approaches.

Example: The covid_symptom study uses cTAKES and a negation transformer working together to tag COVID-19 symptoms in clinical notes. But another study might use the Llama2 large language model, with a prompt like “Does this patient have a nosebleed?”

But With Re-Usable Code

While the clinical “business logic” of how to drive NLP is inevitably study-specific, the code structure of Cumulus ETL is generic. That’s what makes it easy to support multiple different NLP strategies.

We’ll go into more depth about what an NLP task does under the covers below. But for now, here’s a basic outline of how Cumulus ETL runs an NLP study task:

Prepare the clinical notes
Hand those notes to a bit of study-specific Python code
Record the structured results in an AWS Athena database

Because Cumulus ETL has a growing internal library of NLP support code (things like automatically caching results, calling known interfaces like Hugging Face’s inference API, or configuring cTAKES with a custom dictionary), the study-specific Python code can focus on the clinical logic.

Example Code

In pseudocode, here’s the Python code for a task that talks to an LLM like Llama2 might look like:

for clinical_note in etl.read_notes():
    prompt = "Does the patient in this clinical note have a nosebleed?"
    yield etl.ask_llama2(prompt, clinical_note)

Those calls to etl.* are calls to the internal NLP support code that the task does not have to re-invent.

And with that relatively low level of complexity (though finding the right prompt can be hard), you’ve got a study task that you can run over all your institution’s clinical notes.

Available NLP Strategies

Large Language Models (LLMs)

Cumulus ETL makes it easy to pass clinical notes to an LLM, which are often difficult to set up yourself.

Some LLMs are freely-distributable like Meta’s Llama2, and thus can be run locally. While others are cloud-based proprietary LLMs like OpenAI’s ChatGPT, which your institution may have a HIPAA Business Associate Agreement (BAA) with.

Cumulus ETL can handle either type.

Local LLMs

With a local LLM, your notes never leave your network and the only cost is GPU time.

Which is great! But they can be complicated to set up. That’s where Cumulus ETL can help by shipping turnkey configurations for these LLMs.

See full details below, but the basic idea is that Cumulus ETL will download the LLM for you, configure it for study needs, and launch it. We’ll also be able to offer recommendations on what sort of hardware you’ll need (for example, Llama2 works well with two NVIDIA A100 GPUs).

Only LLama2 is supported right now. But Cumulus ETL uses the standard Hugging Face inference interface as an abstraction layer, so integrating new local LLMs is a lightweight process.

Cloud LLMs

Your institution may have a BAA to share protected health information (PHI) with a cloud LLM.

Talking to a cloud LLM is very similar to a local LLM. Instead of making an internal network call to a Docker container, Cumulus ETL makes an external network call to the cloud.

The exact API is different, but the concept is the same. And importantly, 99% of the Cumulus ETL workflow is the same. It would just swap out the actual call to the LLM.

One additional challenge with cloud LLMs is reproducibility, but recording metadata like the current time and vendor version in the database along with the results can at least help explain changes over time.

cTAKES

Apache cTAKES is a tried and true method of tagging symptoms in text.

A Cumulus ETL study can pass clinical notes to cTAKES and augment its results by using:

a custom dictionary to focus specifically on terms of interest
a cNLP transformer to improve negation detection (e.g. “does not have a fever” vs “has a fever”)

Others

Or any other new transformers or services could be integrated, as needed. If a new study required a new service, Cumulus ETL can add support for it, and then any study would be able to use it.

Technical Workflow

How does it all really work though? Be warned that this next section will get a little technical.

Docker Integration

Services like cTAKES and Llama2 can be launched with a single command, because we ship Docker definitions for them.

All you have to bring to the table is your own GPU hardware.

Example

As an example, let’s say you want to run the covid_symptom study. The command below will launch all the services that study needs. In this case, that means cTAKES and two different cNLP transformers.

docker compose --profile covid-symptom-gpu up --wait

That command works because Cumulus ETL ships a Docker Compose file with stanzas like:

ctakes-covid:
  image: smartonfhir/ctakes-covid:1.1.1
  environment:
    - ctakes_umlsuser=umls_api_key
    - ctakes_umlspw=${UMLS_API_KEY:-}
  networks:
    - cumulus-etl
  profiles:
    - covid-symptom-gpu

Docker will download the referenced image and launch it with the specified configuration.

Study Task

Once you’ve prepared the services the study will need with Docker Compose, you can actually run Cumulus ETL on your clinical notes.

Run the specific NLP study task you are interested in. For example, docker compose run cumulus-etl-gpu --task covid_symptom__nlp_results …
Cumulus ETL will read your DocumentReference FHIR resources.
It will download the clinical notes mentioned by those DocumentReferences.
It will feed those notes to an NLP service (in this case, to cTAKES).
It will write the results (but not the note!) out to an Athena database, just like it does with basic FHIR resources. In this example, the results might be a list of COVID-19 symptoms that cTAKES found in the note.

Where Does the Note/PHI Live?

After Cumulus ETL downloads the clinical note and runs NLP on it, it no longer needs the note.

The note is never pushed to Athena (only the NLP results are).

Some aspects of the note might be cached. For example cTAKES results are cached, so that we only need to run cTAKES once per note. But that will be in a special PHI-capable folder that you provide Cumulus ETL with. That is separate location from the Athena databases and can be entirely local to your machine.

NLP Results

The NLP responses are written to an Athena database and can be queried using SQL. Usually by study-specific SQL integrated into the Cumulus Library.

In the covid_symptom example we’ve been using, the Athena database row for a fever cTAKES match in a clinical note would look something like (in JSON form):

{
  "id": "<anonymized ID>",
  "docref_id": "<anonymized ID>",
  "encounter_id": "<anonymized ID>",
  "subject_id": "<anonymized ID>",
  "generated_on": "2020-01-20T20:00:00+00:00",
  "task_version": 3,
  "match": {
    "begin": 36,
    "end": 41,
    "text": "fever",
    "polarity": 0,
    "conceptAttributes": [
      {"code": "386661006", "cui": "C0015967", "codingScheme": "SNOMEDCT_US", "tui": "T184"},
      {"code": "50177009", "cui": "C0015967", "codingScheme": "SNOMEDCT_US", "tui": "T184"}
    ],
    "type": "SignSymptomMention"
  }
}

As the Cumulus Library SQL processes all the detected symptoms and cross-references the patients & encounters, it generates counts of patients with fever, headaches, etc. Those counts are then sent to the Cumulus Dashboard and can then finally be displayed as digestible charts.

And that’s the lifecycle of a clinical note! It starts inside your EHR, flows through the ETL & related NLP services, its symptoms end up in Athena, and counts of those symptoms get sent to & displayed in the Dashboard.