Sample-Based Quality Assurance
Quality assurance is often sample-based, meaning only a partial picture of the entire system can be obtained.
The Juniperus project was realized with the support of the University Research Scholarship Programme (EKÖP) of the University of Pécs, within the framework of the 2025/2026 research period.
Professional support and methodological contribution for the use of audio materials and the audit framework were provided by the Gábor Aurél Research Group of the National Ambulance Service.
Juniperus is a research and development project focused on the AI-based, evidence-driven processing and quality auditing of emergency medical dispatch calls. The goal is not to replace human decision-making, but to create a reproducible analytical framework that enables the structured, searchable, and professionally interpretable examination of large volumes of calls.
Focus
Emergency medical dispatch, audit, clinical communication
Approach
LLM + rule-based structuring
Target Audience
Healthcare professionals and researchers
Format
Summary publication overview
Quality assurance is often sample-based, meaning only a partial picture of the entire system can be obtained.
Manual evaluation is difficult to scale, especially with high case volumes or when multiple audit dimensions must be assessed simultaneously.
Structured retrieval of communication and professional elements is often challenging even retrospectively.
An AI-assisted system could help make evaluation more transparent, faster, and more consistent.
The professional quality of emergency medical dispatch calls can directly influence the early stages of patient care. At the same time, retrospective auditing of calls is time-consuming, dependent on human resources, and often limited to only a small portion of the total call volume.

In the first step, raw audio is converted into a standardized processable format, then speech activity detection is used to isolate relevant speech segments. This reduces noise and provides a more suitable input for subsequent transcription and interpretation stages.
The system aims not only to determine what was said, but also who said it. Speaker separation is especially important in emergency dispatch environments, where instructions, clarifying questions, and caller responses carry different professional meanings.
The transcribed text is then transformed into a controlled structured format. The goal is to create a consistent, line-by-line searchable representation that preserves chronology, speaker attribution, and the auditability of textual evidence.
In the final stage, the structured text is analyzed according to a predefined audit framework. The system may only evaluate elements positively if direct evidence exists within the call transcript, thereby reducing the risks of hallucination and retrospective overinterpretation.
Juniperus is not a black-box system built around a single model, but a chain of multiple interconnected processing stages. The emphasis is on gradual, verifiable structuring derived from raw audio material.

Each evaluated element appears as an independent category, such as scene safety, consciousness, breathing, communication, or advice provided.
Positive evaluation can only be assigned if a concrete quotation or clearly identifiable text segment from the call supports the given criterion.
If an evaluation criterion is not applicable within the given clinical context, it must not be treated as an error, but instead categorized separately as non-relevant.
One of the central elements of Juniperus is a structured evaluation framework that examines different dimensions of emergency dispatch communication and professional data collection through separate criteria. This is not a general impression-based scoring system, but a detailed, evidence-linked evaluation process.

Structured processing makes it possible to apply the same audit logic consistently across large numbers of calls, enabling research comparability.
The method could help extend the review of communication processes that were previously examined only through sampling.
The system may make call materials more interpretable, but human review and clinical responsibility remain professionally central.
The project exists at the intersection of emergency communication, healthcare quality assurance, and applied artificial intelligence. Its potential value is not only technological, but also methodological.

Audio quality, background noise, and speech intelligibility may influence the quality of the entire processing pipeline.
Good structuring is not equivalent to complete professional truth; the system can only work from what was actually said.
Anonymization and data handling are especially sensitive areas in emergency call processing.
Validation of the methodology requires comparison with human auditors and prospective studies.
Juniperus is not a clinical decision-making system and does not replace professional auditors. Model-based processing remains sensitive to input quality, context, and the accuracy of the governing rule system. The core principle of the project is that the system must not improve missing professional content through assumptions, but instead separate what is provable from what cannot be verified.

Comparison of human and AI-based audits according to agreement, discrepancy, and uncertainty.
Methodological evaluation on larger and more heterogeneous datasets.
Development of more detailed communication metrics and feedback modules.
Transparent and publishable presentation of research-oriented results within healthcare and AI professional communities.
The current form represents a methodological foundation and the mindset of a working prototype. The next steps point less toward technical fine-tuning and more toward validation, comparability, and professional integration.
