Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (4)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Schwappach, D. L. B.
Right arrow Articles by Hochreutener, M.-A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Schwappach, D. L. B.
Right arrow Articles by Hochreutener, M.-A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

International Journal for Quality in Health Care 15:473-485 (2003)
© 2003 International Society for Quality in Health Care

‘Emerge’: benchmarking of clinical performance and patients’ experiences with emergency care in Switzerland

David L. B. Schwappach1, Annette Blaudszun2, Dieter Conen3, Heinz Ebner4, Klaus Eichler2 and Marc-Anton Hochreutener2

1 Department of Health Policy and Management, Faculty of Medicine, University Witten/Herdecke, Witten, Germany,
2 Verein Outcome, Zurich,
3 Department of Internal Medicine, Aarau, Switzerland,
4 Koeck, Ebner and Partner, Vienna, Austria

Objective. To assess the effects of uniform indicator measurement and group benchmarking followed by hospital-specific activities on clinical performance measures and patients’ experiences with emergency care in Switzerland.

Design. Data were collected in a pre–post design in two measurement cycles, before and after implementation of improvement activities. Trained hospital staff recorded patient characteristics and clinical performance data. Patients completed a questionnaire after discharge/transfer from the emergency unit.

Setting. Emergency departments of 12 community hospitals in Switzerland, participating in the ‘Emerge’ project.

Subjects. Eligible patients were entered into the study (18 544 in total: 9174 and 9370 in the first and second cycles, respectively), and 2916 and 3370 patients returned the questionnaire in the first and second measurement cycles, respectively (response rates 32% and 36%, respectively).

Main outcome measures. Clinical performance measures (concordance of prospective and retrospective assessment of urgency of care needs, and time intervals between sequences of events) and patients’ reports about care provision in emergency departments (EDs), measured by a 22-item, self-administered questionnaire.

Results. Concordance of prospective and retrospective assignments to one of three urgency categories improved significantly by 1%, and both under- and over-prioritization, were reduced. The median duration between ED admission and documentation of post-ED disposition fell from 137 minutes in 2001 to 130 minutes in 2002 (P < 0.001). Significant improvements in the reports provided by patients were achieved in 10 items, and were mainly demonstrated in structures of care provision and perceived humanity.

Conclusion. Undertaken in a real-world setting, small but significant improvements in performance measures and patients’ perceptions of emergency care could be achieved. Hospitals accomplished these improvements mainly by averting strong outliers, and were most successful in preventing series of negative events. Uniform outcomes measurement, group benchmarking, and data-driven hospital-specific strategies for change are suggested as valuable tools for continuous improvement. Several hospitals have already implemented the developed measures in their internal quality systems and subsequent measurements are projected.

Keywords: benchmarking, emergency care, patient satisfaction, quality of health care, safety

Accepted for publication July 31, 2003.


In 2000, the Swiss Federal Social Insurance Office and the ‘Verein Outcome’ (Outcome Association) initiated the nationwide project ‘Emerge’ to assess and improve the quality of care provided by Swiss emergency departments (EDs). The Outcome Association is a non-profit organization that was founded by the health authority of the Canton Zurich as part of a broad initiative for the improvement of hospital quality of care [1]. The primary aim of the project was to establish a nationwide pilot benchmarking project and to evaluate the methodology and feasibility for the initiation of quality improvement activities in hospitals. The focus was set on emergency care departments as these provide highly interdisciplinary care and are among the most sensitive, challenging, and risk-entailing areas of hospital care. In addition, EDs are of central importance for the corporate image and economic success of individual hospitals.

‘Emerge’ comprised seven phases: (1) selection of interested hospitals, participating on a voluntary basis; (2) joint development of a set of clinical performance indicators agreed upon by all parties; (3) establishment of a measurement system, development of measurement tools (manuals, training), and design of data collection instruments; (4) data collection in a first measurement cycle; (5) benchmarking of results and definition of shared, quantitative targets; (5) initialization of hospital-specific improvement activities; (6) data collection in a second measurement cycle; and (7) benchmarking of results. The normative framework of the project included the focusing of results, the orientation on the patient’s perspective, an interdisciplinary, cooperative approach in the development of indicators, and feasibility of measurement in routine care. All parties agreed that interpretation of measurement results and benchmarking should be guided by a culture of organizational learning rather than individual blame. The developed indicator set includes two main components: objective measures that evaluate clinical performance in terms of speed and accuracy of patient assessment, and patients’ experiences with care provided by EDs. In this contribution, we report the results of two measurement cycles, pooled across the 12 participating community hospitals. We analyze whether uniform indicator measurement and benchmarking followed by hospital-specific activities and changes in processes or structures of care resulted in improved clinical performance measures and patients’ reports.


    Patients and methods
 Top
 Patients and methods
 Results
 Discussion
 Contributors
 References
 
The Swiss emergency health care system
Patients in Switzerland have direct, unlimited access to primary care physicians in an ambulatory care setting and free access to outpatient specialist treatment unless they are insured with managed care organizations with gatekeeper systems (currently < 10% of the population). There are no barriers to hospital-based EDs and non-health related factors often affect decisions whether to seek care in an ED rather than in primary care. Especially in urban areas, and during nights or weekends, EDs are often used as substitutes for primary care physicians by patients. EDs in Switzerland are very diverse and have no uniform structure. The level of training of physicians working in EDs varies largely and depends mainly on the size of hospitals and internal policies. In larger hospitals, ED physicians are often specialists, certified in emergency or intensive care. In smaller hospitals, ED care is usually provided by residents without formal, postgraduate emergency care qualifications, closely supervised by higher-qualified physicians (attendings or seniors). Residents have usually attended qualified in-house training.

Indicators and measurement
Clinical performance in terms of speed and accuracy of patient assessment was assessed by seven indicators, of which six measure the time intervals between the sequences of events from admission to established diagnosis (Box 1). One indicator captures the accuracy of the first medical decision to prioritize patients according to urgency of care needs. ED physicians triaged patients in one of three urgency classes (‘emergent’, ‘urgent’, or ‘non-urgent’) in the first medical evaluation based on vital signs (and cardiogram if appropriate), visual assessment of clinical presentation, and suspected diagnosis. Patients were re-assessed retrospectively by more-qualified physicians after completed diagnostic work-up based upon all clinical information available at the end of emergency care, i.e. with documentation of post-ED disposition. The indicator reflects the concordance of initial and subsequent re-assessment, and the frequency of up- and down-grading (Box 1). No protocols or algorithms were used for the classification of urgency of care needs. However, hospital staff had been extensively trained in measurement in a series of workshops preceding both measurement cycles. For the assessment of urgency of care needs (triage), detailed lists of typical conditions and clinical presentations falling in the respective urgency of care need category were prepared and discussed by professionals of the participating hospitals in the workshops. For the assessment of patients’ experiences, a 22-item questionnaire was developed using qualitative methods in samples of former patients from the participating hospitals and hospital staff [2]. This questionnaire uses ‘report’-like questions rather than ratings of satisfaction. As others have noted, responses to such ‘report’-like questions are readily interpretable, and actions to be taken to improve quality can often be derived directly [3].


Box 1 Definitions of performance measures


Indicator Description

1. Over-, under-, and accurate prioritization of urgency of care needs
    Category I: highest priority (acute life-threatening situation, e.g. myocardial infarction, polytrauma) a. Rate of agreement between first and second assessments: proportion of patients classified in the same category at both assessments1
    Category II: high priority (acute severe condition, e.g. acute appendicitis, hip fracture) b. Rate of over-prioritization: proportion of patients downgraded from category I or category II in the initial assessment to categories II and III in the second assessment
    Category III: low priority (moderate to mild condition,e.g. superficial cut, rash) c. Rate of under-prioritization: proportion of patients upgraded from category II or category III in the initial assessment to categories I and II in the second assessment
2. Time interval from ED admission to initial medical assessment (assignation of course of diagnostic and therapeutic interventions)
3. Time interval from ED admission to second medical assessment and diagnosis (end of emergency care and decision about discharge/transfer destination)
4. Time interval from initial to second medical assessment
5. Time interval from ED admission to begin of nursing activities
6. Time interval from ED admission to history taking
7. Time interval from ED admission to provision of information to the patient about the diagnosis and subsequent treatment course
1 The re-assessment (i.e. second assessment) seeks a second retrospective appraisal of the initial situation (‘considering all the information available now, was the situation life-threatening?’) rather than an evaluation of the initial assessment (‘did the situation seem to be life threatening?’).

 

Patients and setting
Patients were recruited among all patients admitted to the emergency units of the study hospitals from 18th April to 15th May 2001 (first cycle) or from 3rd April to 1st May 2002 (second cycle), either by self-selection or referral by any health care facility. All in- and outpatients were eligible in case they were considered to have an ‘emergency care problem’, either from the professional’s or the patient’s perspective. EDs of 12 community hospitals in Switzerland participated, comprising small rural and large urban regional hospitals, and two university hospitals. The mean number of beds in these institutions is 334 (range 106–1060) and the mean number of emergency admissions in 2000 was 11 725 per hospital (range 3000–24 755).

Data collection
Demographic, clinical, and performance data were collected via specific data sheets by hospital staff: attending physicians, responsible for the diagnostic and therapeutic decisions, assigned acuity scores at presentation and at re-assessment. Physicians recorded information on time points relating to acuity assessment, diagnostic work-up, and treatment. Time points relating to administrative procedures, patient information and nursing care were recorded by nurses. Each hospital nominated a person to be in charge of ensuring and supervising correct data collection. The ‘Verein Outcome’ provided comprehensive additional support to ensure data quality. This included recurrent training in data collection for hospital staff before measurements, a manual describing the indicators and the data collection procedure, answering frequently asked questions, hotline support during measurement phases, and data controlling. Data sheets were completed at discharge and transferred to the Verein Outcome office. Patients’ evaluations of care provided by EDs were obtained via self-administered questionnaires dispensed by the attending carers at the end of ED stay (at discharge or during the 3 days following hospital admission). The procedure followed a standardized course that was described and explained in the measurement manuals and exercised in workshops. Patients (and parents of paediatric patients) received the questionnaire with a covering letter and a reply-paid envelope to complete either before discharge or at home, and were asked to return the completed questionnaire in the closed envelope to hospital staff or to post it to the specified neutral postbox address. Hospital staff were encouraged to remind patients of the questionnaire 12 h after hand-out, and were advised on appropriate communication and presentation of the questionnaire. In case patients felt unable to complete the questionnaires alone, they were encouraged to ask their accompanying persons, or, if necessary, hospital staff to help them. Questionnaires were available in German, Italian, and French. Foreign patients were offered support by interpreters. Questionnaires were confidential and quasi-anonymous (coded by code number), and responses were related to clinical data sheets by code number.

Processing and analyzing data
Random samples of transmitted data sheets were subject to systematic data controlling.

The initial 5% of datasets expected to be transferred by each hospital and subsequent weekly 5% random samples of each hospital’s transferred datasets were checked for eligibility, completeness, and a set of pre-defined plausibility tests, consecutively. These included checks for contradicting data (e.g. discharge of males to the department of obstetrics), checks for double information (e.g. more than one post-ED disposition noted), and plausibility of time measurements (e.g. whether the hour noted for admission was before the hour noted for the triage decision). In addition, the number of transferred cases was compared to the expected number of cases on an individual hospital level, based on historical data. Hospitals were informed weekly on the quality of data they provided. A commercial data processing company scanned data sheets and questionnaires, and returned data on file. Data sheets and questionnaires were then merged by code number.

Statistical analyses
Responders to the questionnaire were compared with non-responders in terms of several variables using unpaired t-tests and {chi}2 tests. These procedures were also applied to test for differences in performance data between the years. Differences in questionnaire responses between cycles were analyzed using Mann–Whitney tests. Differences in subsamples were tested using the {chi}2 or Fisher’s exact test in case of at least one frequency < 5. We also calculated a ‘problem score’ for survey data: for each question, responses were dichotomized as 0 (no problem) or 1 (a serious problem, represented by the least favorable response code) [4]. Patients that reported one or more problems, i.e. those who scored ‘1’ on any question, were considered to have experienced an ED admission ‘with a patient-evaluated problem’. We then compared the fraction of patients that reported a patient-evaluated problem between the years. Multiple logistic regression (forward selection) was used to adjust simultaneously for confounding factors. Confidence intervals (CIs) are reported on the 95% level. The level of statistical significance was set at P < 0.05. Data were analyzed with the statistical software Stata 8 [5].


    Results
 Top
 Patients and methods
 Results
 Discussion
 Contributors
 References
 
Performance data were obtained for 9174 and 9370 patients in the first and second measurement cycles, respectively. Characteristics of patients are provided in Table 1. Distributions of patients’ age and sex do not significantly differ between measurements. However, patients in the second measurement were less likely to be self-selected for emergency care, were more often transferred to a non-intensive care unit, and were less likely to be transferred to immediate surgery. Also, the distribution of conditions and affected organ-systems differs slightly between measurements. An analysis of the diagnostic interventions and tests ordered in 2001 and 2002 shows that no general increase in procedures can be observed. Patients in 2002 were slightly more likely to have magnetic resonance imaging compared with 2001 (0.7% and 0.4%, respectively; P = 0.006), and less likely to have echocardiography (1.9% and 2.4%, respectively; P = 0.047). There were no differences in the utilization of X-ray, laboratory, sonography, duplexsonography, endoscopy, computer tomography, interventional radiology, or consultation of specialists. A significant shift in the qualification of staff establishing the first preliminary diagnosis could be observed with fewer interns and residents, and more fellows, attendings, and seniors performing the first assessment in 2002 compared with 2001. This effect was even more pronounced for qualification of professionals in charge of the second assessment, indicating a change in hospitals’ practice styles. Based on the results of the first measurement cycle, subsequent benchmarking, and process analyses, a number of improvement activities were initiated in individual hospitals covering a wide range of targets, from investment in ED structures to professional education and organization of care (Box 2).


View this table:
[in this window]
[in a new window]
 
Table 1 Patient characteristics by measurement cycle

 


Box 2 Examples of initiated improvement activities


Target: professional education
    Systematic advanced training of residents in emergency care
    Systematic introductory training in ED care for newly entering residents
    Installation of regular interdisciplinary training on ED-specific topics, e.g. management of anxiety and pain
    Enhanced clinical supervision of residents
Target: process development
    Implementation of guidelines for the assessment and treatment of certain clinical conditions/clinical situations
    Process re-engineering to increase speed of triage and reduction of waiting times
    Introduction of pain protocols and dolometers
    Installation of a risk management and error detection system
Target: communication
    Training in appropriate communication to improve manner, diction, and timing when communicating with ED patients
    Development of internal conventions for the avoidance of X-ray evaluation and teaching while patients present
    Development of internal conventions regarding explicit nomination of professionals in charge of the information on patients with unclear diagnoses
    Development of internal conventions on when, how, and by whom patients are informed about whether and when they are allowed to eat or drink
    Development of a communication concept regarding waiting times
Target: human resources and organizational development
    Change in practice style: all surgical/trauma patients must be evaluated by the surgical department’s chairperson
    Development of guidelines for time intervals between alerting medical staff and arrival at the ED
    Recruitment of additional staff
Target: buildings and investments
    Investment in new beds, cubicles, and curtains
    Establishment and equipment of dedicated triage rooms
    Acquisition of snack bars and drink dispensers

 

Clinical performance measures
Small, but significant improvements were achieved in all performance measures (Table 2). Concordance of first and second assignment to one of three urgency categories improved by 1%. Both under- and over-prioritization were reduced. After adjusting for confounders (age, sex, admission mode, underlying condition, qualifications of physicians assessing and re-assessing acuity, post-ED disposition) via multiple logistic regression, the odds ratio for any misclassification in 2002 compared with 2001 was 0.82 (P < 0.001; CI 0.73–0.91). The adjusted odds ratio for under-prioritization, i.e. upgrading of urgency of care needs in the second assessment, in 2002 compared with 2001 is 0.62 (P < 0.001; CI 0.49–0.79). The major source of improvement is an increase in appropriate classifications of ‘urgent’ patients (90.6% in 2001 compared with 93.1% in 2002; P < 0.001). Correct identification of ‘emergent’ patients with the first assessment increased from 83.4% in 2001 to 85.1% in 2002 (P = 0.558). Retrospective classification of acuity (second assessment) was strongly associated with post-ED disposition (P < 0.001). The hospitalization rate ranged from 94.4% in emergent, to 79.2% in urgent and 23.9% in non-urgent patients.


View this table:
[in this window]
[in a new window]
 
Table 2 Changes in performance measures between two measurements

 

All time intervals between the sequences of events from admission to established diagnosis decreased significantly, whereas subsequent intervals from diagnosis to documentation of post-ED disposition could not be shortened. In sum, the median duration between ED admission and documentation of post-ED disposition fell from 137 minutes in 2001 to 130 minutes in 2002 (P = 0.002). Again, patients that were evaluated as ‘urgent’ in the preliminary assessment profited most, with a decrease in median total time from ED admission to documentation of post-ED disposition from 190 minutes in 2001 to 178 minutes in 2002 (P < 0.001). An analysis of means, medians, and distribution of time intervals shows that hospitals achieved these improvements mainly by cutting down intervals at the upper extremes (outliers).

Results of questionnaires
Questionnaires were obtained from 2916 patients in the first measurement, and from 3370 patients in the second (response rates 32% and 36%, respectively; P < 0.001). Comparison of responders and non-responders reveals that responders were slightly older (49 versus 47 years; P < 0.001), less likely to be self-selected for ED admission (48.5% versus 52.5%; P < 0.001) or transferred by ambulance or rescue services (11.0% versus 13.2%; P < 0.001), and were more likely to be classified as ‘urgent’ (28.6% versus 26.1%; P < 0.001). Responders were also less likely to receive ambulatory treatment (55.6% versus 60.4%; P < 0.001) and more likely to be transferred to immediate surgery (6.2% versus 4.9%; P < 0.001) or non-intensive care wards (33.9% versus 28.7%; P < 0.001). Relative differences between responders and non-responders did not change between the years. There were no differences in response mode (self-completed versus other options) between questionnaires returned in the first and second cycle. Detailed responses to the questionnaire are provided in Tables 36. Significant improvements in the reports provided by patients could be achieved for 10 items. Improvements were mainly demonstrated in reports relating to structures of care provision and perceived humanity. The rate of patients with a problem score of 1 (i.e. reporting any ED problem) decreased from 41.3% in 2001 to 36.2% after the benchmarking intervention (CI of the difference 3–7%; P < 0.001), a relative decrease of 14%. After adjusting for age, sex, admission mode, final urgency classification, diagnosis, post-ED disposition, and survey response mode, the odds ratio of reporting any problem in 2002 versus 2001 was 0.81 (CI 0.72–0.90; P < 0.001). An analysis of the distribution of problem scores on the individual patient level reveals that not only the fraction of responders that experienced any ED problem, but also the number of problems reported per patient, decreased. While 12% experienced more than three serious ED-related problems in 2001, this fraction decreased to 9% in 2002 (P = 0.012).


View this table:
[in this window]
[in a new window]
 
Table 3 Patient survey responses related to their perceptions of safety and staff technical skills [n (2001) = 2916; n (2002) = 3370]1

 

View this table:
[in this window]
[in a new window]
 
Table 6 Patient survey responses related to care and support [n (2001) = 2916; n (2002) = 3370]1

 


View this table:
[in this window]
[in a new window]
 
Table 4 Patient survey responses related to aspects of the ED care structure [n (2001) = 2916; n (2002) = 3370]1

 


View this table:
[in this window]
[in a new window]
 
Table 5 Patient survey responses related to communication [n (2001) = 2916; n (2002) = 3370]1

 

There are major differences in the experiences with ED care according to initial urgency classification on the content level. Calculated over both measurement cycles, the top three items emergent patients were most likely to report negative experiences with were, in descending order, contradictions between different staff members (item 7), ED rooms (item 1), and information about diagnosis (item 8). Urgent and non-urgent patients were most likely to report negative experiences with preservation of privacy (item 5), opportunities for eating and drinking (item 6), and waiting times (item 19). Improvements in patients’ experiences and perceptions of care were mainly achieved in patients initially evaluated as ‘non-urgent’. The fraction reporting one or more ED problems decreased from 42.3% (2001) to 36.9% (2002) in ‘non-urgent’ patients (P < 0.001), from 40.0% to 35.7% in ‘urgent’ patients (P = 0.043), and from 28.9% to 28.6% in ‘emergent’ patients (P = 0.962).

Although on the aggregate level decreases in average actual waiting times coincide with improvements in subjectively perceived waiting times (item 19) and information about the causes underlying delays (item 21), actual waiting times are only moderately correlated with patients’ perceptions of waiting times on the individual patient level (Spearman’s rho {rho} = 0.26; P < 0.001).


    Discussion
 Top
 Patients and methods
 Results
 Discussion
 Contributors
 References
 
In this study, undertaken in a real-world setting, small but significant improvements in performance measures and patients’ perceptions of emergency care could be achieved by uniform outcomes measurement, group benchmarking, and hospital-specific strategies for change. In contrast to other studies that have evaluated frequency of diagnostic errors or concordance of ED and discharge diagnoses, we assessed identification of patients’ urgency of care needs as an indicator of ED performance before the establishment of diagnosis [68]. In terms of patient safety in fast-track areas, the latter may be more important. Participating hospitals used data for misclassified patients and ultimate clinical outcome in grand rounds and internal supervision as a tool for continuous learning and risk management. However, one has to consider that not all disconcordances between initial and re-assessment are due to ‘errant’ triage decisions. It is instead one of the key characteristics of emergency care to prioritize patients at high risk for life-threatening conditions that have not yet been confirmed. The fact that frequencies of both up- and down-grading decreased is promising. It indicates that hospitals did in sum not increase safety at the cost of efficiency by systematically shifting initial assessments towards higher urgency classes. Results of measurement of time intervals and questionnaires imply that improvements in speed and patients’ reports were mainly achieved by averting strong, negative outliers and series of negative experiences, and by targeting concentrated deficits.

As others, we observed an association between urgency of care needs and patients’ experiences, with non-urgent patients more frequently providing negative reports of care provision [9,10]. While these patients also benefited most in terms of patient satisfaction, improvements in performance measures were primarily achieved in patients initially evaluated as urgent. We found little effects on care provided to emergent patients and the problems reported by these patients. One has to take into account, however, that the problem score does, by definition, only capture changes in responses that affect the least favorable response code, and systematic improvements within the other answer categories are not detected. In addition, equal importance is attached to all questions. It is a normative decision which response codes should be evaluated as being ‘problematic’. Although only responses in the least favorable category were used to calculate problem scores in this study, all three categories indicating ‘non-optimal’ experiences are used for benchmarking. This approach is rooted in the understanding that every non-optimal experience offers the potential for improvement. Conversely, to increase discriminatory power and to identify subpopulations at increased risk for serious ED experiences, we report the problem score as a binary measure contrasting the least favorable response code.

Consistent with the literature, this study also provides further evidence that actual waiting times and patients’ perceptions of waiting times are distinct concepts. As others have noted, perceptions of waiting times and satisfaction with waiting times may be strongly influenced by expected waiting times, expressive quality, and provision of information about the causes underlying delays, rather than being determined by actual waiting times [1114]. Participating hospitals achieved improvements in patients’ subjective evaluation of waiting times with systematic changes in ED information and communication patterns.

The study also has some limitations: the major disadvantage is that the reliability and validity of the clinical performance measures, in particular of triage decisions, were not assessed. A number of studies have shown triage decisions based on three-tier triage scales to suffer from poor interrater reliability [1519]. As we did not study the final outcomes of ED admissions, one may question the clinical significance of triage decisions as well as the accuracy of the retrospective second assessment. Although we cannot provide direct evidence, there are a number of reasons that suggest the reliability and validity of the indicators to be higher than the figures reported elsewhere. The initial triage decision at ED admission determines the prioritization of patients, and the urgency and speed at which further procedures are to be undertaken. The identification of these needs and the avoidance of delays are of vital importance in a number of conditions frequently seen in EDs (e.g. initiation of thrombolysis in acute myocardial infarction, or prevention of perforation in acute appendicitis). The strong association we found between triage status and post-ED disposition can be considered as an indirect indication of its construct validity. The retrospective assessment was performed by more qualified and experienced physicians, based on all relevant clinical information and after the post-ED disposition has been documented. It was therefore well accepted in the project that the second assessment would accurately reflect the patient’s ‘true’ state in the vast majority of cases.

Reliability studies have usually used paper cases (vignettes) to measure triage performance and have commonly excluded ‘typical ED patients’, patients with life-threatening diseases or comorbidities, all of which are included in our study. As opposed to other studies, characteristics of the initial triage process as applied in our study, namely visual assessment by physicians, clinical judgment, and suspected diagnosis based on knowledge of vital signs, have been shown to increase accuracy in terms of post-ED disposition prediction, especially in patients from vulnerable populations (the very young and the elderly) [2023]. Aimed at developing a common understanding of the acuity scale, hospital staff participated in intensive measurement workshops and indicator discussions, were trained in the application of the scale, and consented on detailed lists of typical conditions. Because this stimulated awareness and sensibility, it can itself be regarded as an intervention that increases reliability. Finally, the relative frequency of assigned triage scores per hospital remained stable between the years (data not shown). With poor reliability, one would expect large variability in the fraction of patients allocated to each of the urgency classes by hospitals in the two measurement cycles.

Another limitation of performance measurement relates to the qualification of professionals that conducted the re-assessment of urgency of care needs. The shift towards higher qualifications in the second measurement cycle may have introduced a detection bias, with more misclassifications detected in the second cycle. This in turn would have artificially decreased the magnitude of improvements achieved in accurate initial assessments.

A more general concern is that surveys applied to ED patients directly after the care experience suffer from bias towards less injured patients, as the severely ill are less likely to respond. We also noted some response bias, with patients admitted via external physicians, evaluated as ‘urgent’, and those discharged to in-hospital wards being more likely to respond. Although a number of studies have reported comparatively low or even poorer rates, the response rates are dissatisfying and make the interpretation of results difficult [10,24,25]. Finally, we could not relate improvement activities to performance measures and patients’ experiences on the individual hospital level. Based on project conventions, hospital-specific data are not disclosed to the public. Concerns about league tabling, i.e. the public release of rankings of institutions based on single performance measures with the effect of ‘naming and shaming’, would inhibit motivation and set incentives to develop evading strategies, thereby resulting in poor data quality and endangering any culture of continuous learning and improvement [26,27]. The disadvantage is, however, that failures and successes of specific improvement strategies cannot be presented and their impact on results is averaged.

In summary, some improvements in ED performance and patients’ experiences could be achieved by participating hospitals in a design of data-driven quality improvement activities with recurrent plan–do–check–act cycles. A number of studies have reported on activities to improve quality of ED care and patient satisfaction, and several interventions have been suggested to improve quality of care, mainly in the field of professional education, staffing, and management of patient flow [4,2832]. The key differences in our approach are that hospital staff are actively involved not only in benchmarking of results but also in measurement, and that clinical performance measures as well as patient-reported experiences serve as quality indicators. As noted by the authors of the Harvard Emergency Department Quality Study, which used a 2-year intervention period in a comparable design, improvements achieved per cycle may only be moderate with rapid cycling [4]. However, we chose to provide hospitals with timely feedback on the results of the actions taken to allow for adjustment of effective strategies and to stimulate institutional learning. Current achievements should therefore not be regarded as final. Several participating hospitals have already implemented the developed measures in their continuous quality management systems and subsequent measurements are projected. Future cycles and analyses for trend will elucidate whether this approach also proves to be successful in the long-term.


    Contributors
 Top
 Patients and methods
 Results
 Discussion
 Contributors
 References
 
M.A.H. and H.E. developed the principal design of the ‘Emerge’ project. M.A.H., D.C., K.E., A.B., and H.E. participated in the development of performance measures. A.B. and K.E. developed the measurement system, supervised measurements and data collection, and provided support activities to hospitals. H.E. and D.C. served as methodological consultants during the study period. D.L.B.S. conceived and designed the methodology and conception, analyzed and interpreted the data, and wrote up the paper. A.B., K.E., and M.A.H. contributed to the conception of the study. Each author critically discussed and reviewed the final paper. D.L.B.S. is guarantor.

The authors express their gratitude and respect to the institutions that participated in the ‘Emerge’ project and provided the data for this study. The invaluable commitment of countless individuals who collected the data, managed measurements, and contributed to discussion of processes and results in addition to their regular activities is greatly appreciated. They are represented here by the contributing institutions and the professionals in charge of outcome measurement: Spital Aarberg (Dr T. Ritschard, M. Friedrich, Dr C. Klaiber); Universitätsspital Kantonsspital Basel (Dr B. Martina, R. Herold, J. Wuhrmann); Spital Bülach (Dr M. Osusky, Dr C. Lauber); Kantonsspital Chur (Dr T. Wieland, R. Lorez, Dr A. Frutiger); Klinik Hirslanden (R. Bartczak, Dr F. Horber, M. Rauber); Universitätsspital Inselspital (Dr F. Nohl, M. Jost, Prof. H. Gerber, Dr M. Hess); Spital Interlaken (R. Enezian, Dr M. Studer, Dr H. Schaad); Spital Lachen (G. Kurpjuhn, Dr I. Guldenschuh, Dr A. Pellegrino, Dr L. Stäger); Ospedale Regionale la Carità Locarno (Dr L. Gabutti, Dr S. Schlunke, L. Merlini); Stadtspital Waid (S. Vetter, M. Herzog, E. Seiler, Prof. H. Bühler); Kantonsspital fiinterthur (Dr K. Käch, U. Müller, Dr R. Imoberdorf); Spital Zollikerberg (A. Keller, Dr R. Spalinger, Dr P. Guyer). We also thank patients for taking the time to complete the survey and sharing their experiences of ED care. Finally, we wish to acknowledge the valuable comments of two anonymous referees.


    Footnotes
 
Address reprint requests to David L. B. Schwappach, Department of Health Policy and Management, Faculty of Medicine, University Witten/Herdecke, Alfred-Herrhausen Strasse 50, 58448 Witten, Germany. E-mail: davids{at}uni-wh.de


    References
 Top
 Patients and methods
 Results
 Discussion
 Contributors
 References
 

  1. Schwappach DLB, Hochreutener M-A, Conen D, Eichler K, Blaudszun A, Koeck CM. Outcome measurement in the Canton of Zurich. From theory to practice—progress report on six years of development, implementation and experience with routine inpatient outcome measurement. Muri: SGGP (Swiss Society for Health Policy), 2003.

  2. Rehm J, Blaudszun A, Frick U et al. The development of a questionnaire to assess patient satisfaction with emergency units of Swiss hospitals. Soz Praventivmed 2003, in press.

  3. Sixma HJ, Kerssens JJ, Campen CV, Peters L. Quality of care from the patients’ perspective: from theoretical concept to a new measuring instrument. Health Expect 1998; 1: 82–95.[CrossRef][Medline]

  4. Burstin HR, Conn A, Setnik G et al. Benchmarking and quality improvement: the Harvard Emergency Department Quality Study. Am J Med 1999; 107: 437–449.[CrossRef][Web of Science][Medline]

  5. StataCorp. Stata Statistical Software: Release 8.0. College Station, TX: Stata Corporation, 2003.

  6. Guly HR. Diagnostic errors in an accident and emergency department. Emerg Med J 2001; 18: 263–269.[Abstract/Free Full Text]

  7. Wogan JM. ED follow-up: a comparison of admission and discharge diagnoses. Am J Emerg Med 2001; 19: 249–251.[CrossRef][Medline]

  8. Chellis M, Olson J, Augustine J, Hamilton G. Evaluation of missed diagnoses for patients admitted from the emergency department. Acad Emerg Med 2001; 8: 125–130.[Medline]

  9. Hansagi H, Carlsson B, Brismar B. The urgency of care need and patient satisfaction at a hospital emergency department. Health Care Manage Rev 1992; 17: 71–75.[Medline]

  10. Trout A, Magnusson AR, Hedges JR. Patient satisfaction investigations and the emergency department: what does the literature say? Acad Emerg Med 2000; 7: 695–709.[Web of Science][Medline]

  11. Thompson DA, Yarnold PR. Relating patient satisfaction to waiting time perceptions and expectations: the disconfirmation paradigm. Acad Emerg Med 1995; 2: 1057–1062.[Web of Science][Medline]

  12. Thompson DA, Yarnold PR, Williams DR, Adams SL. Effects of actual waiting time, perceived waiting time, information delivery, and expressive quality on patient satisfaction in the emergency department. Ann Emerg Med 1996; 28: 657–665.[CrossRef][Web of Science][Medline]

  13. Yarnold PR, Michelson EA, Thompson DA, Adams SL. Predicting patient satisfaction: a study of two emergency departments. J Behav Med 1998; 21: 545–563.[CrossRef][Web of Science][Medline]

  14. Naumann S, Miles JA. Managing waiting patients’ perceptions: the role of process control. J Manag Med 2001; 15: 376–386.[CrossRef][Medline]

  15. Gill JM, Reese CL, Diamond JJ. Disagreement among health care professionals about the urgent care needs of emergency department patients. Ann Emerg Med 1996; 28: 474–479.[CrossRef][Web of Science][Medline]

  16. Nakagawa J, Ouk S, Schwartz B, Schriger DL. Interobserver agreement in emergency department triage. Ann Emerg Med 2003; 41: 191–195.[Medline]

  17. Reilly BM, Evans AT, Schaider JJ, Wang Y. Triage of patients with chest pain in the emergency department: a comparative study of physicians’ decisions. Am J Med 2002; 112: 95–103.[CrossRef][Web of Science][Medline]

  18. Wuerz R, Fernandes CM, Alarcon J. Inconsistency of emergency department triage. Emergency Department Operations Research Working Group. Ann Emerg Med 1998; 32: 431–435.[CrossRef][Web of Science][Medline]

  19. Travers DA, Waller AE, Bowling JM, Flowers D, Tintinalli J. Five-level triage system more effective than three-level in tertiary emergency department. J Emerg Nurs 2002; 28: 395–400.[CrossRef][Medline]

  20. Brillman JC, Doezema D, Tandberg D, Sklar DP, Skipper BJ. Does a physician visual assessment change triage? Am J Emerg Med 1997; 15: 29–33.[CrossRef][Medline]

  21. Cooper RJ, Schriger DL, Flaherty HL, Lin EJ, Hubbell KA. Effect of vital signs on triage decisions. Ann Emerg Med 2002; 39: 223–232.[CrossRef][Medline]

  22. Schoenenberger RA, Conzelmann M, Dubach UC, Schwander J. Quality of emergency room triage of medical inpatients to an acute care clinic or chronic health care facilities. J Gen Intern Med 1992; 7: 321–327.[Medline]

  23. Waldrop RD, Harper DE, Mandry C. Prospective assessment of triage in an urban emergency department. South Med J 1997; 90: 1208–1212.[Medline]

  24. Boudreaux ED, Ary RD, Mandry CV, McCabe B. Determinants of patient satisfaction in a large, municipal ED: the role of demographic variables, visit characteristics, and patient perceptions. Am J Emerg Med 2000; 18: 394–400.[CrossRef][Web of Science][Medline]

  25. Sun BC, Adams JG, Burstin HR. Validating a model of patient satisfaction with emergency care. Ann Emerg Med 2001; 38: 527–532.[CrossRef][Medline]

  26. Thomson RG, McElroy H, Kazandjian VA. Maryland Hospital Quality indicator project in the United Kingdom: an approach for promoting continuous quality improvement. Qual Health Care 1997; 6: 49–55.[Free Full Text]

  27. Shojania KG, Showstack J, Wachter RM. Assessing hospital quality: a review for clinicians. Eff Clin Pract 2001; 4: 82–90.[Medline]

  28. Cobelas C, Cooper C, Ell M, Hawthorne G, Kennedy M, Leach D. Quality management and the Emergency Services Enhancement Program. J Qual Clin Pract 2001; 21: 80–85.[Medline]

  29. Fernandes CM, Christenson JM. Use of continuous quality improvement to facilitate patient flow through the triage and fast-track areas of an emergency department. J Emerg Med 1995; 13: 847–855.[CrossRef][Medline]

  30. Hall MF, Press I. Keys to patient satisfaction in the emergency department: results of a multiple facility study. Hosp Health Serv Admin 1996; 41: 515–532.[Web of Science][Medline]

  31. Jelinek GA, Mountain D, O’Brien D et al. Re-engineering an Australian emergency department: can we measure success? J Qual Clin Pract 1999; 19: 133–138.[CrossRef][Medline]

  32. Wallis LA, Guly HR. Improving care in accident and emergency departments. Br Med J 2001; 323: 39–42.[Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (4)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Schwappach, D. L. B.
Right arrow Articles by Hochreutener, M.-A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Schwappach, D. L. B.
Right arrow Articles by Hochreutener, M.-A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?