|
|
||||||||
Eur J Cardiothorac Surg 2004;25:312-319
© 2004 Elsevier Science NL
a Clinic for Cardiothoracic Surgery, Heart Institute Lahr/Baden, Lahr 77933, Germany
b Institute of Neuroinformatics, University of Bielefeld, Bielefeld, Germany
Received 19 October 2003; received in revised form 10 December 2003; accepted 15 December 2003.
* Corresponding author. Tel.: +49-7821-925-367; fax: +49-7821-925-253
e-mail: alexander.albert{at}heart-lahr.com
| Abstract |
|---|
|
|
|---|
CONCLUSIONS: On-line VLADs based on a day-to-day updated database, displaying both internal and external standards, are a helpful visualisation tool for earlier detection of unfavourable trends. They enable the surgeon teams and clinical management to take countermeasures at an early stage.
Key Words: Variable live-adjusted displays Risk adjustment EuroSCORE Cumulative sum method Data warehouse
| 1. Introduction |
|---|
|
|
|---|
| 2. Method |
|---|
|
|
|---|
2.1. Data warehouse/data mart
All heart operations performed at the Heart Institute Lahr/Baden from January 1996 on were analysed continuously. Data for quality assessment were collected prospectively on all patients as part of a national quality assessment trial of cardiothoracic surgery (Quality Assurance Data Review Analysis, Quadra) and cardio-anaesthesia using standardised protocols of the German Society of Thoracic and Cardiovascular Surgery and Intensive Care Medicine [9,10]. A technical assistant for data collection and medical documentation controlled the data collection and tested the inter-rater reliability. Preoperatively, the data is fed into three independently operating commercial hospital information systems (Medwork, Datapec, Clinicom). From there, our data warehouse extracts, transforms, and consolidates the data in order to build up a large, comprehensive database, which appears to the investigator as a single data source. Mortality within 30 days was meticulously recorded with a completeness of more than 99% using a specific procedure based on questionnaires and phone calls to the patients after discharge.
2.2. Calculation of VLADs
The 18 EuroSCORE risk variables could be continuously determined from our data warehouse. In order to determine the estimated percentage of death, the items and additive weights were derived from the simple additive EuroSCORE model, which uses only dichotomous or ordinal data [11]. The second, centre-specific risk score was calibrated to the locally observed mortality by rescaling the data of 9135 patients from 1st January 1996 to 1st April 2002, such that the sum of the expected mortality (EM) of all individuals equals the total number of observed mortalities (OM). For all patients the EM was in the range of 0.0020.1 [12]. VLADs were created by continuous display of operative results. In the case of a successful operation, the curve goes up by the value of EM for this patient; if the patient dies, the curve decreases by the value of 1-EM. If the patient was at a high risk of perioperative death, the surgeon's mortality figures are not unduly penalised but mortality figures are penalised when a low-risk patient dies. The results for every single patient are cumulated and altogether display the performance. The expected cumulative mortality minus the actual cumulative mortality is called net live saved.
2.3. Web-based information portal
In our institution, VLADs are part of a multi-purpose intranet information portal, made available to all physicians via intranet. It provides various statistics on operative results and allows the export of various anonymous patient data for all authorised physicians. VLADs can be selected for several operating types, time intervals and/or individual surgeons. The access to VLADs displaying individual surgeon's performance is limited to the surgeon himself and specially authorised persons. Our VLAD plot simultaneously displays curves, which are based on predictive mortalities derived from the EuroSCORE publications and curves after recalibration. VLADs as well as all statistics presented at the web-based information portal are derived directly from the data warehouse and are updated automatically.
We established weekly meetings of our physicians for a joint analysis of adverse outcomes. VLADs were correlated to clinical events and factors possibly influencing clinical practice and contributing to adverse events. They are interpreted regularly by the staff. From this study, we present examples that should be of general interest.
| 3. Results |
|---|
|
|
|---|
|
|
|
|
|
|
|
| 4. Discussion |
|---|
|
|
|---|
For example, the annually performed risk-adjusted statistics could not demonstrate any difference in 30-day mortality between OPCAB versus on-pump. However, motivated by the results of the VLAD curves, we performed retrospective studies of the individual patients, which revealed an increased rate of perioperative myocardial infarction due to bypass graft thrombosis and incomplete revascularisation. This finding supports the impression given by the VLADs, which attributes the cluster of adverse events after starting with OPCAB not to the risk of the patients but to the learning ability of the surgeons. At that time, various surgeons performed OPCAB surgery primarily in high-risk patients (>80 years, severely decreased pulmonary function, preoperative stroke, etc.), so that complications were interpreted individually rather than systemically as an effect of a learning curve in OPCAB surgery. Later, when OPCAB surgery was limited to three surgeons who specialised in this, performance increased. Probably a continuous monitoring of performance by VLADs at that time would have detected the learning curve earlier. Another observation concerning a learning curve in OPCAB was made using non-risk-adjusted CUSUM analysis [13].
The training of cardiac surgeons showed similar patterns of performance. When considering annual statistics we could not identify any difference in outcome between trainees, registrars, and consultants. This is in line with studies analysing the effect of surgical training on outcome in cardiac surgery [1416], which could not prove any detrimental effect on patient outcome in operations performed by trainees. However, the observed clusters of fatalities at the beginning of the training and later after 200600 operations were identified in several surgeons. This corresponds with the original report studying VLADs in individual surgeons, which found clusters at the beginning of training in two surgeons, and a later cluster after 250350 cases in one surgeon [5]. One report applying CUSUM retrospectively to display the 10-year performance of one surgeon also identified a cluster during the first year of training [17]. Technical error, inexperience and poor judgement are readily reflected in operative mortality in cardiac surgery, conditions which are probably involved in the emergence of these first clusters.
The decline in performance after 300600 operations we observed in our trainees most likely reflects a drop in supervision, greater optimism and overestimation of the trainee's abilities as far as difficult cases were concerned. Our observation led to an increased supervision of trainees as well as of apparently more experienced colleagues.
The third example reveals a decrease in overall performance of the cardiosurgical department in the year 2002. VLADs suggest that the increase in mortality is not only a consequence of the higher risk of the patients but of other contributory factors. In the year 2002, our facility, especially the ICU-team, experienced significant and relevant changes in staff and staff-related management. Measures were taken to restructure the ICU staff, and to improve organisational and management changes, which resulted immediately in regaining good overall performance.
What is strongly recommended for all situations when information is drawn from clinical databases [18] is the participation of the physicians in the process of data acquisition and analysis. This applies in particular to the VLADs. In order to avoid a misuse of incomplete or inaccurate data we built a data warehouse, which allows the inspection and verification of the whole data integration process. Additionally, case-based verification tools were developed and plausibility checks are performed automatically. The power of the VLADs depends on the model of EM and therefore, on the choice of the underlying risk score. In the first instance, this concerns the type of risk model. Studies applying external risk scores to centre-specific populations reported a wide range of areas under the ROC curve (0.67 [19] and 0.831 [20]).
Secondly, this concerns the local adaptation or recalibration of risk scores. The human visual system can detect subtle absolute slope trends around the horizontal line much better than higher slope trends, e.g. -1°/+1° versus 20°/22° elevation angle. Therefore, we recalibrated the EuroSCORE expected mortalities to fit the observed centre-specific mortalities. This allows for easier visual recognition of the periods of performance changes.
Despite these refinements, we still observe inaccuracies of some VLADs, such as an overall decline in performance in valve operations in comparison to bypass operations. Thus, for an additional improvement in VLADs it would appear advantageous to fit the individual weightings of the EuroSCORE to the population specific for our facility or to alter the EuroSCORE parameter mix. For example, we could recently show how renal function can be better assessed by replacing the variable creatinine by the creatinine clearance, which significantly increased the area under the ROC curve [12].
Optimal adjustment for the clinic specific population requires the development of a risk model beyond the EuroSCORE and another time window of the risk score outcome than 30-day mortality. It was demonstrated that the hazard of dying after CABG induced by the preprocedural condition and the procedure itself continues to exist for 2 months beyond the date of the procedure [21]. It was confirmed that the risk incurred by the highest risk patients is likely to be underestimated by 30-day mortality [22]. The suggestion was made to postpone the endpoint of 30-day mortality to 1 year post-operative for a case mix [23]. In this context, it must be pointed out that precise comparisons between therapies and appropriate medical decision making requires follow-up intervals as long as possible as well as more sophisticated data exploration and analysis [24]. The more uniform the patient population is, the more feasible is the application of more complex, time-related analysis for continuous monitoring of surgical results. Furthermore, it has to be considered that higher the costs and the more special parameters included in the scoring model the less suitable it is for interclinic benchmarking [23]. VLADs regain the biggest benefit in a large case mix. The generation of VLADs with both external and internal standards enables us to apply the more complex techniques of risk adjustment to the internal standard and to maintain a simple score like the EuroSCORE as a common language.
In order to specify how much variation in the plot is expected under good surgical performance, and hence a deviation from the expected, should be a cause for concern, prediction intervals were incorporated into the VLADs [7,25]. The normal distribution of mortality rates within a sizeable sequence of operations was used to construct nested prediction intervals [7]. The prediction intervals were colour coded and superimposed onto the VLAD chart to indicate increasing departures from EM. Ref. [25] designed CUSUM graphs which signals if sufficient evidence has accumulated to indicate that the surgical failure rate has substantially changed. The run length until the CUSUM signifies the deterioration in performance and the discriminative ability of the test depends on the choice of the control limits, which can be varied according to the medical context and the heterogeneity of the case mix. Other investigators do without prediction intervals [6,8]. They stress that the interpretation of VLADs requires the consideration of other qualitative trends and obvious changes in patient management is more important. Furthermore, a failure to act on the qualitative trend observed until a significant P value has been achieved could result in the avoidable loss of lives.
| 5. Summary |
|---|
|
|
|---|
On-line VLADs based on a day-to-day updated database are a helpful visualisation tool for earlier detection of unfavourable trends, enabling the surgeon teams and clinical management to take countermeasures at an early stage.
| Footnotes |
|---|
| Appendix A. Conference discussion |
|---|
|
|
|---|
Dr Albert: We used the simple additive EuroSCORE without continuous data, which we considered sufficient as external standard. But we performed recalibration of the EuroSCORE for a more accurate internal standard by fitting the expected mortalities of the EuroSCORE to our observed mortalities. Therefore the internal standard demonstrates better our results. So I suggest to go even further steps beyond the EuroSCORE to develop a clinical score.
Dr Laufer: But what was your mean EuroSCORE, for example, for the year 2002?
Dr Albert: I don't have the data now.
Dr Laufer: Because that reflects a little bit the risk profile of your patients.
Dr Albert: Yes, but it is calculated together with all VLADs and included automatically within them. We did not present these numbers, from which expected mortalities of VLADs are calculated, because we preferred here to show graphics rather than tables.
Dr B. Buxton (Heidelberg, Vic., Australia): We also use the VLAD system for monitoring early changes. I wonder whether you have looked at adapting this to other outcome measures such as wound infection or stroke, these creep up on you sometimes more subtly than you think, and I wonder whether that would also be a good outcome to measure using this technique?
Dr Albert: Excuse me?
Dr Buxton: Have you adapted this system to the assessment of stroke or perhaps, more importantly, wound infections?
Dr Albert: Yes, we looked for some complications, trying to find a correlation with the rise of mortality. The results are not yet clear so I didn't like to present them here. Often they go parallel to the mortality data sometimes they arise earlier. We used as evidence of complications the length of stay in an intensive care unit more than 7 days, we looked for the stroke rate, bleeding and for postoperative CK-MB greater than 50.
Dr G. Rizzoli (Padova, Italy): To me this is just another quality control chart.
Dr Albert: Yes, it is.
Dr Rizzoli: I really don't understand how you can attribute to a single surgeon results that depend from the entire process of patient's care. We know that patients die not only because of the surgeon's performance in the operative room, but also because of the standard of care of the environment. So, did you do a multivariate analysis to isolate if the single surgeon was an incremental risk factor or not?
Dr Albert: As you know, the surgeon has a great impact on the outcome of the patients, and his responsibility goes further beyond the operating room. I showed in the last example that it is not always the surgeon but there were a lot of contributory factors, it is the team and the staff on the ICU which contributes to the outcome, and this becomes clear in our last examples presented here.
Dr T. Carrel (Berne, Switzerland): Can you tell us a little bit what kind of consequences you draw from your examination and how fast are you able to react when you see a cluster coming in, if you have an on-line analysis?
Dr Albert: It is not a method for reglementation or punishment. As I told you every surgeon can view his own VLADs, the overall performance of the clinic and the performance concerning different types of surgery. Mostly the interpretation of the VLADs is really easy and it is easy to get a consensus because they correlate with known changes in patient care. Every surgeon or member of the team has to think about what he can change. I showed you in the last example, we employed more qualified staff on the ICU and we discussed other problems.
Dr Laufer: Do you know if you are allowed or the chief of the institution in Germany is allowed to analyse the data for an individual surgeon in this manner or does he need the permission from the individual surgeon? It is the case in Austria according to the data safety law or daten schutzgesetz as we call it in Austria.
Dr Albert: Yes, from the beginning we discussed this issue and all surgeons agreed that they could view their own VLADs, not the VLADs of other colleagues, but the chief can view all. But I told you, it is not a system for punishment.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
J. M. Bernal, J. Lorca, D. Prieto-Salceda, I. Pulitani, A. Ponton, I. Garcia, and J. M. Revuelta Performance at 10 years of the CarboMedics 'Top-Hat' valve. Postclamping time is a predictor of mortality Eur. J. Cardiothorac. Surg., February 1, 2006; 29(2): 144 - 149. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. J. Murphy, C. A. Rogers, M. Caputo, and G. D. Angelini Acquiring Proficiency in Off-Pump Surgery: Traversing the Learning Curve, Reproducibility, and Quality Control Ann. Thorac. Surg., November 1, 2005; 80(5): 1965 - 1970. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Keogh, D. Spiegelhalter, A. Bailey, J. Roxburgh, P. Magee, and C. Hilton The legacy of Bristol: public disclosure of individual surgeons' results BMJ, August 21, 2004; 329(7463): 450 - 454. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ANN THORAC SURG | ASIAN CARDIOVASC THORAC ANN | EUR J CARDIOTHORAC SURG |
| J THORAC CARDIOVASC SURG | ICVTS | ALL CTSNet JOURNALS |