• Keine Ergebnisse gefunden

Comparison of Several Regression Procedures for Method Comparison Studies and Determination of Sample Sizes Application of linear regression procedures for method comparison studies in Clinical Chemistry, Part II

N/A
N/A
Protected

Academic year: 2022

Aktie "Comparison of Several Regression Procedures for Method Comparison Studies and Determination of Sample Sizes Application of linear regression procedures for method comparison studies in Clinical Chemistry, Part II"

Copied!
15
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

J. Clin. Chem. Gin. Biochem.

Vol. 22, 1984, pp. 431-445

Comparison of Several Regression Procedures for

Method Comparison Studies and Determination of Sample Sizes

Application of linear regression procedures for method comparison studies in Clinical Chemistry, Part II

By //. Passing

Abt. für Praktische Mathematik, Hoechst AG, Frankfurt am Main 80, and

W. Bablok

>

Allg. Biometrie, Boehringer Mannheim GmbH, Mannheim (Received October 18, 1983/February 27, 1984)

Summary: In part I of this series (H. Passing & W. Bablok (1983), J. Clin. Chem. Clin. Biochem. 27, 709—720) we described a new biometrical procedure for the evaluation of method comparison studies. In part II we now discuss its properties and compare them with thöse of other established procedures by means of a Simulation study, We demonstrate that the reliability of the results not only depends on the sample size but also on the sampling distribution, the precisipn of the methods, and the concentration ränge covered by the samples. Linear regression and principal component procedures are either inadequate or not äs reliable äs our new procedure. The appropriate sample size is discussed and recommendations are given.

Vergleich verschiedener Regressionsverfahren für Methodenvergleichsstudien und Bestimmung von Stichprobenumfängen

Anwendung von linearen Regressionsverfahren bei Methodenvergleichsstudien in der Klinischen Chemie, Teil II

Zusammenfassung: Im ersten Teil unserer Arbeit (//. Passing & W. Bablok (1983), J. Clin. Chem. Clin.

Biochem. 21, 709—720) haben wir ein neues biometrisches Verfahren zur Auswertung von Methodenver- gleichen vorgestellt. Im zweiten Teil mm werden seine Eigenschaften untersucht und im Rahmen einer Simu- lationsstudie mit denen anderer, bereits etablierter Verfahren verglichen. Wir zeigen, daß die Zuverlässigkeit der Auswertungsergebnisse nicht nur vom Stichprobenumfang, sondern auch von der Stichprobenverteilung, der Präzision der Methoden und dem Konzentrationsbereich der Proben abhängt. Lineare Regression und die Hauptkomponenten-Verfahren sind entweder unzulänglich oder nicht so zuverlässig wie das neue Verfahren.

Die Abhängigkeit des Stichprobenumfanges von den Randbedingungen wird diskutiert, und Empfehlungen werden gegeben.

J. Clin. Chem, Clin. Biochem. / VoJ. 22, 1984 / No. 6

(2)

432 Passing and Babiok: Comparison of regression procedures for method comparison and determination of sample sizes

Contents

1. Introduction

2. Evaluation Procedures for Method Comparison 3. The Simulation Model

4. Comparison of Procedures P ι .to P? if β = l 4.1 The probability of a false positive test result if

no extreme values are present

4.2 The bias of the slope estimates if no extreme values are present

4.3 The influence of extreme measurement values 4.4 Very small size of measurement r nge

4.5 The influence of a non-constant CV on the behaviour of the procedures

4.6 The influence of the sample size on the above results

4.7 Recommendations 5. The Sample Size

5.1 The probability of a positive test result 5.2 Determination of the sample size 6. Conclusion

1. Introduction

In part I of our paper (1) we described a new statisti- cal linear regression procedure which can be em- ployed in the vevaluation of method comparison stu- dies. We showed its theoretical advantages com- pared with other statistical procedures and presented an example with real data to support our arguments.

In this paper we present the comparison of our procedure with other established procedures in order to demonstrate its merits. Further we shall ex- pose the strong dependence of the appropriate sam- ple size on certain properties which are inherent in the analytical methods for which a comparison should be performed.

The experimental design consists of drawing n inde- pendent samples from a population and measuring the analyte in question with each of the two meth- ods. The evaluation usually consists of fitting a straight line Υ = α -l· X to the data and in testing the hypotheses β = l and α = 0. The evaluation procedures discussed here differ with regard to the estimators, the meaning of β and a, and the tests of the hypotheses. The statistical models on which an evaluation can be based are discussed in part I. The notation and meaning of X and Υ will be the same s in part I.

In this context we should like to point out that the problem of determining the parameters of a linear equation by which one method is transformed into the other is a different one. It deserves a separate treatment and will be the topic of part III of our pa- per.

2. Evaluation Procedures for Method Comparison The evaluation procedures discussed in this paper are divided in two groups, one which is invariant with regard to the assignment of the methods to X and Y, and one which is not. A procedure is invar^

iant, if after interchanging X and Υ the respective estimators of β and α can be transformed into each other, and if the test of the hypotheses β = l and α = 0 gives the same results under both assignments.

2.1 Invariant procedures

The invariant procedures asume that the Variation of the observed values Xi and yi has two independent sources: one is the Variation within the population of all possible samples, the other one is the measure^

ment error within each sample. This leads to the par- tition

Xi = χ* + ξί and yi = y,* -l· ηί?

where x* and y* are the expected values within the i-th sample and ζι and η, are the rneasurement errors.

The procedures ass me the structural relationship γί = α + βχΓ

between the expected values.

Procedure P/: This is our new procedure s de- scribed in part L The estimators of β and α are both medians. The usual statistics, such s mean, Standard deviation and correlation coefficient are not re- quired. The hypothesis tests do not ask for any spe- cific distributional assumptipns. τ

Procedure Py. Standardized principal component analysis (2). Its estimators are based on means and Standard deviations; the tests require certain distri- butional properties of the data.

Procedure P3: Principal component analysis (2, 3).

The theoretical background is identical to that of ?2 and the test of the hypothesis β = l is the same; the estimators, however, involve in addition the coeffi- . cient of correlation.

2.2 Procedures which are not invariant These procedures ass me that one variable is free of random Variation iinplying that it is fixed.

The underlying statistical models are given by yi = α -l- xj Φ ηί with fixed xj^or

Xi = A + Byi + ξι with fixed yf.

?.2.1 Procedures assuming X to be fixed

Procedure P4: TheU's procedui-e (4): This is similar to PI, in particular the estimator of β is also a medi-

J. Clin. Chem. Clin. Biochem. / Vol. 22, 1984 / No. 6

(3)

an. The test of the hypothesis β = l is distribution- free. Theil does not give an estimator for α; α may be estimated s in PI.

Procedure PS: This is the classical linear regression based on least squares (5). Its estimators are based on means, Standard deviations and the correlation coefficient ( s for ?2 and PS); the tests require cer- tain distributional properties of the underlying data.

2.2.2 Procedures aswning Υ ίο be fixed Procedure PO is identical to procedure ?4 and procedure P7 identical to procedure P5, only the as- signment of the methods to X and Υ is interchanged.

(If the arithmetical evaluation of a method compari- son is carried out by a procedure which is not invar- iant then usually both ?4 and PO, or PS and P? are cal- culated.)

3. The Simulation Model

If the same data set is evaluated by the procedures PI to P7 the estimators of β and α usually yield different results. Moreover, the results of testing the hypo- theses β = l and α = 0 are not necessarily identical.

Therefore it is desirable to know which procedure really gives the correct result.

Each of the 7 procedures is based on certain ma- thematical assumptions which are different or even contrary to each other. If the properties of the real data ineet the assumptions of a particular procedure i t will give a reliable result; otherwise the result may be biased. Therefore, a given data set may satisfy the assumptions of one procedure but not of the other one so that systematic differences between the re- sults can be expected. If we restrict ourselves to the slope β — and β is the inost important parameter in such an evaluation -= then deviations may occur in two respects:

Firstly, an estimator b of β njay be biased in so far that it achieves values which are systematically larger (or smaller) than . Hence b would not estimate β but anything eise resulting possibly in an erroneous judgement of the methods. Consequently an unbi- ased estimation of b in realistic situations is a desira- ble property.

Secondly, testing the hypothesis β = l for the esti- mated slope b may result in a significant difference even though β = l is trae. If the mathematical as- sumptions of the test are met the probability of such a false positive result is restricted to the level γ (e. g.

γ = 5%). Otherwise the actual level - that is the true probability of obtaining a false positive result under the given circumstances — may be much higher than the nominal level of γ on which the test is

J. Clin. Chem. Clin. Biochem. / Vol. 22,1984 / No. 6

performed. Consequently, an evaluation procedure becomes inappropriate if significant differences be- tween the two methods would be found too fre- quently. Therefore, a second desirable property of a procedure is to achieve the level γ of probability in realistic situations.

Itfollows that a procedure gives the correct result ifits actual level is about y and ifits estimator ofthe slope is unbiased.

Obviously both properties cannot be studied by eva- luating real data sets, since the true relation between both methods is not known. They can, however, be judged by the results of Simulation experiments (6).

Here data sets describing a well defined "Situation"

are repeatedly generated to study the behaviour of PI to P? in detail. Our Simulation is based on the structural relationship model which gives a reasona- ble description of reality. Since procedures ?4 to P-j are frequently used in method comparison studies we have incl ded these procedures in the Simulation study. From the structural relationship model, it fol- lows that the CV's of the methods can be defined from the variances σ| and o\ of their respective mea- surement errors. For ease of notation, however, we shall use CVX and CVy when we refer to the coeffi- cient of Variation of method X or method Υ respec- tively.

Before we describe the details of the Simulation model we state the following general assumptions:

- There is a linear relationship between method X and method Y.

— The measurements of X and Y are realisations of independent continuous bivariate variables.

Let [cu, co] be the r nge of measurements for the method assigned to X, and [ cu, c0] the correspond- ing r nge for the method assigned to Y. Let c = ~^- be the common size of both raiiges with Kc< °o. A large size corresponding to c^8 will be represented by c = oo, a medium size corresponding to 4 ^ c < 8 by c = 4. Further, a small size of 2^c<4 will be mo- delled by c = 2 and a very small size of 1.25 ^c<2 by c = 1.25. Since every method has a lower detec- tion limit there is always cu>0.

As a measure of precision which is known before the Start of a method comparison experiment, we use the coefficient of Variation. In the Simulation model we assume CVX and CVy to be constant over their con- centration r nge1). This is much more realistic than

') The CV is chosen here s a famili r measure of precision. In I.e. (7) it is shown that other measures may be more approp- riate if the distribution of the measurement error is skew or has a kurtosis, but this property is without relcvance in this con- text.

(4)

434 Passing and Bablok: Comparison of regrcssion procedures for niethod comparison and determination of sample sizes

the usual assumption of constant Standard deviations (2, 5). For completeness sake, we also studied the influence of non-constant CV's on the evaluation procedures. The magnitude of the CV's is not inde- pendent of the size c of the measurement r nge since methods for constituents with a very small biological r nge ( s for instance electrolytes) require small CV's for the differentiation of measurements.

Therefore, the CV's are varied independently of each other from 2% to 13% for a medium or large size c and from 2% to 10% for a small size c. In the case of a very small size the CV's are varied from 1%

to 2%.

It can be shown that it is sufficient to take the inter- val [—, 1] for X and the interval [A ] for Υ s

c c

common r nge, whereby both CV's remain un- changed. This does not cause any loss of generality, but achieves independence from the real size of the measurement values.

In the Simulation model, n samples are drawn from the interval [—, 1]; the i-th sample has expected values x* and y,* = x*. (Since we study the estima- tion of β we assume a constant α and set it equal to zero).

The samples are generated from

- a uniform sampling distribution over [—, 1], rep- resented by equidistant x*, or

- a skew sampling distribution over [—, 1]: to achieve this the interval is divided into 5 sectionsc of equal length with equidistant x* covering 5%, 50%, 30%, 10% and 5% of n. In this way the measurements are concentrated more in the left part of the r nge. This distribution corresponds to many real situations where usually samples come both from healthy and diseased persons.

The choice of predefined x* from the uniform and skew sampling distribution actually leads to a func- tional relationship model. However, our first investi- gations demonstrated that the results from the Simu- lation on the basis of a true structural relationship model (with random generation of χΓ) did not differ from those of the functional relationship model.

Considering the amount of Computing time needed for the Simulation we decided to use the less de- manding functional relationship model. Besides, this model can be interpreted s a special case of the structural relationship model.

In order to estimate β reliably it would be optimal to have the samples located at the boundaries of the r nge s long s linearity is guaranteed. Obviously,

this sampling distribution will be insufficient for practical reasons. However, the uniform distribution lies between this extreme and the usually skew sam- pling distribution and is attainable.

The expected values are distorted by independe t measurement errors ξί and r\\ giving "measurement values" Xj = χ? + ξ,· and y\ = y* + T)J. These errors correspond to the precision of the methods. Three types of distribution of measurement errors are con- sidered:

— ξ,· and i]i are normally distributed.

— ξί and r|i both have a mixture of two normal distri- butions differing slightly from each other so that the resulting distributions of ξί and T)J look like a normal distribution; in particular they are sym- metric.

- ξ,· and η, both have a skew distribution with a pos- itive kurtosis so that they differ essentially from a normal distribution.

The latter two distributions are chosen since it is well known (8, 9) that in general the measurement errors are not normally distributed. Therefore it is advisa- ble to investigate several distributions.

So far the Simulation does not allow for large differ- ences between Xj and yj that occur frequently in real experiments. We call such a pair (xj, yi) an extreme value. It may be caused by a difference in specificity or by susceptibility to interferences of the methods and should not be removed from the evaluation without any experimental reasbn/Hence it is neces- sary to consider extreme vajues in the Simulation model; they are introduced to the data by changing some values of Y up to ± 50%.

In summing up we define one Simulation step by the following parameters: c = common size of ranges, n

= sample size, CVX, CVy, , sampling distribution, distribution of measurement errors and number and Position of extreme values.

In our basic Simulation model n pairs (xi, yj) are gen- erated for each choice of Simulation parameters. The estimators bi, ,.., b? of the slope are calculated ac- cording to PI, ..., P7, and the hypothesis = l is tested with respect to PI, ..., P7. These Steps are per- formed 500 times for each choice of the Simulation parameters. From the 500 bj's the median med (bi) is calculated. The bias of bi is estimated (med (bi) - ).

The proportion of significant test results given by Pj is an estimator of the actual level of = P\ if = 1.

The 7 procedures are compared with regard to their aetual level and to their bias, if J= 1. The case β Φ l deserves a separate investigation. As the maximum

J. Clin. Chem. Gin. Biochem. / Vol. 22,1984 / No. 6

(5)

sample size we choose n = 90 since otherwise the Simulation expenditure would be too large. The in- fluence of extreme values is not investigated and on- ly normally distributed measurement errors are con- sidered. The probability of a significant test result if β φ l is true, is called the power. It is tabulated for several parameter combinations. The power should be large if there is a relevant difference between β and 1. Therefore the user must define a relevant value, rd, which is adequate to his specific problem.

Then, if β is larger than rei or less than l/ rci the power should be sufficiently large, say ^ 80%. It is shown that this desirable property depends on the suitable choice of the sample size n. A list of such sample sizes is given.

Since it is difficult to assess the properties of an experimental data set with respect to the model as- sumptions of a regression procedure, we find it more advantageous to demonstrate how a procedure be- haves if certain assumptions are not met.

4. Comparison of Procedures PI to P? if β = l 4.1 The p r o b a b i l l i t y of a false positive test

result if no extreme v a l u e s are present We demonstrate the results of the Simulation study for the sample size n = 40. The probabilities ob- tained for a false positive test result are accurate up to ±2%. A summary is given in table l, where CVX

is assumed without loss of generality. The

Tab. 1. Actual probability of a false positive test result for n = 40 (γ = 5%); no extreme values are present.

PI = our new procedure

P2 = standardized principal component analysis

?3 = principal component analysis

?4 = Theil's procedure, X assumed to be fixed

PS = least squares linear regression, X assumed to be fixed PO = Theil's procedure, Υ assumed to be fixed

P? = least squares linear regression, Υ assumed to be fixed Samples: uniform

Errors: normal Invariant

c CVX

(%) o 22 55 77 107 13 4 22 25 57 77 1013 2 22 25 75 107

CVy (%)

25 77 107 1013 1313 25 77 107 1013 1313 25 77 107 1010

Pl

56 76 55 67 76 45 44 85 69 66 57 137 165 48

P2

P3

68 108 87 109 88 58 77 116 129 67 69 179 215 104

Not invariant

P4

46 67 57 77 109 54 48 105 87 2413 56 49 247 3916

PS 78 89 128 118 2016 67 114 159 1110 2034 86 136 3310 2152

?6

56 109 106 1015 1513 48 12U 2510 2739 2928 197 4534 6625 4556

P7

117 1614 1911 2919 2624 157 2218 4L15 5537 4243 308 5844 7935 7156

Samples: skew Errors: normal Invariant

Pi 56 66 95 116 98 58 98 126 148 .87 166 2411 255 155

P.

P3

1717 2019 2121 2419 1917.

1416 1617 1712 1618 1410 178 2616 29 197 8

Not invariant

?4

56 106 77 108 1219 67 105 207 1510 2649 68 245 4714 3076

PS 1718 2315 2318 2422 2632 H14 1218 2617 2120 5331 1211 259 2151 3477

P6

69 109 2012 3016 2519 176 2923 4720 4068 4961 5110 7764 9349 7388

P7

2018 2923 2731 2745 3933 2815 3631 2617 4668 6252 5412 7763 2950 8775

Samples:

Errors: uniform

mixture of normals Invariant Not invariant P,

46 57 46 57 65 65 65 73 78 55 67 96 125 47

P2 P3

56 108 67 118 77 57 85 115 127 76 115 127 166 105

P4

47 69 65 76 116 56 66 57 96 2012 67 105 198 3613

PS 57 117 109 119 2112 57 77 127 139 3017 88 124 2510 4619

Pe 47 69 95 147 1211 67 1010 1119 2918 2221 194 3527 5121 4433

P7

58 1312 159 2714 2018 116 1915 2913 2344 3632 277 4934 6628 4556

Samples: uniform

Errors: skew with kurtosis Invariant Not invariant P,

56 76 68 78 76 47 78 109 99 47 56 146 127 77

P2

P3

166 1710 169 1315 1111 165 179 179 1318 117 125 268 237 98

P4

65 67 89 87 129 55 104 126 109 2414 65 104 2510 40S

PS 107 98 139 127 2411 59 125 187 129 3719 86 144 359 2452

Pe 67 98 1210 1316 1312 115 169 2313 3525 3027 215 4334 6325 4352

P7 237 3018 1613 4125 3322 277 2039 4920 4058 4737 417 6847 7933 6855

J. Clin. Chem. Clin. Bjochem. / Vol. 22,1984 / No. 6

(6)

436 Passing and Bablok: Comparison of regression procedures for method comparison and determination of sample sizes

properties of the individual procedures are discussed in the light of the overall results from the Simulation;

they are available on request. We give the Interpre- tation on the basis of differences in CV's:

I.

II,

III.

Both CV's are identical: P ι achieves its nominal level rather well for all Simulation parameters (see also part I). In many cases, however, P2 and P3 show insufficient results particularly if the size is large and the sampling distribution is skew; then they exceed their nominal level con- siderably. The procedures P4, PS, PO and P7 are far away from their nominal level unless both CV's are small. To illustrate an actual level of about 50% one could say that the outcome of a method comparison can be obtained by tossing a coin.

Both CV's are approximately identical, say l <-^-^< 1.5. Here PI is the best of all proce-CV

uvx .

dures in meeting the nominal level for all Simu- lation parameters, but it does occasionally ex- ceed γ = 5%. If the size c is large and the sam- pling distribution is skew then P2 and P3 exceed their nominal level considerably. If the size c is small the actual levels of PI, P2, PS are essential- ly higher than in case I. The actual levels of PO and PI are higher, and those of P4 and PS are lower when compared with case I.

Both CV's are rather different, say 1.5 <-CVy

cv

x

< 2.5: Then P4 is the procedure with the best result but it can also exceed γ. However, if both CV's are less than 7% and the sampling distri- bution is uniform then PI also meets the level γ rather well in contrast to P2 and P3. PS has a higher level than P4. In comparison with case II the levels of P4 and PS are decreased whereas those of P6 and P7 are further increased.

IV. Both CV's are essentially different, say yCV CVX

> 2.5: Then P4 achieves γ = 5%, whereas the level of P5 is in most of the cases higher. PI, P2, and Ρ3 may also exceed γ = 5% considerably.

The levels of P6 and P7 can go s far s 100%.

These results are plausible: If both CV's are very dif- ferent then X might be considered to be free of ran- dom Variation relative to Y, so that P4 and P5 will be appropriate; in contrast P6 and P7 are completely in- appropriate. P4 is superior to P5 since P4 is distribu- tion-free. If, however, both CV's are identical then no variable can be assumed to be free of random va- riation, and Pj, P2 or P3 will be applicable. PI is su- perior to P2 and P3 since it is not based on any distri- butional assumptions.

11 can be seen that the actual level of P ι is more inde- pendent of the sampling distribution than that of P2

or Ρ3, especially in the case of a large c which is typi- cal for most of the applications. It is obvious from the results that it does not make sense to perform both P4 and PO or PS and P7 since' il is likely that the test results will contradict each other.

4.2 The bias of the slope estimators if no extreme values are present

Again the sample size is restricted to n = 40. The resulting bias is accurate up to ± 1%. Table 2 shows the bias of bi, ..., b7 in per.cent of β = l in corfe- spondence to table 1.

Procedures PI, P2 arid PS show the same degree of bias if no extreme values are present. If any bias oc- curs then bi, b2 and b3 ovefestimate β whereas b4 and bs underestimate . be and b7 underestimate β s well; however, in table 2 the bias of — and — is given in order to demonstrate that b4 and be or bs and b7 are quite different and that they do not corre- spond to each other. b4 and bs have a similar bias, if any, and the same holds true for b^ and b7. The bias incfeases if c becomes smaller or the CV's are in- creased. As before we discuss four cases:

I. CVX = CVy: The slope estimators of the invar- iant procedures PI, P2 and P3 can be judged to be unbiased. The other procedures, however, may produce rather heavily -bi sed fesults.

II. l < < 1.5: bi, bCV 2 and b3 often are nearly unbiased. But they show a bias if the size c is small or if the sampling distribution is skew.

CVV

III. 1.5 < < 2.5: Here b4 and b5 are the least biased estimators. If both CV's are less than 7%

and if the sampling distribution is uniform then the invariant procedures also have only a small bias.

CVy

CVx

ble bias whereas bi, b2 and b3 may be strongly biased; b6 and b7 are highly foiased.

If an estimator is biased the corresponding actual test level is increased. The inverse, however, is not true. The test level may be highly increased even though the estimator is unbiased — caused by a vio- lation of the corresponding distributjon l assump- tions. Therefore PI with no distributional assump- tions is superior to P2 and P3, and P4 is superior to PS even if extreme values do not occur.

J. Clin. Chem. Clin..Biochem. / Vol. 22,1984 / No. 6

(7)

oV) O.

cd8 83

1

o S

Oc o~ '-'

""J QJ

11

1

c

h« 4^ '

5 1

II j?

CO. 7)

^o v*

x$> u .Ξ Ό^ 5 S £

% §

(N

g

i

t;

p

ε ·ξJ.J

£ *c ^

QM ^*

Ιώ

"cdoo Oc

<*-.

Q

0 ^o 2

Jt2 X

§ 'ε

*> :^

tl

<5>u3

"cd

* εω g

•s §

ΪΛ

•Hg

ε ε

co Ui

£ ω

v· cd·

fi ε"c o P C

ΪΛ

•B. Ϊ2

</> tu

ε ε

*>·

1

cdc

***0 Z

.*c

ccd

"**c Z0

ccd

1

c

Scd

*ncd

Z0

^ccd fe>

^c

1 I 1

25

1 s I

cu

cu

CUIf)

cu

^ tf

cu

Cur*.

ef

CU«Λ

Ou 0.ΪΛ

du

*

r«»

v&

CU CU

CU^

CUΓΟ

<N

G*

^

cC

-<C

C CU

^· '

1 CU

£ cC

^ υ >^

-r^

SS

υ

..„„— ...,

^ ^ ^ ^ ^ ^ ^ ^ ^ ^ , ^

a „. saa .. a38aB

1 ^- - ^ ~ - « Ο ί ~ Γ Μ < Ν

O (NJ r^ C5 ·"·* CS '•^ ^5 ΓΊ ^D C5 ^^ *·^ tN ^^ f) fO fi t**· CO »—* ^D ·™^ Νθ Ό fS C^J COr- ^ — . (N 1 I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 O * - « O O O ' ^ » - ' ' — ' ( N ^ · O O O t N C N m c o m v O O ^ H * - < O » O i r > O « - - O O

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 o - . « - m o « < * n - O f s ^ - u . ^ ^ ^ ^ 0 o ^ o o ^2o t - - O ' - « r ^ i - < m o ( N r i - < N O o c M T f ^ i n - ^ f o r ^ m o o ^ t c x j T i · — - o v o —

O O O O » — i O » — « ^ — < «— ' O O » — « » - ' O f S O c N ^ - C N J O O t N v o r ^ ^ O ^ O O

^5 *^* C>J '•^ CO ^^ CS '^O Ό *O G5 CS CO (Ό vO PO ^O <~< *·^ CS ^"* *O *~^ CO CO CO O"^ CO

» - ^ ^ p - ^_ ^ fsj ^ ^ ^

O O — « - * < N ~ - 4 « - ' Cr> m r ' ) O » - « ( N f N » O < N » O O O O ' - « ' « d - O s O O O C N O O O O

o o o — — — < < N ~ f n « o o o o c s c s c n r o m x o ^ r ^ ^ o « n u - ) 0 x o r -

1-H i—* i-^

I l l l l l l I l l l l l l I I 1 1 1 1 1

O O O C N C N C N C O « O O O Ο Ο Ο ^ - ^ Γ ^ Ο Ο ^ η

I l l l l l l I l l l l l l 1 1 1 1 1

*

co co ^^ co «-·* co co es *™^ co co *™^ es '•^ es ^r^ es <<i es co co es 10 es oo ^^ 10 co o o ~ o - o o ~ ~ o 0 - ^ , - ^ o r . ^ r . o o r . ^ ^ , r - - ^ o o o o o o o O ' — ' O O O i — • ' - • o e s o e s e o - ^ o o e s ^ e s r ^ ^ ^ o

O - ^ C M ^ ^ ^ O O V O - ^ = 0 = 0 ^ « > ^ J Q ^ ^ ^ J j q j q ^ J Q J N J N

»— « »—i < - t f s c s e s i— « rs rs Ό es \o 10 I l l l l l l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 co co co ^** I~M es es es ^~ oo co co co ^~ °^" r**· t*** r*·» ro es es es es es co es *~* r^*·

^ es ^ ~ es es co I l l l l l l I l l l l l l 1 1 1 1 1 1 1 1

O ^ ^ H O C M O O ^ C N J ^ O ^ C O C S U O O ^ O x v O O O v O ^ v O O \ O i O O

^-( ·— 1 ' »-^

o ^ r ^ c o e s o o ^ e s — · o ^ c o e s i n o c o o o m o o v o o < n ^ n o ^ o co ^^ Ί"™· co es *~^ »"^ co es ^^ co es co es vo co ^f ON \o co co ^o es vo o\ co "^ co o — f M c a ^ e S ^ - v o s o v o O c N j ^ - T t ^ - o v o » o i o ^ ^ ^ S S ^ c s c s

- c o c o o o c o o o e s e s e s

"""^^ — -ies — eses I l l l l l l 1 I l l l l l l I I 1 1 7 7 1

O Q O O T_4^H r^T i^C sj ^ . o o o e s ^ ^ j - e s c o N o r - » 1-11— " O - ^ - T t o o c x :

"" ~ f-^ r-* i-^

Ί 1 1 1 1 1 I l l l l l l I I 1 1 1 1 1 Ο Ο ^ ^ , ^ , Ο - H C O — O o r - i c s - i ^ - o m v o r o c o o e o r ^ ^ - - | O t ^ T - , . o o - ^ - Q ^ - m ^ o o - e s - ^ o r o m e M c o o e o t - ^ o o > o - f^t ζ^ e^ ^^ «-H o co es ·"* co co co *·^ co co co fS' ^* es es co es Ό co CJN co vo *~*

«-^ ^H r^ f-H «-^ t-^ τ~* r·^ ^^ *~^ *"^ '•^ »~^

i-H r-< »-4 r^t »— '

8 -^ es

•o

i

ccd

<:

CU

'S

Dy>

S

C U

'5b

M

8cd B

<

CJo.

^o

£*^

VlMo 8 J .B.1

υ

1

Im

15

a ί

J.,C!in, Chem. Clin. Biochem. / Vol. 22,1984 / No. 6

(8)

438 Passing and Bablok: Comparjson of regression procedures for method comparison and determination of sample sizes

4.3 The i n f l u e n c e of e x t r e m e m e a s u r e m e n t v a l u e s

We now consider the case that some of the measure- ment values of Υ differ distinctly from the corre- sponding values of X; the sample size is n = 40.

5% of the measurement values are systematically bi- ased in the following manner: If the expected values y\ are sorted to yfi) ^ · · · ^ y(40), then y^o) is decreased either by 50% or by 50% of the amount l - — whichever is the smaller. In the same way y(*40) is increased by 50% or by 50% of the amountc l . Clearly, y/40) lies outside the common r nge

c

of both methods; but since the actual r nge of Υ must be larger when extreme values are present, it is a realistic model. Table 3 shows the probability of a false positive test result, i.e. the actual test level, and table 4 gives the bias of bi, ..., b? in per cent of

- i-

The actual level achieved by PI is equal to or slightly higher than γ = 5%, if both CV's are approximately identical — provided the sampling distribution is uni- form. If the sampling distribution is skew and the size c not large then the actual level of PI may be considerably higher than 5%. Procedure P4 meets its nominal level very well provided that both CV's are rather different. bi tends to be biased particularly if the sampling distribution is skew and c is not large.

The actual level of ?2 and PS exceeds γ = 5% by far*

and is substantially higher than that of PI for all Sim- ulation parameters. If the sampling distribution is skew and the precision of both methods is high it can go up to 100%: here P? and PS would declare both methods to be significantly different even th ugh β = l is true. Both bi and bs are biased particularly if the sampling distribution is skew.

Now we consider the influence of the number of ex- treme values and their location (below or above the

Tab. 3. Probability of a false positive test fesult for n = 40 (γ = 5%); two extreme values are present.

Procedures see legend to table 1.

Samples: uniform Errors: normal Invariant

c CVX

(%)

*> 2 22 5 57 77 1310 4 2 22 55 77 107 13 2 2 22 55 7 107

cvy

(%)

25 77 107 1310 1313 25 77 107 1013 13 13 25 77 107 1010

P.

4 79 6 97 7 87 9 78 87 14 68 12 85 105 2011 18 114 5

P2

P3

2238 4235 4126 3234 2824 2815 3624 3420 27 3322 15 3717 4726 3912 25 10

P4

3 56 5 43 3 55 8 34 4 35 54 59 12 22 45 165 3010

Not invariant P5

0 00 01 1 11 1 2 00 ' 00 01 01 114 00 22 104 327

P6

148 1915 24 1419 25 2525 2013 2727 4223 40 4956 45 15 4066 54 7943 7361

P7

10099 99 9493 90 8987 83 82 100 9897 9094 8689 9487 83 9996 9691 o 8198 9485

Samples: skew Errors: normal Invariant

PI 76 86 14 58 11 86 148 2312 268 2518 169 318 4823 497 26 6

P2

P3

100100 100 9996 97 9187 8169 100100 98 9088 7779 8164 44 10097 9475 4383 6227

P4

65 4 35 46 4 88 34 54 39 66 3315 43 103 368 2360

Not invariant

ps

9174 63 46 4536 31 2816 9 2213 2310 16 66 74 11 117 113 4 97.

3*

P6

1119 27 2240 2133 49 4044 2044 6556 8250 71 9083 77 2981 9590 9980 9791

P7 100100 100 100100 10099 9998 96 100100 .100100 100 10099 10097 95 100100 100100 10097 10097

J. Clin. Chem. Clin. Bioehem. / Vol. 22,1984 / No. 6

(9)

Tab. 4. Bias in % of β for n » 40; two extreme values are prcscnt.

Proccdurcs scc legend to table 1.

Samples: uniform Enrors: normal Invariant

c CV.

χ 22

2 55 7 77 10 13

4 22

52 57 7 107 13

2 22

2 5 57 107

cv

y 25 77 107 U)13 1313 2 5 77 10 7 1310 1313 25 77 107 1010

Pi 01 21 21 23 33 21 33 52 57 64 41 97 124 95

*

10 11 1111 1210 1112 11 10 Π11 13U 1210 1214 12 10 1310 1613 17 10 159

P*

1111 1211 1210 1113 1211 1212 14 1314 11 1317 14 12 1114 1816 2211 19 12

P4

00 01 0i

0 - 1 - 1- 3 0 0 1 - 10 - 2 - 2- 2 - 6 - 9 00 - 41 - 5- 9 - 9-18

Not invariant PS

4 55 4 43 32

~ 11 43 3 - - - - 6 - 9 2 32 - 3- 4 - 9 - 9-18

P*

01 2 35 35 7 88 1 37 117 8 1218 19 19 3 1016 18 3018 3232

P7

1616 1818 1918 2219 2223 2019 2222 2522 2533 3333 19 2332 32 4332 4545

Pi 1 22 42 23 65 4 41 87 135 1016 138 3 2213 16 3210 24 10

Samples: skew Errors: normal Invariant

PJ 24 25 2523 25 2324 26 25 23 2627 2827 30

% 24 27 3127 21 27 3135 29 3722 3019

*

2526 27 2628 2527 2828 27 29 3032 3136 2832 39 3528 3136 4337 5229 4431

P4

01 01 - 10 - 1- 2 - 3- 7 0l 1 - 2- 1 - 6- 6 - 6-13 -21 - 2 00 -11 - 9-21 -36-20

Not invariant P5

16 1616 1516 14 1312 117 1515 1115 127 8 71 - 8 13 1212 2 - 82 - 7-23

P*

1 35 105 7 1013 15 16 39 15 2816 2818 41 4745 7 4125 45 4979 8279

P7

3233 3333 37 3335 4139 41 4139 4545 52 4349 5959 59 43 5261 6185 6182 82

regression line). Table 5 shows the simulated cases, where the extreme values are obtained s described above. Table 6 states the respective probability of a false positive test result and the bias of procedures Pi, P2 and P3 if both CV's are 7% (in this constella- tion procedures P4, PS, PO, P? are completely inade- quate).

Tab. 5. Distribution of the extreme values for n = 40.

Tab. 6. Actual probability of a false positive test result (γ = 5%) and bias in % of β = l for n = 40 and CVX = CVy = 7%, depending on the number and distribution of extreme values.

Procedures see legend to table 1.

Number of extreme values

1 2 3

%of n 2.5 5.0 7.5

Position in the sort sequence of the y*

40 3040 3035 40

Location with respect to regression line above belpw above below above above

Extreme

values Samples: uniform Errors: normal

% Actual

c

oc

4

2

VT..—iNun ber 01 2 3 0 21 3 0 21 3

of n

0 2.55 7.5 2.50 7.55 0 2.5 7.55

Bias

Samples: skew Errors: normal Actual

levei Pi

5 97 11 58 106 5 46 10

?2

P3

577 2677 336 6220 5 1216 34

Pi 02 31 0 22 5 0 43 8

?2

110 10 16 100 1018 0 109 16

p,

110 10 18 100 2111

1011 20

level Bias Pi

55 125 67 158 57 177

?2P3

2197 97 99 12 8179 97 407 4374

P.

21 2 4 0 53 8 04 10 16

?2

0 2322 29 220 24 35 180 2232

P3

0 2425 33 250 4128 240 29 45

J. Clin. Chem. Clin. Bjochem. / Vol. 22, 1984 / No. 6

Referenzen

ÄHNLICHE DOKUMENTE

Quantifying the results of tendency surveys: a comparison among different procedures.

Global gridded climatological (GGC) datasets, including precipitation and temperature, are becoming more and more precise, accessible, and common, but the utility of these datasets

Four different Hausdorff distances are computed: The distance between the initial subspace estimate and the true cointegrating space, the distance between the adapted subspace

The development of adaptive procedures corresponding to the robust procedures of point 2, and again the investi- gation of their asymptotic properties... The development of

The lack of significant differences in outcomes between L and NL at moderate-to-vigorous intensities, together with the adequacy of a simple approximate linear model and the

Local effects in the respiratory tract including sensory irritation as well as respiratory sensitisation are given more attention in the area of OELs derived by workplace

To demonstrate that the increased concentration of plasma histamine was actually free, pharmacologically active histamine, gastric acid secretion, heart or pulse rate

Due to recent advances in conic pro- gramming and due to the availability of new conic optimization solvers, e.g., ecos (Embedded Conic Solver; Domahidi et al. 2013), it is now