Schweitzer Fachinformationen
Wenn es um professionelles Wissen geht, ist Schweitzer Fachinformationen wegweisend. Kunden aus Recht und Beratung sowie Unternehmen, öffentliche Verwaltungen und Bibliotheken erhalten komplette Lösungen zum Beschaffen, Verwalten und Nutzen von digitalen und gedruckten Medien.
1
A simple comparative experiment
1.1 Key concepts
1. Good experimental designs allow for precise estimation of one or more unknown quantities of interest. An example of such a quantity, or parameter, is the difference in the means of two treatments. One parameter estimate is more precise than another if it has a smaller variance.
2. Balanced designs are sometimes optimal, but this is not always the case.
3. If two design problems have different characteristics, they generally require the use of different designs.
4. The best way to allocate a new experimental test is at the treatment combination with the highest prediction variance. This may seem counterintuitive but it is an important principle.
5. The best allocation of experimental resources can depend on the relative cost of runs at one treatment combination versus the cost of runs at a different combination.
Is A different from B? Is A better than B? This chapter shows that doing the same number of tests on A and on B in a simple comparative experiment, while seemingly sensible, is not always the best thing to do. This chapter also defines what we mean by the best or optimal test plan.
1.2 The setup of a comparative experiment
Peter and Brad are drinking Belgian beer in the business lounge of Brussels Airport. They have plenty of time as their flight to the United States is severely delayed due to sudden heavy snowfall. Brad has just launched the idea of writing a textbook on tailor-made design of experiments.
[Brad] I have been playing with the idea for quite a while. My feeling is that design of experiments courses and textbooks overemphasize standard experimental plans such as full factorial designs, regular fractional factorial designs, other orthogonal designs, and central composite designs. More often than not, these designs are not feasible due to all kinds of practical considerations. Also, there are many situations where the standard designs are not the best choice.
[Peter] You don’t need to convince me. What would you do instead of the classical approach?
[Brad] I would like to use a case-study approach. Every chapter could be built around one realistic experimental design problem. A key feature of most of the cases would be that none of the textbook designs yields satisfactory answers and that a flexible approach to design the experiment is required. I would then show that modern, computer-based experimental design techniques can handle real-world problems better than standard designs.
[Peter] So, you would attempt to promote optimal experimental design as a flexible approach that can solve any design of experiments problem.
[Brad] More or less.
[Peter] Do you think there is a market for that?
[Brad] I am convinced there is. It seems strange to me that, even in 2011, there aren’t any books that show how to use optimal or computer-based experimental design to solve realistic problems without too much mathematics. I’d try to focus on how easy it is to generate those designs and on why they are often a better choice than standard designs.
[Peter] Do you have case studies in mind already?
[Brad] The robustness experiment done at Lone Star Snack Foods would be a good candidate. In that experiment, we had three quantitative experimental variables and one categorical. That is a typical example where the textbooks do not give very satisfying answers.
[Peter] Yes, that is an interesting case. Perhaps the pastry dough experiment is a good candidate as well. That was a case where a response surface design was run in blocks, and where it was not obvious how to use a central composite design.
[Brad] Right. I am sure we can find several other interesting case studies when we scan our list of recent consulting jobs.
[Peter] Certainly.
[Brad] Yesterday evening, I tried to come up with a good example for the introductory chapter of the book I have in mind.
[Peter] Did you find something interesting?
[Brad] I think so. My idea is to start with a simple example. An experiment to compare two population means. For example, to compare the average thickness of cables produced on two different machines.
[Peter] So, you’d go back to the simplest possible comparative experiment?
[Brad] Yep. I’d do so because it is a case where virtually everybody has a clear idea of what to do.
[Peter] Sure. The number of observations from the two machines should be equal.
[Brad] Right. But only if you assume that the variance of the thicknesses produced by the two machines is the same. If the variances of the two machines are different, then a 50–50 split of the total number of observations is no longer the best choice.
[Peter] That could do the job. Can you go into more detail about how you would work that example?
[Brad] Sure.
Brad grabs a pen and starts scribbling key words and formulas on his napkin while he lays out his intended approach.
[Brad] Here we go. We want to compare two means, say and , and we have an experimental budget that allows for, say, n = 12 observations, n1 observations from machine 1 and or n2 observations from machine 2. The sample of n1 observations from the first machine allows us to calculate a sample mean for the first machine, with variance . In a similar fashion, we can calculate a sample mean from the n2 observations from the second machine. That second sample mean has variance .
[Peter] You’re assuming that the variance in thickness is for both machines, and that all the observations are statistically independent.
[Brad] Right. We are interested in comparing the two means, and we do so by calculating the difference between the two sample means, . Obviously, we want this estimate of the difference in means to be precise. So, we want its variance
or its standard deviation
to be small.
[Peter] Didn’t you say you would avoid mathematics as much as possible?
[Brad] Yes, I did. But we will have to show a formula here and there anyway. We can talk about this later. Stay with me for the time being.
Brad empties his Leffe, draws the waiter’s attention to order another, and grabs his laptop.
[Brad] Now, we can enumerate all possible experiments and compute the variance and standard deviation of for each of them.
Before the waiter replaces Brad’s empty glass with a full one, Brad has produced Table 1.1. The table shows the 11 possible ways in which the n = 12 observations can be divided over the two machines, and the resulting variances and standard deviations.
Table 1.1 Variance of sample mean difference for different sample sizes n1 and n2 for .
[Brad] Here we go. Note that I used a value of one in my calculations. This exercise shows that taking n1 and n2 equal to six is the best choice, because it results in the smallest variance.
[Peter] That confirms traditional wisdom. It would be useful to point out that the value you use does not change the choice of the design or the relative performance of the different design options.
[Brad] Right. If we change the value of , then the 11 variances will all be multiplied by the value of and, so, their relative magnitudes will not be affected. Note that you don’t lose much if you use a slightly unbalanced design. If one sample size is 5 and the other is 7, then the variance of our sample mean difference, , is only a little bit larger than for the balanced design. In the last column of the table, I computed the efficiency for the 11 designs. The design with sample sizes 5 and 7 has an efficiency of 0.333/0.343 = 97.2%. So, to calculate that efficiency, I divided the variance for the optimal design by the variance of the alternative.
[Peter] OK. I guess the next step is to convince the reader that the balanced design is not always the best choice.
Brad takes a swig of his new Leffe, and starts scribbling on his napkin again.
[Brad] Indeed. What I would do is drop the assumption that both machines have the same variance. If we denote the variances of machines 1 and 2 by and , respectively, then the variances of and become and . The variance of our sample mean difference then is
so that its standard deviation is
[Peter] And now you will again enumerate the 11 design options?
[Brad] Yes, but first I need an a priori guess for the values of and . Let’s see what happens if is nine times .
[Peter] Hm. A variance ratio of nine seems quite large.
[Brad] I know. I know. I just want to make sure that there is a noticeable effect on the design.
Brad pulls his laptop a bit closer and modifies his original table so that the thickness variances are and . Soon, he produces Table 1.2.
Table 1.2 Variance of sample mean difference for different sample sizes n1 and n2 for and .
[Brad] Here we are. This time, a design that requires three observations from machine 1 and nine observations from machine 2 is the optimal choice. The balanced design results in a variance of 1.667, which is 25% higher than the variance of 1.333 produced by the optimal design. The balanced design now is only 1.333/1.667 = 80% efficient.
[Peter] That would be perfect if the variance ratio was really as large as nine. What happens if...
Dateiformat: ePUBKopierschutz: Adobe-DRM (Digital Rights Management)
Systemvoraussetzungen:
Das Dateiformat ePUB ist sehr gut für Romane und Sachbücher geeignet – also für „fließenden” Text ohne komplexes Layout. Bei E-Readern oder Smartphones passt sich der Zeilen- und Seitenumbruch automatisch den kleinen Displays an. Mit Adobe-DRM wird hier ein „harter” Kopierschutz verwendet. Wenn die notwendigen Voraussetzungen nicht vorliegen, können Sie das E-Book leider nicht öffnen. Daher müssen Sie bereits vor dem Download Ihre Lese-Hardware vorbereiten.Bitte beachten Sie: Wir empfehlen Ihnen unbedingt nach Installation der Lese-Software diese mit Ihrer persönlichen Adobe-ID zu autorisieren!
Weitere Informationen finden Sie in unserer E-Book Hilfe.