## 统计代写|概率与统计作业代写Probability and Statistics代考|Convenience Sampling

Convenience sampling collects only units from the population that can be easily obtained, such as the top layer of a pallet of boxes or trays with vials or the first cavity in a multi-cavity molding process. This may provide a biased sample, as it represents only one small part or time window of the whole processing window for a batch of products. The term bias indicates that we obtain the value of interest with a systematic mistake: we study bias in more detail later in this chapter. Convenience sampling is often justified by using the argument of population homogeneity. ${ }^8$ This insinuates that either the population units are not truly different or the process produces the population of units in random order. Under these assumptions it is indeed irrelevant which set of units is collected, but these assumptions seem to contradict the need for sampling in the first place and are hardly ever justified.

Haphazard sampling is often believed to be an excellent way of collecting samples, because it gives a feeling or the impression that each unit was collected completely at random. ${ }^9$ This way of sampling is best described by an example. If one stands in a library in front of a bookshelf and one is asked to collect an arbitrary book, then “just picking one” would be a haphazard sample. However, in practice it turns out that this procedure typically collects books in the center of the bookshelf and typically books that are larger or thicker. This is usually not what people feel or believe when they try to take an arbitrary book. Hence despite the feeling of randomness when performing haphazard sampling, often the resulting sample is not truly random. Another example is that human beings have the tendency to choose smaller digits when they are asked to choose digits from 1 to 6 (Towse et al. 2014).

## 统计代写|概率与统计作业代写Probability and Statistics代考|Purposive Sampling

Purposive sampling or judgmental sampling tries to sample units for a specific purpose. This means that the collection of units is focused on one or more particular characteristics and hence it implies that only units that are more alike are sampled. In epidemiological research ${ }^{10}$ purposive sampling can be very practical, since it may be used to exclude subjects with high risks for unrelated diseases. In clinical trials ${ }^{11}$ inclusion (e.g., participants older than 65 years) and exclusion (e.g., no pregnant women) criteria are explicitly applied to make sure a sample has specific characteristics. This way of sampling is strongly related to the definition of the population, since deliberately excluding units from the sample is analogous to limiting the population of interest. Thus purposive sampling may be useful, but it is limited since it does not allow us in general to make statements about the whole population, and at best only about a limited part of the population (although we may not be sure either). In other words, it does most likely produce a biased sample with respect to the complete population.

All the sampling methods discussed above have the risk that some units are much more likely to be included in the sample than others, which can make statistics computed on the sample data bad estimates for the population parameters of interest. Even worse: with non-representative sampling some units are not only more likely to be included in the sample, we also do not actually know how likely units were included. Hence, even if we wanted to, we could not control for these systematic differences between units. When performing representative sampling we sample units in such a way that we do know how likely units are to be included in the sample (even if they will be different from unit to unit).

