 Methodology
 Open Access
 Published:
Sample size and power calculations for detecting changes in malaria transmission using antibody seroconversion rate
Malaria Journal volume 14, Article number: 529 (2015)
Abstract
Background
Several studies have highlighted the use of serological data in detecting a reduction in malaria transmission intensity. These studies have typically used serology as an adjunct measure and no formal examination of sample size calculations for this approach has been conducted.
Methods
A sample size calculator is proposed for crosssectional surveys using data simulation from a reverse catalytic model assuming a reduction in seroconversion rate (SCR) at a given change point before sampling. This calculator is based on logistic approximations for the underlying power curves to detect a reduction in SCR in relation to the hypothesis of a stable SCR for the same data. Sample sizes are illustrated for a hypothetical crosssectional survey from an African population assuming a known or unknown change point.
Results
Overall, data simulation demonstrates that power is strongly affected by assuming a known or unknown change point. Small sample sizes are sufficient to detect strong reductions in SCR, but invariantly lead to poor precision of estimates for current SCR. In this situation, sample size is better determined by controlling the precision of SCR estimates. Conversely larger sample sizes are required for detecting more subtle reductions in malaria transmission but those invariantly increase precision whilst reducing putative estimation bias.
Conclusions
The proposed sample size calculator, although based on data simulation, shows promise of being easily applicable to a range of populations and survey types. Since the change point is a major source of uncertainty, obtaining or assuming prior information about this parameter might reduce both the sample size and the chance of generating biased SCR estimates.
Background
The global decline of malaria burden has brought new challenges to disease control and elimination [1]. These challenges encompass problems related to parasite rate (PR) estimation in detecting low parasitaemia or submicroscopic infections [2–4] and potentially prohibitive large sample sizes for PR to be epidemiologically informative. In low transmission settings, alternative malariometrics, such as antimalarial antibody seroprevalence (SP) and seroconversion rate (SCR) have been proposed to overcome some shortcomings of other measures [5]. In practice, SP is statistically defined as the proportion of antibodypositive individuals and reflects antibody responses induced by current and possibly historic infections. Two recent studies highlighted the potential of using SP to discriminate sites with different Plasmodium falciparum endemicity levels that otherwise would appear to be similar in terms of parasite rate [6, 7]. SCR is the frequency per unit of time (e.g., year) by which seronegative individuals become seropositive. This parameter, related to the underlying forceofinfection, is typically assessed via crosssectional data where SP as function of age of the individual is described by a given stochastic model. The reverse catalytic model is the most popular choice for data analysis and based on the simple notion that individuals randomly transit between seronegativity and seropositivity with specific transition rates over time [8]. The superinfection model extends the latter to the scenario where there are different states (or levels) of seropositivity resulting from recurrent malaria exposure [9]. However, this more complicated model does not have a dramatic impact on SCR estimation [9, 10] and, therefore, most likely to be excluded from routine data analysis.
The first step of a seroepidemiological analysis invariantly assumes a constant SCR that applies to every individual in the population at all times. This assumption implies a simple and increasing SP curve taken as function of the age of the sampled individuals. However, there are several studies reporting a qualitative change of the SP at a given age value in relation to what is expected from a constant SCR assumption [7, 11, 12]. This change might result from more complex seroepidemiological scenarios where SCR is assumed to vary over time or among distinct age groups, as reviewed elsewhere [13, 14]. Three main explanations were advanced for such qualitative change in SP, each one implying a different mathematical model to the data. Firstly, agerelated behaviour might affect the malaria risk of certain age groups. An example of such risk behaviour was reported in Indonesia where SCR in adults was increased in relation to the SCR for younger individuals, most likely because of workrelated activities in the forest and exposure to forest vectors [12, 15]. Secondly, a change in the SP curve could be related to putative founder effects, i.e., an influx of nonexposed migrants to an endemic region. Migrants and individuals born locally would have different infection history, thus, presenting different SP profiles. This situation occurred in Brazil where there was a wave of migration in the 1980’s from malariafree states to mining sites in the heart of the Amazonia forest [16]. A similar founder effect was seen in Chagas disease in a Peruvian community [17]. Thirdly, a change in the SP curve might also be attributed to a reduction in malaria transmission after the implementation or intensification of a given malaria control programme [18]. It is expected that this scenario will become increasingly common and it is therefore important that surveys that collect serological information do so with sufficient statistical power to detect impact on malaria transmission and be informative for the control and research communities.
This paper focuses on sample size calculations for detecting an abrupt reduction in disease transmission occurred somewhere in the past. It is worth noting that this statistical exercise is affected by the transmission intensities acting before and after the reduction, and the time between the change point and sample collection. Until now only a few studies have reported a change in transmission using SP data and those referred to dramatic reductions in SCR after intervention (Table 1). Similar observation can be taken for seroepidemiological studies of Chagas disease and trachoma [17, 19, 20]. One possible explanation is that a slowly decreasing trend in malaria transmission might not result in a clear qualitative change in the ageadjusted SP, as demonstrated in Western Kenya [21]. Alternatively it may be that studies were underpowered to detect small reductions in disease transmission.
Previously two sample size calculators were proposed for estimating SCR under stable disease transmission [22]. The present paper extends this work to the setting of detecting a reduction in SCR at a given time point before sampling and attributed to a field intervention. This new calculator is based on logistic approximations for the power using simulated data sets. As in a previous study [22], bias and precision of ensuing parameter estimates were also assessed via data simulation.
Methods
Reverse catalytic models for seropositivity data
Reverse catalytic models have recently been used to analyse seropositivity data [5, 13, 14]. Using a Markov chain formalism, these models describe the dynamics of the serological state of an individual under the assumption that every one is born seronegative and becomes seropositive upon malaria infection. Reversion to a seronegative state might occur in absence of sufficiently frequent malaria exposure. The basic notion is that the frequency by which individuals become seropositive (i.e., SCR) reflects the underlying forceofinfection while the rate by which seropositive individuals revert to a seronegative state (i.e., seroreversion rate (SRR)) might result from a variety of host factors (e.g., genetics or age).
The simplest epidemiological setting is to consider a constant disease transmission over time. In this situation, the resulting reverse catalytic model is described by the following probability of an individual aged t being seropositive.
where \( \lambda \) and \( \rho \) are SCR and SRR, respectively. Notwithstanding its simplicity, this model would appear to be appropriate for the P. falciparum malaria history of the Somalia [6], Brazilian Amazonian region [7], northeast Tanzania [8], or even when SCR randomly fluctuates around a given mean value over time [5].
A more interesting setting is to assume the occurrence of a sudden reduction in malaria transmission due to an intervention. In this scenario, SRR is commonly assumed to be constant over time, thus, precluding any significant changes in factors affecting seroreversion. The probability of an individual aged t being seropositive is now given by
where \( {\lambda_1}\, \text {and} \,{\lambda_2}\) are the SCRs before and after the reduction in disease transmission, respectively \( ({\lambda_2} < {\lambda_1})\,\text{and}\, \tau\) is the time point when that reduction actually occurred (in years before sampling). In simple terms, the above equation is divided into two branches according to the time when the individuals were born in relation to the change time t. For individuals born before the change point (t > \(\tau\)), the seropositivity probability results from the sum of two probabilities: (1) one referring to the event of an individual being seropositive at the change point and remained so after that; and, (2) another one describing the chance of an individual being seronegative at the change point and seroconverting after that. The individuals born after the intervention (t < \(\tau\)) have only experienced current malaria transmission intensity and, thus, the corresponding seropositivity probability is described by the simple reverse catalytic model, as shown in Eq. (1). For a detailed mathematical derivation of the above equation, see Additional file 1.
Model parameterization, estimation and comparison
Model parameterization was the same as previously described [22]. Briefly, the relationships between PR, entomological inoculation rate (EIR), SP, and SCR were used to derive realistic values for SCR and SRR. These relationships were derived from two independent data sets from northeast Tanzania where altitude is highly correlated to these different malaria risk measures. EIRs of 10, 1, 0.1, and 0.01 were used as the core values for studying reductions in disease transmission. The corresponding values for SCR associated with P. falciparum MSP1 antigen are 0.0969, 0.0324, 0.0108, 0.0036, respectively. Although some variations on SRR may be found across different studies, this parameter was fixed at 0.017 as the average value of a large study from 24 sites in Tanzania with differing malaria endemicity [5] and genetic background [23, 24].
With respect to parameter estimation, the sampling distribution was assumed to be a binomialproduct distribution, one binomial distribution per age value. The simple reverse catalytic model was estimated via standard maximum likelihood method [25], whereas the parameter estimates of the model assuming a change in transmission were determined using a profile likelihood method. In general, the latter estimation method aims to reduce the dimensionality of the maximization problem associated with the maximum likelihood method. The basic idea is to maximize the loglikelihood function considering one of the parameters fixed at a given value. Maximization is successively carried out using different values of interest for that fixed parameter. The overall estimates are the ones associated with the maximum of all the maximized loglikelihood functions obtained from considering those fixed values. This simple idea has been applied to statistical problems where one aims to estimate a model that includes a parameter defined in the integer space, such as the number of different T cell receptors present in the organism [26]. In this line of thought, the change time point was then considered to be an integer value (i.e., years before sampling) leading to the following profile likelihood algorithm: (1) fix the change point \(\tau\) at 1; (2) determine the respective maximum likelihood estimates for the remaining parameters; (3) calculate the corresponding loglikelihood function; (4) increase one unit (e.g., one year before sampling) to the change point and repeat steps (23); (v) keep increasing the change point until reaching the maximum expected value for that parameter. The final maximum likelihood estimates are those associated with the change point \(\tau\) with the maximum value of the loglikelihood function. A detailed example on using this algorithm in real data can be found in Cook et al. [18].
After performing parameter estimation, the two reverse catalytic models were compared to each other using the Akaike’s information criteria (AIC) [27]. This pragmatic approach seems more appropriate than the popular Wilks’ likelihood ratio test because the usual ChiSquare approximation associated with the latter may be affected by the sample size to be determined. Theoretically, AIC weights the likelihood of a model given the data with its intrinsic dimension (i.e., the total number of parameters). One should then select the model that leads to the smallest AIC estimate. In this scenario, power to detect a reduction in SCR was estimated by the number of (simulated) data sets in which the corresponding model was considered better than the one assuming a stable SCR.
Simulation study and sample size calculations
Before conducting the sample size calculation per se, data sets were first simulated from the simple reverse catalytic model assuming stable transmission intensity and tested against the reverse catalytic model but assuming a change in transmission. To simulate each data set, the following algorithm was used: randomly select the age of each individual in the sample, and then generate the seroposivity state of each individual at the time of sampling, using a Bernoulli trial with success probability given by the seroprevalence expected under a given model [22]. Assuming SRR = 0.017, four SCR situations were studied, 0.0969, 0.0324, 0.0108 and 0.0036, corresponding to 10, 1, 0.1 and 0.01 EIR units, respectively, as described elsewhere [22]. The total number of simulated data sets per SCR value was 1000 that seemed to provide a good precision of power estimates and feasible total time to perform the simulation study. The results demonstrated that sample sizes of at least 100 individuals ensure a null probability of detecting a spurious change point irrespective of the transmission intensity (Table 2). Therefore, given the typical sample sizes used in seroepidemiological studies (see examples in Table 1), it is very unlikely to report a spurious change point.
A simulation study under the assumption of a change in transmission was then performed using four different reductions in SCR: 0.0969 to 0.0324 and 0.0108 (10–1 and 0.1 in EIR units, respectively), and 0.0324 to 0.0108 and 0.0036 (1 to 0.1 and 0.01 in EIR units, respectively). These four reductions were then combined with three possible change points: three, five and ten years before sampling. In total, there are 12 parameter combinations under study that, in theory, comprise the most interesting situations for using a serological approach in malaria epidemiology. The corresponding ageadjusted SP curves are shown in Fig. 1a–d. At this point the visualization of these curves is key to obtain some qualitative expectation for the ensuing sample sizes. On the one hand, the reduction of one order of magnitude in EIR units does not show dramatic differences in the corresponding SP curves in relation to a situation of stable SCR (Fig. 1a, c), thus, implying larger sample sizes for the corresponding detection. On the other hand, the reduction of two orders of magnitude in EIR units shows a clear biphasic behaviour in the SP curves (Fig. 1b, d), thus relatively small sample sizes may be required, especially when those reductions occur ten years before sampling.
The total number of simulated data sets per parameter combination was 1000 assuming an appropriate balance between the precision of power estimates and the total time to perform the simulation study.
Central to the assumptions of the simulation study is the age distribution of a given population. For that a typical age distribution from African population was used (Fig. 1e), as described elsewhere [22]. To gain intuition on the relationship between the change point and the expected sample size, it was convenient to calculate the percentages of the following age groups: one to three, four to five, six to ten, >10 years old (Fig. 1f). These percentages imply that the frequency of individuals born after the reduction in transmission is 9.9, 16.6 and 31.9 % for the change points of three, five and ten years before sampling, respectively. This suggests that reductions in SCR occurring further in the past should be easier to detect than changes which occur closer to the time of sampling. Furthermore, since the frequency of the individuals born after the reduction increases with the change point, the precision of current SCR should also increase.
Before conducting any formal sample size calculation, the simulation results were first assessed for any potential bias of SCR and change point estimators. Although no sampling bias was introduced in the simulation of each data set, statistical theory predicts that the maximum likelihood method would only lead to unbiased estimates in settings of infinitely large samples [28]. The bias of a given estimator was estimated by the difference between the average of the estimates for a given parameter and the true parameter value that generated the data.
Approximate sample sizes were calculated by estimating power over a predefined set of sample sizes (e.g., 250, 500, 1000, 2500). The power was calculated under the assumption of a known and unknown change point. In some cases, simulation was extended to additional sample sizes (e.g., 100 or 5000) in order to increase the resolution of the underlying power curve. To approximate the power functions, separate logistic regression models were fit to the power estimates obtained from each one of the 12 parameter combinations; the package easynls for the R software was used for such purpose. In these models, the sample size was considered as a covariate. Better model fits were obtained using the sample size in log rather than in linear scale. The use of this transformed scale also ensured a null power for a sample size of 0, as predicted by statistical theory. The minimum sample size that would warrant a power \(\beta_{0}\) was determined by the smallest integer greater than the value provided by the following formula
where \( n_{{i,\beta_{0} }} \), \( \hat{a}_{{i,\beta_{0} }} \) and \( \hat{b}_{{i,\beta_{0} }} \) are the sample size, the intercept and the slope of the logistic regression estimated from simulated power associated with the ith parameter combination (i = 1,…, 12), respectively. Sample sizes were calculated for \( \beta_{0} \) = 0.80, 0.90 and 0.95.
The final step of this study was to learn the estimation implications of the calculated sample size. In particular, it was of key interest to assess the bias and relative precision associated with the estimates for current SCR. Bias was calculated as described above while estimation of relative precision was conducted as for the situation of stable SCR [22]. In brief, relative precision associated with a given sample size was defined as the difference between 2.5 and 97.5 % quantiles of the distribution of currentSCR estimates divided by the true parameter value that generated the corresponding data. This difference was calculated for the predefined set of sample sizes and then predicted for a specific one using the following linear regression model
where \( \hat{\gamma }_{0,i} \), \( \hat{\gamma }_{1,i} \), \( \hat{\gamma }_{2,i} \) and \( \hat{\gamma }_{3,i} \) are coefficients estimated from the corresponding simulated data associated with ith parameter combination. Further details on the use of this model for estimating precision can be found in a previous study [22].
All simulations and estimations were performed in the R software (version 3.2.1) using scripts written for the purpose. In a near future these scripts together with others for the analysis of stable transmission models will be assembled in a convenient R package. For now they are available from the first author upon request and free to be adapted to different sampling scenarios. It is worth noting that, to speed up the simulation study, parallel computing was carried out manually by running the analysis of each parameter in a different node of a computer cluster.
Results
Estimation bias when the true change point is assumed to be known and unknown
The simulated results were firstly studied in terms of estimation bias in relation to the true parameter values that generated the data (Table 3). When the simulated data sets were analysed assuming a known change point, the resulting SCR estimates showed slight bias for sample sizes of 250 and 500 individuals. Unsurprisingly, the most extreme case was observed for a change point of ten years before sampling and a reduction in SCR from 0.0324 to 0.0108 (from 1 to 0.1 in EIR units, respectively). For sample sizes of 1000 and 2500 individuals, estimation bias was highly reduced and tended to be close to the nominal value of 0 %. Since the change point was considered known and there was no selection bias introduced by a communitybased sampling scheme, the estimation bias must be derived from the maximum likelihood method itself. Therefore, SCR estimation based on this method might require a biascorrection adjustment specifically for small sample sizes.
When the simulated samples were analysed under the assumption of a unknown change point, the SCR estimates were highly biased for sample sizes of 250, 500 and 1000 individuals (Table 4). In particular, the estimates of past SCR tended to overestimate the true parameter value whereas the opposite happened for current SCR where a negative bias was found for the corresponding estimates. Again, the most extreme estimation bias was observed for a change in transmission occurring ten years before sampling from 0.0324 to 0.0108 (1 to 0.1 in EIR units, respectively). Likewise for the case of a known change point, some estimation bias might result from the application of the maximum likelihood method itself. However, the highest contribution for estimation bias in this situation would appear to derive from highly skewed distributions for the change point estimates (Fig. 2 and Additional file 2). This skewness implied a tendency of overestimating the true change point and, because of that, estimation of the historic SCR might only use limited sample information of individuals likely to be in the plateau of ageadjusted SP curves (Fig. 1a–d). In practice, when overestimation of the change point occurs, the simple model assuming a constant SCR was mostly preferred to the data. Finally, it is worth noting the wide confidence intervals for the true change point even for sample sizes of 2500 individuals (Fig. 2, Additional file 2). This result suggests that the antibody data taken as a binary outcome might not have sufficient information to estimate the true change point with a high precision, thus, demonstrating the necessity of finding alternative approaches for that specific purpose. As an exceptional case, the situation related to a reduction from 0.0969 to 0.0108 (from 10 to 0.1 in EIR units) using a sample size of 2500 individuals implied at least 60 % chance of generating a data set that would lead to the correct change point estimate and relatively small confidence interval.
Sample size determination
Sample size calculations were then performed using logistic curves fit to the simulated power (Fig. 3, Additional file 3). The sample size decreased with the true change point for a given value of power and reduction in disease transmission (Table 5). This implied that the detection of a shortterm reduction requires larger sample sizes compared to settings where the same reduction is occurring further in the past. The exception would appear to be the analysis assuming a known change point and describing a reduction in SCR from 0.0969 to 0.0324 (Fig. 3a). In this case, the simulation results suggested that a reduction in SCR occurring five years before sampling was easier to detect than the same occurring ten years prior to sampling. However, the corresponding power functions were almost indistinguishable from each other and thus, these variations in the results might solely be attributed to the randomness associated with a simulation study. Notwithstanding these variations, it is clear that each parameter combination required a different set of sample sizes. On the one extreme, the lowest sample sizes were obtained for a reduction in SCR from 0.0969 to 0.0108 (from 10 to 0.1 in EIR units, respectively; Fig. 3b). This was unsurprising and agreed with the visual inspection of the SP curves shown in Fig. 1. In this case, a sample size of 485 individuals considering the change point known was enough to generate a power of at least 95 % to detect a reduction occurring between three and ten years before sampling. On the other extreme, the reduction in SCR from 0.0324 to 0.0108 (from 10 to 1 in EIR units, respectively; Fig. 3c) required the largest set of sample size irrespective of considering or not the change point known. The most extreme case was the sample size of 5675 individuals to detect a reduction occurring ten years before sampling with 95 % power under the assumption of a unknown change point.
Estimation bias and precision associated with a given sample size calculation
Statistically speaking, the final decision of using of a given sample size must be taken not only on the basis of the underlying power, but also on the corresponding impact over parameter estimation. The above results suggested a negligible estimation bias for reasonably large sample sizes under the assumption of a known change point. In contrast, unbiased estimates might only be obtained for large sample sizes when the change point is considered unknown. Ideally, one wishes to have enough power to detect a reduction in SCR and high estimation precision.
For the analysis assuming a known change point, all determined sample sizes led to estimates for current SCR with up to 5 % bias (Table 6). The only exception was the setting of a reduction from 0.0969 to 0.0324 occurring three years before sampling where a 13 % bias was found for a power of 80 %. In this case, estimation bias can be avoided by increasing the power to 90 or 95 %. Although estimation bias was negligible for all determined sample sizes, the corresponding relative precision was in most cases greater than 1.00 in relation to the true value for current SCR. The only two settings where relative precision was less than 1.00 were related to reductions in EIR units of one order of magnitude for a change point of ten years before sampling. Therefore, when the change point is assumed to be known, sample size calculation based only on power, although avoiding estimation bias, would result in studies with limited estimation precision for current SCR.
For the analysis assuming a unknown change point, the sample size that would balance power, bias, and precision was not easily determined (Table 6). Most of the determined sample sizes led to underestimation with bias greater than 10 % in absolute terms, specially, when it was easy to detect a reduction in transmission (i.e., reduction in SCR of two orders of magnitude measured in EIR units). Having biased estimates would make any subsequent precisionbased analysis elusive. Notwithstanding this problem, the corresponding results suggested poor estimation precision (>1.00) for the calculated sample sizes. Therefore, the uncertainty associated with a unknown change point brings problems in terms of balancing power with estimation bias.
Discussion
This paper describes a pragmatic approach to calculate the minimum sample size for detecting a reduction in SCR with a given power. The approach was applied to different epidemiological settings but with a special focus on lower endemicity settings. The analysis was based on antibody responses to P. falciparum MSP1 antigens. A discussion about using antibody data from alternative antigens can be found elsewhere [22].
Sample size calculations were performed assuming demographics for African populations and using communitybased surveys. The demographic distribution is different elsewhere and has been shown to impact SCR under stable SCR [22]. This study concluded that, under an unknown SRR, nonAfrican studies using community sampling might require larger sample sizes than their African counterparts in order to obtain the same estimation precision. With respect to the present case of a reduction in SCR, one expects a higher precision of pastSCR estimates in nonAfrican studies for a given sample size, owing to an increased frequency of older individuals that experienced both past and current malaria transmission intensity. In contrast, precision of current SCR would be increased in African studies due to a higher percentage of young individuals who only experienced current disease transmission intensity. Although expected, these estimation implications need to be further investigated. Alternatively, some statistical improvement might be achieved by sampling specific age groups. Malaria transmission due to P. falciparum is typically much lower outside Africa [29], indicating larger sample sizes for detecting putative reductions in SCR and the need for alternative sampling approaches. This is particularly important when transmission risk is behavioural and associated with older ages [12, 15].
Besides describing a framework for sample size calculation, this paper has also important implications in terms of estimation bias and precision, especially when the true change point is assumed to be unknown. In community surveys, the expected 95 % confidence intervals for the true change point tend to be wide suggesting a high uncertainty in estimating such parameter. The respective point estimates tend to overestimate the true change point being located further in the past than in reality. In agreement with this result is the serological study from the Bioko Island in Equatorial Guinea [11] where a crosssectional survey was conducted four years after the initiation of a comprehensive malaria control programme in the island but the estimated change points suggested a reduction in transmission further in the past (Table 1). Similarly, a serological study from Vanuatu estimated a change point occurring 30 years before sampling that appeared to overestimate in 13 years a putative change point due to a known insecticidetreated net distribution across the islands (Table 1) [18]. Biologically speaking, the most likely explanation for obtaining overestimates of the change point is the putative difference in antibodydecay rates between younger children and older individuals. More precisely, the former would have a higher loss rates than the latter, who have more established antibody responses [30, 31]. In practice, it is unlikely to have enough information to distinguish between statistical and biological bias, thus the necessity of applying different biasreduction strategies to data collection and analysis.
To minimize the estimation bias and decrease uncertainty of the estimates, five possible solutions can be envisioned. The first one consists of fixing the change point at an expected value. In that case, the SCR estimates were found to be approximately unbiased for relatively small sample sizes (e.g., 250 or 500 individuals depending on the size of the underlying reduction in SCR). Assuming a given change point would appear to be a reasonable data analysis strategy for postintervention studies where the start of the intervention is known, such as the abovementioned study from Bioko where an intensive malaria control programme was launched in 2004 [11]. However, fixing a change point might not be so easily applicable to exploratory (or preliminary) studies. As an example, a recent study from the Brazilian Amazonia region reported a strong reduction in SCR for P. falciparum antigens occurring 30 years before sampling [7]. In this specific example, different malaria control programmes have been operating in the area since the 1980’s [32] but are likely to have been scaled up over time making a known change point difficult to assess. Health system records of changes in malaria case number could additionally provide indicators of potential change points for a given study. The second solution for reducing bias is to use alternative estimators, such as the jackknife [33] or the bootstrap estimator [34], which are particularly tailored to solve this statistical problem. However, these estimators are in general computationally intensive due to the use of leaveoneout or resampling techniques. In this scenario, the application of these estimators would appear to be feasible in small samples where estimates might be more affected by bias. For large samples, estimation bias is reduced and, therefore, the decision of using such estimators should be weighted with the real implications of obtaining more reliable estimates. A third route for bias reduction is to choose a sampling strategy where the chance of detecting the true change point is increased. One might define three age groups according to the kind of epidemiological information each one provides. The first one refers to individuals born before the change point but with age ‘‘far’’ from it (e.g., young children up to three years old for a change point of five years before sampling). This age group is essential to estimate current SCR since the corresponding target population would not have experienced any change in disease transmission. The second group consists of children or adolescents with age in the vicinity of the putative change point (e.g., children aged between four to seven years old for a change point of five years before sampling), thus, having the highest sampling information over that parameter. The age range should be defined in order to approximately sample the same amount of individuals that experienced the different exposure periods. Sampling in this way should jointly increase the power to detect a change in SCR and the accuracy of the corresponding changepoint estimates. The third group targets older individuals because of the putative information they might show of historical disease exposure. Since this group refers to older individuals, it also embodies important information on seroreversion rate. Having all of these different possible sampling options, future research is critical to determine the most optimal sampling strategy for controlling power, precision and bias altogether. A fourth solution is to jointly analyse data from different study sites. The theoretical expectation is that, under the assumption of a shared change point, more accurate information can be borrowed from sites where such a parameter is more easily estimated. This solution was followed in the abovementioned Brazilian study where there is evidence for a common change point for P. falciparum malaria that could not be easily detected due to the uncertainty of the change point estimates when the corresponding data of each study site was analysed separately (Table 1) [7]. A fifth and last solution is to analyse antibody concentration data using appropriate antibody density models as reviewed elsewhere [13]. The use of a quantitative outcome is expected to be more informative about the underlying phenomenon than its binaryderived counterpart, as suggested by several genetic studies aiming to estimate the location of quantitative trait loci [35, 36]. Similar line of evidence was observed in a seroepidemiological study from Nigeria where the antibody values of the sampled individuals declined over an intervention period but the corresponding ageadjusted seroprevalence curves remained unaltered [37, 38]. However, this solution remains to be tested in real world data.
In general, controlling power via sample size is an ideal strategy to increase the chance of drawing the right conclusion if different explanations exist for the same data. Here, the power was calculated using the stable SCR assumption as the only alternative explanation for the data. However, an effect of agedependent risk factor might be yet another competing explanation for the occurrence of biphasic SP curves. This is the case of an Indonesian population where adults working in the forest were more exposed to malaria vectors than younger workers [15]. In theory, the interventionbased and riskfactor models are mathematically distinguishable but very closely related [13]. Therefore, the sample sizes calculated here would also hold for the alternative setting of detecting changes in SCR due to agedependent risk factors, although this requires further investigation. Alternatively, the power to distinguish these models might require collecting such a large sample that brings several practical and theoretical challenges, as discussed in detail elsewhere [22]. Current models may be inappropriate in these settings and approaches that use antibody levels or different antigenic targets with shorter SRR would ultimately be more useful.
The calculated sample sizes suggest potentially opposing conclusions for study design. On the one hand, small sample sizes might be sufficient to detect significant reductions in SCR with high power, but lead to relatively poor estimation precision of current SCR. In this scenario, it is recommended to perform sample size calculations focusing on estimation precision rather than on power. Assuming the true change point known improves the estimates precision, which might be further improved by fixing the SRR at a reasonable value [22]. On the other hand, subtle reductions in SCR might only be detected by means of large sample sizes. The use of a large sample size brings theoretical and operational challenges but inevitably leads to improved estimation precision and reduced estimation bias. Precision and bias are particularly important to be controlled in situations where there is no information on the timing of a change in transmission.
Conclusion
In summary, designing a study that aims to detect a reduction in transmission using SCR requires balancing the use of a given sampling strategy with the sample size warranting a given power and estimation precision. Ultimately the decision of choosing one or another sample size should be made on the basis of not only statistical arguments, as discussed here, but also on possible sampling constraints that might influence data collection, such as ethics, available human and economic resources and/or presence of any time constraint. Simply augmenting the number of individuals sampled in the age groups around any perceived change point may be the most pragmatic solution. As malaria transmission decreases and multiple malariometrics are required to determine the effect of control programmes, optimizing sample size is crucial to avoid wasting valuable resources. Using optimal study designs is particularly important for countries on the brink of malaria elimination or eradication, such as the Hispaniola Island [39] or Sri Lanka [40]. This and other related issues are going to be investigated in a future study.
Abbreviations
 AIC:

Akaike’s information criterion
 EIR:

entomological inoculation rate
 SP:

seroprevalence
 SCR:

seroconversion rate
 SRR:

seroreversion rate
 AMA1:

apical membrane protein1
 MSP1:

merozoite surface protein1
 PR:

parasite rate
References
 1.
Stresman G, Kobayashi T, Kamanga A, Thuma PE, Mharakurwa S, et al. Malaria research challenges in low prevalence settings. Malar J. 2012;11:353.
 2.
Harris I, Sharrock WW, Bain LM, Gray KA, Bobogare A, et al. A large proportion of asymptomatic Plasmodium infections with low and submicroscopic parasite densities in the low transmission setting of Temotu Province, Solomon Islands: challenges for malaria diagnostics in an elimination setting. Malar J. 2010;9:254.
 3.
Mosha JF, Sturrock HJW, Greenhouse B, Greenwood B, Sutherland CJ, et al. Epidemiology of subpatent Plasmodium falciparum infection: implications for detection of hotspots with imperfect diagnostics. Malar J. 2013;12:221.
 4.
Vallejo AF, Chaparro PE, Benavides Y, Álvarez A, Quintero JP, et al. High prevalence of submicroscopic infections in Colombia. Malar J. 2015;14:201.
 5.
Corran P, Coleman P, Riley E, Drakeley C. Serology: a robust indicator of malaria transmission intensity? Trends Parasitol. 2007;23:575–82.
 6.
Bousema T, Youssef RM, Cook J, Cox J, Alegana VA, et al. Serologic markers for detecting malaria in areas of low endemicity, Somalia, 2008. Emerg Infect Dis. 2010;16:392–9.
 7.
Cunha MG, Silva ES, Sepúlveda N, Costa SPT, Saboia TC, et al. Serologically defined variations in malaria endemicity in Pará state, Brazil. PLoS One. 2014;9:e113357.
 8.
Drakeley CJ, Corran PH, Coleman PG, Tongren JE, McDonald SLR, et al. Estimating medium and longterm trends in malaria transmission by using serological markers of malaria exposure. Proc Natl Acad Sci USA. 2005;102:5108–13.
 9.
Bosomprah S. A mathematical model of seropositivity to malaria antigen, allowing seropositivity to be prolonged by exposure. Malar J. 2014;13:12.
 10.
van den Hoogen LL, Griffin JT, Cook J, Sepúlveda N, Corran P, et al. Serology describes a profile of declining malaria transmission in Farafenni, The Gambia. Malar J. 2015;14:416.
 11.
Cook J, Kleinschmidt I, Schwabe C, Nseng G, Bousema T, et al. Serological markers suggest heterogeneity of effectiveness of malaria control interventions on Bioko Island, Equatorial Guinea. PLoS One. 2011;6:e25137.
 12.
Cook J, Speybroeck N, Sochanta T, Somony H, Sokny M, et al. Seroepidemiological evaluation of changes in Plasmodium falciparum and Plasmodium vivax transmission patterns over the rainy season in Cambodia. Malar J. 2012;11:86.
 13.
Sepúlveda N, Stresman G, White MT, Drakeley CJ. Current mathematical models for analyzing antimalarial antibody data with an eye to malaria elimination and eradication. J Immunol Res. 2015;2015:738030.
 14.
Hens N, Aerts M, Faes C, Shkedy Z, Leujeune O, et al. Seventyfive years of estimating the force of infection from current status data. Epidemiol Infect. 2010;138:802–12.
 15.
Supargiyono S, Bretscher MT, Wijayanti MA, Sutanto I, Nugraheni D, et al. Seasonal changes in the antibody responses against Plasmodium falciparum merozoite surface antigens in areas of differing malaria endemicity in Indonesia. Malar J. 2013;12:444.
 16.
Marques AC. Human migration and the spread of malaria in Brazil. Parasitol Today. 1987;3:166–70.
 17.
Bowman NM, Kawai V, Levy MZ, del Carpio JGC, Cabrera L, et al. Chagas disease transmission in periurban communities of Arequipa. Peru. Clin Infect Dis. 2008;46:1822–8.
 18.
Cook J, Reid H, Iavro J, Kuwahata M, Taleo G, et al. Using serological measures to monitor changes in malaria transmission in Vanuatu. Malar J. 2010;9:169.
 19.
Delgado S, Neyra RC, Machaca VRQ, Juarez JA, Chu LC, et al. A history of chagas disease transmission, control, and reemergence in perirural La Joya, Peru. PLoS Negl Trop Dis. 2011;5:e970.
 20.
Martin DL, Bid R, Sandi F, Goodhew EB, Massae PA, et al. Serology for trachoma surveillance after cessation of mass drug administration. PLoS Negl Trop Dis. 2015;9:e0003555.
 21.
Wong J, Hamel MJ, Drakeley CJ, Kariuki S, Shi YP, et al. Serological markers for monitoring historical changes in malaria transmission intensity in a highly endemic region of Western Kenya, 19942009. Malar J. 2014;13:451.
 22.
Sepúlveda N, Drakeley C. Sample size determination for estimating antibody seroconversion rate under stable malaria transmission intensity. Malar J. 2015;14:141.
 23.
Enevold A, Alifrangis M, Sanchez JJ, Carneiro I, Roper C, et al. Associations between alpha^{+}thalassemia and Plasmodium falciparum malarial infection in northeastern Tanzania. J Infect Dis. 2007;196:451–9.
 24.
Sepúlveda N, Manjurano A, Drakeley C, Clark TG. On the performance of multiple imputation based on chained equations in tackling missing data of the African α3.7globin deletion in a malaria association study. Ann Hum Genet. 2014;78:277–89.
 25.
Williams BG, Dye C. Maximum likelihood for parasitologists. Parasitol Today. 1994;10:489–93.
 26.
Sepúlveda N, Paulino CD, Carneiro J. Estimation of Tcell repertoire diversity and clonal size distribution by Poisson abundance models. J Immunol Methods. 2010;353:124–37.
 27.
Burnham KP, Anderson DR. Multimodel inference—Understanding AIC and BIC in model selection. Sociol Methods Res. 2004;33:261–304.
 28.
Casella G, Berger RL. Statistical inference. 2nd ed. Pacific Grove: Duxbury; 2002.
 29.
Gething PW, Patil AP, Smith DL, Guerra CA, Elyazar IRF, et al. A new world malaria map: Plasmodium falciparum endemicity in 2010. Malar J. 2011;10:378.
 30.
Kinyanjui SM, Conway DJ, Lanar DE, Marsh K. IgG antibody responses to Plasmodium falciparum merozoite antigens in Kenyan children have a short halflife. Malar J. 2007;6:82.
 31.
Akpogheneta OJ, Duah NO, Tetteh KKA, Dunyo S, Lanar DE, et al. Duration of naturally acquired antibody responses to bloodstage Plasmodium falciparum is age dependent and antigen specific. Infect Immun. 2008;76:1748–55.
 32.
OliveiraFerreira J, Lacerda MVG, Brasil P, Ladislau JLB, Tauil PL, et al. Malaria in Brazil: an overview. Malar J. 2010;9:115.
 33.
Quenouille MH. Notes on bias in estimation. Biometrika. 1956;43:353–60.
 34.
Efron B, Tibshirani RJ. An introduction to the bootstrap. 1st ed. New York: Chapman & Hall; 1993.
 35.
Broman KW. Mapping quantitative trait loci in the case of a spike in the phenotype distribution. Genetics. 2003;163:1169–75.
 36.
Xu S, Atchley WR. Mapping quantitative trait loci for complex binary diseases using line crosses. Genetics. 1996;143:1417–24.
 37.
Brögger RC, Mathews HM, Storey J, Ashkar TS, Brögger S, et al. Changing patterns in the humoral immune response to malaria before, during, and after the application of control measures: a longitudinal study in the West African savanna. Bull World Health Organ. 1978;56:579–600.
 38.
Molyneux L, Gramiccia G. The Garki Project. Research on the epidemiology and control of malaria in the Sudan savanna of West Africa. World Health Organization 1980, pp. 311.
 39.
Herrera S, OchoaOrozco SA, González IJ, Peinado L, Quinones ML, et al. Prospects for malaria elimination in Mesoamerica and Hispaniola. PLoS Negl Trop Dis. 2015;9:e0003700.
 40.
Karunaweera ND, Galappaththy GN, Wirth DF. On the road to eliminate malaria in Sri Lanka: lessons from history, challenges, gaps in knowledge and research needs. Malar J. 2014;13:59.
Authors’ contributions
NS developed the sample size calculators and wrote the manuscript. CDP provided key insights on the statistical aspects of the study. CD designed the project and discussed the epidemiological implications of this work. All authors read and approved the manuscript.
Acknowledgements
NS and CD acknowledge funding from the Wellcome Trust (Grant number 091924). NS and CDP were partially supported by Fundação para a Ciência e Tecnologia (Portugal) through the project PestOE/MAT/UI0006/2011. The authors would like to thank Jackie Cook for proofreading the paper.
Competing interests
The authors declare that they have no competing interests.
Author information
Additional files
12936_2015_1050_MOESM1_ESM.pdf
12936_2015_1050_MOESM2_ESM.pdf
12936_2015_1050_MOESM3_ESM.pdf
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Sepúlveda, N., Paulino, C.D. & Drakeley, C. Sample size and power calculations for detecting changes in malaria transmission using antibody seroconversion rate. Malar J 14, 529 (2015) doi:10.1186/s1293601510503
Received
Accepted
Published
DOI
Keywords
 Intervention
 Malaria transmission
 Bias
 Precision
 Sample size
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate. Please note that comments may be removed without notice if they are flagged by another user or do not comply with our community guidelines.