Introduction
Anthropometric measurements are crucial for nutritional and chronic disease epidemiology. However, in large-scale population-based studies direct anthropometry is not logistically feasible, and researchers must rely on self-reported anthropometry. Correlations between direct and self-reported height and weight are usually high (over 0.9),1,2,3 however systematic errors of self-reported anthropometry may result in an underestimation of the prevalence of obesity,1,2,3,4,5 residual confounding after adjusting for an imperfect measure,6 and depending on their predictors, bias on the causal associations between anthropometric indicators and disease incidence.7 The magnitude of such errors and their associated factors differ across populations and settings.4 In addition, the validity of waist and hip circumferences have seldom been evaluated, resulting in lower correlations between self-reported and direct measurements (ranging from 0.62 to 0.89).2,5,8,9
The Mexican Teachers’ Cohort (MTC), also known as ESMaestras (Spanish acronym), is an ongoing prospective study aimed at evaluating the relationship of dietary and lifestyle factors and chronic diseases. As the largest cohort study in Latin-America and among Hispanics, the MTC offers a unique sample from which to assess the validity of self-reported anthropometry among Mexican women due to its large sample size, wide age range (25 to 84 years at enrollment) and the economically and culturally diverse origin of participants.
Given the multiple and central causal role of adiposity on the development of many chronic diseases, understanding the validity of self-reported anthropometry and the main factors associated with its measurement error is critical for a comprehensive evaluation of the relationship of risk factors and chronic disease, not only within the MTC but also in other observational studies in similar populations that rely on self-reported anthropometry. Therefore, our primary objective was to compare self-reported anthropometric data with direct measures performed by standardized personnel in a subsample of 3 756 female participants of the MTC.
Materials and methods
Study population and analytical sample
We included participants from the MTC in our cross-sectional study. The MTC is a cohort study of 115 315 female Mexican teachers 25 years and older from twelve economically and culturally diverse states in Mexico. Participant selection, distribution of instruments and data collection procedures for the MTC has been previously described.10 Briefly, between 2006 and 2008, eligible female teachers were identified through an administrative database. A baseline self-administered questionnaire was distributed to assess socio-demographic and reproductive characteristics, diet, lifestyle, risk factors for chronic diseases and medical conditions. Between 2007 and 2010, 3 913 study participants from four study sites (Hidalgo, Jalisco, Mexico City and Veracruz) underwent a clinical evaluation. These women were a random sample of cohort participants living in a 30 km radius from clinical sites stratified by menopausal status.
All women with a baseline questionnaire who participated in a clinical evaluation were eligible for this analysis. We excluded participants for whom all measured (n=80), or self-reported (n=70) anthropometry or both (n=7) were unavailable. The final analytical sample included 3 756 female teachers, who had direct and self-reported anthropometric data on either weight and height (n=3 413) or waist and hip circumferences (n=3 258). The median time from self-reported to direct anthropometric measures was 11 months (inter-quartile range 7-15).
This study was approved by the Research, Biosafety and Ethics Committees at the National Institute of Public Health, Mexico. All participants gave written informed consent.
Anthropometric measures
Self-report
Participants reported their weight (kg), height (cm), waist circumference (cm), and hip circumference (cm) on the baseline questionnaire. They received a plastic measuring tape (commercially available) and instructions to measure their own waist and hip circumferences.
Direct measures
Anthropometry was performed by experienced personnel, previously standardized following Lohman’s guidelines.11 Participants were barefoot and wearing light clothing. Weight was measured to the nearest 0.1 kg with a digital scale (Tanita Corp., Japan), and height to the nearest 0.1 cm with a stadiometer (Seca Corp., Hanover, MD). Waist and hip circumferences were measured standing, to the nearest 0.1 cm, using a plastic measuring tape (commercially available): waist at the midway between the lowest rib and the iliac crest, and hip at the maximum circumference of the buttocks.
Statistical analysis
Body mass index (BMI) was calculated for direct and self-reported measures, and participants were categorized as normal weight (<24.9 kg/m2), overweight (25.0 to 29.9 kg/m2) and obesity (≥30.0 kg/m2).12,13 The linear relationship between direct and self-reported anthropometric measures was assessed by Pearson correlation coefficients and their 95%CI and graphically explored in scatterplots where regression lines and 95% prediction limits were added. To explore non-linear relations between direct and self-reported anthropometry, regression specification error (reset) tests were performed, the gain in R2 from including higher-order terms to the linear models was calculated and locally weighted regression (lowess) curves were fitted.
Errors in self-reported anthropometry were estimated by calculating mean differences of direct minus self-reported measures and their 95%CI. Positive values in that difference indicate underreporting, and negative values, over-reporting. Errors in self-reported anthropometry were further explored using Bland-Altman plots.14 Predicted mean differences and 95% limits of agreement as well as lowess curves were added to the plots. To explore whether the magnitude of the errors in self-reported anthropometry depend on the actual measurement, the null hypothesis that the slope of regression line in the Bland-Altman plot is zero was tested. To identify factors associated with errors in self-reported anthropometry, mean differences of direct minus self-reported measures were adjusted by: age (30 to 39, 40 to 44, 45 to 49 and 50 to 72 years); marital status (single, married or with a partner, divorced or separated, widowed and unknown); indigenous language spoken (yes, no, unknown); rurality (teaching at a rural/urban school); education (less than university degree, bachelor’s degree, graduate and unknown); number of durable assets at home: phone, car, computer, vacuum cleaner, microwave oven, mobile and internet access, categorized as 0 to 4, 5 (median), 6 to 7 assets and unknown; study site; time between direct and self-reported measures; and measured BMI categories. Multiple linear regression models with dummy variables for the unknown categories were used to estimate the multivariate-adjusted mean differences. To calculate Ptrend in ordinal variables with the unknown category, values of that category were imputed to their respective marginal median.
Misclassification that resulted from using self-reported data was explored by comparing the prevalence of overweight or obesity, obesity and abdominal obesity (waist circumference≥88 cm) as derived from direct and self-reported measures.12,13,15 Also, using direct measures as the gold standard, sensitivity and specificity of self-reported anthropometry and their 95%CI were calculated.
In a sensitivity analysis, in order to attenuate the effect of extreme values, all analyses were repeated in a subsample defined as the 90% central of the distribution of the errors of self-reported anthropometry for each anthropometric measure. To explore the potential misspecification of regression models including variables with the unknown category, complete-case analysis was conducted.
All analyses were performed with the statistical software Stata/SE 13.1.* The level of significance was 5%, two-tailed.
Results
Socio-demographic characteristics of participants are shown in table I. The median (interquartile range) age at baseline questionnaire was 45 (41, 50) years. Most participants were married (65.9%). On average, 9.4% spoke an indigenous language and had a university education. With the exception of median age (45 vs. 44) and the percentage of teachers who taught in a rural school (14.7 vs. 25%), socio-demographic characteristics of participants included in this analysis had a similar distribution to that observed for all MTC participants.
a Complete information (self-reported and direct measures) in weight and height or waist and hip circumferences
Table II includes the correlations and mean differences between direct and self-reported measures. Pearson correlation coefficients between direct and self-reported measures were above 0.80 for all anthropometric measures, excepting waist circumference (r=0.78). The linear fit of the relationship between direct and self-reported measures showed slopes ranging from 0.79 to 1.00 and R2 from 61 to 87% for waist circumference and weight, respectively (figure 1). RESET tests indicated that for all anthropometric measures the linear approximation was not the correctly specified. However, only a marginal gain (ranging from 0.0 to 2.2 percent points) in R2 from including quadratic or cubic terms in the linear model was found for all anthropometric measures. Accordingly, in lowess curves, mild deviations from linearity (larger errors of self-report) were observed in the lowest values of waist and hip circumferences and in both extremes of the distribution of height ( figure 1 ).
* Difference was calculated as direct minus self-reported measures. Positive values indicate underreporting and negative values indicate over-reporting
‡ Due to rounding error, some mean differences do not match with the difference of the means shown in the table
Mean differences (95%CI) of direct minus self-reported measures were 1.3 (1.2, 1.5) kg for weight; -2.2 (-2.3, -2.1) cm for height; 1.3 (1.3, 1.4) kg/m2 for BMI; 1.8 (1.5, 2.0) cm for waist circumference; and 1.9 (1.7, 2.1) cm for hip circumference. Bland-Altman analysis further supported those results and showed that slopes of regression lines were close to zero for all anthropometric measures, except BMI (β1=0.12), suggesting that proportional bias would be an important issue only for BMI. Lowess curves in general confirmed the results from linear fit (figure 1).
In table III multivariable-adjusted mean differences of direct minus self-reported measures according to participant’s characteristics are shown. Age was directly associated with over-reporting of height and inversely associated with underreporting of weight, BMI and hip circumference (all Ptrend≤0.01). Mean differences of direct minus self-reported anthropometric measures were smaller in indigenous language speakers than in non-speakers; however differences were statistically significant only for waist circumference. Over-reporting of height was larger in women teaching at a rural school than in those teaching at an urban school. Education and number of durable assets were directly associated with underreporting of weight (Ptrend<0.01) and waist circumference (Ptrend<0.05). Women with increasing number of durable assets showed larger underreporting of hip circumference (Ptrend=0.02). Time between direct and self-reported measures was inversely associated with over-reporting of height (Ptrend<0.001) and BMI (Ptrend=0.02), and directly but not statistically significant with underreporting of waist circumference (Ptrend=0.11). Women who were overweight and obese, according to measured BMI, showed larger differences between direct and self-reported measures (all Ptrend<0.001, except hip circumference, Ptrend=0.02).
* Adjusted means are from linear regression models of the differences between direct and self-reported measures on all the characteristics shown. Difference was calculated as direct minus self-reported measures. Positive values indicate underreporting and negative values indicate over-reporting
‡ P<0.05 for the difference vs. the first category
§ P<0.01 for the difference vs. the first category
# P<0.001 for the difference vs. the first category
AM, adjusted mean. Circ., circumference. SE, standard error of the adjusted mean
Prevalence of overweight and obesity estimated with direct and self-reported measures, as well as sensitivity and specificity of self-reported anthropometry, are shown in table IV. Compared to the prevalence of obesity calculated by direct measures, the prevalence of obesity with self-reported data was nine percentage points lower (33.2 vs. 24.1%). Sensitivity and specificity of self-reported obesity and abdominal obesity were 66.5 and 97.0%, and 77.4 and 81.2%, respectively.
* Categories were defined with the same cut-off points for direct and self-reported measures
‡ Waist circumference>88cm
In sensitivity analyses, although magnitude of differences and statistical significance were attenuated, findings were similar to the main analysis. Noteworthy, in complete-case analysis, the inverse relationship between age and underreporting of BMI turned non-significant.
Discussion
This study showed a strong linear relationship between measured and self-reported anthropometry in female participants of the MTC, which implies that women are ranked similarly with either measure. However, systematic errors in self-reported anthropometry were identified, namely over-reporting of height and underreporting of weight, BMI, and waist and hip circumferences. Several characteristics of the participants such as age, education and socioeconomic status were associated with those errors -the main predictor being the measured BMI, as over-reporting of height and underreporting of the remaining anthropometric measures increased with rising BMI. Systematic errors in self-reported anthropometry resulted also in lower estimates of the prevalence of overweight and obesity, compared to estimates derived from direct measures.
Correlations between direct and self-reported measures were higher for weight, height and BMI (0.93, 0.84 and 0.89, respectively) than for waist and hip circumferences (0.78 and 0.83, respectively). This was similar to other studies which showed correlations ranging from 0.95 to 0.97 for weight, 0.91 to 0.98 for height, 0.92 to 0.97 for BMI, 0.74 to 0.89 for waist circumference and 0.62 to 0.86 for hip circumference.1,2,3,5,8,9 Although we provided participants with a measuring tape and precise instructions for measuring their waist and hip circumferences, difficulties for the participants or lack of compliance by measuring those circumferences might be an explanation for these results.
As found by others, women in this study showed a trend to over-report their height, but to underreport the rest of the anthropometric measures.1,2,3,4,5,8,9,16,17,18 The mean differences of direct minus self-reported measures we found (1.3 kg for weight, -2.2 cm for height and 1.3 kg/m2 for BMI) were within the range for women in most studies included in a systematic review published in 2007: 0.1 to 6.5 kg for weight, -0.1 to -5.0 cm for height, and 0.0 to 2.2 kg/m2 for BMI.4 Our results were also similar to those published more recently, between 2009 and 2014, in Hispanic women. In Hispanic menopausal women, mean differences of direct minus self-reported measures were 1.5 kg for weight, -2.5 cm for height and 1.4 kg/m2 for BMI.19 While the errors in self-reported weight we found were similar than those for Hispanic women living in the United States20 and for Colombian women,21 errors in self-reported height we found were higher than those reported in the aforementioned studies (0.6 to 1.2 cm). In our study, the possibility that the errors in self-reported anthropometry correspond partially to real changes in weight over time cannot be disregarded. In a population-based cohort study, weight gains ranging from 0.96 kg/year in African-American women to 0.55 kg/year in White women were reported.22 However, based on our results of the non-significant trend of over-reporting across the categories of time between self-reported and direct measures, it is unlikely that weight gain between measures explain the systematic differences we found.
Underreporting of weight, BMI, and hip circumference decreased with increasing age, while over-reporting of height increased. These findings are consistent with other studies where, compared to younger women, women aged 60 to 65 reported weights closer to real values.17,23 The association between increasing age and over-reporting of height was also observed in other studies.16,17,18,23,24,25,26 As age increases, time since last height measurement might be longer. Authors argued that the value of self-reported height might reflect height at younger age, before a shrinkage due to osteoporosis.26 However because of the relatively young age of the participants in our study, the contribution of osteoporosis to over-reporting of height would be small, if any. In general, more education and number of assets were directly associated with underreporting of weight and waist and hip circumferences. This could be due to women with higher socioeconomic status having poorer body appearance satisfaction. McLaren and colleagues suggest that women with higher education might have a higher risk of poor body appearance because they have a higher exposure to media, which might emphasize the importance of an ideal physical appearance; have higher exposure to health messages, which might inadvertently cause body dissatisfaction through a focus on diet and ideal weight; and have higher expectations on personal development, including body image.27 Along these lines, we found that self-reported waist circumference was closer to real values in indigenous language speakers against non-speakers.
The most important predictor of the systematic errors in self-reported anthropometry was BMI derived from direct measures. Comparing obese to normal weight women, mean differences of direct minus self-reported measures were 3.9 kg for weight, -1.1 cm for height, 2.3 kg/m2 for BMI, 2.0 cm for waist circumference and 0.6 cm for waist circumference. This is consistent with other findings.1,3,4,5,16,18,23,24,25,28,29,30 However, since the correlation between direct and self-reported measures was high (roughly 0.8 to 0.9), this error might have little impact in analyses that use self-reported data as continuous variables, or as multiple ordinal categories. Standard deviations were similar between direct and self-reported measures, which support the hypothesis that errors in self-reported anthropometry would only cause a shift in the distribution of anthropometric variables. Thus, using self-reported measures in regression models might have little impact in measures of association provided that they are included as continuous or multiple ordinal categories independent variables. Other authors suggest that the role of the real BMI on bias in self-reported anthropometry could be attenuated by the individuals’ perception of their own weight. Individuals who described themselves as “too heavy”, regardless of their measured BMI category, were less likely to underreport weight, over-report height, and be classified in a lower BMI category than their real category.29 However, there could be misclassification if self-reported measures were used as categorical variables with few categories (e.g., normal, overweight and obese), since values would be systematically farther away from the real values for any given cutoff point. In our study, sensitivity of self-reported obesity was 66.5% and the prevalence of obesity estimated by self-reported was nine percentage points lower than that estimated by direct measures (33.2 vs. 24.1%). Small differences between measured and self-reported BMI might translate into larger differences in the prevalence of obesity. The latter is especially true when the cutoff for obesity is under a high-density area of the BMI distribution. Therefore, caution is warranted when interpreting the prevalence of overweight and obesity derived from self-reported data.
The main strengths of this study are a large sample size from both rural and urban populations; the availability of several participants’ characteristics, which allowed for the assessment of the factors related to the errors of self-reported anthropometry; and the standardization of measurement processes across settings. Additionally the simultaneous evaluation of several anthropometric indicators allows comparing directly the validity between indicators.
An important limitation of this study is that direct measures were made eleven months (median time) after self-report. Thus, discrepancies between direct and self-reported measures could be explained by real changes in weight, waist or hip circumferences. Time between direct and self-reported measures was independently associated with errors of self-reported height and BMI. However, contrary to what was expected, errors decreased with increasing time between measures. This might indicate improvements in the processes of collecting information over time. This is supported by the fact that time between direct and self-reported measures was longer in the study areas where clinical evaluations were most recent, which also are the sites where participants received a most recent release of the baseline questionnaire. Women included in this analysis were on average more educated and wealthier than the Mexican average. Although that is likely to impact on the generalizability of our results, we obtained estimates of errors of self-reported anthropometry adjusted for several contextual characteristics, including education, rurality and socioeconomic status. Moreover, the cultural and economic diversity of the sites included in this study may help reflecting population differences across Mexico.
These results suggest that self-reported anthropometry is sufficiently valid for epidemiological purposes, especially for the evaluation of association estimates where anthropometric measures are evaluated as exposure, outcome or adjusting variable in adult Mexican women, and can be extended to populations of similar characteristics. Moreover the measurement error model estimates presented here can be incorporated in future analyses to correct for this type of error. Differences between direct and self-reported measures might cause underestimation of the prevalence of overweight and obesity, and should be considered in analyses that rely on self-reported anthropometry.