Introduction
Non-communicable chronic diseases (NCDs), including cancer, are the leading cause of death globally.1 Among the main risk factors include the lack of physical activity (PA).2 Measurement of physical activity levels is crucial to identify current levels of activity, to assess the effectiveness of intervention and/or prevention programs and to estimate its association with NCDs.3
Assessment of physical activity is possible through a variety of objective and subjective methods. Accelerometers (AC) have the advantage of measuring physical activity levels through acceleration, however, this instrument could underestimate some activities, specifically, aquatic and upper body movements.4,5 Self-reported questionnaires are the most common, low-cost, and easy to apply method used to measure physical activity in population-based studies.3 However, these instruments have the disadvantage of misclassifying physical activity levels and thus compromise the ability to detect true associations.3
Therefore, the purposes of this study were to determine the test-retest reliability and validity of the IPAQ long-form in a subsample of female Mexican teachers from the Mexican Teachers’ Cohort (MTC).
Materials and methods
The MTC is a prospective study of 115 345 female teachers aged 25 years and older in Mexico. This study began in 2006 in two states and expanded to 10 more states in 2008. Teachers are members of the Teachers Incentives Program (TIP) (or carrera magistral in Spanish). This is a well-established voluntary economic incentives program for public education teachers.10,11
Participants. In 2013, a random sample of 161 women aged ≥35 years from Mexico City were invited to participate in a validation study. In total, 99 teachers agreed to participate. Written informed consent was obtained from all participants. There were no statistically significant differences in mean weight, height, body mass index (BMI), and hip circumference between teachers that agreed to participate and those that disagreed. However, a statistically higher mean waist circumference was observed for those teachers that did not want to participate (data not shown).
Procedure. Teachers underwent telephone interviews and trained staff collected anthropometric data during 22 months. In the first month, teachers answered the IPAQ long-form (IPAQ1) by phone and underwent anthropometric measurements. At month three, a second IPAQ long-form (IPAQ2) was administered by phone. At month six, teachers visited the clinic and received an Actical accelerometer (AC1) (Mini Mitter Company, Bend, OR, USA). They were asked to wear this device, using an elastic belt all the time (except when in water) for the following nine days. In addition, teachers received a daily log that contained illustrated instructions on how to wear the accelerometer, removals, frequently asked questions and support contacts. After nine days, teachers returned the accelerometer, and a third IPAQ long-form (IPAQ3) was administered. At month nine, a fourth IPAQ long-form (IPAQ4) was self-reported by phone. During month 12, teachers answered a fifth IPAQ long-form (IPAQ5) by phone. And at month 22, a subsample of 69 teachers received and wore an Actical accelerometer (AC2) during seven consecutive days. The study was reviewed and approved by the Instituto Nacional de Salud Pública institutional review board (No. 1130). Written informed consent was obtained from all participants.
IPAQ long-form. The IPAQ was developed by a group of experts to understand and compare physical activity patterns around the world.6 This is a 31-item questionnaire that estimates physical activity levels related to household and yard work activities, work-related activity, self-powered transport, walking, leisure time and sedentary activities during the last seven days in bouts of at least 10 minutes.6 It can be applied by either face-to-face or telephone interview in 15 minutes.6 The Spanish version of the questionnaire is available within the IPAQ webpage.12
Data cleaning of the IPAQ long-form was performed based on the IPAQ protocol according to the following criteria:13 1) data collected in hours were converted into minutes, 2) results reported as a weekly frequency were converted into an average daily time, and 3) “do not know”, “refused” or “missing data” for duration or frequency were removed from the analysis. Truncation was performed for all daily duration values exceeding 180 minutes.13
Days and minutes per day of vigorous, moderate and walking were multiply to obtain minutes per week of vigorous (VPA), moderate (MPA) and walking (WPA) physical activity for each domain. Then, moderate and walking minutes per week were summed to create (MWPA) during household and yard activities, work-related, transportation, walking and leisure time. In addition, minutes per week of MWPA and VPA were added to create total moderate to vigorous physical activity (MVPA) minutes per week.
Actical accelerometer. The Actical is a waistband-mounted omnidirectional accelerometer (dimensions: 2.8 x 2.7 x 1.0 centimeters, weight: 17 grams), which has been previously used in adults to validate and measure physical activity.14 The Actical accelerometer output is expressed as total counts per registered time (counts-min-1).
Data management and cleaning were performed using IBM SPSS software; version 24 (SPSS Inc., an IBM company Chicago Illinois, United States) and the Personal Activity and Location Measurement System (University of California, San Diego, California, United States). Compliance criteria for wearing accelerometers were defined as a minimum of four weekdays of wearing the accelerometer for at least 10 hours per day. All periods of 60 or more consecutives minutes with zero epochs were removed prior to calculating wear time for a given day.15,16 To determine physical activity levels for participants, we used the following cut-points for each set of epochs (minutes of physical activity data):14 sedentary (≤ 1.5 Mets, <100 accelerometer counts) or engaged in light (1.5 to 2.9 Mets, 100 to <1 500 accelerometer counts), moderate (3.0 to 5.9 Mets, 1 500 to <6 500 accelerometer counts), or vigorous (≥ 6 Mets, ≥6 500 accelerometer counts) intensity activity. Finally, MVPA that occurred in bouts of at least 10 consecutive minutes (with allowance of two minutes per each 10 minutes below the 1 530 epoch cut-point) was summed for each valid day and averaged to determine weekly moderate, vigorous, and MVPA.
Physical activity classification (average IPAQs and average AC). Teachers with complete IPAQs and AC data (n= 58) were classified as inactive if they accumulated <150 minutes of MVPA, moderately active if they achieved 150-299 minutes of MVPA, and active if they accumulated 300 or more minutes of MVPA based on WHO physical activity recommendations.17
Anthropometry. Trained personal measured weight and height to the nearest 0.1 kg and 0.1 cm, and the BMI was calculated as kg/m2.18 Waist and hip circumferences were measured to the nearest 0.1 cm.
Statistical analysis. Descriptive analyses (means and standard deviation) were used to characterize the study population. Kolmogorov-Smirnov test was used to test normality in all physical activity variables. Variables that did not meet the normality criteria were logarithmically transformed. Log-transformed mean minutes per week per instrument were compared using student t-test.
Test-retest reliability for the log-transformed work-related, transportation, household, leisure-time, MWPA, VPA and MVPA per week of two accelerometers and five IPAQs long-form were tested using Pearson correlation coefficients. Linear Regression was used to estimate 95% confidence intervals (CI) for the correlation between instruments.
Pearson correlation coefficients and Intra-class correlation (two-way mixed, average measurements and absolute agreement) were used to determine the relationship between log-transformed minutes per week of MWPA, VPA, MVPA for the log-transformed minutes per week of five IPAQ long-form and accelerometers.
A linear regression was used to assess the association between MVPA minutes per week of IPAQ3 and AC1. The intercept and slopes of the regression line and their associated 95%CI was examined to observe if values were different from 0 and 1, respectively. Level of significance was set at p<0.05.
Results
Descriptive characteristics
Characteristics of participants at baseline are described in table I. In total, 86 women answered the IPAQs and on average 61 had two valid accelerometer values. The mean age was 48.2 ± 6.8 years. The mean BMI was 28.2 ± 4.8 kg/m2.
Mean |
SD* |
|
Age (years) |
48.2 |
6.8 |
Weight (kg) |
68.1 |
12.3 |
Height (m) |
155.4 |
6.2 |
Body mass index (kg/m2) |
28.2 |
4.8 |
Waist circumference (cm) |
88.2 |
10.9 |
Hip circumference (cm) |
104.4 |
9.9 |
* Standard deviation
Physical activity means and differences
With respect to intensities of physical activity, higher MVPA and MWPA were observed on IPAQ2 (504 ± 421 and 463 ± 393 min/week, respectively) than IPAQ3 (387 ± 289 and 341 ± 251 min/week, respectively), IPAQ4 (328 ± 254 and 289 ± 223 min/week, respectively), and IPAQ5 (333 ± 306 and 300 ± 295 min/week, respectively). There were no differences between IPAQ1 and IPAQ2 for all intensities. MWPA minutes per week were significantly higher based on IPAQ1 and IPAQ3 compared to IPAQ4 and IPAQ5 (451 ± 350 and 341 ± 251 vs. 289 ± 223 and 300 ±295, respectively). Based on domains of physical activity, teachers reported higher minutes per week of household physical activities for all IPAQs compared to other domains (p<0.01) (figure 1).
MWPA: moderate and walking activities
VPA: vigorous physical activity
MVPA: moderate-to-vigorous physical activity
Line: standard deviation
Number: mean
* Statistically significant differences between different methods (p<0.05)
a,b,c,d,e: Statistically significant differences between IPAQs (a- IPAQ1, b- IPAQ2, c- IPAQ3, d- IPAQ4, e- IPAQ5) (p<0.05)
Related to household physical activity, statistically significant differences were observed between IPAQ1 and IPAQ3, IPAQ4 and IPAQ5 (200 ± 198 and 162 ± 106, 138 ± 133, 126 ± 174 minutes per week, respectively) (p<0.05). The same pattern was observed for active transportation (IPAQ1: 100 ± 114 vs. IPAQ3: 72 ± 92, IPAQ4: 63 ± 100, IPAQ5: 66 ± 93 minutes per week, p<0.05). Leisure time physical activity was significantly different between IPAQ2 and IPAQ5, and IPAQ3 and IPAQ5 (111 ± 142 and 83 ± 130 minutes per week) and (106 ± 162 and 83 ± 130 minutes per week) respectively (p<0.05). Work-related physical activity was statistically significant different between IPAQ2 and IPAQ3 (66 ± 187 vs. 46 ± 115 minutes per week) (figure 1).
Based on accelerometer measures, there were no statistically significant differences between minutes per week in AC1 and AC2 at any intensity. Statistically significant differences were found between average accelerometers and average IPAQs long-form for all intensities (p<0.001) (figure 1).
Reliability
The test-retest correlation between five IPAQs long-form is presented in table II. IPAQ1 minutes per week of MWPA, VPA, MVPA, work-related, transportation, household and leisure-time physical activity were significantly correlated to those minutes of IPAQ2, IPAQ3, IPAQ4 and IPAQ5 with r values ranging from 0.29 to 0.59. The same result was observed for the correlation between minutes per week of MWPA, VPA and MVPA, work-related, transportation, household and leisure-time physical activity IPAQ2 vs. IPAQ3, IPAQ4, IPAQ5, IPAQ3 vs. IPAQ4 and IPAQ5, and IPAQ4 vs. IPAQ5 (range: 0.26 to 0.65), expect for the correlation between minutes per week of VPA IPAQ2 and IPAQ4 and minutes per week of transportation IPAQ2 and IPAQ5.
(Min/week) r (CI95%) |
Accelerometer (n= 61) |
IPAQ long-form (n= 86) |
|||||||||
1,2 |
1,2 |
1,3 |
1,4 |
1,5 |
2,3 |
2,4 |
2,5 |
3,4 |
3,5 |
4,5 |
|
Domains of PA | |||||||||||
Work-related |
NA |
0.40 (0.20,0.60) |
0.39 (0.19,0.61) |
0.46 (0.28,0.68) |
0.36 (0.14,0.52) |
0.45 (0.26,0.66) |
0.37 (0.17,0.59) |
0.51 (0.31,0.66) |
0.38 (0.18,0.58) |
0.34 (0.13,0.50) |
0.38 (0.16,0.52) |
Transportation |
NA |
0.35 (0.15,0.58) |
0.36 (0.14,0.51) |
0.34 (0.13,0.56) |
0.36 (0.16,0.56) |
0.48 (0.25,0.58) |
0.40 (0.19,0.59) |
0.20 (-0.01,0.39) |
0.51 (0.40,0.79) |
0.42 (0.24,0.67) |
0.24 (0.02,0.43) |
Household |
NA |
0.53 (0.29,0.60) |
0.44 (0.19,0.50) |
0.51 (0.25,0.53) |
0.57 (0.30,0.57) |
0.38 (0.16,0.54) |
0.32 (0.10,0.47) |
0.42 (0.25,0.60) |
0.36 (0.15,0.54) |
0.31 (0.09,0.50) |
0.42 (0.22,0.62) |
Leisure-time |
NA |
0.49 (0.31,0.60) |
0.59 (0.42,0.78) |
0.56 (0.39,0.76) |
0.45 (0.25,0.64) |
0.50 (0.30,0.67) |
0.49 (0.29,0.67) |
0.53 (0.33,0.69) |
0.59 (0.42,0.77) |
0.56 (0.37,0.73) |
0.64 (0.47,0.80) |
Intensity of PA | |||||||||||
MWPA |
0.60 (0.39,0.80) |
0.35 (0.19,0.70) |
0.47 (0.38,0.91) |
0.51 (0.32,0.68) |
0.48 (0.29,0.67) |
0.45 (0.28,0.70) |
0.61 (0.33,0.60) |
0.34 (0.10,0.42) |
0.60 (0.30,0.55) |
0.43 (0.17,0.45) |
0.62 (0.45,0.80) |
VPA |
0.61 (0.43,0.81) |
0.33 (0.13,0.54) |
0.43 (0.24,0.64) |
0.29 (0.09,0.52) |
0.32 (0.12,0.57) |
0.26 (0.05,0.48) |
0.18 (-0.03,0.41) |
0.28 (0.08,0.52) |
0.38 (0.19,0.60) |
0.45 (0.27,0.68) |
0.48 (0.29,0.69) |
MVPA |
0.65 (0.44,0.83) |
0.35 (0.18,0.69) |
0.49 (0.40,0.89) |
0.52 (0.31,0.66) |
0.52 (0.35,0.74) |
0.54 (0.38,0.77) |
0.64 (0.35,0.60) |
0.38 (0.15,0.49) |
0.64 (0.34,0.57) |
0.49 (0.24,0.54) |
0.65 (0.54,0.91) |
NA: Not applicable
PA: physical activity
MWPA: moderate and walking activities
VPA: vigorous physical activity
MVPA: moderate-to-vigorous physical activity
Bold type: statistically significant correlations
CI: confidence interval
Based on accelerometer estimates, a high correlation (r≥0.60) was observed between AC1 and AC2 minutes per week of MWPA, VPA and MVPA (table II).
Validity
Compared to the log-transformed minutes per week AC1 values, VPA and MVPA were significantly correlated to average log-transformed minutes per week IPAQ long-form values (Pearson correlation: 0.31, 95%CI: 0.11, 0.47 and 0.25, 95%CI: 0.04, 0.55, respectively). Long-transformed minutes per week AC1 VPA and MVPA measures were significantly correlated (p<0.05) to IPAQ3 values r= 0.44 and r= 0.24, respectively. Higher correlations were observed using Intra-class correlation for all intensities (table III).
Variables (r (95%CI)) |
||
Pearson |
ICC |
|
Average IPAQ vs. AC1 (n= 82) |
||
MWPA |
0.17 (-0.06, 0.44) |
0.17 (-0.06, 0.26) |
VPA |
0.31 (0.11, 0.47) |
0.42 (0.06, 0.63) |
MVPA |
0.25 (0.04, 0.55) |
0.25 (-0.13, 0.51) |
IPAQ3 vs. AC1 (n= 87) |
||
MWPA |
0.16 (-0.07, 0.42) |
0.21 (-0.12, 0.45) |
VPA |
0.44 (0.33, 0.83) |
0.60 (0.39, 0.74) |
MVPA |
0.24 (0.04, 0.50) |
0.31 (-0.04, 0.54) |
ICC: intra-class correlation
IPAQ: International Physical Activity Questionnaire - long form
AC: accelerometer; MWPA: moderate and walking activities
VPA: vigorous physical activity; MVPA: moderate-to-vigorous physical activity
Bold type: statistically significant correlations (p<0.05)
CI: confidence interval
MVPA: moderate-to-vigorous physical activity
MWPA: moderate and walking activities
MVPA: moderate-to-vigorous physical activity
As shown in figure 2, when IPAQ3 MVPA measures were used to predict accelerometer 1, the intercept was 1.57 (p<0.001) and the slope was 0.22 (p<0.05). This interception suggests that those teachers with lower MVPA measured by accelerometers over-reported MVPA levels, however, this over-report decreased as objectively measured MVPA increased. The interception between the line of identity and the Pearson correlation line was at 126 minutes per week of MVPA.
The distribution of teachers across physical activity categories was significantly different between average IPAQs and average AC. Based on average AC, 56.9% were classified as inactive, 34.5% as moderately active and 8.6% as active. Whereas, 12.1% were classified as inactive, 27.6% as moderately active and 60.3% as active based on average IPAQs (data not shown).
Discussion
The main purposes of this study were to determine the test-retest reliability and validity of the IPAQ long-form in a sub sample of Mexican teachers. Our findings indicated that minutes per week of three different intensities of physical activity (VPA, MWPA and MVPA) between IPAQs long-form were correlated (range: r= 0.26 to 0.65). Minutes per week of four domains of physical activity (work-related, transportation, housework and leisure-time) had a correlation between 0.24 and 0.64. Furthermore, minutes per week of MVPA and VPA AC1 were correlated to average MVPA and VPA IPAQs (r= 0.25 and r= 0.31), and MVPA and VPA estimates of AC1 were correlated with IPAQ3-long form measures answered one-week apart (r = 0.24 and r=0.44).
Teachers reported higher average mean values for all intensities based on IPAQs long-form compared to the Actical accelerometer estimates. This over-report has been observed in other self-reports19 and in the IPAQ short-form for Mexican population.7 The overestimation of physical activity levels by the questionnaire could be due to social desirability and/or social approval bias. However, the underestimation of the Actical accelerometer could be related to the lack of record of water activities and strength exercises such as weight lifting.4,5,7
Data from this study showed that in three out of five self-reported IPAQs teachers accumulated more than 150 minutes per week on household physical activities. In other words, teachers achieved physical activity recommendations only by performing activities at home. In addition, compared to other domains, higher number of minutes per week of household physical activity were reported. The same pattern has been observed in previous Latin American studies.20 There are many possible reasons of this observation; first, there could be a misclassification of physical activity intensities during household activities,21 second, questionnaire structure problems (ex.: since IPAQ is conformed by four different domains, it could be difficult for respondents to understand, differentiate and report minutes per week of each domain),20 and third, there is good evidence that women truly perform more household activities than in any other domain.22
Mean differences were observed in almost all minutes per week for transportation, household, MWPA and MVPA between IPAQs. Similar results have been observed in previous repeatability studies worldwide23,24 and in Mexico.7 These differences could be related to the fact that teachers accumulated different minutes of physical activity throughout the year, over or underestimation of minutes per week of physical activity and the recognition of activities performed throughout the year.7,23,24
The correlation between accelerometers 17 months apart was good for MVPA (r=0.65) and was slightly lower for VPA than for MWPA (r= 0.61 vs. r= 0.60, respectively). Some studies have found different correlation values between accelerometers (r ranged from 0.47 to 0.96).25,26,27,28 The main differences in the correlation values could be a shorter interval range between measurements,25 the use of laboratory settings and controlled activities,25 seasonal physical activity variability,27 differences between weekdays and weekends,28 intra-individual variance,28 types of accelerometer,25,26,27,28,29 changes in physical activity participation,30 and accelerometer hip position (left or right).29
The test-retest reliability for IPAQ long-form MVPA was lower compared to previous studies.6,31,32,33,34 Some explanations of the lower MVPA correlations could be related to the fact that most of the studies that reported higher correlation values had a shorter time interval between surveys.6,31,32,33,34,35 Lower correlation values have been observed for VPA than for MWPA.31,32 This could be attributed to the low frequency of VPA. In addition, in almost all IPAQ test-retests, lower repeatability correlation values were observed for transportation physical activity (ranging from 0.20 to 0.51) compared to other domains. In contrast, higher correlation values were observed for the leisure-time domain (ranging from 0.45 to 0.64). These results could be explained by the fact that transportation activities could vary from day to day and leisure-time activities are the most regular physical activities.31,32
Minutes per week of MVPA and VPA for average IPAQ vs. AC1 and IPAQ3 vs. AC1 were significantly correlated. However, minutes per week of MWPA were not correlated. Similar results have been observed in previous studies.36 This could be explained by the fact that vigorous physical activities are easier to rememer and report than moderate physical activities.36 In addition, underreport of minutes of moderate physical activity.34,36
The correlation of the average minutes MVPA between the IPAQ long-form vs. AC1 was r= 0.25. Compared to other self-report instruments, slightly higher correlation values have been observed in different settings and countries for IPAQ long-form ranged from 0.30 to 0.38.6,19,32,35 One explanation for the low correlation observed in this type of analysis for all intensities could be the fact that subjects wore an Actical accelerometer one week, and this week does not represent the physical activity performed during almost 12 months (IPAQ assessment). The difficulty for participants to estimate the physical activity intensity in which they are engaged in,35 and social desirability bias might also produce an overestimation of physical activity levels.35 However, a slightly low correlation was observed between AC1 and IPAQ3 answered one week apart. This result is similar to what has been observed in previous studies using IPAQ-long form (r= 0.31 Mexico vs. r= 0.33 global)6 and similar compared to IPAQ-short form (r= 0.31 long form vs. r= 0.31 short form).7
The prevalence of physical inactivity was higher based on average AC (56.9%) than for average IPAQs (12.1%). This result is similar to previous studies in Mexican population.7 Caution should be taken when using self-report physical activity questionnaires, because misleading prevalence could conduct to underreported associations (physical activity vs. determinants, and vs. health outcomes).7
Strengths and limitations
This is one of the few studies that validated the IPAQ long-form in a Mexican sample. Although participants used the Actical accelerometer day and night during seven days, we asked them to remove their device every time they had contact with water (ex.: swimming, showers). This could result in an underestimation of the physical activity levels. It is well known that accelerometers and questionnaires measure different constructs of physical activity, accelerometers capture body movements, whereas questionnaires measures behaviors, making it difficult to compare.4,5 The 10-minute cut off point of the IPAQ could lead to overestimation, in other words, respondents could have rounded the minutes of physical activity less than 10 minutes. In addition, self-report questionnaires based on seven previous days are limited to last week, this might not represent necessarily the usual physical activity. We could not test the validity of domain-specific physical activity because accelerometers could not differentiate between these domains. In addition, it is important to remind that this study was limited to female Mexican teachers from Mexico City, which increases its internal validity, but results may not represent women with other professions, residents from other cities, nor men.
Conclusion
Test-retest reliability of the IPAQ long-form was modest for MVPA and the validity against accelerometer was fair for MVPA and VPA in a sub sample of Mexican female teachers. Although these correlation values were similar to those reported previously, future studies should consider the use of objective measures to estimate physical activity levels.