Introduction
Sexually transmitted infections (STI) and HIV/AIDS continue to have a high prevalence and incidence worldwide ( United Nations Programme on HIV and AIDS [UNAIDS], 2014) and in Mexico ( Centro Nacional para la prevención y el control del VIH/SIDA [CENSIDA], 2016 ). Condoms are the prevention method that has proved to provide the best protection against the spread of STDs and HIV/AIDS, regardless of the partner’s serological status ( Centers for Disease Control and Prevention [CDC], 2014 ; United Nations Programme on HIV and AIDS [UNAIDS], 2015 ).
Accordingly, several theoretical models have been developed to explain condom use, with the Theory of Planned Behavior (TPB) being one of the most popular ( Albarracín, Johnson, Fishbein & Muellerleile, 2001 ; Andrew et al., 2016 ; Armitage & Conner, 2001 ; Bennett & Bozionelos, 2000 ; Tyson, Covey & Rosenthal, 2014 ). TPB provides a conceptual framework with cognitive basis of behavior, in other words, the value an individual places on beliefs at the moment of engaging in a particular behavior ( Yzer, 2012 ). The model suggests that behavior is determined by intention, and that the intention of using a condom is influenced by attitudes, in other words, a positive or negative evaluation of this behavior ( Fishbein & Ajzen, 1975 ). Subjective norms refer to the opinion of significant persons for the individual on condom use ( Ajzen & Fishbein, 1980 ) while self-efficacy is the perceived ability to use condoms ( Ajzen, 1991 ), which also has direct effects on behavior.
There has been an increase in the research to measure TPB in condom use by using psychometric instruments such as the Sexual Behavior Scale ( Malcolm et al., 2013 ), and questionnaires based on TPB ( Fishbein & Ajzen, 2010 ; Francis et al., 2004 ; Prati, Mazzoni & Zani, 2014 ; Rich, Mullan, Sainsbury & Kuczmierczyk, 2014 ). However, none of these instruments was specifically designed for the Spanish-speaking population.
In Mexico, very few studies have focused on developing measurement instruments that take the characteristics of the Mexican population into account other than the research by Robles-Montijo and Díaz-Loving (2011) . However, this questionnaire, developed for the teenage middle- and high-school population, with a mean age of 15.38 years, found that there is a difference in the sexuality of middle-school and high school students ( Vasilenko & Lanza, 2014 ).
Previous studies have regarded substance use as a predictor of the determinants of the TPB in condom use, in other words, when people are intoxicated, attitudes, subjective norms and self-efficacy regarding condom use decline ( Conner, Sutherland, Kennedy, Grearly & Berry, 2008 ; Davis et al., 2015 ). However, it is not known whether the effect of alcohol use is a “true” effect or an artifact of measurement caused by the bias derived from substance use, which can be identified through the examination of Differential Item Functioning (DIF) ( Starosta, Berghoff & Earleywine, 2015 ).
DIF refers to the likelihood of individuals choosing a particular response on an item despite having the same degree of response to a latent variable (e.g. perceived norms of condom use). This difference in probability may be due to belonging to a particular group (for example, man/woman) or to another non-categorical variable (e.g. days of substance use) ( Starosta et al., 2015 ).
The lack of consistency in condom use in young adults is regarded as a public health problem ( Encuesta Nacional de Salud y Nutricion [ENSANUT], 2012 ; Instituto Nacional de las Mujeres [INMUJERES], 2012 ), since 58.6% of men and 75.5% of women did not use condoms ( Instituto Nacional de las Mujeres [INMUJERES], 2012 ) during their last sexual intercourse, and reported having various sexual partners and not using condoms when they were intoxicated.
Accordingly, the valid measurement of the TPB model in condom use is a substantial need for the development and evaluation of HIV prevention programs ( Noar, 2006 ). Given this context, this manuscript has two objectives: to develop a scale to measure the TPB model in condom use in a sample of young adults (Study 1), and to evaluate the DIF related to substance use in the questionnaire (Study 2).
MetHod
Study 1
Participants
A convenience sample consisting of 50 participants from a public university, 29 men and 21 women, with an average age of 23, who met the inclusion criteria: a) being enrolled in an academic program, and b) having over 18 and under 25 years.
Measures
The preliminary questionnaire was applied using the open question technique, and the instructions used were those proposed by Fishbein (1980) : “List all the advantages/disadvantages you can think of that condom use might have” and “Mention a person(s) or group(s) you know that you think would approve/disapprove of your using a condom”.
Procedure
Participants who met the inclusion criteria were invited to participate voluntarily and spend a few minutes answering a questionnaire on condoms. If they agreed to participate, they were given the open-ended questionnaire and were asked to include all the words or short sentences that came to mind and to take their time.
Subsequently, using content analysis ( Krippendorff, 1990 ) the information in the questionnaires was organized and the registration units (adjectives) were selected. Once selected, the repetition of the registration units was determined in the recount in order to categorize them and reduce the number of classes or categories ( Krippendorff, 1990 ).
Once the criteria for categorizing the registration units had been obtained, the categories were coded as attitudes, norms, and self-efficacy. Adjectives evaluating condoms were placed in the attitudes category, the important people mentioned were put into the standards category and adjectives related to problems with condom use were included in the category of self-efficacy. Lastly, the items for the preliminary version of the questionnaire were drafted.
Study 2
Participants
A cross-sectional study was conducted at three universities in Mexico City using convenience sampling. Data were collected from May to August 2015. Participants were included according to the criteria in Study 1. Sample size was calculated through a previous Monte Carlo simulation study, requiring approximately 600 participants to detect a DIF with a medium effect size. The simulation was based on recommendations previously established in the literature ( Muthén & Muthén, 2002 ).
Measures
TPB Determinants. A 34-item instrument was used, which resulted from Study 1 (Appendix 1).
Substance use. Following the recommendation in the literature, substance use was measured through a self-report question evaluating the number of days of substance use in the previous month (“On how many of the past 30 days did you use alcohol or drugs?”) ( Donovan et al., 2012 ).
Risky sexual behavior. A self-report was used to evaluate the number of partners in the past 30 days (more than two partners in answer to the question “With how many people have you had sexual relations with anal or vaginal penetration?”), condom use during the last sexual intercourse (“Did you use a condom during your last sexual intercourse?”), knowledge of HIV sero-status of partner in the past 12 months (“How often have you known whether your sexual partner(s) was/were infected with HIV?”). These questions have been used in other studies to assess risky sexual behavior ( Donenber, Emerson, Bryant, Wilson & Weber-Shifrin, 2001 ).
Demographic variables. In order to describe the sample, respondents were asked about their age, sex, years of education, income and marital status.
Procedure
Five evaluators were trained to measure substance use, sexual behavior and study procedures, and were certified by a supervisor as having complied with the data collection procedures. Since most cases of HIV infection are due to sexual contact, only penile-vaginal contact was evaluated; oral sex was not evaluated. According to the recommendations of previous studies on condom use, an operational definition of a sexual intercourse was established ( Crosby, Charnigo, Weathers, Caliendo & Shrier, 2012 ), which was read to the participants before asking them about their sexual behavior. For the purposes of this study, a sexual intercourse was defined as “A process that begins with the insertion of the penis into the vagina or anus and ends with the male orgasm”.
Participants were recruited in their respective universities by trained evaluators. Respondents who agreed to participate were evaluated to determine whether they met the inclusion criteria, and subsequently signed an informed consent form. For the purposes of analysis, cases with patterns of missing data (no response to the questionnaire regarding condom use) or who had not initiated their sexual lives were removed. All the participants included in this study gave their verbal and written consent.
Statistical analysis
Descriptive statistics are provided for demographic variables and risky sexual behavior. Differences in these variables between participants who reported at least one day of use and no day of use were estimated through the Student t test for numerical variables and chi square (χ2) for categorical variables.
Since the first step in undertaking a MIMIC analysis is to find a model with adequate goodness of fit, items from the version derived from Study 1 were selected. Then, a frequency analysis was conducted, the Student t test was applied for independent samples, a bias analysis was performed, and the correlation of each item ( Rodríguez-Pérez, Valencia-Flores, Reyes-Lagunes & Lara-Muñoz, 2013 ) with each determinant of TPB was obtained. The Exploratory Factor Analysis was replaced by the Confirmatory Factor Analysis (CFA). An unifactorial model was evaluated for each of the TPB determinants. Weighted least squares adjusted to the variance and mean was used as the estimator, since this is regarded as the most accurate estimator in CFA with non-normal Likert variables ( Beauducel & Herzberg, 2006 ; Muthén & Kaplan, 1985 ). The fit measures that were taken into account, in keeping with the recommendations of the literature ( Hu & Bentler, 1999 ; Satorra & Bentler, 2001 ), were the Satorra-Bentler Scaled Chi-square (χ2S-B), the Satorra-Bentler Scaled Chi-square between degrees of freedom (χ2S-B/df), the scaled comparative fit index (CFIS), and the Scaled Root Mean Square Error of Approximation (RMSEAs). Regression coefficients, standard errors and statistical significance are presented for each of the parameters estimated in the model. In order to handle missing data, the full information matrix method was used, because it has provided accurate estimates of parameters in Monte Carlo studies ( Enders & Bandalos, 2001 ). The next step for MIMIC was to establish similar mediation models to those established in a previous study ( Miles, Marshall & Schell, 2008 ), where the variable for the number of days of substance use had direct effects on each of the items and the effect was mediated by the latent variables (attitudes, subjective norms and self-efficacy) (figure 1). In this procedure, the null hypothesis is that the latent variables are complete mediators for the link, and therefore the direct effect of the variable is not significant. If the null hypothesis is rejected, DIF is assumed to exist. The MIMIC analysis was performed using the Mplus 6.12 statistical package.
Following Cribbie’s recommendation ( Cribbie, 2007 ), the deflated p value associated with multiple tests was corrected through the False Discovery Rate (FDR) ( Benjamini & Hochberg, 1995 ) through the R 3.3.0 statistical package. A value of p < .05 was considered significant for all the analyses.
Results
Study 1
In the analysis conducted through the questionnaires with open-ended questions on condom use, the repetition of the registration units was counted for each instruction. Those with the highest frequency were chosen and the terms were categorized in the TPB dimensions (table 1).
The instrument obtained from the registration units consisted of 34 questions distributed throughout the three theoretical dimensions-attitudes, subjective norms and self-efficacy-comprising the TPB. The first dimension consisted of 10 items assessing attitudes to condom use, through a seven-point semantic differential evaluation (Pleasant-Unpleasant); the second dimension assessed subjective norms, consisting of eight items with five response choices (unimportant to very important), while the third dimension, assessing self-efficacy, comprised 16 items with five response options (totally unsafe to totally safe).
Study 2
A total of 886 university students were recruited, 697 of which initiated the application. After the eligibility evaluation, the sample was reduced to 646 participants (figure 2), 363 women and 279 men with an average age of 23.16 years (SD = 3.19).
Participants had an average of 14.58 years of schooling (SD = 1.16), 89.7% had never married, 6.9% were married or living together; 44% reported an approximate monthly income of between $0 and $2000; approximately 25.1% had a monthly income of $2001 to $4000, while 15.7% had an income of $4001 to $6000 (Mexican currency).
In relation to substance use in the previous 30 days, 75.2% reported using alcohol, 45.8% of which reported excess drinking; 13.8% used marijuana and 1.7% cocaine, while the average number of days of use was: 5.04 (SD = 4.86) days of alcohol use, .72 (SD = 3.51) of drug use and 5.62 (SD = 5.93) days using any substance.
As regards risky sexual behaviors, 51.8% reported not using a condom during their last sexual intercourse, 17.1% reported several partners in the previous month, while 68.0% reported lack of knowledge of the HIV status of their partners in the past 12 months. Those reporting substance use had a lower relative probability of being married, a greater number of partners, a lower frequency of condom use and less knowledge of the HIV status of their partners (table 2).
Confirmatory Factor Analysis
Items with a correlation of less than .30 between them were removed. After choosing the items for each model, the following was obtained: attitudes consisted of 4 items (5, 7, 8, 9), with an adjustment of χ2S-B(2) = 3.902, p < .001; CFIs = .999; TLIs = .996; RMSEAs = .037, 90% CI ≤ .001-.095; for subjective norms model, a 6-item model was used, with a goodness of fit of χ2S-B(7) = 9.103, p < .245; CFIs = .999; TLIs = .998; RMSEAs = .022, 90% CI ≤ .001-.056; while for the self-efficacy model, the model was adjusted by χ2S-B(25) = 65.115, p < .001; CFIs = .982; TLIs = .974; RMSEAs = .050, 90% CI = .036-.066, with a total of 9 items (table 3 shows the coefficients and standard errors of each model).
Differential Item Functioning (DIF)
Figure 1 shows the DIF model for each TPB determinant. After correcting the p values through the FDR values, it was found that item 8 (“Excellent-Terrible”) had a value of p < .05 for the attitudes model, whereas in the subjective norms model, items 4 (“the opinion of the Church/priest if I decide to use a condom”) and 8 (“the opinion of my friends who do not use condoms”) had statistically significant values (p = .009 and p = .006, respectively). No significant DIF were found in the self-efficacy model (table 3).
Discussion and conclusion
The main aim of this study was to develop an instrument for the Mexican population to measure the determinants of TPB (attitudes, subjective norms and self-efficacy) through a content analysis of the answers to open-ended questions and an item selection procedure and the selection of items from the questionnaires on attitudes (4 items), subjective norms (6 items) and self-efficacy (9 items).
A second objective of the study was to evaluate the DIF caused by substance use. Specific items with this effect were found in the attitudes and subjective norms models. This implies that these items have a specific bias. In people with substance use, this bias increases significantly as substance use rises.
Previous studies have shown that there is a positive relationship between attitudes toward condoms and their use ( Álvarez, Villarruel, Zhou & Gallegos, 2010 ). However, it has been reported that substance use causes a negative evaluation ( Conner et al., 2008 ; Davis et al., 2015 ). For example, when people are under the effects of any substance use, they avoid using condoms, because they do not feel the same and it delays ejaculation, causing them to rate condoms as terrible ( Maher et al., 2011 ).
It has also been documented that people are influenced by the opinions or behaviors of their peer group, and are more pressured to have unprotected sex when there are people who use substances in the group (item 8 “The opinion of my friends who do not use condoms”) ( Sacolo et al., 2013 ; Selikow, Ahmed, Flisher, Mathews & Mukoma, 2009 ).
It has been found that religion is a protective factor for both substance and condom use ( Inmaculada, Bermudez, Ramiro & Buela-Casal, 2014 ; Valle, Canizales & Potter, 2010 ). However, the inverse effect that explains the increase in substance use is due to low religiosity (item 4, “The opinion of the Church/priest if I decide to use a condom”) ( Ellickson, Collins, Hambarsoomians & McCaffrey, 2005 ) has not been studied. This is an area for future research to explain this link in more detail.
Item 11 (“Using a condom even if you have used alcohol or other drugs”) did not obtain a significant DIF, contrary to expectations, since a negative link has been found between substance use and self-efficacy in condom use ( Bennett & Bozionelos, 2000 ; Davis et al., 2015 ). A probable explanation is that when people who have experimented with condom use are intoxicated, they are more likely to use it correctly and consistently even when they are under the effects of a particular substance ( Calsyn, Baldwin, Niu, Crits-Christoph & Hatch-Maillette, 2011 ).
There is little evidence of the detection of DIF due to substance use in TPB indicators for condom use, since this is the first study to undertake this type of analysis. There is evidence at the construct level that is not sensitive to the DIF effect, meaning that possibly spurious links may be obtained.
DIF was handled differently in our study from previous research ( Conrad, Conrad, Passetti, Funk & Dennis, 2015 ) that detects items with significant DIF and subsequently eliminates them. This is because the items did not fit the measurement model, as a result of which only the free DIF indicators were used. This has the advantage that the comparison between groups is easier to interpret. However, the disadvantage of this method is the change in the measurement of the construct, since the items deleted provided key information on the phenomenon studied.
Therefore, as is the case, with various studies ( Lewis, Yang, Jacobs & Fitchett, 2012 ; Starosta et al., 2015 ), a DIF analysis was conducted to identify the items that operated differently between peer groups to determine the reason why the item is being modified by an observable variable (e.g. substance use). According to the authors, items with DIF open the door to future research, due to the importance of evaluating how indicators behave with a population with a particular trait.
It is important to mention that during the evaluation of the measurement models, detection and handling of items with DIF is substantial. There are alternatives ranging from the deletion of the item from the scale to the inclusion of the variable that causes DIF in the model. In both cases, the objective is to balance the scores in order to prevent the indicators from providing inflated results in the group. This in turn leads to a loss of accuracy in the evaluation of the construct and therefore unreliable results ( Starosta et al., 2015 ). Accordingly, if an item with DIF is detected, the researcher must decide whether to delete the item or to handle it with caution in order to prevent results from being altered by a bias in the measurement of the indicators.
Limitations considered in the study include: marital status assessment, since asking whether a person has “Ever been married” fails to provide information on the current relationship or its duration. The link between substance use and sexual behavior in states of intoxication, measured by the number of days of use of any substance, fails to discriminate between the days when substances were used and when they were not. This is particularly important because the study was conducted on a non-clinical population (college students), which is why it is essential to conduct studies where the amounts of the substance used are measured in order to observe a direct relationship between the two variables. Another limitation was the fact that sexual orientation was not included in the analysis of the DIF. This has been a concern in previous studies ( Starosta et al., 2015 ) since people with a predominantly homosexual or bisexual orientation may respond differently from heterosexual men and women. This issue should be considered in future studies evaluating the DIF caused by sexual preferences. Another constraint was the sample size, which made it impossible to perform more complex analyses of DIF. Interactions were observed between substance use and sex in the indicators of TCP determinants in condom use. A larger sample size would be required to improve the accuracy of this analysis.
In conclusion, the detection of DIF in the evaluation of measurement models is a substantial need in the accurate assessment of psychosocial constructs. This is the case of substance use in TCP indicators, where bias was detected in the measurement of indicators. These could be areas of opportunity where prevention programs could have an impact on increasing condom use and controlling HIV/AIDS.