Introduction
Cervical cancer (CC) is the fourth most prevalent malignant tumor presented in women worldwide, with an estimated nearly 570 thousand new cases in 2018.1 The International Agency for Research on Cancer estimates that the global CC incidence was 13.1 per 100 000 women of all ages.1,2 According to Global Burden of Disease data, the age-standardized incidence rate of CC was 12 cases per 100 000 people in Mexico.3 CC was the primary cause of death by neoplasia in Mexican women up to the year 2005. After that year, breast cancer replaced it as the primary cause of death and CC became the second most frequent cause of death from neoplastic diseases.4 However, at a sub-national level, CC remains the primary cause of cancer death in women in twelve of thirty-two states of Mexico.5 CC continues as the most important cancer in a number of low income countries.6
The Census of Population and Housing estimates a reduction in the mortality rate of cervical cancer in Mexico from 1990 to 2010 (10.2 a 9.7 per 100 000).7 This reduction in mortality is mainly attributable to the decrease in the birth rate and the increase in cervical cytology coverage.8,9 Rural zones still show the highest prevalence and mortality risks, even though the cervical cancer screening coverage has increased from 33.3% in the year 2000 to 49% in the year 2012.9
The National Cancer Screening Program of Mexico (NCSP) uses secondary prevention strategies to benefit the most vulnerable population sectors. Since 1974, NCSP has adopted conventional cervical cytology for all women under 65 years old into routine screening in Mexico. In 2008, the hrHPV test was incorporated as a screening procedure for all women older than 35 years. Conventional cervical cytology continued as the strategy for woman younger than 35 years. The positive cases detected by hrHPV-based screening are evaluated with cytology as a triage test. Given the high sensitivity (96%) but low specificity (76%) of the hrHPV screening, a reflex triage testing with cytology is necessary before deciding which women need a colposcopy examination.10 The current clinical guidelines with this approach have demonstrated an increase in negative predictive value of the hrHPV test.11 The hrHPV with cytology triage strategy is an excellent combination for women older than 30 years who may experience greater security when reassured by both tests that they are free of the disease.12,13 All information about NCSP activities are documented in Women Cancer Information System (SICAM, for its Spanish acronym) database in operation since 2000, which holds around 28 million records registers between 2008 and 2018.
Cervical screening has contributed significantly to the decrease in CC, mainly in high income countries. In Europe, large-scale clinical trials reported that hrHPV-based screening detects 70% more cervical lesions than conventional cytology.14,15 In other populations, studies reported 90% and more detected high-grade cancer precursors using this strategy.16 In Mexico, evidence from controlled studies indicated that hrHPV improves the detection of precursor cervical cancer lesions.12 Chile has had similar results with hrHPV screening.2 The benefit provided by the introduction of hrHPV testing in the existing screening programs is enormous. When carried out together, hrHPV screening with cytology triage test are capable of timely detection of a greater number of precursor lesions. In fact, screening intervals can be extended when tests are carried out in parallel or sequentially, and operational costs are reduced.16,17
The efficacy of hrHPV-based screening in NCSP has not yet been evaluated. In addition, there are still inequalities in cervical cancer regarding diverse sociodemographic factors, which are not fully understood. A performance comparison between hrHPV with cytology triage and conventional cytology in the NCSP will provide evidence about the benefits of introducing hrHPV testing in uncontrolled settings. An evaluation of how these variations are present in Mexico and how they influence the heterogeneity of the performance of NCSP is warranted. This work would also contribute to the ongoing efforts towards the creation of a public policy for the management of CC.18
Therefore, we describe the methods of nationwide database study. Our work is aimed to evaluate hrHPV screening with cervical cytology as triage, compared to conventional cervical cytology as the primary screening for detection of grade 2+ cervical intraepithelial neoplasia (CIN2+), using information extracted from the SICAM database. Additionally, our study will evaluate potential regional differences in the prevalence of hrHPV infection and CIN +2 among the states of Mexico within the NCSP and generate information about trends of hrHPV infection by age group and geographical zone.
Materials and methods
National Cancer Screening Program
Figure 1, in which NCSP actions are summarized, is modified from the Specific Action Program and shows the current care model for CC prevention and control.19 This algorithm is used in the health services provided by the Ministry of Health (MoH) in Mexico. Medical care that is provided by the health services of MoH is mainly directed towards Seguro Popular users (population without access to a social security benefits). However, medical care related to prevention of CC can be provided to anyone who requests it, regardless of his or her health insurance affiliation.
Source: Modified from the Specific Action Program (PAE), Women’s Cancer Prevention and Control for 2013-201819
* Complementary cytology on women with a positive result in hrHPV testing
Over time, the federal government has tried to unify sanitary service strategies to guarantee effective and timely detection of CC. This has been accomplished by means of the creation of guidelines called the Official Mexican Regulations which are nested in the NCSP, with particular objectives, strategies and lines of action for cancer prevention and control in women.20 The current government proposes indicators which favor increased coverage of prevention programs and a reduction in morbidity and mortality due to cancer in women. Goal compliance will be verified by means of the National Development Plan from 2013 to 2018 and the Sectorial Health Plan (PROSESA) from 2013 to 2018.21
SICAM database
The procedure for information integration in SICAM consists of manual processes involving interactions between health providers (users) and the system (SICAM managers). Users enter records of all procedures related with CC screening and follow-up of cases (hrHPV, cytology, colposcopy and biopsy). The information connects online and updates the system in real time from the site where the cervical samples are collected. The SICAM managers implement information validations continuously and are responsible for the final information to users.
Sometimes manual processes result in certain inconsistencies attributable to human error, such as from the user entering data and the patient providing information. Such errors occur (naturally) in large volume information system. Therefore, data cleaning will be a challenge which we will address by applying appropriate algorithms to minimize record duplicity.
Information and Communication Technologies (TIC) in SICAM represents another important challenge due to the more than 300 GB of available information. This large volume requires a high-performance technological infrastructure. The use of TIC in SICAM is necessary due to its various automatized processes that help guarantee information reliability.
In this context, the SICAM information collection and integration framework consists in properly clarifying data input and output based on the information management model in accordance with the current health information systems regulation (NOM-035-SSA3-2012 and NOM-024-SSA3-2012). Information technologies (IT) have been implemented as well to allow access, production, safety, treatment and communication of the information itself so as to guarantee informed decision-making. Figure 2 shows both physical and logical models, which uses state-of-the-art technology to ensure that information remains secure online. The value of this information increases as long it is available and accessible to the authorities responsible for decisions related to health programs.
We will process SICAM information in a virtual space with a basic execution unit of one processing core. Each calculation standard node has 24 cores (2 Intel Xeon E5-2680 v3 sockets at 2.5 Ghz with 12 cores per socket) and 128 GB of shared RAM.
Study design and population
We will analyze the information related to cervical cancer in SICAM during the period from 2008 to 2018 in users of Seguro Popular. There are 28 million records related to cervical cancer in the SICAM database; 21 million correspond to cytology-based screening, and 7 million correspond to hrHPV-based screening. The SICAM administrators will provide us only with variables directly related to the study objectives; thus, preventing access to confidential data. We will establish permanent data cleaning algorithms to avoid duplicates to the extent possible. We will conduct a quasi-experimental design to compare and contrast both strategies of CC screening in real life conditions where the CC screening randomization is impossible to carry out in population level.22,23,24,25
After applying eligibility criteria, we expect to carry out the statistical analyses in a sample of approximately 15 million records within the database. Such registries include results for hrHPV, cervical cytology, colposcopy and biopsy, as well as a referral of confirmed cases to oncological centers. We will use complete records of patients who underwent screening and evaluation for diagnosis confirmation. Specifically, we will use complete results of hrHPV as well as data available on cytology, colposcopy and biopsy to meet our objectives. We will not include records with missing or pending test results. We will analyze separately records identified as repeated measures. Figures 3 and 4 provide details of the study design.
In the first phase, we will generate information about the prevalence of HPV infection and trends over time by age group and geographical zone. To estimate hrHPV infection prevalence, we will analyze all hrHPV test records so as to carry out descriptive analysis over the study period. The analysis will be adjusted for covariates. We also expect to determine the hrHPV frequency trend over time and its variations in accordance with the degree of municipal marginalization, depending on sociodemographic and obstetric-gynecologic factors; we provide details in figure 3.
In the second phase, we will compare the performance of hrHPV with cytology triage and conventional cytology. To estimate the relative sensitivity analysis, two groups of women will be analyzed: 1) hrHPV with cytology triage, and 2) conventional cytology. The purpose will be to compare the amount of positive cases undergoing colposcopy for diagnostic confirmation by each strategy. Patients will be reclassified after this assessment and in accordance with the biopsy results; only those results confirmed by histopathology will be considered cases in each screening strategy. Positive results to CIN 2+, which were referred for treatment according to local clinical guidelines.
Given that two different groups will be analyzed, we plan to guarantee the comparability between hrHPV with cytology triage and conventional cytology by propensity score matching, using age, collection date, degree of municipal marginalization and geographical zone as matching variables. We will adjust for the remaining (unbalanced) characteristics in statistical models.
In addition, we expect to estimate variations in relative sensitivity by stratifying for age, geographical zone and degree of municipal marginalization. Identifying these variations may contribute to understanding the performance of NCSP in specific groups. Figure 4 shows details of the approach for this analysis.
These results could demonstrate the benefit of hrHPV testing in a population-based setting and under uncontrolled conditions. Furthermore, we expect this study to contribute to other ongoing studies in the region which are evaluating the performance and cost-effectiveness of different triage tests to detect cervical CIN2+ and the impact of HPV vaccination in a hrHPV based primary screening program to extend the screening interval.26,27,28
Unfortunately, not all patients who underwent colposcopy adhered adequately to the program. Patients who failed to undergo colposcopy are considered as lost to follow-up. This lack of information will be considered as a possible source of bias.29 This constitutes one of the greatest problems and may result in program failure.5 Another cause of failure in screening programs is delayed and inadequate reporting of screening results to patients and providers. However, we expect the loss to follow-up to be non-differential with respect to the screening modalities being compared.
Clinical measurements of interest (endpoints)
Table I shows the main study variables. Primary endpoints will be CIN 2+, cytology, colposcopy and biopsy with positive results. Other important variables will be age, geographical zone and municipal marginalization, and age at first sexual intercourse.
Positive hrHPV: cases with a positive result on hrHPV-based screening, regardless of the used DNA extraction method. Results are reported as positive, negative and some cases invalid samples.
Positive cervical cytology: ASCUS (Atypical Squamous Cells of Undetermined Significance) or a more severe result in cytology. The guidelines recommend a systematically ongoing monitoring of these patients.30 All cytology results of ASCUS or more severe patients must undergo a colposcopy for diagnostic confirmation.
Colposcopy: the evaluation will be carried out according to the local clinical criteria. Typically, the biopsy specimen is collected on the cervical lesion with the worst apparent severity.
Positive biopsy: CIN 2+ in pathology results, referred for diagnostic confirmation due to having a cytology ≥ ASCUS. Such patients will be considered as cases.
Standard criteria for quality control of screening strategies included in SICAM | ||
Criteria |
Procedure |
|
HPV results |
The possible results of hrHPV testing, regardless of the extraction technique are: |
|
Negative |
||
Positive |
||
Invalid. When the sample is inadequate due to the lack of beta-globin. |
||
Timeline of cervical cytology |
First cytology |
|
3-year follow-up |
||
Subsequent cytology or triage |
||
Complementary hrHPV result |
||
Cervical sample characteristics |
Evaluation of sample quality: |
|
Adequate |
||
Inadequate |
||
Cervical cytological diagnosis |
The Richart nomenclature will be used.30 Categories will be created in accordance to the result, identifying whether it is for screening or triage. |
|
=ASCUS+ Atypical Squamous Cells of Undetermined Significance results will be referred to colposcopic evaluation |
||
Standard criteria of quality control in the evaluation and diagnostic confirmation | ||
Criteria |
Procedure |
|
Colposcopy assessment |
Assessment will depend on the transformation zone visualization: |
|
Satisfactory when a full visualization is available. |
||
Non-satisfactory when it is possible to have partial visualization or No visualization of the transformation zone. |
||
Colposcopy diagnosis |
Clinical colposcopic evaluation carried out during cervical exploration. Sample collection for histology will depend on the existence of detected cervical lesions. |
|
Histological diagnosis |
The Richart nomenclature will be used to create categories.30 |
|
Others covariates employed in study | ||
Risk factors |
Procedure |
|
Obstetric-gynecologic |
Basic information obtained from the cervical sample collection survey which is related to the use of birth control, hormonal medication and sexual history of the patient. |
|
Age |
Age in years |
|
Level of Education |
Elementary, middle, high or higher |
|
Marital status |
Reported marital status within the categories: |
|
Single, Married, Other |
||
Municipal marginalization |
Marginalization index, based on development indicators, describing the conditions of those residing in municipalities which form the national territory. Obtained from the 2010 population census. |
|
Geographic zones |
Categorization by 5 geographic zones through which 32 states are to be classified: Northwest, Center North, Center, Center South, South. |
|
Municipality |
Categorization by 2456 municipalities within 32 states. Obtained from the 2010 population census. |
SICAM (Spanish acronym): Woman Cancer Information System
ASCUS: Atypical Squamous Cells of Undetermined Significance
Additional information sources
Marginalization
According to Census of Population and Housing, degree of marginalization is formed by four indicators (education, housing, population distribution and income due to work) and is available in different scales and at different levels.31,32 On this study, we will use public information about the marginalization index at municipal level, which is categorized as very high, high, mid, low and very low. “Very high” category represents the lowest degree of marginalization.
Geographical zone
According to Instituto Nacional de Estadística y Geografía (INEGI) geographical statistics.33 The following geographical zones are proposed to describe the prevalence of hrHPV infection. We chose this organization because we hypothesize that the zones are relatively homogeneous in their socio-demographic, economic, and cultural characteristics which in turn may be related to the hrHPV infection.
Zone 1. Northwest: Baja California, Baja California Sur, Chihuahua, Sinaloa, Sonora, Coahuila, Nuevo León, Tamaulipas
Zone 2. Center-North: Aguascalientes, Colima, Durango, Guanajuato, Jalisco, Nayarit, San Luis Potosí y Zacatecas
Zone 3. Center: Ciudad de México, Edo. de México,
Zone 4. Center-South: Guerrero, Hidalgo, Michoacán, Morelos, Puebla, Querétaro y Tlaxcala
Zone 5. South-Southeast: Campeche, Chiapas, Oaxaca, Quintana Roo, Tabasco, Veracruz, Yucatán.
Statistical analysis plan
We will estimate the prevalence of hrHPV infection and CIN 2+ for the period from 2008 to 2018. We will compare the prevalence of hrHPV infection by socioeconomic characteristics, including health insurance status, as well as by year of diagnosis and degree of marginalization of the municipality of residence. To examine the hrHPV infection prevalence trend over time and its variations depending on sociodemographic and obstetric-gynecologic factors and the degree of municipal marginalization, we will use multilevel logistic regression models. The first level will be individual, and the second level will be municipal.
These regression models are justified since the response variable is dichotomous and women from the same municipality share common measurable and unmeasurable characteristics which influence hrHPV infection prevalence.34 The specification of the model will be the following for hrHPV, and we will proceed accordingly for the CIN2+ model:
Where:
Yij is an indicator for the presence of hrHPV
αj is the random intercept at municipal level
X’1ij is the vector of sociodemographic characteristics of women
X’2ij is the vector of obstetric-gynocologic characteristics of women
X’3ij is the degree of municipal marginalization
Tij is the year of diagnosis
єij, is the error term
Odds ratios (ORs) obtained with this model will be the estimators of adjusted prevalence ratios of hrHPV. Furthermore, in order to explore heterogeneous effects, we will evaluate the interaction between year of diagnosis and sociodemographic and obstetric-gynecologic factors of women as well as municipal marginalization. We assume that woman whose information is not available in the SICAM database have on average the same characteristics than women included in our analyses. We also assume that the mechanism by which we obtained the information is similar to a random sampling of women to whom the program is targeted.
We will evaluate the benefit of introducing the hrHPV testing in NCSP by means of testing the relative sensitivity of hrHPV testing with cytology triage compared with conventional cytology, using a procedure based on those described by Pepe.35 Briefly, we will estimate the ratio between the detection rates of CIN 2+ by hrHPV with cytology triage and conventional cytology. For the hrHPV with cytology triage strategy, we will define the detection rate of CIN 2+ as cases confirmed by biopsy and detected by hrHPV with cytology triage divided by all women with positive cytology and diagnosed with HPV who underwent colposcopy for diagnosis confirmation. For the conventional cytology strategy, we will define the detection rate of CIN 2+ as cases confirmed by biopsy and detected by cytology only divided by all women with positive cytology who underwent colposcopy for diagnosis confirmation.
We will employ a modified multilevel Poisson regression model to estimate relative sensitivity adjusting for individual and municipal characteristics of women. In the first level, we will include individual characteristics such as age, number of pregnancies and geographical zone. In the second level, we will include degrees of municipal marginalization. We will employ this model because the outcome measure will be the number of CIN 2+ confirmed cases and because women from the same municipalities tend towards similar measurable and non-measurable characteristics.35 In this regression model, the main explanatory variable will be the following screening strategy:
Where:
λij is the detection rate of CIN2+
αj is the random intercept at municipal level
Sij is an indicator for the strategy of detection
Mij is an indicator of effect modifier
X’1ij is the vector of sociodemographic characteristics of women
X’2ij is the vector of obstetric-gynecologic characteristics of women
X3ij is the degree of municipal marginalization
Tij is the year of diagnosis
єij, is the error term
The ratios of incidence rate obtained with this model will be estimators of the adjusted relative sensibility of hrHPV and cytology as compared against with conventional cytology. In addition, with the purpose of evaluating if the sensitivity to detect CIN 2+ varies according to age, geographical zone, and degree of municipal marginalization, we will include interaction terms between the indicator for the strategy of detection (Xlij) and these variables.
Another approach to comparing the performance of hrHPV with cytology triage and conventional cytology will be to select the analytical sample by matching. In this approach, we will obtain a set of matches for each women who participated in the hrHPV with cytology triage strategy from the pool of women who underwent conventional cytology only. We will select the matches based on propensity score matching using age, date of sample collection, geographical zone and degree of municipal marginalization as variables. As a balanced sample is expected within this approach, we will not adjust for covariates in regression models.
Partial verification is a variety of selection bias that could occur in screening test studies due to only positives cases being verified by gold standard (biopsy).36 Negative cases will not have more invasive procedures for verification of their disease status because this may be inefficient and unethical.37 The negative test results are differentiated by follow-up in the NCSP.38,39 It is also possible that the rate of completed colposcopy referrals may differ between screening strategies. We will use the method proposed by Begg and Greenes to correct the partial verification bias.40
Ethics and biosafety considerations
We will use the information of women recorded in the SICAM database. It is considered that this is minimal risk research, therefore the use of informed consent is not required to access the information. Biological samples will not be obtained or processed, therefore this protocol does not entail biological risk. The data will be accessed and analyzed only by those responsible for this research. The data used in this analysis will not be traceable to women’s personal information as recorded in SICAM. All procedures used in this protocol will be evaluated by the Institutional Review Board of the National Institute of Public Health.
Conclusion
In summary, we expect that the results of this research project will generate scientific evidence on the effectiveness of the introduction of hrHPV screening in a population-based setting under real-world deployment conditions. This information will also provide evidence about NCSP function and whether its strategies should be improved or removed. Authorities of health programs may use these results to gain a comprehensive perspective of the program and its information system in regards to of the benefit of introducing hrHPV testing in the framework of the NCSP.