Introduction
Isostichopus fuscus Ludwig 1875, is a commercially important species due to its high demand and price in Asian markets, where it is considered a delicacy. It is distributed in the eastern central Pacific coast from the Gulf of California, Mexico, to Ecuador including the Galapagos Islands (Maluf, 1988). Its shallow habitat and sedentary habits make this sea cucumber easily accessible to fishing efforts. Populations of I. fuscus have severely declined due to overfishing in México (Ibarra & Soberón, 2002) and Ecuador, including Galápagos (Toral-Granda, 2008). Consequently, I. fuscus has been included as endangered in the IUCN Red List. Despite its importance, the genetic status of its populations has been little studied, with only a handful of samples having been analyzed throughout its distribution (Lohr, 2003), and in Mexico it remains unknown. Here, we present the first I. fuscus species-specific polymorphic microsatellite loci identified from genomic data produced with next generation sequencing (NGA, Illumina), and use them to test genetic structure in the northern Gulf of California. These markers will prove useful for a variety of genetic studies.
Materials and methods
DNA extraction from the tentacles of 2 organisms collected in La Paz, Mexico (24°44’48.58” N, 110°40’36.27” W) was performed with lithium chloride protocol (Hong et al., 1995). De novo assembly of 84,843,094 reads obtained from NGS massive parallel sequencing (HiSeq 2500 Illumina) resulted in a total of 581,221 contigs averaging 898 base pairs (bp) in length. Contigs were processed with MSATCOMMANDER (Faircloth, 2008) searching for potential di- to hexa-nucleotide microsatellite loci with ≥ 6 repeats and enough flanking regions for primer design. A total of 11,224 microsatellite loci were obtained, and unique primers were designed for 4,920 of them. Subsequently, di-, tri- and tetra-nucleotide microsatellite loci were selected based on coverage (number of reads stacked per residue in contig) ranging from 10 to 100 and the number of motif repeats ≥ 10, to increase chances of high polymorphism in loci and their usefulness for population genetics studies (Gardner et al., 2011); producing 29 candidate microsatellites (15 di, 8 tri and 6 tetra-nucleotide) loci to be tested for amplification success and polymorphism.
DNA was extracted from tentacles and tube feet of 22 organisms from Bahía de los Ángeles (BLA) (24°44’48.58” N, 110°40’36.27” W), Mexico and from 19 organisms from San Felipe (SF) (29°48’18” N, 114°22’19” W), Mexico using PureLink ™ Genomic DNA Mini Kit (Invitrogen). The M13 primer sequence (TGTAAAACGACGGCCAGT) was attached to the 5’ end of each forward primer in order to use fluorescent dyes (FAM, NED, VIC, PET) following Schuelke (2000). PCR conditions consisted of 1 min initial denaturation at 94 °C, 30 cycles of 30 s denaturation at 94 °C, 30 s annealing at 63 °C (for all loci) and 30 s elongation at 72 °C, with 15 min elongation at 72 °C at the end of the last cycle. A second PCR was performed replacing the forward primer with FAM, NED, VIC or PET marked M13 primers, using the same conditions but an annealing temperature of 53 °C. Amplification products were subject to fragment analysis in an ABI Prism 3100 genetic analyzer (Applied Biosystems) and genotyped using GeneMarker v. 2.4.0 (Holland & Parson, 2011).
We used organisms from BLA to fully characterize microsatellite polymorphism in all loci, where the number of alleles per locus was evaluated with MStools (Park, 2001). Observed (HO), and expected heterozygosities (HE), Hardy-Weinberg equilibrium (HWE) and linkage disequilibrium (LD) were assessed using ARLEQUIN v. 3.5 (Excoffier et al., 2005) and heterozygote deficiency (Fis) with GENEPOP v. 4.4 (Rousset, 2008). Micro-checker v. 2.2.3 (Van Oosterhout et al., 2004) was used to infer the presence of scoring errors, large allele dropout and null alleles.
Finally a subset of 8 microsatellites (Ifus2-01, Ifus2-06, Ifus2-11, Ifus2-14, Ifus3-06, Ifus4-01, Ifus4-03 and Ifus4-06) was used to assess genetic diversity (NA, HO, HE and HWE) in SF, and its genetic differentiation (φst) with BLA with an analysis of molecular variance (Amova) using ARLEQUIN v. 3.5 (Excoffier et al., 2005).
Results
Optimization was possible for nineteen (di-, tri-, and tetra-nucleotide repeats) microsatellite loci (Table 1). High polymorphism was evident in all loci, showing a number of alleles from 5 to 22, with an average of 10.2. Heterozygosities ranged from 0.35 to 1 (HO) and from 0.56 to 0.97 (HE). After Bonferroni correction, 4 loci (Ifus2-05, Ifus2-06, Ifus3-02 and Ifus4-04) significantly deviated from HWE, and the pair Ifus2-01- Ifus2-03 showed to be potentially linked. Scoring errors and large allele dropout were undetected; however, evidence of null alleles was found in 6 loci.
Locus | Primer sequences (5' - 3') | Repeat motif | Size range (bp) | Dye | N | NA | HO | HE | PHW | Fis | Null |
---|---|---|---|---|---|---|---|---|---|---|---|
Ifus2-01 | F: AGCAGTTTCGGATGGTTGAG | (AC) 12 | 139-153 | FAM | 20 | 7 | 0.70 | 0.67 | 0.90 | -0.05 | -0.04 |
R: GTGACCGCGCCTTAATAACC | |||||||||||
Ifus2-02 | F: GCGCTTTCTAACACCTCGAC | (AC) 11 | 444-460 | NED | 19 | 7 | 0.68 | 0.77 | 0.01 | 0.11 | 0.02 |
R: AACCTTTGTATTCACGCGGG | |||||||||||
Ifus2-03 | F: AACCATCGTGATTCACCGTG | (AC) 11 | 440-468 | VIC | 20 | 10 | 0.95 | 0.86 | 0.02 | -0.11 | -0.08 |
R: TATGTCAAAGCCTCCCTCCC | |||||||||||
Ifus2-04 | F: AAGCGTCCATCATTGCTACC | (AC) 10 | 290-304 | VIC | 22 | 6 | 0.50 | 0.56 | 0.24 | 0.11 | 0.02 |
R: CGATTGTGTGGTACGGAAGC | |||||||||||
Ifus2-05 | F: TCTGATGGAGCTGGTCTAGG | (AG) 14 | 448-474 | VIC | 22 | 9 | 0.77 | 0.87 | <0.001* | 0.11 | 0.04 |
R: CGGTGGTTCTGAATCTGCAC | |||||||||||
Ifus2-06 | F: TGGTTCCCAGAAGATGAGTGG | (AG) 13 | 374-386 | NED | 20 | 6 | 0.35 | 0.64 | <0.001* | 0.46 | 0.21* |
R: TGACCAAGCCAGCCTATGAC | |||||||||||
Ifus2-07 | F: ACTGACTTACGTACATGCCAG | (AG) 10 | 441-453 | FAM | 22 | 7 | 0.91 | 0.81 | 0.06 | -0.13 | -0.09 |
R: ACAGAGCACGCGTAATGAGG | |||||||||||
Ifus2-10 | F: TGCCAACTAATTTCACGGGC | (AT) 10 | 392-402 | PET | 22 | 5 | 0.55 | 0.61 | 0.39 | 0.11 | 0.05 |
R: GGGAGGGCTGAGATTTGTAAG | |||||||||||
Ifus2-11 | F: TCACAATGATGCATGCGGTG | (AG)10 | 286-296 | PET | 20 | 6 | 0.40 | 0.57 | 0.06 | 0.31 | 0.15 |
R: TCATTACAACACTGGCAACCG | |||||||||||
Ifus2-14 | F: TTTACAGCAATCGTCACGCC | (AC)12 | 360-386 | VIC | 20 | 10 | 0.80 | 0.84 | 0.88 | 0.05 | 0.01 |
R: CACATTCATCTGGGCACGTG | |||||||||||
Ifus3-01 | F: AGTGCAGTGTACTCTCAGGC | (AGG)11 | 238-262 | NED | 20 | 9 | 0.65 | 0.88 | 0.004 | 0.26 | 0.12* |
R: TGGGTTTCCTGCTAAGCTTC | |||||||||||
Ifus3-02 | F: TCTTCGTCCTCTCGCTCAAC | (ATC)21 | 361-412 | FAM | 17 | 13 | 0.47 | 0.94 | <0.001* | 0.50 | 0.24* |
R: TGACGATGATGAAGGTGGCC | |||||||||||
Ifus3-04 | F: AATGGGTGTATTTGCCGCTG | (ATC)20 | 295-352 | FAM | 21 | 15 | 0.62 | 0.92 | 0.003 | 0.33 | 0.16* |
R: TCGCTTGCACCATCATTGTC | |||||||||||
Ifus3-06 | F: TGTGTTTGGCCGAGTTCAAC | (ATC)11 | 170-233 | NED | 19 | 10 | 0.68 | 0.82 | 0.08 | 0.17 | 0.07 |
R: AGCAGTAAGGTTGGTCTGAGG | |||||||||||
Ifus4-01 | F: CGACGTCACTATGCCACAAG | (ACAG)11 | 353-389 | NED | 20 | 9 | 0.60 | 0.85 | 0.04 | 0.30 | 0.14* |
R: CACACTGCTCTGACGTTCAC | |||||||||||
Ifus4-02 | F: GGGAGGCCAATCAGATGTTG | (ACAG)18 | 403-447 | VIC | 15 | 12 | 0.73 | 0.91 | 0.23 | 0.20 | 0.08 |
R: CATGTTTCAAGATGGCAACCC | |||||||||||
Ifus4-03 | F: ACAAAGCAGGCATCACAAAC | (ACAT)12 | 273-389 | VIC | 20 | 22 | 1.00 | 0.97 | 1.00 | -0.03 | -0.03 |
R: AGGACCAATGAAACCACTGTG | |||||||||||
Ifus4-04 | F: ACTGCATATAGAACGCGTGC | (AGAT)18 | 396-544 | VIC | 12 | 15 | 0.42 | 0.97 | <0.001* | 0.58 | 0.27* |
R: TGAGGTAAATTTGGGCCAAGG | |||||||||||
Ifus4-06 | F: ACGAGTTTACAGAGGTGCCC | (AGAT)18 | 186-278 | FAM | 20 | 16 | 0.90 | 0.91 | 0.12 | 0.02 | 0.00 |
R: CTGTCAATCCATCTGTCTGCC |
Dye, fluorescent dye of M13 primer; N, number of individuals with successful amplification; NA, number of alleles; HO, observed heterozygosity; HE, expected heterozygosity; PHW, P-value for Hardy-Weinberg equilibrium (* denotes significant deviation after Bonferroni correction, p < 0.0026); FIS, heterozygote deficiency; Null, null allele frequency (* inference of null alleles). GenBank accession numbers KX525270 to KX525288.
The 8 loci assessed in samples from SF showed levels of variation similar to BLA. The number of alleles ranged from 2 to 19 (9.5 average). Heterozygosities varied from 0.23 to 0.83 (HO) and from 0.32 to 0.95 (HE). Loci Ifus2-14 and Ifus4-01 deviated from HWE, showing heterozygote deficiency associated to null alleles and couples Ifus2-01 - Ifus2-11, Ifus3-06 - Ifus4-03, Ifus2-01 - Ifus2-06 and Ifus3-06 - Ifus4-01 presented potential LD. The Amova results showed no differentiation between BLA and SF localities with only 0.2 % of genetic variance related to interpopulation level (Table 2).
Discussion
Optimization was possible for 19 (di-, tri-, and tetra-nucleotide repeats) microsatellite loci and involved designing multiplexing panels of loci using the same annealing temperature, producing different amplicon sizes, as well as combining a variety of fluorescent dyes (Table 1).
The high genetic diversity and the absence of genetic differentiation between BLA and SF suggest the existence of a large panmictic population in the region and the presence of high connectivity in the recent evolutionary history. Despite the decrease of population density of I. fuscus in this area in the past 2 decades, genetic connectivity could have been sustained by the short distance between locations (≈150 km) and the highly dynamic circulation in the Ballenas Channel and the northern Gulf of California (Lavín & Marinone, 2003).
A broader evaluation considering a larger study area and several molecular markers is needed to determine the genetic status of I. fuscus in the Gulf of California. The new microsatellite markers reported herein will be useful for future population genetics analyses of this endangered and economically important sea cucumber along its distribution.