Brassica vegetables, including bok choy, cabbage, cauliflower, collards, broccoli, Brussels Sprouts, kale, kohlrabi, rutabagas, and turnips are popular around the.
The closely related species Brassica rapa and B. Oleracea encompass a wide range of vegetable, fodder and oil crops. The release of their reference genomes has facilitated resequencing collections of B. Oleracea aiming to build their variome datasets. These data can be used to investigate the evolutionary relationships between and within the different species and the domestication of the crops, hereafter named morphotypes. These data can also be used in genetic studies aiming at the identification of genes that influence agronomic traits.
We selected and resequenced 199 B. Rapa and 119 B. Oleracea accessions representing 12 and nine morphotypes, respectively.
Based on these resequencing data, we obtained 2,249,473 and 3,852,169 high quality SNPs (single-nucleotide polymorphisms), as well as 303,617 and 417,004 InDels for the B. Oleracea populations, respectively. The variome datasets of B. Oleracea represent valuable resources to researchers working on evolution, domestication or breeding of Brassica vegetable crops. Design Type(s)species comparison design.
genetic structural variation analysis objectiveMeasurement Type(s)DNA sequence variation detectionTechnology Type(s)whole genome sequencingFactor Type(s)SubspeciesSample Characteristic(s)Brassica rapa subsp. Pekinensis. Brassica rapa subsp. Rapa. Brassica rapa subsp. Chinensis. Brassica rapa var.
Parachinensis. Brassica rapa var. Purpuraria. Brassica rapa subsp. Oleifera.
Brassica rapa. Brassica rapa subsp. Narinosa. Brassica rapa var. Perviridis. Brassica rapa subsp. Nipposinica.
Brassica oleracea var. Capitata. Brassica oleracea var.
Gongylodes. Brassica oleracea var. Botrytis.
Brassica oleracea var. Italica. Brassica oleracea var. Alboglabra.
Brassica oleracea. Brassica oleracea var. Gemmifera(ISA-Tab format). Many important crops have been domesticated and are cultivated from the genus Brassica, including those used as oilseeds, condiments, fodder, or vegetables. The ‘triangle of U’ describes the relationships between the six economically important Brassica species. Three of the six species are diploids (A genome, Brassica rapa, n=10; B genome, B. Nigra, n=8; and C genome, B.
Oleracea, n=9), while the other three species are allotetraploids resulting from interspecific hybridization between the diploids (AB genomes, B. Juncea, n=18; AC genomes, B. Napus, n=19; BC genomes, B. Carinata, n=17). These different crops are characterized by their specialized development of organs, and are referred to as morphotypes. Interestingly, similar morphotypes are often selected for in two or more Brassica species, clearly illustrating convergent crop domestication. This includes the leafy heads in Chinese cabbage ( B.
Rapa) and cabbage ( B. Oleracea) and tubers at stems or hypocotyls/roots in kohlrabi’s ( B.
Oleracea), swede’s ( B. Napus) and turnips (B. Rapa), enlarged stems in marrow stem kale ( B. Oleracea) and also broccoletto ( B. Rapa), enlarged inflorescences in cauliflower and broccoli’s ( B. Oleracea), the many axillary shoots in Brussels sprouts ( B.
Oleracea) and mizuna/mibuna’s ( B. Rapa) and the increased numbers of enlarged seedpods in oilseed ( B. Napus).In recent years, the genomes of B.
Oleracea have been assembled and released –. Brassicaceae comparative genomics provided 24 genomic blocks (GBs) as the framework for genome studies in Brassicas. With the GBs system, the comparative genomic analysis of B.
Oleracea, and A. Thaliana revealed the whole genome triplication (WGT) event that is shared by all Brassicas –, and made it possible to deduce and reconstruct the diploid ancestor for the Brassiceae tribe. Further studies reported the phenomenon of sub-genome dominance among the three sub-genomes in Brassicas, and found that the biased distribution of small RNA-targeted TEs plays an important role in this phenomenon.
Furthermore, the release of the genome sequences has also contributed to the mapping, cloning and functional studies of agronomic genes in Brassica crops, such as two clubroot resistance genes, two blackleg resistance genes, as well as two key genes regulating the time of flowering –. The availability of these reference genomes also made it possible to build variome datasets by resequencing populations of B. Oleracea, to investigate domestication of the diverse morphotypes.Here, we selected 199 B. Rapa and 119 B. Oleracea accessions representing the 12 and nine morphotypes from these two species, respectively (, ).
These accessions originate from a wide geographic range, from North till South Asia and Europe. Whole genome resequencing data of these 199 B. Rapa and 119 B.oleracea accessions was generated using the Illumina HiSeq2000 platform, with paired-end reads from 350 bp insert libraries. After filtering low-quality or duplicated reads, totally, we produced 982.87 Gb resequencing data for the 199 B. Rapa accessions and 742.35 Gb for the 119 B. Oleracea accessions, which is about on average 10× coverage for accessions of both B.
Oleracea, ranging from 1.67× till 19.8× coverage. These reads were then mapped to the reference genomes of B. Rapa version V1.5 and B. Oleracae V1.0, respectively.
Finally, we called out 2,249,473 and 3,852,169 reliable SNPs (single-nucleotide polymorphisms), as well as 303,617 and 417,004 InDels for the B. Oleracea populations, respectively.
With these datasets, we have identified genomic selection signals for traits of leaf-heading and tuberous organs in both B. Oleracea, and found that sub-genome parallel selection was associated with morphotype diversification and convergent crop domestication in the two Brassica species. These methods are expanded versions of descriptions in our related work. Sample collection of B. OleraceaAccessions representing the 12 morphotypes of B. Rapa and nine morphotypes of B.
Oleracea were collected ( and ). Almost all morphotypes from the two Brassica species were included in this collection. Rapa accessions included 46 Chinese cabbage accessions ( B. Pekinensis) , 54 turnip accessions ( B. Rapa), 25 pak choi accessions ( B. Chinensis), 30 caixin accessions ( B.
Parachinensis), 13 zicaitai accessions ( B. Chinensis var. Purpurea Bailey), 14 oil seed accessions ( B. Oleifera), four taicai accessions ( B. Chinensis var. Tai-tsai Lin), seven wutacai accessions ( B. Narinosa), one edible flower accession ( B.
Broccoletto) and one yellow sarson ( B. Tricolaris), as well as two accessions each of komatsuna ( B. Perviridis) and mizuna ( B. Rapa population includes 75 DH (doubled haploid) lines, 18 inbred lines, and 106 germplasm lines. Oleracea accesions included 45 cabbage accessions ( B. Oleracea var.
Capitata) , 19 kohlrabi accessions ( B. Oleracea var. Gongylodes), 20 cauliflower accessions ( B.
Oleracea var. Botrytis), 23 broccoli accessions ( B. Oleracea var. Italica), four Chinese kale accessions ( B. Oleracea var.
Alboglabra), as well as two accessions each of kale ( B. Oleracea var. Acephala), Brussels sprouts ( B. Oleracea var.
Gemmifera), curly kale ( B. Oleracea var. Sabellica), and wild B. Oleracea population included 69 DH or inbred lines, 44 germplasm lines (inbred lines from genebank accessions), and six genebank accessions. We have grown all the 199 B. Rapa accessions with three replicates in the green house during autumn 2014 to confirm their morphotypes.
The phenotypes of these accessions were investigated till their maturity. Sample preparation and resequencingPlants of the 199 B. Watch white noise. Rapa accessions were grown in a greenhouse, each accession with five replicates. At the six-leaf stage, the two youngest leaves were collected from one of the five plants and DNA was extracted from these leaves. Oleracea, the DNA of the 44 germplasm lines and six genebank accessions was extracted as described for B. However for the DH and inbred lines, 50–100 seedlings per genotype were grown on moist filter paper.
Cotyledons and hypocotyls were harvested after 12 days and DNA was then extracted.DNA libraries with approximately 350 bp insert sizes were constructed following the manufacturer’s instructions (Illumina GAII) and paired-end resequencing reads were generated by commercial Illumina HiSeq 2000 service provided by Biomarker Technologies Corporation, Beijing, and BGI, Shenzhen. Raw data were filtered before alignmentWe filtered the raw reads before alignment to the corresponding reference genomes of B. First, low quality reads were filtered based on the following three rules. If one end of a paired-end read had 5% ‘N’ bases, then the paired-end read was removed. Secondly, for each paired-end read pair, if one of two reads had an average base-quality less than 20 (Phred-like score), then both reads were removed.
Thirdly, for each read, we trimmed its 3’ bases if the quality scores are below 13. The trimming was stopped at the base with quality score ≥13. After trimming, if the remaining read was less than 40, the paired-end reads were removed. Moreover, considering that the duplicates of paired-end reads were generated from a single amplicon, we further removed redundant duplicated reads and only kept a single pair. We completed this raw read filtering process using an in-house made Perl script. Alignment and variants callingFiltered reads of each re-sequenced sample were mapped onto the corresponding reference genomes using the software Burrows-Wheeler Aligner (BWA version 0.7.5a-r405). We used the B.
Rapa genome version 1.5 and B. Oleracae V1.0 as the references for the two species. The reference genomes were first indexed by using commands as ‘bwa index reference.fa’ and ‘samtools faidx reference.fa’. The clean reads of each accession were then mapped to the indexed reference genomes one by one with the algorithm ‘mem’ of BWA. The command line was ‘bwa mem reference.fa sample1.fq sample2.fqsample.sam’.
It generated a sam file ‘sample.sam’ as the mapping output for each accession. This sam file can be handled by Samtools to call variants.Samtools was used to call SNPs and InDels for each resequenced Brassica accession from the mapping results reported by BWA.Samtools (version 0.1.19–44428 cd) was first used to transfer the sam file to bam file by command ‘samtools view -bS sample.sam sample.bam’. The bam file was then sorted by ‘samtools sort sample.bam sample.sorted.bam’. After index the sorted bam file with ‘samtools index sample.sorted.bam’, candidate genomic variants were called out by using the algorithm ‘mpileup’ of Samtools. The full command was ‘samtools mpileup -q 20 -Q 30 -ugf reference.index sample.sorted.bam bcftools view -p 0.9 -cg -candidateVariants.list’. We set ‘-q 20’ to use nucleotides with Phred-like quality scores higher than 20 as reliable nucleotides of a read to report variants.
‘-Q 30’ denotes that mapping quality of reads should be higher than 30 to be considered as a reliable mapping. Bcftools was used to transfer the vcf file generated by ‘mpileup’ and reported candidate variants in an output file.
We used ‘-p 0.9’ to ask Bcftools to report variants at a locus with more than 10% reads showing a different genotype to that of the reference.We further filtered the candidate variant list for reliable variants. For each accession, we screened its variations one by one. Here, we called the genotype that is the same as the reference ‘reference allele’, while the one that is different to the reference as ‘alternative allele’. For each variant, only alleles that were covered by sufficient reads (≥3 reads) were considered as confident alleles, and the variant was kept as a reliable variant. We performed this process for all Brassica accessions.
This removed potentially false variants produced by sequencing errors. Since most of the Brassica samples (except six genebank accessions of B. Oleracea) are from homozygous or almost homozygous accessions, we filtered our data further by retaining the variant calls that were homozygous in individual homozygous accessions (loci were considered as heterozygous if 0.2D R/(D A+D R)0.8, where D R denotes the number of reads with the reference allele, and D A denotes the number of reads with the alternative allele). We developed in-house Perl scripts to complete the candidate variants filtering processes. In order to remove rare SNPs and use common SNPs to scan for selection signals, we filtered out SNPs that had a MAF (minor allele frequency). ( a) The genomic information of the B. Rapa population; i: the ten chromosomes of B.
Rapa, the physical positions are indicated in units of million bases; ii: the genomic heterozygosity of the B. Rapa population. Area charts quantify iii: functional SNPs/InDels, iv: InDels, v: SNPs. Vi: heat map for gene density. Vii: subgenome partition of the B. Rapa genome, red, green, and blue corresponding to subgenomes LF, MF1, and MF2, respectively. Viii: The triplicated 24 genomic blocks in B.
( b) The genomic information of the B. Oleracea population; ix: the nine chromosomes of B.
Oleracea; x: the genomic heterozygosity of the B. Oleracea population. Area charts quantify xi: functional SNPs/InDels, xii: InDels, xiii: SNPs. Xiv: heat map for gene density. Xv: subgenome partition in B. Oleracea genome, red, green, and blue corresponding to subgenomes LF, MF1, and MF2, respectively. Xvi: The triplicated 24 genomic blocks in B.
LF denotes the least fractionated subgenome, MF1 and MF2 denote for more fractionated sub-genomes 1 and 2. Functional annotation of genomic variantsThe variants identified in the two Brassica genomes were further annotated into different groups ( and ). According to the genomic positions of SNPs and InDels relevant to predicted gene models, we first separated them into 1) variants located at genic regions and 2) at inter-genic regions. Variants located at genic regions were further separated into three subgroups: a) variants in coding sequences (CDs), b) in introns and c) in untranslated regions (UTRs). The SNPs and InDels located at CDs (1a) were classified into two subgroups: the subgroup causing changes to the coding amino acids, including non-synonymous SNPs and frame shift InDels, and the subgroup causing no changes to the amino acids, including synonymous SNPs and InDels that do not cause frame-shifts. Intronic variants (1b) were also separated into two subgroups: causing (8 bp to the splice site) or not causing splice site mutations; UTR variants (1c) are divided into two subgroups: variants in 5′ or 3′UTR regions. The results show that among these 2,249,473 SNPs and 303,617 InDels in B.
Rapa, 161,319 SNPs (nonsynonymous and splice site) and 16,905 InDels (CDS and splice site) respectively, introduced changes to the protein sequences ; for the 3,852,169 SNPs and 417,004 InDels in B. Oleracea, 154,863 SNPs and 16,687 InDels respectively introduced changes to the protein sequences. Additionally, we analyzed the length distribution of InDels, and found that 1 bp deletions or insertions are the dominant InDels in both Brassica populations. The length distribution of InDels located in coding regions of genes was also investigated, and it was found that InDels of three or fold changes of three nucleotides are clearly the dominant types. These InDels will not introduce frame-shift mutations to genes, thus are under less stringent selection compared to InDels that don’t correspond to fold changes of three. We further investigated the genetic diversity within each morphotype group in both species with the annotated genomic variations. We performed the analysis by counting the number of polymorphic variants in each morphotype group.
The results showed that groups of turnip (54) and Chinese cabbages (46) in B. Rapa had most polymorphic loci based on both total SNPs as well as functional SNPs (non-synonymous SNPs or SNPs located at splicing sites). Interestingly, the groups of turnip and pak choi had the most polymorphic loci based on total InDels and functional InDels (InDels located at coding sequences or splicing sites). Oleracea, groups of cabbage (45) and kohlrabi (19) had both most polymorphic loci of SNPs and InDels.
The number of polymorphic variants reflects the genetic diversity in each group. However, the number of variants is also impacted by the numbers of samples studied. Code availabilityCustom perl scripts used to filter raw reads and candidate genomic variants can be downloaded through in Brassica database BRAD, or are freely available when requested by email to the authors.
Other tools used in this work including version details are BWA (version 0.7.5a-r405) and SAMtools (version 0.1.19–44428 cd). Command lines with details of parameters when using these tools are described in the method section. In order to measure the quality of variants called out from the resequencing data, we selected five SNPs , and genotyped 95 out of the 199 B. Rapa accessions for these SNPs using the method of KASP (kompetitive allele specific PCR).
With the same method, we also test the polymorphic level of five SNPs determined from 119 B. Oleracea accessions in another group of 281 B. Oleracea accessions. KASP is a homogeneous, fluorescence-based endpoint SNP genotyping platform (LGC Genomics LLC, Beverly, MA, USA). The five selected SNPs satisfied the following criteria of KASP experiments: 1) No other genomic matches were found for the 50 bp sequences flanking the two sides of each candidate SNP; 2) There are no other SNPs or InDels located at the 50 bp flanking regions of the candidate SNP. For each SNP, two allele-specific primers A1 and A2 and one common reverse primer C were designed by LGC Genomics. The PCR reaction mixture had a total volume of 5 μl and included 2.5 μl DNA, 2.5 μl 2×master mix with 0.07 μl primer mix according to the manufacturer’s guidelines.
Three no-template controls were included for each SNP locus. The amplification process was ran in Gene Amp PCR System 9700 (Applied Biosystems) using the following program: 94 °C for 15 min followed by 10 touchdown cycles of 94 °C for 20 s, 61–55 °C for 60 s (decreasing by 0.6 °C per cycle), followed by 26 cycles of 94 °C for 20 s, 55 °C for 60 s. Fluorescence detection was then performed in a 7900 HT Fast Real-Time PCR System (Applied Biosystems), and the results were analysed using SDS2.3 Software (supplied by Applied Biosystems).We further compared the genotypes of the five SNPs in the 95 samples that were called out by resequencing data, with the genotypes of these loci that were reported by the KASP experiments. Results show that all the 475 loci analyzed have consistent genotypes between the two methods of resequencing and the KASP experiment , supporting the fact that the genomic variants are of high quality. The polymorphism level of the five SNPs in B. Oleracea is shown in.
Also found in: Thesaurus, Medical, Encyclopedia, Wikipedia.
Related to brassica: Brassica juncea, Brassica rapa
bras·si·ca
(brăs′ĭ-kə)n. Any of various plants of the genus Brassica of the mustard family, including cabbage and broccoli.
[New Latin Brassica, genus name, from Latin brassica, cabbage.]
brassica
(ˈbræsɪkə) n (Plants) any plant of the genus Brassica, such as cabbage, rape, turnip, and mustard: family Brassicaceae (crucifers)
brassicaceousadj
bras•si•ca
(ˈbræs ɪ kə)n., pl. -cas.
any plant belonging to the genus Brassica, of the mustard family, including cabbage, kale, broccoli, cauliflower, turnip, and mustard.
Noun | 1. | Brassica - mustards: cabbages; cauliflowers; turnips; etc. dilleniid dicot genus - genus of more or less advanced dicotyledonous trees and shrubs and herbs Brassicaceae, Cruciferae, family Brassicaceae, family Cruciferae, mustard family - a large family of plants with four-petaled flowers; includes mustards, cabbages, broccoli, turnips, cresses, and their many relatives wild cabbage, Brassica oleracea - wild original of cultivated cabbages; common in western coastal Europe Brassica oleracea, cultivated cabbage, cabbage - any of various cultivars of the genus Brassica oleracea grown for their edible leaves or flowers Brassica oleracea italica, broccoli - plant with dense clusters of tight green flower buds borecole, Brassica oleracea acephala, cole, colewort, kail, kale - a hardy cabbage with coarse curly leaves that do not form a head Brassica oleracea gongylodes, kohlrabi - plant cultivated for its enlarged fleshy turnip-shaped edible stem Brassica rapa, turnip, white turnip - widely cultivated plant having a large fleshy edible white or yellow root Brassica napus napobrassica, rutabaga plant, Swedish turnip, turnip cabbage, swede, rutabaga - a cruciferous plant with a thick bulbous edible yellow root Brassica rapa ruvo, broccoli raab, broccoli rabe - plant grown for its pungent edible leafy shoots mustard - any of several cruciferous plants of the genus Brassica Brassica juncea, chinese mustard, gai choi, indian mustard, leaf mustard - Asiatic mustard used as a potherb Brassica rapa pekinensis, celery cabbage, Chinese cabbage, napa, pe-tsai - plant with an elongated head of broad stalked leaves resembling celery; used as a vegetable in east Asia bok choi, bok choy, Brassica rapa chinensis, Chinese white cabbage, pak choi, pakchoi - Asiatic plant grown for its cluster of edible white stalks with dark green leaves Brassica perviridis, Brassica rapa perviridis, spinach mustard, tendergreen - Asiatic plant cultivated for its swollen root crown and edible foliage black mustard, Brassica nigra - widespread Eurasian annual plant cultivated for its pungent seeds; a principal source of table mustard Brassica napus, colza - Eurasian plant cultivated for its seed and as a forage crop |
brassica
[ˈbræsɪkə]n → brassicacée f, crucifèrefWant to thank TFD for its existence? Tell a friend about us, add a link to this page, or visit the webmaster's page for free fun content.
Link to this page: