SNP haplotypes of the BADH1 gene and their association with aroma in rice (Oryza sativa L.) Anuradha Singh • Pradeep K. Singh • Rakesh Singh • Awadhesh Pandit • Ajay K. Mahato • Deepak K. Gupta • Kuldeep Tyagi • Ashok K. Singh • Nagendra K. Singh • Tilak R. Sharma Received: 5 July 2009 / Accepted: 2 March 2010 / Published online: 21 March 2010 Ó Springer Science+Business Media B.V. 2010 Abstract Betaine aldehyde dehydrogenase (BADH) is a key enzyme involved in the synthesis of glycin- ebetaine—a powerful osmoprotectant against salt and drought stress in a large number of species. Rice is not known to accumulate glycinebetaine but it has two functional genes coding for the BADH enzyme. A non- functional allele of the BADH2 gene located on chromosome 8 is a major factor associated with rice aroma. However, similar information is not available regarding the BADH1 gene located on chromosome 4 despite the similar biochemical function of the two genes. Here we report on the discovery and validation of SNPs in the BADH1 gene by re-sequencing of diverse rice varieties differing in aroma and salt tolerance. There were 17 SNPs in introns with an average density of one per 171 bp, but only three SNPs in exons at a density of one per 505 bp. Each of the three exonic SNPs led to changes in amino acids with functional significance. Multiplex SNP assays were used for genotyping of 127 diverse rice varieties and landraces. In total 15 SNP haplotypes were identified but only four of these, corresponding to two protein haplotypes, were common, representing more than 85% of the cultivars. Determination of population structure using 54 random SNPs classified the varieties into two groups broadly corresponding to indica and japonica cultivar groups, aromatic varieties clustering with the japonica group. There was no association between salt tolerance and the common BADH1 haplotypes, but aromatic varieties showed specific association with a BADH1 protein haplotype (PH2) having lysine 144 to asparagine 144 and lysine 345 to glutamine 345 substitutions. Protein modeling and ligand docking studies show that these two substitu- tions lead to reduction in the substrate binding capacity of the BADH1 enzyme towards gamma-aminobutyr- aldehyde (GABald), which is a precursor of the major aroma compound 2-acetyl-1-pyrroline (2-AP). This association requires further validation in segregating populations for potential utilization in the rice breeding programs. Keywords BADH1 Á Oryza sativa Á SNP haplotypes Á Aroma Electronic supplementary material The online version of this article (doi:10.1007/s11032-010-9425-1) contains supplementary material, which is available to authorized users. A. Singh Á P. K. Singh Á R. Singh Á A. Pandit Á A. K. Mahato Á D. K. Gupta Á K. Tyagi Á N. K. Singh (&) Á T. R. Sharma Rice Genome Laboratory, National Research Centre on Plant Biotechnology, Indian Agricultural Research Institute, New Delhi 110012, India e-mail: [email protected]Present Address: R. Singh National Bureau of Plant Genetic Resources, New Delhi 110012, India A. K. Singh Division of Genetics, Indian Agricultural Research Institute, New Delhi 110012, India 123 Mol Breeding (2010) 26:325–338 DOI 10.1007/s11032-010-9425-1
14
Embed
SNP haplotypes of the BADH1 gene and their association with aroma in rice (Oryza sativa L.)
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
SNP haplotypes of the BADH1 gene and their associationwith aroma in rice (Oryza sativa L.)
Electronic supplementary material The online version ofthis article (doi:10.1007/s11032-010-9425-1) containssupplementary material, which is available to authorized users.
A. Singh � P. K. Singh � R. Singh � A. Pandit �A. K. Mahato � D. K. Gupta � K. Tyagi �N. K. Singh (&) � T. R. Sharma
Rice Genome Laboratory, National Research Centre
on Plant Biotechnology, Indian Agricultural Research
Jaya G C G T T A A C C T T T T A T SH1 PH1 38ADT43 A C G T T A A G C T T C T A T SH2 PH1 19Basmati 370 G C A A C T G G T T C C C C T SH3 PH2 17Taraori Basmati G A A A T T G G T T C C C C T SH4 PH2 6Kalanamak 3119 G C A T T A A G C A T C C A C SH5 PH3 2Taipai 309 G C A A T T G G T T C C C C T SH6 PH2 1Jyothi G C G T C A G C C T T T T A T SH7 PH1 1Pusa 44 G C G T T A A G C T T T T A T SH8 PH1 1SKR 126 G C G T T A A G C T T C T A T SH9 PH1 1CSR 10 G C G T T T G G T T T C T A T SH10 PH4 1IR 64 G C G T T A A C C T T T C A T SH11 PH1 1Pusa 1266 G C A A C T G C T T C C C C T SH12 PH2 1Kasturi G C A T C T G G T T C C C C T SH13 PH2 1Pusa 1121 A C G T T A A C C T T C T A T SH14 PH1 1Pant Dhan 4 G C G T T A A C C T T T T C T SH15 PH5 1
Fig. 2 Haplotypes of the
BADH1 gene in 92 diverse
rice varieties based on 15
SNPs with no missing data
genotyped using Sequenom
MassARRAY system.
Protein haplotypes are
based on three exonic SNPs
(S6, S18 and S19)
Fig. 3 3D modeling of the
two common BADH1
protein haplotypes (PH1
and PH2) with ligand
docking (green color)
showing reduced number of
binding sites in the PH2 for
GABald, a precursor of the
rice aroma compound 2-AP
334 Mol Breeding (2010) 26:325–338
123
An important observation in the present investiga-
tion was the association of BADH1-PH2 protein
haplotype with the aromatic rice varieties. To com-
plement such analysis it was important to analyze the
127 rice varieties and landraces for their population
structure using the STRUCTURE 2 software (Prit-
chard et al. 2000). We used 54 validated genome-wide
SNP markers with four to six loci per rice chromosome
for genotyping using Sequenom Mass ARRAY in two
multiplex assays (Table S2). Genome-wide synony-
mous SNPs were identified by resequencing of the
intron-spanning regions of conserved single copy rice
genes (our unpublished data). After fixing the K value
at two, and 1000 bootstrap permutations, the 127 rice
varieties were classified in two population groups
(Fig. 4). The list of varieties in each group with their
BADH1 haplotype score, aroma, salt tolerance and
BADH2-exon 7 SNP score is shown in Table S5. Most
of the traditional aromatic rice varieties were present
in population group 2 along with the japonica type
varieties, broadly agreeing with the phylogenetic
grouping based on the BADH1 gene sequence (Fig.
S1). But the population structure grouping presented
here is based on 54 genome-wide SNPs and therefore
reflected the pedigree and selection strategy used in
the development of these varieties (Fig. 4a). Rooting
of the phylogenetic tree with O. nivara showed that the
indica cultivar group was closer to this wild progen-
itor, but O. rufipogon was closer to the japonica/
aromatic population group (Fig. 4b). In each of the
two population groups there were varieties having
significant proportion of genes from the other group
due to their crossbreeding pedigrees; those with more
than 10% genes from other groups are marked in Table
S5.
The association between the BADH1 haplotypes
and salt tolerance or aroma trait of the rice varieties
was assessed manually using the chi-squared test of
significance. The two common protein haplotypes of
the BADH1 gene were present in both the population
groups. Similarly, salt-tolerant and aromatic varieties
were also present across the population groups, but
there was predominance of aromatic varieties in the
population group 2. Frequencies of varieties with
different salt tolerance and aroma scores against the
two common BADH1 protein haplotypes present in
80 rice varieties are shown in Table 4. The frequen-
cies of two protein haplotypes were analyzed in each
of the four categories of aromatic, non-aromatic, salt-
tolerant and salt-susceptible varieties against the
observed overall distribution of 71.2% for PH1 and
28.8% for PH2 in the whole population. There was a
significant association between aroma score and the
BADH1 protein haplotype PH2, where lysine144 and
lysine345 residues of the common haplotype PH1 are
substituted by asparagine144 and glutamine345,
respectively (v2 = 6.985, P = 0.008 at df 1). It is
known from several independent genetic studies and
validation through genetic transformation that the
BADH2 gene is a major locus responsible for rice
aroma, where loss of function mutations in the gene
lead to accumulation of gamma-aminobutyraldehyde
(GABald), a precursor of the aroma compound 2-AP
in the rice grains (Bradbury et al. 2008). At least two
earlier studies have shown co-location of a QTL for
aroma and the BADH1 gene on rice chromosome 4
(Lorieux et al. 1996, Amarawathi et al. 2008). The
BADH1 haplotype aroma association indentified in
this study may explain the functional basis of the
aroma QTL on chromosome 4.
Due to their similar biochemical function it is
anticipated that loss of function mutations in the
BADH1 gene could also control rice aroma similar to
the BADH2 gene, particularly in salt and water stress
conditions (Bradbury et al. 2008). In this study we
provide evidence that the BADH1 protein haplotype
PH2 (SNP haplotypes SH1 and SH2) is associated
with the aromatic rice varieties. It is important to note
here that the loss of function mutation in the BADH2
gene is a primary requirement for aroma development
due to constitutive expression of the BADH2 gene
(Bradbury et al. 2005). However, just the loss of
function of the BADH2 gene is not enough; it may be
complemented with the BADH1 protein haplotype
PH2 (SNP haplotypes SH1 and SH2) for full aroma
expression. For example, the popular crossbred
basmati variety Pusa Basmati 1 has the badh2-exon
7 deletion mutation but has BADH1 haplotype PH1/
SH1, which could be the reason for its mild aroma,
whereas another popular crossbred basmati variety
Pusa 1121 has a rare allele of the BADH1 gene
(haplotype PH1/SH14) that might lead to a better
aroma development than Pusa Basmati 1. Thus, a
combination of loss of function mutation in the
BADH2 gene and a reduction in the substrate binding
capacity of the BADH1 enzyme to aroma precursor
compound GABald could be important for full aroma
development in rice.
Mol Breeding (2010) 26:325–338 335
123
Re-sequencing of the target gene from different
genotypes of a species is one of the most reliable
techniques for SNP discovery that was applied here
for identification for the first time of 20 new SNPs in
the BADH1 gene. However, for routine application of
SNPs for allele mining and marker-assisted breeding,
high-throughput methods of SNP genotyping are
required. Sequenom MassARRAY system allows
handling of a large number of samples using small
to medium numbers of well characterized SNPs in
0.1
BindliTetep99
JayaPantDha1299
42
Kanak10
PR106Prabhat36
5
PNR381Pokkali65
MI4825
Pothana22
ADT3714
ADT384
1
PR108Shiva29
Swarnamukhi8
CSR27MTU107556
IR64Jyothi36
23
SumatiVarsha35
4
DhoiabankoiAnnadaRudramma
Vikas36
Rajavadlu19
ADT438
NeelaPantDhan422
1
KaushalMTU529398
NLR3444917
SwarnaCSR36
MalviyaDhan3636
OrgalluPelalaVadlu43
IR3614
BhadrakaliDivya60
8
IR501
IntanPusa83411
BPT1768Narendra359100
PusaSugandh2PusaSugandh341
PusaSugandh530
Lunishree52
Pusa16914
SaleemTellahamsa71
Satya72
Rajendra55
Rasi57
KrishanaHamsaRatna84
PNR16250
TKM634
RedTriveni3
HeiBaoJD617
HeeraIR20100
Narendra11827
Pusa20514
HKR126Keshava33
PR111Pusa4415
PR113Phalguna24
1
Kalinga3Vanprabha67
ChandanErramallelu82
Varalu34
JGL11470JGL3855100
SagarSamba31
5
JGL13595WGL3210050
1
ChaitanyaHKR12096
Krisahnaveni68
SwarnaDhan30
MandyaVijayaVijetha59
MTU106445
2
KavyaWGL1497
JGL384445
SonaMahsuri21
SambaMahsuri4
JGL117279
CSR10MTU108125
4
SurekhaIndurSamba
NLR3049129
BPT22312
8
Sriranga
23
CSR30TaraoriBasmati89
Basmati37088
HasanSarai69
Kalanamak313120
Kalanamak31197
Pusa1121Sonasal42
Pusa117610
33
SeondBasmati
41
ShahPasand20
NipponbareTaipai30999
Tripura83
Sathi56
46
Orufipogon381932
43
JhumKhasaPechiBadam37
Nagina2281
Golmalati35
51
KasturiPusaBasmati129
49
Pusa1266Pusa134252
64
Onivara283160
A B
Fig. 4 a Population structure (K = 2) and b NJ phylogenetic
tree (using Nei’s similarity matrix) of 127 rice varieties based
on 54 synonymous SNPs present in conserved single-copy rice
genes evenly distributed on the 12 rice chromosomes. The
phylogenetic tree was rooted using Oryza nivara as an out
group
336 Mol Breeding (2010) 26:325–338
123
multiplex reactions. We designed two multiplex
genotyping assays for the 20 newly discovered SNPs.
Fifteen of these SNPs were validated successfully and
used for the genotyping of a large set of 127 rice
genotypes of diverse origin and agronomic trait
variation. This helped discovery of BADH1 haplo-
types showing significant association with the aro-
matic rice varieties that may compliment the role of
known loss of function alleles of the BADH2 locus
for rice aroma. However, this association needs
further validation in a segregating population.
Sequence submissions
The BADH1 sequences from sixteen rice varieties have
been submitted to the NCBI GenBank with accession
numbers: CSR10 (EU566870), CSR27 (bankit111
4275), CSR36 (bankit1114294), Jaya (EU566862),
Jyoti (bankit1114323), Kalanamak 3119 (EU566863),
Kalanamak 3131 (bankit1114319), MI48 (bankit111
4313), Pokkali (EU566865), Pusa 44 (EU566866),
Pusa 1121 (EU566867), Pusa 1266 (EU566864), Pusa
1342 (EU566868), Ratna (bankit1114305), Red Tri-
veni (bankit1114308), Taipei 309 (EU566869).
Acknowledgments This work was supported by the NPTC
project of the Indian Council of Agricultural Research and is
part of the M.Sc. thesis of the senior author.
References
Amarawathi Y, Singh R, Singh AK, Singh VP, Mohapatra T,
Sharma TR et al (2008) Mapping of quantitative trait loci
for basmati quality traits in rice (Oryza sativa L.). Mol
Breed 21:49–65
Belo A, Zheng P, Luck S, Shen B, Meyer DJ, Li B, Tingey S,
Rafalski A (2008) Whole genome scan detects an allelic
variant of fad2 associated with increased oleic acid levels
in maize. Mol Genet Genomics 279:1–10
Bradbury LMT, Fitzgerald TL, Henry RJ, Jin QS, Waters DLE
(2005) The gene for fragrance in rice. Plant Biotechnol J
3:363–370
Bradbury LMY, Gillies SA, Brushett DJ, Daniel LEW, Henry
RJ (2008) Inactivation of an aminoaldehyde dehydroge-
nase is responsible for fragrance in rice. Plant Mol Biol
68: 439–449
Chen S, Yang Y, Shi W, Ji Q, He F, Zhang Z, Cheng Z, Liu X,
Xu M (2008) BADH2, encoding betaine aldehyde dehy-
drogenase, inhibits the biosynthesis of 2-acetyl-1-pyrro-
line, a major component in rice fragrance. Plant Cell 20:
1850–1861
Ewing B, Green P (1988) Base calling of automated sequence
tracer using Phred I. Accuracy assessment. Genome Res
8:175–185
Fitzgerald TL, Waters DLE, Henry RJ (2008) The affect of salt
on betaine aldehyde dehydrogenase transcript levels and 2-
acetyl-1-pyrroline concentration in fragrant and non-fra-
grant rice (Oryza sativa). Plant Sci. doi:10.1016/j.plantsci.
2008.06.005
IRGSP (2005) The map based sequence of the rice genome.
Nature 436:793–800
Ishitani M, Arakawa K, Mizuno K, Kishitani S, Takabe T
(1993) Betaine aldehyde dehydrogenase in the Grami-
neae. Levels in leaves of both betaine accumulating and