Αr9 RNA

Last updated

αr9 is a family of bacterial small non-coding RNAs with representatives in a broad group of α-proteobacteria from the order Hyphomicrobiales. The first member of this family (Smr9C) was found in a Sinorhizobium meliloti 1021 locus located in the chromosome (C). Further homology and structure conservation analysis have identified full-length Smr9C homologs in several nitrogen-fixing symbiotic rhizobia (i.e. R. leguminosarum bv.viciae, R. leguminosarum bv. trifolii, R. etli, and several Mesorhizobium species), in the plant pathogens belonging to Agrobacterium species (i.e. A. tumefaciens , A. vitis , A. radiobacter, and Agrobacterium H13) as well as in a broad spectrum of Brucella species ( B. ovis , B. canis , B. abortus and B. microtis , and several biovars of B. melitensis ). αr9C RNA species are 144-158 nt long (Table 1) and share a well defined common secondary structure consisting of seven conserved regions (Figure 1). Most of the αr9 transcripts can be catalogued as trans-acting sRNAs expressed from well-defined promoter regions of independent transcription units within intergenic regions (IGRs) of the α-proteobacterial genomes (Figure 5).

Contents

Discovery and structure

Smr9C sRNA was described by del Val et al. [1] in the intergenic regions (IGRs) of the reference S. meliloti 1021 strain (http://iant.toulouse.inra.fr/bacteria/annotation/cgi/rhime.cgi). Northern hybridization experiments confirmed that the predicted smr9C locus did express a single transcript of the expected size, which accumulated differentially in free-living and endosymbiotic bacteria. TAP-based 5'-RACE experiments mapped the transcription start site (TSS) of the full-length Smr9C transcript to the 1,398,425 nt position in the S. meliloti 1021 genome (http://iant.toulouse.inra.fr/bacteria/annotation/cgi/rhime.cgi) whereas the 3'-end was assumed to be located at the 1,398,277 nt position matching the last residue of the consecutive stretch of Us of a bona fide Rho-independent terminator (Figure 5). Parallel and later studies [2] [3] in which Smr9C transcript is referred to as Igr#3 or sra32 independently confirmed the expression this sRNA in S. meliloti and in its closely related strain 2011. Recent deep sequencing-based characterization of the small RNA fraction (50-350 nt) of S. meliloti 2011 further confirmed the expression of Smr9C (here referred to as SmelC289), and mapped the 5'- and 3'-ends of the full-length transcript to positions 1,398,423 and 1,398,279, respectively, in the S. meliloti 1021 genome. [4]

The nucleotide sequence of Smr9C was initially used as query to search against the Rfam database (version 10.0; http://www.sanger.ac.uk/Software/Rfam). This homology search rendered no matches to known bacterial sRNA in this database. Smr9C was next BLASTed with default parameters against all the currently available bacterial genomes (1,615 sequences at 20 April 2011; https://www.ncbi.nlm.nih.gov). The regions exhibiting significant homology to the query sequence (78-89% similarity) were extracted to create a Covariance Model (CM) from a seed alignment using Infernal (version1.0) [5] (Figure 2). This CM was used in a further search for new members of the αr9 family in the existing bacterial genomic databases.

Figure 1: Covariance Model in stockholm format showing the consensus structure for the ar9 family. Each of the stems represented by the structure line#=GC SS_cons is in a different color. To download the covariance model click here. C9 stockholm.png
Figure 1: Covariance Model in stockholm format showing the consensus structure for the αr9 family. Each of the stems represented by the structure line#=GC SS_cons is in a different color. To download the covariance model click here.

The results were manually inspected to deduce a consensus secondary structure for the family (Figure 1 and Figure 2). The consensus structure was also independently predicted with the program locARNATE [6] with very similar predictions. The manual inspection of the sequences found with the CM using Infernal allowed finding 26 true homolog sequences, all of them present as single chromosomal copies in the α-proteobacterial genomes. The rhizobial species encoding the 12 closer homologs to Smr9C were: two R. leguminosarum trifolii strains (WSM2304 and WSM1235), two R. etli strains CFN 42 and CIAT 652, the reference R. leguminosarum bv. viciae 3841 strain, Rhizobium NGR234, and the Agrobacterium species A. vitis,A. tumefaciens, A. radiobacter and A. H13. All these sequences showed significant Infernal E-values (1.50e-39 – 14.02e-21) and bit-scores. The rest of the sequences found with the model showed high E-values between (3.40e-12 and 2.62e-04) but lower bit-scores and are encoded by Brucella species (B. ovis, B. canis, B. abortus, B. microtis, and several biobars of B. melitensis), Brucella anthropi and the Mesorhizobum species M. loti, M. ciceri and M. BNC.

Table 1: Smr9C homologs in other symbionts and pathogens
CM modelNameGI accession numberbeginendstrand%GClengthOrganism
αr9Smr9Cgi|15963753|ref|NC_003047.1|13982771398425-46149Sinorhizobium meliloti 1021
αr9Smedr9Cgi|150395228|ref|NC_009636.1|979747979902-48150Sinorhizobium medicae WSM419 chromosome
αr9Sfr9Cgi|227820587|ref|NC_012587.1|11300011130150-47150Sinorhizobium fredii NGR234 chromosome
αr9Atr9Cgi|159184118|ref|NC_003062.2|12752971275443-47147Agrobacterium tumefaciens str. C58 chromosome circular
αr9AH13r9Cgi|325291453|ref|NC_015183.1|12155801215726-48147Agrobacterium sp. H13-3 chromosome
αr9ReCIATr9Cgi|190889639|ref|NC_010994.1|16849291685072-48144Rhizobium etli CIAT 652
αr9Arr9CIgi|222084201|ref|NC_011985.1|14819751482119-49145Agrobacterium radiobacter K84 chromosome 1
αr9Rlt2304r9Cgi|209547612|ref|NC_011369.1|12965061296650-48145Rhizobium leguminosarum bv. trifolii WSM2304 chromosome
αr9Avr9CIgi|222147015|ref|NC_011989.1|14485101448659-45150Agrobacterium vitis S4 chromosome 1
αr9Rlvr9Cgi|116249766|ref|NC_008380.1|18029361803079-49144Rhizobium leguminosarum bv. viciae 3841
αr9Rlt1325r9Cgi|241202755|ref|NC_012850.1|13545931354736-47144Rhizobium leguminosarum bv. trifolii WSM1325
αr9ReCFNr9Cgi|86355669|ref|NC_007761.1|17032031703345-47143Rhizobium etli CFN 42
αr9Mlr9Cgi|57165207|ref|NC_002678.2|11272171127363+54145Mesorhizobium loti MAFF303099 chromosome
αr9Mcr9Cgi|319779749|ref|NC_014923.1|34496393449786-52148Mesorhizobium ciceri biovar biserrulae WSM1271 chromosome
αr9Bcr9CIgi|161617991|ref|NC_010103.1|796281796428-49148Brucella canis ATCC 23365 chromosome I
αr9Bs23445r9CIgi|163842277|ref|NC_010169.1|817762817909-49148Brucella suis ATCC 23445 chromosome I
αr9Bm16Mr9CIgi|17986284|ref|NC_003317.1|11875961187743+49146Brucella melitensis bv. 1 str. 16M chromosome I
αr9BaS19r9CIgi|189023268|ref|NC_010742.1|817898818045-49148Brucella abortus S19 chromosome 1
αr9Bm23457r9CIgi|225851546|ref|NC_012441.1|819377819523-50147Brucella melitensis ATCC 23457 chromosome I
αr9Bs1330r9CIgi|56968325|ref|NC_004310.3|797917798064-49148Brucella suis 1330 chromosome I
αr9Ba19941r9CIgi|62288991|ref|NC_006932.1|819597819744-49148Brucella abortus bv. 1 str. 9-941 chromosome I
αr9Bmar9CIgi|82698932|ref|NC_007618.1|815875816022-49148Brucella melitensis biovar Abortus 2308 chromosome I
αr9Bor9CIgi|148558820|ref|NC_009505.1|822712822859-49148Brucella ovis ATCC 25840 chromosome I
αr9Bmir9CIgi|256368465|ref|NC_013119.1|802078802225-49148Brucella microti CCM 4915 chromosome 1
αr9Oar9CIgi|153007346|ref|NC_009667.1|25291082529255+48146Brucella anthropi ATCC 49188 chromosome 1
αr9MsBCNr9Cgi|110632362|ref|NC_008254.1|11417761141933-50158Mesorhizobium sp. BNC1
Figure 2: Consensus secondary structure of the ar9 members predicted by RNA and RNAalifold. Smr9C is coloured according to base pair probabilities, the color probability scale is indicated in the drawing. The ar9 family structure is coloured following a base conservation scheme. Red: base pair occurring in all sequences used to generate the consensus; yellow: two types of base pairing occur; Green: three types of base pairing occur. The shading of base pairs represents: Saturated, no inconsistent sequences; Pale, one inconsistent sequence; Very pale, two inconsistent sequences. C9 annotated alignment structure conservation.png
Figure 2: Consensus secondary structure of the αr9 members predicted by RNA and RNAalifold. Smr9C is coloured according to base pair probabilities, the color probability scale is indicated in the drawing. The αr9 family structure is coloured following a base conservation scheme. Red: base pair occurring in all sequences used to generate the consensus; yellow: two types of base pairing occur; Green: three types of base pairing occur. The shading of base pairs represents: Saturated, no inconsistent sequences; Pale, one inconsistent sequence; Very pale, two inconsistent sequences.
Figure 3: Phylogenetic distribution of known and predicted ar9 genes. Gene numbers are based on computational analysis using the program Infernal. Legend: Smr9C = Sinorhizobium meliloti 1021 (NC_003047), Smedr9C = Sinorhizobium medicae WSM419 chromosome (NC_009636), Sfr9C = Sinorhizobium fredii NGR234 chromosome (NC_012587), Atr9C = Agrobacterium tumefaciens str. C58 chromosome circular (NC_003062), AH13r9C = Agrobacterium sp. H13-3 chromosome (NC_015183), ReCIATr9C = Rhizobium etli CIAT 652 (NC_010994), Arr9CI = Agrobacterium radiobacter K84 chromosome 1 (NC_011985), Rlt2304r9C = Rhizobium leguminosarum bv. trifolii WSM2304 chromosome (NC_011369), Avr9CI = Agrobacterium vitis S4 chromosome 1 (NC_011989), Rlvr9C = Rhizobium leguminosarum bv. viciae 3841 (NC_008380), Rlt1325r9C = Rhizobium leguminosarum bv. trifolii WSM1325 (NC_012850), ReCFNr9C = Rhizobium etli CFN 42 (NC_007761), Mlr9C = Mesorhizobium loti MAFF303099 chromosome (NC_002678), Mcr9C = Mesorhizobium ciceri biovar biserrulae WSM1271 chromosome (NC_014923), Bcr9CI = Brucella canis ATCC 23365 chromosome I (NC_010103), Bs23445r9CI = Brucella suis ATCC 23445 chromosome I (NC_010169), Bm16Mr9CI = Brucella melitensis bv. 1 str. 16M chromosome I (NC_003317), BaS19r9CI = Brucella abortus S19 chromosome 1 (NC_010742), Bm23457r9CI = Brucella melitensis ATCC 23457 chromosome I (NC_012441), Bs1330r9CI = Brucella suis 1330 chromosome I (NC_004310), Ba19941r9CI = Brucella abortus bv. 1 str. 9-941 chromosome I (NC_006932), Bmar9CI = Brucella melitensis biovar Abortus 2308 chromosome I (NC_007618), Bor9CI = Brucella ovis ATCC 25840 chromosome I (NC_009505), Bmir9CI = Brucella microti CCM 4915 chromosome 1 (NC_013119), Oar9CI = Brucella anthropi ATCC 49188 chromosome 1 (NC_009667), MsBCNr9C = Mesorhizobium sp. BNC1 (NC_008254). C9 unrooted.png
Figure 3: Phylogenetic distribution of known and predicted αr9 genes. Gene numbers are based on computational analysis using the program Infernal. Legend: Smr9C = Sinorhizobium meliloti 1021 (NC_003047), Smedr9C = Sinorhizobium medicae WSM419 chromosome (NC_009636), Sfr9C = Sinorhizobium fredii NGR234 chromosome (NC_012587), Atr9C = Agrobacterium tumefaciens str. C58 chromosome circular (NC_003062), AH13r9C = Agrobacterium sp. H13-3 chromosome (NC_015183), ReCIATr9C = Rhizobium etli CIAT 652 (NC_010994), Arr9CI = Agrobacterium radiobacter K84 chromosome 1 (NC_011985), Rlt2304r9C = Rhizobium leguminosarum bv. trifolii WSM2304 chromosome (NC_011369), Avr9CI = Agrobacterium vitis S4 chromosome 1 (NC_011989), Rlvr9C = Rhizobium leguminosarum bv. viciae 3841 (NC_008380), Rlt1325r9C = Rhizobium leguminosarum bv. trifolii WSM1325 (NC_012850), ReCFNr9C = Rhizobium etli CFN 42 (NC_007761), Mlr9C = Mesorhizobium loti MAFF303099 chromosome (NC_002678), Mcr9C = Mesorhizobium ciceri biovar biserrulae WSM1271 chromosome (NC_014923), Bcr9CI = Brucella canis ATCC 23365 chromosome I (NC_010103), Bs23445r9CI = Brucella suis ATCC 23445 chromosome I (NC_010169), Bm16Mr9CI = Brucella melitensis bv. 1 str. 16M chromosome I (NC_003317), BaS19r9CI = Brucella abortus S19 chromosome 1 (NC_010742), Bm23457r9CI = Brucella melitensis ATCC 23457 chromosome I (NC_012441), Bs1330r9CI = Brucella suis 1330 chromosome I (NC_004310), Ba19941r9CI = Brucella abortus bv. 1 str. 9-941 chromosome I (NC_006932), Bmar9CI = Brucella melitensis biovar Abortus 2308 chromosome I (NC_007618), Bor9CI = Brucella ovis ATCC 25840 chromosome I (NC_009505), Bmir9CI = Brucella microti CCM 4915 chromosome 1 (NC_013119), Oar9CI = Brucella anthropi ATCC 49188 chromosome 1 (NC_009667), MsBCNr9C = Mesorhizobium sp. BNC1 (NC_008254).

Expression information

Parallel studies assessed Smr9C expression in S. meliloti 1021 under different biological conditions; i.e. bacterial growth in TY, minimal medium (MM) and luteolin-MM broth and endosymbiotic bacteria (i.e. mature symbiotic alfalfa nodules) [1] and high salt stress, oxidative stress and cold and hot shock stresses. [3] Expression of Smr9C in free-living bacteria was found to be growth-dependent, being the gene strongly down-regulated when bacteria entered the stationary phase, whereas no expression was detected in endosymbiotic bacteria. Recent deep sequencing data [4] further revealed up-regulation of Smr9C upon salt, acidic, cold-shock and heat shock stresses. Recent co-inmuno precipitation experiments [9] corroborate that Smr9C, does bind the bacterial protein Hfq for efficient target binding.

Promoter analysis

All the promoter regions of the αr9 family members examined so far are very conserved in a sequence stretch extending up to 80 bp upstream of the transcription start site of the sRNA. All loci have recognizable σ70-dependent promoters showing a -35/-10 consensus motif CTTAGAC-n17-CTATAT, which has been previously shown to be widely conserved among several other genera in the α-subgroup of proteobacteria. [10] To identify binding sites for other known transcription factors we used the fasta sequences provided by RegPredict [11] (http://regpredict.lbl.gov/regpredict/help.html), and used those position weight matrices (PSWM) provided by RegulonDB [12] (http://regulondb.ccg.unam.mx). We built PSWM for each transcription factor from the RegPredict sequences using the Consensus/Patser program, choosing the best final matrix for motif lengths between 14 and 30 bps a threshold average E-value < 10E-10 for each matrix was established, (see "Thresholded consensus" in http://gps-tools2.its.yale.edu). Moreover, we searched for conserved unknown motifs using MEME [13] (http://meme.sdsc.edu/meme4_6_1/intro.html) and used relaxed regular expressions (i.e. pattern matching) over all Smr9C homologs promoters. This studies revealed the presence of 30 bp long region very conserved between positions -40 and -75, conserved MEME motif, (Figure 5). This sequence was present in all but one of the smrC9 homologs found, but no significant similarity to known transcription factor binding sites matrices could be established. [14]

Figure 4: Alignment of the promoter region of the ar9 members. All members presented putative s promoters with -35 and -10 boxes marked in green and red respectively. The motif found conserved in all ar9 family members is marked with an orange box. Promoters9C.png
Figure 4: Alignment of the promoter region of the αr9 members. All members presented putative σ promoters with -35 and -10 boxes marked in green and red respectively. The motif found conserved in all αr9 family members is marked with an orange box.

Genomic context

Most of the members of the αr9 family are trans-encoded sRNAs transcribed from independent promoters in chromosomal IGRs. Exceptions are the cis-encoded antisense Smr9C homologs of A. tumefaciens and B. microti, which are located in the opposite strand of annotated genes, partially overlapping ORFs. Most of the neighboring genes of the seed alignment's members were not annotated and thus were further manually curated. [15] [16] [17] The predicted protein products of these overlapping ORFs could not be assigned to any functional category on the basis of the amino acid sequence homology. However, the genomic regions of almost all αr9 sRNAs exhibited a great degree of conservation including the sRNA-coding sequence and the upstream and downstream genes which have been predicted to code for a prolyl-tRNA syntethase (proS) and a transmembrane protein, respectively. Partial synteny of the αr9 genomic regions was observed in a few cases such as, S. medicae where instead of a proS gene an FAD-dependent pyridine nucleotide-disulfide oxidoreductase encoding gene was found upstream of the αr9 locus, and Mesorhizobium loti where no transmembrane coding gene was recognizable downstream of the sRNA gene. A special case is the Brucella group, where primary automatic annotation over their genomes identified ORFs smaller than 30 aa overlapping with the predicted αr9 sRNA in the same strand. These predicted ORFs, neither show any similarity with database entries nor any motif or signatures when searched against family and motif databases such as Interpro, [18] PFAM [19] or Smart, [20] and thus, are considered here as missannotations not registered in the genomic context graph.

Figure 5: Genomic context scheme of Smr9C and its closest homologues in a-proteobacteria. The ar9 RNA genes are represented by red arrows and the flanking ORFs by arrows on different colors depending on their product function (legend). Numbers indicate the ar9 RNA gene's and flanking ORFs coordinates in each organism genome database. The gene strand is represented with the file direction. On the left of the figure identification names are used which correspond to a certain organism: ar9_Smr9C = Sinorhizobium meliloti 1021 (NC_003047), ar9_Smedr9C = Sinorhizobium medicae WSM419 chromosome (NC_009636), ar9_Sfr9C = Sinorhizobium fredii NGR234 chromosome (NC_012587), ar9_Atr9C = Agrobacterium tumefaciens str. C58 chromosome circular (NC_003062), ar9_AH13r9C = Agrobacterium sp. H13-3 chromosome (NC_015183), ar9_ReCIATr9C = Rhizobium etli CIAT 652 (NC_010994), ar9_Arr9CI = Agrobacterium radiobacter K84 chromosome 1 (NC_011985), ar9_Rlt2304r9C = Rhizobium leguminosarum bv. trifolii WSM2304 chromosome (NC_011369), ar9_Avr9CI = Agrobacterium vitis S4 chromosome 1 (NC_011989), ar9_Rlvr9C = Rhizobium leguminosarum bv. viciae 3841 (NC_008380), ar9_Rlt1325r9C = Rhizobium leguminosarum bv. trifolii WSM1325 (NC_012850), ar9_ReCFNr9C = Rhizobium etli CFN 42 (NC_007761), ar9_Mlr9C = Mesorhizobium loti MAFF303099 chromosome (NC_002678), ar9_Mcr9C = Mesorhizobium ciceri biovar biserrulae WSM1271 chromosome (NC_014923), ar9_Bcr9CI = Brucella canis ATCC 23365 chromosome I (NC_010103), ar9_Bs23445r9CI = Brucella suis ATCC 23445 chromosome I (NC_010169), ar9_Bm16Mr9CI = Brucella melitensis bv. 1 str. 16M chromosome I (NC_003317), ar9_BaS19r9CI = Brucella abortus S19 chromosome 1 (NC_010742), ar9_Bm23457r9CI = Brucella melitensis ATCC 23457 chromosome I (NC_012441), ar9_Bs1330r9CI = Brucella suis 1330 chromosome I (NC_004310), ar9_Ba19941r9CI = Brucella abortus bv. 1 str. 9-941 chromosome I (NC_006932), ar9_Bmar9CI = Brucella melitensis biovar Abortus 2308 chromosome I (NC_007618), ar9_Bor9CI = Brucella ovis ATCC 25840 chromosome I (NC_009505), ar9_Bmir9CI = Brucella microti CCM 4915 chromosome 1 (NC_013119), ar9_Oar9CI = Brucella anthropi ATCC 49188 chromosome 1 (NC_009667), ar9_MsBCNr9C = Mesorhizobium sp. BNC1 (NC_008254). Smbr9.png
Figure 5: Genomic context scheme of Smr9C and its closest homologues in α-proteobacteria. The αr9 RNA genes are represented by red arrows and the flanking ORFs by arrows on different colors depending on their product function (legend). Numbers indicate the αr9 RNA gene's and flanking ORFs coordinates in each organism genome database. The gene strand is represented with the file direction. On the left of the figure identification names are used which correspond to a certain organism: αr9_Smr9C = Sinorhizobium meliloti 1021 (NC_003047), αr9_Smedr9C = Sinorhizobium medicae WSM419 chromosome (NC_009636), αr9_Sfr9C = Sinorhizobium fredii NGR234 chromosome (NC_012587), αr9_Atr9C = Agrobacterium tumefaciens str. C58 chromosome circular (NC_003062), αr9_AH13r9C = Agrobacterium sp. H13-3 chromosome (NC_015183), αr9_ReCIATr9C = Rhizobium etli CIAT 652 (NC_010994), αr9_Arr9CI = Agrobacterium radiobacter K84 chromosome 1 (NC_011985), αr9_Rlt2304r9C = Rhizobium leguminosarum bv. trifolii WSM2304 chromosome (NC_011369), αr9_Avr9CI = Agrobacterium vitis S4 chromosome 1 (NC_011989), αr9_Rlvr9C = Rhizobium leguminosarum bv. viciae 3841 (NC_008380), αr9_Rlt1325r9C = Rhizobium leguminosarum bv. trifolii WSM1325 (NC_012850), αr9_ReCFNr9C = Rhizobium etli CFN 42 (NC_007761), αr9_Mlr9C = Mesorhizobium loti MAFF303099 chromosome (NC_002678), αr9_Mcr9C = Mesorhizobium ciceri biovar biserrulae WSM1271 chromosome (NC_014923), αr9_Bcr9CI = Brucella canis ATCC 23365 chromosome I (NC_010103), αr9_Bs23445r9CI = Brucella suis ATCC 23445 chromosome I (NC_010169), αr9_Bm16Mr9CI = Brucella melitensis bv. 1 str. 16M chromosome I (NC_003317), αr9_BaS19r9CI = Brucella abortus S19 chromosome 1 (NC_010742), αr9_Bm23457r9CI = Brucella melitensis ATCC 23457 chromosome I (NC_012441), αr9_Bs1330r9CI = Brucella suis 1330 chromosome I (NC_004310), αr9_Ba19941r9CI = Brucella abortus bv. 1 str. 9-941 chromosome I (NC_006932), αr9_Bmar9CI = Brucella melitensis biovar Abortus 2308 chromosome I (NC_007618), αr9_Bor9CI = Brucella ovis ATCC 25840 chromosome I (NC_009505), αr9_Bmir9CI = Brucella microti CCM 4915 chromosome 1 (NC_013119), αr9_Oar9CI = Brucella anthropi ATCC 49188 chromosome 1 (NC_009667), αr9_MsBCNr9C = Mesorhizobium sp. BNC1 (NC_008254).
Table 2: Detailed Genomic context information of the αr9 sRNA members.
FamilyFeatureNameStrandBeginEndProtein nameAnnotationOrganism
αr9_Smr9CgeneSMc01933R13978891398104NP_385391.1transmembrane proteinSinorhizobium meliloti 1021 (NC_003047)
αr9_Smr9CsRNASmr9CR13982771398425Sinorhizobium meliloti 1021 (NC_003047)
αr9_Smr9CgeneSMc01934D13988151400143NP_385392.1prolyl-tRNA sybthetaseSinorhizobium meliloti 1021 (NC_003047)
αr9_Rlt2304r9CgeneRleg2_1278D12960671296342YP_002280798.1transmembrane protein function DUF146Rhizobium leguminosarum bv trifolii WSM2304 chromosome (NC_011369)
αr9_Rlt2304r9CsRNARlt2304r9CR12965061296650Rhizobium leguminosarum bv trifolii WSM2304 chromosome (NC_011369)
αr9_Rlt2304r9CgeneRleg2_1279D12969351298257YP_002280799.1prolyl-tRNA sybthetaseRhizobium leguminosarum bv trifolii WSM2304 chromosome (NC_011369)
αr9_Rlt1325r9CgeneRleg_1370D13541551354430YP_002975201.1transmembrane proteinRhizobium leguminosarum bv trifolii WSM1325 (NC_012850)
αr9_Rlt1325r9CsRNARlt1325r9CR13545931354736Rhizobium leguminosarum bv trifolii WSM1325 (NC_012850)
αr9_Rlt1325r9CgeneRleg_1371D13550211356343YP_002975202.1prolyl-tRNA sybthetaseRhizobium leguminosarum bv trifolii WSM1325 (NC_012850)
αr9_ReCFNr9CgeneRHE_CH01622D17027001703041YP_469148.1transmembrane proteinRhizobium etli CFN 42 (NC_007761)
αr9_ReCFNr9CsRNAReCFNr9CR17032031703345Rhizobium etli CFN 42 (NC_007761)
αr9_ReCFNr9CgeneRHE_CH01623D17036291704951YP_469149.2prolyl-tRNA sybthetaseRhizobium etli CFN 42 (NC_007761)
αr9_Smedr9CgeneSmed_0907D979229979498YP_001326597.1transmembrane proteinSinorhizobium medicae WSM419 chromosome (NC_009636)
αr9_Smedr9CsRNASmedr9CR979747979753Sinorhizobium medicae WSM419 chromosome (NC_009636)
αr9_Smedr9CgeneSmed_0908D980177981469YP_001326598.1FAD-dependent pyridine nucleotide-disulfide oxidoreductasSinorhizobium medicae WSM419 chromosome (NC_009636)
αr9_Rlvr9CgeneRL1718D18024321802773YP_767322.1transmembrane proteinRhizobium leguminosarum bv. viciae 3841 (NC_008380)
αr9_Rlvr9CsRNARlvr9CR18029361803079Rhizobium leguminosarum bv. viciae 3841 (NC_008380)
αr9_Rlvr9CgeneRL1719D18033641804686YP_767323.1prolyl-tRNA sybthetaseRhizobium leguminosarum bv. viciae 3841 (NC_008380)
αr9_Atr9CgeneAtu1286D12748031275072NP_354296.2transmembrane proteinAgrobacterium tumefaciens str. C58 chromosome circular (NC_003062)
αr9_Atr9CsRNAAtr9CR12752971275443Agrobacterium tumefaciens str. C58 chromosome circular (NC_003062)
αr9_Atr9CgeneAtu1288D12757381277060NP_354297.2prolyl-tRNA sybthetaseAgrobacterium tumefaciens str. C58 chromosome circular (NC_003062)
αr9_ReCIATr9CgeneRHECIAT_CH0001694D16844731684766YP_001977846.1transmembrane proteinRhizobium etli CIAT 652 (NC_010994)
αr9_ReCIATr9CsRNAReCIATr9CR16849291685072Rhizobium etli CIAT 652 (NC_010994)
αr9_ReCIATr9CgeneRHECIAT_CH0001695D16853561686678YP_001977847.1prolyl-tRNA sybthetaseRhizobium etli CIAT 652 (NC_010994)
αr9_Arr9CIgeneArad_1869D14814921481770YP_002544124.1transmembrane proteinAgrobacterium radiobacter K84 chromosome 1 (NC_011985)
αr9_Arr9CIsRNAArr9CIR14819751482119Agrobacterium radiobacter K84 chromosome 1 (NC_011985)
αr9_Arr9CIgeneArad_1870D14823071483734YP_002544125.1prolyl-tRNA sybthetaseAgrobacterium radiobacter K84 chromosome 1 (NC_011985)
αr9_Sfr9CgeneNGR_c10670D11295711129840YP_002825606.1transmembrane proteinSinorhizobium fredii NGR234 chromosome (NC_012587)
αr9_Sfr9CsRNASfr9CR11300011130150Sinorhizobium fredii NGR234 chromosome (NC_012587)
αr9_Sfr9CgeneNGR_c10680D11303671131872YP_002825607.1prolyl-tRNA sybthetaseSinorhizobium fredii NGR234 chromosome (NC_012587)
αr9_AH13r9CgeneAGROH133_05703D12150321215337YP_004278514.1transmembrane proteinAgrobacterium sp. H13-3 chromosome (NC_015183)
αr9_AH13r9CsRNAAH13r9CR12155801215726Agrobacterium sp. H13-3 chromosome (NC_015183)
αr9_AH13r9CgeneAGROH133_05706D12159111217344YP_004278515.1Prolyl-tRNA sybthetaseAgrobacterium sp. H13-3 chromosome (NC_015183)
αr9_Avr9CIgeneAvi_1737D14481811448450YP_002549251.1transmembrane proteinAgrobacterium vitis S4 chromosome 1 (NC_011989)
αr9_Avr9CIsRNAAvr9CIR14485101448659Agrobacterium vitis S4 chromosome 1 (NC_011989)
αr9_Avr9CIgeneAvi_1739D14488951450391YP_002549252.1prolyl-tRNA sybthetaseAgrobacterium vitis S4 chromosome 1 (NC_011989)
αr9_Bor9CIgeneBOV_0814D822060822329YP_001258797.1transmembrane proteinBrucella ovis ATCC 25840 chromosome I (NC_009505)
αr9_Bor9CIsRNABor9CIR822712822859Brucella ovis ATCC 25840 chromosome I (NC_009505)
αr9_Bor9CIgeneBOV_0816D823177824505YP_001258799.1prolyl-tRNA sybthetaseBrucella ovis ATCC 25840 chromosome I (NC_009505)
αr9_Bcr9CIgeneBCAN_A0834D795629795898YP_001592670.1transmembrane proteinBrucella canis ATCC 23365 chromosome I (NC_010103)
αr9_Bcr9CIsRNABcr9CIR796281796428Brucella canis ATCC 23365 chromosome I (NC_010103)
αr9_Bcr9CIgeneBCAN_A0837D796746798074YP_001592673.1prolyl-tRNA sybthetaseBrucella canis ATCC 23365 chromosome I (NC_010103)
αr9_Bs23445r9CIgeneBSUIS_A0858D817110817379YP_001627496.1transmembrane proteinBrucella suis ATCC 23445 chromosome I (NC_010169)
αr9_Bs23445r9CIsRNABs23445r9CIR817762817909Brucella suis ATCC 23445 chromosome I (NC_010169)
αr9_Bs23445r9CIgeneBSUIS_A0861D818227819555YP_001627499.1prolyl-tRNA sybthetaseBrucella suis ATCC 23445 chromosome I (NC_010169)
αr9_Bm16Mr9CIgeneBMEI1140R11859501187287NP_540057.1prolyl-tRNA sybthetaseBrucella melitensis bv. 1 str. 16M chromosome I (NC_003317)
αr9_Bm16Mr9CIsRNABm16Mr9CID11875961187743Brucella melitensis bv. 1 str. 16M chromosome I (NC_003317)
αr9_Bm16Mr9CIgeneBMEI1141R11881261188395NP_540058.1transmembrane proteinBrucella melitensis bv. 1 str. 16M chromosome I (NC_003317)
αr9_BaS19r9CIgeneBAbS19_I07830D817246817515YP_001934781.1transmembrane proteinBrucella abortus S19 chromosome 1 (NC_010742)
αr9_BaS19r9CIsRNABaS19r9CIR817898818045Brucella abortus S19 chromosome 1 (NC_010742)
αr9_BaS19r9CIgeneBAbS19_I07840D818363819691YP_001934782.1prolyl-tRNA sybthetaseBrucella abortus S19 chromosome 1 (NC_010742)
αr9_Bs1330r9CIgeneBR0819D797265797534NP_697833.1transmembrane proteinBrucella suis 1330 chromosome I (NC_004310)
αr9_Bs1330r9CIsRNABs1330r9CIR797917798064Brucella suis 1330 chromosome I (NC_004310)
αr9_Bs1330r9CIgeneBR0822D798382799710NP_697836.1prolyl-tRNA sybthetaseBrucella suis 1330 chromosome I (NC_004310)
αr9_Ba19941r9CIgeneBruAb1_0833D818945819214YP_221561.1transmembrane proteinBrucella abortus bv. 1 str. 9-941 chromosome I (NC_006932)
αr9_Ba19941r9CIsRNABa19941r9CIR819597819744Brucella abortus bv. 1 str. 9-941 chromosome I (NC_006932)
αr9_Ba19941r9CIgeneBruAb1_0836D820062821390YP_221564.1prolyl-tRNA sybthetaseBrucella abortus bv. 1 str. 9-941 chromosome I (NC_006932)
αr9_Bmar9CIgeneBAB1_0839D815223815492YP_414270.1hypothetical membrane spanning proteinBrucella melitensis biovar Abortus 2308 chromosome I (NC_007618)
αr9_Bmar9CIsRNABmar9CIR815875816022Brucella melitensis biovar Abortus 2308 chromosome I (NC_007618)
αr9_Bmar9CIgeneBAB1_0842D816340817668YP_414273.1prolyl-tRNA sybthetaseBrucella melitensis biovar Abortus 2308 chromosome I (NC_007618)
αr9_Bmir9CIgeneBMI_I820D801426801695YP_003106759.1transmembrane proteinBrucella microti CCM 4915 chromosome 1 (NC_013119)
αr9_Bmir9CIsRNABmir9CIR802078802225Brucella microti CCM 4915 chromosome 1 (NC_013119)
αr9_Bmir9CIgeneBMI_I822D802543803871YP_003106761.1prolyl-tRNA sybthetaseBrucella microti CCM 4915 chromosome 1 (NC_013119)
αr9_Bm23457r9CIgeneBMEA_A0861D818725818994YP_002732568.1transmembrane proteinBrucella melitensis ATCC 23457 chromosome I (NC_012441)
αr9_Bm23457r9CIsRNABm23457r9CIR819377819523Brucella melitensis ATCC 23457 chromosome I (NC_012441)
αr9_Bm23457r9CIgeneBMEA_A0862D819842821170YP_002732569.1prolyl-tRNA sybthetaseBrucella melitensis ATCC 23457 chromosome I (NC_012441)
αr9_Mlr9Cgenemll1344R11255201126848NP_102953.1prolyl-tRNA sybthetaseMesorhizobium loti MAFF303099 chromosome (NC_002678)
αr9_Mlr9CsRNAMlr9CD11272171127363Mesorhizobium loti MAFF303099 chromosome (NC_002678)
αr9_Mlr9Cgenemsl1345R11275031127787NP_102954.1hypothetical membrane spanning proteinMesorhizobium loti MAFF303099 chromosome (NC_002678)
αr9_Oar9CIgeneOant_2405R25274622528790YP_001370948.1prolyl-tRNA sybthetaseBrucella anthropi ATCC 49188 chromosome 1 (NC_009667)
αr9_Oar9CIsRNAOar9CID25291082529255Brucella anthropi ATCC 49188 chromosome 1 (NC_009667)
αr9_Oar9CIgeneOant_2406R25295942529863YP_001370949.1transmembrane proteinBrucella anthropi ATCC 49188 chromosome 1 (NC_009667)
αr9_Mcr9CgeneMesci_3302D34492153449499YP_004142475.1transmembrane proteinMesorhizobium ciceri biovar biserrulae WSM1271 chromosome (NC_014923)
αr9_Mcr9CsRNAMcr9CR34496393449786Mesorhizobium ciceri biovar biserrulae WSM1271 chromosome (NC_014923)
αr9_Mcr9CgeneMesci_3303D34501333451461YP_004142476.1prolyl-tRNA sybthetaseMesorhizobium ciceri biovar biserrulae WSM1271 chromosome (NC_014923)
αr9_MsBCNr9CgeneMeso_1039D11414891141758YP_673601.1transmembrane proteinMesorhizobium sp. BNC1 (NC_008254)
αr9_MsBCNr9CsRNAMsBCNr9CR11417761141933Mesorhizobium sp. BNC1 (NC_008254)
αr9_MsBCNr9CgeneMeso_1040D11421391143467YP_673602.1prolyl-tRNA sybthetaseMesorhizobium sp. BNC1 (NC_008254)

References

  1. 1 2 del Val C, Rivas E, Torres-Quesada O, Toro N, Jiménez-Zurdo JI (2007). "Identification of differentially expressed small non-coding RNAs in the legume endosymbiont Sinorhizobium meliloti by comparative genomics". Mol Microbiol. 66 (5): 1080–1091. doi:10.1111/j.1365-2958.2007.05978.x. PMC   2780559 . PMID   17971083.
  2. Vincent M Ulvé; Emeric W Sevin; Angélique Chéron & Frédérique Barloy-Hubler (2007). "Identification of chromosomal alpha-proteobacterial small RNAs by comparative genome analysis and detection in Sinorhizobium meliloti strain 1021". BMC Genomics. 8 (467): 467. doi: 10.1186/1471-2164-8-467 . PMC   2245857 . PMID   18093320.
  3. 1 2 Valverde, Claudio; Livny, Jonathan; Schlüter, Jan-Philip; Reinkensmeier, Jan; Becker, Anke & Parisi, Gustavo (2009). "prediction of Sinorhizobium meliloti sRNA genes and experimental detection in strain 2011". BMC Genomics. 9 (406): 416. doi: 10.1186/1471-2164-9-416 . PMC   2573895 . PMID   18793445.
  4. 1 2 Schlüter JP, Reinkensmeier J, Daschkey S, Evguenieva-Hackenberg E, Janssen S, Jänicke S, Becker JD, Giegerich R, Becker A (2010). "A genome-wide survey of sRNAs in the symbiotic nitrogen-fixing alpha-proteobacterium Sinorhizobium meliloti". BMC Genomics. 11 (245): 436. doi: 10.1186/1471-2164-11-436 . PMC   3091635 . PMID   20637113.
  5. Nawrocki EP, Kolbe DL, Eddy SR (2009). "Infernal 1.0: inference of RNA alignments". Bioinformatics. 25 (10): 1335–1337. doi:10.1093/bioinformatics/btp157. PMC   2732312 . PMID   19307242.
  6. Will S, Reiche K, Hofacker IL, Stadler PF, Backofen R (2007). "Inferring Noncoding RNA Families and Classes by Means of Genome-Scale Structure-Based Clustering". PLOS Comput Biol. 4 (65): e65. Bibcode:2007PLSCB...3...65W. doi: 10.1371/journal.pcbi.0030065 . PMC   1851984 . PMID   17432929.
  7. I. L. Hofacker; W. Fontana; P. F. Stadler; L. S. Bonhoeffer; M. Tacker & P. Schuster (1994). "Fast folding and comparison of RNA secondary structures". Monatshefte für Chemie. 125 (2): 167–188. doi:10.1007/BF00818163. S2CID   19344304.
  8. Stephan H Bernhart; Ivo L Hofacker; Sebastian Will; Andreas R Gruber & Peter F Stadler (2008). "RNAalifold: improved consensus structure prediction for RNA alignments". BMC Bioinformatics. 9 (474): 474. doi: 10.1186/1471-2105-9-474 . PMC   2621365 . PMID   19014431.
  9. Torres-Quesada O, Oruezabal RI, Peregrina A, Jofre E, Lloret J, Rivilla R, Toro N, Jiménez-Zurdo JI (2010). "The Sinorhizobium meliloti RNA chaperone Hfq influences central carbon metabolism and the symbiotic interaction with alfalfa" (PDF). BMC Microbiol. 6.
  10. MacLellan SR, MacLean AM, Finan TM (2006). "Promoter prediction in the rhizobia". Microbiology. 152 (6): 1751–1763. doi: 10.1099/mic.0.28743-0 . PMID   16735738.
  11. Novichkov PS, Rodionov DA, Stavrovskaya ED, Novichkova ES, Kazakov AE, Gelfand MS, Arkin AP, Mironov AA, Dubchak I (2010). "RegPredict: an integrated system for regulon inference in prokaryotes by comparative genomics approach". Nucleic Acids Research. 38 (Web Server issue): W299 –W307. doi:10.1093/nar/gkq531. PMC   2896116 . PMID   20542910.
  12. Gama-Castro S, Salgado H, Peralta-Gil M, Santos-Zavaleta A, Muniz-Rascado L, Solano-Lira H, Jimenez-Jacinto V, Weiss V, Garcia-Sotelo JS, Lopez-Fuentes A, Porron-Sotelo L, Alquicira-Hernandez S, Medina-Rivera A, Martinez-Flores I, Alquicira-Hernandez K, Martinez-Adame R, Bonavides-Martinez C, Miranda-Rios J, Huerta AM, Mendoza-Vargas A, Collado-Torres L, Taboada B, Vega-Alvarado L, Olvera M, Olvera L, Grande R, Morett E, Collado-Vides J (2010). "RegulonDB version 7.0: transcriptional regulation of Escherichia coli K-12 integrated within genetic sensory response units (Gensor Units)". Nucleic Acids Research. 39 (Database issue): D98 –D105. doi:10.1093/nar/gkq1110. PMC   3013702 . PMID   21051347.
  13. Bailey TL, Elkan C (1994). "Fitting a mixture model by expectation maximization to discover motifs in biopolymers". Proceedings. International Conference on Intelligent Systems for Molecular Biology. 2. AAAI Press, Menlo Park, California: 28–36. PMID   7584402.
  14. Goethals K, Van Montagu M, Holsters M (1992). "Conserved motifs in a divergent nod box of Azorhizobium caulinodans ORS571 reveal a common structure in promoters regulated by LysR-type proteins". Proc Natl Acad Sci U S A. 89 (5): 1646–1650. Bibcode:1992PNAS...89.1646G. doi: 10.1073/pnas.89.5.1646 . PMC   48509 . PMID   1542656.
  15. Vinayagam A, del Val C, Schubert F, Eils R, Glatting KH, Suhai S, König R (2006). "GOPET: a tool for automated predictions of Gene Ontology terms". BMC Bioinformatics. 7 161: 171. doi: 10.1186/1471-2105-7-161 . PMC   1434778 . PMID   16549020.
  16. Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M (2005). "Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research". Bioinformatics. 21 (18): 3674–3676. doi: 10.1093/bioinformatics/bti610 . hdl: 10251/151293 . PMID   16081474.
  17. del Val C, Ernst P, Falkenhahn M, Fladerer C, Glatting KH, Suhai S, Hotz-Wagenblatt A (2007). "ProtSweep, 2Dsweep and DomainSweep: protein analysis suite at DKFZ". Nucleic Acids Res. 35 (Web Server issue): W444–50. doi:10.1093/nar/gkm364. PMC   1933246 . PMID   17526514.
  18. Hunter S, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Das U, Daugherty L, Duquenne L, Finn RD, Gough J, Haft D, Hulo N, Kahn D, Kelly E, Laugraud A, Letunic I, Lonsdale D, Lopez R, Madera M, Maslen J, McAnulla C, McDowall J, Mistry J, Mitchell A, Mulder N, Natale D, Orengo C, Quinn AF, Selengut JD, Sigrist CJ, Thimma M, Thomas PD, Valentin F, Wilson D, Wu CH, Yeats C (2009). "InterPro: the integrative protein signature database". Nucleic Acids Res. 37 (Database issue): D224–228. doi:10.1093/nar/gkn785. PMC   2686546 . PMID   18940856.
  19. Finn RD, Mistry J, Tate J, Coggill P, Heger A, Pollington JE, Gavin OL, Gunasekaran P, Ceric G, Forslund K, Holm L, Sonnhammer EL, Eddy SR, Bateman A (2010). "The Pfam protein families database". Nucleic Acids Res. 38 (Database issue): D211–222. doi:10.1093/nar/gkp985. PMC   2808889 . PMID   19920124.
  20. Letunic I, Doerks T, Bork P (2008). "SMART 6: recent updates and new developments". Nucleic Acids Res. 37 (Database issue): D211–222. doi:10.1093/nar/gkn808. PMC   2686533 . PMID   18978020.