Query 001577
Match_columns 1050
No_of_seqs 360 out of 1111
Neff 3.7
Searched_HMMs 46136
Date Fri Mar 29 04:15:54 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/001577.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/001577hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1990 Poly(A)-specific exori 100.0 1.3E-63 2.9E-68 576.1 18.4 512 169-741 4-531 (564)
2 PRK10343 RNA-binding protein Y 99.9 3.2E-24 7E-29 200.0 12.9 89 593-685 2-90 (97)
3 TIGR00253 RNA_bind_YhbY putati 99.9 3.5E-24 7.6E-29 199.1 12.2 88 594-685 1-88 (95)
4 PF01985 CRS1_YhbY: CRS1 / Yhb 99.9 1.2E-23 2.6E-28 190.6 11.1 84 180-263 1-84 (84)
5 COG1534 Predicted RNA-binding 99.9 4.1E-23 9E-28 192.0 11.8 96 593-693 1-96 (97)
6 PF01985 CRS1_YhbY: CRS1 / Yhb 99.9 3.2E-23 6.9E-28 187.9 10.1 84 594-681 1-84 (84)
7 PRK10343 RNA-binding protein Y 99.9 1.1E-22 2.3E-27 189.9 12.8 89 178-266 1-89 (97)
8 TIGR00253 RNA_bind_YhbY putati 99.9 2.2E-22 4.8E-27 187.1 12.7 87 180-266 1-87 (95)
9 COG1534 Predicted RNA-binding 99.8 1.8E-20 3.9E-25 174.5 12.0 86 179-264 1-86 (97)
10 KOG1990 Poly(A)-specific exori 99.8 1.2E-19 2.7E-24 211.6 4.5 303 168-503 163-494 (564)
11 PF15337 Vasculin: Vascular pr 36.9 19 0.00041 35.1 1.4 41 399-440 34-74 (97)
12 PF03641 Lysine_decarbox: Poss 21.3 1.5E+02 0.0032 29.7 4.6 37 598-634 72-109 (133)
13 PF12207 DUF3600: Domain of un 20.4 32 0.00069 36.0 -0.2 101 553-653 24-139 (162)
No 1
>KOG1990 consensus Poly(A)-specific exoribonuclease PARN [Replication, recombination and repair]
Probab=100.00 E-value=1.3e-63 Score=576.09 Aligned_cols=512 Identities=43% Similarity=0.637 Sum_probs=444.8
Q ss_pred ccCCCchhhccCCHHHHHHHHhhccCCCCcEEeCCCCCCHHHHHHHHHHHHhCCeEEEEecCCchhhHHHHHHHHHHHhC
Q 001577 169 EKEVPTLAELKLSGKELRRLRTLGIGLRKKLKIGKAGITEGIVNGIHERWRHAEVVKIVCEDLCRLNMKRTHDSLERKTG 248 (1050)
Q Consensus 169 ~~~~PtlAEl~Lt~kErr~LRklA~~Lkp~V~IGK~GLTq~VVeeI~~~Wkk~ELVKIKv~g~~~~dmke~aE~LEekTG 248 (1050)
... |++|++++...++.+||..|..+.... +++|+|+.+++.|+..|+.+|+++++|......+|.+++++++..||
T Consensus 4 ~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~y~k~k~~~~~~~~~q~~~~~~~~~~~~~~~~~~~ 80 (564)
T KOG1990|consen 4 FES-LSLAELTVDEADLRRLRLVATGMTSAP--WKAGSTFDTVEAIYLKWKDNEEVYLKFQFGLCLFMKRSHEALEMSTG 80 (564)
T ss_pred ccc-hhHHHhhcCHHHHHHHhhhhccceecc--cccccchhhhHHHHHHHHhhhhhheeeccccchhHHHhhhHhhccCC
Confidence 445 999999999999999999999998887 99999999999999999999999999997777799999999999999
Q ss_pred C-EEEEEECcEEEEEecCCCCCCccccCCCCCCCCCCCCCCCcccccccccccccccCCCCCCCCCCCCCCchhhhhhhc
Q 001577 249 G-LVVWRSGSKIILYRGADYKYPYFLADESSTDGASSDDLPNQLVDDEGLDETKTHSSGADSAKPSGQSPTNKKVQQTLI 327 (1050)
Q Consensus 249 G-~VV~riG~tiVLYRGknY~~P~~~~~~~~~~e~~~~~~~~~~~d~e~~~~~~~~~ss~~~~~~~g~~~e~~~~~~~l~ 327 (1050)
| -+||+.|.....|++..|..|..-..+. ++ .....|
T Consensus 81 ~n~~~~~~g~~~s~~~~~~~~~~~~~~~~~--------------~~---------------------------~~~~~~- 118 (564)
T KOG1990|consen 81 GNFVVWSRGDSISSPEFLCQRSPVDFVARQ--------------QE---------------------------NQAGKW- 118 (564)
T ss_pred CceeeeecCccccCCccceeecchhhhhhh--------------ch---------------------------hhhhhh-
Confidence 9 9999999999999888887764322210 00 000000
Q ss_pred cccCCCCccccCCcchhhhHHHHHHhhhccCCCccCCCCCCCCccCCCCCCcccCCCCCCcccCCCCCCCCCCHHHHHHH
Q 001577 328 HSVGSPDKLRYQLPGEAELVEEADRLLDGLGPRFTDWWGYDPQPVDADLLPATVPGYRRPFRLLPYGVQPKLTNDEMTTL 407 (1050)
Q Consensus 328 ~~~~~~~~~~~~~p~E~~~e~E~~~LLd~LGPRF~dW~g~~PlPVDAdLLP~vVpgyk~PfR~lP~gvr~~LT~eErt~L 407 (1050)
...++.+.+.+++++||+|.+||+.+|+|+|+|++|++||+|.+|||.+|+|++++||..|.+++
T Consensus 119 ---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dl~~~~i~~~~~p~r~l~~~~~~~l~~~~~~~~ 183 (564)
T KOG1990|consen 119 ---------------PSELEKEKNELLDSLGPELSDWGGSDRLSVDADLLPEKIPDYMRPFRTLPVGSPPLLTSIESTLL 183 (564)
T ss_pred ---------------HHHHHHHHHHHhhccCcccccCCCCCCccchhhhchhhhhcccChhccCCCCChhhhhhHHHHHH
Confidence 13456789999999999999999999999999999999999999999999999999999999999
Q ss_pred HHcCCCCCceeEeccCcchHHHHHHHHHHhhcCcceEEEecCCCCcchHHHHHHHH--HhhhCCEEEEeeCCEEEEEcCC
Q 001577 408 RRLGRPLPCHFALGRNRNLQGLAAAIVKLWEKCEIAKIAVKRGAQNTNSEMMAQEL--KRLTGGTLLSRDREFIVFYRGK 485 (1050)
Q Consensus 408 Rklar~lpph~~LGRngv~~GLa~aI~k~we~~ELVKI~~K~gv~~td~e~ma~~L--k~lTGGvLVsr~k~~IIlYRGK 485 (1050)
|++|..+||||++|+++..+|++.+|+.+|++|+++++.+++|+..+..+.+|..| ...||++||++|+.++|+|||+
T Consensus 184 r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~k~~~~k~~~~rg~~~~~~~~~a~~l~~~~~tg~~lv~hN~~~dv~y~~~ 263 (564)
T KOG1990|consen 184 RRLGYKLPPHFALGRSRKLQGLAVAMVSFWEKHEFAKILIKRGVLETRKERMADELQELLLTGKVLVLHNKLLDVMYRYK 263 (564)
T ss_pred HHhcccccccceehhccccccchhHHHHHHHHHHHHHHHHHhcchhhhccchHHHHHHHHhcCCeEEeeccceeeeeehh
Confidence 99999999999999999999999999999999999999999999999999999999 9999999999999999999999
Q ss_pred CCCChhhhhHHHHHHhhhccccCCCCCccccCCC--CCCCCCCCccccc---cccccccc------hhhHHHHHHHHHHH
Q 001577 486 DFLPPAASSAIEERRKHEFSTSNDSKEEPELGNR--HDNSGDNTQDEFG---CTNDQKST------MHSEQKERRSAEVA 554 (1050)
Q Consensus 486 dflp~~V~~~L~eR~~l~~s~~~~~~~~~e~~~~--~~~s~~~t~~~~~---~~~~~~~~------~~~~~~~~~~~~~~ 554 (1050)
+|++ .+..+|.++++.....+............ ++....+|..++. +.++..-. ....+......++.
T Consensus 264 ~Fl~-~lp~~l~~f~~~~~~fp~~~~~~~~~~~~~~~~~~~~~t~~e~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~ 342 (564)
T KOG1990|consen 264 NFLS-PLPSTLEEFTDSSSMFPNIEDTKRLAKLSEYQKLNLKATLLELARAKAKKEKEIERRSISSRLKLEFEKASSEKL 342 (564)
T ss_pred hccc-ccchhHHHhhhhhhhhhhhHHHHHhhccccccchhhhhhHHHHHHHhcccccCcccccccchhhhhhhccchhhH
Confidence 9999 89999999988632222221100000100 1112233333321 11111000 01222334566778
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcc-cCCCCcccCcCCCCHHHHHHHHhcccCCCceEEeccCCCCHHHHHHHHHHHh
Q 001577 555 IRRTNIRLSRVLEKKAEAEKLLAELEE-ERPEQYEVDKEGITEEERYMLRKVGLRMKAFLLMGRRGVFDGTVENMHLHWK 633 (1050)
Q Consensus 555 ~k~~e~kL~~~~~K~~kae~ll~kle~-~~P~~~~~d~E~LT~eERk~LRkiGhkmkPvV~IGK~GVtdgVIeeIh~ALk 633 (1050)
.+.+++++..+..|...++..+++++. ..|.+..+|+|.+|.+++.+++++|.+|++++.+|++|+|+|++.+||+||+
T Consensus 343 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~t~ee~~~~~k~g~k~~~~~~~~rrg~f~g~i~n~~l~wk 422 (564)
T KOG1990|consen 343 TEAIFHKLEKAKKKLASANRILAKLEDPKIPAELRYDPESITEEERLMLRKVGLKMKRRLLSGRRGVFDGVIENMHLHWK 422 (564)
T ss_pred HHHHHHHHhhhhhhccchhhhhhcccccccccccccchhhcChHHHHHHHHHHHhhccccccCCcccccceeecchhhhh
Confidence 899999999999999999999999998 8899998999999999999999999999999999999999999999999999
Q ss_pred ccceEEEEecCCCH-HHHHHHHHHHHHHhCCEEEEeeecccCcEEEEEccCCCCCCCcCCccccccHHHHHHHhHHHHHH
Q 001577 634 HRELVKIISKQRKI-EAALQEARTLEVESGGILVAVERVNKGYAIILYRGKNYERPACLRPKTLLTKREAMKRSLEAQRR 712 (1050)
Q Consensus 634 ~rELIKVkv~~n~~-ed~keiA~~Le~~SGgeLVqv~k~~IG~~IILYRgKNY~rP~~l~P~~lLtKrkAl~rs~e~qr~ 712 (1050)
+||++|++|+.... .++++.|..++.++|+++|+++++..|++|+.||++||.+|..++|.++|+|++|+.+++++|++
T Consensus 423 ~~e~~k~i~~~~~~~~~~~~~a~~le~esg~~~v~~~~~~~~~ai~~yr~k~y~~p~~l~P~~~l~~~k~~~~~~~~~~~ 502 (564)
T KOG1990|consen 423 SRELVKVICKEKNLPSQVKQYASALERESGGILVSIDKNPKGYAIIAYRGKNYDRPTSLRPRNLLSRRKALERSLEEQRK 502 (564)
T ss_pred hcccceeeeccccccHHHHHHHHHHHHHhCCceeeeccCCchhhHHHhhhhhccCCcccCchhhhcccCCccccHHHHHH
Confidence 99999999999877 99999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhccc
Q 001577 713 QSLKLHVLELTRNIEKLKLQLVKDKEANS 741 (1050)
Q Consensus 713 ~sl~~~i~~l~~~i~~l~~~l~~~~~~~~ 741 (1050)
++++.||..+..++++++.++..+...+.
T Consensus 503 ~a~~~~i~~~~~~~e~~~~~~~~~~~~~~ 531 (564)
T KOG1990|consen 503 EALKSHISDLEQEIEQLQASVEAMPAINK 531 (564)
T ss_pred HHHhhhcchhhhhHHHhhcchhccccccc
Confidence 99999999999999999999877766665
No 2
>PRK10343 RNA-binding protein YhbY; Provisional
Probab=99.91 E-value=3.2e-24 Score=199.99 Aligned_cols=89 Identities=21% Similarity=0.398 Sum_probs=86.9
Q ss_pred CCCHHHHHHHHhcccCCCceEEeccCCCCHHHHHHHHHHHhccceEEEEecCCCHHHHHHHHHHHHHHhCCEEEEeeecc
Q 001577 593 GITEEERYMLRKVGLRMKAFLLMGRRGVFDGTVENMHLHWKHRELVKIISKQRKIEAALQEARTLEVESGGILVAVERVN 672 (1050)
Q Consensus 593 ~LT~eERk~LRkiGhkmkPvV~IGK~GVtdgVIeeIh~ALk~rELIKVkv~~n~~ed~keiA~~Le~~SGgeLVqv~k~~ 672 (1050)
+||++||+|||++||+++|+|+||++||+|+|+.++++||++||||||++++++.++++++|.+||+.+||++||+
T Consensus 2 ~Lt~kqr~~LR~~ah~l~Pvv~IGk~Glt~~vi~ei~~aL~~hELIKvkv~~~~~~~~~e~~~~i~~~~~ae~Vq~---- 77 (97)
T PRK10343 2 NLSTKQKQHLKGLAHPLKPVVLLGSNGLTEGVLAEIEQALEHHELIKVKIATEDRETKTLIVEAIVRETGACNVQV---- 77 (97)
T ss_pred CCCHHHHHHHHHhcCCCCCeEEECCCCCCHHHHHHHHHHHHHCCcEEEEecCCChhHHHHHHHHHHHHHCCEEEee----
Confidence 4999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cCcEEEEEccCCC
Q 001577 673 KGYAIILYRGKNY 685 (1050)
Q Consensus 673 IG~~IILYRgKNY 685 (1050)
||+++||||.++.
T Consensus 78 IG~~~vlYR~~~~ 90 (97)
T PRK10343 78 IGKTLVLYRPTKE 90 (97)
T ss_pred eCcEEEEEecCCC
Confidence 9999999999863
No 3
>TIGR00253 RNA_bind_YhbY putative RNA-binding protein, YhbY family. A combination of crystal structure, molecular modeling, and bioinformatic data together suggest that members of this family, including YhbY of E. coli, are RNA binding proteins.
Probab=99.91 E-value=3.5e-24 Score=199.08 Aligned_cols=88 Identities=28% Similarity=0.451 Sum_probs=86.1
Q ss_pred CCHHHHHHHHhcccCCCceEEeccCCCCHHHHHHHHHHHhccceEEEEecCCCHHHHHHHHHHHHHHhCCEEEEeeeccc
Q 001577 594 ITEEERYMLRKVGLRMKAFLLMGRRGVFDGTVENMHLHWKHRELVKIISKQRKIEAALQEARTLEVESGGILVAVERVNK 673 (1050)
Q Consensus 594 LT~eERk~LRkiGhkmkPvV~IGK~GVtdgVIeeIh~ALk~rELIKVkv~~n~~ed~keiA~~Le~~SGgeLVqv~k~~I 673 (1050)
||++||+|||++||+|+|+|+||++||||+||+++++||++||||||+++++++++++++|.+||+.+||++||+ |
T Consensus 1 Lt~kqr~~Lr~~ah~l~p~v~IGK~Glt~~vi~ei~~aL~~hELIKVkvl~~~~~~~~e~a~~i~~~~~a~~Vq~----i 76 (95)
T TIGR00253 1 LTGKQKRHLRGKAHHLKPVVLVGKNGLTEGVIKEIEQALEHRELIKVKVATEDREDKTLIAEALVKETGACNVQV----I 76 (95)
T ss_pred CCHHHHHHHHHHhCCCCCeEEECCCCCCHHHHHHHHHHHHhCCcEEEEecCCChhHHHHHHHHHHHHHCCEEEEE----E
Confidence 799999999999999999999999999999999999999999999999999999999999999999999999999 9
Q ss_pred CcEEEEEccCCC
Q 001577 674 GYAIILYRGKNY 685 (1050)
Q Consensus 674 G~~IILYRgKNY 685 (1050)
|+++||||+++.
T Consensus 77 G~~~vlYR~~~~ 88 (95)
T TIGR00253 77 GKTIVLYRPTKE 88 (95)
T ss_pred ccEEEEEecCCc
Confidence 999999999754
No 4
>PF01985 CRS1_YhbY: CRS1 / YhbY (CRM) domain; InterPro: IPR001890 The CRM domain is an ~100-amino acid RNA-binding domain. The name chloroplast RNA splicing and ribosome maturation (CRM) has been suggested to reflect the functions established for the four characterised members of the family: Zea mays (Maize) CRS1 (Q9FYT6 from SWISSPROT), CAF1 (Q84N49 from SWISSPROT) and CAF2 (Q84N48 from SWISSPROT) proteins and the Escherichia coli protein YhbY (P0AGK4 from SWISSPROT). The CRM domain is found in eubacteria, archaea, and plants. The CRM domain is represented as a stand-alone protein in archaea and bacteria, and in single- and multi-domain proteins in plants. It has been suggested that prokaryotic CRM proteins existed as ribosome-associated proteins prior to the divergence of archaea and bacteria, and that they were co-opted in the plant lineage as RNA binding modules by incorporation into diverse protein contexts. Plant CRM domains are predicted to reside not only in the chloroplast, but also in the mitochondrion and the nucleo/cytoplasmic compartment. The diversity of the CRM domain family in plants suggests a diverse set of RNA targets [, ]. The CRM domain is a compact alpha/beta domain consisting of a four-stranded beta sheet and three alpha helices with an alpha-beta-alpha-beta-alpha-beta-beta topology. The beta sheet face is basic, consistent with a role in RNA binding. Proximal to the basic beta sheet face is another moiety that could contribute to nucleic acid recognition. Connecting strand beta1 and helix alpha2 is a loop with a six amino acid motif, GxxG flanked by large aliphatic residues, within which one 'x' is typically a basic residue []. Escherichia coli YhbY is associated with pre-50S ribosomal subunits, which implies a function in ribosome assembly. GFP fused to a single-domain CRM protein from maize localises to the nucleolus, suggesting that an analogous activity may have been retained in plants []. A CRM domain containing protein in plant chloroplasts has been shown to function in group I and II intron splicing []. In vitro experiments with an isolated maize CRM domain have shown it to have RNA binding activity. These and other results suggest that the CRM domain evolved in the context of ribosome function prior to the divergence of Archaea and Bacteria, that this function has been maintained in extant prokaryotes, and that the domain was recruited to serve as an RNA binding module during the evolution of plant genomes []. YhbY has a fold similar to that of the C-terminal domain of translation initiation factor 3 (IF3C), which binds to 16S rRNA in the 30S ribosome [].; GO: 0003723 RNA binding; PDB: 1RQ8_A 1JO0_B 1LN4_A.
Probab=99.90 E-value=1.2e-23 Score=190.60 Aligned_cols=84 Identities=37% Similarity=0.580 Sum_probs=75.7
Q ss_pred CCHHHHHHHHhhccCCCCcEEeCCCCCCHHHHHHHHHHHHhCCeEEEEecCCchhhHHHHHHHHHHHhCCEEEEEECcEE
Q 001577 180 LSGKELRRLRTLGIGLRKKLKIGKAGITEGIVNGIHERWRHAEVVKIVCEDLCRLNMKRTHDSLERKTGGLVVWRSGSKI 259 (1050)
Q Consensus 180 Lt~kErr~LRklA~~Lkp~V~IGK~GLTq~VVeeI~~~Wkk~ELVKIKv~g~~~~dmke~aE~LEekTGG~VV~riG~ti 259 (1050)
||++|+++||+.||+|+|+|+|||+|+|++|+++|+++|++||||||+|.+++..+++++|++|+++|||++||++|+++
T Consensus 1 Lt~ke~~~Lr~~a~~l~p~v~IGk~Glt~~vi~~i~~~l~~~eLvKVk~~~~~~~~~~~~~~~l~~~t~~~~V~~iG~~~ 80 (84)
T PF01985_consen 1 LTSKERKFLRKLAHHLKPVVQIGKNGLTDGVIEEIDDALEKHELVKVKVLGNCREDRKEIAEQLAEKTGAEVVQVIGRTI 80 (84)
T ss_dssp --HHHHHHHHHHHTTC--SEEE-TTSS-HHHHHHHHHHHHHHSEEEEEETT--HHHHHHHHHHHHHHHTEEEEEEETTEE
T ss_pred CCHHHHHHHHHHhcCCCCeEEECCCCCCHHHHHHHHHHHHhCCeeEEEEccCCHHHHHHHHHHHHHHhCCEEEEEECCEE
Confidence 79999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEe
Q 001577 260 ILYR 263 (1050)
Q Consensus 260 VLYR 263 (1050)
||||
T Consensus 81 vlyR 84 (84)
T PF01985_consen 81 VLYR 84 (84)
T ss_dssp EEEE
T ss_pred EEEC
Confidence 9998
No 5
>COG1534 Predicted RNA-binding protein containing KH domain, possibly ribosomal protein [Translation, ribosomal structure and biogenesis]
Probab=99.89 E-value=4.1e-23 Score=192.04 Aligned_cols=96 Identities=27% Similarity=0.443 Sum_probs=89.4
Q ss_pred CCCHHHHHHHHhcccCCCceEEeccCCCCHHHHHHHHHHHhccceEEEEecCCCHHHHHHHHHHHHHHhCCEEEEeeecc
Q 001577 593 GITEEERYMLRKVGLRMKAFLLMGRRGVFDGTVENMHLHWKHRELVKIISKQRKIEAALQEARTLEVESGGILVAVERVN 672 (1050)
Q Consensus 593 ~LT~eERk~LRkiGhkmkPvV~IGK~GVtdgVIeeIh~ALk~rELIKVkv~~n~~ed~keiA~~Le~~SGgeLVqv~k~~ 672 (1050)
+||++|++|||+.||+++|+|+||++|||++||.+|+++|++||||||++.+++.++.+++|+.|++.+||.+||+
T Consensus 1 ~Lt~kq~~~Lrs~Ah~l~piv~IGk~Glte~vi~Ei~~aL~~reLIKVkvl~~~~edr~eia~~l~~~~~a~lVqv---- 76 (97)
T COG1534 1 MLTGKQKRFLRSKAHHLKPIVQIGKNGLTEGVIKEIDRALEARELIKVKVLQNAREDKKEIAEALAEETGAELVQV---- 76 (97)
T ss_pred CCcHHHHHHHHHhhccCCceEEecCCccCHHHHHHHHHHHHhCCcEEEEeeccchhhHHHHHHHHHHHhCCEEeee----
Confidence 6999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cCcEEEEEccCCCCCCCcCCc
Q 001577 673 KGYAIILYRGKNYERPACLRP 693 (1050)
Q Consensus 673 IG~~IILYRgKNY~rP~~l~P 693 (1050)
||+++|||| .+..+++..+|
T Consensus 77 iG~~~vlyr-~~~e~~~i~l~ 96 (97)
T COG1534 77 IGKTLVLYR-ESKEKRKISLP 96 (97)
T ss_pred eeeEEEEEe-cCcccccccCC
Confidence 999999999 44444444444
No 6
>PF01985 CRS1_YhbY: CRS1 / YhbY (CRM) domain; InterPro: IPR001890 The CRM domain is an ~100-amino acid RNA-binding domain. The name chloroplast RNA splicing and ribosome maturation (CRM) has been suggested to reflect the functions established for the four characterised members of the family: Zea mays (Maize) CRS1 (Q9FYT6 from SWISSPROT), CAF1 (Q84N49 from SWISSPROT) and CAF2 (Q84N48 from SWISSPROT) proteins and the Escherichia coli protein YhbY (P0AGK4 from SWISSPROT). The CRM domain is found in eubacteria, archaea, and plants. The CRM domain is represented as a stand-alone protein in archaea and bacteria, and in single- and multi-domain proteins in plants. It has been suggested that prokaryotic CRM proteins existed as ribosome-associated proteins prior to the divergence of archaea and bacteria, and that they were co-opted in the plant lineage as RNA binding modules by incorporation into diverse protein contexts. Plant CRM domains are predicted to reside not only in the chloroplast, but also in the mitochondrion and the nucleo/cytoplasmic compartment. The diversity of the CRM domain family in plants suggests a diverse set of RNA targets [, ]. The CRM domain is a compact alpha/beta domain consisting of a four-stranded beta sheet and three alpha helices with an alpha-beta-alpha-beta-alpha-beta-beta topology. The beta sheet face is basic, consistent with a role in RNA binding. Proximal to the basic beta sheet face is another moiety that could contribute to nucleic acid recognition. Connecting strand beta1 and helix alpha2 is a loop with a six amino acid motif, GxxG flanked by large aliphatic residues, within which one 'x' is typically a basic residue []. Escherichia coli YhbY is associated with pre-50S ribosomal subunits, which implies a function in ribosome assembly. GFP fused to a single-domain CRM protein from maize localises to the nucleolus, suggesting that an analogous activity may have been retained in plants []. A CRM domain containing protein in plant chloroplasts has been shown to function in group I and II intron splicing []. In vitro experiments with an isolated maize CRM domain have shown it to have RNA binding activity. These and other results suggest that the CRM domain evolved in the context of ribosome function prior to the divergence of Archaea and Bacteria, that this function has been maintained in extant prokaryotes, and that the domain was recruited to serve as an RNA binding module during the evolution of plant genomes []. YhbY has a fold similar to that of the C-terminal domain of translation initiation factor 3 (IF3C), which binds to 16S rRNA in the 30S ribosome [].; GO: 0003723 RNA binding; PDB: 1RQ8_A 1JO0_B 1LN4_A.
Probab=99.89 E-value=3.2e-23 Score=187.87 Aligned_cols=84 Identities=32% Similarity=0.493 Sum_probs=75.6
Q ss_pred CCHHHHHHHHhcccCCCceEEeccCCCCHHHHHHHHHHHhccceEEEEecCCCHHHHHHHHHHHHHHhCCEEEEeeeccc
Q 001577 594 ITEEERYMLRKVGLRMKAFLLMGRRGVFDGTVENMHLHWKHRELVKIISKQRKIEAALQEARTLEVESGGILVAVERVNK 673 (1050)
Q Consensus 594 LT~eERk~LRkiGhkmkPvV~IGK~GVtdgVIeeIh~ALk~rELIKVkv~~n~~ed~keiA~~Le~~SGgeLVqv~k~~I 673 (1050)
||++||++||++||+|+|+|+|||+|||++|+++|++||++||||||+|.+++.++++++|..|++.+||++|++ +
T Consensus 1 Lt~ke~~~Lr~~a~~l~p~v~IGk~Glt~~vi~~i~~~l~~~eLvKVk~~~~~~~~~~~~~~~l~~~t~~~~V~~----i 76 (84)
T PF01985_consen 1 LTSKERKFLRKLAHHLKPVVQIGKNGLTDGVIEEIDDALEKHELVKVKVLGNCREDRKEIAEQLAEKTGAEVVQV----I 76 (84)
T ss_dssp --HHHHHHHHHHHTTC--SEEE-TTSS-HHHHHHHHHHHHHHSEEEEEETT--HHHHHHHHHHHHHHHTEEEEEE----E
T ss_pred CCHHHHHHHHHHhcCCCCeEEECCCCCCHHHHHHHHHHHHhCCeeEEEEccCCHHHHHHHHHHHHHHhCCEEEEE----E
Confidence 799999999999999999999999999999999999999999999999999999999999999999999999998 9
Q ss_pred CcEEEEEc
Q 001577 674 GYAIILYR 681 (1050)
Q Consensus 674 G~~IILYR 681 (1050)
|+++||||
T Consensus 77 G~~~vlyR 84 (84)
T PF01985_consen 77 GRTIVLYR 84 (84)
T ss_dssp TTEEEEEE
T ss_pred CCEEEEEC
Confidence 99999998
No 7
>PRK10343 RNA-binding protein YhbY; Provisional
Probab=99.89 E-value=1.1e-22 Score=189.89 Aligned_cols=89 Identities=27% Similarity=0.428 Sum_probs=86.7
Q ss_pred ccCCHHHHHHHHhhccCCCCcEEeCCCCCCHHHHHHHHHHHHhCCeEEEEecCCchhhHHHHHHHHHHHhCCEEEEEECc
Q 001577 178 LKLSGKELRRLRTLGIGLRKKLKIGKAGITEGIVNGIHERWRHAEVVKIVCEDLCRLNMKRTHDSLERKTGGLVVWRSGS 257 (1050)
Q Consensus 178 l~Lt~kErr~LRklA~~Lkp~V~IGK~GLTq~VVeeI~~~Wkk~ELVKIKv~g~~~~dmke~aE~LEekTGG~VV~riG~ 257 (1050)
|+||++|+++||++||+|+|+|+|||+|||++|+++|.++|++||||||++.+++.++.++++++|++.||+++||.||+
T Consensus 1 m~Lt~kqr~~LR~~ah~l~Pvv~IGk~Glt~~vi~ei~~aL~~hELIKvkv~~~~~~~~~e~~~~i~~~~~ae~Vq~IG~ 80 (97)
T PRK10343 1 MNLSTKQKQHLKGLAHPLKPVVLLGSNGLTEGVLAEIEQALEHHELIKVKIATEDRETKTLIVEAIVRETGACNVQVIGK 80 (97)
T ss_pred CCCCHHHHHHHHHhcCCCCCeEEECCCCCCHHHHHHHHHHHHHCCcEEEEecCCChhHHHHHHHHHHHHHCCEEEeeeCc
Confidence 46999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEEEecCC
Q 001577 258 KIILYRGAD 266 (1050)
Q Consensus 258 tiVLYRGkn 266 (1050)
++||||++.
T Consensus 81 ~~vlYR~~~ 89 (97)
T PRK10343 81 TLVLYRPTK 89 (97)
T ss_pred EEEEEecCC
Confidence 999999964
No 8
>TIGR00253 RNA_bind_YhbY putative RNA-binding protein, YhbY family. A combination of crystal structure, molecular modeling, and bioinformatic data together suggest that members of this family, including YhbY of E. coli, are RNA binding proteins.
Probab=99.88 E-value=2.2e-22 Score=187.14 Aligned_cols=87 Identities=31% Similarity=0.487 Sum_probs=85.2
Q ss_pred CCHHHHHHHHhhccCCCCcEEeCCCCCCHHHHHHHHHHHHhCCeEEEEecCCchhhHHHHHHHHHHHhCCEEEEEECcEE
Q 001577 180 LSGKELRRLRTLGIGLRKKLKIGKAGITEGIVNGIHERWRHAEVVKIVCEDLCRLNMKRTHDSLERKTGGLVVWRSGSKI 259 (1050)
Q Consensus 180 Lt~kErr~LRklA~~Lkp~V~IGK~GLTq~VVeeI~~~Wkk~ELVKIKv~g~~~~dmke~aE~LEekTGG~VV~riG~ti 259 (1050)
||++|+++||++||+|+|.|+|||+|||++|+++|+++|++||||||++..++..+.+++|++|++.||+++|+.+|+++
T Consensus 1 Lt~kqr~~Lr~~ah~l~p~v~IGK~Glt~~vi~ei~~aL~~hELIKVkvl~~~~~~~~e~a~~i~~~~~a~~Vq~iG~~~ 80 (95)
T TIGR00253 1 LTGKQKRHLRGKAHHLKPVVLVGKNGLTEGVIKEIEQALEHRELIKVKVATEDREDKTLIAEALVKETGACNVQVIGKTI 80 (95)
T ss_pred CCHHHHHHHHHHhCCCCCeEEECCCCCCHHHHHHHHHHHHhCCcEEEEecCCChhHHHHHHHHHHHHHCCEEEEEEccEE
Confidence 79999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEecCC
Q 001577 260 ILYRGAD 266 (1050)
Q Consensus 260 VLYRGkn 266 (1050)
||||++.
T Consensus 81 vlYR~~~ 87 (95)
T TIGR00253 81 VLYRPTK 87 (95)
T ss_pred EEEecCC
Confidence 9999964
No 9
>COG1534 Predicted RNA-binding protein containing KH domain, possibly ribosomal protein [Translation, ribosomal structure and biogenesis]
Probab=99.83 E-value=1.8e-20 Score=174.54 Aligned_cols=86 Identities=31% Similarity=0.499 Sum_probs=84.4
Q ss_pred cCCHHHHHHHHhhccCCCCcEEeCCCCCCHHHHHHHHHHHHhCCeEEEEecCCchhhHHHHHHHHHHHhCCEEEEEECcE
Q 001577 179 KLSGKELRRLRTLGIGLRKKLKIGKAGITEGIVNGIHERWRHAEVVKIVCEDLCRLNMKRTHDSLERKTGGLVVWRSGSK 258 (1050)
Q Consensus 179 ~Lt~kErr~LRklA~~Lkp~V~IGK~GLTq~VVeeI~~~Wkk~ELVKIKv~g~~~~dmke~aE~LEekTGG~VV~riG~t 258 (1050)
+||++|+++||+.||+++|.|+|||+|||++|+++|.++|++||||||++.+.+..+.+++|+.|++.+|+.+|+.+|++
T Consensus 1 ~Lt~kq~~~Lrs~Ah~l~piv~IGk~Glte~vi~Ei~~aL~~reLIKVkvl~~~~edr~eia~~l~~~~~a~lVqviG~~ 80 (97)
T COG1534 1 MLTGKQKRFLRSKAHHLKPIVQIGKNGLTEGVIKEIDRALEARELIKVKVLQNAREDKKEIAEALAEETGAELVQVIGKT 80 (97)
T ss_pred CCcHHHHHHHHHhhccCCceEEecCCccCHHHHHHHHHHHHhCCcEEEEeeccchhhHHHHHHHHHHHhCCEEeeeeeeE
Confidence 58999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEEec
Q 001577 259 IILYRG 264 (1050)
Q Consensus 259 iVLYRG 264 (1050)
+||||.
T Consensus 81 ~vlyr~ 86 (97)
T COG1534 81 LVLYRE 86 (97)
T ss_pred EEEEec
Confidence 999994
No 10
>KOG1990 consensus Poly(A)-specific exoribonuclease PARN [Replication, recombination and repair]
Probab=99.77 E-value=1.2e-19 Score=211.56 Aligned_cols=303 Identities=21% Similarity=0.284 Sum_probs=218.1
Q ss_pred hccCCCchhhccCCHHHHHHHHhhccCCCCcEEeCCCCCCHHHHHHHHHHHHhCCeEEEEec-CCchhhHHHHHHHH--H
Q 001577 168 KEKEVPTLAELKLSGKELRRLRTLGIGLRKKLKIGKAGITEGIVNGIHERWRHAEVVKIVCE-DLCRLNMKRTHDSL--E 244 (1050)
Q Consensus 168 k~~~~PtlAEl~Lt~kErr~LRklA~~Lkp~V~IGK~GLTq~VVeeI~~~Wkk~ELVKIKv~-g~~~~dmke~aE~L--E 244 (1050)
-.+.+|.+.+..|+..+...+|.++...+++|.+|.++.-+++.-.+...|.+|+.+|+.+. +.+....+.++.++ -
T Consensus 163 p~r~l~~~~~~~l~~~~~~~~r~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~k~~~~k~~~~rg~~~~~~~~~a~~l~~~ 242 (564)
T KOG1990|consen 163 PFRTLPVGSPPLLTSIESTLLRRLGYKLPPHFALGRSRKLQGLAVAMVSFWEKHEFAKILIKRGVLETRKERMADELQEL 242 (564)
T ss_pred hhccCCCCChhhhhhHHHHHHHHhcccccccceehhccccccchhHHHHHHHHHHHHHHHHHhcchhhhccchHHHHHHH
Confidence 35788999999999999999999999999999999999999999999999999999998886 88888888999999 8
Q ss_pred HHhCCEEEEEECcEEEEEecCCCCCCccccCCCCCCCCCCCCCCCcccccccccccccccCCCCCCCCCCCCCCchhhhh
Q 001577 245 RKTGGLVVWRSGSKIILYRGADYKYPYFLADESSTDGASSDDLPNQLVDDEGLDETKTHSSGADSAKPSGQSPTNKKVQQ 324 (1050)
Q Consensus 245 ekTGG~VV~riG~tiVLYRGknY~~P~~~~~~~~~~e~~~~~~~~~~~d~e~~~~~~~~~ss~~~~~~~g~~~e~~~~~~ 324 (1050)
..||+.||.++|-..|+||+++|.. ............ ... ..+...........+ .......++..+...+..
T Consensus 243 ~~tg~~lv~hN~~~dv~y~~~~Fl~-~lp~~l~~f~~~-~~~----fp~~~~~~~~~~~~~-~~~~~~~~t~~e~~~~~~ 315 (564)
T KOG1990|consen 243 LLTGKVLVLHNKLLDVMYRYKNFLS-PLPSTLEEFTDS-SSM----FPNIEDTKRLAKLSE-YQKLNLKATLLELARAKA 315 (564)
T ss_pred HhcCCeEEeeccceeeeeehhhccc-ccchhHHHhhhh-hhh----hhhhHHHHHhhcccc-ccchhhhhhHHHHHHHhc
Confidence 9999999999999999999999976 333322100000 001 111111111111111 111122344444445555
Q ss_pred hhccccCCCCcc-ccCCcchh----hh--------------HHHHHHhhhccCCCccCCCCCCCCccCCCCCCcccCCCC
Q 001577 325 TLIHSVGSPDKL-RYQLPGEA----EL--------------VEEADRLLDGLGPRFTDWWGYDPQPVDADLLPATVPGYR 385 (1050)
Q Consensus 325 ~l~~~~~~~~~~-~~~~p~E~----~~--------------e~E~~~LLd~LGPRF~dW~g~~PlPVDAdLLP~vVpgyk 385 (1050)
.|...+...+.. +..++.+. +. +..++..|++++ ....|+.
T Consensus 316 ~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~---------------~~~~~~~----- 375 (564)
T KOG1990|consen 316 KKEKEIERRSISSRLKLEFEKASSEKLTEAIFHKLEKAKKKLASANRILAKLE---------------DPKIPAE----- 375 (564)
T ss_pred ccccCcccccccchhhhhhhccchhhHHHHHHHHHhhhhhhccchhhhhhccc---------------ccccccc-----
Confidence 554444332211 11111111 11 111344444432 1112221
Q ss_pred CCcccCCCCCCCCCCHHHHHHHHHcCCCCCceeEeccCcchHHHHHHHHHHhhcCcceEEEecCCCCcchHHHHHHHHHh
Q 001577 386 RPFRLLPYGVQPKLTNDEMTTLRRLGRPLPCHFALGRNRNLQGLAAAIVKLWEKCEIAKIAVKRGAQNTNSEMMAQELKR 465 (1050)
Q Consensus 386 ~PfR~lP~gvr~~LT~eErt~LRklar~lpph~~LGRngv~~GLa~aI~k~we~~ELVKI~~K~gv~~td~e~ma~~Lk~ 465 (1050)
++.+.+.+|++|+.++++.|..+...+.+||.|+|+|++.++|.||++++++++.||........+..|..++.
T Consensus 376 ------~~~~~e~~t~ee~~~~~k~g~k~~~~~~~~rrg~f~g~i~n~~l~wk~~e~~k~i~~~~~~~~~~~~~a~~le~ 449 (564)
T KOG1990|consen 376 ------LRYDPESITEEERLMLRKVGLKMKRRLLSGRRGVFDGVIENMHLHWKSRELVKVICKEKNLPSQVKQYASALER 449 (564)
T ss_pred ------cccchhhcChHHHHHHHHHHHhhccccccCCcccccceeecchhhhhhcccceeeeccccccHHHHHHHHHHHH
Confidence 12355679999999999999999999999999999999999999999999999999988777999999999999
Q ss_pred hhCCEEEEeeC----CEEEEEcCCCCCChhh---hhHHHHHHhhh
Q 001577 466 LTGGTLLSRDR----EFIVFYRGKDFLPPAA---SSAIEERRKHE 503 (1050)
Q Consensus 466 lTGGvLVsr~k----~~IIlYRGKdflp~~V---~~~L~eR~~l~ 503 (1050)
+.||++|+.+. +.|++|||++|..|.. .+.|.+|..+.
T Consensus 450 esg~~~v~~~~~~~~~ai~~yr~k~y~~p~~l~P~~~l~~~k~~~ 494 (564)
T KOG1990|consen 450 ESGGILVSIDKNPKGYAIIAYRGKNYDRPTSLRPRNLLSRRKALE 494 (564)
T ss_pred HhCCceeeeccCCchhhHHHhhhhhccCCcccCchhhhcccCCcc
Confidence 99999999977 5589999999999876 24555554433
No 11
>PF15337 Vasculin: Vascular protein family Vasculin-like 1
Probab=36.92 E-value=19 Score=35.09 Aligned_cols=41 Identities=24% Similarity=0.419 Sum_probs=32.8
Q ss_pred CCHHHHHHHHHcCCCCCceeEeccCcchHHHHHHHHHHhhcC
Q 001577 399 LTNDEMTTLRRLGRPLPCHFALGRNRNLQGLAAAIVKLWEKC 440 (1050)
Q Consensus 399 LT~eErt~LRklar~lpph~~LGRngv~~GLa~aI~k~we~~ 440 (1050)
||.+|+++|--+.-.|. ---+||||++++..-++-..|...
T Consensus 34 lTEDElkEF~~kseQlr-rNGf~kngfl~~rs~slf~pWr~t 74 (97)
T PF15337_consen 34 LTEDELKEFQVKSEQLR-RNGFGKNGFLQSRSLSLFSPWRST 74 (97)
T ss_pred CcHHHHHHHHHHHHHHH-Hccccccchhhhhhhhcccccccc
Confidence 99999999987775544 356899999999888877778654
No 12
>PF03641 Lysine_decarbox: Possible lysine decarboxylase; InterPro: IPR005269 This entry represents a cytokinin-activating enzyme working in the direct activation pathway. It is a phosphoribohydrolase that converts inactive cytokinin nucleotides to the biologically active free-base forms [, ]. The proteins in this entry belong to the LOG family of proteins.; PDB: 1YDH_B 2Q4D_A 1RCU_C 1WEH_B 3SBX_F 3BQ9_B 2PMB_D 3GH1_D 1WEK_C 3QUA_A ....
Probab=21.32 E-value=1.5e+02 Score=29.66 Aligned_cols=37 Identities=22% Similarity=0.330 Sum_probs=31.1
Q ss_pred HHHHHHhcccCCC-ceEEeccCCCCHHHHHHHHHHHhc
Q 001577 598 ERYMLRKVGLRMK-AFLLMGRRGVFDGTVENMHLHWKH 634 (1050)
Q Consensus 598 ERk~LRkiGhkmk-PvV~IGK~GVtdgVIeeIh~ALk~ 634 (1050)
+--.+.++|.+-+ |++.++.+|.|+.+++.++.....
T Consensus 72 ~~~~~~~l~~~~~~Piil~~~~g~w~~l~~~l~~~~~~ 109 (133)
T PF03641_consen 72 EALTLMQLGRHNKVPIILLNIDGFWDPLLEFLDRMIEE 109 (133)
T ss_dssp HHHHHHHTTSSTS-EEEEEECGGCCHHHHHHHHHHHHT
T ss_pred HHHHHHhhccccCCCEEEeCCcchHHHHHHHHHHHHHC
Confidence 5566788998888 999999999999999999755543
No 13
>PF12207 DUF3600: Domain of unknown function (DUF3600); InterPro: IPR022019 This family of proteins is found in bacteria. Proteins in this family are approximately 230 amino acids in length. This domain is the C-terminal of the putative ecf-type sigma factor negative effector. ; PDB: 3FGG_A 3FH3_A.
Probab=20.39 E-value=32 Score=36.04 Aligned_cols=101 Identities=14% Similarity=0.147 Sum_probs=68.0
Q ss_pred HHHHHHHHHHHHHHHHH---H--HHHHHHHhhcc----cCCCCcccCcCCCCHHHHHHHHhcccCCCceEE------ecc
Q 001577 553 VAIRRTNIRLSRVLEKK---A--EAEKLLAELEE----ERPEQYEVDKEGITEEERYMLRKVGLRMKAFLL------MGR 617 (1050)
Q Consensus 553 ~~~k~~e~kL~~~~~K~---~--kae~ll~kle~----~~P~~~~~d~E~LT~eERk~LRkiGhkmkPvV~------IGK 617 (1050)
..--+++.||+.|+-.. + .=..+|++|-. ..-+.-..|.|-||..++..++++...+.|+|- --|
T Consensus 24 e~Y~~~eaKLqqAK~~lgeeEfeef~~lLK~lt~~kLkygD~NGnidye~ls~~eqee~k~~~~eLqPYFdKLN~~~SsK 103 (162)
T PF12207_consen 24 EDYLRLEAKLQQAKGELGEEEFEEFKELLKKLTNAKLKYGDKNGNIDYEKLSKEEQEEYKKLTMELQPYFDKLNGHKSSK 103 (162)
T ss_dssp HHHHHHHHHHHHHHHCS-HHHHHHHHHHHHHHHHHHHHHB-TTS-B-GGGS-HHHHHHHHHHHHHHHHHHHHHTT---HH
T ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhHHhhcccCCCcCHHhCCHHHHHHHHHHHHhcchHHHHhcCCcchh
Confidence 34567899999998542 2 22346666654 346677889999999999999999999999873 344
Q ss_pred CCCCHHHHHHHHHHHhccceEEEEecCCCHHHHHHH
Q 001577 618 RGVFDGTVENMHLHWKHRELVKIISKQRKIEAALQE 653 (1050)
Q Consensus 618 ~GVtdgVIeeIh~ALk~rELIKVkv~~n~~ed~kei 653 (1050)
.=+++.=.+.--.||-..|.|+|+........+.++
T Consensus 104 ~vlt~~E~d~y~eALm~~e~v~vk~~~~~~~~ve~v 139 (162)
T PF12207_consen 104 EVLTQEEYDQYIEALMTYETVRVKTKSSGGITVEEV 139 (162)
T ss_dssp HHS-HHHHHHHHHHHHHHHHHHHHCT-SS---GGGS
T ss_pred hhcCHHHHHHHHHHHhhhheeeeeccCCCCCcHHhc
Confidence 456777777888899999999998886554444433
Done!