Query gi|254780527|ref|YP_003064940.1| hypothetical protein CLIBASIA_02070 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 397 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Sun May 29 22:53:45 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780527.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam02120 Flg_hook Flagellar h 90.3 1.5 3.9E-05 23.6 6.8 70 257-326 9-78 (85) 2 pfam11352 DUF3155 Protein of u 50.5 4.8 0.00012 20.3 0.2 16 362-377 55-70 (90) 3 COG5142 OXR1 Oxidation resista 46.9 4.1 0.0001 20.7 -0.6 65 184-248 15-92 (212) 4 COG0179 MhpD 2-keto-4-pentenoa 45.6 7.4 0.00019 19.0 0.5 26 171-196 65-91 (266) 5 COG4606 CeuB ABC-type enteroch 36.1 6.7 0.00017 19.2 -0.9 49 168-216 171-219 (321) 6 pfam11471 DUF3205 Protein of u 30.8 32 0.00082 14.6 1.9 65 64-139 73-139 (140) 7 KOG0927 consensus 29.6 26 0.00065 15.3 1.2 89 118-211 247-338 (614) 8 TIGR00781 ccoO cytochrome c ox 29.1 19 0.00048 16.2 0.4 43 177-219 117-165 (234) 9 KOG2107 consensus 27.2 18 0.00045 16.4 -0.0 64 22-109 48-113 (179) 10 KOG3870 consensus 26.9 13 0.00033 17.3 -0.7 74 122-213 31-104 (434) 11 pfam11080 DUF2622 Protein of u 24.4 24 0.00062 15.5 0.3 15 61-75 34-48 (96) 12 COG2993 CcoO Cbb3-type cytochr 24.1 24 0.0006 15.5 0.2 43 177-219 119-166 (227) 13 KOG3879 consensus 23.6 44 0.0011 13.7 1.5 46 309-357 173-224 (267) 14 pfam11629 Mst1_SARAH C termina 23.4 43 0.0011 13.8 1.4 22 292-313 26-47 (49) 15 COG1029 FwdB Formylmethanofura 23.3 35 0.0009 14.3 1.0 137 158-299 193-348 (429) 16 COG4085 Predicted RNA-binding 23.1 35 0.0009 14.4 0.9 49 231-279 89-137 (204) 17 TIGR02583 DevR_archaea CRISPR- 20.9 31 0.00079 14.8 0.3 29 224-252 248-280 (311) No 1 >pfam02120 Flg_hook Flagellar hook-length control protein FliK. This is the C terminal domain of FliK. FliK controls the length of the flagellar hook by directly measuring the hook length as a molecular ruler. This family also includes YscP of the Yersinia type III secretion system, and equivalent proteins in other pathogenic bacterial type III secretion systems. Probab=90.28 E-value=1.5 Score=23.62 Aligned_cols=70 Identities=17% Similarity=0.288 Sum_probs=62.6 Q ss_pred CCCEEEEEEEECCCCCCCEEEEEEECCCEEEEEEEECHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEHHH Q ss_conf 6524799999458982004554553484458999951157898876336767753043760110000011 Q gi|254780527|r 257 TGSIKALTIQMKPNSPDNIVATLCLSGEKLSVKLRVESDFLYKKIQSGRQNILDALNFSGYTMDRFDVEF 326 (397) Q Consensus 257 tgsikaltiqmkpnspdnivatlclsgeklsvklrvesdflykkiqsgrqnildalnfsgytmdrfdvef 326 (397) .+....++|++.|..-..+-+++-+.+..++|.+.+++.--...+++....+..+|.-.|+....++|.. T Consensus 9 ~~~~~~~~i~L~p~~LG~v~i~l~~~~~~l~v~~~~~~~~~~~~L~~~~~~L~~~L~~~G~~~~~~~v~~ 78 (85) T pfam02120 9 ANGVWRAELRLDPPELGPVEVRLSLDGGQLSVRFTAENPETRELLEQNLPRLREALAAQGLELADLSVSQ 78 (85) T ss_pred HCCCCEEEEEECCHHCCCEEEEEEEECCEEEEEEEECCHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECC T ss_conf 8898289999883682958999999689799999978789999999989999999998699841579835 No 2 >pfam11352 DUF3155 Protein of unknown function (DUF3155). This family of proteins with unknown function appears to be restricted to Cyanobacteria. Probab=50.53 E-value=4.8 Score=20.26 Aligned_cols=16 Identities=38% Similarity=0.879 Sum_probs=12.7 Q ss_pred CCHHHHHHHHCCCCCC Q ss_conf 4101013321134333 Q gi|254780527|r 362 PERERTEKKFFWGEKR 377 (397) Q Consensus 362 perertekkffwgekr 377 (397) -..|.|...||||||- T Consensus 55 rRnehttdR~FW~ekG 70 (90) T pfam11352 55 RRNEHTTDRYFWAEKG 70 (90) T ss_pred EECCCCHHHHHHHCCC T ss_conf 4054311344332034 No 3 >COG5142 OXR1 Oxidation resistance protein [DNA replication, recombination, and repair] Probab=46.91 E-value=4.1 Score=20.71 Aligned_cols=65 Identities=31% Similarity=0.343 Sum_probs=44.2 Q ss_pred HCCCCCCHHHHHHH-HHCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHH------------HCCCCEEEEEEC Q ss_conf 13788787888875-212123444654234475365000341026777535765------------350212578852 Q gi|254780527|r 184 SEDVPLNPVDFLLE-KQLDTASLGEEICNLKGFSVPSRYQSSDSLDALMSTESV------------LNQKSFMRVTMI 248 (397) Q Consensus 184 sedvplnpvdflle-kqldtaslgeeicnlkgfsvpsryqssdsldalmstesv------------lnqksfmrvtmi 248 (397) -+|.|+-|++..-. +.....-|-+|||.----+.|.|||++.+..-|.|++.- -|.+.|-||..| T Consensus 15 ~~~~~~~~~~l~~~~~s~K~~llt~e~~~~ire~lp~Ry~~~t~W~llySl~~~G~Sl~t~y~~~~~~~~~frrvg~V 92 (212) T COG5142 15 LPDLPELQFDLMHDGKSYKASLLTEEIVTRIRESLPDRYKYSTSWRLLYSLFENGFSLRTFYESFGENEWPFRRVGFV 92 (212) T ss_pred CCCCCCCCEECCCCCCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCEEE T ss_conf 666876753011556212124489999999998625788878899998988625652999998748344756673289 No 4 >COG0179 MhpD 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism] Probab=45.61 E-value=7.4 Score=18.98 Aligned_cols=26 Identities=42% Similarity=0.574 Sum_probs=21.6 Q ss_pred CHHHHHHHHHHHHHC-CCCCCHHHHHH Q ss_conf 067898888886313-78878788887 Q gi|254780527|r 171 SFSNYLRHRKEMFSE-DVPLNPVDFLL 196 (397) Q Consensus 171 sfsnylrhrkemfse-dvplnpvdfll 196 (397) -..||..|.+||... ++|..|+.|+- T Consensus 65 vG~NY~~Ha~E~~~~~~~p~~P~~F~K 91 (266) T COG0179 65 VGRNYADHAEEMGKDRDIPEEPVFFLK 91 (266) T ss_pred EECHHHHHHHHHCCCCCCCCCCEEECC T ss_conf 961199999986567788889765024 No 5 >COG4606 CeuB ABC-type enterochelin transport system, permease component [Inorganic ion transport and metabolism] Probab=36.10 E-value=6.7 Score=19.24 Aligned_cols=49 Identities=29% Similarity=0.505 Sum_probs=44.7 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCHHHHHHHHHCCCCCC Q ss_conf 4450678988888863137887878888752121234446542344753 Q gi|254780527|r 168 DSGSFSNYLRHRKEMFSEDVPLNPVDFLLEKQLDTASLGEEICNLKGFS 216 (397) Q Consensus 168 dsgsfsnylrhrkemfsedvplnpvdfllekqldtaslgeeicnlkgfs 216 (397) -.|+||+-+|+|-|..--..|+--+.++......-|.+|||+..--|.+ T Consensus 171 ~~G~Fs~v~~GrYElL~i~~~l~i~ay~~AdrfTi~GmGedfs~NLGLn 219 (321) T COG4606 171 LQGDFSGVLRGRYELLWISIPLAILAYLFADRFTIAGMGEDFSVNLGLN 219 (321) T ss_pred HCCCHHHHHCCCEEEEHHHHHHHHHHHHHHHHEEEECCCHHHHHHCCCC T ss_conf 6075387751532561678999999999860324520556666650724 No 6 >pfam11471 DUF3205 Protein of unknown function (DUF3205). This is a family of phage proteins with unknown function. Probab=30.80 E-value=32 Score=14.63 Aligned_cols=65 Identities=26% Similarity=0.228 Sum_probs=45.5 Q ss_pred HHCCCCCCCCCCHHHHHHCCCCEEECCCCCCCCCCCCEEEEEECCCCCCCCCCHHHH--HCCCCCCHHHHHHHCCCCE Q ss_conf 217775322230455320021100022453154634347875045432246124555--2367010455442208761 Q gi|254780527|r 64 TTIPDADNHIHEDEERLIDNDLIYNSYKGAMYPSASQYFTVRDLSASEKDFPSIESF--ASKSPQDIEDISGFLSPDF 139 (397) Q Consensus 64 ttipdadnhihedeerlidndliynsykgamypsasqyftvrdlsasekdfpsiesf--askspqdiedisgflspdf 139 (397) .+-|-|||-..|..|++|-+ ++---.++|-+.+|+- ..|-+.||+- |...+-.++|||-|+--|. T Consensus 73 a~S~vAD~~~~E~~e~~~~a--------~qptvka~Q~v~~~~~---~~dkktiE~~~~a~~~~~TLkdisk~vKddi 139 (140) T pfam11471 73 AKSEVADSFLAEKREQLILA--------PQPTVKAMQLYMQRFG---LTDKKTIEDLGNATRTNATLEEQSKKLKDDI 139 (140) T ss_pred CCCHHHHHHHHHCCCCCCCC--------CCCCCCCCCEEEEEEC---CCCCCCCCCCCCCCCCCCCHHHHHHHHHCCC T ss_conf 12465677764113201136--------8875430113665313---5876300022222458753999999863037 No 7 >KOG0927 consensus Probab=29.63 E-value=26 Score=15.31 Aligned_cols=89 Identities=21% Similarity=0.326 Sum_probs=60.2 Q ss_pred HHHHCCCCCCHHHHHHHCCCCEE-EEEECCCCCCCCCCHHH-HHHHHHH-HHCCCCCHHHHHHHHHHHHHCCCCCCHHHH Q ss_conf 55523670104554422087614-67624665531144101-8889975-312445067898888886313788787888 Q gi|254780527|r 118 ESFASKSPQDIEDISGFLSPDFS-ILLKNKTDRDYSDGLCT-ITDVFSQ-LKSDSGSFSNYLRHRKEMFSEDVPLNPVDF 194 (397) Q Consensus 118 esfaskspqdiedisgflspdfs-illknktdrdysdglct-itdvfsq-lksdsgsfsnylrhrkemfsedvplnpvdf 194 (397) |.++.--+..|.-+++||.---. +++---.++|.-+|+|| |-++..+ ++--.|.+..|.+.|-|.-+ |++.- T Consensus 247 EPtnhLDleA~~wLee~L~k~d~~~lVi~sh~QDfln~vCT~Ii~l~~kkl~~y~Gnydqy~~tr~E~~~-----~q~K~ 321 (614) T KOG0927 247 EPTNHLDLEAIVWLEEYLAKYDRIILVIVSHSQDFLNGVCTNIIHLDNKKLIYYEGNYDQYVKTRSELEE-----NQMKA 321 (614) T ss_pred CCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHEECCCCEEEECCCHHHHHHHHHHHHH-----HHHHH T ss_conf 8756788999999999998535761899935055554675465122356315643778997437888767-----88899 Q ss_pred HHHHHCCHHHHHHHHHC Q ss_conf 87521212344465423 Q gi|254780527|r 195 LLEKQLDTASLGEEICN 211 (397) Q Consensus 195 llekqldtaslgeeicn 211 (397) ....|-+-++.-.-|.+ T Consensus 322 ~~kqqk~i~~~K~~ia~ 338 (614) T KOG0927 322 YEKQQKQIAHMKDLIAR 338 (614) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99877577776678876 No 8 >TIGR00781 ccoO cytochrome c oxidase, cbb3-type, subunit II; InterPro: IPR003468 Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO) . Cytochrome cbb3 oxidases are required both to support symbiotic nitrogen fixation, whilst ensuring that the oxygen-labile nitrogenase is not compromised. Cytochrome cbb3 oxidases consist of four subunits: FixN (or CcoN), FixO (or CcoO), FixP (or CcoP) and FixQ (or CcoQ). The catalytic core is comprised of subunits FixN, FixO and FixP, where FixN acts as the catalytic subunit, and Fix O and FixP are membrane-bound mono- and di-haem cytochromes c, respectively. The FixQ subunit protects the core complex in the presence of oxygen from proteolytic degradation . This entry represents the mono-haem FixO subunit.; GO: 0004129 cytochrome-c oxidase activity, 0006118 electron transport. Probab=29.07 E-value=19 Score=16.20 Aligned_cols=43 Identities=28% Similarity=0.394 Sum_probs=23.5 Q ss_pred HHHHHHHHC--CCCC---CHHHHHHHHHCCHHHHHHHHHCCCCC-CCCC Q ss_conf 888886313--7887---87888875212123444654234475-3650 Q gi|254780527|r 177 RHRKEMFSE--DVPL---NPVDFLLEKQLDTASLGEEICNLKGF-SVPS 219 (397) Q Consensus 177 rhrkemfse--dvpl---npvdfllekqldtaslgeeicnlkgf-svps 219 (397) -|++.|+.- -||- -.-.||.++.+|......++-..+-. .||- T Consensus 117 WH~~Hl~~P~~VVPeS~MPay~~L~~~~~d~~~~~~~~~~~~~VfGvPY 165 (234) T TIGR00781 117 WHVKHLLDPRSVVPESIMPAYKYLAEKKVDVDTVVAELKAQKKVFGVPY 165 (234) T ss_pred HHHHHCCCCCCCCCCCCCHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCC T ss_conf 8998504753348888750058760588859999998864003125678 No 9 >KOG2107 consensus Probab=27.20 E-value=18 Score=16.42 Aligned_cols=64 Identities=38% Similarity=0.622 Sum_probs=36.5 Q ss_pred CCHHHHHHHHHHHHCCCCCCCCCCCCC-CCCCCCCCEEEEEEEHHCCCCCCCCCCHHH-HHHCCCCEEECCCCCCCCCCC Q ss_conf 660123467776301443110468983-322334420144320217775322230455-320021100022453154634 Q gi|254780527|r 22 HNDDRLNSIQEEKRNNSTKTINKNPNK-QDFDVQNKSLFEIFLTTIPDADNHIHEDEE-RLIDNDLIYNSYKGAMYPSAS 99 (397) Q Consensus 22 hnddrlnsiqeekrnnstktinknpnk-qdfdvqnkslfeiflttipdadnhihedee-rlidndliynsykgamypsas 99 (397) +++.+|..|.+++.-+-..-..--|.+ ..||..-|++|| .|+|+||| |+| -..+ T Consensus 48 e~~~~L~~lr~e~~~~~~d~~~~~~e~~~nfdeKvk~FfE----------Ehlh~deeiR~i--------------l~Gt 103 (179) T KOG2107 48 ELDEELDRLREERGYSYMDICTVCPETLPNFDEKVKSFFE----------EHLHEDEEIRYI--------------LEGT 103 (179) T ss_pred HHHHHHHHHHHHCCCCEEEEEEECHHHCCCHHHHHHHHHH----------HHCCCHHHEEEE--------------EECC T ss_conf 7788999999972985046788763232368899999988----------855850314778--------------6063 Q ss_pred CEEEEEECCC Q ss_conf 3478750454 Q gi|254780527|r 100 QYFTVRDLSA 109 (397) Q Consensus 100 qyftvrdlsa 109 (397) .||-|||... T Consensus 104 gYfDVrd~dd 113 (179) T KOG2107 104 GYFDVRDKDD 113 (179) T ss_pred EEEEECCCCC T ss_conf 5776114788 No 10 >KOG3870 consensus Probab=26.92 E-value=13 Score=17.31 Aligned_cols=74 Identities=26% Similarity=0.308 Sum_probs=36.7 Q ss_pred CCCCCCHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCC Q ss_conf 36701045544220876146762466553114410188899753124450678988888863137887878888752121 Q gi|254780527|r 122 SKSPQDIEDISGFLSPDFSILLKNKTDRDYSDGLCTITDVFSQLKSDSGSFSNYLRHRKEMFSEDVPLNPVDFLLEKQLD 201 (397) Q Consensus 122 skspqdiedisgflspdfsillknktdrdysdglctitdvfsqlksdsgsfsnylrhrkemfsedvplnpvdfllekqld 201 (397) .+.|+-+.++--.|+-+-+-++ +|-...+-..+-.|...+++||.|- -.|.|+-| +.+++.| T Consensus 31 ~R~P~Ilt~vid~L~~~~s~l~-~k~g~~~~~d~k~ii~~l~~Lr~ei--------------~~drpl~p---~~~~~~D 92 (434) T KOG3870 31 DRWPIILTNVIDTLSRDKSDLI-EKYGSEVVEDGKQIIKLLSKLRYEI--------------QTDRPLQP---LTGEEPD 92 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHCCHHHHHHHHHHHHHHHHHHHHH--------------HCCCCCCC---CCCCCCC T ss_conf 2307999999999986279999-8605799987999999999999998--------------51897555---5568877 Q ss_pred HHHHHHHHHCCC Q ss_conf 234446542344 Q gi|254780527|r 202 TASLGEEICNLK 213 (397) Q Consensus 202 taslgeeicnlk 213 (397) +.+-.+.+-++. T Consensus 93 ~d~wN~~L~~l~ 104 (434) T KOG3870 93 IDSWNEFLKKLP 104 (434) T ss_pred HHHHHHHHHHCC T ss_conf 799999998587 No 11 >pfam11080 DUF2622 Protein of unknown function (DUF2622). This family is conserved in the Enterobacteriaceae family. Several members are named as YdiZ, a putative cytoplasmic protein. The function is not known. Probab=24.36 E-value=24 Score=15.47 Aligned_cols=15 Identities=40% Similarity=0.592 Sum_probs=12.7 Q ss_pred EEEHHCCCCCCCCCC Q ss_conf 320217775322230 Q gi|254780527|r 61 IFLTTIPDADNHIHE 75 (397) Q Consensus 61 iflttipdadnhihe 75 (397) -|++|..|++.++|| T Consensus 34 GF~~TltD~eG~pHe 48 (96) T pfam11080 34 GFLLTLTDEEGNPHE 48 (96) T ss_pred CEEEEEECCCCCCCC T ss_conf 516786358998201 No 12 >COG2993 CcoO Cbb3-type cytochrome oxidase, cytochrome c subunit [Energy production and conversion] Probab=24.07 E-value=24 Score=15.55 Aligned_cols=43 Identities=28% Similarity=0.573 Sum_probs=31.9 Q ss_pred HHHHHHHHC--CCC---CCHHHHHHHHHCCHHHHHHHHHCCCCCCCCC Q ss_conf 888886313--788---7878888752121234446542344753650 Q gi|254780527|r 177 RHRKEMFSE--DVP---LNPVDFLLEKQLDTASLGEEICNLKGFSVPS 219 (397) Q Consensus 177 rhrkemfse--dvp---lnpvdfllekqldtaslgeeicnlkgfsvps 219 (397) -||..|+.- -|| .-.-.||.+..||....+.+.-.++-|.||- T Consensus 119 WH~~Hl~~PR~vVPeSiMPsY~~L~~~~ld~~d~~~~~~~~r~~GVPY 166 (227) T COG2993 119 WHRAHLLDPRSVVPESIMPSYPWLFDNKLDVDDIGAELKALRKVGVPY 166 (227) T ss_pred HHHHHCCCCHHCCCCCCCCCCHHHHCCCCCHHHHHHHHHHHCCCCCCC T ss_conf 899872694200862237645777418774677988887520579999 No 13 >KOG3879 consensus Probab=23.56 E-value=44 Score=13.72 Aligned_cols=46 Identities=26% Similarity=0.243 Sum_probs=20.0 Q ss_pred HHHCCCCCC---EEEEEEHHHHHHHHHHHHHHHHHHHHHH---CCCCCCCCHHHH Q ss_conf 753043760---1100000110000078888767777752---176321000111 Q gi|254780527|r 309 LDALNFSGY---TMDRFDVEFVRKDSLDLLKDSMNYCAQQ---QGFTHSNEFEKK 357 (397) Q Consensus 309 ldalnfsgy---tmdrfdvefvrkdsldllkdsmnycaqq---qgfthsnefekk 357 (397) +.++-.+|+ .|=+| ++|||--..|-.-|-+-..- .--||++.-|.. T Consensus 173 ~t~i~~s~~kggn~wwF---GiRk~fcqflle~~pflreygnisy~th~~d~e~~ 224 (267) T KOG3879 173 LTAIATSGSKGGNHWWF---GIRKDFCQFLLEIFPFLREYGNISYDTHHEDNEPE 224 (267) T ss_pred HHHHHHHCCCCCCEEEE---EECCHHHHHHHHHCHHHHHHCCCCEECCCCCCCCC T ss_conf 99999954678854789---84142789999978788984486202343447887 No 14 >pfam11629 Mst1_SARAH C terminal SARAH domain of Mst1. This family of proteins represents the C terminal SARAH domain of Mst1. SARAH controls apoptosis and cell cycle arrest via the Ras, RASSF, MST pathway. The Mst1 SARAH domain interacts with Rassf1 and Rassf5 by forming a heterodimer which mediates the apoptosis process. Probab=23.36 E-value=43 Score=13.80 Aligned_cols=22 Identities=41% Similarity=0.520 Sum_probs=16.5 Q ss_pred ECHHHHHHHHHHHHHHHHHHCC Q ss_conf 5115789887633676775304 Q gi|254780527|r 292 VESDFLYKKIQSGRQNILDALN 313 (397) Q Consensus 292 vesdflykkiqsgrqnildaln 313 (397) .|-+-|-++-|.-||-||||.. T Consensus 26 ~Eieelr~RYq~KRqPILdAie 47 (49) T pfam11629 26 TEIEELRQRYQAKRQPILDAIE 47 (49) T ss_pred HHHHHHHHHHHHHHCCHHHHHH T ss_conf 9999999998874243999975 No 15 >COG1029 FwdB Formylmethanofuran dehydrogenase subunit B [Energy production and conversion] Probab=23.30 E-value=35 Score=14.35 Aligned_cols=137 Identities=24% Similarity=0.292 Sum_probs=64.2 Q ss_pred HHHHHHHHHCCCC--CHHHHH--HHHHHHH-HCCCCCCHHHHHHHH--HCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHH Q ss_conf 8889975312445--067898--8888863-137887878888752--12123444654234475365000341026777 Q gi|254780527|r 158 ITDVFSQLKSDSG--SFSNYL--RHRKEMF-SEDVPLNPVDFLLEK--QLDTASLGEEICNLKGFSVPSRYQSSDSLDAL 230 (397) Q Consensus 158 itdvfsqlksdsg--sfsnyl--rhrkemf-sedvplnpvdfllek--qldtaslgeeicnlkgfsvpsryqssdsldal 230 (397) +.|.+-|+|..|. -+|..+ -|-|+-- ||.|.--|+.-++|- -+..|..| |-.-|..+.+--..-..++.. T Consensus 193 lad~~~qi~p~sDyelisAl~~~l~G~~~~~~eev~gvp~~~i~e~a~~mKna~Fg---~if~GlGlt~S~gk~rN~e~a 269 (429) T COG1029 193 LADNHVQIKPNSDYELISALRAALHGKEPHRSEEVAGVPIEEIEELADMMKNAKFG---AIFVGLGLTSSRGKHRNVENA 269 (429) T ss_pred HHHHEEECCCCCCHHHHHHHHHHHCCCCCCCCHHHCCCCHHHHHHHHHHHHCCCCC---EEEEECCEEECCCCCCCHHHH T ss_conf 54321756899867999999998558999872421487989999999998639812---499830200034422357999 Q ss_pred HHHHHHHCCCCEEEEEECCCCEEECCCCCEEEEEEEE--------C----CCCCCCEEEEEEECCCEEEEEEEECHHHHH Q ss_conf 5357653502125788523422320465247999994--------5----898200455455348445899995115789 Q gi|254780527|r 231 MSTESVLNQKSFMRVTMIPESVHYTDTGSIKALTIQM--------K----PNSPDNIVATLCLSGEKLSVKLRVESDFLY 298 (397) Q Consensus 231 mstesvlnqksfmrvtmipesvhytdtgsikaltiqm--------k----pnspdnivatlclsgeklsvklrvesdfly 298 (397) |.--.-||+. -+.|+||-.-||.-+|--..++-.. . --.|...-+.--|-.+....-|-+-+|-.+ T Consensus 270 ~~Lv~~LNe~--ak~tli~mrgH~Nv~GFnqv~~~e~GYpf~vdF~rG~prynPgE~s~vdlL~~k~vDAalvi~sDp~a 347 (429) T COG1029 270 INLVKDLNEY--AKFTLIPMRGHYNVTGFNEVLSWETGYPFAVDFSRGYPRYNPGEFSAVDLLKRKEVDAALVIASDPGA 347 (429) T ss_pred HHHHHHHHHH--CEEEEEEECCEECCCCCCCCHHHHHCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCEEEEEECCCCC T ss_conf 9999998653--22999973040023452410143407750564256876779651008898750477737999538411 Q ss_pred H Q ss_conf 8 Q gi|254780527|r 299 K 299 (397) Q Consensus 299 k 299 (397) . T Consensus 348 h 348 (429) T COG1029 348 H 348 (429) T ss_pred C T ss_conf 2 No 16 >COG4085 Predicted RNA-binding protein, contains TRAM domain [General function prediction only] Probab=23.14 E-value=35 Score=14.36 Aligned_cols=49 Identities=27% Similarity=0.137 Sum_probs=28.3 Q ss_pred HHHHHHHCCCCEEEEEECCCCEEECCCCCEEEEEEEECCCCCCCEEEEE Q ss_conf 5357653502125788523422320465247999994589820045545 Q gi|254780527|r 231 MSTESVLNQKSFMRVTMIPESVHYTDTGSIKALTIQMKPNSPDNIVATL 279 (397) Q Consensus 231 mstesvlnqksfmrvtmipesvhytdtgsikaltiqmkpnspdnivatl 279 (397) -+||+++--++.|-++.--+.|.++..-++-.=+-+.|+|+|+.+++-- T Consensus 89 ~~tee~l~~n~~~p~~~eGe~veVtGrv~~yrG~~eVkvnq~~d~~~l~ 137 (204) T COG4085 89 RSTEETLELNEGMPVTVEGEIVEVTGRVEEYRGSSEVKVNQPNDSRPLP 137 (204) T ss_pred CCHHHHHHHCCCCCCCCCCCEEEEEEEEEEECCCCEEECCCCCCCCCCC T ss_conf 6736768644788732267589998888771797303106844321366 No 17 >TIGR02583 DevR_archaea CRISPR-associated regulatory protein, Csa2 family; InterPro: IPR002764 CRISPR is a term for Clustered Regularly Interspaced Short Palindromic Repeats. A number of protein families are encoded only in association with these repeats and are designated Cas (CRISPR associated) proteins. This entry is typified by MJ0381 of Methanococcus jannaschii. This archaeal clade is a member of the DevR family, which includes the DevR protein of Myxococcus xanthus, a protein whose expression appears to regulated through a number of means, including both location and autorepression; DevR mutants are incapable of fruiting body development . This entry is found in a CRISPR/Cas locus, so it is designated Csa2, for CRISPR/Cas Subtype Protein 2.. Probab=20.91 E-value=31 Score=14.75 Aligned_cols=29 Identities=28% Similarity=0.332 Sum_probs=18.2 Q ss_pred CHHHHHHHHHHHH----HCCCCEEEEEECCCCE Q ss_conf 1026777535765----3502125788523422 Q gi|254780527|r 224 SDSLDALMSTESV----LNQKSFMRVTMIPESV 252 (397) Q Consensus 224 sdsldalmstesv----lnqksfmrvtmipesv 252 (397) .-||+|||.+-+- -+|--|.-|.+|-|-| T Consensus 248 ~AaL~AL~~~L~gf~fGaK~~RflP~~~~~eLV 280 (311) T TIGR02583 248 EAALKALLPLLSGFAFGAKRARFLPVVKVEELV 280 (311) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEE T ss_conf 999999999888887654443206412468899 Done!