Query gi|254780932|ref|YP_003065345.1| hypothetical protein CLIBASIA_04155 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 236 No_of_seqs 221 out of 390 Neff 9.6 Searched_HMMs 39220 Date Mon May 30 03:50:06 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780932.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG3827 Uncharacterized protei 100.0 1.8E-37 4.5E-42 285.0 15.1 77 159-235 155-231 (231) 2 pfam10691 DUF2497 Protein of u 99.9 6.9E-23 1.8E-27 182.3 7.6 69 164-232 2-72 (72) 3 COG3827 Uncharacterized protei 96.1 0.09 2.3E-06 33.6 9.4 49 12-65 1-49 (231) 4 pfam00157 Pou Pou domain - N-t 71.6 4.6 0.00012 21.6 3.2 44 169-212 24-73 (75) 5 KOG3802 consensus 63.6 3.7 9.5E-05 22.2 1.4 15 198-212 256-272 (398) 6 PRK07680 late competence prote 61.5 14 0.00036 18.1 4.2 42 178-229 219-260 (273) 7 COG3682 Predicted transcriptio 55.5 10 0.00026 19.1 2.5 49 185-235 14-62 (123) 8 pfam09012 FeoC FeoC like trans 55.1 13 0.00034 18.3 3.0 32 180-211 3-40 (68) 9 PRK03643 altronate oxidoreduct 53.9 6.6 0.00017 20.5 1.3 45 187-231 172-228 (482) 10 COG1291 MotA Flagellar motor c 49.6 22 0.00055 16.8 5.3 57 174-230 77-137 (266) 11 PRK09072 short chain dehydroge 49.0 17 0.00043 17.6 2.8 30 206-235 233-262 (262) 12 pfam01941 AdoMet_Synthase S-ad 47.6 16 0.0004 17.8 2.5 60 165-226 316-396 (396) 13 pfam10151 DUF2359 Uncharacteri 46.9 13 0.00033 18.4 1.9 22 199-221 361-382 (469) 14 cd00822 TopoII_Trans_DNA_gyras 45.8 25 0.00063 16.4 4.9 44 192-235 124-170 (172) 15 cd06154 YjgF_YER057c_UK114_lik 45.6 21 0.00053 17.0 2.8 50 166-216 37-97 (119) 16 KOG3239 consensus 45.4 12 0.00031 18.6 1.6 19 206-224 34-52 (193) 17 smart00352 POU Found in Pit-Oc 45.3 11 0.00029 18.8 1.4 43 170-212 25-73 (75) 18 pfam05716 AKAP_110 A-kinase an 44.5 18 0.00046 17.3 2.4 34 181-214 652-685 (685) 19 smart00807 AKAP_110 A-kinase a 42.5 21 0.00053 17.0 2.4 34 181-214 818-851 (851) 20 PRK04439 S-adenosylmethionine 42.4 23 0.00059 16.6 2.6 60 165-226 317-397 (400) 21 pfam04305 DUF455 Protein of un 42.1 17 0.00042 17.6 1.9 24 211-234 177-200 (250) 22 pfam03562 MltA MltA specific i 41.2 21 0.00053 16.9 2.3 21 202-223 84-104 (113) 23 KOG1043 consensus 40.6 19 0.00047 17.3 1.9 45 180-225 346-394 (499) 24 KOG1463 consensus 40.5 28 0.00071 16.0 2.8 62 170-231 286-353 (411) 25 pfam00204 DNA_gyraseB DNA gyra 39.6 30 0.00078 15.8 4.9 38 193-230 126-166 (173) 26 PHA02130 hypothetical protein 38.9 13 0.00033 18.4 0.9 10 205-214 18-27 (81) 27 KOG1168 consensus 38.7 16 0.00042 17.7 1.4 25 193-217 267-291 (385) 28 smart00467 GS GS motif. Aa app 38.5 13 0.00034 18.3 0.9 22 205-226 2-30 (30) 29 pfam08171 Mad3_BUB1_II Mad3/BU 37.8 20 0.00052 17.0 1.8 20 5-24 39-58 (67) 30 pfam08490 DUF1744 Domain of un 36.9 20 0.0005 17.1 1.6 16 198-213 263-278 (396) 31 PRK13946 shikimate kinase; Pro 36.8 32 0.0008 15.7 2.6 28 183-211 162-189 (195) 32 COG2080 CoxS Aerobic-type carb 36.1 9.6 0.00025 19.3 -0.1 36 179-214 60-97 (156) 33 pfam08726 efhand_Ca_insen Ca2+ 36.1 34 0.00087 15.4 3.9 45 170-218 3-47 (69) 34 COG1491 Predicted RNA-binding 36.0 27 0.00069 16.1 2.2 26 206-231 92-117 (202) 35 cd06152 YjgF_YER057c_UK114_lik 34.5 21 0.00053 17.0 1.4 51 166-216 27-91 (114) 36 PRK00871 glutathione-regulated 34.2 19 0.00049 17.2 1.2 10 204-213 165-174 (176) 37 COG5406 Nucleosome binding fac 34.1 16 0.00041 17.7 0.8 15 14-28 326-340 (1001) 38 PRK04930 glutathione-regulated 33.5 21 0.00052 17.0 1.2 11 204-214 78-88 (184) 39 COG0246 MtlD Mannitol-1-phosph 33.5 25 0.00064 16.4 1.7 44 187-230 175-228 (473) 40 pfam02186 TFIIE_beta TFIIE bet 33.2 29 0.00073 16.0 1.9 39 178-217 3-48 (65) 41 PRK00808 hypothetical protein; 32.5 32 0.0008 15.7 2.0 21 196-217 99-119 (150) 42 PRK10996 thioredoxin 2; Provis 32.5 7.7 0.0002 20.0 -1.1 42 173-214 96-139 (139) 43 pfam05186 Dpy-30 Dpy-30 motif. 32.3 35 0.0009 15.3 2.3 18 206-223 2-20 (42) 44 pfam11838 DUF3358 Domain of un 32.0 40 0.001 15.0 5.9 65 170-234 347-422 (428) 45 PRK01917 hemerythrin family pr 31.6 40 0.001 14.9 5.2 26 196-222 100-125 (142) 46 pfam08369 PCP_red Proto-chloro 31.1 41 0.001 14.9 3.0 25 208-233 7-31 (45) 47 PTZ00109 DNA gyrase subunit b; 30.6 42 0.0011 14.8 4.5 34 193-227 505-538 (941) 48 pfam09028 Mac-1 Mac 1. The bac 30.1 31 0.0008 15.7 1.7 19 203-230 102-120 (333) 49 pfam05700 BCAS2 Breast carcino 29.6 37 0.00093 15.2 2.0 14 222-235 155-168 (221) 50 pfam10350 DUF2428 Putative dea 29.6 43 0.0011 14.7 4.3 45 174-226 105-166 (241) 51 pfam10092 DUF2330 Uncharacteri 29.3 23 0.00059 16.6 0.9 21 206-226 138-161 (337) 52 TIGR03198 pucE xanthine dehydr 29.1 20 0.00052 17.0 0.6 28 186-213 67-94 (151) 53 PRK12383 putative mutase; Prov 28.8 30 0.00077 15.8 1.4 49 167-221 274-327 (406) 54 COG5095 TAF6 Transcription ini 28.7 45 0.0011 14.6 3.1 18 193-210 182-199 (450) 55 pfam08283 Gemini_AL1_M Geminiv 28.5 27 0.00069 16.1 1.2 16 195-214 64-79 (106) 56 PRK05057 aroK shikimate kinase 28.0 45 0.0012 14.5 2.2 13 209-221 102-114 (172) 57 COG4229 Predicted enolase-phos 27.8 46 0.0012 14.5 3.7 45 190-234 29-79 (229) 58 cd07977 TFIIE_beta_winged_heli 27.4 47 0.0012 14.4 3.6 25 190-214 24-52 (75) 59 cd03160 CD37_CD82_like_LEL Tet 27.1 28 0.00071 16.0 1.0 18 197-214 99-116 (117) 60 pfam07533 BRK BRK domain. The 26.6 28 0.00071 16.0 0.9 19 187-214 20-38 (45) 61 pfam04690 YABBY YABBY protein. 26.3 48 0.0012 14.4 2.1 19 212-230 123-141 (170) 62 PRK13948 shikimate kinase; Pro 26.3 49 0.0013 14.3 2.6 29 183-212 151-179 (182) 63 pfam03858 Crust_neuro_H Crusta 26.0 34 0.00087 15.4 1.3 16 11-26 8-23 (41) 64 TIGR03454 partition_RepB plasm 26.0 35 0.0009 15.3 1.4 18 204-221 306-323 (325) 65 PRK11623 pcnB poly(A) polymera 25.8 50 0.0013 14.2 4.4 57 176-232 251-316 (471) 66 pfam06895 DUF1267 Protein of u 25.4 43 0.0011 14.7 1.7 16 211-226 3-18 (74) 67 TIGR01377 soxA_mon sarcosine o 24.9 18 0.00046 17.4 -0.3 19 205-223 303-323 (401) 68 KOG3995 consensus 24.7 26 0.00065 16.3 0.4 22 194-215 172-194 (279) 69 pfam10723 RepB-RCR_reg Replica 24.3 53 0.0014 14.1 3.1 30 199-228 45-81 (85) 70 COG4691 StbC Plasmid stability 24.2 50 0.0013 14.3 1.8 30 189-221 23-52 (80) 71 PRK13456 DNA protection protei 24.0 54 0.0014 14.0 3.0 14 216-229 140-153 (184) 72 KOG1588 consensus 23.5 55 0.0014 13.9 3.3 23 213-235 46-70 (259) 73 TIGR01689 EcbF-BcbF capsule bi 23.5 35 0.00089 15.4 0.9 27 188-214 48-77 (126) 74 pfam09494 Slx4 Slx4 endonuclea 23.5 55 0.0014 13.9 2.4 31 176-210 563-598 (627) 75 pfam06931 Adeno_E4_ORF3 Mastad 23.4 56 0.0014 13.9 2.2 38 175-220 6-43 (113) 76 PRK10696 C32 tRNA thiolase; Pr 23.4 56 0.0014 13.9 3.3 31 193-224 215-245 (311) 77 PRK13700 conjugal transfer pro 23.3 56 0.0014 13.9 3.7 19 207-225 694-712 (732) 78 pfam04564 U-box U-box domain. 22.6 47 0.0012 14.5 1.5 28 186-213 43-71 (74) 79 PRK13702 replication protein; 22.5 58 0.0015 13.8 2.9 31 198-228 44-81 (82) 80 PRK09381 trxA thioredoxin; Pro 22.4 15 0.00037 18.0 -1.2 32 183-214 77-108 (109) 81 KOG0100 consensus 22.0 44 0.0011 14.6 1.2 39 187-230 594-634 (663) 82 COG2249 MdaB Putative NADPH-qu 21.6 46 0.0012 14.5 1.2 11 204-214 84-94 (189) 83 pfam08482 HrpB_C ATP-dependent 21.5 58 0.0015 13.8 1.7 35 190-224 8-42 (133) 84 KOG4761 consensus 21.3 34 0.00087 15.4 0.5 24 212-235 132-156 (266) 85 COG0816 Predicted endonuclease 20.6 46 0.0012 14.5 1.1 21 192-215 119-139 (141) 86 TIGR02499 HrpE_YscL_not type I 20.4 64 0.0016 13.5 4.4 39 193-231 83-122 (183) 87 pfam10874 DUF2746 Protein of u 20.2 64 0.0016 13.5 4.8 41 193-235 16-56 (57) 88 TIGR01028 S7_S5_E_A ribosomal 20.2 26 0.00066 16.3 -0.3 12 223-234 195-206 (206) 89 pfam10065 DUF2303 Uncharacteri 20.2 47 0.0012 14.4 1.1 16 204-219 134-149 (272) 90 PRK13866 plasmid partitioning 20.1 64 0.0016 13.5 2.0 21 204-224 308-328 (336) 91 COG3437 Response regulator con 20.0 65 0.0016 13.5 4.5 58 170-231 297-356 (360) 92 PRK02318 mannitol-1-phosphate 20.0 52 0.0013 14.2 1.2 70 166-235 212-298 (381) 93 PRK01450 rpmC 50S ribosomal pr 20.0 63 0.0016 13.6 1.7 12 219-230 48-59 (71) No 1 >COG3827 Uncharacterized protein conserved in bacteria [Function unknown] Probab=100.00 E-value=1.8e-37 Score=285.05 Aligned_cols=77 Identities=51% Similarity=0.848 Sum_probs=73.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 22322112235567899987655556532656530789999973799999766103578999999999999603246 Q gi|254780932|r 159 CKGEEEALVSSDVGDQVASSFDQLVKALRESESRSLDQLSLDVLRPMLREWLDDNLPGIVERLVREEIERIARGPIR 235 (236) Q Consensus 159 ~~~~~~~l~S~~t~~~v~~~~~~l~~~~~~~~~~tle~lv~emLrPmLk~WLD~nLP~lVErlVr~EIerv~r~~~r 235 (236) ....+++++|.+++++++++|++|++.+.+..+||||+||++|||||||+|||+|||+|||||||+|||||+||+|| T Consensus 155 ~~s~paails~a~~rqvA~aF~dLs~a~~~~~rrsleE~a~eMLRPmLqdWLDkNLPtLVErLVrEEIeRv~RGskr 231 (231) T COG3827 155 IKSAPAAILSEAAGRQVADAFGDLSLAFNSSSRRSLEEMAAEMLRPMLQDWLDKNLPTLVERLVREEIERVVRGSKR 231 (231) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCC T ss_conf 01471367659999999999988898750465012999999998899999998133599999999999999702589 No 2 >pfam10691 DUF2497 Protein of unknown function (DUF2497). Members of this family belong to the Alphaproteobacteria. The function of the family is not known. Probab=99.88 E-value=6.9e-23 Score=182.25 Aligned_cols=69 Identities=57% Similarity=0.964 Sum_probs=64.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCC--CCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCC Q ss_conf 11223556789998765555653265--6530789999973799999766103578999999999999603 Q gi|254780932|r 164 EALVSSDVGDQVASSFDQLVKALRES--ESRSLDQLSLDVLRPMLREWLDDNLPGIVERLVREEIERIARG 232 (236) Q Consensus 164 ~~l~S~~t~~~v~~~~~~l~~~~~~~--~~~tle~lv~emLrPmLk~WLD~nLP~lVErlVr~EIerv~r~ 232 (236) ..|+|..++.+++++|+.|++.+.++ +++||||||++|||||||+|||+|||+|||||||+||+||+|+ T Consensus 2 e~LiS~~t~~av~~~f~~La~~i~~~~~~~~TlE~lv~emLrPmL~~WLd~nLp~lve~~V~~EI~ri~~~ 72 (72) T pfam10691 2 DALISEETAEAVAAAFGKLARAIRISRSGGRTLEDLVREMLRPMLKEWLDNNLPALVERLVREEIERIARK 72 (72) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 41005477999999999999998723335674999999998999999998613799999999999987539 No 3 >COG3827 Uncharacterized protein conserved in bacteria [Function unknown] Probab=96.08 E-value=0.09 Score=33.59 Aligned_cols=49 Identities=35% Similarity=0.463 Sum_probs=30.6 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 999998676762046778666666677777667665676666677667777788 Q gi|254780932|r 12 MEEIVNSIRRILENNDQEFSTSNNVQTQVPARDEGVIEKEDFVQEDKMSNHLFA 65 (236) Q Consensus 12 MeeILASIRRIIsdde~~~~~~~~~~~~~p~~~e~~~~~~~~~~~~~~~~~~~~ 65 (236) ||||||||||||+|+|-... ..+.++.++.....++...++..+..... T Consensus 1 meeilasirriie~~d~~~~-----~~e~~a~~~~a~~~~~~~~~~~~~~~~~~ 49 (231) T COG3827 1 MEEILASIRRIIEDSDFSRQ-----LDEDPASGEAAFAAEPVAPPDRKPQAVAA 49 (231) T ss_pred CHHHHHHHHHHHHCCHHHCC-----CCCCCCCCHHHHHCCCCCCHHHCCCCCCC T ss_conf 93789999999852011038-----65465654266650656771104421147 No 4 >pfam00157 Pou Pou domain - N-terminal to homeobox domain. Probab=71.65 E-value=4.6 Score=21.56 Aligned_cols=44 Identities=18% Similarity=0.422 Sum_probs=24.4 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCC-CHHHH---HHHH--HHHHHHHHHHH Q ss_conf 556789998765555653265653-07899---9997--37999997661 Q gi|254780932|r 169 SDVGDQVASSFDQLVKALRESESR-SLDQL---SLDV--LRPMLREWLDD 212 (236) Q Consensus 169 ~~t~~~v~~~~~~l~~~~~~~~~~-tle~l---v~em--LrPmLk~WLD~ 212 (236) .-|...|..+|+.|-....++... -+|.| .+.| |+|.|+.||.+ T Consensus 24 G~TQ~dVg~al~~l~G~~~SQttIcRFE~l~Ls~kn~~kLkP~L~~WL~e 73 (75) T pfam00157 24 GYTQADVGLALGALYGPVFSQTTICRFEALQLSFKNMCKLKPLLEKWLEE 73 (75) T ss_pred CCCHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 77599999999987086542358888885044799899889999999874 No 5 >KOG3802 consensus Probab=63.56 E-value=3.7 Score=22.20 Aligned_cols=15 Identities=33% Similarity=1.004 Sum_probs=12.4 Q ss_pred HHHH--HHHHHHHHHHH Q ss_conf 9997--37999997661 Q gi|254780932|r 198 SLDV--LRPMLREWLDD 212 (236) Q Consensus 198 v~em--LrPmLk~WLD~ 212 (236) .+.| |||.|+.||++ T Consensus 256 FKNMCKLKPLL~KWLeE 272 (398) T KOG3802 256 FKNMCKLKPLLEKWLEE 272 (398) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 78776637999999999 No 6 >PRK07680 late competence protein ComER; Validated Probab=61.45 E-value=14 Score=18.12 Aligned_cols=42 Identities=10% Similarity=0.249 Sum_probs=32.8 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHH Q ss_conf 7655556532656530789999973799999766103578999999999999 Q gi|254780932|r 178 SFDQLVKALRESESRSLDQLSLDVLRPMLREWLDDNLPGIVERLVREEIERI 229 (236) Q Consensus 178 ~~~~l~~~~~~~~~~tle~lv~emLrPmLk~WLD~nLP~lVErlVr~EIerv 229 (236) .|.+|...|.+-++.|.++|-. |++||+.++.+++++=++|- T Consensus 219 ~~~~L~~~V~SPgGtT~~gl~~----------Le~~~~~~f~~a~~~a~~r~ 260 (273) T PRK07680 219 TLPTLQEKVCVKGGVTGEGIRV----------LEEHVGDMFHKLFERTHEKF 260 (273) T ss_pred CHHHHHHCCCCCCHHHHHHHHH----------HHHHHHHHHHHHHHHHHHHH T ss_conf 9899998078997259999999----------99725999999999999999 No 7 >COG3682 Predicted transcriptional regulator [Transcription] Probab=55.46 E-value=10 Score=19.10 Aligned_cols=49 Identities=14% Similarity=0.267 Sum_probs=32.0 Q ss_pred HHCCCCCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 532656530789999973799999766103578999999999999603246 Q gi|254780932|r 185 ALRESESRSLDQLSLDVLRPMLREWLDDNLPGIVERLVREEIERIARGPIR 235 (236) Q Consensus 185 ~~~~~~~~tle~lv~emLrPmLk~WLD~nLP~lVErlVr~EIerv~r~~~r 235 (236) .+=..+..|..+|+..+... ++|=+----+|.-|||+|+.=-=-|..|+ T Consensus 14 ilW~~~~~t~~eI~~~l~~~--~ews~sTV~TLl~RL~KKg~l~~~kdgr~ 62 (123) T COG3682 14 ILWSRGPATVREIIEELPAD--REWSYSTVKTLLNRLVKKGLLTRKKDGRA 62 (123) T ss_pred HHHHCCCCCHHHHHHHHHHC--CCCCHHHHHHHHHHHHHCCCHHHHHCCCE T ss_conf 99972886699999998632--36008789999999986430431103872 No 8 >pfam09012 FeoC FeoC like transcriptional regulator. This family contains several transcriptional regulators, including FeoC, which contain a HTH motif. FeoC acts as a [Fe-S] dependant transcriptional repressor. Probab=55.09 E-value=13 Score=18.32 Aligned_cols=32 Identities=31% Similarity=0.541 Sum_probs=22.8 Q ss_pred HHHHHHHCCCCCCCHHHHHH------HHHHHHHHHHHH Q ss_conf 55556532656530789999------973799999766 Q gi|254780932|r 180 DQLVKALRESESRSLDQLSL------DVLRPMLREWLD 211 (236) Q Consensus 180 ~~l~~~~~~~~~~tle~lv~------emLrPmLk~WLD 211 (236) .+|..++.-++..++-+|.+ ++|+.||..|.- T Consensus 3 ~~lr~~l~~~g~~s~~~Ls~~f~~~~~~v~~ML~~wi~ 40 (68) T pfam09012 3 TELKQYLQERGRASLAELARHFKMSPDAVEAMLEVWIR 40 (68) T ss_pred HHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 89999999919847999998979399999999999998 No 9 >PRK03643 altronate oxidoreductase; Provisional Probab=53.89 E-value=6.6 Score=20.47 Aligned_cols=45 Identities=20% Similarity=0.487 Sum_probs=35.0 Q ss_pred CCCCCCCHHHHHHHHHH-----HHHHHHHHHHC---HHHHHHHHH----HHHHHHHC Q ss_conf 26565307899999737-----99999766103---578999999----99999960 Q gi|254780932|r 187 RESESRSLDQLSLDVLR-----PMLREWLDDNL---PGIVERLVR----EEIERIAR 231 (236) Q Consensus 187 ~~~~~~tle~lv~emLr-----PmLk~WLD~nL---P~lVErlVr----~EIerv~r 231 (236) +.+|+..|-.+|.+..+ +-|.+|+++|- =+||-|||= .+++.+.. T Consensus 172 l~~NG~~Lr~~Vl~~A~~~~~d~~~~~Wi~~~v~Fp~sMVDRIvP~~~~~~~~~~~~ 228 (482) T PRK03643 172 IDYNGEALKELVLRYAQEWNLPEAFIQWLEEANTFCSTLVDRIVTGYPRDEVAELEE 228 (482) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCHHHCCCCCCCCHHHHHHHHH T ss_conf 577899999999999998478978999999727675333232558999899999999 No 10 >COG1291 MotA Flagellar motor component [Cell motility and secretion] Probab=49.57 E-value=22 Score=16.81 Aligned_cols=57 Identities=26% Similarity=0.429 Sum_probs=46.7 Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHH----HHHCHHHHHHHHHHHHHHHH Q ss_conf 9998765555653265653078999997379999976----61035789999999999996 Q gi|254780932|r 174 QVASSFDQLVKALRESESRSLDQLSLDVLRPMLREWL----DDNLPGIVERLVREEIERIA 230 (236) Q Consensus 174 ~v~~~~~~l~~~~~~~~~~tle~lv~emLrPmLk~WL----D~nLP~lVErlVr~EIerv~ 230 (236) .+-+.|-+|+...+-.+-..||++..+.+-|.+|+.| |.|-|.-+|.+..+||+-+- T Consensus 77 ~li~~l~~la~~~Rk~GllaLE~~~~~~~d~Fi~~glrliVdG~~~~~I~~~me~Ei~~~e 137 (266) T COG1291 77 DLIALLYELAEKARKEGLLALEALADEIEDPFIKDGLRLIVDGNDPEEIEALMEEEIETME 137 (266) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH T ss_conf 9999999999999870187799888624145899667987158988999999999999999 No 11 >PRK09072 short chain dehydrogenase; Provisional Probab=49.00 E-value=17 Score=17.56 Aligned_cols=30 Identities=37% Similarity=0.514 Sum_probs=26.7 Q ss_pred HHHHHHHHCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 999766103578999999999999603246 Q gi|254780932|r 206 LREWLDDNLPGIVERLVREEIERIARGPIR 235 (236) Q Consensus 206 Lk~WLD~nLP~lVErlVr~EIerv~r~~~r 235 (236) |--||-.=||.||.|++++-...|.|=.|| T Consensus 233 ~~~~l~~l~P~l~~r~l~k~l~~~~r~~~~ 262 (262) T PRK09072 233 LFVRLNGLLPGLVDRALRKQLPVIHRFARR 262 (262) T ss_pred HHHHHHHHCCHHHHHHHHHHCHHHHHHHCC T ss_conf 999999869299999999747788875139 No 12 >pfam01941 AdoMet_Synthase S-adenosylmethionine synthetase (AdoMet synthetase). This family consists of several archaebacterial S-adenosylmethionine synthetase C(AdoMet synthetase or MAT) (EC 2.5.1.6). S-Adenosylmethionine (AdoMet) occupies a central role in the metabolism of all cells. The biological roles of AdoMet include acting as the primary methyl group donor, as a precursor to the polyamines, and as a progenitor of a 5'-deoxyadenosyl radical. S-Adenosylmethionine synthetase catalyses the only known route of AdoMet biosynthesis. The synthetic process occurs in a unique reaction in which the complete triphosphate chain is displaced from ATP and a sulfonium ion formed. MATs from various organisms contain ~400-amino acid polypeptide chains. Probab=47.63 E-value=16 Score=17.77 Aligned_cols=60 Identities=13% Similarity=0.346 Sum_probs=39.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHH---------------------HCCCCCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHH Q ss_conf 122355678999876555565---------------------32656530789999973799999766103578999999 Q gi|254780932|r 165 ALVSSDVGDQVASSFDQLVKA---------------------LRESESRSLDQLSLDVLRPMLREWLDDNLPGIVERLVR 223 (236) Q Consensus 165 ~l~S~~t~~~v~~~~~~l~~~---------------------~~~~~~~tle~lv~emLrPmLk~WLD~nLP~lVErlVr 223 (236) -+++...++.+...++..... +....+..++++-++ .+=.+.+|| .|++.+.++||+ T Consensus 316 Nvla~~iA~~i~~~v~gv~ev~v~llSqIG~PId~P~~a~~~i~~~~g~~~~~~~~~-v~~I~~~~L-~~i~~~~~~li~ 393 (396) T pfam01941 316 NVLANEIARRIVEEVDGVKEVYVRLLSQIGKPIDQPLVASAQVITEDGVLLSDVERE-IEEIIDEEL-ANITKLTEELVE 393 (396) T ss_pred HHHHHHHHHHHHHHCCCCEEEEEEEHHCCCCCCCCCCEEEEEEECCCCCCHHHHHHH-HHHHHHHHH-HCHHHHHHHHHC T ss_conf 999999999999854882499999842049988886078899982699883889999-999999998-453899999965 Q ss_pred HHH Q ss_conf 999 Q gi|254780932|r 224 EEI 226 (236) Q Consensus 224 ~EI 226 (236) -+| T Consensus 394 G~~ 396 (396) T pfam01941 394 GKL 396 (396) T ss_pred CCC T ss_conf 879 No 13 >pfam10151 DUF2359 Uncharacterized conserved protein (DUF2359). This is a 450 amino acid region of a family of proteins conserved from insects to humans. The mouse protein, Q8BM55, is annotated as being a putative Vitamin K-dependent carboxylation gamma-carboxyglutamic (GLA) domain containing protein, but this could not be confirmed. The function is not known. Probab=46.91 E-value=13 Score=18.36 Aligned_cols=22 Identities=45% Similarity=0.898 Sum_probs=15.9 Q ss_pred HHHHHHHHHHHHHHHCHHHHHHH Q ss_conf 99737999997661035789999 Q gi|254780932|r 199 LDVLRPMLREWLDDNLPGIVERL 221 (236) Q Consensus 199 ~emLrPmLk~WLD~nLP~lVErl 221 (236) .++-++ ++.|+.+|||.|+|++ T Consensus 361 ~~~~~~-~~~w~~~klP~lle~v 382 (469) T pfam10151 361 SEQSRP-LRAYLNDKLPELLERV 382 (469) T ss_pred HHHHHH-HHHHHHHHHHHHHHHH T ss_conf 997268-9999980226999999 No 14 >cd00822 TopoII_Trans_DNA_gyrase TopoIIA_Trans_DNA_gyrase: Transducer domain, having a ribosomal S5 domain 2-like fold, of the type found in proteins of the type IIA family of DNA topoisomerases similar to the B subunits of E. coli DNA gyrase and E. coli Topoisomerase IV which are heterodimers composed of two subunits. The type IIA enzymes are the predominant form of topoisomerase and are found in some bacteriophages, viruses and archaea, and in all bacteria and eukaryotes. All type IIA topoisomerases are related to each other at amino acid sequence level, though their oligomeric organization sometimes differs. TopoIIA enzymes cut both strands of the duplex DNA to remove (relax) both positive and negative supercoils in DNA. These enzymes covalently attach to the 5' ends of the cut DNA, separate the free ends of the cleaved strands, pass another region of the duplex through this gap, then rejoin the ends. TopoIIA enzymes also catenate/ decatenate duplex rings. E.coli DNA gyrase is a Probab=45.82 E-value=25 Score=16.42 Aligned_cols=44 Identities=25% Similarity=0.409 Sum_probs=32.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHC---HHHHHHHHHHHHHHHHCCCCC Q ss_conf 30789999973799999766103---578999999999999603246 Q gi|254780932|r 192 RSLDQLSLDVLRPMLREWLDDNL---PGIVERLVREEIERIARGPIR 235 (236) Q Consensus 192 ~tle~lv~emLrPmLk~WLD~nL---P~lVErlVr~EIerv~r~~~r 235 (236) ..+-.+|...++--|..||++|- =.|+++++...=.|.++.-.| T Consensus 124 ~ev~~~v~~~v~~~l~~~l~~np~~a~~Ii~k~i~~a~aR~~akkar 170 (172) T cd00822 124 SEVRSIVESAVREALEEWLEENPEEAKKILEKAILAAKAREAARKAR 170 (172) T ss_pred HHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 88999999999999999999899999999999999999999999734 No 15 >cd06154 YjgF_YER057c_UK114_like_6 This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function. The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site. Probab=45.63 E-value=21 Score=16.97 Aligned_cols=50 Identities=20% Similarity=0.274 Sum_probs=33.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHH-----------HHHHHHHHHHHHHHCHH Q ss_conf 2235567899987655556532656530789999-----------97379999976610357 Q gi|254780932|r 166 LVSSDVGDQVASSFDQLVKALRESESRSLDQLSL-----------DVLRPMLREWLDDNLPG 216 (236) Q Consensus 166 l~S~~t~~~v~~~~~~l~~~~~~~~~~tle~lv~-----------emLrPmLk~WLD~nLP~ 216 (236) .++.+...|+..+|.++...+...+ -+++|||+ ..+.--.++|+.+|.|. T Consensus 37 ~~~~d~~~Q~~~~l~ni~~iL~~aG-~~~~dvvk~tvyltd~~~~~~~~~v~~~~f~~~~Pa 97 (119) T cd06154 37 VMPGDAYEQTRQCLEIIEAALAEAG-ASLEDVVRTRMYVTDIADFEAVGRAHGEVFGDIRPA 97 (119) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHCC-CCHHHHEEEEEEECCHHHHHHHHHHHHHHCCCCCCC T ss_conf 6489999999999999999999869-987781566678568889999999999876999993 No 16 >KOG3239 consensus Probab=45.44 E-value=12 Score=18.60 Aligned_cols=19 Identities=37% Similarity=0.840 Sum_probs=12.7 Q ss_pred HHHHHHHHCHHHHHHHHHH Q ss_conf 9997661035789999999 Q gi|254780932|r 206 LREWLDDNLPGIVERLVRE 224 (236) Q Consensus 206 Lk~WLD~nLP~lVErlVr~ 224 (236) -|+||-.|.|.|.++|--+ T Consensus 34 Ck~WL~~n~pdly~kL~~e 52 (193) T KOG3239 34 CKEWLEENHPDLYAKLYGE 52 (193) T ss_pred HHHHHHHCCHHHHHHHHCC T ss_conf 9999985083677765156 No 17 >smart00352 POU Found in Pit-Oct-Unc transcription factors. Probab=45.29 E-value=11 Score=18.80 Aligned_cols=43 Identities=19% Similarity=0.371 Sum_probs=22.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCC-CHHHH---HHHH--HHHHHHHHHHH Q ss_conf 56789998765555653265653-07899---9997--37999997661 Q gi|254780932|r 170 DVGDQVASSFDQLVKALRESESR-SLDQL---SLDV--LRPMLREWLDD 212 (236) Q Consensus 170 ~t~~~v~~~~~~l~~~~~~~~~~-tle~l---v~em--LrPmLk~WLD~ 212 (236) =|...|..+++.|-....++... -+|.| .+.| |||.|+.||.+ T Consensus 25 ~TQ~dVG~aL~~l~g~~~SQttIcRFE~l~LS~kn~~kLkPiL~~WL~e 73 (75) T smart00352 25 FTQADVGLALGALYGPDFSQTTICRFEALQLSFKNMCKLKPLLEKWLEE 73 (75) T ss_pred CCHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 7599999999986186431468889886054798899889999999874 No 18 >pfam05716 AKAP_110 A-kinase anchor protein 110 kDa (AKAP 110). This family consists of several mammalian protein kinase A anchoring protein 3 (PRKA3) or A-kinase anchor protein 110 kDa (AKAP 110) sequences. Agents that increase intracellular cAMP are potent stimulators of sperm motility. Anchoring inhibitor peptides, designed to disrupt the interaction of the cAMP-dependent protein kinase A (PKA) with A kinase-anchoring proteins (AKAPs), are potent inhibitors of sperm motility. PKA anchoring is a key biochemical mechanism controlling motility. AKAP110 shares compartments with both RI and RII isoforms of PKA and may function as a regulator of both motility- and head-associated functions such as capacitation and the acrosome reaction. Probab=44.54 E-value=18 Score=17.35 Aligned_cols=34 Identities=38% Similarity=0.495 Sum_probs=26.4 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 5556532656530789999973799999766103 Q gi|254780932|r 181 QLVKALRESESRSLDQLSLDVLRPMLREWLDDNL 214 (236) Q Consensus 181 ~l~~~~~~~~~~tle~lv~emLrPmLk~WLD~nL 214 (236) -|..++...+-|-|++-|-.|-|=-|-+||=-|| T Consensus 652 vLQ~Vlry~KERQLDEAVGNvtR~QLLDWLm~NL 685 (685) T pfam05716 652 VLQSVLRYEKERQLDEAVGNVTRLQLLDWLMANL 685 (685) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9999999987630678765788999999997419 No 19 >smart00807 AKAP_110 A-kinase anchor protein 110 kDa. This family consists of several mammalian protein kinase A anchoring protein 3 (PRKA3) or A-kinase anchor protein 110 kDa (AKAP 110) sequences. Agents that increase intracellular cAMP are potent stimulators of sperm motility. Anchoring inhibitor peptides, designed to disrupt the interaction of the cAMP-dependent protein kinase A (PKA) with A kinase-anchoring proteins (AKAPs), are potent inhibitors of sperm motility. PKA anchoring is a key biochemical mechanism controlling motility. AKAP110 shares compartments with both RI and RII isoforms of PKA and may function as a regulator of both motility- and head-associated functions such as capacitation and the acrosome reaction PUBMED:10319321. Probab=42.52 E-value=21 Score=16.95 Aligned_cols=34 Identities=38% Similarity=0.495 Sum_probs=26.7 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 5556532656530789999973799999766103 Q gi|254780932|r 181 QLVKALRESESRSLDQLSLDVLRPMLREWLDDNL 214 (236) Q Consensus 181 ~l~~~~~~~~~~tle~lv~emLrPmLk~WLD~nL 214 (236) -|..++.-.+-|-|++-|-.|-|=-|-+||=-|| T Consensus 818 vLQsVlry~KERQLDEAVGNvtR~QLLDWLm~NL 851 (851) T smart00807 818 VLQSVLRYEKERQLDEAVGNVTRLQLLDWLMANL 851 (851) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9999999887621677765688999999987419 No 20 >PRK04439 S-adenosylmethionine synthetase; Provisional Probab=42.40 E-value=23 Score=16.63 Aligned_cols=60 Identities=8% Similarity=0.240 Sum_probs=37.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHH---------------------HCCCCCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHH Q ss_conf 122355678999876555565---------------------32656530789999973799999766103578999999 Q gi|254780932|r 165 ALVSSDVGDQVASSFDQLVKA---------------------LRESESRSLDQLSLDVLRPMLREWLDDNLPGIVERLVR 223 (236) Q Consensus 165 ~l~S~~t~~~v~~~~~~l~~~---------------------~~~~~~~tle~lv~emLrPmLk~WLD~nLP~lVErlVr 223 (236) -+++...++.+...++..... +....+..++++-++. +=.+.+|| .|++.+.+.|++ T Consensus 317 NvlA~~iA~~I~~~v~gv~ev~V~llSqIG~PId~P~~a~~~v~~~~g~~~~~~~~~v-~~I~~~~L-~~i~~~~~~l~~ 394 (400) T PRK04439 317 NVLAHQIAREIVEEVEGVKEVYVRLLSQIGKPIDQPLIASAQVIPEDGTLESEVEEEV-EEIVDREL-ANITKFTELLLE 394 (400) T ss_pred HHHHHHHHHHHHHHCCCCEEEEEEEHHHCCCCCCCCCEEEEEEECCCCCCHHHHHHHH-HHHHHHHH-HHHHHHHHHHHC T ss_conf 9999999999998638942799997530499789874688999747898737789999-99999998-515889999975 Q ss_pred HHH Q ss_conf 999 Q gi|254780932|r 224 EEI 226 (236) Q Consensus 224 ~EI 226 (236) -++ T Consensus 395 g~~ 397 (400) T PRK04439 395 GKV 397 (400) T ss_pred CCC T ss_conf 874 No 21 >pfam04305 DUF455 Protein of unknown function (DUF455). Probab=42.12 E-value=17 Score=17.63 Aligned_cols=24 Identities=33% Similarity=0.552 Sum_probs=19.6 Q ss_pred HHHCHHHHHHHHHHHHHHHHCCCC Q ss_conf 610357899999999999960324 Q gi|254780932|r 211 DDNLPGIVERLVREEIERIARGPI 234 (236) Q Consensus 211 D~nLP~lVErlVr~EIerv~r~~~ 234 (236) |+-.-.|++.|.++||-+|+.|.| T Consensus 177 D~~sa~iL~~I~~dEI~HVa~G~r 200 (250) T pfam04305 177 DNESAAILEVILRDEIGHVAIGNR 200 (250) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 867899999999998889998799 No 22 >pfam03562 MltA MltA specific insert domain. This beta barrel domain is found inserted in the MltA a murein degrading transglycosylase enzyme. This domain may be involved in peptidoglycan binding. Probab=41.17 E-value=21 Score=16.93 Aligned_cols=21 Identities=33% Similarity=0.670 Sum_probs=15.6 Q ss_pred HHHHHHHHHHHHCHHHHHHHHH Q ss_conf 3799999766103578999999 Q gi|254780932|r 202 LRPMLREWLDDNLPGIVERLVR 223 (236) Q Consensus 202 LrPmLk~WLD~nLP~lVErlVr 223 (236) =-.-+|.||.+| |.-+.++.. T Consensus 84 sm~~Ik~wl~~n-P~~~~~~l~ 104 (113) T pfam03562 84 SMQAIRAWLRAN-PDRVDELLE 104 (113) T ss_pred CHHHHHHHHHHC-HHHHHHHHH T ss_conf 999999999989-899999997 No 23 >KOG1043 consensus Probab=40.59 E-value=19 Score=17.29 Aligned_cols=45 Identities=38% Similarity=0.465 Sum_probs=29.9 Q ss_pred HHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHH----HCHHHHHHHHHHH Q ss_conf 555565326565307899999737999997661----0357899999999 Q gi|254780932|r 180 DQLVKALRESESRSLDQLSLDVLRPMLREWLDD----NLPGIVERLVREE 225 (236) Q Consensus 180 ~~l~~~~~~~~~~tle~lv~emLrPmLk~WLD~----nLP~lVErlVr~E 225 (236) .+|-..|...+.+++ +++.|-||.=|++|||- ++|....-|+|-+ T Consensus 346 ~el~~aC~~rgmra~-gv~~e~l~~ql~~wldlsl~~~vps~lL~Lsr~~ 394 (499) T KOG1043 346 LELQIACRERGMRAL-GVSEERLREQLRVWLDLSLDKKVPSVLLLLSRTF 394 (499) T ss_pred HHHHHHHHHHHCCHH-CCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH T ss_conf 999999986212020-4654443477899886000356854998875442 No 24 >KOG1463 consensus Probab=40.48 E-value=28 Score=16.05 Aligned_cols=62 Identities=26% Similarity=0.289 Sum_probs=41.6 Q ss_pred HHHHHHHHHHHHHH--HHH--CCC--CCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHC Q ss_conf 56789998765555--653--265--653078999997379999976610357899999999999960 Q gi|254780932|r 170 DVGDQVASSFDQLV--KAL--RES--ESRSLDQLSLDVLRPMLREWLDDNLPGIVERLVREEIERIAR 231 (236) Q Consensus 170 ~t~~~v~~~~~~l~--~~~--~~~--~~~tle~lv~emLrPmLk~WLD~nLP~lVErlVr~EIerv~r 231 (236) .+..+|+.+|++.+ .+. +.. .-.--+.+||.-|.=.-..-|++||=.++|---|-||.+|++ T Consensus 286 ~AmkavAeA~~nRSLkdF~~AL~~yk~eL~~D~ivr~Hl~~Lyd~lLEknl~riIEPyS~Vei~hIA~ 353 (411) T KOG1463 286 DAMKAVAEAFGNRSLKDFEKALADYKKELAEDPIVRSHLQSLYDNLLEKNLCRIIEPYSRVEISHIAE 353 (411) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 99999999863786899999999968987128489999999999999873777717613543999999 No 25 >pfam00204 DNA_gyraseB DNA gyrase B. This family represents the second domain of DNA gyrase B which has a ribosomal S5 domain 2-like fold. This family is structurally related to PF01119. Probab=39.60 E-value=30 Score=15.78 Aligned_cols=38 Identities=18% Similarity=0.306 Sum_probs=27.4 Q ss_pred CHHHHHHHHHHHHHHHHHHHH---CHHHHHHHHHHHHHHHH Q ss_conf 078999997379999976610---35789999999999996 Q gi|254780932|r 193 SLDQLSLDVLRPMLREWLDDN---LPGIVERLVREEIERIA 230 (236) Q Consensus 193 tle~lv~emLrPmLk~WLD~n---LP~lVErlVr~EIerv~ 230 (236) -+..+|..+++--|..||+.| .-.|+++++.+.=.|++ T Consensus 126 ev~~~v~~~v~~~l~~~l~~np~~a~~Ii~k~~~~~~aR~a 166 (173) T pfam00204 126 EVRSAVESVVSEKFAKFLEKNPEIAKKILEKALAAAKARLA 166 (173) T ss_pred HHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH T ss_conf 89999999999999999998999999999999999999999 No 26 >PHA02130 hypothetical protein Probab=38.91 E-value=13 Score=18.39 Aligned_cols=10 Identities=60% Similarity=1.292 Sum_probs=7.6 Q ss_pred HHHHHHHHHC Q ss_conf 9999766103 Q gi|254780932|r 205 MLREWLDDNL 214 (236) Q Consensus 205 mLk~WLD~nL 214 (236) -||+|||.|. T Consensus 18 sl~~wl~~~~ 27 (81) T PHA02130 18 SLREWLDERF 27 (81) T ss_pred HHHHHHHHCC T ss_conf 9999998202 No 27 >KOG1168 consensus Probab=38.70 E-value=16 Score=17.68 Aligned_cols=25 Identities=24% Similarity=0.671 Sum_probs=14.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHCHHH Q ss_conf 0789999973799999766103578 Q gi|254780932|r 193 SLDQLSLDVLRPMLREWLDDNLPGI 217 (236) Q Consensus 193 tle~lv~emLrPmLk~WLD~nLP~l 217 (236) ||-.---=-|||+|+.||..-=-.+ T Consensus 267 TLSHNNMiALKPILqaWLEeAE~a~ 291 (385) T KOG1168 267 TLSHNNMIALKPILQAWLEEAEAAM 291 (385) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 0245763106689999999999998 No 28 >smart00467 GS GS motif. Aa approx. 30 amino acid motif that precedes the kinase domain in types I and II TGF beta receptors. Mutation of two or more of the serines or threonines in the TTSGSGSG of TGF-beta type I receptor impairs phosphorylation and signaling activity. Probab=38.51 E-value=13 Score=18.34 Aligned_cols=22 Identities=41% Similarity=0.673 Sum_probs=16.9 Q ss_pred HHHHHHHH-------HCHHHHHHHHHHHH Q ss_conf 99997661-------03578999999999 Q gi|254780932|r 205 MLREWLDD-------NLPGIVERLVREEI 226 (236) Q Consensus 205 mLk~WLD~-------nLP~lVErlVr~EI 226 (236) .|++||++ -||.||.|-|.+-| T Consensus 2 ~~~d~~~~~tSGSGSGlp~LvQrTiarqi 30 (30) T smart00467 2 TLSDLLEDTTSGSGSGLPLLVQRTVARQI 30 (30) T ss_pred CHHHHHHHCCCCCCCCCHHHHHHHHHHCC T ss_conf 57888652257888886024365575139 No 29 >pfam08171 Mad3_BUB1_II Mad3/BUB1 homology region 2. This domain is found in checkpoint proteins which are involved in cell division. This region has been shown to be necessary and sufficient for the binding of MAD3 to BUB3 in Saccharomyces cerevisiae. This domain is present in BUB1 which also binds BUB3. Probab=37.76 E-value=20 Score=16.99 Aligned_cols=20 Identities=25% Similarity=0.313 Sum_probs=15.7 Q ss_pred CCCCCCCHHHHHHHHHHHHC Q ss_conf 13568879999986767620 Q gi|254780932|r 5 NAVSEPSMEEIVNSIRRILE 24 (236) Q Consensus 5 ~~~~EPSMeeILASIRRIIs 24 (236) ...+|=|+|||||-.|+|-. T Consensus 39 ~~~eE~~~eEiLA~sR~~y~ 58 (67) T pfam08171 39 EDDEEFCLEEILAISRNLYK 58 (67) T ss_pred CCCCHHHHHHHHHHHHHHHH T ss_conf 88523039999999887887 No 30 >pfam08490 DUF1744 Domain of unknown function (DUF1744). This domain is found on the epsilon catalytic subunit of DNA polymerase. It is found C terminal to pfam03104 and pfam00136. Probab=36.91 E-value=20 Score=17.09 Aligned_cols=16 Identities=50% Similarity=1.047 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9997379999976610 Q gi|254780932|r 198 SLDVLRPMLREWLDDN 213 (236) Q Consensus 198 v~emLrPmLk~WLD~n 213 (236) +--+||=|+|.||++- T Consensus 263 af~iLR~mV~~W~~da 278 (396) T pfam08490 263 ALRVLKSMVKEWWDDA 278 (396) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999999999999998 No 31 >PRK13946 shikimate kinase; Provisional Probab=36.83 E-value=32 Score=15.67 Aligned_cols=28 Identities=18% Similarity=0.278 Sum_probs=12.9 Q ss_pred HHHHCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 56532656530789999973799999766 Q gi|254780932|r 183 VKALRESESRSLDQLSLDVLRPMLREWLD 211 (236) Q Consensus 183 ~~~~~~~~~~tle~lv~emLrPmLk~WLD 211 (236) ++.++....++.+++|.++|+ .|+.||| T Consensus 162 Ad~~I~t~~~s~~~ia~eIi~-~L~~~~~ 189 (195) T PRK13946 162 ADLTVESRDVPHEVIADEVIE-ALAAYLE 189 (195) T ss_pred CCEEEECCCCCHHHHHHHHHH-HHHHHHH T ss_conf 898988998999999999999-9999985 No 32 >COG2080 CoxS Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion] Probab=36.12 E-value=9.6 Score=19.29 Aligned_cols=36 Identities=17% Similarity=0.277 Sum_probs=28.2 Q ss_pred HHHHHHHHCCCCCCCHHHHHH--HHHHHHHHHHHHHHC Q ss_conf 655556532656530789999--973799999766103 Q gi|254780932|r 179 FDQLVKALRESESRSLDQLSL--DVLRPMLREWLDDNL 214 (236) Q Consensus 179 ~~~l~~~~~~~~~~tle~lv~--emLrPmLk~WLD~nL 214 (236) ---++....|.+-+|+|+|.+ .-|.||-+-|++.+- T Consensus 60 Cl~~a~~~~G~~ItTiEGl~~~~~~l~~vQ~Af~e~~~ 97 (156) T COG2080 60 CLTLAVQAEGAEITTIEGLAKKDGGLHPVQQAFLEHDA 97 (156) T ss_pred HHHHHHHHCCCEEEEEECCCCCCCCCCHHHHHHHHCCC T ss_conf 88999995798599742226788875789999998299 No 33 >pfam08726 efhand_Ca_insen Ca2+ insensitive EF hand. EF hands are helix-loop-helix binding motifs involved in the regulation of many cellular processes. EF hands usually bind to Ca2+ ions which causes a major conformational change that allows the protein to interact with its designated targets. This domain corresponds to an EF hand which has partially or entirely lost its calcium-binding properties. The calcium insensitive EF hand is still able to mediate protein-protein recognition. Probab=36.08 E-value=34 Score=15.41 Aligned_cols=45 Identities=20% Similarity=0.241 Sum_probs=35.2 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCHHHH Q ss_conf 5678999876555565326565307899999737999997661035789 Q gi|254780932|r 170 DVGDQVASSFDQLVKALRESESRSLDQLSLDVLRPMLREWLDDNLPGIV 218 (236) Q Consensus 170 ~t~~~v~~~~~~l~~~~~~~~~~tle~lv~emLrPmLk~WLD~nLP~lV 218 (236) ++.++|..+|..|+ .+..-.|-++|-+++..=+. ++|=+|.|... T Consensus 3 ~TaEqv~~sFr~lA---~~KpYVT~~dLr~~L~pdqa-eyci~~M~~y~ 47 (69) T pfam08726 3 DTAEQVEQSFRALA---EGKPYVTEEDLRRALTPEQA-EYCIARMPPYS 47 (69) T ss_pred CCHHHHHHHHHHHH---CCCCCCCHHHHHHHCCHHHH-HHHHHHCCCCC T ss_conf 52999999999984---89981079999976899999-99998686666 No 34 >COG1491 Predicted RNA-binding protein [Translation, ribosomal structure and biogenesis] Probab=35.99 E-value=27 Score=16.13 Aligned_cols=26 Identities=31% Similarity=0.580 Sum_probs=19.6 Q ss_pred HHHHHHHHCHHHHHHHHHHHHHHHHC Q ss_conf 99976610357899999999999960 Q gi|254780932|r 206 LREWLDDNLPGIVERLVREEIERIAR 231 (236) Q Consensus 206 Lk~WLD~nLP~lVErlVr~EIerv~r 231 (236) |..---.|||++||.+|++-=+|-++ T Consensus 92 LT~~Ar~eLpyvve~iV~~~E~rFV~ 117 (202) T COG1491 92 LTDTARSELPYVVEKIVKENEDRFVK 117 (202) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 56777755189999999861567888 No 35 >cd06152 YjgF_YER057c_UK114_like_4 YjgF, YER057c, and UK114 belong to a large family of proteins present in bacteria, archaea, and eukaryotes with no definitive function. The conserved domain is similar in structure to chorismate mutase but there is no sequence similarity and no functional connection. Members of this family have been implicated in isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF) biosynthesis, as well as threonine anaerobic degradation (tdcF) and mitochondrial DNA maintenance (Ibm1). This domain homotrimerizes forming a distinct intersubunit cavity that may serve as a small molecule binding site. Probab=34.55 E-value=21 Score=16.95 Aligned_cols=51 Identities=16% Similarity=0.429 Sum_probs=33.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH--------------HHHHHHHHHHHHHCHH Q ss_conf 22355678999876555565326565307899999--------------7379999976610357 Q gi|254780932|r 166 LVSSDVGDQVASSFDQLVKALRESESRSLDQLSLD--------------VLRPMLREWLDDNLPG 216 (236) Q Consensus 166 l~S~~t~~~v~~~~~~l~~~~~~~~~~tle~lv~e--------------mLrPmLk~WLD~nLP~ 216 (236) .+......|+..+|.++...+...++.+++|+||- .+.-..++|+.+|.|. T Consensus 27 ~v~~d~~~Q~~q~~~Ni~~~L~~aG~~~~~dVVk~~vyltd~~~~~~~~~~~~~~~~~~~~~~Pa 91 (114) T cd06152 27 KIPEDLEEEIDQAFDNVELALKAAGGKGWEQVYKVNSYHVDIKNEEAFGLMVENFKKWMPNHQPI 91 (114) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHEEEEEEEEECCCCHHHHHHHHHHHHHHCCCCCCC T ss_conf 45889999999999999999998069675368999999954886899999999999657999996 No 36 >PRK00871 glutathione-regulated potassium-efflux system ancillary protein KefF; Provisional Probab=34.24 E-value=19 Score=17.21 Aligned_cols=10 Identities=30% Similarity=0.657 Sum_probs=5.0 Q ss_pred HHHHHHHHHH Q ss_conf 9999976610 Q gi|254780932|r 204 PMLREWLDDN 213 (236) Q Consensus 204 PmLk~WLD~n 213 (236) =.|++|+|.+ T Consensus 165 ~~L~~~~~~~ 174 (176) T PRK00871 165 QRLLEWQEAH 174 (176) T ss_pred HHHHHHHHHH T ss_conf 9999998754 No 37 >COG5406 Nucleosome binding factor SPN, SPT16 subunit [Transcription / DNA replication, recombination, and repair / Chromatin structure and dynamics] Probab=34.14 E-value=16 Score=17.70 Aligned_cols=15 Identities=13% Similarity=0.235 Sum_probs=6.6 Q ss_pred HHHHHHHHHHCCCCC Q ss_conf 999867676204677 Q gi|254780932|r 14 EIVNSIRRILENNDQ 28 (236) Q Consensus 14 eILASIRRIIsdde~ 28 (236) +|..+|-+-|+.+-+ T Consensus 326 ~iY~~~~~yi~~~~p 340 (1001) T COG5406 326 IIYSEAEKYISSNGP 340 (1001) T ss_pred HHHHHHHHHHHHCCC T ss_conf 689999999872398 No 38 >PRK04930 glutathione-regulated potassium-efflux system ancillary protein KefG; Provisional Probab=33.54 E-value=21 Score=16.97 Aligned_cols=11 Identities=55% Similarity=1.144 Sum_probs=7.4 Q ss_pred HHHHHHHHHHC Q ss_conf 99999766103 Q gi|254780932|r 204 PMLREWLDDNL 214 (236) Q Consensus 204 PmLk~WLD~nL 214 (236) ++||.|+|+=| T Consensus 78 allK~W~D~V~ 88 (184) T PRK04930 78 ALLKEWLDRVL 88 (184) T ss_pred HHHHHHHHHHH T ss_conf 99999999985 No 39 >COG0246 MtlD Mannitol-1-phosphate/altronate dehydrogenases [Carbohydrate transport and metabolism] Probab=33.53 E-value=25 Score=16.36 Aligned_cols=44 Identities=20% Similarity=0.406 Sum_probs=32.3 Q ss_pred CCCCCCCHHHHHHHHHHHHHH---HHHHHHC---HHHHHHHHH----HHHHHHH Q ss_conf 265653078999997379999---9766103---578999999----9999996 Q gi|254780932|r 187 RESESRSLDQLSLDVLRPMLR---EWLDDNL---PGIVERLVR----EEIERIA 230 (236) Q Consensus 187 ~~~~~~tle~lv~emLrPmLk---~WLD~nL---P~lVErlVr----~EIerv~ 230 (236) +..|+-.|-..|..++..--+ .||++|. =+||-|||= .++.+|. T Consensus 175 ~~~NG~~lk~~Vl~~a~~~~~~~a~wi~~~v~FpnsmVDRIVP~~t~~~~~~i~ 228 (473) T COG0246 175 IPRNGEVLKEAVLRFASEWDLALAAWIEENVGFPNSMVDRIVPATTDDERDEIE 228 (473) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 742069999999999876335799999961779840026457999757788888 No 40 >pfam02186 TFIIE_beta TFIIE beta subunit core domain. General transcription factor TFIIE consists of two subunits, TFIIE alpha pfam02002 and TFIIE beta. TFIIE beta has been found to bind to the region where the promoter starts to open to be single-stranded upon transcription initiation by RNA polymerase II. The structure of the DNA binding core region has been solved and has a winged helix fold. Probab=33.16 E-value=29 Score=15.95 Aligned_cols=39 Identities=26% Similarity=0.439 Sum_probs=24.6 Q ss_pred HHHHHHHHHC----CCCCCCHHHHHHHHH---HHHHHHHHHHHCHHH Q ss_conf 7655556532----656530789999973---799999766103578 Q gi|254780932|r 178 SFDQLVKALR----ESESRSLDQLSLDVL---RPMLREWLDDNLPGI 217 (236) Q Consensus 178 ~~~~l~~~~~----~~~~~tle~lv~emL---rPmLk~WLD~nLP~l 217 (236) .|+.|+-+|- -+.-.|||+|--++= -+.++.||- |+|.| T Consensus 3 ~f~~L~~aVeymK~r~~PltleeIld~lsldi~~~~~~~Lk-~n~kI 48 (65) T pfam02186 3 LFTVLAKAVEYMKERDHPLTVEEILDYLSLDIGPTVLPWLK-NNPKI 48 (65) T ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCCHHHHHHHH-HCCCE T ss_conf 88999999999994499747999999974684588999985-09974 No 41 >PRK00808 hypothetical protein; Provisional Probab=32.48 E-value=32 Score=15.67 Aligned_cols=21 Identities=19% Similarity=0.422 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHHHHHHHHCHHH Q ss_conf 9999973799999766103578 Q gi|254780932|r 196 QLSLDVLRPMLREWLDDNLPGI 217 (236) Q Consensus 196 ~lv~emLrPmLk~WLD~nLP~l 217 (236) ++..++|. .|++||-+|.=.. T Consensus 99 ~~~~el~~-fL~~WL~~HIl~~ 119 (150) T PRK00808 99 DVADELRN-MLSRWLFNHIRND 119 (150) T ss_pred HHHHHHHH-HHHHHHHHHHHHH T ss_conf 79999999-9999999985888 No 42 >PRK10996 thioredoxin 2; Provisional Probab=32.46 E-value=7.7 Score=19.99 Aligned_cols=42 Identities=17% Similarity=0.377 Sum_probs=29.4 Q ss_pred HHHHHHHH--HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 89998765--5556532656530789999973799999766103 Q gi|254780932|r 173 DQVASSFD--QLVKALRESESRSLDQLSLDVLRPMLREWLDDNL 214 (236) Q Consensus 173 ~~v~~~~~--~l~~~~~~~~~~tle~lv~emLrPmLk~WLD~nL 214 (236) ..+++.|+ .+...++-.+++-++.++--|=+.-|+.|||++| T Consensus 96 ~~la~~~~I~siPTlilfk~G~~v~r~~Ga~p~~~l~~wl~q~L 139 (139) T PRK10996 96 RELSARFGIRSIPTIMIFKNGQVVDMLNGAVPKAPFDSWLNEAL 139 (139) T ss_pred HHHHHHCCCCCCCEEEEEECCEEEEEEECCCCHHHHHHHHHHHC T ss_conf 78888707671885999989989777768999999999999649 No 43 >pfam05186 Dpy-30 Dpy-30 motif. This motif is found in a wide variety of domain contexts. It is found in the Dpy-30 proteins hence the motifs name. It is about 40 residues long and is probably formed of two alpha-helices. It may be a dimerization motif analogous to pfam02197 (Bateman A pers obs). Probab=32.28 E-value=35 Score=15.32 Aligned_cols=18 Identities=28% Similarity=0.654 Sum_probs=13.1 Q ss_pred HHHHHHHHC-HHHHHHHHH Q ss_conf 999766103-578999999 Q gi|254780932|r 206 LREWLDDNL-PGIVERLVR 223 (236) Q Consensus 206 Lk~WLD~nL-P~lVErlVr 223 (236) +++||++|+ |.|.+.|.. T Consensus 2 ~R~YL~~~v~P~Ll~GL~e 20 (42) T pfam05186 2 ARQYLNDTVAPILLQGLTE 20 (42) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 7899998747999999999 No 44 >pfam11838 DUF3358 Domain of unknown function (DUF3358). This domain is functionally uncharacterized. This domain is found in eukaryotes and bacteria. This presumed domain is typically between 99 to 428 amino acids in length. Probab=32.02 E-value=40 Score=14.97 Aligned_cols=65 Identities=20% Similarity=0.251 Sum_probs=39.7 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH---------HHHHHHHHHHH--CHHHHHHHHHHHHHHHHCCCC Q ss_conf 567899987655556532656530789999973---------79999976610--357899999999999960324 Q gi|254780932|r 170 DVGDQVASSFDQLVKALRESESRSLDQLSLDVL---------RPMLREWLDDN--LPGIVERLVREEIERIARGPI 234 (236) Q Consensus 170 ~t~~~v~~~~~~l~~~~~~~~~~tle~lv~emL---------rPmLk~WLD~n--LP~lVErlVr~EIerv~r~~~ 234 (236) .+..-+..+|+.|..+-..+..--...++.-++ .=.+..||+.| +|..+-|+|.+-+..+.|.=| T Consensus 347 ll~~y~~~yf~~l~~i~~~~~~~~a~~~~~~l~P~~~~s~~~~~~~~~~L~~~pd~~~~LrR~i~e~~D~l~Ral~ 422 (428) T pfam11838 347 LLAPYVERYFAALPGIWETRSSEFAQRFVTGLYPAWDVSAETLDAADAWLAAHPDLPAALRRLVLEGRDGLERALR 422 (428) T ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 9998899999999999986899999999997388766999999999999865998788999999983417799998 No 45 >PRK01917 hemerythrin family protein; Provisional Probab=31.57 E-value=40 Score=14.92 Aligned_cols=26 Identities=27% Similarity=0.484 Sum_probs=17.8 Q ss_pred HHHHHHHHHHHHHHHHHHCHHHHHHHH Q ss_conf 999997379999976610357899999 Q gi|254780932|r 196 QLSLDVLRPMLREWLDDNLPGIVERLV 222 (236) Q Consensus 196 ~lv~emLrPmLk~WLD~nLP~lVErlV 222 (236) .|+++ |.=.|++||.+|.-++=-.++ T Consensus 100 ~l~~~-l~~~L~~WL~~HI~~~D~~la 125 (142) T PRK01917 100 ELGRR-LVAELPEWFDQHVRTMDSMMV 125 (142) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHH T ss_conf 79999-999999999999898889999 No 46 >pfam08369 PCP_red Proto-chlorophyllide reductase 57 kD subunit. This domain is found in bacteria and plant chloroplast proteins. It often appears at the C-terminal of Nitrogenase component 1 type Oxidoreductases (pfam00148) and sometimes independently in bacterial proteins such as the Proto-chlorophyllide reductase 57 kD subunit of the Cyanobacterium Synechocystis. Probab=31.06 E-value=41 Score=14.86 Aligned_cols=25 Identities=32% Similarity=0.433 Sum_probs=20.1 Q ss_pred HHHHHHCHHHHHHHHHHHHHHHHCCC Q ss_conf 97661035789999999999996032 Q gi|254780932|r 208 EWLDDNLPGIVERLVREEIERIARGP 233 (236) Q Consensus 208 ~WLD~nLP~lVErlVr~EIerv~r~~ 233 (236) ..| ++.|.+|-+.||.-||+.||.- T Consensus 7 ~~L-~~iP~fvR~~~r~~~E~~A~~~ 31 (45) T pfam08369 7 AEL-KKIPFFVRKKVRRNAEKYAREA 31 (45) T ss_pred HHH-HHCCHHHHHHHHHHHHHHHHHC T ss_conf 998-7598988999999999999985 No 47 >PTZ00109 DNA gyrase subunit b; Provisional Probab=30.63 E-value=42 Score=14.81 Aligned_cols=34 Identities=15% Similarity=0.236 Sum_probs=25.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHH Q ss_conf 07899999737999997661035789999999999 Q gi|254780932|r 193 SLDQLSLDVLRPMLREWLDDNLPGIVERLVREEIE 227 (236) Q Consensus 193 tle~lv~emLrPmLk~WLD~nLP~lVErlVr~EIe 227 (236) .+-.+|..+++--|..||++| |.+-..||.+-+. T Consensus 505 ev~~~Ve~iv~e~L~~~leen-p~~ak~Ii~ka~~ 538 (941) T PTZ00109 505 FLKPILESAIFEKLSEIFDFE-PNLLNSIYFKALQ 538 (941) T ss_pred HHHHHHHHHHHHHHHHHHHHC-HHHHHHHHHHHHH T ss_conf 899999999999999999979-7999999999999 No 48 >pfam09028 Mac-1 Mac 1. The bacterial protein Mac 1 adopts an alpha/beta fold, with 14 beta strands and 9 alpha helices. The N-terminal domain is made up predominantly of alpha helices, whereas the C-terminal domain consists predominantly of beta sheets. Mac 1 blocks polymorphonuclear opsonophagocytosis, inhibits the production of reactive oxygen species and contains IgG endopeptidase activity. Probab=30.06 E-value=31 Score=15.70 Aligned_cols=19 Identities=32% Similarity=0.650 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHCHHHHHHHHHHHHHHHH Q ss_conf 7999997661035789999999999996 Q gi|254780932|r 203 RPMLREWLDDNLPGIVERLVREEIERIA 230 (236) Q Consensus 203 rPmLk~WLD~nLP~lVErlVr~EIerv~ 230 (236) -=||-=|||+| |.+|||-. T Consensus 102 sNMlHwWfdqN---------kd~Ie~yL 120 (333) T pfam09028 102 SNMLHWWFEQN---------KDYIEQYL 120 (333) T ss_pred HHHHHHHHHHC---------HHHHHHHH T ss_conf 42515787647---------68999985 No 49 >pfam05700 BCAS2 Breast carcinoma amplified sequence 2 (BCAS2). This family consists of several eukaryotic sequences of unknown function. The mammalian members of this family are annotated as breast carcinoma amplified sequence 2 (BCAS2) proteins. BCAS2 is a putative spliceosome associated protein. Probab=29.55 E-value=37 Score=15.22 Aligned_cols=14 Identities=36% Similarity=0.510 Sum_probs=9.6 Q ss_pred HHHHHHHHHCCCCC Q ss_conf 99999999603246 Q gi|254780932|r 222 VREEIERIARGPIR 235 (236) Q Consensus 222 Vr~EIerv~r~~~r 235 (236) +|++|+-|-|+.+. T Consensus 155 ~k~~ie~VN~~RK~ 168 (221) T pfam05700 155 LKEEIEEVNRQRKY 168 (221) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999999999 No 50 >pfam10350 DUF2428 Putative death-receptor fusion protein (DUF2428). This is a family of proteins conserved from plants to humans. The function is not known. Several members have been annotated as being HEAT repeat-containing proteins while others are designated as death-receptor interacting proteins, but neither of these could be confirmed. Probab=29.55 E-value=43 Score=14.69 Aligned_cols=45 Identities=29% Similarity=0.477 Sum_probs=23.7 Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHC-----------------HHHHHHHHHHHH Q ss_conf 99987655556532656530789999973799999766103-----------------578999999999 Q gi|254780932|r 174 QVASSFDQLVKALRESESRSLDQLSLDVLRPMLREWLDDNL-----------------PGIVERLVREEI 226 (236) Q Consensus 174 ~v~~~~~~l~~~~~~~~~~tle~lv~emLrPmLk~WLD~nL-----------------P~lVErlVr~EI 226 (236) +|..+|..+-..+...+.. -|.=+...||+++| |++|.-|+..|. T Consensus 105 ~v~~~F~~lC~~l~~s~~~--------~l~~LP~~Wl~~~l~~i~~~~~~~TRRSAGlP~~i~ail~ae~ 166 (241) T pfam10350 105 SVYPGFTALCQRLWRSRDP--------ELPALPEEWLEELLDAIFSKAQKITRRSAGLPYLVTAILSSEP 166 (241) T ss_pred HHHHHHHHHHHHHHCCCCH--------HHHHHHHHHHHHHHHHHHCCCCCCCHHCCCCHHHHHHHHHCCC T ss_conf 9999999999999717966--------7777489999999999853363100422672799999996077 No 51 >pfam10092 DUF2330 Uncharacterized protein conserved in bacteria (DUF2330). Members of this family of hypothetical bacterial proteins have no known function. Probab=29.30 E-value=23 Score=16.61 Aligned_cols=21 Identities=43% Similarity=0.767 Sum_probs=12.6 Q ss_pred HHHHHHHH---CHHHHHHHHHHHH Q ss_conf 99976610---3578999999999 Q gi|254780932|r 206 LREWLDDN---LPGIVERLVREEI 226 (236) Q Consensus 206 Lk~WLD~n---LP~lVErlVr~EI 226 (236) |.+||++| ||.=.+.+++.=| T Consensus 138 L~~WL~~NgY~ip~ga~~~L~pYv 161 (337) T pfam10092 138 LETWLDDNGYAIPPGAEAALRPYV 161 (337) T ss_pred HHHHHHHCCCCCCCCHHHHHHHHH T ss_conf 999998679768954578899998 No 52 >TIGR03198 pucE xanthine dehydrogenase E subunit. This gene has been characterized in B. subtilis as the Iron-sulfur cluster binding-subunit of xanthine dehydrogenase (pucE), acting in conjunction with pucC, the FAD-binding subunit and pucD, the molybdopterin binding subunit. The more common XDH complex (GenProp0640) includes the xdhA gene as the Fe-S cluster binding component. Probab=29.14 E-value=20 Score=17.01 Aligned_cols=28 Identities=14% Similarity=0.341 Sum_probs=11.7 Q ss_pred HCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 3265653078999997379999976610 Q gi|254780932|r 186 LRESESRSLDQLSLDVLRPMLREWLDDN 213 (236) Q Consensus 186 ~~~~~~~tle~lv~emLrPmLk~WLD~n 213 (236) +.|..-.|+|+|..+=|-|+-+.|++.| T Consensus 67 ~~G~~I~TvEGl~~~~l~~vq~af~~~~ 94 (151) T TIGR03198 67 ADGHEITTIEGIAENELDPCQTAFLEEG 94 (151) T ss_pred HCCCEEEECCCCCCCCHHHHHHHHHHHC T ss_conf 3897899604456422429999999839 No 53 >PRK12383 putative mutase; Provisional Probab=28.78 E-value=30 Score=15.79 Aligned_cols=49 Identities=14% Similarity=0.173 Sum_probs=33.9 Q ss_pred HHHHHHHHHHHHHH-----HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHH Q ss_conf 23556789998765-----55565326565307899999737999997661035789999 Q gi|254780932|r 167 VSSDVGDQVASSFD-----QLVKALRESESRSLDQLSLDVLRPMLREWLDDNLPGIVERL 221 (236) Q Consensus 167 ~S~~t~~~v~~~~~-----~l~~~~~~~~~~tle~lv~emLrPmLk~WLD~nLP~lVErl 221 (236) +...+.+.+...|+ +|.+.-+.+-+|-.++..+.| +|+|..||.|..+| T Consensus 274 ~~~~t~~~~~~~~~gliF~NlVdfDl~GHRrD~~GYa~aL------e~fD~~L~ell~~l 327 (406) T PRK12383 274 VLDITLDEFNTHPTAFICANIQETDLAGHAEDVARYAERL------EVVDRNLARLLEAM 327 (406) T ss_pred HHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCHHHHHHHH------HHHHHHHHHHHHHC T ss_conf 9999999996069956997222312544555888999999------99999899999838 No 54 >COG5095 TAF6 Transcription initiation factor TFIID, subunit TAF6 (also component of histone acetyltransferase SAGA) [Transcription] Probab=28.70 E-value=45 Score=14.59 Aligned_cols=18 Identities=28% Similarity=0.372 Sum_probs=6.2 Q ss_pred CHHHHHHHHHHHHHHHHH Q ss_conf 078999997379999976 Q gi|254780932|r 193 SLDQLSLDVLRPMLREWL 210 (236) Q Consensus 193 tle~lv~emLrPmLk~WL 210 (236) .+--|||-||-=-|+=++ T Consensus 182 E~KplvkHvLsKELq~YF 199 (450) T COG5095 182 ELKPLVKHVLSKELQMYF 199 (450) T ss_pred CCCHHHHHHHHHHHHHHH T ss_conf 020899999789999999 No 55 >pfam08283 Gemini_AL1_M Geminivirus rep protein central domain. This is the cetral domain of the geminivirus rep proteins. Probab=28.53 E-value=27 Score=16.12 Aligned_cols=16 Identities=31% Similarity=0.509 Sum_probs=10.9 Q ss_pred HHHHHHHHHHHHHHHHHHHC Q ss_conf 89999973799999766103 Q gi|254780932|r 195 DQLSLDVLRPMLREWLDDNL 214 (236) Q Consensus 195 e~lv~emLrPmLk~WLD~nL 214 (236) .+.|-+ .|++|+++|+ T Consensus 64 f~~vpe----el~~Wa~~nv 79 (106) T pfam08283 64 FLNVPE----EIQEWADDNF 79 (106) T ss_pred HCCCCH----HHHHHHHHHC T ss_conf 335879----9999999835 No 56 >PRK05057 aroK shikimate kinase I; Reviewed Probab=28.00 E-value=45 Score=14.55 Aligned_cols=13 Identities=15% Similarity=0.406 Sum_probs=7.2 Q ss_pred HHHHHCHHHHHHH Q ss_conf 7661035789999 Q gi|254780932|r 209 WLDDNLPGIVERL 221 (236) Q Consensus 209 WLD~nLP~lVErl 221 (236) |||..+-.+++|+ T Consensus 102 ~L~~~~~~~~~R~ 114 (172) T PRK05057 102 YLETTIEKQLART 114 (172) T ss_pred EEECCHHHHHHHH T ss_conf 9959989999980 No 57 >COG4229 Predicted enolase-phosphatase [Energy production and conversion] Probab=27.78 E-value=46 Score=14.48 Aligned_cols=45 Identities=22% Similarity=0.259 Sum_probs=30.8 Q ss_pred CCCCHHHHHHHHHH-HHHHHHHHHHCHHH-----HHHHHHHHHHHHHCCCC Q ss_conf 65307899999737-99999766103578-----99999999999960324 Q gi|254780932|r 190 ESRSLDQLSLDVLR-PMLREWLDDNLPGI-----VERLVREEIERIARGPI 234 (236) Q Consensus 190 ~~~tle~lv~emLr-PmLk~WLD~nLP~l-----VErlVr~EIerv~r~~~ 234 (236) -++-|.++|++--+ +-+|.|||++|-.. +|+||.-=.+-|+.+++ T Consensus 29 a~~~lp~fv~e~~e~~~v~~~v~~v~~e~g~~~s~E~lva~~~~wiaed~K 79 (229) T COG4229 29 AARKLPDFVRENTEDSEVKKIVDEVLSEFGIANSEEALVALLLEWIAEDSK 79 (229) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCC T ss_conf 887768999861248706689999999857463699999999999864465 No 58 >cd07977 TFIIE_beta_winged_helix TFIIE_beta_winged_helix domain, located at the central core region of TFIIE beta, with double-stranded DNA binding activity. Transcription Factor IIE (TFIIE) beta winged-helix (or forkhead) domain is located at the central core region of TFIIE beta. The winged-helix is a form of helix-turn-helix (HTH) domain which typically binds DNA with the 3rd helix. The winged-helix domain is distinguished by the presence of a C-terminal beta-strand hairpin unit (the wing) that packs against the cleft of the tri-helical core. Although most winged-helix domains are multi-member families, TFIIE beta winged-helix domain is typically found as a single orthologous group. TFIIE is one of the six eukaryotic general transcription factors (TFIIA, TFIIB, TFIID, TFIIE, TFIIF and TFIIH) that are required for transcription initiation of protein-coding genes. TFIIE is a heterotetramer consisting of two copies each of alpha and beta subunits. TFIIE beta contains several functional Probab=27.43 E-value=47 Score=14.44 Aligned_cols=25 Identities=28% Similarity=0.514 Sum_probs=20.1 Q ss_pred CCCCHHHHHHHHH----HHHHHHHHHHHC Q ss_conf 6530789999973----799999766103 Q gi|254780932|r 190 ESRSLDQLSLDVL----RPMLREWLDDNL 214 (236) Q Consensus 190 ~~~tle~lv~emL----rPmLk~WLD~nL 214 (236) .-.|||+|.-++= .+++|+||-.+. T Consensus 24 ~Pltl~EIld~~~~~d~~~~~k~wL~~~~ 52 (75) T cd07977 24 HPLTLDEILDYLSLLDIGPKLKEWLKSEA 52 (75) T ss_pred CCCCHHHHHHHHHCCCCCHHHHHHHHHCC T ss_conf 88259999999731686689999998612 No 59 >cd03160 CD37_CD82_like_LEL Tetraspanin, extracellular domain or large extracellular loop (LEL), CD37_CD82_Like family. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". CD37 is a leukocyte-specific protein, and its restricted expression pattern suggests a role in the immune system. A regulatory role in T-cell proliferation has been suggested. CD82 is a metastasis suppressor implicated in biological processes ranging from fusion, adhesion, and migration to apoptos Probab=27.06 E-value=28 Score=16.04 Aligned_cols=18 Identities=28% Similarity=0.541 Sum_probs=12.2 Q ss_pred HHHHHHHHHHHHHHHHHC Q ss_conf 999973799999766103 Q gi|254780932|r 197 LSLDVLRPMLREWLDDNL 214 (236) Q Consensus 197 lv~emLrPmLk~WLD~nL 214 (236) |=.+==..-|++||.+|| T Consensus 99 IY~~GC~~kl~~Wl~~Nl 116 (117) T cd03160 99 VYQEGCMEKLQSWLNENL 116 (117) T ss_pred HHHHHHHHHHHHHHHHHC T ss_conf 678975999999999757 No 60 >pfam07533 BRK BRK domain. The function of this domain is unknown. It is often found associated with helicases and transcription factors. Probab=26.56 E-value=28 Score=16.04 Aligned_cols=19 Identities=32% Similarity=0.648 Sum_probs=12.0 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 2656530789999973799999766103 Q gi|254780932|r 187 RESESRSLDQLSLDVLRPMLREWLDDNL 214 (236) Q Consensus 187 ~~~~~~tle~lv~emLrPmLk~WLD~nL 214 (236) .|..+.++.+ |..||+.|= T Consensus 20 ~G~~AP~~~~---------L~~WL~~nP 38 (45) T pfam07533 20 TGDEAPKLKD---------LERWLQENP 38 (45) T ss_pred CCCCCCCHHH---------HHHHHHHCC T ss_conf 5566968789---------999999885 No 61 >pfam04690 YABBY YABBY protein. YABBY proteins are a group of plant-specific transcription involved in the specification of abaxial polarity in lateral organs. Probab=26.29 E-value=48 Score=14.37 Aligned_cols=19 Identities=37% Similarity=0.767 Sum_probs=15.6 Q ss_pred HHCHHHHHHHHHHHHHHHH Q ss_conf 1035789999999999996 Q gi|254780932|r 212 DNLPGIVERLVREEIERIA 230 (236) Q Consensus 212 ~nLP~lVErlVr~EIerv~ 230 (236) +-+|.---|-.|+||.||= T Consensus 123 qRvPSAYNrFIKEEIQRIK 141 (170) T pfam04690 123 QRVPSAYNRFIKEEIQRIK 141 (170) T ss_pred CCCCHHHHHHHHHHHHHHH T ss_conf 0683677777999998987 No 62 >PRK13948 shikimate kinase; Provisional Probab=26.28 E-value=49 Score=14.30 Aligned_cols=29 Identities=17% Similarity=0.375 Sum_probs=18.3 Q ss_pred HHHHCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 565326565307899999737999997661 Q gi|254780932|r 183 VKALRESESRSLDQLSLDVLRPMLREWLDD 212 (236) Q Consensus 183 ~~~~~~~~~~tle~lv~emLrPmLk~WLD~ 212 (236) ++..+...+.|++++|.++|+ .|+.|+|. T Consensus 151 A~~~I~td~~~~~eiv~~Ii~-~L~a~~~~ 179 (182) T PRK13948 151 GTIHVHSDGRPVEEIVEEVVE-RLWAWAEA 179 (182) T ss_pred CCEEEECCCCCHHHHHHHHHH-HHHHHHHC T ss_conf 788988998999999999999-99999866 No 63 >pfam03858 Crust_neuro_H Crustacean neurohormone H. These proteins are referred to as precursor-related peptides as they are typically co-transcribed and translated with the CHH neurohormone (pfam01147). However, in some species this neuropeptide is synthesized as a separate protein. Furthermore, neurohormone H can undergo proteolysis to give rise to 5 different neuropeptides. Probab=26.03 E-value=34 Score=15.41 Aligned_cols=16 Identities=31% Similarity=0.490 Sum_probs=12.0 Q ss_pred CHHHHHHHHHHHHCCC Q ss_conf 7999998676762046 Q gi|254780932|r 11 SMEEIVNSIRRILENN 26 (236) Q Consensus 11 SMeeILASIRRIIsdd 26 (236) -||-+|||+|-..+.. T Consensus 8 RMeRLLaslrg~a~~~ 23 (41) T pfam03858 8 RMERLLASLRGSAESS 23 (41) T ss_pred HHHHHHHHHHCCCCCC T ss_conf 6999999975235688 No 64 >TIGR03454 partition_RepB plasmid partitioning protein RepB. Members of this family are the RepB protein involved in replicon partitioning. RepB is found, in general, as part of a repABC operon in plasmids and small chromosomes, separate from the main chromosome, in various bacteria. This model describes a rather narrow clade of proteins; it should be noted that additional homologs scoring below the trusted cutoff have very similar functions, although they may be named differently. Probab=25.96 E-value=35 Score=15.33 Aligned_cols=18 Identities=22% Similarity=0.519 Sum_probs=12.8 Q ss_pred HHHHHHHHHHCHHHHHHH Q ss_conf 999997661035789999 Q gi|254780932|r 204 PMLREWLDDNLPGIVERL 221 (236) Q Consensus 204 PmLk~WLD~nLP~lVErl 221 (236) |-.-+||..||+.|.++- T Consensus 306 ~~F~~~l~~~L~~Ly~~f 323 (325) T TIGR03454 306 PAFGEFITARLDDLYEAF 323 (325) T ss_pred CCHHHHHHHHHHHHHHHH T ss_conf 408999998899999997 No 65 >PRK11623 pcnB poly(A) polymerase I; Provisional Probab=25.85 E-value=50 Score=14.25 Aligned_cols=57 Identities=21% Similarity=0.326 Sum_probs=36.9 Q ss_pred HHHHHHHHHHHCCCC-CCCHHHHHH----HHHHHHHHHHHHHHCHHHHHHHH----HHHHHHHHCC Q ss_conf 987655556532656-530789999----97379999976610357899999----9999999603 Q gi|254780932|r 176 ASSFDQLVKALRESE-SRSLDQLSL----DVLRPMLREWLDDNLPGIVERLV----REEIERIARG 232 (236) Q Consensus 176 ~~~~~~l~~~~~~~~-~~tle~lv~----emLrPmLk~WLD~nLP~lVErlV----r~EIerv~r~ 232 (236) ++.|+++--.+.+|. ..|++.|.+ .-|-|.|..||+++.-...+++| +.==+||.-| T Consensus 251 aRLfdE~lKLf~sG~a~~~~~~L~~~gL~~~LfP~~~~~l~~~~~~~~~~~i~~aL~nTD~RI~~g 316 (471) T PRK11623 251 ARLFEESLKLLQAGYGYETYKLLCEYHLFQPLFPTITRYFTENGDSPMERIIEQVLKNTDTRIHND 316 (471) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCC T ss_conf 678999999986585899999999868767651527877631267089999999999799998688 No 66 >pfam06895 DUF1267 Protein of unknown function (DUF1267). This family consists of several Lactococcus lactis and Lactococcus phage proteins of around 74 residues in length. The function of this family is unknown. Probab=25.38 E-value=43 Score=14.73 Aligned_cols=16 Identities=44% Similarity=0.598 Sum_probs=9.8 Q ss_pred HHHCHHHHHHHHHHHH Q ss_conf 6103578999999999 Q gi|254780932|r 211 DDNLPGIVERLVREEI 226 (236) Q Consensus 211 D~nLP~lVErlVr~EI 226 (236) |+-+.-++|||||=|- T Consensus 3 de~~~ei~ERLvRIET 18 (74) T pfam06895 3 DELIREILERLARIET 18 (74) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 3789999999999998 No 67 >TIGR01377 soxA_mon sarcosine oxidase, monomeric form; InterPro: IPR006281 This set of sequences describe the alpha subunit of a family of known and putative heterotetrameric sarcosine oxidases. Five operons of such oxidases are found in Mesorhizobium loti and three in Agrobacterium tumefaciens, a high enough copy number to suggest that not all members share the same function. Sarcosine oxidase catalyzes the oxidative demethylation of sarcosine to glycine. The reaction converts tetrahydrofolate to 5,10-methylene-tetrahydrofolate . Bacterial sarcosine oxidases have been isolated from over a dozen different organisms and fall into two major classes (1) monomeric form that contains only covalent flavin and (2) heterotetrameric (alpha, beta, gamma, delta) form that contain a covalent and noncovalent flavin, this entry represents the monomeric form.; GO: 0008115 sarcosine oxidase activity, 0046653 tetrahydrofolate metabolic process. Probab=24.88 E-value=18 Score=17.39 Aligned_cols=19 Identities=32% Similarity=0.704 Sum_probs=15.7 Q ss_pred HHHHHHHHHCHHHH--HHHHH Q ss_conf 99997661035789--99999 Q gi|254780932|r 205 MLREWLDDNLPGIV--ERLVR 223 (236) Q Consensus 205 mLk~WLD~nLP~lV--ErlVr 223 (236) .|++||-++||.|+ +.++. T Consensus 303 ~L~~f~~~~~pg~~ds~ep~~ 323 (401) T TIGR01377 303 ILKKFVRDHLPGLKDSEEPKA 323 (401) T ss_pred HHHHHHHHHCCCCCCCCCCCC T ss_conf 999999985788656656401 No 68 >KOG3995 consensus Probab=24.67 E-value=26 Score=16.31 Aligned_cols=22 Identities=27% Similarity=0.623 Sum_probs=17.5 Q ss_pred HHHHHHHHHHHH-HHHHHHHHCH Q ss_conf 789999973799-9997661035 Q gi|254780932|r 194 LDQLSLDVLRPM-LREWLDDNLP 215 (236) Q Consensus 194 le~lv~emLrPm-Lk~WLD~nLP 215 (236) ++--.+++.+|| +|+|||+|.- T Consensus 172 f~~~t~~~~~P~s~~~~~~~h~~ 194 (279) T KOG3995 172 FPLSTRSIMEPMSLDAWLDSHHR 194 (279) T ss_pred CCCCCCCCCCCCCHHHHHHHHHH T ss_conf 66530236666534678888888 No 69 >pfam10723 RepB-RCR_reg Replication regulatory protein RepB. This is a family of proteins which regulate replication of rolling circle replication (RCR) plasmids that have a double-strand replication origin (dso). Regulation of replication of RCR plasmids occurs mainly at initiation of leading strand synthesis at the dso, such that Rep protein concentration controls plasmid replication. Probab=24.34 E-value=53 Score=14.06 Aligned_cols=30 Identities=20% Similarity=0.398 Sum_probs=22.8 Q ss_pred HHHHHHHHHHHHHH-----HC--HHHHHHHHHHHHHH Q ss_conf 99737999997661-----03--57899999999999 Q gi|254780932|r 199 LDVLRPMLREWLDD-----NL--PGIVERLVREEIER 228 (236) Q Consensus 199 ~emLrPmLk~WLD~-----nL--P~lVErlVr~EIer 228 (236) .=.+..-||+||.. -| -.++|+|++.|..| T Consensus 45 ~v~i~n~lK~~L~elc~~~GlTQAe~IE~LIe~e~~~ 81 (85) T pfam10723 45 KVFIPNPLKDDLQELCEEEGLTQAEVIEQLIERELQG 81 (85) T ss_pred EEEECHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 1353689999999999872971999999999999997 No 70 >COG4691 StbC Plasmid stability protein [General function prediction only] Probab=24.22 E-value=50 Score=14.29 Aligned_cols=30 Identities=20% Similarity=0.279 Sum_probs=23.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHH Q ss_conf 565307899999737999997661035789999 Q gi|254780932|r 189 SESRSLDQLSLDVLRPMLREWLDDNLPGIVERL 221 (236) Q Consensus 189 ~~~~tle~lv~emLrPmLk~WLD~nLP~lVErl 221 (236) .-+|+.|+=+|++||-||+. +|-|.-|.=. T Consensus 23 ~HgrS~EaE~ReIL~~al~~---E~~p~~~alA 52 (80) T COG4691 23 RHGRSVEAEAREILTAALLG---EEVPMPVALA 52 (80) T ss_pred HCCCCHHHHHHHHHHHHHCC---CCCCCHHHHH T ss_conf 60665378999999997554---5777478988 No 71 >PRK13456 DNA protection protein DPS; Provisional Probab=23.99 E-value=54 Score=14.01 Aligned_cols=14 Identities=29% Similarity=0.411 Sum_probs=8.9 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 78999999999999 Q gi|254780932|r 216 GIVERLVREEIERI 229 (236) Q Consensus 216 ~lVErlVr~EIerv 229 (236) .|++.|.++|||+= T Consensus 140 ~la~~IL~dE~EHe 153 (184) T PRK13456 140 DLALAILNEEIEHE 153 (184) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999999989 No 72 >KOG1588 consensus Probab=23.48 E-value=55 Score=13.95 Aligned_cols=23 Identities=35% Similarity=0.400 Sum_probs=17.8 Q ss_pred HCHH--HHHHHHHHHHHHHHCCCCC Q ss_conf 0357--8999999999999603246 Q gi|254780932|r 213 NLPG--IVERLVREEIERIARGPIR 235 (236) Q Consensus 213 nLP~--lVErlVr~EIerv~r~~~r 235 (236) .++. +++||+-+||+||--.-|. T Consensus 46 ~~~~~~~~~rLL~~Ei~rv~~~~~~ 70 (259) T KOG1588 46 PFFPFPHAERLLDEEIERVQTSGRQ 70 (259) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHC T ss_conf 6411679999999999998864320 No 73 >TIGR01689 EcbF-BcbF capsule biosynthesis phosphatase; InterPro: IPR010039 This entry describes a small family of highly conserved proteins (>60 0dentity). Two of these, BcbF and EcbF of Pasteurella multocida are believed to be part of the capsule polysaccharide biosynthesis machinery because they are co-transcribed from a locus devoted to that purpose . In P. multocida there are six different variant capsules (A-F), and these proteins are found only in B and E. The other two species in which this gene is (currently) found are both also pathogenic. These proteins are also members of the IIIC subfamily of the Haloacid Dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. All of the characterised enzymes within subfamily III are phosphatases, and all of the active site residues characteristic of HAD-superfamily phosphatases are present in this subfamily.. Probab=23.48 E-value=35 Score=15.36 Aligned_cols=27 Identities=15% Similarity=0.394 Sum_probs=16.6 Q ss_pred CCCCCCHHHHHHHH---HHHHHHHHHHHHC Q ss_conf 65653078999997---3799999766103 Q gi|254780932|r 188 ESESRSLDQLSLDV---LRPMLREWLDDNL 214 (236) Q Consensus 188 ~~~~~tle~lv~em---LrPmLk~WLD~nL 214 (236) +.|-||-++=|--+ -=|.+-.|||+|- T Consensus 48 sRNMRTY~GNvG~INI~TLP~I~~WL~~H~ 77 (126) T TIGR01689 48 SRNMRTYEGNVGKINIHTLPIIIDWLNQHN 77 (126) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 656532137614121202126887875069 No 74 >pfam09494 Slx4 Slx4 endonuclease. The Slx4 protein is a heteromeric structure-specific endonuclease found in fungi. Slx4 with Slx1 acts as a nuclease on branched DNA substrates, particularly simple-Y, 5'-flap, or replication fork structures by cleaving the strand bearing the 5' non-homologous arm at the branch junction and thus generating ligatable nicked products from 5'-flap or replication fork substrates. Probab=23.46 E-value=55 Score=13.94 Aligned_cols=31 Identities=26% Similarity=0.671 Sum_probs=19.4 Q ss_pred HHHHHHHHHHHCCCC-----CCCHHHHHHHHHHHHHHHHH Q ss_conf 987655556532656-----53078999997379999976 Q gi|254780932|r 176 ASSFDQLVKALRESE-----SRSLDQLSLDVLRPMLREWL 210 (236) Q Consensus 176 ~~~~~~l~~~~~~~~-----~~tle~lv~emLrPmLk~WL 210 (236) ..-|+.|.+.|..+. -.|+|-|+-+. |++|| T Consensus 563 ~~if~~lt~LVkq~p~l~ekI~tFEPI~leE----L~~wL 598 (627) T pfam09494 563 SAVFDRLTQLVKQNPDLLEKIYTFEPIRLEE----LQEWL 598 (627) T ss_pred HHHHHHHHHHHHHCHHHHHHHHCCCCCCHHH----HHHHH T ss_conf 9999999999864889986531046602999----99999 No 75 >pfam06931 Adeno_E4_ORF3 Mastadenovirus E4 ORF3 protein. This family consists of several Mastadenovirus E4 ORF3 proteins. Early proteins E4 ORF3 and E4 ORF6 have complementary functions during viral infection. Both proteins facilitate efficient viral DNA replication, late protein expression, and prevention of concatenation of viral genomes. A unique function of E4 ORF3 is the reorganisation of nuclear structures known as PML oncogenic domains (PODs). The function of these domains is unclear, but PODs have been implicated in a number of important cellular processes, including transcriptional regulation, apoptosis, transformation, and response to interferon. Probab=23.39 E-value=56 Score=13.94 Aligned_cols=38 Identities=26% Similarity=0.573 Sum_probs=24.0 Q ss_pred HHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCHHHHHH Q ss_conf 9987655556532656530789999973799999766103578999 Q gi|254780932|r 175 VASSFDQLVKALRESESRSLDQLSLDVLRPMLREWLDDNLPGIVER 220 (236) Q Consensus 175 v~~~~~~l~~~~~~~~~~tle~lv~emLrPmLk~WLD~nLP~lVEr 220 (236) |.+|+.+|=.+. +..| .+.|+-.++.|-+.|-=+||+. T Consensus 6 VEgAL~elF~m~----Gl~l----q~~~~~ii~~Wk~ENYLGmVq~ 43 (113) T pfam06931 6 VEGALSELFNMH----GLNL----QQLLRDIIRGWKNENYLGMVQD 43 (113) T ss_pred HHHHHHHHHHHC----CCCH----HHHHHHHHHHHHHCCCHHHHHH T ss_conf 487999999981----7887----9999999997645450256664 No 76 >PRK10696 C32 tRNA thiolase; Provisional Probab=23.36 E-value=56 Score=13.93 Aligned_cols=31 Identities=32% Similarity=0.604 Sum_probs=24.8 Q ss_pred CHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHH Q ss_conf 07899999737999997661035789999999 Q gi|254780932|r 193 SLDQLSLDVLRPMLREWLDDNLPGIVERLVRE 224 (236) Q Consensus 193 tle~lv~emLrPmLk~WLD~nLP~lVErlVr~ 224 (236) +-+.|-|...|=||.+|. ..-|+.++-+.+. T Consensus 215 sq~~lqR~~ik~ml~~~~-~~~p~r~~~~~~a 245 (311) T PRK10696 215 SQPNLQRQVIADMLRDWD-KRYPGRIETMFSA 245 (311) T ss_pred CCHHHHHHHHHHHHHHHH-HHCCCCHHHHHHH T ss_conf 836789999999999999-9888828999999 No 77 >PRK13700 conjugal transfer protein TraD; Provisional Probab=23.28 E-value=56 Score=13.92 Aligned_cols=19 Identities=32% Similarity=0.735 Sum_probs=9.4 Q ss_pred HHHHHHHCHHHHHHHHHHH Q ss_conf 9976610357899999999 Q gi|254780932|r 207 REWLDDNLPGIVERLVREE 225 (236) Q Consensus 207 k~WLD~nLP~lVErlVr~E 225 (236) ..|..++-+..-..+-|+| T Consensus 694 ~~~~~~~~~~~~~~~~~~~ 712 (732) T PRK13700 694 EAWQQENHPDIQQQMQRRE 712 (732) T ss_pred HHHHHHCCCCHHHHHHHHH T ss_conf 9999752433287877766 No 78 >pfam04564 U-box U-box domain. This domain is related to the Ring finger pfam00097 but lacks the zinc binding residues. Probab=22.64 E-value=47 Score=14.46 Aligned_cols=28 Identities=29% Similarity=0.370 Sum_probs=18.9 Q ss_pred HCCCCCCCHHHHH-HHHHHHHHHHHHHHH Q ss_conf 3265653078999-997379999976610 Q gi|254780932|r 186 LRESESRSLDQLS-LDVLRPMLREWLDDN 213 (236) Q Consensus 186 ~~~~~~~tle~lv-~emLrPmLk~WLD~n 213 (236) ...+...+..+|+ ---||=++.+||.+| T Consensus 43 P~T~~~l~~~~l~pN~~Lk~~I~~w~~~n 71 (74) T pfam04564 43 PFTREPLTHDQLIPNLELKEKIDAWLEEN 71 (74) T ss_pred CCCCCCCCCCCEEECHHHHHHHHHHHHHC T ss_conf 87787488565042399999999999986 No 79 >PRK13702 replication protein; Provisional Probab=22.46 E-value=58 Score=13.81 Aligned_cols=31 Identities=23% Similarity=0.407 Sum_probs=22.4 Q ss_pred HHHHHHHHHHHHHHH-----HC--HHHHHHHHHHHHHH Q ss_conf 999737999997661-----03--57899999999999 Q gi|254780932|r 198 SLDVLRPMLREWLDD-----NL--PGIVERLVREEIER 228 (236) Q Consensus 198 v~emLrPmLk~WLD~-----nL--P~lVErlVr~EIer 228 (236) ++=.+++-||+||.+ -| -.|+|+|.+.|+.| T Consensus 44 i~Vfi~~~lK~~l~e~C~~~GlTQAe~ie~LIe~E~~~ 81 (82) T PRK13702 44 IKVFIQNPLKDKLLELCEEEGLTQAEMIEKLIERELAR 81 (82) T ss_pred HHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHC T ss_conf 52100358899999999883864999999999998843 No 80 >PRK09381 trxA thioredoxin; Provisional Probab=22.43 E-value=15 Score=18.02 Aligned_cols=32 Identities=22% Similarity=0.173 Sum_probs=22.7 Q ss_pred HHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 56532656530789999973799999766103 Q gi|254780932|r 183 VKALRESESRSLDQLSLDVLRPMLREWLDDNL 214 (236) Q Consensus 183 ~~~~~~~~~~tle~lv~emLrPmLk~WLD~nL 214 (236) ...+.-.++..+..++-.+=+-.|++|||.|| T Consensus 77 PT~~~fk~G~~v~~~~G~~~~~~l~~~i~~~L 108 (109) T PRK09381 77 PTLLLFKNGEVAATKVGALSKGQLKEFLDANL 108 (109) T ss_pred CEEEEEECCEEEEEEECCCCHHHHHHHHHHHH T ss_conf 84999999989788727999999999999860 No 81 >KOG0100 consensus Probab=22.02 E-value=44 Score=14.64 Aligned_cols=39 Identities=21% Similarity=0.513 Sum_probs=23.8 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHH--HHHHHHHHH Q ss_conf 26565307899999737999997661035789999--999999996 Q gi|254780932|r 187 RESESRSLDQLSLDVLRPMLREWLDDNLPGIVERL--VREEIERIA 230 (236) Q Consensus 187 ~~~~~~tle~lv~emLrPmLk~WLD~nLP~lVErl--Vr~EIerv~ 230 (236) .+....|+|+.|.+.| +||+.|-=..-|.. -++|.|-|+ T Consensus 594 ~~edKe~~e~av~e~~-----eWL~~n~~a~~Ee~~ek~kele~vv 634 (663) T KOG0100 594 SDEDKETIEDAVEEAL-----EWLESNQDASKEEFKEKKKELEAVV 634 (663) T ss_pred CHHHHHHHHHHHHHHH-----HHHHHCCCCCHHHHHHHHHHHHHHH T ss_conf 8667899999999999-----9876166656778999998899998 No 82 >COG2249 MdaB Putative NADPH-quinone reductase (modulator of drug activity B) [General function prediction only] Probab=21.59 E-value=46 Score=14.54 Aligned_cols=11 Identities=27% Similarity=0.915 Sum_probs=7.6 Q ss_pred HHHHHHHHHHC Q ss_conf 99999766103 Q gi|254780932|r 204 PMLREWLDDNL 214 (236) Q Consensus 204 PmLk~WLD~nL 214 (236) .|||.|+|+=| T Consensus 84 aiLKg~iDrV~ 94 (189) T COG2249 84 ALLKGWIDRVF 94 (189) T ss_pred HHHHHHHHHHH T ss_conf 99998999993 No 83 >pfam08482 HrpB_C ATP-dependent helicase C-terminal. This domain is found near the C-terminus of bacterial ATP-dependent helicases such as HrpB. Probab=21.51 E-value=58 Score=13.81 Aligned_cols=35 Identities=23% Similarity=0.081 Sum_probs=22.2 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHH Q ss_conf 65307899999737999997661035789999999 Q gi|254780932|r 190 ESRSLDQLSLDVLRPMLREWLDDNLPGIVERLVRE 224 (236) Q Consensus 190 ~~~tle~lv~emLrPmLk~WLD~nLP~lVErlVr~ 224 (236) +.+++.+|-+--|...|+.||+-+.-..++++.=. T Consensus 8 g~~s~~~l~~~~~~~~L~~~L~w~~~~~Ld~~aP~ 42 (133) T pfam08482 8 GVRSLADLKKLDLLEALLALLPWEQQQRLDELAPT 42 (133) T ss_pred CCCCHHHHHCCCHHHHHHHHCCHHHHHHHHHHCCC T ss_conf 78898987527999999987999999999986976 No 84 >KOG4761 consensus Probab=21.30 E-value=34 Score=15.42 Aligned_cols=24 Identities=38% Similarity=0.497 Sum_probs=12.7 Q ss_pred HHCHHHHHHHHHHHHHHHH-CCCCC Q ss_conf 1035789999999999996-03246 Q gi|254780932|r 212 DNLPGIVERLVREEIERIA-RGPIR 235 (236) Q Consensus 212 ~nLP~lVErlVr~EIerv~-r~~~r 235 (236) .||=.||+||+++=|..+. ++.|| T Consensus 132 kn~dklvtrlrsEiidpl~~~~tr~ 156 (266) T KOG4761 132 KNLDKLVTRLRSEIIDPLVTVGTRR 156 (266) T ss_pred CCHHHHHHHHHHHHCCCCCCCCCEE T ss_conf 5799999999986057443355232 No 85 >COG0816 Predicted endonuclease involved in recombination (possible Holliday junction resolvase in Mycoplasmas and B. subtilis) [DNA replication, recombination, and repair] Probab=20.57 E-value=46 Score=14.50 Aligned_cols=21 Identities=29% Similarity=0.482 Sum_probs=14.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHCH Q ss_conf 307899999737999997661035 Q gi|254780932|r 192 RSLDQLSLDVLRPMLREWLDDNLP 215 (236) Q Consensus 192 ~tle~lv~emLrPmLk~WLD~nLP 215 (236) ..+|.++.-+ +|+.|||.|.- T Consensus 119 ~~iD~~AA~~---ILq~~ld~~~~ 139 (141) T COG0816 119 GVIDSLAAVL---ILESYLDRNHA 139 (141) T ss_pred CHHHHHHHHH---HHHHHHHHHHH T ss_conf 1438999999---99999975200 No 86 >TIGR02499 HrpE_YscL_not type III secretion apparatus protein, HrpE/YscL family; InterPro: IPR012842 This entry is related to IPR009335 from INTERPRO, but is broader. IPR009335 from INTERPRO describes HrpE-like proteins, components of bacterial type III secretion systems primarily in bacteria that infect plants. This entry also includes the homologous proteins of animal pathogens, such as YscL of Yersinia pestis. This entry excludes the related protein FliH of the bacterial flagellar apparatus (see IPR000563 from INTERPRO).. Probab=20.36 E-value=64 Score=13.52 Aligned_cols=39 Identities=28% Similarity=0.400 Sum_probs=33.5 Q ss_pred CHHHHHHHHHHHHHHHHHHH-HCHHHHHHHHHHHHHHHHC Q ss_conf 07899999737999997661-0357899999999999960 Q gi|254780932|r 193 SLDQLSLDVLRPMLREWLDD-NLPGIVERLVREEIERIAR 231 (236) Q Consensus 193 tle~lv~emLrPmLk~WLD~-nLP~lVErlVr~EIerv~r 231 (236) .+.++|.+-|+=.|++|+|+ +-+.+.-++||.=+..+.- T Consensus 83 ~~~~~v~~a~~~~l~~~~~qQ~~~~~~~~~lr~~~~~~~~ 122 (183) T TIGR02499 83 RLAELVLQALEQILGETLGQQDESERLVRLLRQLLAAVAN 122 (183) T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHC T ss_conf 9999999999998656335899889999999999998744 No 87 >pfam10874 DUF2746 Protein of unknown function (DUF2746). This family of proteins has no known function. Probab=20.25 E-value=64 Score=13.51 Aligned_cols=41 Identities=32% Similarity=0.484 Sum_probs=30.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 0789999973799999766103578999999999999603246 Q gi|254780932|r 193 SLDQLSLDVLRPMLREWLDDNLPGIVERLVREEIERIARGPIR 235 (236) Q Consensus 193 tle~lv~emLrPmLk~WLD~nLP~lVErlVr~EIerv~r~~~r 235 (236) -|++|+ +|.|=++++ .+..+=+|-|.|--+-+|||+--.|| T Consensus 16 DlDel~-e~v~~gf~~-i~rdigglrEel~tER~ERIeGDrrr 56 (57) T pfam10874 16 DLDELV-ETVREGFKE-VRRDIGGLREELNTERRERIEGDRRR 56 (57) T ss_pred HHHHHH-HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 799999-999999999-99998889999979999985121035 No 88 >TIGR01028 S7_S5_E_A ribosomal protein S7; InterPro: IPR005716 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . This family describes the members from the eukaryotic cytosol and the Archaea of the family that includes ribosomal protein S7 of bacteria and S5 of eukaryotes.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015935 small ribosomal subunit. Probab=20.17 E-value=26 Score=16.28 Aligned_cols=12 Identities=42% Similarity=0.764 Sum_probs=5.8 Q ss_pred HHHHHHHHCCCC Q ss_conf 999999960324 Q gi|254780932|r 223 REEIERIARGPI 234 (236) Q Consensus 223 r~EIerv~r~~~ 234 (236) ++|+||||+-.| T Consensus 195 K~E~ERvA~SaR 206 (206) T TIGR01028 195 KEELERVAKSAR 206 (206) T ss_pred CCCCHHHCCCCC T ss_conf 521000101479 No 89 >pfam10065 DUF2303 Uncharacterized conserved protein (DUF2303). Members of this family of hypothetical bacterial proteins have no known function. Probab=20.16 E-value=47 Score=14.45 Aligned_cols=16 Identities=38% Similarity=0.706 Sum_probs=13.0 Q ss_pred HHHHHHHHHHCHHHHH Q ss_conf 9999976610357899 Q gi|254780932|r 204 PMLREWLDDNLPGIVE 219 (236) Q Consensus 204 PmLk~WLD~nLP~lVE 219 (236) =-+.+||.+|.+.||. T Consensus 134 ~~fAefLEd~~~~i~a 149 (272) T pfam10065 134 KEFAEFIEDNAPDIVA 149 (272) T ss_pred HHHHHHHHHCHHHHCC T ss_conf 9999999855465303 No 90 >PRK13866 plasmid partitioning protein RepB; Provisional Probab=20.10 E-value=64 Score=13.48 Aligned_cols=21 Identities=19% Similarity=0.555 Sum_probs=16.0 Q ss_pred HHHHHHHHHHCHHHHHHHHHH Q ss_conf 999997661035789999999 Q gi|254780932|r 204 PMLREWLDDNLPGIVERLVRE 224 (236) Q Consensus 204 PmLk~WLD~nLP~lVErlVr~ 224 (236) |-.-+||.+||+.|.+.-.+. T Consensus 308 ~~F~~fL~~~L~~Ly~eF~~~ 328 (336) T PRK13866 308 PTFGEYVADQLDSLYAQFIET 328 (336) T ss_pred CCHHHHHHHHHHHHHHHHHHH T ss_conf 308999998899999999986 No 91 >COG3437 Response regulator containing a CheY-like receiver domain and an HD-GYP domain [Transcription / Signal transduction mechanisms] Probab=20.02 E-value=65 Score=13.47 Aligned_cols=58 Identities=29% Similarity=0.306 Sum_probs=37.0 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHH--HHHHHHHC Q ss_conf 567899987655556532656530789999973799999766103578999999--99999960 Q gi|254780932|r 170 DVGDQVASSFDQLVKALRESESRSLDQLSLDVLRPMLREWLDDNLPGIVERLVR--EEIERIAR 231 (236) Q Consensus 170 ~t~~~v~~~~~~l~~~~~~~~~~tle~lv~emLrPmLk~WLD~nLP~lVErlVr--~EIerv~r 231 (236) .---+|+.-|+.|..-.....+-|.++....+.+=-=+ |.| |.+||-.|+ .+|..|.+ T Consensus 297 arI~aiADvfDAL~s~r~Yk~~~~~~eA~~~i~~~~Gr-~fd---P~lVeaFi~~~d~i~~i~~ 356 (360) T COG3437 297 ARIVAIADVFDALVSGRPYKEAMSTEEALEIIRAQSGR-LFD---PKLVEAFIQVEDEIIDIAR 356 (360) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCC-CCC---HHHHHHHHHHHHHHHHHHH T ss_conf 77999999999996488777778989999999984378-278---7999999998999999998 No 92 >PRK02318 mannitol-1-phosphate 5-dehydrogenase; Provisional Probab=20.02 E-value=52 Score=14.15 Aligned_cols=70 Identities=17% Similarity=0.369 Sum_probs=33.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHC---CCCCCCHHHHHHHHHH----HHHHHH------HHHHCHHHHHH----HHHHHHHH Q ss_conf 2235567899987655556532---6565307899999737----999997------66103578999----99999999 Q gi|254780932|r 166 LVSSDVGDQVASSFDQLVKALR---ESESRSLDQLSLDVLR----PMLREW------LDDNLPGIVER----LVREEIER 228 (236) Q Consensus 166 l~S~~t~~~v~~~~~~l~~~~~---~~~~~tle~lv~emLr----PmLk~W------LD~nLP~lVEr----lVr~EIer 228 (236) +-.-.++.++.++++-|..+-. .-+...+-.+|+.+|. -+.+.| |+.---.+++| .++-++.| T Consensus 212 L~~lNg~Ha~~Ay~G~l~G~~tI~Ea~~D~~i~~~v~~~~~E~~~~L~~~~~~~~~el~~Y~~~il~RF~Np~l~d~v~R 291 (381) T PRK02318 212 LFTVNTGHAITAYLGYLKGYKTIREAILDPSIRAVVKGALEESGAVLIKRYGFDEDEHAAYIEKILKRFENPYLSDDVER 291 (381) T ss_pred HHHHCHHHHHHHHHHHHCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHE T ss_conf 98735277999999998691359999759999999999999989976202397999999999999997189989987310 Q ss_pred HHCCCCC Q ss_conf 9603246 Q gi|254780932|r 229 IARGPIR 235 (236) Q Consensus 229 v~r~~~r 235 (236) |+|.|-| T Consensus 292 Var~p~r 298 (381) T PRK02318 292 VGRQPLR 298 (381) T ss_pred EECCCHH T ss_conf 2327020 No 93 >PRK01450 rpmC 50S ribosomal protein L29; Provisional Probab=20.01 E-value=63 Score=13.55 Aligned_cols=12 Identities=42% Similarity=0.548 Sum_probs=10.2 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999996 Q gi|254780932|r 219 ERLVREEIERIA 230 (236) Q Consensus 219 ErlVr~EIerv~ 230 (236) =|.||++|-||- T Consensus 48 ~r~VrrdIARIk 59 (71) T PRK01450 48 FSLVKKSIARIK 59 (71) T ss_pred HHHHHHHHHHHH T ss_conf 999999999999 Done!