Query gi|254781224|ref|YP_003065637.1| hypothetical protein CLIBASIA_05655 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 103 No_of_seqs 54 out of 56 Neff 4.0 Searched_HMMs 39220 Date Mon May 30 08:17:06 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781224.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam01442 Apolipoprotein Apoli 97.5 0.006 1.5E-07 36.1 13.6 82 6-87 17-99 (191) 2 pfam01442 Apolipoprotein Apoli 97.4 0.0085 2.2E-07 35.2 14.2 86 5-90 38-124 (191) 3 pfam07798 DUF1640 Protein of u 97.2 0.015 3.8E-07 33.8 13.9 75 21-95 72-171 (177) 4 pfam07798 DUF1640 Protein of u 96.8 0.036 9.2E-07 31.6 14.5 55 21-75 46-107 (177) 5 PRK10884 putative signal trans 88.7 1.6 4.1E-05 22.0 15.1 94 5-98 90-191 (206) 6 pfam07665 consensus 88.3 1.7 4.4E-05 21.8 6.1 31 42-76 10-43 (57) 7 pfam06895 DUF1267 Protein of u 82.9 3.2 8.2E-05 20.2 10.3 35 64-98 36-70 (74) 8 KOG3156 consensus 77.2 5 0.00013 19.1 10.6 26 63-88 178-207 (220) 9 PRK00409 recombination and DNA 74.1 6.1 0.00015 18.6 11.5 70 5-74 525-594 (780) 10 PRK09174 F0F1 ATP synthase sub 72.2 6.7 0.00017 18.4 15.5 69 7-75 86-154 (204) 11 pfam05478 Prominin Prominin. T 64.7 9.7 0.00025 17.4 14.5 73 26-98 347-432 (809) 12 PRK11546 zraP zinc resistance 59.3 12 0.00031 16.9 8.1 17 63-79 89-105 (139) 13 pfam07246 Phlebovirus_NSM Phle 58.8 12 0.00032 16.8 7.4 41 31-71 180-225 (264) 14 pfam03954 Lectin_N Hepatic lec 56.5 14 0.00035 16.6 8.2 43 11-53 58-103 (138) 15 TIGR03545 conserved hypothetic 51.7 16 0.00042 16.1 8.8 53 28-84 214-269 (554) 16 pfam10805 DUF2730 Protein of u 51.5 17 0.00042 16.1 5.6 26 17-42 49-74 (106) 17 cd03195 GST_C_4 GST_C family, 50.8 15 0.00037 16.4 2.7 50 36-85 8-68 (114) 18 pfam04880 NUDE_C NUDE protein, 45.6 21 0.00052 15.6 4.8 10 21-30 8-17 (166) 19 pfam08317 Spc7 Spc7 kinetochor 45.1 21 0.00053 15.5 9.7 38 6-43 181-218 (322) 20 pfam12128 DUF3584 Protein of u 44.0 22 0.00055 15.4 13.6 52 24-75 699-750 (1192) 21 PRK00488 pheS phenylalanyl-tRN 41.9 24 0.0006 15.2 8.0 54 26-79 29-82 (338) 22 pfam04799 Fzo_mitofusin fzo-li 41.1 24 0.00062 15.1 12.2 47 21-67 93-139 (171) 23 COG1561 Uncharacterized stress 41.0 24 0.00062 15.1 10.1 77 19-100 201-277 (290) 24 PRK05759 F0F1 ATP synthase sub 37.6 27 0.0007 14.8 15.4 66 7-72 37-102 (156) 25 TIGR02977 phageshock_pspA phag 37.4 28 0.00071 14.8 6.6 16 60-75 141-156 (222) 26 PRK13454 F0F1 ATP synthase sub 37.0 28 0.00072 14.8 15.5 80 7-86 64-154 (181) 27 CHL00118 atpG ATP synthase CF0 35.8 29 0.00075 14.7 15.4 61 7-67 55-115 (156) 28 pfam11629 Mst1_SARAH C termina 34.9 30 0.00078 14.6 4.0 36 40-75 11-46 (49) 29 pfam09726 Macoilin Transmembra 34.6 31 0.00079 14.5 7.4 14 77-90 574-587 (680) 30 TIGR02941 Sigma_B RNA polymera 29.7 33 0.00084 14.4 1.8 13 89-101 61-73 (256) 31 pfam04971 Lysis_S Lysis protei 28.5 39 0.001 13.9 2.7 21 80-100 19-42 (68) 32 PRK11820 hypothetical protein; 28.3 40 0.001 13.9 6.0 27 71-97 246-272 (288) 33 pfam02912 Phe_tRNA-synt_N Amin 27.1 42 0.0011 13.8 6.3 54 24-77 12-65 (73) 34 pfam10146 zf-C4H2 Zinc finger- 26.9 42 0.0011 13.8 12.7 46 29-75 35-80 (220) 35 PRK08476 F0F1 ATP synthase sub 26.7 42 0.0011 13.7 14.8 45 8-52 41-85 (141) 36 PRK07353 F0F1 ATP synthase sub 26.7 42 0.0011 13.7 15.3 61 7-67 38-98 (140) 37 pfam07106 TBPIP Tat binding pr 26.5 43 0.0011 13.7 7.9 39 32-70 92-131 (169) 38 pfam03938 OmpH Outer membrane 24.4 47 0.0012 13.5 14.7 76 22-97 38-124 (157) 39 COG4980 GvpP Gas vesicle prote 23.8 48 0.0012 13.4 10.0 71 4-77 31-101 (115) 40 PRK12750 cpxP periplasmic repr 23.5 49 0.0012 13.4 4.9 37 32-68 58-98 (170) 41 pfam04546 Sigma70_ner Sigma-70 23.3 49 0.0013 13.4 4.7 37 5-41 109-145 (210) 42 COG4890 Predicted outer membra 23.1 50 0.0013 13.3 3.3 20 79-98 4-23 (37) 43 pfam10473 Cenp-F_leu_zip Leuci 22.8 50 0.0013 13.3 11.0 69 3-72 19-90 (140) 44 PRK03100 sec-independent trans 22.6 51 0.0013 13.3 6.8 20 46-65 48-67 (131) 45 pfam04484 DUF566 Family of unk 20.6 56 0.0014 13.0 12.2 48 45-93 189-249 (312) No 1 >pfam01442 Apolipoprotein Apolipoprotein A1/A4/E domain. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I, Apolipoprotein A-IV, and Apolipoprotein E. Probab=97.53 E-value=0.006 Score=36.11 Aligned_cols=82 Identities=17% Similarity=0.265 Sum_probs=36.9 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999-999999988899999999999999999999999999999999999999999999999999999999 Q gi|254781224|r 6 VRQKVQKDSVEIR-FTKLETALPYLATKADLADVRTELKQDIANVRTELKADIADVRTELACTKSELKDAINSQTKWFMG 84 (103) Q Consensus 6 lr~~v~~~~~e~r-~~~~e~~~~~l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~~~~LR~ei~~~~~~l~G 84 (103) |+.++.-...+++ -+-.++..++-.+..|+..+|..+......|++.+..++..||+.+.....+|+..++.++-.|.. T Consensus 17 l~~~l~P~t~~~~~~l~~~~~~lr~~l~~dleelr~~l~p~~~el~~~~~~~~eelr~~l~p~~~el~~~~~~~~eeLr~ 96 (191) T pfam01442 17 LQEQLGPYAQEFWAQLSKETEALREELQKDLEEVRARLQPYLDELKAKVGQNLEELRQRLAPYAEELRERLNRDAEELRR 96 (191) T ss_pred HHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99770516899998888999999999985899999987899999999999899999998777899999988865999999 Q ss_pred HHH Q ss_conf 999 Q gi|254781224|r 85 IIV 87 (103) Q Consensus 85 ~~~ 87 (103) .+. T Consensus 97 ~l~ 99 (191) T pfam01442 97 KLA 99 (191) T ss_pred HHH T ss_conf 999 No 2 >pfam01442 Apolipoprotein Apolipoprotein A1/A4/E domain. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I, Apolipoprotein A-IV, and Apolipoprotein E. Probab=97.41 E-value=0.0085 Score=35.24 Aligned_cols=86 Identities=22% Similarity=0.272 Sum_probs=44.7 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 79999999999999-99999998889999999999999999999999999999999999999999999999999999999 Q gi|254781224|r 5 AVRQKVQKDSVEIR-FTKLETALPYLATKADLADVRTELKQDIANVRTELKADIADVRTELACTKSELKDAINSQTKWFM 83 (103) Q Consensus 5 ~lr~~v~~~~~e~r-~~~~e~~~~~l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~~~~LR~ei~~~~~~l~ 83 (103) .||.++.++-.++| -...-...++..+..++..+|+.+..-+..||..++.+...||.-+.....++|+.+..+...+. T Consensus 38 ~lr~~l~~dleelr~~l~p~~~el~~~~~~~~eelr~~l~p~~~el~~~~~~~~eeLr~~l~~~~eelr~~~~~~ve~lr 117 (191) T pfam01442 38 ALREELQKDLEEVRARLQPYLDELKAKVGQNLEELRQRLAPYAEELRERLNRDAEELRRKLAPYAEELRDRLRQNVEALR 117 (191) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999985899999987899999999999899999998777899999988865999999999999999999997799999 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|254781224|r 84 GIIVSVL 90 (103) Q Consensus 84 G~~~al~ 90 (103) ..+.+.. T Consensus 118 ~~l~P~~ 124 (191) T pfam01442 118 ARLGPYV 124 (191) T ss_pred HHHHHHH T ss_conf 9977389 No 3 >pfam07798 DUF1640 Protein of unknown function (DUF1640). This family consists of sequences derived from hypothetical eukaryotic proteins. A region approximately 100 residues in length is featured. Probab=97.20 E-value=0.015 Score=33.81 Aligned_cols=75 Identities=19% Similarity=0.346 Sum_probs=35.9 Q ss_pred HHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------HHHHHHHHHHHHH--- Q ss_conf 9999988899---999999999999999999999999999999999------------------9999999999999--- Q gi|254781224|r 21 KLETALPYLA---TKADLADVRTELKQDIANVRTELKADIADVRTE------------------LACTKSELKDAIN--- 76 (103) Q Consensus 21 ~~e~~~~~l~---~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~E------------------l~~~~~~LR~ei~--- 76 (103) +.++..++.+ ++.|+..|+++++.++..++.+.+-|++.=|+. +-.++..||.++. T Consensus 72 ~~e~~~lr~e~e~L~~dve~L~~~L~~ei~k~~a~~klDlNleK~riree~~~~e~ki~e~~~kId~ei~~lrt~iEs~K 151 (177) T pfam07798 72 KSEFSNLQAENEKLKNDLEKLKNRLRDEIDKVTAGVRLDLNLEKGRIREELAEQELKIKETDTKIDTEIAGLKTQLESVK 151 (177) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 75899999999999999999999999999986152576545552077899987999999999899999999999999999 Q ss_pred -HHHHHHHHHHHHHHHHHHH Q ss_conf -9999999999999999999 Q gi|254781224|r 77 -SQTKWFMGIIVSVLVSTIG 95 (103) Q Consensus 77 -~~~~~l~G~~~al~~a~iG 95 (103) ..++|+.|.+....+-..| T Consensus 152 ~~~ir~l~g~~~~~~a~~la 171 (177) T pfam07798 152 LDTIKYLIGVVTGCLALALA 171 (177) T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999 No 4 >pfam07798 DUF1640 Protein of unknown function (DUF1640). This family consists of sequences derived from hypothetical eukaryotic proteins. A region approximately 100 residues in length is featured. Probab=96.78 E-value=0.036 Score=31.60 Aligned_cols=55 Identities=24% Similarity=0.416 Sum_probs=24.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999988899999999999999----9999999999---99999999999999999999999 Q gi|254781224|r 21 KLETALPYLATKADLADVRTEL----KQDIANVRTE---LKADIADVRTELACTKSELKDAI 75 (103) Q Consensus 21 ~~e~~~~~l~~k~di~~LR~El----k~~i~~LR~E---lk~di~~LR~El~~~~~~LR~ei 75 (103) +.+..-.....+++++.||.|| +.++..||.| ++.|+..|++++..++..++.+. T Consensus 46 k~~~e~~~y~~~~~~selr~el~~~~~~e~~~lr~e~e~L~~dve~L~~~L~~ei~k~~a~~ 107 (177) T pfam07798 46 KEQQEKTEYQQKVDFAKLKSELQSLEKSEFSNLQAENEKLKNDLEKLKNRLRDEIDKVTAGV 107 (177) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999877758999999999999999999999999999861525 No 5 >PRK10884 putative signal transduction protein; Provisional Probab=88.73 E-value=1.6 Score=21.98 Aligned_cols=94 Identities=15% Similarity=0.170 Sum_probs=39.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7999999999999999999998889999999999999999---9999999---999999999999999999999999999 Q gi|254781224|r 5 AVRQKVQKDSVEIRFTKLETALPYLATKADLADVRTELKQ---DIANVRT---ELKADIADVRTELACTKSELKDAINSQ 78 (103) Q Consensus 5 ~lr~~v~~~~~e~r~~~~e~~~~~l~~k~di~~LR~Elk~---~i~~LR~---Elk~di~~LR~El~~~~~~LR~ei~~~ 78 (103) .+|..+.++..++...+.+.+.+..+..+..+++.+.+.. .+..|.. +++.+...++.++......+.+.-+.. T Consensus 90 s~r~rl~~Le~ql~~l~~~L~~l~~~~~q~~~el~~~~~~~~~~i~~L~~en~~L~~eL~~~~~~~~~~~~~l~~~~~~~ 169 (206) T PRK10884 90 SLRTRVPDLENQVKTLTDKLANIDNTWNQRTAEMQQKVAQSDSVINGLKEENQKLKNELIVAQKKVDAANLQLDDKQRTI 169 (206) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 67889999999999999999999999999999999999989999999999999999999999999999999998876679 Q ss_pred H--HHHHHHHHHHHHHHHHHHH Q ss_conf 9--9999999999999999999 Q gi|254781224|r 79 T--KWFMGIIVSVLVSTIGILL 98 (103) Q Consensus 79 ~--~~l~G~~~al~~a~iGvv~ 98 (103) . ..+.|..++..=..+|++. T Consensus 170 ~~~wF~~Gg~vl~~GlllGlIl 191 (206) T PRK10884 170 IMQWFMYGGGVLGIGLLLGLVL 191 (206) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999997289999999999987 No 6 >pfam07665 consensus Probab=88.29 E-value=1.7 Score=21.81 Aligned_cols=31 Identities=32% Similarity=0.554 Sum_probs=12.8 Q ss_pred HHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999---999999999999999999999999 Q gi|254781224|r 42 LKQDIANVRTE---LKADIADVRTELACTKSELKDAIN 76 (103) Q Consensus 42 lk~~i~~LR~E---lk~di~~LR~El~~~~~~LR~ei~ 76 (103) ++++++.|.++ +|.|+..|..+ ++.|+.+|. T Consensus 10 lk~Dv~~LK~Dmv~vK~Di~~LK~d----V~~Lk~Dvs 43 (57) T pfam07665 10 LKTDVAVLKTDMVEVKSDIKILKED----VAVLKKDVS 43 (57) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHH T ss_conf 9988999999999999999999999----999999899 No 7 >pfam06895 DUF1267 Protein of unknown function (DUF1267). This family consists of several Lactococcus lactis and Lactococcus phage proteins of around 74 residues in length. The function of this family is unknown. Probab=82.95 E-value=3.2 Score=20.24 Aligned_cols=35 Identities=26% Similarity=0.324 Sum_probs=26.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999 Q gi|254781224|r 64 LACTKSELKDAINSQTKWFMGIIVSVLVSTIGILL 98 (103) Q Consensus 64 l~~~~~~LR~ei~~~~~~l~G~~~al~~a~iGvv~ 98 (103) ++.+-.+-=.+|+.|-+|+++.++-+.++.++-+. T Consensus 36 ~aknn~~~i~eik~NqKW~WrtiiG~~iaii~~ll 70 (74) T pfam06895 36 IAKNNEKDIAEIKENQKWVWRTIAGIGVAVILYLL 70 (74) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98844878999987548899999989999999999 No 8 >KOG3156 consensus Probab=77.17 E-value=5 Score=19.12 Aligned_cols=26 Identities=31% Similarity=0.700 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHHHH---HH-HHHHHHHHH Q ss_conf 999999999999999---99-999999999 Q gi|254781224|r 63 ELACTKSELKDAINS---QT-KWFMGIIVS 88 (103) Q Consensus 63 El~~~~~~LR~ei~~---~~-~~l~G~~~a 88 (103) .+-.+++.||..|.+ ++ .|+.|.+.. T Consensus 178 kId~Ev~~lk~qi~s~K~qt~qw~~g~v~~ 207 (220) T KOG3156 178 KIDQEVTNLKTQIESVKTQTIQWLIGVVTG 207 (220) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 899999999999999899999999999887 No 9 >PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed Probab=74.13 E-value=6.1 Score=18.64 Aligned_cols=70 Identities=11% Similarity=0.104 Sum_probs=53.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7999999999999999999998889999999999999999999999999999999999999999999999 Q gi|254781224|r 5 AVRQKVQKDSVEIRFTKLETALPYLATKADLADVRTELKQDIANVRTELKADIADVRTELACTKSELKDA 74 (103) Q Consensus 5 ~lr~~v~~~~~e~r~~~~e~~~~~l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~~~~LR~e 74 (103) ..|.+..+...+++-.+.+...+..++......+..+-+.-....+.|...-+...|.++...+.++|+. T Consensus 525 ~~~~~~e~~~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~~~~~~~ea~~~i~~a~~e~~~~i~~lk~~ 594 (780) T PRK00409 525 ELERELEEKAEEAEKLLKEAEKLKEELEEQKEKLQEREDKLLLEAEEEAQQAIKEAKKEAAEIIKELRQL 594 (780) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999999999999999999999999999999999999999999999999999974 No 10 >PRK09174 F0F1 ATP synthase subunit B'; Validated Probab=72.24 E-value=6.7 Score=18.37 Aligned_cols=69 Identities=19% Similarity=0.186 Sum_probs=43.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999988899999999999999999999999999999999999999999999999 Q gi|254781224|r 7 RQKVQKDSVEIRFTKLETALPYLATKADLADVRTELKQDIANVRTELKADIADVRTELACTKSELKDAI 75 (103) Q Consensus 7 r~~v~~~~~e~r~~~~e~~~~~l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~~~~LR~ei 75 (103) ++.|..|-.+.+-.+.|.....-+.+..+.+-|.|-+.-+..-|.+.+.++..-|.++...+...-.+- T Consensus 86 ~~~I~~DLe~Ae~lk~EAe~~~a~Ye~~LaeAR~eA~~Ii~~Ar~~a~~~~e~~~a~~ea~L~~kia~A 154 (204) T PRK09174 86 RDRIAQDLDQAARLKQEADAAVAAYEQELAQARSKAASIAQAAREAAKAKAEAERAEIEASLEKKLKEA 154 (204) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999987999999999999999999999999999999999999999999999999999999999999999 No 11 >pfam05478 Prominin Prominin. The prominins are an emerging family of proteins that among the multispan membrane proteins display a novel topology. Mouse prominin and human prominin (mouse)-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration. Probab=64.73 E-value=9.7 Score=17.45 Aligned_cols=73 Identities=18% Similarity=0.296 Sum_probs=41.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHH Q ss_conf 8889999999999999999999999999999999999999999999999999-------------999999999999999 Q gi|254781224|r 26 LPYLATKADLADVRTELKQDIANVRTELKADIADVRTELACTKSELKDAINS-------------QTKWFMGIIVSVLVS 92 (103) Q Consensus 26 ~~~l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~~~~LR~ei~~-------------~~~~l~G~~~al~~a 92 (103) .....++..+.....+++.++...+..++.....++..+.+..+.+..+... .-||+.|.++...+. T Consensus 347 ~ip~~v~~~~~~~i~~~~~~l~~~~~~~~~~a~~i~~~i~~~~s~i~~~t~~~~~~~~~~~~~y~~yR~~~~liv~~lll 426 (809) T pfam05478 347 EIPSKVKNQTSSVVPDVKAALDSLGTDIKSVAEDLPLQVLSVLSQILNNTQSSSNPYLPYVEQYDTYRWYAGLVVCSLVL 426 (809) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999987442689999999899999999872699998899999989886442157899998899999999999999 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254781224|r 93 TIGILL 98 (103) Q Consensus 93 ~iGvv~ 98 (103) .|-++. T Consensus 427 LIv~~l 432 (809) T pfam05478 427 LVVLCN 432 (809) T ss_pred HHHHHH T ss_conf 999999 No 12 >PRK11546 zraP zinc resistance protein; Provisional Probab=59.30 E-value=12 Score=16.87 Aligned_cols=17 Identities=18% Similarity=0.323 Sum_probs=8.5 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254781224|r 63 ELACTKSELKDAINSQT 79 (103) Q Consensus 63 El~~~~~~LR~ei~~~~ 79 (103) .+..+++.||+.+-.+. T Consensus 89 avakEis~LR~kl~e~r 105 (139) T PRK11546 89 AVAKEMETLRQSLDEQR 105 (139) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99999999999999999 No 13 >pfam07246 Phlebovirus_NSM Phlebovirus nonstructural protein NS-M. This family consists of several Phlebovirus nonstructural NS-M proteins which represent the N-terminal region of the M polyprotein precursor. The function of this family is unknown. Probab=58.81 E-value=12 Score=16.82 Aligned_cols=41 Identities=27% Similarity=0.365 Sum_probs=17.5 Q ss_pred HHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999-----9999999999999999999999999999 Q gi|254781224|r 31 TKADLADVRTELK-----QDIANVRTELKADIADVRTELACTKSEL 71 (103) Q Consensus 31 ~k~di~~LR~Elk-----~~i~~LR~Elk~di~~LR~El~~~~~~L 71 (103) .|.++..+|.|++ -|..-+-.|.-+-..+||-+++.+..++ T Consensus 180 ~Kn~~~K~Rn~~~Rgqe~RDAk~~~~e~MAr~~~l~~~~~~ltde~ 225 (264) T pfam07246 180 KKNEIQKLRNDLKRGQEHRDAKLRVEESMARVKGLKEELKQLTDEL 225 (264) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999762654003441199999999987442788878887999 No 14 >pfam03954 Lectin_N Hepatic lectin, N-terminal domain. Probab=56.53 E-value=14 Score=16.59 Aligned_cols=43 Identities=14% Similarity=0.163 Sum_probs=27.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHH Q ss_conf 9999999999999998889999999999999---999999999999 Q gi|254781224|r 11 QKDSVEIRFTKLETALPYLATKADLADVRTE---LKQDIANVRTEL 53 (103) Q Consensus 11 ~~~~~e~r~~~~e~~~~~l~~k~di~~LR~E---lk~~i~~LR~El 53 (103) +|...|++..|..+.-....|++||..|-+- +...+..|-..+ T Consensus 58 sqlq~dL~tLretfsNfts~t~~evqaL~sqGgs~~~kv~sleakl 103 (138) T pfam03954 58 SQLQEELRTLRETFSNFTSSTLAEVQALSTQGGSVGRKMTSLESKL 103 (138) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 9999999999998731439899999999972552888999999999 No 15 >TIGR03545 conserved hypothetical protein TIGR03545. This model represents a relatively rare but broadly distributed uncharacterized protein family, distributed in 1-2 percent of bacterial genomes, all of which have outer membranes. In many of these genomes, it is part of a two-gene pair. Probab=51.67 E-value=16 Score=16.13 Aligned_cols=53 Identities=21% Similarity=0.338 Sum_probs=25.0 Q ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 899999999999999999---999999999999999999999999999999999999999 Q gi|254781224|r 28 YLATKADLADVRTELKQD---IANVRTELKADIADVRTELACTKSELKDAINSQTKWFMG 84 (103) Q Consensus 28 ~l~~k~di~~LR~Elk~~---i~~LR~Elk~di~~LR~El~~~~~~LR~ei~~~~~~l~G 84 (103) .+..+.++..|..|++++ +..++.++..+..++ ++....|+..-+.+..-|.+ T Consensus 214 l~~~k~~~d~lk~e~~adk~~i~~~k~~l~~~~~~l----~~~~~~lk~Apq~D~~~L~~ 269 (554) T TIGR03545 214 LQKAKEEFDKLKKEGKADKQQIISAKNDLQNDLKQL----KAQLAELKKAPQKDLNRLEN 269 (554) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHCCHHHHHHHHH T ss_conf 999999999999999999999999999999859999----99999998533867999997 No 16 >pfam10805 DUF2730 Protein of unknown function (DUF2730). This family of proteins with unknown function appears to be restricted to Gammaproteobacteria. Probab=51.55 E-value=17 Score=16.11 Aligned_cols=26 Identities=35% Similarity=0.515 Sum_probs=15.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999988899999999999999 Q gi|254781224|r 17 IRFTKLETALPYLATKADLADVRTEL 42 (103) Q Consensus 17 ~r~~~~e~~~~~l~~k~di~~LR~El 42 (103) -|....|+....+-+..|+..|+-++ T Consensus 49 ~Rl~~lE~~v~~lPt~~dv~~L~~~i 74 (106) T pfam10805 49 KRLTEMETKLESLPTAKDVHALKIEI 74 (106) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 68999999985399687899999999 No 17 >cd03195 GST_C_4 GST_C family, unknown subfamily 4; composed of uncharacterized proteins with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Probab=50.77 E-value=15 Score=16.41 Aligned_cols=50 Identities=12% Similarity=0.183 Sum_probs=26.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999-----------9999999999999999999999999999 Q gi|254781224|r 36 ADVRTELKQDIANVRTELKADI-----------ADVRTELACTKSELKDAINSQTKWFMGI 85 (103) Q Consensus 36 ~~LR~Elk~~i~~LR~Elk~di-----------~~LR~El~~~~~~LR~ei~~~~~~l~G~ 85 (103) -++-.||+.++..||+|+-.|+ ...|.++.........-+...--+|+|. T Consensus 8 R~vqA~m~S~f~~lR~e~p~~vvf~~~~~~pLs~a~~ad~~rl~~~a~~ll~~g~p~LFG~ 68 (114) T cd03195 8 RQVQAWLRSDLLPIRVERSTEVVFAGAKAEPLSEAAQAAAEKLIAVAEALLPPGAANLFGE 68 (114) T ss_pred HHHHHHHHCCHHHHHHHCCHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 9999999864499885398256752578899999999999999999999874689864253 No 18 >pfam04880 NUDE_C NUDE protein, C-terminal conserved region. This family represents the C-terminal conserved region of the NUDE proteins. NUDE proteins are involved in nuclear migration. Probab=45.56 E-value=21 Score=15.56 Aligned_cols=10 Identities=50% Similarity=0.388 Sum_probs=3.9 Q ss_pred HHHHHHHHHH Q ss_conf 9999988899 Q gi|254781224|r 21 KLETALPYLA 30 (103) Q Consensus 21 ~~e~~~~~l~ 30 (103) |++.++.+-. T Consensus 8 k~N~aiEr~a 17 (166) T pfam04880 8 KLNQALERNA 17 (166) T ss_pred HHHHHHHHHH T ss_conf 9999999988 No 19 >pfam08317 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association. Probab=45.08 E-value=21 Score=15.51 Aligned_cols=38 Identities=26% Similarity=0.258 Sum_probs=14.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999888999999999999999 Q gi|254781224|r 6 VRQKVQKDSVEIRFTKLETALPYLATKADLADVRTELK 43 (103) Q Consensus 6 lr~~v~~~~~e~r~~~~e~~~~~l~~k~di~~LR~Elk 43 (103) ++.+-..+..++.-.++--..+...-+.++..+|+++. T Consensus 181 l~~~~~~L~~e~~~l~~~~~e~~~~D~~eL~~~r~eL~ 218 (322) T pfam08317 181 LRKKLQALKEEIANLRQLADELNLCDPLELEKARQELR 218 (322) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 99999999999999997365552179999999999999 No 20 >pfam12128 DUF3584 Protein of unknown function (DUF3584). This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. There are two conserved sequence motifs: GKT and YLP. Probab=44.05 E-value=22 Score=15.42 Aligned_cols=52 Identities=23% Similarity=0.324 Sum_probs=19.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9988899999999999999999999999999999999999999999999999 Q gi|254781224|r 24 TALPYLATKADLADVRTELKQDIANVRTELKADIADVRTELACTKSELKDAI 75 (103) Q Consensus 24 ~~~~~l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~~~~LR~ei 75 (103) +.....+.+.+...++.+.++....+=++...-+..+.++++...+.....+ T Consensus 699 ~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 750 (1192) T pfam12128 699 QQAFLEELKDDFRELRTERKAKWQVVESDLDAQIAQLSAAIEAARTNAKARL 750 (1192) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999998999999999999899999999999999998899999 No 21 >PRK00488 pheS phenylalanyl-tRNA synthetase subunit alpha; Validated Probab=41.85 E-value=24 Score=15.22 Aligned_cols=54 Identities=17% Similarity=0.228 Sum_probs=36.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 888999999999999999999999999999999999999999999999999999 Q gi|254781224|r 26 LPYLATKADLADVRTELKQDIANVRTELKADIADVRTELACTKSELKDAINSQT 79 (103) Q Consensus 26 ~~~l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~~~~LR~ei~~~~ 79 (103) .-++.=|+.++.+-..|+.=-..=|-++...++++++++...+.+-+..++... T Consensus 29 ~~yLGKKG~Lt~llk~Lk~L~~EEKk~~Gk~iNelK~~Ie~~i~~k~~~l~~~~ 82 (338) T PRK00488 29 VKYLGKKGELTELLKGLGKLPPEERKAAGALINEAKQAVQEALNARKAELEAAE 82 (338) T ss_pred HHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999496069999998763699999999999999999999999999999999999 No 22 >pfam04799 Fzo_mitofusin fzo-like conserved region. Family of putative transmembrane GTPase. The fzo protein is a mediator of mitochondrial fusion. This conserved region is also found in the human mitofusin protein. Probab=41.10 E-value=24 Score=15.15 Aligned_cols=47 Identities=19% Similarity=0.247 Sum_probs=30.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999888999999999999999999999999999999999999999 Q gi|254781224|r 21 KLETALPYLATKADLADVRTELKQDIANVRTELKADIADVRTELACT 67 (103) Q Consensus 21 ~~e~~~~~l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~ 67 (103) -+-.+--...+++|++.-..-+.+....-+.||..+|.+|..|++.. T Consensus 93 s~TS~ncshQVqqELs~tfarLc~qVd~~~~~le~ei~~L~~ei~~L 139 (171) T pfam04799 93 SLTSSNCSHQVQQELASTFARLCQQVDVTSKDLEEEIAELTKEIQRL 139 (171) T ss_pred HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 77605806899999999999999998689999999999999999999 No 23 >COG1561 Uncharacterized stress-induced protein [Function unknown] Probab=41.03 E-value=24 Score=15.14 Aligned_cols=77 Identities=21% Similarity=0.207 Sum_probs=37.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999998889999999999999999999999999999999999999999999999999999999999999999999999 Q gi|254781224|r 19 FTKLETALPYLATKADLADVRTELKQDIANVRTELKADIADVRTELACTKSELKDAINSQTKWFMGIIVSVLVSTIGILL 98 (103) Q Consensus 19 ~~~~e~~~~~l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~~~~LR~ei~~~~~~l~G~~~al~~a~iGvv~ 98 (103) -.+.+.....++.|+||.+==+-+++-+...|.-++..- -+..---=|-|+|+.+.|.|-...++.-+...|+=. T Consensus 201 ~~rl~qE~al~a~K~DI~EEldRL~sHv~~~~~iL~~~g-----~vGRkLDFl~QE~nREaNTl~SKS~~~~it~~~vEl 275 (290) T COG1561 201 EDRLEQEVALLAQKADIAEELDRLKSHVKEFRNILEKGG-----PVGRKLDFLMQEFNREANTLGSKSNAAEITAAVVEL 275 (290) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-----CCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 678999999998871089899999999999999986188-----610468999999857887662122369999999999 Q ss_pred HH Q ss_conf 96 Q gi|254781224|r 99 KL 100 (103) Q Consensus 99 ~l 100 (103) |. T Consensus 276 K~ 277 (290) T COG1561 276 KV 277 (290) T ss_pred HH T ss_conf 99 No 24 >PRK05759 F0F1 ATP synthase subunit B; Validated Probab=37.63 E-value=27 Score=14.82 Aligned_cols=66 Identities=17% Similarity=0.221 Sum_probs=33.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999988899999999999999999999999999999999999999999999 Q gi|254781224|r 7 RQKVQKDSVEIRFTKLETALPYLATKADLADVRTELKQDIANVRTELKADIADVRTELACTKSELK 72 (103) Q Consensus 7 r~~v~~~~~e~r~~~~e~~~~~l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~~~~LR 72 (103) +++|...-.+.+-.+.+...+.-+.+..+...+.|.+.-+...|.+........+.+...+...+. T Consensus 37 ~~~I~~~l~~Ae~~~~~a~~~~~e~~~~l~~a~~ea~~ii~~a~~~a~~~~~~~~~~a~~e~~~~~ 102 (156) T PRK05759 37 QKKIADGLAAAERAKKELEAAQAKAEEQLAEARAEAAEIIEQAKKRAAQIIEEAKAEAEAEAARIK 102 (156) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999998999999999999999999999999999999999999999999999999999999999999 No 25 >TIGR02977 phageshock_pspA phage shock protein A; InterPro: IPR014319 Members of this protein are the phage shock protein PspA, from the phage shock operon. PspA appears to maintain the proton motive force under stress conditions that include over expression of certain phage secretins, heat shock, ethanol and protein export defects.. Probab=37.42 E-value=28 Score=14.81 Aligned_cols=16 Identities=13% Similarity=0.466 Sum_probs=5.7 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999999999999 Q gi|254781224|r 60 VRTELACTKSELKDAI 75 (103) Q Consensus 60 LR~El~~~~~~LR~ei 75 (103) +|..-++.+..+|..+ T Consensus 141 ~R~~~A~~R~~~r~~~ 156 (222) T TIGR02977 141 IREQAASSRLDVRRQL 156 (222) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999999999999988 No 26 >PRK13454 F0F1 ATP synthase subunit B'; Provisional Probab=37.00 E-value=28 Score=14.77 Aligned_cols=80 Identities=20% Similarity=0.168 Sum_probs=37.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999888999999999999999999999999999-----------99999999999999999999 Q gi|254781224|r 7 RQKVQKDSVEIRFTKLETALPYLATKADLADVRTELKQDIANVRTELKA-----------DIADVRTELACTKSELKDAI 75 (103) Q Consensus 7 r~~v~~~~~e~r~~~~e~~~~~l~~k~di~~LR~Elk~~i~~LR~Elk~-----------di~~LR~El~~~~~~LR~ei 75 (103) ++.|..|-.+..-.+.|......+....+.+=|.|-+.=+..-|.+.+. ++.....+-...+...|+.- T Consensus 64 ~~~I~~dL~~Ae~~k~eAe~~~~~ye~~L~~Ar~eA~~ii~eAr~~a~~~~~~~~~~a~~el~~~i~~AE~~I~~~K~~A 143 (181) T PRK13454 64 QGTITNDLAAAEELKQKAVEAEKAYNKALADARAEAQRIVAETRAEIQAELDVAIAKADAEIAAKAAESEKRIAEIRAGA 143 (181) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999889999999999999999999999999999999999999999999999999999999999999999999999988 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999999 Q gi|254781224|r 76 NSQTKWFMGII 86 (103) Q Consensus 76 ~~~~~~l~G~~ 86 (103) -++.+.+-..+ T Consensus 144 m~~i~~iA~d~ 154 (181) T PRK13454 144 LESVEEVAKDT 154 (181) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 27 >CHL00118 atpG ATP synthase CF0 B' subunit; Validated Probab=35.79 E-value=29 Score=14.65 Aligned_cols=61 Identities=10% Similarity=0.049 Sum_probs=33.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999888999999999999999999999999999999999999999 Q gi|254781224|r 7 RQKVQKDSVEIRFTKLETALPYLATKADLADVRTELKQDIANVRTELKADIADVRTELACT 67 (103) Q Consensus 7 r~~v~~~~~e~r~~~~e~~~~~l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~ 67 (103) ++.|..+..+..-.+.+......+....+.+-|.|-+.-+...|.+-+........+-+.+ T Consensus 55 ~~~I~~~l~~Ae~~~~ea~~~~~eye~~L~~Ar~Ea~~ii~~A~~~a~~~~~~i~~~A~~e 115 (156) T CHL00118 55 EEYIRSNLAKASSKLAKANELTAQYEEQLSKARKEAQLLIAQSQKEAQEIVQEELKQAQKN 115 (156) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999988999999999999999999999999999999999999999999999999999999 No 28 >pfam11629 Mst1_SARAH C terminal SARAH domain of Mst1. This family of proteins represents the C terminal SARAH domain of Mst1. SARAH controls apoptosis and cell cycle arrest via the Ras, RASSF, MST pathway. The Mst1 SARAH domain interacts with Rassf1 and Rassf5 by forming a heterodimer which mediates the apoptosis process. Probab=34.86 E-value=30 Score=14.56 Aligned_cols=36 Identities=28% Similarity=0.537 Sum_probs=21.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999999999999 Q gi|254781224|r 40 TELKQDIANVRTELKADIADVRTELACTKSELKDAI 75 (103) Q Consensus 40 ~Elk~~i~~LR~Elk~di~~LR~El~~~~~~LR~ei 75 (103) .|+..-.++|-.+|..+|..||.--+.-+.-+=++| T Consensus 11 eeL~~rl~sLD~~ME~Eieelr~RYq~KRqPILdAi 46 (49) T pfam11629 11 EELEQRLASLDPEMETEIEELRQRYQAKRQPILDAI 46 (49) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHH T ss_conf 999999986678899999999999887424399997 No 29 >pfam09726 Macoilin Transmembrane protein. This entry is a highly conserved protein present in eukaryotes. Probab=34.55 E-value=31 Score=14.53 Aligned_cols=14 Identities=14% Similarity=0.268 Sum_probs=7.0 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99999999999999 Q gi|254781224|r 77 SQTKWFMGIIVSVL 90 (103) Q Consensus 77 ~~~~~l~G~~~al~ 90 (103) .++-.|.+++.+.+ T Consensus 574 ~d~E~LmsAL~amQ 587 (680) T pfam09726 574 KETEVLMSALQAMQ 587 (680) T ss_pred CCHHHHHHHHHHHH T ss_conf 53999999999988 No 30 >TIGR02941 Sigma_B RNA polymerase sigma-B factor; InterPro: IPR014288 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . This entry represents the sigma factor, sigmaB. It is restricted to certain species within the order Bacillales including Staphylococcus aureus , Listeria monocytogenes and Bacillus cereus (strain ATCC 14579 / DSM 31) .. Probab=29.75 E-value=33 Score=14.37 Aligned_cols=13 Identities=31% Similarity=0.496 Sum_probs=10.4 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999999965 Q gi|254781224|r 89 VLVSTIGILLKLS 101 (103) Q Consensus 89 l~~a~iGvv~~l~ 101 (103) .|+||+|+++|+- T Consensus 61 vQVGM~GLlgAir 73 (256) T TIGR02941 61 VQVGMVGLLGAIR 73 (256) T ss_pred HHHHHHHHHHHHH T ss_conf 5676899999986 No 31 >pfam04971 Lysis_S Lysis protein S. The lysis S protein is a cytotoxic protein forming holes in membranes causing cell lysis. The action of Lysis S is independent of the proportion of acidic phospholipids in the membrane. Probab=28.48 E-value=39 Score=13.93 Aligned_cols=21 Identities=29% Similarity=0.632 Sum_probs=11.1 Q ss_pred HHHHHHH---HHHHHHHHHHHHHH Q ss_conf 9999999---99999999999996 Q gi|254781224|r 80 KWFMGII---VSVLVSTIGILLKL 100 (103) Q Consensus 80 ~~l~G~~---~al~~a~iGvv~~l 100 (103) -|++..+ .+-+++.|||+.+| T Consensus 19 yw~~qlLd~~sp~qWaAIGVlgsl 42 (68) T pfam04971 19 YWFLQLLDKVSPSQWAAIGVLGSL 42 (68) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 999999803899999999999999 No 32 >PRK11820 hypothetical protein; Provisional Probab=28.30 E-value=40 Score=13.91 Aligned_cols=27 Identities=7% Similarity=0.020 Sum_probs=13.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999 Q gi|254781224|r 71 LKDAINSQTKWFMGIIVSVLVSTIGIL 97 (103) Q Consensus 71 LR~ei~~~~~~l~G~~~al~~a~iGvv 97 (103) |=|+|+.++|.+-.-.....+....|- T Consensus 246 l~QEm~RE~NTigsK~~~~~i~~~vv~ 272 (288) T PRK11820 246 LMQELNREANTLGSKSNDAEITNAVVE 272 (288) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 999875899988880386999999999 No 33 >pfam02912 Phe_tRNA-synt_N Aminoacyl tRNA synthetase class II, N-terminal domain. Probab=27.12 E-value=42 Score=13.78 Aligned_cols=54 Identities=20% Similarity=0.260 Sum_probs=33.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 998889999999999999999999999999999999999999999999999999 Q gi|254781224|r 24 TALPYLATKADLADVRTELKQDIANVRTELKADIADVRTELACTKSELKDAINS 77 (103) Q Consensus 24 ~~~~~l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~~~~LR~ei~~ 77 (103) +..-++.=|+.++.+...|+.=-..=|.++...++.+++++...+..-+..+.. T Consensus 12 ~r~~~lGKkG~l~~~~k~l~~l~~eekk~~G~~iN~~K~~i~~~~~~k~~~l~~ 65 (73) T pfam02912 12 IRVKYLGKKGPLTELLKGLGKLSPEERPKVGALINEAKEAVEEALEEKKAALEE 65 (73) T ss_pred HHHHHHCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999992751599999997706999999998999999999999999999999999 No 34 >pfam10146 zf-C4H2 Zinc finger-containing protein. This is a family of proteins which appears to have a highly conserved zinc finger domain at the C terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members are annotated as being tumour-associated antigen HCA127 in humans but this could not confirmed. Probab=26.89 E-value=42 Score=13.76 Aligned_cols=46 Identities=20% Similarity=0.328 Sum_probs=30.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999999999999999 Q gi|254781224|r 29 LATKADLADVRTELKQDIANVRTELKADIADVRTELACTKSELKDAI 75 (103) Q Consensus 29 l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~~~~LR~ei 75 (103) .+++.+|..|-+|=-+-...||+ +.+||+.+-..+++.-++..... T Consensus 35 ~Ey~~em~~L~qEKmaHvEELRq-IhaDIN~mE~iIKqse~e~~~~~ 80 (220) T pfam10146 35 KEYNKEMEELLMEKMSHVEELRQ-IHADINDMENIIKQSESELEKRK 80 (220) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999989999999999-97119999999998899999999 No 35 >PRK08476 F0F1 ATP synthase subunit B'; Validated Probab=26.74 E-value=42 Score=13.74 Aligned_cols=45 Identities=13% Similarity=0.104 Sum_probs=17.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999888999999999999999999999999 Q gi|254781224|r 8 QKVQKDSVEIRFTKLETALPYLATKADLADVRTELKQDIANVRTE 52 (103) Q Consensus 8 ~~v~~~~~e~r~~~~e~~~~~l~~k~di~~LR~Elk~~i~~LR~E 52 (103) +.|..+-.+..-.+.+...+.-+.+.-+.+-|.|-..-+...+.+ T Consensus 41 ~~I~~~l~~A~~~~~ea~~l~~e~e~~L~~Ar~ea~~ii~~A~~~ 85 (141) T PRK08476 41 ASIKNDLEKVKTNSSESVEINAEIEAILKNAREEANKIRQEAIAA 85 (141) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999988999999899999999999999999999999999999999 No 36 >PRK07353 F0F1 ATP synthase subunit B'; Validated Probab=26.68 E-value=42 Score=13.74 Aligned_cols=61 Identities=18% Similarity=0.145 Sum_probs=31.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999888999999999999999999999999999999999999999 Q gi|254781224|r 7 RQKVQKDSVEIRFTKLETALPYLATKADLADVRTELKQDIANVRTELKADIADVRTELACT 67 (103) Q Consensus 7 r~~v~~~~~e~r~~~~e~~~~~l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~ 67 (103) ++.|..+-.+..-.+.+...+..+.+..+.+-|.|-+.-+..-|.+-+........+.+.+ T Consensus 38 ~~~I~~~l~~A~~~~~ea~~~~~e~e~~l~~Ar~ea~~ii~~A~~~a~~~~~~~~~~A~~e 98 (140) T PRK07353 38 EDYIRTNRAEAKEKLAEAEKLEAQYEQQLASARKQAQQVIAEAEAEADKLYAEALAEAQAE 98 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999888999999999999999999999999999999999999999999999999999999 No 37 >pfam07106 TBPIP Tat binding protein 1(TBP-1)-interacting protein (TBPIP). This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation. Probab=26.49 E-value=43 Score=13.72 Aligned_cols=39 Identities=23% Similarity=0.401 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999-99999999999999999999 Q gi|254781224|r 32 KADLADVRTELKQDIANVR-TELKADIADVRTELACTKSE 70 (103) Q Consensus 32 k~di~~LR~Elk~~i~~LR-~Elk~di~~LR~El~~~~~~ 70 (103) +.++..|+.|++.=...+- .|+...+..|+.|+...... T Consensus 92 ~~~~~~l~~el~~L~s~~t~eel~~~i~~L~~e~~~l~~k 131 (169) T pfam07106 92 KQDCSTLEIELKSLTSDLTTEELQEEIQELKKEVREIEEK 131 (169) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999875899999999999999999999999 No 38 >pfam03938 OmpH Outer membrane protein (OmpH-like). This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery. Probab=24.41 E-value=47 Score=13.49 Aligned_cols=76 Identities=11% Similarity=0.193 Sum_probs=29.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999988899999999999999999999999-----------999999999999999999999999999999999999999 Q gi|254781224|r 22 LETALPYLATKADLADVRTELKQDIANVRT-----------ELKADIADVRTELACTKSELKDAINSQTKWFMGIIVSVL 90 (103) Q Consensus 22 ~e~~~~~l~~k~di~~LR~Elk~~i~~LR~-----------Elk~di~~LR~El~~~~~~LR~ei~~~~~~l~G~~~al~ 90 (103) .++...+...+.++..++.+++.....|.. +...++.....+++.....+.+++......+...+.... T Consensus 38 ~~l~~~~~~~~~~l~~~~~~l~~~~~~l~~~~~~ls~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~i~~~i 117 (157) T pfam03938 38 KQLEKEFKKLQAELQKKQKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQELQQKQQELLQPIYDKI 117 (157) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999999999876516999999999999999999999999999999999999999999999 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|254781224|r 91 VSTIGIL 97 (103) Q Consensus 91 ~a~iGvv 97 (103) ...+.-+ T Consensus 118 ~~~i~~i 124 (157) T pfam03938 118 DKAIKEV 124 (157) T ss_pred HHHHHHH T ss_conf 9999999 No 39 >COG4980 GvpP Gas vesicle protein [General function prediction only] Probab=23.84 E-value=48 Score=13.42 Aligned_cols=71 Identities=14% Similarity=0.202 Sum_probs=31.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 67999999999999999999998889999999999999999999999999999999999999999999999999 Q gi|254781224|r 4 TAVRQKVQKDSVEIRFTKLETALPYLATKADLADVRTELKQDIANVRTELKADIADVRTELACTKSELKDAINS 77 (103) Q Consensus 4 ~~lr~~v~~~~~e~r~~~~e~~~~~l~~k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~~~~LR~ei~~ 77 (103) -.+|+.++.-.-.+..+..+.....+.... .+-.+......+.-++++.-+.+.+-+++.+...|.+++.. T Consensus 31 kelR~~~K~~~~~~~~~ae~~~~~~~~~a~---~~s~~~a~~~~~~~~~ik~~v~~~~e~~q~~~~~l~~ei~~ 101 (115) T COG4980 31 KELRKKLKKSGDALFELAEDKGTDILMIAD---KLSKESAETLKDQGGEIKESVKKWKEDIQPEIERLKSEIED 101 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 899999999898769999998778999999---98688999999844999999998576316568999988787 No 40 >PRK12750 cpxP periplasmic repressor CpxP; Reviewed Probab=23.47 E-value=49 Score=13.38 Aligned_cols=37 Identities=14% Similarity=0.212 Sum_probs=15.0 Q ss_pred HHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999----999999999999999999999 Q gi|254781224|r 32 KADLADVRTELKQDIA----NVRTELKADIADVRTELACTK 68 (103) Q Consensus 32 k~di~~LR~Elk~~i~----~LR~Elk~di~~LR~El~~~~ 68 (103) |+.+-+||.+-+++.- +-|....++....|.++++++ T Consensus 58 q~qlK~lre~~R~emk~~~~~~r~~~~ae~~A~r~qmq~Li 98 (170) T PRK12750 58 KEQLKEMREANRAEMKAKFSGNREAKHAERKAHHAKVQALL 98 (170) T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99999999998987200022217777999999999999887 No 41 >pfam04546 Sigma70_ner Sigma-70, non-essential region. The domain is found in the primary vegetative sigma factor. The function of this domain is unclear and can be removed without loss of function. Probab=23.33 E-value=49 Score=13.36 Aligned_cols=37 Identities=11% Similarity=0.121 Sum_probs=18.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7999999999999999999998889999999999999 Q gi|254781224|r 5 AVRQKVQKDSVEIRFTKLETALPYLATKADLADVRTE 41 (103) Q Consensus 5 ~lr~~v~~~~~e~r~~~~e~~~~~l~~k~di~~LR~E 41 (103) +.++++++.++.|+|+---+..+-..++.-+..+|.- T Consensus 109 k~~~~l~e~f~~~Kl~pk~~d~L~~~lR~~v~~IR~~ 145 (210) T pfam04546 109 KAREALAEEFKQFRLVPKQFDRLVDNLRGMVDRVRAQ 145 (210) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999986176899999999999999999999 No 42 >COG4890 Predicted outer membrane lipoprotein [Function unknown] Probab=23.11 E-value=50 Score=13.34 Aligned_cols=20 Identities=20% Similarity=0.602 Sum_probs=14.9 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999 Q gi|254781224|r 79 TKWFMGIIVSVLVSTIGILL 98 (103) Q Consensus 79 ~~~l~G~~~al~~a~iGvv~ 98 (103) +.|++|...+-.++.|..+. T Consensus 4 FaWiLG~lLAcAFgiinAlw 23 (37) T COG4890 4 FAWILGLLLACAFGIINALW 23 (37) T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 99999889999999999999 No 43 >pfam10473 Cenp-F_leu_zip Leucine-rich repeats of kinetochore protein Cenp-F/LEK1. Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance. Probab=22.85 E-value=50 Score=13.31 Aligned_cols=69 Identities=17% Similarity=0.333 Sum_probs=38.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH--HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 66799999999999999999999--888999-99999999999999999999999999999999999999999 Q gi|254781224|r 3 KTAVRQKVQKDSVEIRFTKLETA--LPYLAT-KADLADVRTELKQDIANVRTELKADIADVRTELACTKSELK 72 (103) Q Consensus 3 ~~~lr~~v~~~~~e~r~~~~e~~--~~~l~~-k~di~~LR~Elk~~i~~LR~Elk~di~~LR~El~~~~~~LR 72 (103) .-.|.|+|-.+.-|+.....+-. ++-+++ |+++..|-+++..-...|| .+..++..+|.+..+...+|. T Consensus 19 ~d~L~d~ie~LErELe~seen~e~~ileaEnskaEvetLk~eie~m~~~L~-~Le~EL~~~rsEkenL~keLq 90 (140) T pfam10473 19 ADLLKDKVENLERELEMSEENQELVILEAENSKAEVETLKTKIEEMAESLK-DLELDLVTVRSEKENLTKQLQ 90 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH T ss_conf 688998999999999877861999999988668789999999999998889-889989989988998999999 No 44 >PRK03100 sec-independent translocase; Provisional Probab=22.60 E-value=51 Score=13.28 Aligned_cols=20 Identities=25% Similarity=0.537 Sum_probs=9.0 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999 Q gi|254781224|r 46 IANVRTELKADIADVRTELA 65 (103) Q Consensus 46 i~~LR~Elk~di~~LR~El~ 65 (103) -.+||.|+..++.+||.-+. T Consensus 48 ~~~l~~elGpef~dlrk~l~ 67 (131) T PRK03100 48 TSQLREDLGPEFDDLRKPLG 67 (131) T ss_pred HHHHHHHHCCCHHHHHHHHH T ss_conf 99999987711999998899 No 45 >pfam04484 DUF566 Family of unknown function (DUF566). Family of related proteins that is plant specific. Probab=20.62 E-value=56 Score=13.03 Aligned_cols=48 Identities=25% Similarity=0.414 Sum_probs=20.8 Q ss_pred HHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHH Q ss_conf 999999999---9999999999999999999999----------999999999999999999 Q gi|254781224|r 45 DIANVRTEL---KADIADVRTELACTKSELKDAI----------NSQTKWFMGIIVSVLVST 93 (103) Q Consensus 45 ~i~~LR~El---k~di~~LR~El~~~~~~LR~ei----------~~~~~~l~G~~~al~~a~ 93 (103) .|..||.-+ +-++.-+|+++| +...|+..| +.....|.|++-+|..++ T Consensus 189 ~is~Lr~sV~~kRi~lq~l~q~~K-L~~il~~Qm~~Le~W~~lE~~hs~sL~~a~eaL~ast 249 (312) T pfam04484 189 SISELRNSVAQKRIELQRLRQELK-LNSILKSQMSYLEEWSDLERDHSSSLSGAIEALKAST 249 (312) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999999-9999999887899999999887999999999999763 Done!