Query psy4837
Match_columns 100
No_of_seqs 107 out of 420
Neff 4.0
Searched_HMMs 46136
Date Fri Aug 16 19:01:03 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4837.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4837hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF04442 CtaG_Cox11: Cytochrom 100.0 8E-42 1.7E-46 251.9 4.8 86 3-88 53-143 (152)
2 PTZ00128 cytochrome c oxidase 100.0 1.4E-40 3E-45 259.1 8.3 87 3-89 124-215 (232)
3 PRK05089 cytochrome C oxidase 100.0 1.4E-39 3E-44 246.9 8.1 77 3-79 80-161 (188)
4 COG3175 COX11 Cytochrome oxida 100.0 4.2E-39 9.1E-44 245.2 7.1 87 3-89 79-170 (195)
5 KOG2540|consensus 100.0 5.3E-34 1.1E-38 223.9 5.8 88 4-91 150-242 (269)
6 PF06030 DUF916: Bacterial pro 84.0 2.8 6.2E-05 29.5 4.8 69 10-79 20-106 (121)
7 PF14874 PapD-like: Flagellar- 82.3 4.1 8.9E-05 26.4 4.8 55 12-74 15-73 (102)
8 PF07610 DUF1573: Protein of u 81.1 4.1 9E-05 23.8 4.0 36 23-71 2-41 (45)
9 PF10633 NPCBM_assoc: NPCBM-as 81.0 0.93 2E-05 28.7 1.3 62 13-80 1-64 (78)
10 TIGR02756 TraK_Ftype type-F co 80.7 2.6 5.7E-05 32.7 3.9 47 17-71 182-228 (232)
11 PRK15218 fimbrial chaperone pr 66.4 19 0.00041 27.9 5.5 67 12-82 27-102 (226)
12 PRK15192 fimbrial chaperone Bc 66.2 19 0.00042 28.1 5.6 66 12-83 31-109 (234)
13 PRK15224 pili assembly chapero 63.6 22 0.00047 27.9 5.4 66 12-83 37-108 (237)
14 PRK15246 fimbrial assembly cha 63.5 22 0.00048 27.6 5.4 64 13-82 20-95 (233)
15 PF13473 Cupredoxin_1: Cupredo 63.2 24 0.00053 23.1 4.9 45 8-71 34-78 (104)
16 PRK15295 fimbrial assembly cha 62.0 24 0.00051 27.2 5.3 67 12-82 28-100 (226)
17 PF06586 TraK: TraK protein; 61.9 12 0.00026 28.2 3.6 47 17-71 186-232 (234)
18 PF07705 CARDB: CARDB; InterP 60.9 16 0.00036 22.6 3.6 55 9-71 11-65 (101)
19 PRK15233 putative fimbrial cha 60.1 27 0.00059 27.7 5.5 65 12-82 49-119 (246)
20 PRK15253 putative fimbrial ass 58.2 29 0.00063 27.2 5.3 67 12-82 42-117 (242)
21 PF07177 Neuralized: Neuralize 57.1 5.7 0.00012 25.6 1.0 17 55-71 31-47 (69)
22 PRK15211 fimbrial chaperone pr 57.0 32 0.0007 26.7 5.3 66 12-82 31-102 (229)
23 PRK11385 putativi pili assembl 52.4 41 0.00088 26.3 5.2 72 12-83 35-114 (236)
24 PF11611 DUF4352: Domain of un 51.9 27 0.00058 22.8 3.6 64 16-80 35-105 (123)
25 PF00207 A2M: Alpha-2-macroglo 51.9 33 0.00071 22.2 4.0 26 12-37 65-90 (92)
26 PF06205 GT36_AF: Glycosyltran 51.8 21 0.00045 23.7 3.1 19 7-25 64-82 (90)
27 PF01345 DUF11: Domain of unkn 50.8 35 0.00077 21.1 3.9 35 9-43 33-67 (76)
28 PRK13736 conjugal transfer pro 49.5 33 0.00072 27.0 4.4 47 17-71 185-233 (245)
29 PRK09926 putative chaperone pr 49.4 45 0.00098 25.9 5.1 64 13-82 35-110 (246)
30 TIGR03079 CH4_NH3mon_ox_B meth 47.4 57 0.0012 28.1 5.7 55 15-71 280-348 (399)
31 PF14646 MYCBPAP: MYCBP-associ 46.6 68 0.0015 26.6 5.9 64 8-71 238-305 (426)
32 PF02752 Arrestin_C: Arrestin 44.9 52 0.0011 21.2 4.2 29 9-37 12-40 (136)
33 PF04744 Monooxygenase_B: Mono 44.4 47 0.001 28.4 4.7 57 14-71 260-329 (381)
34 PRK15249 fimbrial chaperone pr 43.4 66 0.0014 25.2 5.1 65 14-83 39-115 (253)
35 smart00588 NEUZ domain in neur 41.8 25 0.00054 24.8 2.4 17 55-71 32-48 (123)
36 PF09624 DUF2393: Protein of u 39.2 1.3E+02 0.0029 21.0 6.4 59 13-71 58-127 (149)
37 PF14310 Fn3-like: Fibronectin 39.0 52 0.0011 20.4 3.3 19 9-27 25-43 (71)
38 TIGR02656 cyanin_plasto plasto 39.0 1.1E+02 0.0024 20.0 5.5 56 9-71 17-72 (99)
39 PRK02710 plastocyanin; Provisi 37.5 1E+02 0.0022 21.0 4.9 46 9-71 47-92 (119)
40 PRK15274 putative periplasmic 32.8 1.2E+02 0.0025 24.2 5.1 65 12-82 35-109 (257)
41 TIGR01451 B_ant_repeat conserv 32.8 1.1E+02 0.0023 18.4 3.9 35 10-44 5-39 (53)
42 PF08308 PEGA: PEGA domain; I 32.4 90 0.002 18.9 3.6 22 5-26 47-68 (71)
43 PF00362 Integrin_beta: Integr 32.0 77 0.0017 26.9 4.2 37 2-38 72-108 (426)
44 PF00127 Copper-bind: Copper b 30.3 1.4E+02 0.0031 19.4 4.5 56 8-71 16-72 (99)
45 TIGR03096 nitroso_cyanin nitro 29.8 80 0.0017 23.1 3.5 44 9-71 61-104 (135)
46 KOG3960|consensus 29.6 12 0.00026 30.8 -1.0 9 86-94 149-157 (284)
47 PF02018 CBM_4_9: Carbohydrate 29.5 1.4E+02 0.003 19.0 4.2 25 9-33 54-78 (131)
48 PRK11055 galM galactose-1-epim 29.5 1.1E+02 0.0024 24.8 4.5 44 8-62 265-308 (342)
49 cd06184 flavohem_like_fad_nad_ 29.4 1.5E+02 0.0032 21.8 4.9 24 2-25 24-49 (247)
50 PF14415 DUF4424: Domain of un 29.0 87 0.0019 24.9 3.8 23 17-39 1-23 (253)
51 PRK15195 fimbrial chaperone pr 28.7 1.6E+02 0.0036 22.7 5.2 63 14-82 36-105 (229)
52 smart00391 MBD Methyl-CpG bind 28.1 25 0.00055 23.0 0.6 16 49-64 56-71 (77)
53 PF11325 DUF3127: Domain of un 27.4 61 0.0013 22.0 2.3 30 12-56 52-81 (84)
54 PRK15188 fimbrial chaperone pr 26.1 2E+02 0.0043 22.4 5.3 65 12-82 36-107 (228)
55 PF06650 DUF1162: Protein of u 26.0 2.1E+02 0.0045 21.4 5.2 22 55-76 252-275 (277)
56 PF00345 PapD_N: Pili and flag 25.2 2E+02 0.0042 19.1 4.5 71 8-81 6-81 (122)
57 PF00963 Cohesin: Cohesin doma 25.2 2.3E+02 0.0049 19.3 5.1 36 9-44 6-41 (141)
58 PF01429 MBD: Methyl-CpG bindi 25.1 31 0.00067 22.2 0.5 18 48-65 59-76 (77)
59 KOG0360|consensus 24.9 36 0.00078 30.3 1.0 29 51-79 235-265 (545)
60 PRK13202 ureB urease subunit b 24.7 1E+02 0.0022 22.0 3.1 67 5-71 7-80 (104)
61 cd05711 Ig_FcalphaRI Immunoglo 24.4 1.2E+02 0.0027 19.5 3.3 14 37-50 58-71 (94)
62 PF04314 DUF461: Protein of un 24.4 1.5E+02 0.0033 20.0 3.9 53 17-71 14-73 (110)
63 TIGR02636 galM_Leloir galactos 24.3 1.4E+02 0.003 23.9 4.2 46 8-64 260-305 (335)
64 PRK00415 rps27e 30S ribosomal 24.0 30 0.00065 22.3 0.3 25 47-71 6-31 (59)
65 smart00187 INB Integrin beta s 23.7 1.2E+02 0.0026 26.2 3.9 34 2-35 69-102 (423)
66 KOG0121|consensus 23.7 29 0.00062 26.3 0.2 13 49-61 72-85 (153)
67 PF00635 Motile_Sperm: MSP (Ma 23.4 1.5E+02 0.0032 18.9 3.5 61 2-71 2-63 (109)
68 PTZ00485 aldolase 1-epimerase; 23.1 94 0.002 26.0 3.1 48 7-64 292-339 (376)
69 PF13815 Dzip-like_N: Iguana/D 23.1 18 0.00038 25.1 -1.0 40 56-99 1-46 (118)
70 PF06280 DUF1034: Fn3-like dom 21.2 1.5E+02 0.0033 19.6 3.3 26 4-29 57-82 (112)
71 PF07883 Cupin_2: Cupin domain 20.9 1.9E+02 0.004 16.7 4.3 35 4-38 32-70 (71)
72 PRK13203 ureB urease subunit b 20.7 2.5E+02 0.0054 19.9 4.4 66 5-71 7-79 (102)
73 COG1470 Predicted membrane pro 20.6 2.6E+02 0.0056 25.0 5.4 64 7-71 274-337 (513)
74 PF12690 BsuPI: Intracellular 20.5 1.3E+02 0.0029 19.6 2.9 49 19-71 2-64 (82)
No 1
>PF04442 CtaG_Cox11: Cytochrome c oxidase assembly protein CtaG/Cox11; InterPro: IPR007533 Cytochrome c oxidase assembly protein is essential for the assembly of functional cytochrome oxidase protein. In eukaryotes it is an integral protein of the mitochondrial inner membrane. Cox11 is essential for the insertion of Cu(I) ions to form the CuB site. This is essential for the stability of other structures in subunit I, for example haems a and a3, and the magnesium/manganese centre. Cox11 is probably only required in sub-stoichiometric amounts relative to the structural units []. The C-terminal region of the protein is known to form a dimer. Each monomer coordinates one Cu(I) ion via three conserved cysteine residues (111, 208 and 210) in Saccharomyces cerevisiae (P19516 from SWISSPROT). Met 224 is also thought to play a role in copper transfer or stabilising the copper site [].; GO: 0005507 copper ion binding; PDB: 1SO9_A 1SP0_A.
Probab=100.00 E-value=8e-42 Score=251.94 Aligned_cols=86 Identities=45% Similarity=0.785 Sum_probs=64.5
Q ss_pred eeceeceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec-----cCCCCC
Q psy4837 3 TTHRYLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI-----FAPKKP 77 (100)
Q Consensus 3 ~~~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M-----vDP~i~ 77 (100)
..+|+|++|.|||||+++++|+|+|++|++++|||+|||+|++||.||||||||||+||+|+|||+++| |||+|+
T Consensus 53 ~F~P~q~~v~V~pGe~~~~~y~a~N~s~~~i~g~A~~nV~P~~a~~YF~KieCFCF~eQ~L~pgE~~~mPv~F~IDp~i~ 132 (152)
T PF04442_consen 53 EFKPEQRSVKVHPGETALVFYEATNPSDKPITGQAIPNVTPGEAGKYFNKIECFCFEEQTLAPGETVDMPVVFYIDPDIP 132 (152)
T ss_dssp EEE-S-SEEEEETT--EEEEEEEEE-SSS-EE---EEEE-SSS-STTECCS-TTS-S--EE-TT-EEEEEEEEEE-GGGG
T ss_pred EEEeeeeeEEeCCCCEEEEEEEEECCCCCcEEEEEeeeECHHHhhhhccccceEeccCcCcCCCCeEEEEEEEEECCccc
Confidence 568999999999999999999999999999999999999999999999999999999999999999999 999999
Q ss_pred CCCchhhhccc
Q psy4837 78 HHNEHKMRSQI 88 (100)
Q Consensus 78 ~d~~~~~~~~~ 88 (100)
+|.++++++.|
T Consensus 133 ~d~~~~~v~~i 143 (152)
T PF04442_consen 133 EDPDMKDVKTI 143 (152)
T ss_dssp SSTTTTT--BE
T ss_pred CCcccCCcCEE
Confidence 99999999987
No 2
>PTZ00128 cytochrome c oxidase assembly protein-like; Provisional
Probab=100.00 E-value=1.4e-40 Score=259.07 Aligned_cols=87 Identities=48% Similarity=0.872 Sum_probs=84.8
Q ss_pred eeceeceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec-----cCCCCC
Q psy4837 3 TTHRYLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI-----FAPKKP 77 (100)
Q Consensus 3 ~~~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M-----vDP~i~ 77 (100)
..+|+|++|.|||||+++++|+|+|++|++++|||+|||+|++||.||||||||||+||+|+|||+++| |||+|+
T Consensus 124 ~F~P~q~~v~V~pGE~~lv~Y~a~N~sd~~i~G~A~ynV~P~~Ag~YFnKieCFCF~eQ~L~pgE~~~MPV~F~IDP~i~ 203 (232)
T PTZ00128 124 EFEPLQKEVEVLPGETALAFYRAKNRSDKPVIGVATYHIAPPEAGLYFNKIQCFCFEEQRLNPHEEVDMPVFFYIDPDIL 203 (232)
T ss_pred eEEeeeeEEEEcCCCeEEEEEEEECCCCCcEEEEEecccCHHHHhhhccceeeecccccccCCCCeEecCEEEEECCCCC
Confidence 568999999999999999999999999999999999999999999999999999999999999999999 999999
Q ss_pred CCCchhhhcccc
Q psy4837 78 HHNEHKMRSQIL 89 (100)
Q Consensus 78 ~d~~~~~~~~~~ 89 (100)
+|.++++++.|-
T Consensus 204 ~D~~~~~v~~IT 215 (232)
T PTZ00128 204 NDPRLKWVDEIT 215 (232)
T ss_pred CCcccCCcCEEE
Confidence 999999999874
No 3
>PRK05089 cytochrome C oxidase assembly protein; Provisional
Probab=100.00 E-value=1.4e-39 Score=246.89 Aligned_cols=77 Identities=42% Similarity=0.747 Sum_probs=75.9
Q ss_pred eeceeceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec-----cCCCCC
Q psy4837 3 TTHRYLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI-----FAPKKP 77 (100)
Q Consensus 3 ~~~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M-----vDP~i~ 77 (100)
..+|+|++|+|||||+++++|+|+|++|++++|||+|||+|++||.||||||||||+||+|+|||+++| |||+|+
T Consensus 80 ~F~P~q~~v~V~pGE~~~~~y~a~N~sd~~i~g~A~~nV~P~~a~~YF~KieCFCF~eQ~L~pgE~~~mPV~F~IDP~i~ 159 (188)
T PRK05089 80 EFKPEQRSVDVHPGELNLVFYEAENLSDRPIVGQAIPSVTPGQAGAYFNKIECFCFTQQTLQPGETREMPVVFYVDPDLP 159 (188)
T ss_pred eEEeeeeEEEEcCCCeEEEEEEEECCCCCcEEEEEecccCHHHHhhhccceeeecccCcccCCCCeEecCEEEEECCCcc
Confidence 568999999999999999999999999999999999999999999999999999999999999999999 999999
Q ss_pred CC
Q psy4837 78 HH 79 (100)
Q Consensus 78 ~d 79 (100)
+|
T Consensus 160 ~d 161 (188)
T PRK05089 160 KD 161 (188)
T ss_pred cc
Confidence 99
No 4
>COG3175 COX11 Cytochrome oxidase assembly factor [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=4.2e-39 Score=245.23 Aligned_cols=87 Identities=43% Similarity=0.712 Sum_probs=84.5
Q ss_pred eeceeceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec-----cCCCCC
Q psy4837 3 TTHRYLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI-----FAPKKP 77 (100)
Q Consensus 3 ~~~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M-----vDP~i~ 77 (100)
...|.|++|.|||||+++++|+|+|++|++++|||+|||+|++||+||||+|||||+||+|+|||+++| ||||+.
T Consensus 79 ~F~p~q~~v~v~pGet~~~~y~a~N~sd~~itg~A~~nv~P~~Ag~YF~KveCFCFteq~L~pgE~vemPV~FfVDpd~~ 158 (195)
T COG3175 79 RFRPVQREVYVRPGETNLIFYEAENLSDKPITGQATYNVAPGQAGAYFNKVECFCFTEQTLKPGETVEMPVVFFVDPDFA 158 (195)
T ss_pred eeEecCceeEeccCceEEEEEEEecCCCCCceeEEecccChhHhhhheeeeeEEEeeecccCCCCeEeccEEEEECcccc
Confidence 357999999999999999999999999999999999999999999999999999999999999999999 999999
Q ss_pred CCCchhhhcccc
Q psy4837 78 HHNEHKMRSQIL 89 (100)
Q Consensus 78 ~d~~~~~~~~~~ 89 (100)
+|++++|++.|.
T Consensus 159 ~dPe~kdvk~iT 170 (195)
T COG3175 159 DDPEMKDVKTIT 170 (195)
T ss_pred cCcccCCCCeEE
Confidence 999999999874
No 5
>KOG2540|consensus
Probab=100.00 E-value=5.3e-34 Score=223.95 Aligned_cols=88 Identities=49% Similarity=0.819 Sum_probs=85.4
Q ss_pred eceeceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec-----cCCCCCC
Q psy4837 4 THRYLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI-----FAPKKPH 78 (100)
Q Consensus 4 ~~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M-----vDP~i~~ 78 (100)
..|.|++|.|+|||++++||.|+|.||++++|.|+||++|++||.||||||||||+||.|.|||+++| |||+...
T Consensus 150 f~PqQrEiyV~PGEtALaFYta~N~sdkpIiGvstYni~P~~Aa~YFnKiqCFCFEEQ~L~pgE~vDmPVFFyIDPefa~ 229 (269)
T KOG2540|consen 150 FTPQQREIYVLPGETALAFYTAENPSDKPIIGVSTYNITPGQAAVYFNKIQCFCFEEQKLNPGEQVDMPVFFYIDPEFAT 229 (269)
T ss_pred ccccceEEEEcCCcceeeeEeccCCCCCCceeeEeeccCccHhhhheeceeEEeehhhccCCCcccCcceEEEeCccccc
Confidence 57999999999999999999999999999999999999999999999999999999999999999999 9999999
Q ss_pred CCchhhhcccccc
Q psy4837 79 HNEHKMRSQILPK 91 (100)
Q Consensus 79 d~~~~~~~~~~~~ 91 (100)
|..+++++.||=.
T Consensus 230 DP~m~~id~i~Ls 242 (269)
T KOG2540|consen 230 DPAMDGIDDILLS 242 (269)
T ss_pred CcccccccceEEE
Confidence 9999999998743
No 6
>PF06030 DUF916: Bacterial protein of unknown function (DUF916); InterPro: IPR010317 This family consists of putative cell surface proteins, from Firmicutes, of unknown function.
Probab=83.97 E-value=2.8 Score=29.51 Aligned_cols=69 Identities=20% Similarity=0.223 Sum_probs=45.1
Q ss_pred EEEeecCCeEEEEEEEECCCCCcEEEEEeC-----------cccccchh------hcccceeeeeccccccCCCCeeec-
Q psy4837 10 TLHVYPGETALAFYTAKNPTNSPIVGISTY-----------NVVPFDAG------QYFNKIQCFCFEEQQLNPHEELQI- 71 (100)
Q Consensus 10 ~v~V~pGE~~~v~y~a~N~sd~~v~GqAvp-----------sVsP~~A~------~YF~KieCFCF~eQ~L~pgE~~~M- 71 (100)
.+.+.||+...+...++|.++++++-.+.. .+++.... .-|.++-=.=- +-+|.|||++..
T Consensus 20 dL~~~P~q~~~l~v~i~N~s~~~~tv~v~~~~A~Tn~nG~I~Y~~~~~~~d~sl~~~~~~~v~~~~-~Vtl~~~~sk~V~ 98 (121)
T PF06030_consen 20 DLKVKPGQKQTLEVRITNNSDKEITVKVSANTATTNDNGVIDYSQNNPKKDKSLKYPFSDLVKIPK-EVTLPPNESKTVT 98 (121)
T ss_pred EEEeCCCCEEEEEEEEEeCCCCCEEEEEEEeeeEecCCEEEEECCCCcccCcccCcchHHhccCCc-EEEECCCCEEEEE
Confidence 477999999999999999999998754432 22222211 13333322222 278999999988
Q ss_pred cCCCCCCC
Q psy4837 72 FAPKKPHH 79 (100)
Q Consensus 72 vDP~i~~d 79 (100)
+.=.+|+.
T Consensus 99 ~~i~~P~~ 106 (121)
T PF06030_consen 99 FTIKMPKK 106 (121)
T ss_pred EEEEcCCC
Confidence 65555654
No 7
>PF14874 PapD-like: Flagellar-associated PapD-like
Probab=82.32 E-value=4.1 Score=26.36 Aligned_cols=55 Identities=25% Similarity=0.413 Sum_probs=38.5
Q ss_pred EeecCCeEEEEEEEECCCCCcEEEEEeCccc-ccchhhcccceeeeeccccccCCCCeeec---cCC
Q psy4837 12 HVYPGETALAFYTAKNPTNSPIVGISTYNVV-PFDAGQYFNKIQCFCFEEQQLNPHEELQI---FAP 74 (100)
Q Consensus 12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVs-P~~A~~YF~KieCFCF~eQ~L~pgE~~~M---vDP 74 (100)
.|..|+.+.....++|.+..+.. |.+. |.....+| .+. ..+..|.||++.++ +.|
T Consensus 15 ~v~~g~~~~~~v~l~N~s~~p~~----f~v~~~~~~~~~~---~v~-~~~g~l~PG~~~~~~V~~~~ 73 (102)
T PF14874_consen 15 NVFVGQTYSRTVTLTNTSSIPAR----FRVRQPESLSSFF---SVE-PPSGFLAPGESVELEVTFSP 73 (102)
T ss_pred EEccCCEEEEEEEEEECCCCCEE----EEEEeCCcCCCCE---EEE-CCCCEECCCCEEEEEEEEEe
Confidence 57899999999999999999854 3332 22222222 222 35789999999999 663
No 8
>PF07610 DUF1573: Protein of unknown function (DUF1573); InterPro: IPR011467 These hypothetical proteins from bacteria, such as Rhodopirellula baltica, Bacteroides thetaiotaomicron and Porphyromonas gingivalis, share a region of conserved sequence towards their N termini.
Probab=81.07 E-value=4.1 Score=23.84 Aligned_cols=36 Identities=25% Similarity=0.484 Sum_probs=28.1
Q ss_pred EEEECCCCCcEEEEEeCcccccchhhcccceeeeec----cccccCCCCeeec
Q psy4837 23 YTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCF----EEQQLNPHEELQI 71 (100)
Q Consensus 23 y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF----~eQ~L~pgE~~~M 71 (100)
|.++|.+++++.=..+ +..|=|. ...+|+|||+..+
T Consensus 2 F~~~N~g~~~L~I~~v-------------~tsCgCt~~~~~~~~i~PGes~~i 41 (45)
T PF07610_consen 2 FEFTNTGDSPLVITDV-------------QTSCGCTTAEYSKKPIAPGESGKI 41 (45)
T ss_pred EEEEECCCCcEEEEEe-------------eEccCCEEeeCCcceECCCCEEEE
Confidence 8899999999876554 3567774 5688999998765
No 9
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=80.99 E-value=0.93 Score=28.69 Aligned_cols=62 Identities=23% Similarity=0.318 Sum_probs=29.9
Q ss_pred eecCCeEEEEEEEECCCCCcEEE-EEeCcccccchhhcccceeeeeccccccCCCCeeec-cCCCCCCCC
Q psy4837 13 VYPGETALAFYTAKNPTNSPIVG-ISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI-FAPKKPHHN 80 (100)
Q Consensus 13 V~pGE~~~v~y~a~N~sd~~v~G-qAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M-vDP~i~~d~ 80 (100)
|.||+...+...++|..+.++.. ....++ |.-...-+.--+- ..|.|||++.. +.=..|.|.
T Consensus 1 v~~G~~~~~~~tv~N~g~~~~~~v~~~l~~-P~GW~~~~~~~~~-----~~l~pG~s~~~~~~V~vp~~a 64 (78)
T PF10633_consen 1 VTPGETVTVTLTVTNTGTAPLTNVSLSLSL-PEGWTVSASPASV-----PSLPPGESVTVTFTVTVPADA 64 (78)
T ss_dssp --TTEEEEEEEEEE--SSS-BSS-EEEEE---TTSE---EEEEE-------B-TTSEEEEEEEEEE-TT-
T ss_pred CCCCCEEEEEEEEEECCCCceeeEEEEEeC-CCCccccCCcccc-----ccCCCCCEEEEEEEEECCCCC
Confidence 57999999999999999877543 222333 5322211111111 18999999988 444445543
No 10
>TIGR02756 TraK_Ftype type-F conjugative transfer system secretin TraK. The TraK protein is predicted to interact with the TraV and TraB proteins as part of the scaffold which extends from the inner membrane, through the periplasm to the cell envelope and through which the F-type conjugative pilus passes. TraK is homologous to the P-type IV secretion system protein TrbG, the Ti-type protein VirB9 and the I-type TraN protein. The protein is related to the secretin family especially the HrcC subgroup of the type III secretion system. The protein is hypothesized to oligomerize to form a ring structure akin to other secretins.
Probab=80.70 E-value=2.6 Score=32.69 Aligned_cols=47 Identities=23% Similarity=0.236 Sum_probs=36.6
Q ss_pred CeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837 17 ETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI 71 (100)
Q Consensus 17 E~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M 71 (100)
+..-.-|+++|.+++++.-.--=-.+|+-.|.-|.+ +.|.|||+..+
T Consensus 182 ~l~g~~y~l~N~s~~~~~L~E~~F~~~gv~AVa~~~--------~~L~PGe~t~v 228 (232)
T TIGR02756 182 HLKGERFELENKTNSPLELTESWFWQPGTRAVALSK--------PQLAPGETADL 228 (232)
T ss_pred CcEEEEEEEEcCCCCCeEechHHhCCcCcEEEEecc--------CccCCCCEEEE
Confidence 455678999999999998876666677666665544 79999998765
No 11
>PRK15218 fimbrial chaperone protein PegB; Provisional
Probab=66.37 E-value=19 Score=27.94 Aligned_cols=67 Identities=13% Similarity=0.057 Sum_probs=42.8
Q ss_pred EeecCCeEEEEEEEECCCCCcEEEEEeCcc-----cccchhhcccceeeeeccccccCCCCeeec----cCCCCCCCCch
Q psy4837 12 HVYPGETALAFYTAKNPTNSPIVGISTYNV-----VPFDAGQYFNKIQCFCFEEQQLNPHEELQI----FAPKKPHHNEH 82 (100)
Q Consensus 12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsV-----sP~~A~~YF~KieCFCF~eQ~L~pgE~~~M----vDP~i~~d~~~ 82 (100)
-|.+|....+...+.|.++.+...|+--.- +|.....-|-=+ =-=..|+||+...+ ....||+|+|.
T Consensus 27 vIy~~~~~~~si~i~N~~~~pyLvQsWvd~~~~~~~~~~~~~pFivt----PPlfRl~p~~~~~lRI~~~~~~LP~DRES 102 (226)
T PRK15218 27 IIYPAQKKDITVQLMNDGKRSSLIQAWIDNGDTSLPPEKLQVPFIMT----PPVIRVAANSGQQLKIKKLANNLPGDRES 102 (226)
T ss_pred EEEcCCCcEEEEEEEcCCCCcEEEEEEEeCCCCCCCcccccCCEEEC----CCeEEECCCCceEEEEEECCCCCCcceeE
Confidence 467788889999999999999999984321 111110111100 01246889887777 66778888765
No 12
>PRK15192 fimbrial chaperone BcfG; Provisional
Probab=66.15 E-value=19 Score=28.14 Aligned_cols=66 Identities=11% Similarity=0.012 Sum_probs=42.8
Q ss_pred EeecCCeEEEEEEEECCCCCcEEEEEeCc-------ccccchhhcccceeeeecc--ccccCCCCeeec----cCCCCCC
Q psy4837 12 HVYPGETALAFYTAKNPTNSPIVGISTYN-------VVPFDAGQYFNKIQCFCFE--EQQLNPHEELQI----FAPKKPH 78 (100)
Q Consensus 12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvps-------VsP~~A~~YF~KieCFCF~--eQ~L~pgE~~~M----vDP~i~~ 78 (100)
-+.+|....+...+.|.++.+...|+--. -.+..... =|=-+ -..|+||++..+ ....||+
T Consensus 31 vIy~~~~k~~sv~l~N~~~~p~LvQswv~~~~~w~~~~~~~~~~------PFivtPPlfrl~p~~~~~lRI~~~~~~LP~ 104 (234)
T PRK15192 31 FIYHAGAPALSVPVSNHSEASWLIDTHILPGGRWPGTKNEGNIT------PFVVTPPLFMLSARQENSMRVVYTGAPLPA 104 (234)
T ss_pred EEEcCCCceEEEEEEeCCCCcEEEEEEeccCccccccCCccccC------CEEEcCCeEEECCCCceEEEEEECCCCCCC
Confidence 46677888999999999999999998421 11101000 02111 246789988777 7777888
Q ss_pred CCchh
Q psy4837 79 HNEHK 83 (100)
Q Consensus 79 d~~~~ 83 (100)
|+|.-
T Consensus 105 DRESl 109 (234)
T PRK15192 105 DRESL 109 (234)
T ss_pred cceEE
Confidence 87653
No 13
>PRK15224 pili assembly chaperone protein SafB; Provisional
Probab=63.62 E-value=22 Score=27.93 Aligned_cols=66 Identities=23% Similarity=0.234 Sum_probs=43.1
Q ss_pred EeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeecc--ccccCCCCeeec----cCCCCCCCCchh
Q psy4837 12 HVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFE--EQQLNPHEELQI----FAPKKPHHNEHK 83 (100)
Q Consensus 12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~--eQ~L~pgE~~~M----vDP~i~~d~~~~ 83 (100)
-+.+|....+...++|.++.+...|+--.=.-+.. +. =|=-+ -..|+||+...+ ....||.|+|.=
T Consensus 37 vIy~~~~k~~sl~v~N~~~~pyLvQsWvd~~~~~~-----~~-pFivtPPlfRlep~~~~~lRI~~~~~~LP~DRESl 108 (237)
T PRK15224 37 VIYHAGTAGATLSVSNPQNYPILVQSSVKAADKSS-----PA-PFLVMPPLFRLEANQQSQLRIVRTGGDMPTDRETL 108 (237)
T ss_pred EEEeCCCcEEEEEEEcCCCCcEEEEEEEeCCCCCc-----cC-CEEECCCeEEECCCCceEEEEEECCCCCCCceeEE
Confidence 45677778899999999999999998443111110 00 01111 246889987777 666789887753
No 14
>PRK15246 fimbrial assembly chaperone StbE; Provisional
Probab=63.51 E-value=22 Score=27.63 Aligned_cols=64 Identities=11% Similarity=0.129 Sum_probs=39.5
Q ss_pred eecCCeEEEEEEEECCCCCcEEEEEeCc-----ccccchhhcccceeeeec--cccccCCCCeeec----c-CCCCCCCC
Q psy4837 13 VYPGETALAFYTAKNPTNSPIVGISTYN-----VVPFDAGQYFNKIQCFCF--EEQQLNPHEELQI----F-APKKPHHN 80 (100)
Q Consensus 13 V~pGE~~~v~y~a~N~sd~~v~GqAvps-----VsP~~A~~YF~KieCFCF--~eQ~L~pgE~~~M----v-DP~i~~d~ 80 (100)
+.++....+...++|.++.+...|+--. ..|.....- |=- --..|+||++..+ . +..||+|+
T Consensus 20 I~~~~~~~~sv~l~N~~~~p~LvQsWvd~~~~~~~p~~~~~p------FivtPPlfrl~~~~~~~lRI~~~~~~~LP~DR 93 (233)
T PRK15246 20 IFASDDVAQSLTLSNDNTTPMLLQVWTDAGNIDASPDNSKTP------LVALPPVFKMQPGELRTLRLLLSSRQQLATDR 93 (233)
T ss_pred EEcCCCceEEEEEEeCCCCcEEEEEEEeCCCCccCcccccCc------EEECCcceEECCCCceEEEEEECCCCCCCCCc
Confidence 4556678899999999999999998221 111110000 111 1246788887776 5 35788887
Q ss_pred ch
Q psy4837 81 EH 82 (100)
Q Consensus 81 ~~ 82 (100)
|.
T Consensus 94 ES 95 (233)
T PRK15246 94 ES 95 (233)
T ss_pred eE
Confidence 65
No 15
>PF13473 Cupredoxin_1: Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=63.19 E-value=24 Score=23.15 Aligned_cols=45 Identities=18% Similarity=0.203 Sum_probs=26.0
Q ss_pred ceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837 8 LLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI 71 (100)
Q Consensus 8 q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M 71 (100)
-..++|+.|+.+++.+ +|..+.+ ..-.++. + .....|.||++..+
T Consensus 34 P~~i~v~~G~~v~l~~--~N~~~~~-h~~~i~~---------------~-~~~~~l~~g~~~~~ 78 (104)
T PF13473_consen 34 PSTITVKAGQPVTLTF--TNNDSRP-HEFVIPD---------------L-GISKVLPPGETATV 78 (104)
T ss_dssp S-EEEEETTCEEEEEE--EE-SSS--EEEEEGG---------------G-TEEEEE-TT-EEEE
T ss_pred cCEEEEcCCCeEEEEE--EECCCCc-EEEEECC---------------C-ceEEEECCCCEEEE
Confidence 3589999999888664 6776665 3333333 1 12267888887766
No 16
>PRK15295 fimbrial assembly chaperone SthB; Provisional
Probab=62.02 E-value=24 Score=27.22 Aligned_cols=67 Identities=18% Similarity=0.160 Sum_probs=41.0
Q ss_pred EeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeec--cccccCCCCeeec----cCCCCCCCCch
Q psy4837 12 HVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCF--EEQQLNPHEELQI----FAPKKPHHNEH 82 (100)
Q Consensus 12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF--~eQ~L~pgE~~~M----vDP~i~~d~~~ 82 (100)
-+.+|+...+...+.|.++.+...|+--.-..... .+-.=|=- --.+|+||++-.+ ..+.||.|.|.
T Consensus 28 vI~~~~~~~~si~i~N~~~~p~LvQsWv~~~~~~~----~~~~pFivtPPl~rl~p~~~q~lRI~~~~~~LP~DrEs 100 (226)
T PRK15295 28 LVFDGNNDESSINVENKDSKANLVQSWLSVVDPQV----TNKQAFIITPPLFRLDAGQKNSIRVIRSGAPLPADRES 100 (226)
T ss_pred EEEeCCCceeEEEEEeCCCCcEEEEEEEeCCCCCC----CCCCCEEEcCCeEEECCCCceEEEEEECCCCCCCCceE
Confidence 35666778899999999999999987431110000 00000111 1235788987777 66778888765
No 17
>PF06586 TraK: TraK protein; InterPro: IPR010563 This family consists of several TraK proteins from Escherichia coli, Salmonella typhi and Salmonella typhimurium. TraK is known to be essential for pilus assembly but its exact role in this process is unknown [].
Probab=61.91 E-value=12 Score=28.18 Aligned_cols=47 Identities=15% Similarity=0.081 Sum_probs=34.1
Q ss_pred CeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837 17 ETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI 71 (100)
Q Consensus 17 E~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M 71 (100)
...-.-|+++|.++.++.-.--==.+|+..|..| +...|.|||+.+.
T Consensus 186 ~l~~~~y~v~N~~~~~v~l~E~~f~~~~v~AVa~--------~~~~L~PGe~t~v 232 (234)
T PF06586_consen 186 GLRGEVYRVTNTSDQPVELDERDFYSPGVRAVAL--------WPPTLAPGESTEV 232 (234)
T ss_pred ceEEEEEEEEeCCCCCEEecHHHhCCCCcEEEEe--------cccccCCCCEEEE
Confidence 4556789999999998876554444576666655 4458999998764
No 18
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=60.89 E-value=16 Score=22.63 Aligned_cols=55 Identities=20% Similarity=0.231 Sum_probs=30.5
Q ss_pred eEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837 9 LTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI 71 (100)
Q Consensus 9 ~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M 71 (100)
..-.+.+|+..++.+.++|.......+.-+ . .|...-..==..-..|+|||+...
T Consensus 11 ~~~~~~~g~~~~i~~~V~N~G~~~~~~~~v-~-------~~~~~~~~~~~~i~~L~~g~~~~v 65 (101)
T PF07705_consen 11 SPSNVVPGEPVTITVTVKNNGTADAENVTV-R-------LYLDGNSVSTVTIPSLAPGESETV 65 (101)
T ss_dssp C-SEEETTSEEEEEEEEEE-SSS-BEEEEE-E-------EEETTEEEEEEEESEB-TTEEEEE
T ss_pred CCCcccCCCEEEEEEEEEECCCCCCCCEEE-E-------EEECCceeccEEECCcCCCcEEEE
Confidence 345678999999999999998876433221 1 122211110011167888888776
No 19
>PRK15233 putative fimbrial chaperone protein SefB; Provisional
Probab=60.13 E-value=27 Score=27.66 Aligned_cols=65 Identities=11% Similarity=0.083 Sum_probs=41.7
Q ss_pred EeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeec--cccccCCCCeeec----cCCCCCCCCch
Q psy4837 12 HVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCF--EEQQLNPHEELQI----FAPKKPHHNEH 82 (100)
Q Consensus 12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF--~eQ~L~pgE~~~M----vDP~i~~d~~~ 82 (100)
-+.+|....+...+.|.++.+...|+--.=..... +. =|== --..|+||+.-.+ ....||+|+|.
T Consensus 49 vIy~~~~~~~sl~i~N~~~~p~LvQsWvd~~~~~~-----~~-pFiVtPPLfRLep~~~~~lRIi~~~~~LP~DRES 119 (246)
T PRK15233 49 VIYKEDAPSTSFWIMNEKEYPILVQTQVYNDDKSS-----KA-PFIVTPPILKVESNARTRLKVIPTSNLFNKNEES 119 (246)
T ss_pred EEEeCCCcEEEEEEEcCCCCcEEEEEEEecCCCCc-----cC-CEEECCCeEEECCCCceEEEEEECCCCCCcCceE
Confidence 45677778999999999999999998432111110 00 0110 1235788887777 67778888665
No 20
>PRK15253 putative fimbrial assembly chaperone protein StcB; Provisional
Probab=58.16 E-value=29 Score=27.23 Aligned_cols=67 Identities=16% Similarity=0.112 Sum_probs=42.1
Q ss_pred EeecCCeEEEEEEEECCCCCcEEEEEeCcc-----cccchhhcccceeeeeccccccCCCCeeec----cCCCCCCCCch
Q psy4837 12 HVYPGETALAFYTAKNPTNSPIVGISTYNV-----VPFDAGQYFNKIQCFCFEEQQLNPHEELQI----FAPKKPHHNEH 82 (100)
Q Consensus 12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsV-----sP~~A~~YF~KieCFCF~eQ~L~pgE~~~M----vDP~i~~d~~~ 82 (100)
-|.+|....+.+.+.|.++.+...|+--.= +|......|-=+ =--..|+||+...+ ....||+|+|.
T Consensus 42 vIy~~~~k~~sv~i~N~~~~pyLvQsWvd~~~~~~~~~~~~~pFivt----PPlfRl~p~~~~~lRI~~~~~~LP~DRES 117 (242)
T PRK15253 42 VIYPAEKKEVVVQLVNQGEQASLVQSWIDDGNTSLPPEKIQVPFMLT----PPVARVAAESGQQIKIKKMPNSLPDNKES 117 (242)
T ss_pred EEEeCCCceEEEEEEcCCCCcEEEEEEEECCCCCCCcccccCCEEEC----CCeEEECCCCceEEEEEECCCCCCcceeE
Confidence 456777788999999999999999984421 111110111100 01246888887777 55568888765
No 21
>PF07177 Neuralized: Neuralized; InterPro: IPR006573 NEUZ is a domain of unknown function found in neuralized proteins, i.e. proteins involved in the specification of the neuroblast during cellular differentiation. ; PDB: 2YUE_A 2E63_A.
Probab=57.13 E-value=5.7 Score=25.59 Aligned_cols=17 Identities=18% Similarity=0.386 Sum_probs=12.5
Q ss_pred eeeccccccCCCCeeec
Q psy4837 55 CFCFEEQQLNPHEELQI 71 (100)
Q Consensus 55 CFCF~eQ~L~pgE~~~M 71 (100)
+.+|++++|.+||.+++
T Consensus 31 giVFS~rPl~~~E~~~v 47 (69)
T PF07177_consen 31 GIVFSSRPLRIGEKFEV 47 (69)
T ss_dssp -EEEESS-B-TT-EEEE
T ss_pred eEEEecCCccCCCEEEE
Confidence 78999999999999998
No 22
>PRK15211 fimbrial chaperone protein PefD; Provisional
Probab=56.98 E-value=32 Score=26.72 Aligned_cols=66 Identities=12% Similarity=0.113 Sum_probs=42.4
Q ss_pred EeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeecc--ccccCCCCeeec----cCCCCCCCCch
Q psy4837 12 HVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFE--EQQLNPHEELQI----FAPKKPHHNEH 82 (100)
Q Consensus 12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~--eQ~L~pgE~~~M----vDP~i~~d~~~ 82 (100)
-+.++....+...++|.++.+...|+--.-....... .=|=-+ -..|+||++..+ .+..||+|+|.
T Consensus 31 vIy~~~~~~~si~i~N~~~~p~LvQswv~~~~~~~~~-----~pFivtPPlfrl~p~~~q~lRI~~~~~~LP~DRES 102 (229)
T PRK15211 31 FIYDEGRKNISFEVTNQADQTYGGQVWIDNTTQGSST-----VYMVPAPPFFKVRPKEKQIIRIMKTDSALPKDRES 102 (229)
T ss_pred EEEcCCCceEEEEEEeCCCCcEEEEEEEecCCCCCcc-----CCEEEcCCeEEECCCCceEEEEEECCCCCCCCceE
Confidence 3566777889999999999999999854322111110 002221 246788987777 66678888765
No 23
>PRK11385 putativi pili assembly chaperone; Provisional
Probab=52.44 E-value=41 Score=26.27 Aligned_cols=72 Identities=14% Similarity=0.015 Sum_probs=42.5
Q ss_pred EeecCCeEEEEEEEECCCCCcEEEEEeCcccccch-hhcccceeeeecc--ccccCCCCeeec----cC-CCCCCCCchh
Q psy4837 12 HVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDA-GQYFNKIQCFCFE--EQQLNPHEELQI----FA-PKKPHHNEHK 83 (100)
Q Consensus 12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A-~~YF~KieCFCF~--eQ~L~pgE~~~M----vD-P~i~~d~~~~ 83 (100)
-+.++....+...+.|.++++...|+--.-.+..+ +.--++..=|=-+ =..|+||++..+ .. ..||+|+|.-
T Consensus 35 vIy~~~~~~~sv~l~N~~~~p~LvQswv~~~~~~~~~~~~~~~~pFivtPPlfrl~p~~~q~lRIi~~~~~~LP~DRESl 114 (236)
T PRK11385 35 FIFPADRESISILLTNTSQESWLINSKINRPTRWAGGEASTVPAPLLAAPPLILLKPGTTGTLRLLRTESDILPVDRETL 114 (236)
T ss_pred EEEcCCCceEEEEEEeCCCCcEEEEEEcccCccccCcccccccCCEEEcCCeEEECCCCceEEEEEECCCCCCCCCceEE
Confidence 35567778899999999999999999443111000 0000000112222 236789998777 54 4789997753
No 24
>PF11611 DUF4352: Domain of unknown function (DUF4352); InterPro: IPR021652 This entry is represented by Bacteriophage A118, Gp32. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry represents a group of putative lipoproteins of unknown function.; PDB: 3CFU_A.
Probab=51.94 E-value=27 Score=22.81 Aligned_cols=64 Identities=13% Similarity=0.133 Sum_probs=30.9
Q ss_pred CCeEEEEEEEECCCCCcEEEEEeCcccc-cchhhcccceeeee-----ccccccCCCCeeec-cCCCCCCCC
Q psy4837 16 GETALAFYTAKNPTNSPIVGISTYNVVP-FDAGQYFNKIQCFC-----FEEQQLNPHEELQI-FAPKKPHHN 80 (100)
Q Consensus 16 GE~~~v~y~a~N~sd~~v~GqAvpsVsP-~~A~~YF~KieCFC-----F~eQ~L~pgE~~~M-vDP~i~~d~ 80 (100)
++...|.+.++|.+++++.-... ...= ...+.-+.--.=.. +..+.|+||++++- +-=++|++.
T Consensus 35 ~~fv~v~v~v~N~~~~~~~~~~~-~f~l~d~~g~~~~~~~~~~~~~~~~~~~~i~pG~~~~g~l~F~vp~~~ 105 (123)
T PF11611_consen 35 NKFVVVDVTVKNNGDEPLDFSPS-DFKLYDSDGNKYDPDFSASSNDNDLFSETIKPGESVTGKLVFEVPKDD 105 (123)
T ss_dssp SEEEEEEEEEEE-SSS-EEEEGG-GEEEE-TT--B--EEE-CCCTTTB--EEEE-TT-EEEEEEEEEESTT-
T ss_pred CEEEEEEEEEEECCCCcEEeccc-ceEEEeCCCCEEcccccchhccccccccEECCCCEEEEEEEEEECCCC
Confidence 56788999999999999864322 1111 11111111111111 55689999999988 333556653
No 25
>PF00207 A2M: Alpha-2-macroglobulin family; InterPro: IPR001599 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0004866 endopeptidase inhibitor activity; PDB: 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 2PN5_A 3FRP_G 3HRZ_B ....
Probab=51.88 E-value=33 Score=22.20 Aligned_cols=26 Identities=15% Similarity=-0.055 Sum_probs=20.9
Q ss_pred EeecCCeEEEEEEEECCCCCcEEEEE
Q psy4837 12 HVYPGETALAFYTAKNPTNSPIVGIS 37 (100)
Q Consensus 12 ~V~pGE~~~v~y~a~N~sd~~v~GqA 37 (100)
.+..||...+...+.|.+++.+....
T Consensus 65 ~l~~GD~~~i~v~v~N~~~~~~~v~V 90 (92)
T PF00207_consen 65 SLRRGDQIQIPVTVFNYTDKDQEVTV 90 (92)
T ss_dssp EEETTSEEEEEEEEEE-SSS-EEEEE
T ss_pred EEecCCEEEEEEEEEeCCCCCEEEEE
Confidence 47899999999999999999887653
No 26
>PF06205 GT36_AF: Glycosyltransferase 36 associated family ; InterPro: IPR010403 This domain is found in the NvdB protein (P20471 from SWISSPROT), which is involved in the production of beta-(1-->2)-glucan.; PDB: 1V7V_A 1V7W_A 1V7X_A 3ACT_B 2CQT_A 3QFY_B 3QFZ_A 2CQS_A 3QG0_B 3AFJ_A ....
Probab=51.78 E-value=21 Score=23.71 Aligned_cols=19 Identities=16% Similarity=0.315 Sum_probs=14.9
Q ss_pred eceEEEeecCCeEEEEEEE
Q psy4837 7 YLLTLHVYPGETALAFYTA 25 (100)
Q Consensus 7 ~q~~v~V~pGE~~~v~y~a 25 (100)
.+..+++.|||...+.|..
T Consensus 64 l~~~v~L~PGe~~~v~f~l 82 (90)
T PF06205_consen 64 LQVRVTLEPGEEKEVVFLL 82 (90)
T ss_dssp EEEEEEE-TT-EEEEEEEE
T ss_pred EEEEEEECCCCEEEEEEEE
Confidence 5778999999999999975
No 27
>PF01345 DUF11: Domain of unknown function DUF11; InterPro: IPR001434 This group of sequences is represented by a conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis (Streptococcus faecalis), and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydia trachomatis outer membrane proteins. In C. trachomatis, three cysteine-rich proteins (also believed to be lipoproteins), MOMP, OMP6 and OMP3, make up the extracellular matrix of the outer membrane []. They are involved in the essential structural integrity of both the elementary body (EB) and recticulate body (RB) phase. They are thought to be involved in porin formation and, as these bacteria lack the peptidoglycan layer common to most Gram-negative microbes, such proteins are highly important in the pathogenicity of the organism.; GO: 0005727 extrachromosomal circular DNA
Probab=50.81 E-value=35 Score=21.08 Aligned_cols=35 Identities=20% Similarity=0.299 Sum_probs=28.7
Q ss_pred eEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccc
Q psy4837 9 LTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVP 43 (100)
Q Consensus 9 ~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP 43 (100)
..-.+.||+.......++|..+.+..+..+-..-|
T Consensus 33 ~~~~~~~Gd~v~ytitvtN~G~~~a~nv~v~D~lp 67 (76)
T PF01345_consen 33 NPSTANPGDTVTYTITVTNTGPAPATNVVVTDTLP 67 (76)
T ss_pred CCCcccCCCEEEEEEEEEECCCCeeEeEEEEEcCC
Confidence 44568999999999999999999988776655444
No 28
>PRK13736 conjugal transfer protein TraK; Provisional
Probab=49.47 E-value=33 Score=27.05 Aligned_cols=47 Identities=19% Similarity=0.150 Sum_probs=35.7
Q ss_pred CeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccc--cccCCCCeeec
Q psy4837 17 ETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEE--QQLNPHEELQI 71 (100)
Q Consensus 17 E~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~e--Q~L~pgE~~~M 71 (100)
+..-+-|+++|.+++++.-.--==.+|+-- -+=|.. +.|.|||+..+
T Consensus 185 ~l~g~~y~l~N~~~~~v~L~E~~F~~~gvr--------AVa~~~~~~~L~PG~~t~v 233 (245)
T PRK13736 185 HLKVVRYRVENPTLSARNLRESDFWQPGTR--------AVMFSQPARQLLAGGRMDV 233 (245)
T ss_pred CcEEEEEEEEcCCCCCeEechHHhCCCCce--------EEEecCCcccCCCCCEEEE
Confidence 466788999999999988765555555433 345677 99999999887
No 29
>PRK09926 putative chaperone protein EcpD; Provisional
Probab=49.40 E-value=45 Score=25.89 Aligned_cols=64 Identities=20% Similarity=0.288 Sum_probs=39.3
Q ss_pred eecCCeEEEEEEEECCCCCcEEEEEeCc-----ccccchhhcccceeeeec--cccccCCCCeeec----cCC-CCCCCC
Q psy4837 13 VYPGETALAFYTAKNPTNSPIVGISTYN-----VVPFDAGQYFNKIQCFCF--EEQQLNPHEELQI----FAP-KKPHHN 80 (100)
Q Consensus 13 V~pGE~~~v~y~a~N~sd~~v~GqAvps-----VsP~~A~~YF~KieCFCF--~eQ~L~pgE~~~M----vDP-~i~~d~ 80 (100)
+.++....+...++|.++++...|+--. .+|... |.. |=- --..|+||++..+ ..+ .+|.|+
T Consensus 35 I~~~~~~~~sv~l~N~~~~p~LvQ~Wvd~~~~~~~p~~~-----~~p-fivtPPl~rl~p~~~q~lRIi~~~~~~lP~Dr 108 (246)
T PRK09926 35 IYKSDQKDVNVRLENKGNNPLLVQSWLDTGDDNAEPGSI-----KVP-FTATPPVSRIDPKRGQTIKLMYTASTALPKDR 108 (246)
T ss_pred EEeCCCceEEEEEEeCCCCcEEEEEEecCCCCccCcccc-----CCC-EEEcCCeEEECCCCccEEEEEeCCCCCCCCCc
Confidence 4456667888899999999999997331 111110 000 111 1246788887776 666 688887
Q ss_pred ch
Q psy4837 81 EH 82 (100)
Q Consensus 81 ~~ 82 (100)
|.
T Consensus 109 ES 110 (246)
T PRK09926 109 ES 110 (246)
T ss_pred eE
Confidence 65
No 30
>TIGR03079 CH4_NH3mon_ox_B methane monooxygenase/ammonia monooxygenase, subunit B. Both ammonia oxidizers such as Nitrosomonas europaea and methanotrophs (obligate methane oxidizers) such as Methylococcus capsulatus each can grow only on their own characteristic substrate. However, both groups have the ability to oxidize both substrates, and so the relevant enzymes must be named here according to their ability to oxidze both. The protein family represented here reflects subunit B of both the particulate methane monooxygenase of methylotrophs and the ammonia monooxygenase of nitrifying bacteria.
Probab=47.42 E-value=57 Score=28.14 Aligned_cols=55 Identities=13% Similarity=0.153 Sum_probs=38.6
Q ss_pred cCCeEEEEEEEECCCCCcEEEE-------------EeCcccccchhhcccceeeeec-cccccCCCCeeec
Q psy4837 15 PGETALAFYTAKNPTNSPIVGI-------------STYNVVPFDAGQYFNKIQCFCF-EEQQLNPHEELQI 71 (100)
Q Consensus 15 pGE~~~v~y~a~N~sd~~v~Gq-------------AvpsVsP~~A~~YF~KieCFCF-~eQ~L~pgE~~~M 71 (100)
||-.-++...++|.+|+++.-- .+|...|..-..+.-+ ---. ..-+++|||+++.
T Consensus 280 PGR~l~~~~~VTN~g~~~vrlgEF~TA~vRFlN~~~v~~~~~~yP~~lla~--GL~v~d~~pI~PGETr~v 348 (399)
T TIGR03079 280 PGRALRVTMEITNNGDQVISIGEFTTAGIRFMNANGVRVLDPDYPRELLAE--GLEVDDQSAIAPGETVEV 348 (399)
T ss_pred CCcEEEEEEEEEcCCCCceEEEeEeecceEeeCcccccccCCCChHHHhhc--cceeCCCCCcCCCcceEE
Confidence 8889999999999999998531 3555555544444444 3333 3447999999987
No 31
>PF14646 MYCBPAP: MYCBP-associated protein family
Probab=46.58 E-value=68 Score=26.62 Aligned_cols=64 Identities=19% Similarity=0.116 Sum_probs=44.6
Q ss_pred ceEEEeecCCeEEEEEE-EECCCCCcEEEEEeCcccccch-hhcccceeeeecccc--ccCCCCeeec
Q psy4837 8 LLTLHVYPGETALAFYT-AKNPTNSPIVGISTYNVVPFDA-GQYFNKIQCFCFEEQ--QLNPHEELQI 71 (100)
Q Consensus 8 q~~v~V~pGE~~~v~y~-a~N~sd~~v~GqAvpsVsP~~A-~~YF~KieCFCF~eQ--~L~pgE~~~M 71 (100)
.-....+|||...=.-. ++|.....+.=+=.-.-.+... ...-...+||=|+.. +|.|||+..+
T Consensus 238 ~l~Fe~~p~e~~~~~v~~l~N~Gt~~I~y~W~~~~~~~~~~~~~~~~~~~F~Fd~~~gvilPGe~~~~ 305 (426)
T PF14646_consen 238 RLTFECHPGERVSKEVVRLENNGTTAIYYSWRRVPFFKNFGSLFRAQDQRFYFDTSSGVILPGETRNF 305 (426)
T ss_pred EEEEEcccCceeeEEEEEEecCCceEEEEEEEecccccccchhccccCCeEEEeCCCCEECCCceEEE
Confidence 34567899999888888 9999888776543332222222 333445888888765 6899999987
No 32
>PF02752 Arrestin_C: Arrestin (or S-antigen), C-terminal domain; InterPro: IPR011022 G protein-coupled receptors are a large family of signalling molecules that respond to a wide variety of extracellular stimuli. The receptors relay the information encoded by the ligand through the activation of heterotrimeric G proteins and intracellular effector molecules. To ensure the appropriate regulation of the signalling cascade, it is vital to properly inactivate the receptor. This inactivation is achieved, in part, by the binding of a soluble protein, arrestin, which uncouples the receptor from the downstream G protein after the receptors are phosphorylated by G protein-coupled receptor kinases. In addition to the inactivation of G protein-coupled receptors, arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways. Arrestin (retinal S-antigen) is a major protein of the retinal rod outer segments. It interacts with photo-activated phosphorylated rhodopsin, inhibiting or 'arresting' its ability to interact with transducin []. The protein binds calcium, and shows similarity in its C terminus to alpha-transducin and other purine nucleotide-binding proteins. In mammals, arrestin is associated with autoimmune uveitis. Arrestins comprise a family of closely-related proteins that includes beta-arrestin-1 and -2, which regulate the function of beta-adrenergic receptors by binding to their phosphorylated forms, impairing their capacity to activate G(S) proteins; Cone photoreceptors C-arrestin (arrestin-X) [], which could bind to phosphorylated red/green opsins; and Drosophila phosrestins I and II, which undergo light-induced phosphorylation, and probably play a role in photoreceptor transduction [, , ]. The crystal structure of bovine retinal arrestin comprises two domains of antiparallel beta-sheets connected through a hinge region and one short alpha-helix on the back of the amino-terminal fold []. The binding region for phosphorylated light-activated rhodopsin is located at the N-terminal domain, as indicated by the docking of the photoreceptor to the three-dimensional structure of arrestin. The C-terminal domain consists of an immunoglobulin-like beta-sandwich structure. This entry represents proteins with immunoglobulin-like domains that are similar to those found in arrestin.; PDB: 1SUJ_A 3UGX_A 1CF1_B 1AYR_A 3UGU_A 3P2D_B 1ZSH_A 2WTR_B 3GC3_A 1G4R_A ....
Probab=44.92 E-value=52 Score=21.24 Aligned_cols=29 Identities=24% Similarity=0.193 Sum_probs=20.6
Q ss_pred eEEEeecCCeEEEEEEEECCCCCcEEEEE
Q psy4837 9 LTLHVYPGETALAFYTAKNPTNSPIVGIS 37 (100)
Q Consensus 9 ~~v~V~pGE~~~v~y~a~N~sd~~v~GqA 37 (100)
..-...|||...+...+.|.+++.+.+.-
T Consensus 12 ~~~~~~~Ge~i~v~v~i~n~s~~~i~~I~ 40 (136)
T PF02752_consen 12 PRTAYVPGETIPVNVEIDNQSKKKIKKIK 40 (136)
T ss_dssp S-SEEETT--EEEEEEEEE-SSSEEEEEE
T ss_pred CCCEECCCCEEEEEEEEEECCCCEEEEEE
Confidence 34456799999999999999999887743
No 33
>PF04744 Monooxygenase_B: Monooxygenase subunit B protein; InterPro: IPR006833 Ammonia monooxygenase and the particulate methane monooxygenase are both integral membrane proteins, occurring in ammonia oxidisers and methanotrophs respectively, which are thought to be evolutionarily related []. These enzymes have a relatively wide substrate specificity and can catalyse the oxidation of a range of substrates including ammonia, methane, halogenated hydrocarbons and aromatic molecules []. These enzymes are composed of 3 subunits - A (IPR003393 from INTERPRO), B (IPR006833 from INTERPRO) and C (IPR006980 from INTERPRO) - and contain various metal centres, including copper. Particulate methane monooxygenase from Methylococcus capsulatus str. Bath is an ABC homotrimer, which contains mononuclear and dinuclear copper metal centres, and a third metal centre containing a metal ion whose identity in vivo is not certain[]. The soluble regions of these enzymes derive primarily from the B subunit. This subunit forms two antiparallel beta-barrel-like structures and contains the mono- and di- nuclear copper metal centres [].; PDB: 3CHX_E 3RFR_A 3RGB_A 1YEW_A.
Probab=44.35 E-value=47 Score=28.45 Aligned_cols=57 Identities=19% Similarity=0.198 Sum_probs=30.3
Q ss_pred ecCCeEEEEEEEECCCCCcEEEEE---------eCcccccchh---hcccceeeeec-cccccCCCCeeec
Q psy4837 14 YPGETALAFYTAKNPTNSPIVGIS---------TYNVVPFDAG---QYFNKIQCFCF-EEQQLNPHEELQI 71 (100)
Q Consensus 14 ~pGE~~~v~y~a~N~sd~~v~GqA---------vpsVsP~~A~---~YF~KieCFCF-~eQ~L~pgE~~~M 71 (100)
-||-.-++...++|.+++++.-.- -+.|. .... .++-..+---. ...+++|||++++
T Consensus 260 vpgR~l~~~l~VtN~g~~pv~LgeF~tA~vrFln~~v~-~~~~~~P~~l~A~~gL~vs~~~pI~PGETrtl 329 (381)
T PF04744_consen 260 VPGRTLTMTLTVTNNGDSPVRLGEFNTANVRFLNPDVP-TDDPDYPDELLAERGLSVSDNSPIAPGETRTL 329 (381)
T ss_dssp SSSSEEEEEEEEEEESSS-BEEEEEESSS-EEE-TTT--SS-S---TTTEETT-EEES--S-B-TT-EEEE
T ss_pred cCCcEEEEEEEEEcCCCCceEeeeEEeccEEEeCcccc-cCCCCCchhhhccCcceeCCCCCcCCCceEEE
Confidence 489999999999999999987543 11111 1111 12222222233 3459999999998
No 34
>PRK15249 fimbrial chaperone protein StbB; Provisional
Probab=43.36 E-value=66 Score=25.19 Aligned_cols=65 Identities=18% Similarity=0.232 Sum_probs=38.7
Q ss_pred ecCCeEEEEEEEECCCCCcEEEEEe-----Ccccccchhhcccceeeeecc--ccccCCCCeeec----cCC-CCCCCCc
Q psy4837 14 YPGETALAFYTAKNPTNSPIVGIST-----YNVVPFDAGQYFNKIQCFCFE--EQQLNPHEELQI----FAP-KKPHHNE 81 (100)
Q Consensus 14 ~pGE~~~v~y~a~N~sd~~v~GqAv-----psVsP~~A~~YF~KieCFCF~--eQ~L~pgE~~~M----vDP-~i~~d~~ 81 (100)
.+|....+...++|.++++...|+- -+.+|.... +. =|=-+ -.+|+||++..+ .+. .+|+|.|
T Consensus 39 y~~~~~~~sl~l~N~~~~p~LvQsWv~~~~~~~~p~~~~----~~-pFivtPPlfrl~p~~~q~lRI~~~~~~~lP~DRE 113 (253)
T PRK15249 39 YPSTASSVDVQLKNNDAIPYIVQTWFDDGDMNTSPENSS----AM-PFIATPPVFRIQPKAGQVVRVIYNNTKKLPQDRE 113 (253)
T ss_pred EeCCCcceeEEEEcCCCCcEEEEEEEeCCCCCCCccccc----cC-cEEEcCCeEEecCCCceEEEEEEcCCCCCCCCce
Confidence 3555678888999999999999974 122221110 00 01111 135778887666 553 7888876
Q ss_pred hh
Q psy4837 82 HK 83 (100)
Q Consensus 82 ~~ 83 (100)
.-
T Consensus 114 Sl 115 (253)
T PRK15249 114 SV 115 (253)
T ss_pred EE
Confidence 53
No 35
>smart00588 NEUZ domain in neuralized proteins.
Probab=41.80 E-value=25 Score=24.76 Aligned_cols=17 Identities=18% Similarity=0.423 Sum_probs=15.6
Q ss_pred eeeccccccCCCCeeec
Q psy4837 55 CFCFEEQQLNPHEELQI 71 (100)
Q Consensus 55 CFCF~eQ~L~pgE~~~M 71 (100)
+.||+..+|++||.+.+
T Consensus 32 givFS~rPl~~~E~~~v 48 (123)
T smart00588 32 ALVFSARPLRINELFEV 48 (123)
T ss_pred eEEecCCCCcCCCEEEE
Confidence 57999999999999887
No 36
>PF09624 DUF2393: Protein of unknown function (DUF2393); InterPro: IPR013417 The function of this protein is unknown. It is always found as part of a two-gene operon with IPR013416 from INTERPRO, a protein that appears to span the membrane seven times. It has so far been found in the bacteria Anabaena sp. (strain PCC 7120), Agrobacterium tumefaciens, Rhizobium meliloti, and Gloeobacter violaceus.
Probab=39.23 E-value=1.3e+02 Score=20.97 Aligned_cols=59 Identities=20% Similarity=0.115 Sum_probs=44.6
Q ss_pred eecCCeEEEEEEEECCCCCcEEEEE----eCc---ccccchhhcccceeeeec----cccccCCCCeeec
Q psy4837 13 VYPGETALAFYTAKNPTNSPIVGIS----TYN---VVPFDAGQYFNKIQCFCF----EEQQLNPHEELQI 71 (100)
Q Consensus 13 V~pGE~~~v~y~a~N~sd~~v~GqA----vps---VsP~~A~~YF~KieCFCF----~eQ~L~pgE~~~M 71 (100)
+.-++..-+.+.++|.+++++.+-- +++ +....-..|+.|..=|=- -...|.|||+++.
T Consensus 58 l~~~~~~~v~g~V~N~g~~~i~~c~i~~~l~~~~~~~~n~~~~~~~~~~~f~~~~~~i~~~L~~~e~~~f 127 (149)
T PF09624_consen 58 LQYSESFYVDGTVTNTGKFTIKKCKITVKLYNDKQVSGNKFKEIFYQQIPFVKKSIPIADNLKPGESKEF 127 (149)
T ss_pred eeeccEEEEEEEEEECCCCEeeEEEEEEEEEeCCCccCchhhhhhccccchhccceeHHhhcCcccceeE
Confidence 4578999999999999999987643 233 666777888888876411 1334999999999
No 37
>PF14310 Fn3-like: Fibronectin type III-like domain; PDB: 3ABZ_D 3AC0_D 2X40_A 2X41_A 2X42_A.
Probab=38.98 E-value=52 Score=20.38 Aligned_cols=19 Identities=21% Similarity=0.317 Sum_probs=14.0
Q ss_pred eEEEeecCCeEEEEEEEEC
Q psy4837 9 LTLHVYPGETALAFYTAKN 27 (100)
Q Consensus 9 ~~v~V~pGE~~~v~y~a~N 27 (100)
..+.+.|||..++.|.+..
T Consensus 25 ~rv~l~pGes~~v~~~l~~ 43 (71)
T PF14310_consen 25 ERVSLAPGESKTVSFTLPP 43 (71)
T ss_dssp EEEEE-TT-EEEEEEEEEH
T ss_pred EEEEECCCCEEEEEEEECH
Confidence 4677999999999998864
No 38
>TIGR02656 cyanin_plasto plastocyanin. Members of this family are plastocyanin, a blue copper protein related to pseudoazurin, halocyanin, amicyanin, etc. This protein, located in the thylakoid luman, performs electron transport to photosystem I in Cyanobacteria and chloroplasts.
Probab=38.95 E-value=1.1e+02 Score=20.03 Aligned_cols=56 Identities=14% Similarity=0.256 Sum_probs=27.7
Q ss_pred eEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837 9 LTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI 71 (100)
Q Consensus 9 ~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M 71 (100)
..++|++|+++ .|. |....+-.-.+.-.=.|..+......+. -+...|+|||+.++
T Consensus 17 ~~i~v~~G~~V--~~~--N~~~~~H~~~~~~~~~~~~~~~~~~~~~---~~~~~~~pG~t~~~ 72 (99)
T TIGR02656 17 AKISIAAGDTV--EWV--NNKGGPHNVVFDEDAVPAGVKELAKSLS---HKDLLNSPGESYEV 72 (99)
T ss_pred CEEEECCCCEE--EEE--ECCCCCceEEECCCCCccchhhhccccc---ccccccCCCCEEEE
Confidence 57999999975 343 6543332222211112322211111110 13457899999988
No 39
>PRK02710 plastocyanin; Provisional
Probab=37.46 E-value=1e+02 Score=21.04 Aligned_cols=46 Identities=22% Similarity=0.239 Sum_probs=26.8
Q ss_pred eEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837 9 LTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI 71 (100)
Q Consensus 9 ~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M 71 (100)
..++|++|+++ .|. |..+.+-. +..... +=+.++...++|||+.+.
T Consensus 47 ~~i~v~~Gd~V--~~~--N~~~~~H~------v~~~~~-------~~~~~~~~~~~pg~t~~~ 92 (119)
T PRK02710 47 STLTIKAGDTV--KWV--NNKLAPHN------AVFDGA-------KELSHKDLAFAPGESWEE 92 (119)
T ss_pred CEEEEcCCCEE--EEE--ECCCCCce------EEecCC-------ccccccccccCCCCEEEE
Confidence 67999999974 443 55433222 111111 112345677899999887
No 40
>PRK15274 putative periplasmic fimbrial chaperone protein SteC; Provisional
Probab=32.84 E-value=1.2e+02 Score=24.19 Aligned_cols=65 Identities=18% Similarity=0.225 Sum_probs=40.8
Q ss_pred EeecCCeEEEEEEEECCCCC-cEEEEEeCcccccchhhcccceee-eecc--ccccCCCCeeec----cC--CCCCCCCc
Q psy4837 12 HVYPGETALAFYTAKNPTNS-PIVGISTYNVVPFDAGQYFNKIQC-FCFE--EQQLNPHEELQI----FA--PKKPHHNE 81 (100)
Q Consensus 12 ~V~pGE~~~v~y~a~N~sd~-~v~GqAvpsVsP~~A~~YF~KieC-FCF~--eQ~L~pgE~~~M----vD--P~i~~d~~ 81 (100)
-|.+|....+.+.+.|.++. +...|+--. -+... +... |=-+ -..|+||+.-.+ .+ ..||+|+|
T Consensus 35 vIy~e~~~~~sv~v~N~~~~~p~LVQsWvd--d~~~~----~~~~pFivtPPLfRlep~~~q~lRI~~~~~~~~LP~DRE 108 (257)
T PRK15274 35 VIFNGNENSITVTLKNGNATLPYLAQAWLE--DDKFA----KDTRYFTALPPLQRIEPKSDGQVKVQPLPAAASLPQDRE 108 (257)
T ss_pred EEEeCCCceEEEEEEeCCCCCcEEEEEEcc--CCCCC----cccCCEEEcCCeEEECCCCceEEEEEECCCCCCCCCcee
Confidence 45667778899999999866 899998432 21100 1101 1111 236789987777 53 67999976
Q ss_pred h
Q psy4837 82 H 82 (100)
Q Consensus 82 ~ 82 (100)
.
T Consensus 109 S 109 (257)
T PRK15274 109 S 109 (257)
T ss_pred E
Confidence 4
No 41
>TIGR01451 B_ant_repeat conserved repeat domain. This model represents the conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis, and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydial outer membrane proteins.
Probab=32.76 E-value=1.1e+02 Score=18.36 Aligned_cols=35 Identities=20% Similarity=0.227 Sum_probs=29.8
Q ss_pred EEEeecCCeEEEEEEEECCCCCcEEEEEeCccccc
Q psy4837 10 TLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPF 44 (100)
Q Consensus 10 ~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~ 44 (100)
.-.+.||+..+-.-.++|..+.+..+..+-..-|.
T Consensus 5 ~~~~~~Gd~v~Yti~v~N~g~~~a~~v~v~D~lP~ 39 (53)
T TIGR01451 5 KTVATIGDTITYTITVTNNGNVPATNVVVTDILPS 39 (53)
T ss_pred ccccCCCCEEEEEEEEEECCCCceEeEEEEEcCCC
Confidence 44678999999999999999999988888777663
No 42
>PF08308 PEGA: PEGA domain; InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=32.41 E-value=90 Score=18.88 Aligned_cols=22 Identities=36% Similarity=0.413 Sum_probs=17.8
Q ss_pred ceeceEEEeecCCeEEEEEEEE
Q psy4837 5 HRYLLTLHVYPGETALAFYTAK 26 (100)
Q Consensus 5 ~p~q~~v~V~pGE~~~v~y~a~ 26 (100)
.+....+.|.+|+...+.+.++
T Consensus 47 ~~~~~~v~v~~~~~~~v~~~L~ 68 (71)
T PF08308_consen 47 EPYTKTVTVKPGETTTVNVTLE 68 (71)
T ss_pred eeEEEEEEECCCCEEEEEEEEE
Confidence 3566789999999999988765
No 43
>PF00362 Integrin_beta: Integrin, beta chain; InterPro: IPR002369 Integrins are the major metazoan receptors for cell adhesion to extracellular matrix proteins and, in vertebrates, also play important roles in certain cell-cell adhesions, make transmembrane connections to the cytoskeleton and activate many intracellular signalling pathways [, ]. The integrin receptors are composed of alpha and beta subunit heterodimers. Each subunit crosses the membrane once, with most of the polypeptide residing in the extracellular space, and has two short cytoplasmic domains. Some members of this family have EGF repeats at the C terminus and also have a vWA domain inserted within the integrin domain at the N terminus. Most integrins recognise relatively short peptide motifs, and in general require an acidic amino acid to be present. Ligand specificity depends upon both the alpha and beta subunits []. There are at least 18 types of alpha and 8 types of beta subunits recognised in humans []. Each alpha subunit tends to associate only with one type of beta subunit, but there are exceptions to this rule []. Each association of alpha and beta subunits has its own binding specificity and signalling properties. Many integrins require activation on the cell surface before they can bind ligands. Integrins frequently intercommunicate, and binding at one integrin receptor activate or inhibit another. The structure of unliganded alphaV beta3 showed the molecule to be folded, with the head bent over towards the C termini of the legs which would normally be inserted into the membrane []. The head comprises a beta propeller domain at the end terminus of the alphaV subunit and an I/A domain inserted into a loop on the top of the hybrid domain in the beta subunit. The I/A domain consists of a Rossman fold with a core of beta parallel sheets surrounded by amphipathic alpha helices. Integrins are important therapeutic targets in conditions such as atherosclerosis, thrombosis, cancer and asthma []. At the N terminus of the beta subunit is a cysteine-containing domain reminiscent of that found in presenillins and semaphorins, which has hence been termed the PSI domain. C-terminal to the PSI domain is an A-domain, which has been predicted to adopt a Rossmann fold similar to that of the alpha subunit, but with additional loops between the second and third beta strands []. The murine gene Pactolus shares significant similarity with the beta subunit [], but lacks either one or both of the inserted loops. The C-terminal portion of the beta subunit extracellular domain contains an internally disulphide-bonded cysteine-rich region, while the intracellular tail contains putative sites of interaction with a variety of intracellular signalling and cytoskeletal proteins, such as focal adhesion kinase and alpha-actinin respectively []. Integrin cytoplasmic domains are normally less than 50 amino acids in length, with the beta-subunit sequences exhibiting greater homology to each other than the alpha-subunit sequences. This is consistent with current evidence that the beta subunit is the principal site for binding of cytoskeletal and signalling molecules, whereas the alpha subunit has a regulatory role. The first 20 amino acids of the beta-subunit cytoplasmic domain are also alpha helical, but the final 25 residues are disordered and, apart from a turn that follows a conserved NPxY motif, appear to lack defined structure, suggesting that this is adopted on effector binding. The two membrane-proximal helices mediate the link between the subunits via a series of hydrophobic and electrostatic contacts. This entry represents the N-terminal portion of the extracellular region of integrin beta subunits.; GO: 0005488 binding, 0007155 cell adhesion, 0007160 cell-matrix adhesion; PDB: 3VI4_B 3VI3_B 2VDQ_B 3IJE_B 1M1X_B 2VDR_B 3NIF_B 3NID_D 1TYE_F 2Q6W_F ....
Probab=32.00 E-value=77 Score=26.91 Aligned_cols=37 Identities=16% Similarity=0.177 Sum_probs=29.4
Q ss_pred eeeceeceEEEeecCCeEEEEEEEECCCCCcEEEEEe
Q psy4837 2 ITTHRYLLTLHVYPGETALAFYTAKNPTNSPIVGIST 38 (100)
Q Consensus 2 ~~~~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAv 38 (100)
+-+.|..-+++++||+...+.+.++=..+.|+--.-.
T Consensus 72 vQl~PQ~v~l~LRpG~~~~f~v~~~~a~~yPvDLYyL 108 (426)
T PF00362_consen 72 VQLSPQKVSLKLRPGEPVTFNVTVRPAEDYPVDLYYL 108 (426)
T ss_dssp BSEESSEEEEEEETTBEEEEEEEEEBSSS--EEEEEE
T ss_pred eeeccceeEEEeecceeEEEEEEEeeccccceeEEEE
Confidence 3478999999999999999999999888888765433
No 44
>PF00127 Copper-bind: Copper binding proteins, plastocyanin/azurin family; InterPro: IPR000923 Blue (type 1) copper proteins are small proteins which bind a single copper atom and which are characterised by an intense electronic absorption band near 600 nm [, ]. The most well known members of this class of proteins are the plant chloroplastic plastocyanins, which exchange electrons with cytochrome c6, and the distantly related bacterial azurins, which exchange electrons with cytochrome c551. This family of proteins also includes amicyanin from bacteria such as Methylobacterium extorquens or Paracoccus versutus (Thiobacillus versutus) that can grow on methylamine; auracyanins A and B from Chloroflexus aurantiacus []; blue copper protein from Alcaligenes faecalis; cupredoxin (CPC) from Cucumis sativus (Cucumber) peelings []; cusacyanin (basic blue protein; plantacyanin, CBP) from cucumber; halocyanin from Natronomonas pharaonis (Natronobacterium pharaonis) [], a membrane associated copper-binding protein; pseudoazurin from Pseudomonas; rusticyanin from Thiobacillus ferrooxidans []; stellacyanin from Rhus vernicifera (Japanese lacquer tree); umecyanin from the roots of Armoracia rusticana (Horseradish); and allergen Ra3 from ragweed. This pollen protein is evolutionary related to the above proteins, but seems to have lost the ability to bind copper. Although there is an appreciable amount of divergence in the sequences of all these proteins, the copper ligand sites are conserved.; GO: 0005507 copper ion binding, 0009055 electron carrier activity; PDB: 1UAT_A 1CUO_A 1PLC_A 4PCY_A 3PCY_A 1PND_A 1PNC_A 1JXG_A 6PCY_A 1TKW_A ....
Probab=30.31 E-value=1.4e+02 Score=19.39 Aligned_cols=56 Identities=16% Similarity=0.252 Sum_probs=32.1
Q ss_pred ceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccc-cchhhcccceeeeeccccccCCCCeeec
Q psy4837 8 LLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVP-FDAGQYFNKIQCFCFEEQQLNPHEELQI 71 (100)
Q Consensus 8 q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP-~~A~~YF~KieCFCF~eQ~L~pgE~~~M 71 (100)
...++|++|++ |.|...+...+.++.-+ .-.| +....+...-. ....|+||++...
T Consensus 16 P~~i~V~~G~t--V~~~n~~~~~Hnv~~~~--~~~~~~~~~~~~~~~~----~~~~~~~G~~~~~ 72 (99)
T PF00127_consen 16 PSEITVKAGDT--VTFVNNDSMPHNVVFVA--DGMPAGADSDYVPPGD----SSPLLAPGETYSV 72 (99)
T ss_dssp SSEEEEETTEE--EEEEEESSSSBEEEEET--TSSHTTGGHCHHSTTC----EEEEBSTTEEEEE
T ss_pred CCEEEECCCCE--EEEEECCCCCceEEEec--ccccccccccccCccc----cceecCCCCEEEE
Confidence 36799999996 45666666666666554 1111 12222222222 4556788887665
No 45
>TIGR03096 nitroso_cyanin nitrosocyanin. Nitrosocyanin, as described from the obligate chemolithoautotroph Nitrosomonas europaea, is a red copper protein of unknown function with sequence similarity to a number of blue copper redox proteins.
Probab=29.81 E-value=80 Score=23.14 Aligned_cols=44 Identities=20% Similarity=0.111 Sum_probs=28.4
Q ss_pred eEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837 9 LTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI 71 (100)
Q Consensus 9 ~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M 71 (100)
..++|+.|+.+++.+. |.++- ..+-.+.. | ..+..|.|||+...
T Consensus 61 ~~I~VkaGD~Vtl~vt--N~d~~-~H~f~i~~--------~--------gis~~I~pGet~Ti 104 (135)
T TIGR03096 61 EALVVKKGTPVKVTVE--NKSPI-SEGFSIDA--------Y--------GISEVIKAGETKTI 104 (135)
T ss_pred CEEEECCCCEEEEEEE--eCCCC-ccceEECC--------C--------CcceEECCCCeEEE
Confidence 5688999998888775 87774 22222221 1 12677888988754
No 46
>KOG3960|consensus
Probab=29.57 E-value=12 Score=30.77 Aligned_cols=9 Identities=78% Similarity=0.999 Sum_probs=6.2
Q ss_pred cccccceeE
Q psy4837 86 SQILPKVEI 94 (100)
Q Consensus 86 ~~~~~~~~~ 94 (100)
+|-||||||
T Consensus 149 NQRLPKVEI 157 (284)
T KOG3960|consen 149 NQRLPKVEI 157 (284)
T ss_pred cccccHHHH
Confidence 466777776
No 47
>PF02018 CBM_4_9: Carbohydrate binding domain; InterPro: IPR003305 The 1,4-beta-glucanase CenC from Cellulomonas fimi contains two cellulose-binding domains, CBD(N1) and CBD(N2), arranged in tandem at its N terminus. These homologous CBDs are distinct in their selectivity for binding amorphous and not crystalline cellulose []. Multidimensional heteronuclear nuclear magnetic resonance (NMR) spectroscopy was used to determine the tertiary structure of the 152 amino acid N-terminal cellulose-binding domain from C. fimi 1,4-beta-glucanase CenC (CBDN1) []. The tertiary structure of CBDN1 is strikingly similar to that of the bacterial 1,3-1,4-beta-glucanases, as well as other sugar-binding proteins with jelly-roll folds.; GO: 0016798 hydrolase activity, acting on glycosyl bonds; PDB: 3OEA_B 2ZEX_B 3OEB_A 2ZEY_A 2ZEW_A 1GUI_A 2W5F_A 2WZE_A 2WYS_A 2ZEZ_B ....
Probab=29.53 E-value=1.4e+02 Score=19.02 Aligned_cols=25 Identities=20% Similarity=0.326 Sum_probs=20.1
Q ss_pred eEEEeecCCeEEEEEEEECCCCCcE
Q psy4837 9 LTLHVYPGETALAFYTAKNPTNSPI 33 (100)
Q Consensus 9 ~~v~V~pGE~~~v~y~a~N~sd~~v 33 (100)
..+.|.+|+.+++.|.++-.....+
T Consensus 54 ~~~~l~~G~~Y~~s~~vk~~~~~~~ 78 (131)
T PF02018_consen 54 QTISLKPGKTYTVSFWVKADSGGTV 78 (131)
T ss_dssp EEEEE-TTSEEEEEEEEEESSSEEE
T ss_pred cceEecCCCEEEEEEEEEeCCCCEE
Confidence 3489999999999999999888333
No 48
>PRK11055 galM galactose-1-epimerase; Provisional
Probab=29.45 E-value=1.1e+02 Score=24.84 Aligned_cols=44 Identities=25% Similarity=0.416 Sum_probs=31.8
Q ss_pred ceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccc
Q psy4837 8 LLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQ 62 (100)
Q Consensus 8 q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~ 62 (100)
.++|.|..-+...+.|.+....+. ++..|.-+.|-.+||+|-|.
T Consensus 265 g~~l~v~t~~p~lqvYT~n~~~~~-----------~~~~g~~~~~~~gialE~q~ 308 (342)
T PRK11055 265 KLQMKVYTTAPALQFYSGNFLAGT-----------PSRGGGPYADYAGLALESQF 308 (342)
T ss_pred CeEEEEEcCCCEEEEecCCccCCc-----------cCCCCcEeCCCceEEEEccc
Confidence 467888888888888887544321 24456777889999999664
No 49
>cd06184 flavohem_like_fad_nad_binding FAD_NAD(P)H binding domain of flavohemoglobin. Flavohemoglobins have a globin domain containing a B-type heme fused with a ferredoxin reductase-like FAD/NAD-binding domain. Flavohemoglobins detoxify nitric oxide (NO) via an NO dioxygenase reaction. The hemoglobin domain adopts a globin fold with an embedded heme molecule. Flavohemoglobins also have a C-terminal reductase domain with bindiing sites for FAD and NAD(P)H. This domain catalyzes the conversion of NO + O2 + NAD(P)H to NO3- + NAD(P)+. Instead of the oxygen transport function of hemoglobins, flavohemoglobins seem to act in NO dioxygenation and NO signalling.
Probab=29.38 E-value=1.5e+02 Score=21.80 Aligned_cols=24 Identities=8% Similarity=-0.168 Sum_probs=15.0
Q ss_pred eeeceeceE--EEeecCCeEEEEEEE
Q psy4837 2 ITTHRYLLT--LHVYPGETALAFYTA 25 (100)
Q Consensus 2 ~~~~p~q~~--v~V~pGE~~~v~y~a 25 (100)
+++++.... ...+|||...+.+..
T Consensus 24 l~l~~~~~~~~~~~~pGQ~v~l~~~~ 49 (247)
T cd06184 24 FYLEPADGGPLPPFLPGQYLSVRVKL 49 (247)
T ss_pred EEEEeCCCCcCCCCCCCCEEEEEEec
Confidence 455554432 578888887777543
No 50
>PF14415 DUF4424: Domain of unknown function (DUF4424)
Probab=29.00 E-value=87 Score=24.87 Aligned_cols=23 Identities=22% Similarity=0.364 Sum_probs=18.3
Q ss_pred CeEEEEEEEECCCCCcEEEEEeC
Q psy4837 17 ETALAFYTAKNPTNSPIVGISTY 39 (100)
Q Consensus 17 E~~~v~y~a~N~sd~~v~GqAvp 39 (100)
+.++|.|+.+|.++++++..-.|
T Consensus 1 ~~I~V~Y~F~N~t~~dv~~~VaF 23 (253)
T PF14415_consen 1 ERIRVRYVFRNPTDQDVTVTVAF 23 (253)
T ss_pred CEEEEEEEEeCCCCCcEEEEEEE
Confidence 45789999999999998555444
No 51
>PRK15195 fimbrial chaperone protein FimC; Provisional
Probab=28.70 E-value=1.6e+02 Score=22.66 Aligned_cols=63 Identities=16% Similarity=0.151 Sum_probs=35.9
Q ss_pred ecCCeEEEEEEEECCCCC-cEEEEEeCcccccchhhcccceeeeecc--ccccCCCCeeec----cCCCCCCCCch
Q psy4837 14 YPGETALAFYTAKNPTNS-PIVGISTYNVVPFDAGQYFNKIQCFCFE--EQQLNPHEELQI----FAPKKPHHNEH 82 (100)
Q Consensus 14 ~pGE~~~v~y~a~N~sd~-~v~GqAvpsVsP~~A~~YF~KieCFCF~--eQ~L~pgE~~~M----vDP~i~~d~~~ 82 (100)
.+++...+.+.++|.++. +...|+--. -+... +..=|=-+ =..|+||++..+ .++.||+|+|.
T Consensus 36 y~~~~~~~si~l~N~~~~~~~LvQsWv~--~~~~~----~~~pfivtPPlfrl~p~~~q~lRIi~~~~~LP~DrES 105 (229)
T PRK15195 36 YPADAKQTSLAIRNSHTNERYLVNSWIE--NSSGV----KEKSFIVTPPLFVSEPKSENTLRIIYAGPPLAADRES 105 (229)
T ss_pred EeCCCceEEEEEEeCCCCccEEEEEEec--CCCCC----ccCCEEEcCCeEEECCCCceEEEEEECCCCCCCCeeE
Confidence 345555588999999865 677776331 11100 00112111 135788887777 77778888665
No 52
>smart00391 MBD Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain
Probab=28.14 E-value=25 Score=23.02 Aligned_cols=16 Identities=19% Similarity=0.343 Sum_probs=11.6
Q ss_pred cccceeeeeccccccC
Q psy4837 49 YFNKIQCFCFEEQQLN 64 (100)
Q Consensus 49 YF~KieCFCF~eQ~L~ 64 (100)
++-.++||||+-+.+.
T Consensus 56 ~~~~~~~F~F~~~~~~ 71 (77)
T smart00391 56 LSLDLECFDFNATVPV 71 (77)
T ss_pred cccccccccCcCCccc
Confidence 4455688999987654
No 53
>PF11325 DUF3127: Domain of unknown function (DUF3127); InterPro: IPR021474 This bacterial family of proteins has no known function.
Probab=27.40 E-value=61 Score=21.99 Aligned_cols=30 Identities=17% Similarity=0.406 Sum_probs=21.9
Q ss_pred EeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeee
Q psy4837 12 HVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCF 56 (100)
Q Consensus 12 ~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCF 56 (100)
.+++|+...|+|.++=+. -. ++||+-+.+.
T Consensus 52 ~~~~Gd~V~Vsf~i~~RE---~~------------gr~fn~i~aW 81 (84)
T PF11325_consen 52 NFQVGDEVKVSFNIEGRE---WN------------GRWFNSIRAW 81 (84)
T ss_pred cCCCCCEEEEEEEeeccE---ec------------ceEeeEeEEE
Confidence 578999999999776432 22 7888877653
No 54
>PRK15188 fimbrial chaperone protein BcfB; Provisional
Probab=26.12 E-value=2e+02 Score=22.41 Aligned_cols=65 Identities=17% Similarity=0.163 Sum_probs=38.9
Q ss_pred EeecCCeEEEEEEEECCCCC-cEEEEEeCcccccchhhcccceeeeecc--ccccCCCCeeec----cCCCCCCCCch
Q psy4837 12 HVYPGETALAFYTAKNPTNS-PIVGISTYNVVPFDAGQYFNKIQCFCFE--EQQLNPHEELQI----FAPKKPHHNEH 82 (100)
Q Consensus 12 ~V~pGE~~~v~y~a~N~sd~-~v~GqAvpsVsP~~A~~YF~KieCFCF~--eQ~L~pgE~~~M----vDP~i~~d~~~ 82 (100)
-+.+|....+.+.++|.+++ +...|+--. ...... + .=|=-+ -..|+||++..+ ....+|+|+|.
T Consensus 36 vIy~~~~~~~sv~i~N~~~~~p~LvQsWv~-~~~~~~----~-~pFivtPPlfrl~~~~~~~lRI~~~~~~lP~DRES 107 (228)
T PRK15188 36 VIYPQGSKQTSLPIINSSASNVFLIQSWVA-NADGSR----S-TDFIITPPLFVIQPKKENILRIMYVGPSLPTDRES 107 (228)
T ss_pred EEEcCCCceEEEEEEeCCCCccEEEEEEEe-cCCCCc----c-CCEEEcCCeEEECCCCceEEEEEECCCCCCCCceE
Confidence 34566777899999999855 677777442 111100 0 001111 235788887777 67778888765
No 55
>PF06650 DUF1162: Protein of unknown function (DUF1162); InterPro: IPR009543 Proteins in this entry may play a role in the control of protein cycling through the trans-Golgi network. Vacuolar sorting protein is an ATPase required for endosomal trafficking []. Defects in the human protein VPS13A cause chorea-acanthocytosis, an autosomal recessive neurodegenerative disorder characterised by the gradual onset of hyperkinetic movements and abnormal erythrocyte morphology [].; GO: 0008104 protein localization
Probab=26.00 E-value=2.1e+02 Score=21.44 Aligned_cols=22 Identities=18% Similarity=0.007 Sum_probs=15.2
Q ss_pred eeeccccccCCCCeeec--cCCCC
Q psy4837 55 CFCFEEQQLNPHEELQI--FAPKK 76 (100)
Q Consensus 55 CFCF~eQ~L~pgE~~~M--vDP~i 76 (100)
|.+-....|.||++... =||..
T Consensus 252 ~~~~~~~~l~p~~~~~yaWd~P~~ 275 (277)
T PF06650_consen 252 GGSSEWYVLPPGSSMPYAWDDPTA 275 (277)
T ss_pred cCCeeeEEECCCcceeEEccCcCC
Confidence 45555668999998876 55544
No 56
>PF00345 PapD_N: Pili and flagellar-assembly chaperone, PapD N-terminal domain; InterPro: IPR016147 Most Gram-negative bacteria possess a supramolecular structure - the pili - on their surface, which mediates attachment to specific receptors. Many interactive subunits are required to assemble pili, but their assembly only takes place after translocation across the cytoplasmic membrane. Periplasmic chaperones assist pili assembly by binding to the subunits, thereby preventing premature aggregation [, ]. Pili chaperones are structurally, and possibly evolutionarily, related to the immunoglobulin superfamily [, ]: they contain two globular domains, with a topology identical to an immunoglobulin fold. This entry represents the N-terminal domain of pili assembly chaperone, and has a beta-sandwich fold consisting of seven strands in two sheets with a Greek key topology.; GO: 0007047 cellular cell wall organization, 0030288 outer membrane-bounded periplasmic space; PDB: 2CO6_B 2CO7_B 1L4I_B 3GFU_A 3F65_F 3F6L_A 3F6I_A 3GEW_B 3DSN_D 2OS7_B ....
Probab=25.25 E-value=2e+02 Score=19.13 Aligned_cols=71 Identities=15% Similarity=0.039 Sum_probs=41.0
Q ss_pred ceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeecc--ccccCCCCeeec---cCCCCCCCCc
Q psy4837 8 LLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFE--EQQLNPHEELQI---FAPKKPHHNE 81 (100)
Q Consensus 8 q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~--eQ~L~pgE~~~M---vDP~i~~d~~ 81 (100)
...+.+..++. .+.+.+.|.+++++..|+...=... ..-..+..-|=++ .-.|+||++..+ ..+.++.|.|
T Consensus 6 ~trii~~~~~~-~~~i~v~N~~~~~~~vq~~v~~~~~--~~~~~~~~~~~vsPp~~~L~pg~~q~vRv~~~~~~~~~~E 81 (122)
T PF00345_consen 6 PTRIIFNESQR-SASITVTNNSDQPYLVQVWVYDQDD--EDEDEPTDPFIVSPPIFRLEPGESQTVRVYRGSKLPIDRE 81 (122)
T ss_dssp SSEEEEETTSS-EEEEEEEESSSSEEEEEEEEEETTS--TTSSSSSSSEEEESSEEEEETTEEEEEEEEECSGS-SSS-
T ss_pred cEEEEEeCCCC-EEEEEEEcCCCCcEEEEEEEEcCCC--cccccccccEEEeCCceEeCCCCcEEEEEEecCCCCCCce
Confidence 34566666554 7899999999999999987643111 0011122223332 346888887777 4344444443
No 57
>PF00963 Cohesin: Cohesin domain; InterPro: IPR002102 Cohesin domains interact with a complementary domain, termed the dockerin domain (see IPR002105 from INTERPRO). The cohesin-dockerin interaction is the crucial interaction for complex formation in the cellulosome []. The scaffoldin component of the cellulolytic bacterium Clostridium thermocellum is a non-hydrolytic protein which organises the hydrolytic enzymes in a large complex, called the cellulosome. Scaffoldin comprises a series of functional domains, amongst which is a single cellulose-binding domain and nine cohesin domains which are responsible for integrating the individual enzymatic subunits into the complex.; GO: 0030246 carbohydrate binding, 0000272 polysaccharide catabolic process; PDB: 2BM3_A 3P0D_I 3KCP_A 2B59_A 3L8Q_B 3FNK_C 3GHP_B 2CCL_A 1ANU_A 1OHZ_A ....
Probab=25.20 E-value=2.3e+02 Score=19.31 Aligned_cols=36 Identities=25% Similarity=0.103 Sum_probs=25.5
Q ss_pred eEEEeecCCeEEEEEEEECCCCCcEEEEEeCccccc
Q psy4837 9 LTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPF 44 (100)
Q Consensus 9 ~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~ 44 (100)
.+..++||++..+.-.+.|..+.....+...++=|.
T Consensus 6 ~~~~a~~G~tv~V~V~v~~~~~~i~~~~~~l~yDp~ 41 (141)
T PF00963_consen 6 DSVSAKPGETVTVPVNVSNVSNSIAGMQFTLSYDPS 41 (141)
T ss_dssp SECEE-TTSEEEEEEEEESCTTTEEEEEEEEEE-TT
T ss_pred CCceECCCCEEEEEEEEEcCCCcEEEEEEEEEeCCc
Confidence 455679999999999999998874455555555554
No 58
>PF01429 MBD: Methyl-CpG binding domain; InterPro: IPR001739 Methylation at CpG dinucleotide, the most common DNA modification in eukaryotes, has been correlated with gene silencing associated with various phenomena such as genomic imprinting, transposon and chromosome X inactivation, differentiation, and cancer. Effects of DNA methylation are mediated through proteins which bind to symmetrically methylated CpGs. Such proteins contain a specific domain of ~70 residues, the methyl-CpG-binding domain (MBD), which is linked to additional domains associated with chromatin, such as the bromodomain, the AT hook motif,the SET domain, or the PHD finger. MBD-containing proteins appear to act as structural proteins, which recruit a variety of histone deacetylase (HDAC) complexes and chromatin remodelling factors, leading to chromatin compaction and, consequently, to transcriptional repression. The MBD of MeCP2, MBD1, MBD2, MBD4 and BAZ2 mediates binding to DNA, in case of MeCP2, MBD1 and MBD2 preferentially to methylated CpG. In case of human MBD3 and SETDB1 the MBD has been shown to mediate protein-protein interactions [, ]. The MBD folds into an alpha/beta sandwich structure comprising a layer of twisted beta sheet, backed by another layer formed by the alpha1 helix and a hairpin loop at the C terminus. These layers are both amphipathic, with the alpha1 helix and the beta sheet lying parallel and the hydrophobic faces tightly packed against each other. The beta sheet is composed of two long inner strands (beta2 and beta3) sandwiched by two shorter outer strands (beta1 and beta4) [].; GO: 0003677 DNA binding, 0005634 nucleus; PDB: 2KY8_A 1UB1_A 1D9N_A 1IG4_A 1QK9_A 3C2I_A.
Probab=25.08 E-value=31 Score=22.23 Aligned_cols=18 Identities=17% Similarity=0.117 Sum_probs=10.6
Q ss_pred hcccceeeeeccccccCC
Q psy4837 48 QYFNKIQCFCFEEQQLNP 65 (100)
Q Consensus 48 ~YF~KieCFCF~eQ~L~p 65 (100)
.+.-++++|||+.+...+
T Consensus 59 ~~~l~~~~F~F~~~~~~~ 76 (77)
T PF01429_consen 59 EHDLKPENFSFSKRLIML 76 (77)
T ss_dssp --SS-CTTBBTTTTB---
T ss_pred cccCCHhHCCCCCCcccC
Confidence 456679999999887654
No 59
>KOG0360|consensus
Probab=24.89 E-value=36 Score=30.35 Aligned_cols=29 Identities=24% Similarity=0.402 Sum_probs=20.9
Q ss_pred cceeeeeccccc--cCCCCeeeccCCCCCCC
Q psy4837 51 NKIQCFCFEEQQ--LNPHEELQIFAPKKPHH 79 (100)
Q Consensus 51 ~KieCFCF~eQ~--L~pgE~~~MvDP~i~~d 79 (100)
-||.||||+-|. .+=|+.+-..||+-.++
T Consensus 235 ~Kiacld~~Lqk~km~lGv~vvv~dp~kle~ 265 (545)
T KOG0360|consen 235 AKIACLDFTLQKTKMKLGVQVVVDDPEKLEQ 265 (545)
T ss_pred ceeeeEecccccccccccceEEEcChHHHHH
Confidence 499999999885 34456655588876554
No 60
>PRK13202 ureB urease subunit beta; Reviewed
Probab=24.67 E-value=1e+02 Score=22.02 Aligned_cols=67 Identities=7% Similarity=-0.000 Sum_probs=40.1
Q ss_pred ceeceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeee-------ccccccCCCCeeec
Q psy4837 5 HRYLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFC-------FEEQQLNPHEELQI 71 (100)
Q Consensus 5 ~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFC-------F~eQ~L~pgE~~~M 71 (100)
.+....+.+.+|..-.+.-.++|..|+|+---+=|-..=...+.-|.--..+= =+--.++|||+++.
T Consensus 7 ~~~~~~I~ln~grr~~~~l~V~NtGDRPIQVGSHyHF~E~N~aL~FDR~~A~G~RLdIpaGTavRFEPG~~k~V 80 (104)
T PRK13202 7 FYGSGDIEMNAAALSRLQMRIINAGDRPVQVGSHVHLPQANRALSFDRATAHGYRLDIPAATAVRFEPGIPQIV 80 (104)
T ss_pred ecCCCCEEeCCCCCceEEEEEEeCCCCceEEccccchhhcCcceeecHhHhcCcccccCCCCeEEECCCCeEEE
Confidence 34556789999965667789999999998655554433332222222222221 23345778877765
No 61
>cd05711 Ig_FcalphaRI Immunoglobulin (IG)-like domain of of FcalphaRI. IG_FcalphaRI : immunoglobulin (IG)-like domain of of FcalphaRI. FcalphaRI (CD89) is an IgA-specific receptor that is expressed on monocytes, eosinophils, neutrophils and macrophages. FcalphaRI mediates IgA-induced immune effector responses such as phagocytosis, antibody-dependent cell-mediated cytotoxicity and respiratory burst. Both monomeric and dimeric IgA can bind to FcalphaRI, and monomeric or dimeric IgA immune complexes can activate phagocytosis and other immune responses through the clustering of FcalphaRI. The Fc RI ectodomain is comprised of two Ig-like domains oriented at about 90 degree to each another.
Probab=24.43 E-value=1.2e+02 Score=19.47 Aligned_cols=14 Identities=36% Similarity=0.403 Sum_probs=11.2
Q ss_pred EeCcccccchhhcc
Q psy4837 37 STYNVVPFDAGQYF 50 (100)
Q Consensus 37 AvpsVsP~~A~~YF 50 (100)
-+.+++|..+|.|+
T Consensus 58 ~I~~~~~~~~G~Y~ 71 (94)
T cd05711 58 PLGPVTPAHAGTYR 71 (94)
T ss_pred EecCCCcccCEEEE
Confidence 56677889999996
No 62
>PF04314 DUF461: Protein of unknown function (DUF461); InterPro: IPR007410 This entry represents a domain found in of proteins of unknown function, including DR1885 from Deinococcus radiodurans and CC3502 from Caulobacter crescentus (Caulobacter vibrioides), which share a potential metal binding motif H(M)X10MX21HXM. DR1885 was found to bind copper(I) through a histidine and three Mets in a cupredoxin-like fold []. The surface location of the copper-binding site as well as the type of coordination are well poised for metal transfer chemistry, suggesting that DR1885 might transfer copper, taking the role of Cox17 in bacteria (Cox17 being an accessory protein required for correct assembly of eukaryotic cyochrome c oxidase). ; PDB: 2K6W_A 2K6Z_A 2K6Y_A 2K70_A 1X9L_A 2JQA_A.
Probab=24.37 E-value=1.5e+02 Score=19.98 Aligned_cols=53 Identities=6% Similarity=0.037 Sum_probs=29.2
Q ss_pred CeEEEEEEEECCCCCcEEEEEeCcccccchhhccccee-------eeeccccccCCCCeeec
Q psy4837 17 ETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQ-------CFCFEEQQLNPHEELQI 71 (100)
Q Consensus 17 E~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~Kie-------CFCF~eQ~L~pgE~~~M 71 (100)
...-+||.++|.+++++.-.++-+=.-+. .=++... =.=.+.-.+.||+++.+
T Consensus 14 ~~~a~y~ti~N~g~~~~~L~~v~s~~a~~--v~lh~~~~~~g~~~m~~v~~i~ipa~~~v~l 73 (110)
T PF04314_consen 14 NVTAAYFTITNNGDQDDRLVGVSSPAAAR--VELHETVMEDGVMKMRPVDSIPIPAGSTVEL 73 (110)
T ss_dssp SEEEEEEEEE-CSSSEEEEEEEE-TTCCE--EEEEEECCCCCEEEECCSS-EEEETT-EEEE
T ss_pred ccEEEEEEEEeCCCCCeEEEEEEcCCCce--EEEEEEEccCCeEEEEECCCEEECCCCeEEe
Confidence 58899999999999987666555422222 2222211 11122556777777777
No 63
>TIGR02636 galM_Leloir galactose mutarotase. Members of this protein family act as galactose mutarotase (D-galactose 1-epimerase) and participate in the Leloir pathway for galactose/glucose interconversion. All members of the seed alignment for this model are found in gene clusters with other enzymes of the Leloir pathway. This enzyme family belongs to the aldose 1-epimerase family, described by pfam model pfam01263. However, the enzyme described as aldose 1-epimerase itself (EC 5.1.3.3) is called broadly specific for D-glucose, L-arabinose, D-xylose, D-galactose, maltose and lactose. The restricted genome context for genes in this family suggests members should act primarily on D-galactose.
Probab=24.27 E-value=1.4e+02 Score=23.94 Aligned_cols=46 Identities=26% Similarity=0.367 Sum_probs=30.6
Q ss_pred ceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccC
Q psy4837 8 LLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLN 64 (100)
Q Consensus 8 q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~ 64 (100)
.++|+|..-+...+.|......+ .++..|.-+.+=.+||+|-|.+-
T Consensus 260 g~~l~v~t~~p~~~vyT~~~~~~-----------~~~~~g~~~~~~~gialE~q~~p 305 (335)
T TIGR02636 260 DLSLEVFTTQPALQIYTGNFLAG-----------TPNRGGKKYVDHAGLALETQFLP 305 (335)
T ss_pred CcEEEEecCCCEEEEecCCCcCC-----------ccCCCCcEeCCCcEEEEecccCC
Confidence 35677777777777777644422 12334566678899999988764
No 64
>PRK00415 rps27e 30S ribosomal protein S27e; Reviewed
Probab=24.01 E-value=30 Score=22.32 Aligned_cols=25 Identities=28% Similarity=0.452 Sum_probs=19.9
Q ss_pred hhcccceee-eeccccccCCCCeeec
Q psy4837 47 GQYFNKIQC-FCFEEQQLNPHEELQI 71 (100)
Q Consensus 47 ~~YF~KieC-FCF~eQ~L~pgE~~~M 71 (100)
.-||-|+.| =|.++|++=.+-+...
T Consensus 6 ~S~F~~VkCp~C~n~q~vFsha~t~V 31 (59)
T PRK00415 6 RSRFLKVKCPDCGNEQVVFSHASTVV 31 (59)
T ss_pred CCeEEEEECCCCCCeEEEEecCCcEE
Confidence 359999999 7999998877665544
No 65
>smart00187 INB Integrin beta subunits (N-terminal portion of extracellular region). Portion of beta integrins that lies N-terminal to their EGF-like repeats. Integrins are cell adhesion molecules that mediate cell-extracellular matrix and cell-cell interactions. They contain both alpha and beta subunits. Beta integrins are proposed to have a von Willebrand factor type-A "insert" or "I" -like domain (although this remains to be confirmed).
Probab=23.72 E-value=1.2e+02 Score=26.25 Aligned_cols=34 Identities=18% Similarity=0.241 Sum_probs=27.9
Q ss_pred eeeceeceEEEeecCCeEEEEEEEECCCCCcEEE
Q psy4837 2 ITTHRYLLTLHVYPGETALAFYTAKNPTNSPIVG 35 (100)
Q Consensus 2 ~~~~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~G 35 (100)
+.+.|..-.++++||+..++...++=..+.|+--
T Consensus 69 vQi~PQ~v~l~LRpG~~~~f~~~~~~a~~yPvDL 102 (423)
T smart00187 69 VQVSPQRVRLKLRPGEPQNFTLTVRQAEDYPVDL 102 (423)
T ss_pred eEeccceEEEEeccCCcEEEEEEEEecccCccce
Confidence 4568899999999999999998888777766643
No 66
>KOG0121|consensus
Probab=23.70 E-value=29 Score=26.28 Aligned_cols=13 Identities=54% Similarity=1.162 Sum_probs=9.6
Q ss_pred cccceee-eecccc
Q psy4837 49 YFNKIQC-FCFEEQ 61 (100)
Q Consensus 49 YF~KieC-FCF~eQ 61 (100)
=|.|+-| |||-+-
T Consensus 72 r~kktpCGFCFVey 85 (153)
T KOG0121|consen 72 RFKKTPCGFCFVEY 85 (153)
T ss_pred cCCcCccceEEEEE
Confidence 3677777 999764
No 67
>PF00635 Motile_Sperm: MSP (Major sperm protein) domain; InterPro: IPR000535 Major sperm proteins (MSP) are central components in molecular interactions underlying sperm motility in Caenorhabditis elegans, whose sperm employ an amoebae-like crawling motion using a MSP-containing lamellipod, rather than the flagellar-based swimming motion associated with other sperm. These proteins oligomerise to form an extensive filament system that extends from sperm villipoda, along the leading edge of the pseudopod. About 30 MSP isoforms may exist in C. elegans. MSPs form a fibrous network, whereby MSP dimers form helical subfilaments that coil around one another to produce filaments, which in turn form supercoils to produce bundles. The crystal structure of MSP from C. elegans reveals an immunoglobulin (Ig)-like seven-stranded beta sandwich fold []. ; GO: 0005198 structural molecule activity; PDB: 1MSP_A 3MSP_B 2BVU_B 2MSP_C 1Z9O_F 1Z9L_A 3IKK_A 1WIC_A 2CRI_A 2RR3_A ....
Probab=23.41 E-value=1.5e+02 Score=18.92 Aligned_cols=61 Identities=18% Similarity=0.283 Sum_probs=33.8
Q ss_pred eeeceeceE-EEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837 2 ITTHRYLLT-LHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI 71 (100)
Q Consensus 2 ~~~~p~q~~-v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M 71 (100)
|.+.|...- .....+........++|.++.++. |.|-=.....|.-+-. .=.|+|||++++
T Consensus 2 l~v~P~~~i~F~~~~~~~~~~~l~l~N~s~~~i~----fKiktt~~~~y~v~P~-----~G~i~p~~~~~i 63 (109)
T PF00635_consen 2 LSVEPSELIFFNAPFNKQQSCELTLTNPSDKPIA----FKIKTTNPNRYRVKPS-----YGIIEPGESVEI 63 (109)
T ss_dssp CEEESSSEEEEESSTSS-EEEEEEEEE-SSSEEE----EEEEES-TTTEEEESS-----EEEE-TTEEEEE
T ss_pred eEEeCCcceEEcCCCCceEEEEEEEECCCCCcEE----EEEEcCCCceEEecCC-----CEEECCCCEEEE
Confidence 345566333 333346778889999999999653 3333333334443322 235899998888
No 68
>PTZ00485 aldolase 1-epimerase; Provisional
Probab=23.15 E-value=94 Score=25.97 Aligned_cols=48 Identities=23% Similarity=0.296 Sum_probs=35.1
Q ss_pred eceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccC
Q psy4837 7 YLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLN 64 (100)
Q Consensus 7 ~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~ 64 (100)
.-++|+|.--+...+.|.+-.+.+ . .++..|.-|.+-.+||||-|.+-
T Consensus 292 sg~~l~v~T~~P~~qiYT~n~l~~-~---------~~~~~g~~~~~~~giclE~Q~~P 339 (376)
T PTZ00485 292 TNICMKVYSTFPCMWVYTANNKPL-P---------ASGGPGQRYARWTGMGLEPQYFP 339 (376)
T ss_pred CCCEEEEEECCCEEEEECCCCCCc-c---------ccCCCCcCcCCCCEEEEeccCCC
Confidence 346899999999999998755532 1 12344556788899999999874
No 69
>PF13815 Dzip-like_N: Iguana/Dzip1-like DAZ-interacting protein N-terminal
Probab=23.09 E-value=18 Score=25.08 Aligned_cols=40 Identities=10% Similarity=0.187 Sum_probs=31.6
Q ss_pred eeccccccCCCCeeec-----cCC-CCCCCCchhhhcccccceeEeeccc
Q psy4837 56 FCFEEQQLNPHEELQI-----FAP-KKPHHNEHKMRSQILPKVEIANVSS 99 (100)
Q Consensus 56 FCF~eQ~L~pgE~~~M-----vDP-~i~~d~~~~~~~~~~~~~~~~~~~~ 99 (100)
|||.. ..|.+++ ||+ .|.++.+.+.+..+++.+-.++++.
T Consensus 1 F~f~~----r~~~iDWr~i~~iDvd~i~~~~Di~~Lq~~i~~vtf~~l~~ 46 (118)
T PF13815_consen 1 FQFRP----RREPIDWRLISAIDVDRIVRELDIDTLQENIENVTFCDLEN 46 (118)
T ss_pred CCCCC----CCCCCcHHHHhccCHHHHHhccCHHHHHHHHHhcceeccCh
Confidence 67763 3577777 898 6888899999999999888887764
No 70
>PF06280 DUF1034: Fn3-like domain (DUF1034); InterPro: IPR010435 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain of unknown function is present in bacterial and plant peptidases belonging to MEROPS peptidase family S8 (subfamily S8A subtilisin, clan SB). It is C-terminal to and adjacent to the S8 peptidase domain and can be found in conjunction with the PA (Protease associated) domain (IPR003137 from INTERPRO) and additionally in Gram-positive bacteria with the surface protein anchor domain (IPR001899 from INTERPRO).; GO: 0004252 serine-type endopeptidase activity, 0005618 cell wall, 0016020 membrane; PDB: 3EIF_A 1XF1_B.
Probab=21.17 E-value=1.5e+02 Score=19.60 Aligned_cols=26 Identities=19% Similarity=0.103 Sum_probs=17.3
Q ss_pred eceeceEEEeecCCeEEEEEEEECCC
Q psy4837 4 THRYLLTLHVYPGETALAFYTAKNPT 29 (100)
Q Consensus 4 ~~p~q~~v~V~pGE~~~v~y~a~N~s 29 (100)
..-...+++|.||+...+...+.-.+
T Consensus 57 ~~~~~~~vTV~ag~s~~v~vti~~p~ 82 (112)
T PF06280_consen 57 VSFSPDTVTVPAGQSKTVTVTITPPS 82 (112)
T ss_dssp EE---EEEEE-TTEEEEEEEEEE--G
T ss_pred EEeCCCeEEECCCCEEEEEEEEEehh
Confidence 34456899999999999999988755
No 71
>PF07883 Cupin_2: Cupin domain; InterPro: IPR013096 This family represents the conserved barrel domain of the cupin superfamily [] (cupa is the Latin term for a small barrel). ; PDB: 2OPK_C 3BU7_B 2PHD_D 3NVC_A 3NKT_A 3NJZ_A 3NW4_A 3NST_A 3NL1_A 2H0V_A ....
Probab=20.91 E-value=1.9e+02 Score=16.72 Aligned_cols=35 Identities=14% Similarity=0.144 Sum_probs=22.0
Q ss_pred eceeceEEEeecCCeEEE----EEEEECCCCCcEEEEEe
Q psy4837 4 THRYLLTLHVYPGETALA----FYTAKNPTNSPIVGIST 38 (100)
Q Consensus 4 ~~p~q~~v~V~pGE~~~v----~y~a~N~sd~~v~GqAv 38 (100)
+......+.+++|+...+ -+.+.|.++.+..--.|
T Consensus 32 ~~~~~~~~~l~~Gd~~~i~~~~~H~~~n~~~~~~~~l~V 70 (71)
T PF07883_consen 32 LTVDGERVELKPGDAIYIPPGVPHQVRNPGDEPARFLVV 70 (71)
T ss_dssp EEETTEEEEEETTEEEEEETTSEEEEEEESSSEEEEEEE
T ss_pred EEEccEEeEccCCEEEEECCCCeEEEEECCCCCEEEEEE
Confidence 334456777777776664 46667777776654443
No 72
>PRK13203 ureB urease subunit beta; Reviewed
Probab=20.71 E-value=2.5e+02 Score=19.94 Aligned_cols=66 Identities=14% Similarity=0.143 Sum_probs=40.3
Q ss_pred ceeceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeee-------ccccccCCCCeeec
Q psy4837 5 HRYLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFC-------FEEQQLNPHEELQI 71 (100)
Q Consensus 5 ~p~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFC-------F~eQ~L~pgE~~~M 71 (100)
.+....+.+.+|... +.-.++|..|||+---+=|-..=...+.-|.--..|= =+--.++|||+++.
T Consensus 7 ~~~~~~I~ln~gr~~-~~l~V~NtGDRPIQVGSH~HF~E~N~aL~FDR~~A~G~RLdIpaGTavRFEPG~~k~V 79 (102)
T PRK13203 7 ITADGEIELNAGRET-VTLTVANTGDRPIQVGSHYHFFEVNPALSFDREAARGMRLNIPAGTAVRFEPGQTREV 79 (102)
T ss_pred ecCCCCEEeCCCCCE-EEEEEEeCCCCceEEccccchhhcCcchhccHhhhcCcccccCCCCeEeECCCCeEEE
Confidence 345567888888654 6778999999998765555444333333333222222 23456778887765
No 73
>COG1470 Predicted membrane protein [Function unknown]
Probab=20.62 E-value=2.6e+02 Score=25.01 Aligned_cols=64 Identities=17% Similarity=0.073 Sum_probs=47.8
Q ss_pred eceEEEeecCCeEEEEEEEECCCCCcEEEEEeCcccccchhhcccceeeeeccccccCCCCeeec
Q psy4837 7 YLLTLHVYPGETALAFYTAKNPTNSPIVGISTYNVVPFDAGQYFNKIQCFCFEEQQLNPHEELQI 71 (100)
Q Consensus 7 ~q~~v~V~pGE~~~v~y~a~N~sd~~v~GqAvpsVsP~~A~~YF~KieCFCF~eQ~L~pgE~~~M 71 (100)
.+....+++|++....-.++|+...+-.--.--+=.|.-....|. =+=---+.-.|+|||+++.
T Consensus 274 s~~~~~i~~~~t~sf~V~IeN~g~~~d~y~Le~~g~pe~w~~~Ft-eg~~~vt~vkL~~gE~kdv 337 (513)
T COG1470 274 SDIYLEISPSTTASFTVSIENRGKQDDEYALELSGLPEGWTAEFT-EGELRVTSVKLKPGEEKDV 337 (513)
T ss_pred ccceeEEccCCceEEEEEEccCCCCCceeEEEeccCCCCcceEEe-eCceEEEEEEecCCCceEE
Confidence 345578999999999999999999887665555546777777776 2222334567999999999
No 74
>PF12690 BsuPI: Intracellular proteinase inhibitor; InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=20.51 E-value=1.3e+02 Score=19.61 Aligned_cols=49 Identities=22% Similarity=0.368 Sum_probs=23.9
Q ss_pred EEEEEEEECCCCCcEEEEEe------Ccc--------cccchhhcccceeeeeccccccCCCCeeec
Q psy4837 19 ALAFYTAKNPTNSPIVGIST------YNV--------VPFDAGQYFNKIQCFCFEEQQLNPHEELQI 71 (100)
Q Consensus 19 ~~v~y~a~N~sd~~v~GqAv------psV--------sP~~A~~YF~KieCFCF~eQ~L~pgE~~~M 71 (100)
..+.+.++|.++++++-+=. +.| --.+.+.-|... +.+-+|+|||++..
T Consensus 2 v~~~l~v~N~s~~~v~l~f~sgq~~D~~v~d~~g~~vwrwS~~~~FtQa----l~~~~l~pGe~~~~ 64 (82)
T PF12690_consen 2 VEFTLTVTNNSDEPVTLQFPSGQRYDFVVKDKEGKEVWRWSDGKMFTQA----LQEETLEPGESLTY 64 (82)
T ss_dssp EEEEEEEEE-SSS-EEEEESSS--EEEEEE-TT--EEEETTTT-----------EEEEE-TT-EEEE
T ss_pred EEEEEEEEeCCCCeEEEEeCCCCEEEEEEECCCCCEEEEecCCchhhhe----eeEEEECCCCEEEE
Confidence 45778888888888775511 111 122355555544 33678999999888
Done!