Query psy2082
Match_columns 314
No_of_seqs 230 out of 568
Neff 6.4
Searched_HMMs 46136
Date Fri Aug 16 23:50:19 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2082.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2082hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3865|consensus 100.0 5E-113 1E-117 789.9 30.1 305 1-306 1-354 (402)
2 KOG3780|consensus 99.9 5.5E-25 1.2E-29 216.6 30.6 258 18-299 8-317 (427)
3 PF02752 Arrestin_C: Arrestin 99.7 3.4E-17 7.4E-22 135.1 14.3 110 193-302 2-136 (136)
4 PF00339 Arrestin_N: Arrestin 99.6 1.5E-14 3.3E-19 121.4 14.3 122 18-161 2-133 (149)
5 PF03643 Vps26: Vacuolar prote 98.4 0.00018 4E-09 68.0 24.9 209 32-282 37-272 (275)
6 PF13002 LDB19: Arrestin_N ter 98.2 4.9E-05 1.1E-09 67.8 13.2 123 105-242 47-181 (191)
7 PF08737 Rgp1: Rgp1; InterPro 97.2 0.077 1.7E-06 53.1 20.8 49 192-240 300-348 (415)
8 PF00339 Arrestin_N: Arrestin 95.3 0.025 5.4E-07 46.9 4.4 100 199-300 2-145 (149)
9 PF07070 Spo0M: SpoOM protein; 94.9 0.39 8.6E-06 44.0 11.2 91 29-150 23-116 (218)
10 PF01835 A2M_N: MG2 domain; I 88.8 1 2.2E-05 35.2 5.2 27 198-224 2-28 (99)
11 KOG3063|consensus 88.2 22 0.00049 33.2 18.8 211 46-281 42-278 (301)
12 KOG2717|consensus 88.0 23 0.0005 33.1 19.1 126 106-240 85-222 (313)
13 KOG3780|consensus 87.7 9.8 0.00021 37.5 12.6 100 198-299 7-156 (427)
14 PF01345 DUF11: Domain of unkn 85.8 3.2 6.9E-05 31.0 6.2 42 192-233 22-63 (76)
15 PF00207 A2M: Alpha-2-macroglo 85.0 2.9 6.3E-05 32.5 5.8 41 192-234 52-92 (92)
16 TIGR01451 B_ant_repeat conserv 83.5 2.5 5.4E-05 29.9 4.4 32 202-233 3-34 (53)
17 PF03643 Vps26: Vacuolar prote 82.1 25 0.00054 33.4 11.9 90 206-299 34-145 (275)
18 PF02752 Arrestin_C: Arrestin 78.0 33 0.00072 27.3 10.1 106 17-150 7-115 (136)
19 PF00927 Transglut_C: Transglu 73.5 9.6 0.00021 30.2 5.6 38 197-235 2-39 (107)
20 PF04425 Bul1_N: Bul1 N termin 68.2 32 0.0007 34.9 9.0 118 30-149 160-345 (438)
21 PF10633 NPCBM_assoc: NPCBM-as 68.1 5.3 0.00011 30.0 2.8 28 208-235 2-29 (78)
22 PF07070 Spo0M: SpoOM protein; 67.5 1E+02 0.0022 28.3 13.4 50 191-240 8-58 (218)
23 PF05688 DUF824: Salmonella re 67.3 8.8 0.00019 26.8 3.5 30 208-237 10-39 (47)
24 PF07705 CARDB: CARDB; InterP 59.8 36 0.00079 25.7 6.2 40 196-235 4-43 (101)
25 PF06030 DUF916: Bacterial pro 58.5 23 0.0005 29.3 5.1 27 208-235 24-50 (121)
26 PF09624 DUF2393: Protein of u 58.2 28 0.00061 29.4 5.8 38 196-234 48-85 (149)
27 PF07703 A2M_N_2: Alpha-2-macr 55.7 17 0.00036 29.7 3.9 30 195-224 94-123 (136)
28 PF09478 CBM49: Carbohydrate b 55.5 31 0.00067 26.2 5.0 39 197-235 2-41 (80)
29 PF04744 Monooxygenase_B: Mono 54.3 14 0.0003 36.6 3.5 42 198-242 249-291 (381)
30 PF00963 Cohesin: Cohesin doma 51.9 43 0.00093 27.8 5.8 37 198-235 1-37 (141)
31 PF05326 SVA: Seminal vesicle 47.8 37 0.00081 28.5 4.7 61 192-254 30-90 (124)
32 cd00258 GM2-AP GM2 activator p 47.2 72 0.0016 28.1 6.5 78 52-149 62-143 (162)
33 smart00809 Alpha_adaptinC2 Ada 45.1 72 0.0016 24.7 5.9 40 192-235 3-42 (104)
34 TIGR03079 CH4_NH3mon_ox_B meth 45.1 30 0.00065 34.3 4.2 47 194-243 264-311 (399)
35 COG5554 NifU Nitrogen fixation 44.5 61 0.0013 23.9 4.7 39 6-54 2-40 (69)
36 PF13199 Glyco_hydro_66: Glyco 44.2 38 0.00082 35.4 5.1 27 201-227 1-27 (559)
37 PF02014 Reeler: Reeler domain 43.1 53 0.0011 27.1 5.0 38 194-235 19-56 (132)
38 PF07703 A2M_N_2: Alpha-2-macr 43.1 31 0.00068 28.0 3.6 25 198-222 1-25 (136)
39 PF06159 DUF974: Protein of un 41.4 37 0.00081 31.6 4.2 29 207-235 10-38 (249)
40 PF04426 Bul1_C: Bul1 C termin 39.4 62 0.0013 30.7 5.4 36 262-298 235-270 (272)
41 COG4326 Spo0M Sporulation cont 39.3 28 0.0006 32.0 2.8 20 107-126 104-124 (270)
42 COG2373 Large extracellular al 38.5 1.5E+02 0.0033 35.1 9.2 43 191-233 493-535 (1621)
43 KOG0452|consensus 37.5 35 0.00077 35.8 3.6 88 144-246 794-881 (892)
44 PF02883 Alpha_adaptinC2: Adap 35.0 1.5E+02 0.0032 23.5 6.3 44 189-234 4-47 (115)
45 PRK12676 bifunctional inositol 34.6 27 0.00058 32.4 2.1 49 31-83 84-132 (263)
46 KOG2540|consensus 34.6 74 0.0016 29.5 4.8 28 204-231 155-183 (269)
47 cd08548 Type_I_cohesin_like Ty 33.8 91 0.002 26.1 5.0 39 199-237 2-40 (135)
48 PF13595 DUF4138: Domain of un 33.3 58 0.0013 30.4 4.1 36 198-234 130-167 (246)
49 cd08544 Reeler Reeler, the N-t 32.6 89 0.0019 25.7 4.8 37 198-236 21-57 (135)
50 COG4326 Spo0M Sporulation cont 32.5 1.2E+02 0.0026 27.9 5.8 43 192-234 29-72 (270)
51 cd01641 Bacterial_IMPase_like_ 30.3 37 0.00081 31.0 2.3 46 33-82 77-122 (248)
52 PF06988 NifT: NifT/FixU prote 30.3 50 0.0011 24.6 2.5 21 7-27 3-23 (64)
53 cd01639 IMPase IMPase, inosito 30.3 34 0.00074 31.1 2.0 47 32-82 79-125 (244)
54 PF11355 DUF3157: Protein of u 30.0 1.3E+02 0.0028 27.4 5.5 36 198-234 100-135 (199)
55 PRK10757 inositol monophosphat 29.3 37 0.00081 31.5 2.1 47 32-82 82-128 (267)
56 TIGR02934 nifT_nitrog probable 28.9 57 0.0012 24.5 2.6 18 10-27 6-23 (67)
57 PF07919 Gryzun: Gryzun, putat 28.7 1.7E+02 0.0037 29.8 7.0 39 191-229 168-208 (554)
58 TIGR01331 bisphos_cysQ 3'(2'), 27.9 35 0.00075 31.3 1.6 47 32-82 80-126 (249)
59 PF14796 AP3B1_C: Clathrin-ada 27.9 1.6E+02 0.0034 25.4 5.5 44 191-234 64-108 (145)
60 PLN02911 inositol-phosphate ph 27.2 40 0.00087 32.0 2.0 48 31-82 112-159 (296)
61 cd01515 Arch_FBPase_1 Archaeal 27.2 40 0.00087 31.1 1.9 48 31-82 79-128 (257)
62 cd01517 PAP_phosphatase PAP-ph 26.1 55 0.0012 30.4 2.7 47 31-82 75-128 (274)
63 cd08546 cohesin_like Cohesin d 25.9 2.1E+02 0.0046 22.8 5.9 35 198-235 3-37 (135)
64 TIGR02067 his_9_proposed histi 24.9 44 0.00095 30.6 1.7 47 32-82 78-124 (251)
65 PLN02553 inositol-phosphate ph 24.4 52 0.0011 30.5 2.1 47 32-82 89-135 (270)
66 PF12389 Peptidase_M73: Camely 24.4 1.6E+02 0.0034 26.8 5.1 30 205-234 59-88 (199)
67 COG2309 AmpS Leucyl aminopepti 23.9 1.5E+02 0.0033 29.6 5.4 54 10-64 193-267 (385)
68 cd01643 Bacterial_IMPase_like_ 23.7 52 0.0011 30.1 2.0 47 32-82 75-121 (242)
69 PRK03298 hypothetical protein; 23.6 5.9E+02 0.013 23.6 8.7 46 138-183 7-55 (224)
70 PRK05089 cytochrome C oxidase 22.5 1.2E+02 0.0027 27.2 4.0 25 207-231 90-114 (188)
71 TIGR03780 Bac_Flav_CT_N Bacter 21.9 1.2E+02 0.0027 29.0 4.1 37 198-235 169-207 (285)
72 KOG2625|consensus 21.7 53 0.0012 30.6 1.6 28 207-234 11-38 (348)
73 PRK10931 adenosine-3'(2'),5'-b 21.3 52 0.0011 30.1 1.4 47 32-82 81-127 (246)
74 PF04442 CtaG_Cox11: Cytochrom 20.7 1E+02 0.0022 26.8 3.0 25 207-231 63-87 (152)
75 cd08908 START_STARD12-like C-t 20.1 2.5E+02 0.0054 25.4 5.6 36 4-39 84-127 (204)
76 cd08759 Type_III_cohesin_like 20.1 2.2E+02 0.0048 24.9 5.0 35 198-234 3-37 (160)
No 1
>KOG3865|consensus
Probab=100.00 E-value=5.2e-113 Score=789.91 Aligned_cols=305 Identities=54% Similarity=0.930 Sum_probs=294.7
Q ss_pred CCccceEEEeecCCCcEEEEecceeeEecCCCceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEE
Q psy2082 1 MVANFKVFKKCSPNGKLTLYMGKRDFVDYVTGVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYL 80 (314)
Q Consensus 1 ~~~~~~vfkk~s~n~~~t~yl~~rd~~d~~~~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~ 80 (314)
|.+++|||||+|||||||+||||||||||++.||||||||+|||||+|+||||+||+|+|||||||+|||||+||||||+
T Consensus 1 ~~~~~rVFKK~SpNgkiT~YLgkRDFvDhvd~vdPvDGvVlvDpeYlK~RKvfv~L~caFRYGREDldVlGLtFrKdL~~ 80 (402)
T KOG3865|consen 1 MKVGTRVFKKASPNGKITVYLGKRDFVDHVDQVDPVDGVVLVDPEYLKDRKVFVQLTCAFRYGREDLDVLGLTFRKDLYL 80 (402)
T ss_pred CCcceeEeEecCCCCcEEEEecccccccccccccccceeEEEChHHhccceEEEEEEeeeecccccceeeeeEEEeeeEE
Confidence 67899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEEEecCCCCCCCCcchhhhhhhccCCCeeEeEEeeCCCCCCCeEEecCCCCCCCCCeEEEEEEEEEEccCCccCcccc
Q psy2082 81 ASEQIYPRSEKQHTSLTKMQDCLLKKLGPKAYPFTFNITPAAPPSVTLQPGPDETGEPCGVTYYVKLFVGENETDRSHRR 160 (314)
Q Consensus 81 ~~~qv~p~~~~~~~~lt~lq~~l~~k~G~h~fPF~f~LP~~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~~~~~dr~~K~ 160 (314)
++.|+||+.+ ...|||+|||||++|+|.|+|||.|++|+++|||++|||+++|.|++|||+|+||||++++.+||+|||
T Consensus 81 ~~~Qv~Pp~~-~~~plT~lQErLlkKLG~nAyPF~f~~pp~~P~SVtLQp~p~D~gKpcGVdyevkaF~~~s~edk~hKr 159 (402)
T KOG3865|consen 81 ATVQVYPPPE-DSRPLTRLQERLLKKLGSNAYPFTFEFPPNLPCSVTLQPGPEDTGKPCGVDYEVKAFVADSEEDKIHKR 159 (402)
T ss_pred EEEEeeCCCc-CCCcccHHHHHHHHHhCCCCCceEEeCCCCCCceEEeccCCccCCCcccceEEEEEEecCCcccccccc
Confidence 9999999843 356799999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ceEEEeeeeccCCCCCCCCCCeEEEeEEEEeeCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEE
Q psy2082 161 STVALGIRKIQYAPSKNGRQPCTVVRKDFMLSPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDV 240 (314)
Q Consensus 161 ~~v~l~irkl~~~p~~~~~~p~~e~~k~f~~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~ 240 (314)
|+|+|+|||+||||..++++|+++++|.|+||+|+|+|+|+|||++|||||+|.+||+|+|||+|+||+||+.+.|++++
T Consensus 160 ~sVrL~IRKvqyAP~~~GpqP~~~v~k~FlmS~~~lhLevsLDkEiYyHGE~isvnV~V~NNsnKtVKkIK~~V~Q~adi 239 (402)
T KOG3865|consen 160 NSVRLVIRKVQYAPLEPGPQPSAEVSKQFLMSDGPLHLEVSLDKEIYYHGEPISVNVHVTNNSNKTVKKIKISVRQVADI 239 (402)
T ss_pred cceeeeeeeeeecCCCCCCCchhHhhHhhccCCCceEEEEEecchheecCCceeEEEEEecCCcceeeeeEEEeEeeceE
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEecCeEEEEEEEEeec------------------------------------------------cCCCCCCCceeEEE
Q psy2082 241 VLFQNGQFRHSVDCVETH------------------------------------------------LASPDPKDAFGIVV 272 (314)
Q Consensus 241 ~~~~~~~~~~~Va~~E~~------------------------------------------------L~~~~~~~~~~I~V 272 (314)
++|++++|+|+||.+|+. +.+++.++.+||.|
T Consensus 240 ~Lfs~aqy~~~VA~~E~~eGc~v~Pgstl~Kvf~l~PllanN~dkrGlALDG~lKhEDtnLASSTii~~~~~re~lGI~V 319 (402)
T KOG3865|consen 240 CLFSTAQYKKPVAMEETDEGCPVAPGSTLSKVFTLTPLLANNKDKRGLALDGKLKHEDTNLASSTIIREGADREALGILV 319 (402)
T ss_pred EEEecccccceeeeeecccCCccCCCCeeeeeEEechhhhcCcccccccccccccccccccchhheecCCCCcceeEEEE
Confidence 999999999999999876 26778899999999
Q ss_pred EEEEEEEEEECC-CCceeEEEEeEEEecCCCCCcc
Q psy2082 273 SYMVKVKLYLGA-LGGELSAELPFILMHPKVRLKM 306 (314)
Q Consensus 273 sY~vkV~l~v~~-~~~~v~velPi~I~~p~p~~~~ 306 (314)
||+|+|+|.+++ .++++.+||||+||||+|.+++
T Consensus 320 sY~VkVkL~vs~ll~ge~~~ElPF~LmhPkP~~~p 354 (402)
T KOG3865|consen 320 SYKVKVKLVVSRLLGGEVAAELPFTLMHPKPGEEP 354 (402)
T ss_pred EEEEEEEEEEecccCCceeeecceEEecCCCCCCc
Confidence 999999999994 7899999999999999997753
No 2
>KOG3780|consensus
Probab=99.95 E-value=5.5e-25 Score=216.56 Aligned_cols=258 Identities=26% Similarity=0.339 Sum_probs=178.7
Q ss_pred EEEecceeeEecCCCceeeeeEEEEe-cceeeceeEEEEEEEeeeccccccc---------------ccCcccc-ceEEE
Q psy2082 18 TLYMGKRDFVDYVTGVEPIDGVVVFE-EDYIQDRKVFGQVVCSFRYGREEDE---------------ILGLNFQ-KELYL 80 (314)
Q Consensus 18 t~yl~~rd~~d~~~~vd~v~GvV~vd-~~~lk~rkv~~~~~~~fryG~~d~d---------------v~gl~f~-k~l~~ 80 (314)
.+.|..++ --...-++|.|-|++. .+.++.|.|.+++.+..+= .|... --...|. +..|+
T Consensus 8 ~i~~d~~~--~iy~~G~~vsG~v~l~~~~~~~~~~i~l~~~G~~~t-~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~ 84 (427)
T KOG3780|consen 8 EIVLDNPE--AIYFPGEPVSGSVVLSTKEPIKVRAIKLQLKGRART-SWSESERGTKLNSKSEGSIKSSTVNYTAKETYL 84 (427)
T ss_pred EEEeCCCc--cccCCCCeEEEEEEEEeCCccceeEEEEEEEEeEEE-eecccccccccccccccccccceEEeeceEEEe
Confidence 34444444 1224558999999999 8889999999998877752 11110 0111222 23333
Q ss_pred -EEEEEecCCCCCCCCcchhhhhhhccCCCeeEeEEeeCCCCCCCeEEecCCCCCCCCCeEEEEEEEEEEccCCccCccc
Q psy2082 81 -ASEQIYPRSEKQHTSLTKMQDCLLKKLGPKAYPFTFNITPAAPPSVTLQPGPDETGEPCGVTYYVKLFVGENETDRSHR 159 (314)
Q Consensus 81 -~~~qv~p~~~~~~~~lt~lq~~l~~k~G~h~fPF~f~LP~~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~~~~~dr~~K 159 (314)
....+|-..+.... .+..+|.|.|||+|+||.+||+||+ |++|+|||.|+|.+ ||+|+
T Consensus 85 ~~~~~~~~~~~g~~~--------~~l~~G~~~~pF~~~LP~~~P~Sfe--------g~~G~irY~vk~~i-----dr~~~ 143 (427)
T KOG3780|consen 85 DSKTILWTSSNGSNS--------RVLPPGNYEFPFSFTLPLNLPPSFE--------GKFGHVRYFVKAEI-----DRPWK 143 (427)
T ss_pred eeeeEEeeccCCCCc--------eecCCCceEEeEeccCCCCCCCcee--------eCCceEEEEEEEEE-----ecCCC
Confidence 34555552221111 0345999999999999999999999 99999999999995 78999
Q ss_pred cceE-EE---eeee--ccCCCCCCCCCCeE--EEeEEEEeeCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEE
Q psy2082 160 RSTV-AL---GIRK--IQYAPSKNGRQPCT--VVRKDFMLSPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIK 231 (314)
Q Consensus 160 ~~~v-~l---~irk--l~~~p~~~~~~p~~--e~~k~f~~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ik 231 (314)
.+.. +. +++. ++..|....+.-.. +...+++|..|++.++++|++.+|++||.|.+++.|+|.|++.+++++
T Consensus 144 ~~~~~~~~~~V~~~~~ln~~p~~~~~~~~~~~k~~~~~~~~~g~v~~~~~ip~~~~~~ge~i~~~~~i~n~ss~~~~~~~ 223 (427)
T KOG3780|consen 144 LNKKNRKPFTVIETVDLNSSPSLLEPIISKASKKLGCVCFSSGPVSLELTIPKTGYVPGETIPVTLEIENKSSRTIKKVK 223 (427)
T ss_pred CCccceeeEEEecccccccCccccCcchhhhhheeeEEEecCCcEEEEEEcccccCcCCccEEEEEEEecCCCCcceeeE
Confidence 8842 22 2222 23233333222111 112235689999999999999999999999999999999999999999
Q ss_pred EEEEEeEEEEEEecC---eE---EEEEEEEeec----------------c---CCCCCCCceeEEEEEEEEEEEEECCC-
Q psy2082 232 AMVQQGVDVVLFQNG---QF---RHSVDCVETH----------------L---ASPDPKDAFGIVVSYMVKVKLYLGAL- 285 (314)
Q Consensus 232 v~L~Q~~~~~~~~~~---~~---~~~Va~~E~~----------------L---~~~~~~~~~~I~VsY~vkV~l~v~~~- 285 (314)
++|.|.+++.++..+ +. .+.++..... + .|.....|.+|.|+|.++|.+.++++
T Consensus 224 ~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~iP~~~Ps~~~~~~~i~v~y~l~v~~~~~~~~ 303 (427)
T KOG3780|consen 224 AKLIQKISYLAFSYGEHTKTKKSEKTLIKSRGSLEVAPRSEDKFEKELRIPPVPPSILPDTPIIRVEYELKVTLKTSSLR 303 (427)
T ss_pred EEEEEEEEEEeecCCccccceeeeeEEeeeccccccCCCCccccceEEEcCCCCCccCCCCceEEEEEEEEEEEecCccc
Confidence 999999999988532 11 1223221111 2 33344457899999999999999743
Q ss_pred CceeEEEEeEEEec
Q psy2082 286 GGELSAELPFILMH 299 (314)
Q Consensus 286 ~~~v~velPi~I~~ 299 (314)
++++.+++|++|++
T Consensus 304 ~~~~~l~~pi~igt 317 (427)
T KOG3780|consen 304 HSELALELPIIIGT 317 (427)
T ss_pred ccceeeeeceEEec
Confidence 68999999999996
No 3
>PF02752 Arrestin_C: Arrestin (or S-antigen), C-terminal domain; InterPro: IPR011022 G protein-coupled receptors are a large family of signalling molecules that respond to a wide variety of extracellular stimuli. The receptors relay the information encoded by the ligand through the activation of heterotrimeric G proteins and intracellular effector molecules. To ensure the appropriate regulation of the signalling cascade, it is vital to properly inactivate the receptor. This inactivation is achieved, in part, by the binding of a soluble protein, arrestin, which uncouples the receptor from the downstream G protein after the receptors are phosphorylated by G protein-coupled receptor kinases. In addition to the inactivation of G protein-coupled receptors, arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways. Arrestin (retinal S-antigen) is a major protein of the retinal rod outer segments. It interacts with photo-activated phosphorylated rhodopsin, inhibiting or 'arresting' its ability to interact with transducin []. The protein binds calcium, and shows similarity in its C terminus to alpha-transducin and other purine nucleotide-binding proteins. In mammals, arrestin is associated with autoimmune uveitis. Arrestins comprise a family of closely-related proteins that includes beta-arrestin-1 and -2, which regulate the function of beta-adrenergic receptors by binding to their phosphorylated forms, impairing their capacity to activate G(S) proteins; Cone photoreceptors C-arrestin (arrestin-X) [], which could bind to phosphorylated red/green opsins; and Drosophila phosrestins I and II, which undergo light-induced phosphorylation, and probably play a role in photoreceptor transduction [, , ]. The crystal structure of bovine retinal arrestin comprises two domains of antiparallel beta-sheets connected through a hinge region and one short alpha-helix on the back of the amino-terminal fold []. The binding region for phosphorylated light-activated rhodopsin is located at the N-terminal domain, as indicated by the docking of the photoreceptor to the three-dimensional structure of arrestin. The C-terminal domain consists of an immunoglobulin-like beta-sandwich structure. This entry represents proteins with immunoglobulin-like domains that are similar to those found in arrestin.; PDB: 1SUJ_A 3UGX_A 1CF1_B 1AYR_A 3UGU_A 3P2D_B 1ZSH_A 2WTR_B 3GC3_A 1G4R_A ....
Probab=99.75 E-value=3.4e-17 Score=135.15 Aligned_cols=110 Identities=23% Similarity=0.325 Sum_probs=82.5
Q ss_pred CCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEEEec----CeEEEEEEEEeec----------
Q psy2082 193 PGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVLFQN----GQFRHSVDCVETH---------- 258 (314)
Q Consensus 193 ~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~~~~----~~~~~~Va~~E~~---------- 258 (314)
+|++++++++||.+|.|||.|+++++|+|.|++.|++|+++|.|..+|.+... ....+.+++....
T Consensus 2 ~g~i~~~~~i~~~~~~~Ge~i~v~v~i~n~s~~~i~~I~v~L~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~ 81 (136)
T PF02752_consen 2 SGKISLSISIPRTAYVPGETIPVNVEIDNQSKKKIKKIKVSLVERITYKAKGGKDESKSEKRVVAKSKNCGVDPGSSGSF 81 (136)
T ss_dssp TEEEEEEEEES-SEEETT--EEEEEEEEE-SSSEEEEEEEEEEEEEEE-SS----S-EEEEEEEEEEECCEB-B-TTEEE
T ss_pred CCEEEEEEEECCCEECCCCEEEEEEEEEECCCCEEEEEEEEEEEEEEEEEeeccccceEEEEEEEEEecCCccCCCCceE
Confidence 69999999999999999999999999999999999999999999998877521 2233455542211
Q ss_pred -------cC----CCCCCCceeEEEEEEEEEEEEECCCCceeEEEEeEEEecCCC
Q psy2082 259 -------LA----SPDPKDAFGIVVSYMVKVKLYLGALGGELSAELPFILMHPKV 302 (314)
Q Consensus 259 -------L~----~~~~~~~~~I~VsY~vkV~l~v~~~~~~v~velPi~I~~p~p 302 (314)
|+ +++...+..|.|+|.|+|++.++++..++.+++||+|+++++
T Consensus 82 ~~~~~l~lP~~~~~s~~~~~~~i~v~Y~l~v~~~~~~~~~~~~~~~PI~I~~~p~ 136 (136)
T PF02752_consen 82 EFNIQLQLPSNLPPSTSTNSRLIQVEYQLEVTVKLSGCTSDLRLELPITIGSSPC 136 (136)
T ss_dssp EEEEEE-----B-----CGGGSEEEEEEEEEEEEEETTSEEEEEEEEEEEEB-SS
T ss_pred EEEEEEcCCCccCcccccCCcEEEEEEEEEEEEEECCceeEEEEEccEEEEecCC
Confidence 22 333347888999999999999987789999999999998753
No 4
>PF00339 Arrestin_N: Arrestin (or S-antigen), N-terminal domain; InterPro: IPR011021 G protein-coupled receptors are a large family of signalling molecules that respond to a wide variety of extracellular stimuli. The receptors relay the information encoded by the ligand through the activation of heterotrimeric G proteins and intracellular effector molecules. To ensure the appropriate regulation of the signalling cascade, it is vital to properly inactivate the receptor. This inactivation is achieved, in part, by the binding of a soluble protein, arrestin, which uncouples the receptor from the downstream G protein after the receptors are phosphorylated by G protein-coupled receptor kinases. In addition to the inactivation of G protein-coupled receptors, arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways. Arrestin (retinal S-antigen) is a major protein of the retinal rod outer segments. It interacts with photo-activated phosphorylated rhodopsin, inhibiting or 'arresting' its ability to interact with transducin []. The protein binds calcium, and shows similarity in its C terminus to alpha-transducin and other purine nucleotide-binding proteins. In mammals, arrestin is associated with autoimmune uveitis. Arrestins comprise a family of closely-related proteins that includes beta-arrestin-1 and -2, which regulate the function of beta-adrenergic receptors by binding to their phosphorylated forms, impairing their capacity to activate G(S) proteins; Cone photoreceptors C-arrestin (arrestin-X) [], which could bind to phosphorylated red/green opsins; and Drosophila phosrestins I and II, which undergo light-induced phosphorylation, and probably play a role in photoreceptor transduction [, , ]. The crystal structure of bovine retinal arrestin comprises two domains of antiparallel beta-sheets connected through a hinge region and one short alpha-helix on the back of the amino-terminal fold []. The binding region for phosphorylated light-activated rhodopsin is located at the N-terminal domain, as indicated by the docking of the photoreceptor to the three-dimensional structure of arrestin. The N-terminal domain consists of an immunoglobulin-like beta-sandwich structure. This entry represents proteins with immunoglobulin-like domains that are similar to those found in arrestin.; PDB: 1SUJ_A 3UGX_A 1CF1_B 1AYR_A 3UGU_A 3P2D_B 1ZSH_A 2WTR_B 3GC3_A 1G4R_A ....
Probab=99.61 E-value=1.5e-14 Score=121.44 Aligned_cols=122 Identities=21% Similarity=0.334 Sum_probs=77.3
Q ss_pred EEEecceeeEecCCCceeeeeEEEEe-cceeeceeEEEEEEEeeecccccccccCccccceEEEEEEEEecCCC-CCCCC
Q psy2082 18 TLYMGKRDFVDYVTGVEPIDGVVVFE-EDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLASEQIYPRSE-KQHTS 95 (314)
Q Consensus 18 t~yl~~rd~~d~~~~vd~v~GvV~vd-~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~~qv~p~~~-~~~~~ 95 (314)
.++|.+.+-+-+-+ |.|.|.|+++ .+.++.+.|.+++.|..++.-.+....+..+.+ +..+... .....
T Consensus 2 ~I~ld~~~~~y~~G--e~I~G~V~l~~~~~~~i~~i~v~l~G~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~ 72 (149)
T PF00339_consen 2 EIELDNPKPVYFPG--EVISGKVVLELSKPIKIKSIKVRLKGRAKTKWSESKSSGSTFRK-------QTTPKVQYSEKKE 72 (149)
T ss_dssp EEEES-SEEEEESS----EEEEEEECTTT-TTTSEEEEEEEEEEEESSSSTTSTTCEEEE-------EEESTSSS-SSSS
T ss_pred EEEECCCCCEECCC--CEEEEEEEEEECCccceeEEEEEEEEEEEEEecCCCcceeeeee-------EEeccccccccee
Confidence 46666666666666 9999999999 889999999999999998755544433433333 3333332 12234
Q ss_pred cchhhhhhhc--------cCCCeeEeEEeeCCCCCCCeEEecCCCCCCCCCeEEEEEEEEEEccCCccCccccc
Q psy2082 96 LTKMQDCLLK--------KLGPKAYPFTFNITPAAPPSVTLQPGPDETGEPCGVTYYVKLFVGENETDRSHRRS 161 (314)
Q Consensus 96 lt~lq~~l~~--------k~G~h~fPF~f~LP~~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~~~~~dr~~K~~ 161 (314)
+...+..+.. .+|.|.|||+|+||.+||+||+ |.+|+|+|.|+|.+ +++|+.+
T Consensus 73 ~~~~~~~l~~~~~~~~~l~~G~~~fpF~f~LP~~lP~S~~--------~~~g~I~Y~l~a~l-----~~~~~~~ 133 (149)
T PF00339_consen 73 YFDHESQLWGSEDGPNILPPGEYEFPFEFQLPSNLPSSFE--------GSHGSIRYKLKATL-----DRPGKKD 133 (149)
T ss_dssp SSHHHHHHHHH--------C-TTEEEEEE---TTS--SEE--------EE-SEEEEEEEEEE-----SSTTSE-
T ss_pred eccceeEeeeeccceecccCCCEEEEEEEECCCCCCceEe--------ccCcCEEEEEEEEE-----ECCCCCC
Confidence 4444444433 3899999999999999999999 88889999999997 5566555
No 5
>PF03643 Vps26: Vacuolar protein sorting-associated protein 26 ; InterPro: IPR005377 The movement of lipid and protein components between intracellular organelles requires the regulated interactions of many molecules. Vacuolar protein sorting-associated protein (Vps)5 is a yeast protein that is a subunit of a large multimeric complex, termed the retromer complex, involved in retrograde transport of proteins from endosomes to the trans-Golgi network. Sorting nexin (SNX) 1 and SNX2 are its mammalian orthologs []. To carry out its biological functions, Vps5 forms the retromer complex with at least four other proteins: Vps17, Vps26, Vps29, and Vps35 []. This family of Vps26-proteins also contains Down syndrome critical region 3/A.; GO: 0007034 vacuolar transport, 0030904 retromer complex; PDB: 3LHA_A 3LH9_A 2R51_A 3LH8_B 2FAU_A.
Probab=98.44 E-value=0.00018 Score=67.97 Aligned_cols=209 Identities=17% Similarity=0.213 Sum_probs=111.6
Q ss_pred CceeeeeEEEEe-cc--eeeceeEEEEEEEeeecccccccccCccccceEEEEEEEEecCCCCCCCCcchhhhhhhccCC
Q psy2082 32 GVEPIDGVVVFE-ED--YIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLASEQIYPRSEKQHTSLTKMQDCLLKKLG 108 (314)
Q Consensus 32 ~vd~v~GvV~vd-~~--~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~~qv~p~~~~~~~~lt~lq~~l~~k~G 108 (314)
.-|.|.|+|.|. +. .++-..|.+++.+.+.-=.+... .-+++-.+.++=|+.. |+ .|
T Consensus 37 ~gE~V~G~V~I~~~~gk~~~H~GI~l~lvG~ie~~~~~~k------~~~f~~~~~eL~~~G~-----l~---------~~ 96 (275)
T PF03643_consen 37 DGETVSGKVVITSKPGKSLEHQGIKLELVGQIEAFYDSGK------PIEFLSLSIELAPPGK-----LP---------EG 96 (275)
T ss_dssp TC--EEEEEEEEESSTS-EEES-EEEEEEEEEEEGCCTT-------EEEEEEEEEEEE-SEE-----E----------S-
T ss_pred CCCEEEEEEEEEECCCCceEEeeEEEEEEEeEeEeccCCC------ceEeEEeeEEEcCCcc-----cC---------CC
Confidence 349999999999 44 57777889988877631111111 1134445666645431 22 56
Q ss_pred CeeEeEEeeCCCCCCCeEEecCCCCCCCCCeEEEEEEEEEEccCCccCccccc--eEEEeeeeccCCCCCCCCCCeEEEe
Q psy2082 109 PKAYPFTFNITPAAPPSVTLQPGPDETGEPCGVTYYVKLFVGENETDRSHRRS--TVALGIRKIQYAPSKNGRQPCTVVR 186 (314)
Q Consensus 109 ~h~fPF~f~LP~~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~~~~~dr~~K~~--~v~l~irkl~~~p~~~~~~p~~e~~ 186 (314)
.. |||.|.+-+.-=.|+. |.+-.|+|.++|.+. |.++.- +..+.+......|.... +-+.++
T Consensus 97 ~t-~pFeF~~~~k~yETY~--------G~~v~i~Y~lrv~v~-----R~~~~i~k~~ef~V~~~~~~p~~~~-~ik~ev- 160 (275)
T PF03643_consen 97 KT-FPFEFPLVEKPYETYH--------GVNVNIRYFLRVTVK-----RSYKDISKEQEFWVQNFSITPESNQ-PIKMEV- 160 (275)
T ss_dssp EE-EEEEE-SB---S--EE---------SSEEEEEEEEEEE-------SSS-EEEEEEEEEE-EB---------EEEEE-
T ss_pred cE-EeeEeCCCCCCCccEe--------eeEEEEEEEEEEEEE-----ccCCCcceEEEEEEEeccCCCCCCC-Cccccc-
Confidence 66 9999987332224555 999999999999974 444211 23455554444443321 111121
Q ss_pred EEEEeeCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEEEecCeEE--EEEEEEeec------
Q psy2082 187 KDFMLSPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVLFQNGQFR--HSVDCVETH------ 258 (314)
Q Consensus 187 k~f~~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~~~~~~~~--~~Va~~E~~------ 258 (314)
=-.+-++++..++|.-|.=.+.|.=.+.+.-.. ..|+.|.+.|++..++-. ..+..+ .++...|.-
T Consensus 161 ----gie~~lhief~~~k~~~~l~d~i~G~i~f~lv~-~kIk~~elqLiR~Et~g~-~~~~~~e~t~i~~~eImDG~p~r 234 (275)
T PF03643_consen 161 ----GIEDCLHIEFEYDKSKYHLKDVITGKIYFLLVR-IKIKSMELQLIRVETCGC-GENYAKESTEIQKIEIMDGAPCR 234 (275)
T ss_dssp ----CETTTEEEEEEES-SEEETT-EEEEEEEEEEES-S-EEEEEEEEEEEEEECE-CCCEEEEEEEEEEEEEESS---T
T ss_pred ----CCCccEEEEEEEcccceECCCCEEEEEEEEEEe-ecceEEEEEEEEEEEEec-CCcccccceEEEEEEeecCCccc
Confidence 134678999999999999999987666654333 669999999998876533 222222 245554432
Q ss_pred ---------c----CCCCCCC-ceeEEEEEEEEEEEEE
Q psy2082 259 ---------L----ASPDPKD-AFGIVVSYMVKVKLYL 282 (314)
Q Consensus 259 ---------L----~~~~~~~-~~~I~VsY~vkV~l~v 282 (314)
| .-|+..+ +....|+|+|.+-+..
T Consensus 235 ge~IPirl~l~~l~l~Pt~~~~~~~FsV~y~lnlvlid 272 (275)
T PF03643_consen 235 GESIPIRLFLPRLFLCPTYKNVNNKFSVEYELNLVLID 272 (275)
T ss_dssp T-EEEEEEECCCT-----EEEECTTEEEEEEEEEEEEE
T ss_pred cceeeEEEEcCCcccCCcchhcCCcEEEEEEEEEEEEc
Confidence 1 2233333 3358999999987654
No 6
>PF13002 LDB19: Arrestin_N terminal like; InterPro: IPR024391 This entry represents a predicted Ig-like beta sandwich domain found towards the N terminus of protein LDB19 []. It is also found in other sequences and is related to the arrestin N-terminal fold [].
Probab=98.16 E-value=4.9e-05 Score=67.79 Aligned_cols=123 Identities=19% Similarity=0.296 Sum_probs=78.8
Q ss_pred ccCCCeeEeEEeeCCCCCCCeEEecCCCCCCCCCeEEEEEEEEEEccCC---ccCccccceEEE----ee-eeccCCCCC
Q psy2082 105 KKLGPKAYPFTFNITPAAPPSVTLQPGPDETGEPCGVTYYVKLFVGENE---TDRSHRRSTVAL----GI-RKIQYAPSK 176 (314)
Q Consensus 105 ~k~G~h~fPF~f~LP~~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~~~~---~dr~~K~~~v~l----~i-rkl~~~p~~ 176 (314)
...|.|+|||++-||-+||+|..| |. +..+.|.|+++|...... ...+-|...+.+ .| |-| +
T Consensus 47 l~~G~h~fPFS~LiPG~LPaS~~l--gs---~~l~~I~Yel~A~a~~~~~~~~~~~~~~~~~~~~~pl~V~Rsi---~-- 116 (191)
T PF13002_consen 47 LTKGSHAFPFSYLIPGHLPASMDL--GS---TPLVSIKYELKAEATYKDPRRGSSSSKPRVLKLKRPLPVKRSI---L-- 116 (191)
T ss_pred cCCCcccCCeeEECCCCCcccccc--CC---CCcEEEEEEEEEEEEEccCccccCCCcceeEEEeeeEEEEEec---C--
Confidence 448999999999999999999986 21 457899999999975410 001122221221 11 111 1
Q ss_pred CCCCCeEEEeEEEEeeCCcEEEEEEECCccccCCCeEEEEEEEEccccce----eEEEEEEEEEeEEEEE
Q psy2082 177 NGRQPCTVVRKDFMLSPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKV----VKKIKAMVQQGVDVVL 242 (314)
Q Consensus 177 ~~~~p~~e~~k~f~~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~----Vk~Ikv~L~Q~~~~~~ 242 (314)
++ ++.. .--.|-.-.|...|.||. +.+|--+.+|++.++|-++.+ ++++.=+|..++.+.+
T Consensus 117 ~g-pd~~---S~RvFPPT~l~a~a~lP~-VI~P~gtfpvel~LdGv~~~~~rWRlrKltWRiEE~~kv~a 181 (191)
T PF13002_consen 117 PG-PDKN---SLRVFPPTNLTASAVLPN-VIHPKGTFPVELRLDGVVSKDRRWRLRKLTWRIEETIKVRA 181 (191)
T ss_pred CC-CCcc---cEEecCCCCcEEEEEcCC-eeCCCCcccEEEEEecccCCCCEEEEEeeeEEEeeEEEEEc
Confidence 11 1111 112466677899999996 556777799999999998765 5566666666655544
No 7
>PF08737 Rgp1: Rgp1; InterPro: IPR014848 Rgp1 forms heterodimer with Ric1 (IPR009771 from INTERPRO) which associates with Golgi membranes and functions as a guanyl-nucleotide exchange factor [].
Probab=97.17 E-value=0.077 Score=53.07 Aligned_cols=49 Identities=20% Similarity=0.233 Sum_probs=41.9
Q ss_pred eCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEE
Q psy2082 192 SPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDV 240 (314)
Q Consensus 192 s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~ 240 (314)
.+|..-..++|.|..|.-||+|...++.++.+...+-.+.+.|.-...+
T Consensus 300 ~n~~~va~~~LsK~~yrlGE~I~g~idf~~~~~~~c~~v~~~LEs~E~v 348 (415)
T PF08737_consen 300 RNGQRVARLSLSKPAYRLGEDIVGTIDFNDASTIPCYQVSASLESEETV 348 (415)
T ss_pred ECCeEEEEEEecCCCcccCCeEEEEEEcCCCCcceeEEEEEEEEEEEEe
Confidence 6788889999999999999999999999999866677788888765543
No 8
>PF00339 Arrestin_N: Arrestin (or S-antigen), N-terminal domain; InterPro: IPR011021 G protein-coupled receptors are a large family of signalling molecules that respond to a wide variety of extracellular stimuli. The receptors relay the information encoded by the ligand through the activation of heterotrimeric G proteins and intracellular effector molecules. To ensure the appropriate regulation of the signalling cascade, it is vital to properly inactivate the receptor. This inactivation is achieved, in part, by the binding of a soluble protein, arrestin, which uncouples the receptor from the downstream G protein after the receptors are phosphorylated by G protein-coupled receptor kinases. In addition to the inactivation of G protein-coupled receptors, arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways. Arrestin (retinal S-antigen) is a major protein of the retinal rod outer segments. It interacts with photo-activated phosphorylated rhodopsin, inhibiting or 'arresting' its ability to interact with transducin []. The protein binds calcium, and shows similarity in its C terminus to alpha-transducin and other purine nucleotide-binding proteins. In mammals, arrestin is associated with autoimmune uveitis. Arrestins comprise a family of closely-related proteins that includes beta-arrestin-1 and -2, which regulate the function of beta-adrenergic receptors by binding to their phosphorylated forms, impairing their capacity to activate G(S) proteins; Cone photoreceptors C-arrestin (arrestin-X) [], which could bind to phosphorylated red/green opsins; and Drosophila phosrestins I and II, which undergo light-induced phosphorylation, and probably play a role in photoreceptor transduction [, , ]. The crystal structure of bovine retinal arrestin comprises two domains of antiparallel beta-sheets connected through a hinge region and one short alpha-helix on the back of the amino-terminal fold []. The binding region for phosphorylated light-activated rhodopsin is located at the N-terminal domain, as indicated by the docking of the photoreceptor to the three-dimensional structure of arrestin. The N-terminal domain consists of an immunoglobulin-like beta-sandwich structure. This entry represents proteins with immunoglobulin-like domains that are similar to those found in arrestin.; PDB: 1SUJ_A 3UGX_A 1CF1_B 1AYR_A 3UGU_A 3P2D_B 1ZSH_A 2WTR_B 3GC3_A 1G4R_A ....
Probab=95.28 E-value=0.025 Score=46.93 Aligned_cols=100 Identities=18% Similarity=0.227 Sum_probs=58.3
Q ss_pred EEEEC--CccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEEEec---CeEEEE-EE-------------------
Q psy2082 199 EVTLD--KQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVLFQN---GQFRHS-VD------------------- 253 (314)
Q Consensus 199 ~asld--K~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~~~~---~~~~~~-Va------------------- 253 (314)
++.|| +..|.+||.|.=+|.+...+...++.|++.|.=.+.+..... ...... ..
T Consensus 2 ~I~ld~~~~~y~~Ge~I~G~V~l~~~~~~~i~~i~v~l~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 81 (149)
T PF00339_consen 2 EIELDNPKPVYFPGEVISGKVVLELSKPIKIKSIKVRLKGRAKTKWSESKSSGSTFRKQTTPKVQYSEKKEYFDHESQLW 81 (149)
T ss_dssp EEEES-SEEEEESS--EEEEEEECTTT-TTTSEEEEEEEEEEEESSSSTTSTTCEEEEEEESTSSS-SSSSSSHHHHHHH
T ss_pred EEEECCCCCEECCCCEEEEEEEEEECCccceeEEEEEEEEEEEEEecCCCcceeeeeeEEecccccccceeeccceeEee
Confidence 44555 999999999999999988888999999999987765543211 111100 00
Q ss_pred -------EEeec---------c---CCCCCCCceeEEEEEEEEEEEEECCCCceeEEEEeEEEecC
Q psy2082 254 -------CVETH---------L---ASPDPKDAFGIVVSYMVKVKLYLGALGGELSAELPFILMHP 300 (314)
Q Consensus 254 -------~~E~~---------L---~~~~~~~~~~I~VsY~vkV~l~v~~~~~~v~velPi~I~~p 300 (314)
.++.+ | .|++..... -.|.|.|++.+..++ ..+...+-+|++..+
T Consensus 82 ~~~~~~~~l~~G~~~fpF~f~LP~~lP~S~~~~~-g~I~Y~l~a~l~~~~-~~~~~~~~~~~v~~~ 145 (149)
T PF00339_consen 82 GSEDGPNILPPGEYEFPFEFQLPSNLPSSFEGSH-GSIRYKLKATLDRPG-KKDHKAKREFTVVEP 145 (149)
T ss_dssp HH--------C-TTEEEEEE---TTS--SEEEE--SEEEEEEEEEESSTT-SE--CGGEEEEEEEE
T ss_pred eeccceecccCCCEEEEEEEECCCCCCceEeccC-cCEEEEEEEEEECCC-CCCcEEEEEEEEECc
Confidence 00000 2 233332222 389999999886543 567778888887654
No 9
>PF07070 Spo0M: SpoOM protein; InterPro: IPR009776 This family consists of several bacterial SpoOM proteins which are thought to control sporulation in Bacillus subtilis.Spo0M exerts certain negative effects on sporulation and its gene expression is controlled by sigmaH [].
Probab=94.86 E-value=0.39 Score=44.03 Aligned_cols=91 Identities=19% Similarity=0.308 Sum_probs=59.7
Q ss_pred cCCCceeeeeEEEEe--cceeeceeEEEEEEEeeecccccccccCccccceEEEEEEEEecCCCCCCCCcchhhhhhhcc
Q psy2082 29 YVTGVEPIDGVVVFE--EDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLASEQIYPRSEKQHTSLTKMQDCLLKK 106 (314)
Q Consensus 29 ~~~~vd~v~GvV~vd--~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~~qv~p~~~~~~~~lt~lq~~l~~k 106 (314)
++..-+.|.|.|.|. ...-+=.+|++.|.+.+....+|.+ ++.+.-+.+.++ .+ ....+
T Consensus 23 ~~~pGe~v~G~V~i~GG~v~Q~I~~I~l~L~t~~~~e~~d~~-----~~~~~~~~~~~v---~~-----------~f~I~ 83 (218)
T PF07070_consen 23 SVRPGETVRGEVHIKGGSVDQEIDRIYLELVTRYEVESDDKE-----YTQEVELARVRV---SG-----------PFTIE 83 (218)
T ss_pred CccCCCEEEEEEEEEeCCcceEEeEEEEEEEEEEEEecCCCe-----EEEEEEEEEEEe---CC-----------CEEEC
Confidence 344558999999998 3444567888888887765544443 333444444444 21 11122
Q ss_pred CC-CeeEeEEeeCCCCCCCeEEecCCCCCCCCCeEEEEEEEEEEc
Q psy2082 107 LG-PKAYPFTFNITPAAPPSVTLQPGPDETGEPCGVTYYVKLFVG 150 (314)
Q Consensus 107 ~G-~h~fPF~f~LP~~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~ 150 (314)
+| .+.+||+|+||.++|-|. ++.+|.++..++
T Consensus 84 ~ge~~~iPF~~~lP~etPiT~------------~~~~v~l~T~Ld 116 (218)
T PF07070_consen 84 PGEEKEIPFSFPLPWETPITE------------GGMRVWLRTGLD 116 (218)
T ss_pred CCCEEEEeEEEECCCCCCccC------------CCcEEEEEEEEE
Confidence 44 589999999999999886 366777777764
No 10
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=88.76 E-value=1 Score=35.20 Aligned_cols=27 Identities=19% Similarity=0.402 Sum_probs=22.5
Q ss_pred EEEEECCccccCCCeEEEEEEEEcccc
Q psy2082 198 LEVTLDKQLYAHNEKIAVSICIRNNSN 224 (314)
Q Consensus 198 l~asldK~~Y~pGE~I~v~v~I~N~Ss 224 (314)
+-+..||.+|.|||+|.+.+-+.+..+
T Consensus 2 ~~i~TDr~iYrPGetV~~~~~~~~~~~ 28 (99)
T PF01835_consen 2 IFIQTDRPIYRPGETVHFRAIVRDLDN 28 (99)
T ss_dssp EEEEESSSEE-TTSEEEEEEEEEEECT
T ss_pred EEEECCccCcCCCCEEEEEEEEecccc
Confidence 346789999999999999999888873
No 11
>KOG3063|consensus
Probab=88.25 E-value=22 Score=33.24 Aligned_cols=211 Identities=18% Similarity=0.246 Sum_probs=102.8
Q ss_pred eeeceeEEEEEEEeeecccccccccCccccceEEEEEEEEecCCCCCCCCcchhhhhhhccCCC----eeEeEEeeCCCC
Q psy2082 46 YIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLASEQIYPRSEKQHTSLTKMQDCLLKKLGP----KAYPFTFNITPA 121 (314)
Q Consensus 46 ~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~~qv~p~~~~~~~~lt~lq~~l~~k~G~----h~fPF~f~LP~~ 121 (314)
+-.|..|-+.+..+.+=|. -+|-.|+. --|+.+..+|-..+ +....+.+-..|. .||+ .+|||.|.
T Consensus 42 f~dgEtv~G~V~l~lk~gk-kleH~Gik---iefiGqIe~~~drg-n~~eF~~lv~eLa-~pGel~~~~~fpFeF~---- 111 (301)
T KOG3063|consen 42 FYDGETVSGKVNLRLKDGK-KLEHQGIK---IEFIGQIEMYYDRG-NFHEFTSLVRELA-RPGELTQSQSFPFEFP---- 111 (301)
T ss_pred EecCCeeeeEEEEEEcCCc-ccccCceE---EEEEEEEEEEecCC-cHHHHHHHHHhhc-CCcceeecccCCcccc----
Confidence 3344555555555554333 22333322 12333444433332 1223343333333 3664 67888873
Q ss_pred CCCeEEecCCCCCCCCCeEEEEEEEEEEccCCccCccccceEEEeeeeccCCCCCCCCCCeEEEeEEEEeeCCcEEEEEE
Q psy2082 122 APPSVTLQPGPDETGEPCGVTYYVKLFVGENETDRSHRRSTVALGIRKIQYAPSKNGRQPCTVVRKDFMLSPGDLELEVT 201 (314)
Q Consensus 122 lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~~~~~dr~~K~~~v~l~irkl~~~p~~~~~~p~~e~~k~f~~s~G~l~l~as 201 (314)
+++ .|-..+.|+.-.+||.+||++.+...|-..- ..+++..+.-.|.... +-..++ + -..-+|++..
T Consensus 112 ---~ve-kpyEsY~G~NV~lrY~lkvTv~Rr~~di~ke---~d~~V~~~~~~P~~nn-~IkmeV--G---IedCLHIEFE 178 (301)
T KOG3063|consen 112 ---HVE-KPYESYIGKNVRLRYFLKVTVSRRLTDIVKE---KDLVVHNLSTYPEINN-SIKMEV--G---IEDCLHIEFE 178 (301)
T ss_pred ---ccc-cchhhhcCcceEEEEEEEEEEEechhhhhhh---hheeeEecccCCCCCC-ceeEee--c---hhhceEEEEE
Confidence 222 2333577999999999999987765433222 2244444444454432 112222 1 1223678888
Q ss_pred ECCccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEEEecCeEE--EEEEEEeec--------------------c
Q psy2082 202 LDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVLFQNGQFR--HSVDCVETH--------------------L 259 (314)
Q Consensus 202 ldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~~~~~~~~--~~Va~~E~~--------------------L 259 (314)
-.|+-|.--|.|-=.+-.- .-.-.|+.+.++++.....-. ...++. .++|+-|.- |
T Consensus 179 YnKskYhLkdvIvGkIYFl-LvRikIk~Mel~iikrEstG~-gpn~~~e~eTiakyeIMDGapvrGEsIPiRlFLagYdl 256 (301)
T KOG3063|consen 179 YNKSKYHLKDVIVGKIYFL-LVRIKIKHMELSIIKRESTGT-GPNTYVETETIAKYEIMDGAPVRGESIPIRLFLAGYDL 256 (301)
T ss_pred ecccccchhheEEeeEEEE-EEEEEeeeeEEEEEEeecccC-CCcceeccceeeeEEeccCCCcCCCeeeeEEEecccCC
Confidence 8888887666653221110 001234555555554432211 111222 366665432 3
Q ss_pred CCCCCCCceeEEEEEEEEEEEE
Q psy2082 260 ASPDPKDAFGIVVSYMVKVKLY 281 (314)
Q Consensus 260 ~~~~~~~~~~I~VsY~vkV~l~ 281 (314)
+|.+..-|.-..|.|.|..-|.
T Consensus 257 TPtmrdinkkFsVkyyLnLVlv 278 (301)
T KOG3063|consen 257 TPTMRDINKKFSVKYYLNLVLV 278 (301)
T ss_pred CcchhhhcceeeeeeEEEEEEE
Confidence 5555434556888998876554
No 12
>KOG2717|consensus
Probab=88.04 E-value=23 Score=33.13 Aligned_cols=126 Identities=10% Similarity=0.160 Sum_probs=70.9
Q ss_pred cCCCeeEeEEeeCCC-CCCCeEEecCCCCCCCCCeEEEEEEEEEEccCCccCccccceEEEeeeeccCCCCCCC-CCCeE
Q psy2082 106 KLGPKAYPFTFNITP-AAPPSVTLQPGPDETGEPCGVTYYVKLFVGENETDRSHRRSTVALGIRKIQYAPSKNG-RQPCT 183 (314)
Q Consensus 106 k~G~h~fPF~f~LP~-~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~~~~~dr~~K~~~v~l~irkl~~~p~~~~-~~p~~ 183 (314)
++|...+||.|.|-. +=|--+- ..+.|-+-.|+|.+.+-+.++-.-|+.-+ +.++.+ ++.|-..+ .+|..
T Consensus 85 p~G~tEipFelpL~~kge~~~lY----ETyHGvfiNiqY~LtcdikR~~L~K~ltk-t~eFiv---~s~pv~l~e~~p~i 156 (313)
T KOG2717|consen 85 PPGTTEIPFELPLREKGEGEKLY----ETYHGVFINIQYLLTCDIKRGYLHKPLTK-TMEFIV---ESGPVDLPERPPEI 156 (313)
T ss_pred CCCceeeeeeeeeccCCCccEee----eeecceEEEEEEEEEEecccchhcCchhh-hheeee---ccCCcccccCCCcc
Confidence 389999999987732 1221121 12559999999999999877644443322 122222 22221100 00000
Q ss_pred -------EE-eEEEE--eeCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEE
Q psy2082 184 -------VV-RKDFM--LSPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDV 240 (314)
Q Consensus 184 -------e~-~k~f~--~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~ 240 (314)
++ .+... .+-+..-+.-.||++-++--+++.=.+.|+- |...|++|.+.|.+..+.
T Consensus 157 V~F~itpdtlq~~~ker~~~p~FlvtG~Ld~t~c~~t~PltGeltVe~-seaaI~Sie~qLvRVEtc 222 (313)
T KOG2717|consen 157 VIFYITPDTLQHPLKERIKTPGFLVTGKLDATQCSLTDPLTGELTVEA-SEAAITSIEIQLVRVETC 222 (313)
T ss_pred eEEEEChHHhhccchhhccCCceEEEeeecceeeEecCCccceEEEEe-eccceeEEEEEEEEEEEe
Confidence 00 00000 2234455666777777777777766666664 467899999999987653
No 13
>KOG3780|consensus
Probab=87.72 E-value=9.8 Score=37.49 Aligned_cols=100 Identities=19% Similarity=0.294 Sum_probs=68.6
Q ss_pred EEEEECCc--cccCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEEEec------------------CeEE--EE----
Q psy2082 198 LEVTLDKQ--LYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVLFQN------------------GQFR--HS---- 251 (314)
Q Consensus 198 l~asldK~--~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~~~~------------------~~~~--~~---- 251 (314)
+++.+|+. +|.+||+|.=++.+.+.....++.|++.+.=.++...... ..|+ ..
T Consensus 7 ~~i~~d~~~~iy~~G~~vsG~v~l~~~~~~~~~~i~l~~~G~~~t~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~ 86 (427)
T KOG3780|consen 7 FEIVLDNPEAIYFPGEPVSGSVVLSTKEPIKVRAIKLQLKGRARTSWSESERGTKLNSKSEGSIKSSTVNYTAKETYLDS 86 (427)
T ss_pred EEEEeCCCccccCCCCeEEEEEEEEeCCccceeEEEEEEEEeEEEeecccccccccccccccccccceEEeeceEEEeee
Confidence 45566666 6999999999999999999999999999987765543210 0011 00
Q ss_pred ---EEEE----ee-----c---------c---CCCCCCCceeEEEEEEEEEEEEECCCCceeEEEEeEEEec
Q psy2082 252 ---VDCV----ET-----H---------L---ASPDPKDAFGIVVSYMVKVKLYLGALGGELSAELPFILMH 299 (314)
Q Consensus 252 ---Va~~----E~-----~---------L---~~~~~~~~~~I~VsY~vkV~l~v~~~~~~v~velPi~I~~ 299 (314)
+... .. + | .|++.....| .|.|.|++.+.-.+ ..+-..+-+|++..
T Consensus 87 ~~~~~~~~~g~~~~~l~~G~~~~pF~~~LP~~~P~Sfeg~~G-~irY~vk~~idr~~-~~~~~~~~~~~V~~ 156 (427)
T KOG3780|consen 87 KTILWTSSNGSNSRVLPPGNYEFPFSFTLPLNLPPSFEGKFG-HVRYFVKAEIDRPW-KLNKKNRKPFTVIE 156 (427)
T ss_pred eeEEeeccCCCCceecCCCceEEeEeccCCCCCCCceeeCCc-eEEEEEEEEEecCC-CCCccceeeEEEec
Confidence 0000 00 0 2 3556666678 88999999888765 66778888888665
No 14
>PF01345 DUF11: Domain of unknown function DUF11; InterPro: IPR001434 This group of sequences is represented by a conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis (Streptococcus faecalis), and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydia trachomatis outer membrane proteins. In C. trachomatis, three cysteine-rich proteins (also believed to be lipoproteins), MOMP, OMP6 and OMP3, make up the extracellular matrix of the outer membrane []. They are involved in the essential structural integrity of both the elementary body (EB) and recticulate body (RB) phase. They are thought to be involved in porin formation and, as these bacteria lack the peptidoglycan layer common to most Gram-negative microbes, such proteins are highly important in the pathogenicity of the organism.; GO: 0005727 extrachromosomal circular DNA
Probab=85.76 E-value=3.2 Score=30.95 Aligned_cols=42 Identities=12% Similarity=0.202 Sum_probs=36.8
Q ss_pred eCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEE
Q psy2082 192 SPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAM 233 (314)
Q Consensus 192 s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~ 233 (314)
....+.+.-+.++....+||.+...+.+.|..+.....+.+.
T Consensus 22 ~~~~~~~~k~~~~~~~~~Gd~v~ytitvtN~G~~~a~nv~v~ 63 (76)
T PF01345_consen 22 AIPDLSITKTVNPSTANPGDTVTYTITVTNTGPAPATNVVVT 63 (76)
T ss_pred CCCCEEEEEecCCCcccCCCEEEEEEEEEECCCCeeEeEEEE
Confidence 456788888899999999999999999999999998887654
No 15
>PF00207 A2M: Alpha-2-macroglobulin family; InterPro: IPR001599 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0004866 endopeptidase inhibitor activity; PDB: 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 2PN5_A 3FRP_G 3HRZ_B ....
Probab=84.98 E-value=2.9 Score=32.53 Aligned_cols=41 Identities=10% Similarity=0.223 Sum_probs=29.1
Q ss_pred eCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082 192 SPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMV 234 (314)
Q Consensus 192 s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L 234 (314)
...|+.++..+|+ ....||.+.+.+.|.|+..+.++ ++++|
T Consensus 52 v~~p~~i~~~lP~-~l~~GD~~~i~v~v~N~~~~~~~-v~V~l 92 (92)
T PF00207_consen 52 VFKPFFIQLNLPR-SLRRGDQIQIPVTVFNYTDKDQE-VTVTL 92 (92)
T ss_dssp EB-SEEEEEE--S-EEETTSEEEEEEEEEE-SSS-EE-EEEEE
T ss_pred EEeeEEEEcCCCc-EEecCCEEEEEEEEEeCCCCCEE-EEEEC
Confidence 3468889999998 55799999999999999988865 45543
No 16
>TIGR01451 B_ant_repeat conserved repeat domain. This model represents the conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis, and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydial outer membrane proteins.
Probab=83.52 E-value=2.5 Score=29.87 Aligned_cols=32 Identities=22% Similarity=0.340 Sum_probs=27.9
Q ss_pred ECCccccCCCeEEEEEEEEccccceeEEEEEE
Q psy2082 202 LDKQLYAHNEKIAVSICIRNNSNKVVKKIKAM 233 (314)
Q Consensus 202 ldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~ 233 (314)
.|+....+||.|..++.|.|........+.+.
T Consensus 3 ~d~~~~~~Gd~v~Yti~v~N~g~~~a~~v~v~ 34 (53)
T TIGR01451 3 VDKTVATIGDTITYTITVTNNGNVPATNVVVT 34 (53)
T ss_pred cCccccCCCCEEEEEEEEEECCCCceEeEEEE
Confidence 57888999999999999999999888876654
No 17
>PF03643 Vps26: Vacuolar protein sorting-associated protein 26 ; InterPro: IPR005377 The movement of lipid and protein components between intracellular organelles requires the regulated interactions of many molecules. Vacuolar protein sorting-associated protein (Vps)5 is a yeast protein that is a subunit of a large multimeric complex, termed the retromer complex, involved in retrograde transport of proteins from endosomes to the trans-Golgi network. Sorting nexin (SNX) 1 and SNX2 are its mammalian orthologs []. To carry out its biological functions, Vps5 forms the retromer complex with at least four other proteins: Vps17, Vps26, Vps29, and Vps35 []. This family of Vps26-proteins also contains Down syndrome critical region 3/A.; GO: 0007034 vacuolar transport, 0030904 retromer complex; PDB: 3LHA_A 3LH9_A 2R51_A 3LH8_B 2FAU_A.
Probab=82.05 E-value=25 Score=33.41 Aligned_cols=90 Identities=18% Similarity=0.287 Sum_probs=46.2
Q ss_pred cccCCCeEEEEEEEEccccce--eEEEEEEEEEeEEEEEE-ecCe-EE--EEEEEE--------------eeccCCCCCC
Q psy2082 206 LYAHNEKIAVSICIRNNSNKV--VKKIKAMVQQGVDVVLF-QNGQ-FR--HSVDCV--------------ETHLASPDPK 265 (314)
Q Consensus 206 ~Y~pGE~I~v~v~I~N~Ssk~--Vk~Ikv~L~Q~~~~~~~-~~~~-~~--~~Va~~--------------E~~L~~~~~~ 265 (314)
+|.+||+|.=.|.|+..++|. =.+|++.|.=. +.++ ..++ +. +....+ |..+.+....
T Consensus 34 iY~~gE~V~G~V~I~~~~gk~~~H~GI~l~lvG~--ie~~~~~~k~~~f~~~~~eL~~~G~l~~~~t~pFeF~~~~k~yE 111 (275)
T PF03643_consen 34 IYSDGETVSGKVVITSKPGKSLEHQGIKLELVGQ--IEAFYDSGKPIEFLSLSIELAPPGKLPEGKTFPFEFPLVEKPYE 111 (275)
T ss_dssp EEETC--EEEEEEEEESSTS-EEES-EEEEEEEE--EEEGCCTT-EEEEEEEEEEEE-SEEE-S-EEEEEEE-SB---S-
T ss_pred eEcCCCEEEEEEEEEECCCCceEEeeEEEEEEEe--EeEeccCCCceEeEEeeEEEcCCcccCCCcEEeeEeCCCCCCCc
Confidence 789999998888888777555 45677666433 3333 2221 11 111111 1112222233
Q ss_pred Ccee--EEEEEEEEEEEEECCCCceeEEEEeEEEec
Q psy2082 266 DAFG--IVVSYMVKVKLYLGALGGELSAELPFILMH 299 (314)
Q Consensus 266 ~~~~--I~VsY~vkV~l~v~~~~~~v~velPi~I~~ 299 (314)
.-.| +.+.|.|+|.+.-+. ++++-+.+|.+-.
T Consensus 112 TY~G~~v~i~Y~lrv~v~R~~--~~i~k~~ef~V~~ 145 (275)
T PF03643_consen 112 TYHGVNVNIRYFLRVTVKRSY--KDISKEQEFWVQN 145 (275)
T ss_dssp -EE-SSEEEEEEEEEEE--SS--S-EEEEEEEEEE-
T ss_pred cEeeeEEEEEEEEEEEEEccC--CCcceEEEEEEEe
Confidence 3444 899999999776654 8999999998764
No 18
>PF02752 Arrestin_C: Arrestin (or S-antigen), C-terminal domain; InterPro: IPR011022 G protein-coupled receptors are a large family of signalling molecules that respond to a wide variety of extracellular stimuli. The receptors relay the information encoded by the ligand through the activation of heterotrimeric G proteins and intracellular effector molecules. To ensure the appropriate regulation of the signalling cascade, it is vital to properly inactivate the receptor. This inactivation is achieved, in part, by the binding of a soluble protein, arrestin, which uncouples the receptor from the downstream G protein after the receptors are phosphorylated by G protein-coupled receptor kinases. In addition to the inactivation of G protein-coupled receptors, arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways. Arrestin (retinal S-antigen) is a major protein of the retinal rod outer segments. It interacts with photo-activated phosphorylated rhodopsin, inhibiting or 'arresting' its ability to interact with transducin []. The protein binds calcium, and shows similarity in its C terminus to alpha-transducin and other purine nucleotide-binding proteins. In mammals, arrestin is associated with autoimmune uveitis. Arrestins comprise a family of closely-related proteins that includes beta-arrestin-1 and -2, which regulate the function of beta-adrenergic receptors by binding to their phosphorylated forms, impairing their capacity to activate G(S) proteins; Cone photoreceptors C-arrestin (arrestin-X) [], which could bind to phosphorylated red/green opsins; and Drosophila phosrestins I and II, which undergo light-induced phosphorylation, and probably play a role in photoreceptor transduction [, , ]. The crystal structure of bovine retinal arrestin comprises two domains of antiparallel beta-sheets connected through a hinge region and one short alpha-helix on the back of the amino-terminal fold []. The binding region for phosphorylated light-activated rhodopsin is located at the N-terminal domain, as indicated by the docking of the photoreceptor to the three-dimensional structure of arrestin. The C-terminal domain consists of an immunoglobulin-like beta-sandwich structure. This entry represents proteins with immunoglobulin-like domains that are similar to those found in arrestin.; PDB: 1SUJ_A 3UGX_A 1CF1_B 1AYR_A 3UGU_A 3P2D_B 1ZSH_A 2WTR_B 3GC3_A 1G4R_A ....
Probab=77.99 E-value=33 Score=27.29 Aligned_cols=106 Identities=14% Similarity=0.173 Sum_probs=45.2
Q ss_pred EEEEecceeeEecCCCceeeeeEEEEe-cceeeceeEEEEEEEeeecccccccccCccccceEEEEEEEEecCCCCCCCC
Q psy2082 17 LTLYMGKRDFVDYVTGVEPIDGVVVFE-EDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLASEQIYPRSEKQHTS 95 (314)
Q Consensus 17 ~t~yl~~rd~~d~~~~vd~v~GvV~vd-~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~~qv~p~~~~~~~~ 95 (314)
+++-|.++=|. .-|.|..-+-|+ ....+=++|.++|.=..+|...+... .-. +..-.+.. +....-
T Consensus 7 ~~~~i~~~~~~----~Ge~i~v~v~i~n~s~~~i~~I~v~L~~~~~~~~~~~~~-~~~-~~~~~v~~-~~~~~~------ 73 (136)
T PF02752_consen 7 LSISIPRTAYV----PGETIPVNVEIDNQSKKKIKKIKVSLVERITYKAKGGKD-ESK-SEKRVVAK-SKNCGV------ 73 (136)
T ss_dssp EEEEES-SEEE----TT--EEEEEEEEE-SSSEEEEEEEEEEEEEEE-SS-----S-E-EEEEEEEE-EECCEB------
T ss_pred EEEEECCCEEC----CCCEEEEEEEEEECCCCEEEEEEEEEEEEEEEEEeeccc-cce-EEEEEEEE-EecCCc------
Confidence 45556666654 336777777777 44445566777665555553322210 000 00011111 111100
Q ss_pred cchhhhhhhccCCCeeEe--EEeeCCCCCCCeEEecCCCCCCCCCeEEEEEEEEEEc
Q psy2082 96 LTKMQDCLLKKLGPKAYP--FTFNITPAAPPSVTLQPGPDETGEPCGVTYYVKLFVG 150 (314)
Q Consensus 96 lt~lq~~l~~k~G~h~fP--F~f~LP~~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~ 150 (314)
...+...|- ..|+||.++|+|.. ..|..-.|.|.|++.+.
T Consensus 74 ---------~~~~~~~~~~~~~l~lP~~~~~s~~------~~~~~i~v~Y~l~v~~~ 115 (136)
T PF02752_consen 74 ---------DPGSSGSFEFNIQLQLPSNLPPSTS------TNSRLIQVEYQLEVTVK 115 (136)
T ss_dssp ----------B-TTEEEEEEEEE-----B-----------CGGGSEEEEEEEEEEEE
T ss_pred ---------cCCCCceEEEEEEEcCCCccCcccc------cCCcEEEEEEEEEEEEE
Confidence 112334444 88999988887773 12889999999999974
No 19
>PF00927 Transglut_C: Transglutaminase family, C-terminal ig like domain; InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds. Members of the transglutaminase family usually have three domains: N-terminal (IPR001102 from INTERPRO), middle (IPR013808 from INTERPRO) and C-terminal. The middle domain is usually well conserved, but family members can display major differences in their N- and C-terminal domains, although their overall structure is conserved []. This entry represents the C-terminal domain found in transglutaminases, which consists of an immunoglobulin-like beta-sandwich consisting of seven strands in two sheets with a Greek key topology. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ].; GO: 0003810 protein-glutamine gamma-glutamyltransferase activity, 0018149 peptide cross-linking; PDB: 2XZZ_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B 1L9N_B ....
Probab=73.48 E-value=9.6 Score=30.23 Aligned_cols=38 Identities=16% Similarity=0.284 Sum_probs=30.4
Q ss_pred EEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082 197 ELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQ 235 (314)
Q Consensus 197 ~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~ 235 (314)
++++.++.... -|+.+.+.+.+.|.|+..++.+++.|.
T Consensus 2 ~~~i~~~~~~~-vG~d~~v~v~~~N~~~~~l~~v~~~l~ 39 (107)
T PF00927_consen 2 EIKIKLPGDPV-VGQDFTVSVSFTNPSSEPLRNVSLNLC 39 (107)
T ss_dssp EEEEEEESEEB-TTSEEEEEEEEEE-SSS-EECEEEEEE
T ss_pred eEEEEECCCcc-CCCCEEEEEEEEeCCcCccccceeEEE
Confidence 46666766665 799999999999999999999998884
No 20
>PF04425 Bul1_N: Bul1 N terminus; InterPro: IPR007519 This domain is the N terminus of Saccharomyces cerevisiae (Baker's yeast) Bul1. Bul1 binds the ubiquitin ligase Rsp5, via an N-terminal PPSY motif (157-160 in P48524 from SWISSPROT) []. The complex containing Bul1 and Rsp5 is involved in intracellular trafficking of the general amino acid permease Gap1 [], degradation of Rog1 in cooperation with Bul2 and GSK-3 [], and mitochondrial inheritance []. Bul1 may contain HEAT repeats. The C terminus is IPR007520 from INTERPRO.
Probab=68.24 E-value=32 Score=34.88 Aligned_cols=118 Identities=17% Similarity=0.107 Sum_probs=0.0
Q ss_pred CCCceeeeeEEEEe---cceeeceeEEEEEEEeeecccc------------------cccccCccccceEEEEEEEEecC
Q psy2082 30 VTGVEPIDGVVVFE---EDYIQDRKVFGQVVCSFRYGRE------------------EDEILGLNFQKELYLASEQIYPR 88 (314)
Q Consensus 30 ~~~vd~v~GvV~vd---~~~lk~rkv~~~~~~~fryG~~------------------d~dv~gl~f~k~l~~~~~qv~p~ 88 (314)
...-|-|.|.|+|. .++++=-=.+|.+.+++--+.. |+.-.+-+-.=+.-.... |.+
T Consensus 160 y~qGD~I~GyvtI~N~S~~pIpFdMFyV~lEG~~~v~~~~~~~~~~~~~~kkFL~M~D~sASws~~~i~~~~~~~--~~~ 237 (438)
T PF04425_consen 160 YTQGDIIHGYVTIENTSSKPIPFDMFYVSLEGTISVVDSKSPSSKKPRTVKKFLRMFDFSASWSYANIDRLVGDN--YCP 237 (438)
T ss_pred ccCCCEEEEEEEEEECCCCCcccceEEEEEEEEEEEcccccccccccHHHHHHHHhhcceecccccccccccccc--cCC
Q ss_pred CCCCCCCcchhh---hhhhccCCCeeEeEEeeCCC---------------CCCCeEEecCCC------------------
Q psy2082 89 SEKQHTSLTKMQ---DCLLKKLGPKAYPFTFNITP---------------AAPPSVTLQPGP------------------ 132 (314)
Q Consensus 89 ~~~~~~~lt~lq---~~l~~k~G~h~fPF~f~LP~---------------~lPsSf~lq~~~------------------ 132 (314)
..-++-.=|-++ +|++.+--.|..+|+|.||. .|||||-+....
T Consensus 238 ~~~Dp~Dgt~lgl~~~r~l~p~~~Yk~fF~FkiP~~LLd~~C~h~~~~H~~lPPs~G~~k~~~~~~~~~i~~n~~lGyg~ 317 (438)
T PF04425_consen 238 GEVDPYDGTYLGLPNKRILEPGVKYKKFFTFKIPEQLLDSACKHNLSSHCLLPPSLGIDKNEFNGKYSSIKINKALGYGH 317 (438)
T ss_pred ccccCCCCeeEeCCCCceecCCCeEeceeEEeCCchhccccccCcccccCcCCCCCCcChhhccchhhhhhcccccCccc
Q ss_pred -----------CCCCCCeEEEEEEEEEE
Q psy2082 133 -----------DETGEPCGVTYYVKLFV 149 (314)
Q Consensus 133 -----------~~~G~~g~V~Y~Vka~i 149 (314)
+..-..-.|.|.|.|.+
T Consensus 318 l~~~gspil~~D~s~~d~SIsY~V~AR~ 345 (438)
T PF04425_consen 318 LGTKGSPILTNDFSFDDTSISYSVEARF 345 (438)
T ss_pred cccCCCcceecccCCCCceEEEEEEEEE
No 21
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=68.08 E-value=5.3 Score=29.99 Aligned_cols=28 Identities=11% Similarity=0.308 Sum_probs=20.5
Q ss_pred cCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082 208 AHNEKIAVSICIRNNSNKVVKKIKAMVQ 235 (314)
Q Consensus 208 ~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~ 235 (314)
.+||++.+.+.|.|..+..+..++++|.
T Consensus 2 ~~G~~~~~~~tv~N~g~~~~~~v~~~l~ 29 (78)
T PF10633_consen 2 TPGETVTVTLTVTNTGTAPLTNVSLSLS 29 (78)
T ss_dssp -TTEEEEEEEEEE--SSS-BSS-EEEEE
T ss_pred CCCCEEEEEEEEEECCCCceeeEEEEEe
Confidence 4899999999999999888888887774
No 22
>PF07070 Spo0M: SpoOM protein; InterPro: IPR009776 This family consists of several bacterial SpoOM proteins which are thought to control sporulation in Bacillus subtilis.Spo0M exerts certain negative effects on sporulation and its gene expression is controlled by sigmaH [].
Probab=67.51 E-value=1e+02 Score=28.31 Aligned_cols=50 Identities=16% Similarity=0.286 Sum_probs=43.4
Q ss_pred eeCCcEEEEEEECCccccCCCeEEEEEEEEccc-cceeEEEEEEEEEeEEE
Q psy2082 191 LSPGDLELEVTLDKQLYAHNEKIAVSICIRNNS-NKVVKKIKAMVQQGVDV 240 (314)
Q Consensus 191 ~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~S-sk~Vk~Ikv~L~Q~~~~ 240 (314)
+.-|..+++..|++..|.|||+|.=.|.|...+ ...|..|.+.|.-.+..
T Consensus 8 ~GiG~akVDT~L~~~~~~pGe~v~G~V~i~GG~v~Q~I~~I~l~L~t~~~~ 58 (218)
T PF07070_consen 8 IGIGGAKVDTVLEKPSVRPGETVRGEVHIKGGSVDQEIDRIYLELVTRYEV 58 (218)
T ss_pred cCCCCceEEEEECCCCccCCCEEEEEEEEEeCCcceEEeEEEEEEEEEEEE
Confidence 345888999999999999999999999999996 57899999999766543
No 23
>PF05688 DUF824: Salmonella repeat of unknown function (DUF824); InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=67.25 E-value=8.8 Score=26.77 Aligned_cols=30 Identities=13% Similarity=0.325 Sum_probs=25.6
Q ss_pred cCCCeEEEEEEEEccccceeEEEEEEEEEe
Q psy2082 208 AHNEKIAVSICIRNNSNKVVKKIKAMVQQG 237 (314)
Q Consensus 208 ~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~ 237 (314)
.-||+|+++|++.|.....+-..-+.|.+-
T Consensus 10 K~Ge~I~ltVt~kda~G~pv~n~~f~l~r~ 39 (47)
T PF05688_consen 10 KVGETIPLTVTVKDANGNPVPNAPFTLTRG 39 (47)
T ss_pred ecCCeEEEEEEEECCCCCCcCCceEEEEec
Confidence 359999999999999999999888777543
No 24
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=59.82 E-value=36 Score=25.66 Aligned_cols=40 Identities=13% Similarity=0.313 Sum_probs=28.7
Q ss_pred EEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082 196 LELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQ 235 (314)
Q Consensus 196 l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~ 235 (314)
|.+.....-....+|+.+.+++.|.|.-......+.+.+.
T Consensus 4 L~v~~~~~~~~~~~g~~~~i~~~V~N~G~~~~~~~~v~~~ 43 (101)
T PF07705_consen 4 LTVSITVSPSNVVPGEPVTITVTVKNNGTADAENVTVRLY 43 (101)
T ss_dssp EEE-EEEC-SEEETTSEEEEEEEEEE-SSS-BEEEEEEEE
T ss_pred EEEEEeeCCCcccCCCEEEEEEEEEECCCCCCCCEEEEEE
Confidence 3445566667778999999999999998888888777753
No 25
>PF06030 DUF916: Bacterial protein of unknown function (DUF916); InterPro: IPR010317 This family consists of putative cell surface proteins, from Firmicutes, of unknown function.
Probab=58.52 E-value=23 Score=29.33 Aligned_cols=27 Identities=26% Similarity=0.390 Sum_probs=22.7
Q ss_pred cCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082 208 AHNEKIAVSICIRNNSNKVVKKIKAMVQ 235 (314)
Q Consensus 208 ~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~ 235 (314)
-||+...+.+.|.|+|++.++ +++.+.
T Consensus 24 ~P~q~~~l~v~i~N~s~~~~t-v~v~~~ 50 (121)
T PF06030_consen 24 KPGQKQTLEVRITNNSDKEIT-VKVSAN 50 (121)
T ss_pred CCCCEEEEEEEEEeCCCCCEE-EEEEEe
Confidence 589999999999999998876 566654
No 26
>PF09624 DUF2393: Protein of unknown function (DUF2393); InterPro: IPR013417 The function of this protein is unknown. It is always found as part of a two-gene operon with IPR013416 from INTERPRO, a protein that appears to span the membrane seven times. It has so far been found in the bacteria Anabaena sp. (strain PCC 7120), Agrobacterium tumefaciens, Rhizobium meliloti, and Gloeobacter violaceus.
Probab=58.20 E-value=28 Score=29.39 Aligned_cols=38 Identities=29% Similarity=0.323 Sum_probs=28.9
Q ss_pred EEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082 196 LELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMV 234 (314)
Q Consensus 196 l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L 234 (314)
..+++.-.+. ..-+|.+.|..+|.|.+++++++.++.+
T Consensus 48 ~~~~~~~~~~-l~~~~~~~v~g~V~N~g~~~i~~c~i~~ 85 (149)
T PF09624_consen 48 IELTLTSQKR-LQYSESFYVDGTVTNTGKFTIKKCKITV 85 (149)
T ss_pred ceEEEeeeee-eeeccEEEEEEEEEECCCCEeeEEEEEE
Confidence 3444444444 4458999999999999999999888766
No 27
>PF07703 A2M_N_2: Alpha-2-macroglobulin family N-terminal region; InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=55.74 E-value=17 Score=29.70 Aligned_cols=30 Identities=27% Similarity=0.366 Sum_probs=24.6
Q ss_pred cEEEEEEECCccccCCCeEEEEEEEEcccc
Q psy2082 195 DLELEVTLDKQLYAHNEKIAVSICIRNNSN 224 (314)
Q Consensus 195 ~l~l~asldK~~Y~pGE~I~v~v~I~N~Ss 224 (314)
.++++++.+++.|.|||++.++++...+|.
T Consensus 94 ~~~v~l~~~~~~~~Pg~~~~~~i~~~~~s~ 123 (136)
T PF07703_consen 94 ELKVELTASPDEYKPGEEVTLRIKAPPNSL 123 (136)
T ss_dssp SSSEEEEESSSSBTTTSEEEEEEEESTTEE
T ss_pred cceEEEEEecceeCCCCEEEEEEEeCCCCE
Confidence 456778889999999999999999955543
No 28
>PF09478 CBM49: Carbohydrate binding domain CBM49; InterPro: IPR019028 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This domain is found at the C-terminal of cellulases and in vitro binding studies have shown it to binds to crystalline cellulose []. ; GO: 0030246 carbohydrate binding, 0005576 extracellular region
Probab=55.47 E-value=31 Score=26.18 Aligned_cols=39 Identities=21% Similarity=0.309 Sum_probs=28.7
Q ss_pred EEEEEECCccccCCCe-EEEEEEEEccccceeEEEEEEEE
Q psy2082 197 ELEVTLDKQLYAHNEK-IAVSICIRNNSNKVVKKIKAMVQ 235 (314)
Q Consensus 197 ~l~asldK~~Y~pGE~-I~v~v~I~N~Ssk~Vk~Ikv~L~ 235 (314)
+++.++..+--..|.+ -...+.|.|+++++|+.+.+.+.
T Consensus 2 ~i~q~~~~sW~~~g~~y~qy~v~I~N~~~~~I~~~~i~~~ 41 (80)
T PF09478_consen 2 TITQTLVNSWTENGQTYTQYDVTITNNGSKPIKSLKISID 41 (80)
T ss_pred EEEEEEEeEEEeCCEEEEEEEEEEEECCCCeEEEEEEEEC
Confidence 3445555555555553 46799999999999999998886
No 29
>PF04744 Monooxygenase_B: Monooxygenase subunit B protein; InterPro: IPR006833 Ammonia monooxygenase and the particulate methane monooxygenase are both integral membrane proteins, occurring in ammonia oxidisers and methanotrophs respectively, which are thought to be evolutionarily related []. These enzymes have a relatively wide substrate specificity and can catalyse the oxidation of a range of substrates including ammonia, methane, halogenated hydrocarbons and aromatic molecules []. These enzymes are composed of 3 subunits - A (IPR003393 from INTERPRO), B (IPR006833 from INTERPRO) and C (IPR006980 from INTERPRO) - and contain various metal centres, including copper. Particulate methane monooxygenase from Methylococcus capsulatus str. Bath is an ABC homotrimer, which contains mononuclear and dinuclear copper metal centres, and a third metal centre containing a metal ion whose identity in vivo is not certain[]. The soluble regions of these enzymes derive primarily from the B subunit. This subunit forms two antiparallel beta-barrel-like structures and contains the mono- and di- nuclear copper metal centres [].; PDB: 3CHX_E 3RFR_A 3RGB_A 1YEW_A.
Probab=54.31 E-value=14 Score=36.59 Aligned_cols=42 Identities=14% Similarity=0.298 Sum_probs=32.3
Q ss_pred EEEEECCccc-cCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEE
Q psy2082 198 LEVTLDKQLY-AHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVL 242 (314)
Q Consensus 198 l~asldK~~Y-~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~ 242 (314)
+++.+.+..| +||-++.++++|+|++++.|+ +.=.|.+.+++
T Consensus 249 V~~~v~~A~Y~vpgR~l~~~l~VtN~g~~pv~---LgeF~tA~vrF 291 (381)
T PF04744_consen 249 VKVKVTDATYRVPGRTLTMTLTVTNNGDSPVR---LGEFNTANVRF 291 (381)
T ss_dssp EEEEEEEEEEESSSSEEEEEEEEEEESSS-BE---EEEEESSS-EE
T ss_pred eEEEEeccEEecCCcEEEEEEEEEcCCCCceE---eeeEEeccEEE
Confidence 7888888888 899999999999999999865 34455666654
No 30
>PF00963 Cohesin: Cohesin domain; InterPro: IPR002102 Cohesin domains interact with a complementary domain, termed the dockerin domain (see IPR002105 from INTERPRO). The cohesin-dockerin interaction is the crucial interaction for complex formation in the cellulosome []. The scaffoldin component of the cellulolytic bacterium Clostridium thermocellum is a non-hydrolytic protein which organises the hydrolytic enzymes in a large complex, called the cellulosome. Scaffoldin comprises a series of functional domains, amongst which is a single cellulose-binding domain and nine cohesin domains which are responsible for integrating the individual enzymatic subunits into the complex.; GO: 0030246 carbohydrate binding, 0000272 polysaccharide catabolic process; PDB: 2BM3_A 3P0D_I 3KCP_A 2B59_A 3L8Q_B 3FNK_C 3GHP_B 2CCL_A 1ANU_A 1OHZ_A ....
Probab=51.86 E-value=43 Score=27.75 Aligned_cols=37 Identities=16% Similarity=0.294 Sum_probs=29.3
Q ss_pred EEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082 198 LEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQ 235 (314)
Q Consensus 198 l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~ 235 (314)
+++++++.--.+||++.|.+.++|.++. |..+.+.|.
T Consensus 1 v~l~~~~~~a~~G~tv~V~V~v~~~~~~-i~~~~~~l~ 37 (141)
T PF00963_consen 1 VTLSVDSVSAKPGETVTVPVNVSNVSNS-IAGMQFTLS 37 (141)
T ss_dssp EEEEESECEE-TTSEEEEEEEEESCTTT-EEEEEEEEE
T ss_pred CEEEeCCceECCCCEEEEEEEEEcCCCc-EEEEEEEEE
Confidence 4667788777999999999999999866 777776663
No 31
>PF05326 SVA: Seminal vesicle autoantigen (SVA); InterPro: IPR007990 This family consists of seminal vesicle autoantigen and prolactin-inducible (PIP) proteins. Seminal vesicle autoantigen (SVA) is specifically present in the seminal plasma of mice. This 19 kDa secretory glycoprotein suppresses the motility of spermatozoa by interacting with phospholipid. PIP has several known functions. In saliva, this protein plays a role in host defence by binding to microorganisms such as Streptococcus. PIP is an aspartyl proteinase and it acts as a factor capable of suppressing T-cell apoptosis through its interaction with CD4 [].; GO: 0005576 extracellular region; PDB: 3ES6_B.
Probab=47.84 E-value=37 Score=28.50 Aligned_cols=61 Identities=13% Similarity=0.176 Sum_probs=31.6
Q ss_pred eCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEEEecCeEEEEEEE
Q psy2082 192 SPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVLFQNGQFRHSVDC 254 (314)
Q Consensus 192 s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~~~~~~~~~~Va~ 254 (314)
+..+|.|+..+++.. .+||.+.+.++|.|+=..-+- ||+.|.-...+.+..+-.|+.+++.
T Consensus 30 ~~k~l~l~l~~~~~~-~~~eev~v~l~V~t~~~eCmv-Vk~yl~sn~~i~~~fny~YTaCLC~ 90 (124)
T PF05326_consen 30 SRKPLSLNLQVPQTA-KANEEVTVTLTVTTELRECMV-VKIYLESNPPIDGSFNYKYTACLCD 90 (124)
T ss_dssp -----EE-----SEE--SS--EEEEEEEEE--SS-EE-EEEEEEESS---SGGG-EEEEEE-S
T ss_pred cCccEEEEeecCCCC-CCCCEEEEEEEEEcchheeEE-EEEEeccCCCccccccceEEEEeCC
Confidence 457899999999988 899999999999999877765 7888876666655445667766753
No 32
>cd00258 GM2-AP GM2 activator protein (GM2-AP) is a non-enzymatic lysosomal protein that acts as cofactor in the sequential degradation of gangliosides. GM2A is an essential cofactor for beta-hexosaminidase A (Hex A) in the enzymatic hydrolysis of GM2 ganglioside to GM3. Mutation of the gene results in the AB variant of Tay-Sachs disease. GM2-AP and similar proteins belong to the ML domain family.
Probab=47.19 E-value=72 Score=28.05 Aligned_cols=78 Identities=15% Similarity=0.188 Sum_probs=45.4
Q ss_pred EEEEEEEeeecccccccccCccccceEEEEEEEEecCCCCCCCCcchhhh--hhhccCCCeeEeEE-eeCCC-CCCCeEE
Q psy2082 52 VFGQVVCSFRYGREEDEILGLNFQKELYLASEQIYPRSEKQHTSLTKMQD--CLLKKLGPKAYPFT-FNITP-AAPPSVT 127 (314)
Q Consensus 52 v~~~~~~~fryG~~d~dv~gl~f~k~l~~~~~qv~p~~~~~~~~lt~lq~--~l~~k~G~h~fPF~-f~LP~-~lPsSf~ 127 (314)
+|+.+=|...+| .....|+.--..++.|+.+.=..++...+- +=--++|.|+.|=+ |.||+ +||+...
T Consensus 62 ~Wi~iPC~~~~G--------SCty~d~C~~l~~~~~~~~~Cp~~l~~~g~pC~CP~~~G~y~lp~s~f~lP~~~LPs~l~ 133 (162)
T cd00258 62 LWMKIPCLDNIG--------SCTYDNACDLLDTLIPPGQQCPEPLRTYGLPCHCPFKEGVYSLPDSTFTLPNVDLPSWLT 133 (162)
T ss_pred EEEEcccccccC--------cccccchhhchhcccCCCCCCCchhhccCCccCCCCCCcceEccceeeecccccCCCccC
Confidence 355666655443 344555555666777776522223321110 01145899998654 69997 6887764
Q ss_pred ecCCCCCCCCCeEEEEEEEEEE
Q psy2082 128 LQPGPDETGEPCGVTYYVKLFV 149 (314)
Q Consensus 128 lq~~~~~~G~~g~V~Y~Vka~i 149 (314)
.| .|.+++.+
T Consensus 134 -------~G-----~Y~i~~~l 143 (162)
T cd00258 134 -------NG-----NYRITGIL 143 (162)
T ss_pred -------CC-----cEEEEEEE
Confidence 24 39999986
No 33
>smart00809 Alpha_adaptinC2 Adaptin C-terminal domain. Adaptins are components of the adaptor complexes which link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. Gamma-adaptin is a subunit of the golgi adaptor. Alpha adaptin is a heterotetramer that regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This Ig-fold domain is found in alpha, beta and gamma adaptins and consists of a beta-sandwich containing 7 strands in 2 beta-sheets in a greek-key topology PUBMED:10430869, PUBMED:12176391. The adaptor appendage contains an additional N-terminal strand.
Probab=45.15 E-value=72 Score=24.71 Aligned_cols=40 Identities=10% Similarity=0.253 Sum_probs=31.9
Q ss_pred eCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082 192 SPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQ 235 (314)
Q Consensus 192 s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~ 235 (314)
.+.-|++.+.+.+. +..+.+.+...|.|...+.++.+.+.
T Consensus 3 ~~~~l~I~~~~~~~----~~~~~i~~~~~N~s~~~it~f~~~~a 42 (104)
T smart00809 3 EKNGLQIGFKFERR----PGLIRITLTFTNKSPSPITNFSFQAA 42 (104)
T ss_pred cCCCEEEEEEEEcC----CCeEEEEEEEEeCCCCeeeeEEEEEE
Confidence 45567888888765 45689999999999999999887664
No 34
>TIGR03079 CH4_NH3mon_ox_B methane monooxygenase/ammonia monooxygenase, subunit B. Both ammonia oxidizers such as Nitrosomonas europaea and methanotrophs (obligate methane oxidizers) such as Methylococcus capsulatus each can grow only on their own characteristic substrate. However, both groups have the ability to oxidize both substrates, and so the relevant enzymes must be named here according to their ability to oxidze both. The protein family represented here reflects subunit B of both the particulate methane monooxygenase of methylotrophs and the ammonia monooxygenase of nitrifying bacteria.
Probab=45.06 E-value=30 Score=34.26 Aligned_cols=47 Identities=15% Similarity=0.247 Sum_probs=33.9
Q ss_pred CcEEEEEEECCccc-cCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEEE
Q psy2082 194 GDLELEVTLDKQLY-AHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVLF 243 (314)
Q Consensus 194 G~l~l~asldK~~Y-~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~~ 243 (314)
++=.+++.+.+..| +||-.+.++++|+|++++.|+ +.=.+.+.+++-
T Consensus 264 ~~~~V~~kv~~a~Y~VPGR~l~~~~~VTN~g~~~vr---lgEF~TA~vRFl 311 (399)
T TIGR03079 264 APNPVSINVTKANYDVPGRALRVTMEITNNGDQVIS---IGEFTTAGIRFM 311 (399)
T ss_pred CCCceEEEEeccEEecCCcEEEEEEEEEcCCCCceE---EEeEeecceEee
Confidence 34456666667666 999999999999999998865 344455555543
No 35
>COG5554 NifU Nitrogen fixation protein [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=44.48 E-value=61 Score=23.87 Aligned_cols=39 Identities=31% Similarity=0.584 Sum_probs=29.2
Q ss_pred eEEEeecCCCcEEEEecceeeEecCCCceeeeeEEEEecceeeceeEEE
Q psy2082 6 KVFKKCSPNGKLTLYMGKRDFVDYVTGVEPIDGVVVFEEDYIQDRKVFG 54 (314)
Q Consensus 6 ~vfkk~s~n~~~t~yl~~rd~~d~~~~vd~v~GvV~vd~~~lk~rkv~~ 54 (314)
||.=.....| |+.|+.|.|. . +|| |.|+.+.+-|-+|..
T Consensus 2 kVmiR~t~ag-LsaYiPKKDL----E--Epi---i~V~nedlwGG~ilL 40 (69)
T COG5554 2 KVMIRVTDAG-LSAYIPKKDL----E--EPI---IVVKNEDLWGGKILL 40 (69)
T ss_pred eEEEEEcCCc-eEEEcccccc----c--cce---EEEeccccccceEEe
Confidence 3444455667 9999999984 3 565 889999999988874
No 36
>PF13199 Glyco_hydro_66: Glycosyl hydrolase family 66; PDB: 3VMO_A 3VMN_A 3VMP_A.
Probab=44.21 E-value=38 Score=35.45 Aligned_cols=27 Identities=26% Similarity=0.437 Sum_probs=17.4
Q ss_pred EECCccccCCCeEEEEEEEEcccccee
Q psy2082 201 TLDKQLYAHNEKIAVSICIRNNSNKVV 227 (314)
Q Consensus 201 sldK~~Y~pGE~I~v~v~I~N~Ssk~V 227 (314)
+.||..|.|||+|.++++..|.+....
T Consensus 1 ~tDKA~Y~PGe~V~l~~~~~~~~~~~~ 27 (559)
T PF13199_consen 1 TTDKARYRPGEKVTLTASLKNTTGSDF 27 (559)
T ss_dssp EES-SSB-TTS-EEEE-EEE--SSS-E
T ss_pred CCCcceeCCCCeEEEEEEeccCccccc
Confidence 579999999999999999999976653
No 37
>PF02014 Reeler: Reeler domain Schematic picture including Reeler domain; InterPro: IPR002861 Extracellular matrix (ECM) proteins play an important role in early cortical development, specifically in the formation of neural connections and in controlling the cyto-architecture of the central nervous system. The product of the reeler gene in mouse is reelin,a large extracellular protein secreted by pioneer neurons that coordinates cell positioning during neurodevelopment []. F-spondin and mindin are a family of matrix-attached adhesion molecules that share structural similarities and overlapping domains of expression. Both F-spondin and mindin promote adhesion and outgrowth of hippocampal embryonic neurons and bind to a putative receptor(s) expressed on both hippocampal and sensory neurons []. This domain of unknown function is found at the N terminus of reelin and F-spondin.; PDB: 2ZOT_B 2ZOU_B 3COO_A.
Probab=43.13 E-value=53 Score=27.08 Aligned_cols=38 Identities=5% Similarity=0.188 Sum_probs=25.2
Q ss_pred CcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082 194 GDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQ 235 (314)
Q Consensus 194 G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~ 235 (314)
++..|. ++.+.|.|||.+.|++ .+.++...+.+-+...
T Consensus 19 ~py~i~--~~~~~y~pg~~~~Vtl--~~~~~~~F~GFllqAr 56 (132)
T PF02014_consen 19 SPYSIS--VSPSSYEPGQTYTVTL--SSSGSSSFRGFLLQAR 56 (132)
T ss_dssp TSEEEE--ET-SSB-TTBEEEEEE--EETTTEEBSEEEEEEE
T ss_pred CCEEEE--eCCCeEcCCCEEEEEE--ECCCCCceeEEEEEEE
Confidence 444444 4489999999999998 6666777777555443
No 38
>PF07703 A2M_N_2: Alpha-2-macroglobulin family N-terminal region; InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=43.05 E-value=31 Score=28.02 Aligned_cols=25 Identities=24% Similarity=0.319 Sum_probs=20.5
Q ss_pred EEEEECCccccCCCeEEEEEEEEcc
Q psy2082 198 LEVTLDKQLYAHNEKIAVSICIRNN 222 (314)
Q Consensus 198 l~asldK~~Y~pGE~I~v~v~I~N~ 222 (314)
|++.+||..|.+||++.+.+.-.-.
T Consensus 1 l~i~~~~~~~~~Ge~~~v~v~~~~~ 25 (136)
T PF07703_consen 1 LQISTDKDSYKPGETAKVTVQSPFP 25 (136)
T ss_dssp EEEEE-SSSB-TTSEEEEEEEEESC
T ss_pred CEEEcCCCCcCCCCEEEEEEEcCCC
Confidence 6789999999999999999987766
No 39
>PF06159 DUF974: Protein of unknown function (DUF974); InterPro: IPR010378 This is a family of uncharacterised eukaryotic proteins.
Probab=41.43 E-value=37 Score=31.56 Aligned_cols=29 Identities=24% Similarity=0.401 Sum_probs=26.1
Q ss_pred ccCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082 207 YAHNEKIAVSICIRNNSNKVVKKIKAMVQ 235 (314)
Q Consensus 207 Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~ 235 (314)
.|-||+-.-.+++.|.|+..|+.+.+++.
T Consensus 10 iylGEtF~~~l~~~N~s~~~v~~v~ikve 38 (249)
T PF06159_consen 10 IYLGETFSCYLSVNNDSNKPVRNVRIKVE 38 (249)
T ss_pred EeecCCEEEEEEeecCCCCceEEeEEEEE
Confidence 46799999999999999999999888775
No 40
>PF04426 Bul1_C: Bul1 C terminus; InterPro: IPR022794 This domain contains the C terminus of Saccharomyces cerevisiae Bul1. Bul1 binds the ubiquitin ligase Rsp5, via an N-terminal PPSY motif (157-160 in P48524 from SWISSPROT) []. The complex containing Bul1 and Rsp5 is involved in intracellular trafficking of the general amino acid permease Gap1 [], degradation of Rog1 in cooperation with Bul2 and GSK-3 [], and mitochondrial inheritance []. Bul1 may contain HEAT repeats. This domain is found C-terminal to PF04425 from PFAM. This signature also matches the C terminus of some bacterial Methyl-accepting chemotaxis proteins.
Probab=39.35 E-value=62 Score=30.72 Aligned_cols=36 Identities=19% Similarity=0.360 Sum_probs=27.2
Q ss_pred CCCCCceeEEEEEEEEEEEEECCCCceeEEEEeEEEe
Q psy2082 262 PDPKDAFGIVVSYMVKVKLYLGALGGELSAELPFILM 298 (314)
Q Consensus 262 ~~~~~~~~I~VsY~vkV~l~v~~~~~~v~velPi~I~ 298 (314)
|+...| .+-=.|+|+|.+.++..++.+.+++|+.|.
T Consensus 235 PsFqSC-l~~R~Y~lrv~ik~~~~~~~~~l~VPv~i~ 270 (272)
T PF04426_consen 235 PSFQSC-LCSRLYYLRVNIKFKNHGGSVSLKVPVQIQ 270 (272)
T ss_pred CCchhe-eeeeEEEEEEEEEEcCCCCEEEEEeceEEe
Confidence 455444 455579999999995445679999999885
No 41
>COG4326 Spo0M Sporulation control protein [General function prediction only]
Probab=39.31 E-value=28 Score=31.99 Aligned_cols=20 Identities=20% Similarity=0.622 Sum_probs=17.7
Q ss_pred CC-CeeEeEEeeCCCCCCCeE
Q psy2082 107 LG-PKAYPFTFNITPAAPPSV 126 (314)
Q Consensus 107 ~G-~h~fPF~f~LP~~lPsSf 126 (314)
|| +|.|||+|.||.+.|-++
T Consensus 104 pgEe~~fpf~l~lP~~tPvT~ 124 (270)
T COG4326 104 PGEERNFPFELSLPWNTPVTI 124 (270)
T ss_pred CCceEeccEEEecCCCCceee
Confidence 45 599999999999999887
No 42
>COG2373 Large extracellular alpha-helical protein [General function prediction only]
Probab=38.50 E-value=1.5e+02 Score=35.06 Aligned_cols=43 Identities=14% Similarity=0.303 Sum_probs=38.0
Q ss_pred eeCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEE
Q psy2082 191 LSPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAM 233 (314)
Q Consensus 191 ~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~ 233 (314)
|-.-.+.++++++|..+.+|+++.+++...|.+...+.+-++.
T Consensus 493 f~p~r~~i~l~~~k~~~~~g~~v~~~v~~~yL~GaPa~g~~~~ 535 (1621)
T COG2373 493 FIPDRFKINLTLDKTEWVPGKDVKIKVDLRYLYGAPAAGLTVQ 535 (1621)
T ss_pred hCCceEEEecccccccccCCCcEEEEEEEEecCCCcccCceee
Confidence 4556788999999999999999999999999999888877766
No 43
>KOG0452|consensus
Probab=37.53 E-value=35 Score=35.84 Aligned_cols=88 Identities=22% Similarity=0.368 Sum_probs=59.3
Q ss_pred EEEEEEccCCccCccccceEEEeeeeccCCCCCCCCCCeEEEeEEEEeeCCcEEEEEEECCccccCCCeEEEEEEEEccc
Q psy2082 144 YVKLFVGENETDRSHRRSTVALGIRKIQYAPSKNGRQPCTVVRKDFMLSPGDLELEVTLDKQLYAHNEKIAVSICIRNNS 223 (314)
Q Consensus 144 ~Vka~i~~~~~dr~~K~~~v~l~irkl~~~p~~~~~~p~~e~~k~f~~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~S 223 (314)
-|||.|+++- +|+|+.|-+-|.|-.++|-|-.. +++ +=-+|.=.-.+.||.++..||+.|.|+. +|..
T Consensus 794 GvKAViaeS~-ErIHrsnLvGmGIiPl~f~~Ge~-----Adt----LgLtG~E~yti~lP~~~lkPgq~i~v~~--dtGk 861 (892)
T KOG0452|consen 794 GVKAVIAESY-ERIHRSNLVGMGIIPLQFLPGED-----ADT----LGLTGRERYTIHLPENILKPGQDITVTT--DTGK 861 (892)
T ss_pred hhHHHHHHHH-HHHHhhccccceeeeeeecCCCC-----hhh----cCcccceeEEEECCcccCCCCceEEEEe--cCCc
Confidence 4677776654 79999998889998899877433 121 1124666677889999999999987653 3332
Q ss_pred cceeEEEEEEEEEeEEEEEEecC
Q psy2082 224 NKVVKKIKAMVQQGVDVVLFQNG 246 (314)
Q Consensus 224 sk~Vk~Ikv~L~Q~~~~~~~~~~ 246 (314)
. =...+++.--+++..|.++
T Consensus 862 --~-F~~~~rFdteVeltyy~~G 881 (892)
T KOG0452|consen 862 --V-FVCTLRFDTEVELTYYKNG 881 (892)
T ss_pred --E-EEEEEEecceEEEEEEecC
Confidence 2 2234566666777777665
No 44
>PF02883 Alpha_adaptinC2: Adaptin C-terminal domain; InterPro: IPR008152 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. AP (adaptor protein) complexes are found in coated vesicles and clathrin-coated pits. AP complexes connect cargo proteins and lipids to clathrin at vesicle budding sites, as well as binding accessory proteins that regulate coat assembly and disassembly (such as AP180, epsins and auxilin). There are different AP complexes in mammals. AP1 is responsible for the transport of lysosomal hydrolases between the TGN and endosomes []. AP2 associates with the plasma membrane and is responsible for endocytosis []. AP3 is responsible for protein trafficking to lysosomes and other related organelles []. AP4 is less well characterised. AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). For example, in AP1 these subunits are gamma-1-adaptin, beta-1-adaptin, mu-1 and sigma-1, while in AP2 they are alpha-adaptin, beta-2-adaptin, mu-2 and sigma-2. Each subunit has a specific function. Adaptins recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal ear (appendage) domains. Mu recognises tyrosine-based sorting signals within the cytoplasmic domains of transmembrane cargo proteins []. One function of clathrin and AP2 complex-mediated endocytosis is to regulate the number of GABA(A) receptors available at the cell surface []. GGAs (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) are a family of monomeric clathrin adaptor proteins that are conserved from yeasts to humans. GGAs regulate clathrin-mediated the transport of proteins (such as mannose 6-phosphate receptors) from the TGN to endosomes and lysosomes through interactions with TGN-sorting receptors, sometimes in conjunction with AP-1 [, ]. GGAs bind cargo, membranes, clathrin and accessory factors. GGA1, GGA2 and GGA3 all contain a domain homologous to the ear domain of gamma-adaptin. GGAs are composed of a single polypeptide with four domains: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The VHS domain is responsible for endocytosis and signal transduction, recognising transmembrane cargo through the ACLL sequence in the cytoplasmic domains of sorting receptors []. The GAT domain (also found in Tom1 proteins) interacts with ARF (ADP-ribosylation factor) to regulate membrane trafficking [], and with ubiquitin for receptor sorting []. The hinge region contains a clathrin box for recognition and binding to clathrin, similar to that found in AP adaptins. The GAE domain is similar to the AP gamma-adaptin ear domain, and is responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. This entry represents a beta-sandwich structural motif found in the appendage (ear) domain of alpha-, beta- and gamma-adaptin from AP clathrin adaptor complexes, and the GAE (gamma-adaptin ear) domain of GGA adaptor proteins. These domains have an immunoglobulin-like beta-sandwich fold containing 7 or 8 strands in 2 beta-sheets in a Greek key topology [, ]. Although these domains share a similar fold, there is little sequence identity between the alpha/beta-adaptins and gamma-adaptin/GAE. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030131 clathrin adaptor complex; PDB: 3MNM_B 3ZY7_B 1GYU_A 1GYW_B 2A7B_A 1GYV_A 2E9G_A 1E42_B 2G30_A 2IV9_B ....
Probab=35.03 E-value=1.5e+02 Score=23.46 Aligned_cols=44 Identities=9% Similarity=0.194 Sum_probs=35.1
Q ss_pred EEeeCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082 189 FMLSPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMV 234 (314)
Q Consensus 189 f~~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L 234 (314)
.++.+.-|.+.+...+ -..+..+.+.+...|.|+..+.++.+.+
T Consensus 4 ~~ye~~~l~I~~~~~~--~~~~~~~~i~~~f~N~s~~~it~f~~q~ 47 (115)
T PF02883_consen 4 VLYEDNGLQIGFKSEK--SPNPNQGRIKLTFGNKSSQPITNFSFQA 47 (115)
T ss_dssp EEEEETTEEEEEEEEE--CCETTEEEEEEEEEE-SSS-BEEEEEEE
T ss_pred EEEeCCCEEEEEEEEe--cCCCCEEEEEEEEEECCCCCcceEEEEE
Confidence 3567778888888887 5668899999999999999999988765
No 45
>PRK12676 bifunctional inositol-1 monophosphatase/fructose-1,6-bisphosphatase; Reviewed
Probab=34.61 E-value=27 Score=32.37 Aligned_cols=49 Identities=16% Similarity=0.243 Sum_probs=37.4
Q ss_pred CCceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEEE
Q psy2082 31 TGVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLASE 83 (314)
Q Consensus 31 ~~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~~ 83 (314)
=.||||||.- .+++|.-.|+...+-++-|+-..=|+-..+..++|.+..
T Consensus 84 WvIDPiDGT~----nfv~g~p~~~vsial~~~g~p~~gvV~~P~~~e~~~A~~ 132 (263)
T PRK12676 84 VVLDPLDGTY----NAINGIPFYAISIAVFKGGKPVYGYVYNLATGDFYEAIP 132 (263)
T ss_pred EEEeccCCch----HHhcCCCceEEEEEEEECCeEEEEEEEecCCCCEEEEEC
Confidence 3578888865 899999999977777776766666777788888888653
No 46
>KOG2540|consensus
Probab=34.55 E-value=74 Score=29.48 Aligned_cols=28 Identities=18% Similarity=0.267 Sum_probs=24.3
Q ss_pred Cccc-cCCCeEEEEEEEEccccceeEEEE
Q psy2082 204 KQLY-AHNEKIAVSICIRNNSNKVVKKIK 231 (314)
Q Consensus 204 K~~Y-~pGE~I~v~v~I~N~Ssk~Vk~Ik 231 (314)
+++| .|||+...--+..|.|.+.|-+|.
T Consensus 155 rEiyV~PGEtALaFYta~N~sdkpIiGvs 183 (269)
T KOG2540|consen 155 REIYVLPGETALAFYTAENPSDKPIIGVS 183 (269)
T ss_pred eEEEEcCCcceeeeEeccCCCCCCceeeE
Confidence 3456 799999999999999999998865
No 47
>cd08548 Type_I_cohesin_like Type I cohesin domain, interaction partner of dockerin. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I cohesins; their interactions with dockerin mediate assembly of a range of dockerin-borne enzymes to the complex.
Probab=33.79 E-value=91 Score=26.09 Aligned_cols=39 Identities=13% Similarity=0.176 Sum_probs=32.1
Q ss_pred EEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEEe
Q psy2082 199 EVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQG 237 (314)
Q Consensus 199 ~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~ 237 (314)
++.+.+--=.+||++.|-|.++|..++.|..+.+.|.--
T Consensus 2 ~v~ig~v~~~~G~tv~VpV~~~~v~~~~i~~~~f~l~yD 40 (135)
T cd08548 2 EVKIGSVSGKPGDTVTVPVTLSNVPSKGIGACDFVLSYD 40 (135)
T ss_pred eEEeccEEecCCCEEEEEEEEecCCccCEEEEEEEEEeC
Confidence 556666666799999999999999998899888877543
No 48
>PF13595 DUF4138: Domain of unknown function (DUF4138)
Probab=33.31 E-value=58 Score=30.37 Aligned_cols=36 Identities=22% Similarity=0.484 Sum_probs=29.8
Q ss_pred EEEEECCccccCCCeEEEEEEEEccccce--eEEEEEEE
Q psy2082 198 LEVTLDKQLYAHNEKIAVSICIRNNSNKV--VKKIKAMV 234 (314)
Q Consensus 198 l~asldK~~Y~pGE~I~v~v~I~N~Ssk~--Vk~Ikv~L 234 (314)
|++.|. .+|++|+-+-+.++|.|.|+.+ |.-|++.+
T Consensus 130 i~~~L~-~Iy~~~d~lyf~~~i~N~S~i~ydID~irf~i 167 (246)
T PF13595_consen 130 IQFRLN-NIYVHGDYLYFHLSIKNKSNIPYDIDFIRFKI 167 (246)
T ss_pred eEEEEE-eEEEECCEEEEEEEEEcCCCCCeEEeEEEEEE
Confidence 555554 6999999999999999999865 77777765
No 49
>cd08544 Reeler Reeler, the N-terminal domain of reelin, F-spondin, and a variety of other proteins. This domain is found at the N-terminus of F-spondin, a protein attached to the extracellular matrix, which plays roles in neuronal development and vascular remodelling. The F-spondin reeler domain has been reported to bind heparin. The reeler domain is also found at the N-terminus of reelin, an extracellular glycoprotein involved in the development of the brain cortex, and in a variety of other eukaryotic proteins with different domain architectures, including the animal ferric-chelate reductase 1 or stromal cell-derived receptor 2, a member of the cytochrome B561 family, which reduces ferric iron before its transport from the endosome to the cytoplasm. Also included is the insect putative defense protein 1, which is expressed upon bacterial infection and appears to contain a single reeler domain.
Probab=32.64 E-value=89 Score=25.71 Aligned_cols=37 Identities=11% Similarity=0.179 Sum_probs=26.0
Q ss_pred EEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEE
Q psy2082 198 LEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQ 236 (314)
Q Consensus 198 l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q 236 (314)
..++++...|.|||.+.|++.-.|. ..-+.+-+....
T Consensus 21 y~i~~~~~~y~pG~~~~Vtl~~~~~--~~F~GF~lqAr~ 57 (135)
T cd08544 21 YSITISGNSYVPGETYTVTLSGSSP--SPFRGFLLQARD 57 (135)
T ss_pred EEEEeCCCEECCCCEEEEEEECCCC--CceeEEEEEEEc
Confidence 3445666699999999999988776 455555544443
No 50
>COG4326 Spo0M Sporulation control protein [General function prediction only]
Probab=32.50 E-value=1.2e+02 Score=27.95 Aligned_cols=43 Identities=7% Similarity=0.218 Sum_probs=35.4
Q ss_pred eCCcEEEEEEECCccccCCCeEEEEEEEEcccc-ceeEEEEEEE
Q psy2082 192 SPGDLELEVTLDKQLYAHNEKIAVSICIRNNSN-KVVKKIKAMV 234 (314)
Q Consensus 192 s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ss-k~Vk~Ikv~L 234 (314)
.=|.-++.--|.+++|+|||.+...+.|.-.-. .+|..|.++|
T Consensus 29 GIGaAKVDt~L~~~~~~PG~~v~g~vhv~GG~~AQdI~~I~LkL 72 (270)
T COG4326 29 GIGAAKVDTVLQQEVLYPGQSVKGIVHVYGGATAQDIDNIELKL 72 (270)
T ss_pred CcchhhhhhhhhhccccCCceEEEEEEEecCchHhhhhhhhhhh
Confidence 346777888899999999999999999987763 5688887777
No 51
>cd01641 Bacterial_IMPase_like_1 Predominantly bacterial family of Mg++ dependend phosphatases, related to inositol monophosphatases. These enzymes may dephosphorylate fructose-1,6-bisphosphate, inositol monophospate, 3'-phosphoadenosine-5'-phosphate, or similar substrates.
Probab=30.34 E-value=37 Score=31.03 Aligned_cols=46 Identities=22% Similarity=0.460 Sum_probs=35.9
Q ss_pred ceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082 33 VEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS 82 (314)
Q Consensus 33 vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~ 82 (314)
||||||.- .+++|.--|....+-++-|+-..=++...+..++|.+.
T Consensus 77 iDPIDGT~----nf~~g~p~~~vsial~~~g~p~~gvV~~P~~~~~~~A~ 122 (248)
T cd01641 77 LDPIDGTK----SFIRGLPVWGTLIALLHDGRPVLGVIDQPALGERWIGA 122 (248)
T ss_pred EecCcCch----hHhcCCCceEEEEEEEECCEEEEEEEccCccCCEEEEe
Confidence 79999965 88999888887666666676666677778888888864
No 52
>PF06988 NifT: NifT/FixU protein; InterPro: IPR009727 This family consists of several NifT and FixU bacterial proteins. The function of NifT is unknown although it is thought that the protein may be involved in biosynthesis of the FeMo cofactor of nitrogenase although perturbation of nifT expression in Klebsiella pneumoniae has only a limited effect on nitrogen fixation [].; GO: 0009399 nitrogen fixation; PDB: 2JN4_A.
Probab=30.27 E-value=50 Score=24.56 Aligned_cols=21 Identities=29% Similarity=0.479 Sum_probs=13.6
Q ss_pred EEEeecCCCcEEEEecceeeE
Q psy2082 7 VFKKCSPNGKLTLYMGKRDFV 27 (314)
Q Consensus 7 vfkk~s~n~~~t~yl~~rd~~ 27 (314)
|.=.-..+|.|++|+.|+|.=
T Consensus 3 VmiR~~~~G~ls~YVpKKDLE 23 (64)
T PF06988_consen 3 VMIRKNGAGGLSAYVPKKDLE 23 (64)
T ss_dssp EEEEE-SS--EEEEETTTTEE
T ss_pred EEEEeCCCcCEEEEEeCCccc
Confidence 334456789999999999953
No 53
>cd01639 IMPase IMPase, inositol monophosphatase and related domains. A family of Mg++ dependent phosphatases, inhibited by lithium, many of which may act on inositol monophosphate substrate. They dephosphorylate inositol phosphate to generate inositol, which may be recycled into inositol lipids; in eukaryotes IMPase plays a vital role in intracellular signaling. IMPase is one of the proposed targets of Li+ therapy in manic-depressive illness. This family contains some bacterial members of the inositol monophosphatase family classified as SuhB-like. E. coli SuhB has been suggested to participate in posstranscriptional control of gene expression, and its inositol monophosphatase activity doesn't appear to be sufficient for its cellular function. It has been proposed, that SuhB plays a role in the biosynthesis of phosphatidylinositol in mycobacteria.
Probab=30.25 E-value=34 Score=31.09 Aligned_cols=47 Identities=17% Similarity=0.299 Sum_probs=35.6
Q ss_pred CceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082 32 GVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS 82 (314)
Q Consensus 32 ~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~ 82 (314)
.||||||.- .+++|.--|....+-++-|+-..=++...+..++|...
T Consensus 79 vIDPIDGT~----nf~~g~p~~~vsial~~~g~p~~gvV~~P~~~~~~~a~ 125 (244)
T cd01639 79 IIDPLDGTT----NFVHGFPHFAVSIALAVKGEPVVGVVYDPIRNELFTAV 125 (244)
T ss_pred EEecccChh----HHhcCCCcEEEEEEEEECCEEEEEEEEeCCCCcEEEEE
Confidence 478999964 89999999997777777676556667777777777764
No 54
>PF11355 DUF3157: Protein of unknown function (DUF3157); InterPro: IPR021501 This family of proteins with unknown function appears to be restricted to Gammaproteobacteria.
Probab=30.05 E-value=1.3e+02 Score=27.36 Aligned_cols=36 Identities=19% Similarity=0.401 Sum_probs=29.2
Q ss_pred EEEEECCccccCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082 198 LEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMV 234 (314)
Q Consensus 198 l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L 234 (314)
+++.|...-|-.| ..-+...+.|+|+..|-.|.+.+
T Consensus 100 VdV~l~~~~y~~~-~L~l~~~ltnqSsqsVv~Vel~v 135 (199)
T PF11355_consen 100 VDVSLGASQYEDG-QLGLPFSLTNQSSQSVVLVELEV 135 (199)
T ss_pred eeEEEeccceeCC-eEEEEEEEecCCCceEEEEEEEE
Confidence 6677777666666 99999999999999998777655
No 55
>PRK10757 inositol monophosphatase; Provisional
Probab=29.26 E-value=37 Score=31.49 Aligned_cols=47 Identities=23% Similarity=0.419 Sum_probs=35.2
Q ss_pred CceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082 32 GVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS 82 (314)
Q Consensus 32 ~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~ 82 (314)
.||||||.. .+++|.--|....+-++-|+-..=|+-..+..++|.+.
T Consensus 82 iIDPIDGT~----nf~~g~p~~~vsial~~~g~pv~GvV~~P~~~~~~~A~ 128 (267)
T PRK10757 82 VIDPLDGTT----NFIKRLPHFAVSIAVRIKGRTEVAVVYDPMRNELFTAT 128 (267)
T ss_pred EEeCccCch----HHHhCCCcEEEEEEEEECCEEEEEEEEcCCCCCEEEEE
Confidence 389999975 89999888887666665566555566677778888764
No 56
>TIGR02934 nifT_nitrog probable nitrogen fixation protein FixT. This largely uncharacterized protein family is assigned a role in nitrogen fixation by two criteria. First, its gene occurs, generally, among genes essential for expression of active nitrogenase. Second, its phylogenetic profile closely matches that of nitrogen-fixing bacteria. However, mutational studies in Klebsiella pneumoniae failed to demonstrate any phenotype for deletion or overexpression of the protein.
Probab=28.86 E-value=57 Score=24.50 Aligned_cols=18 Identities=28% Similarity=0.571 Sum_probs=14.7
Q ss_pred eecCCCcEEEEecceeeE
Q psy2082 10 KCSPNGKLTLYMGKRDFV 27 (314)
Q Consensus 10 k~s~n~~~t~yl~~rd~~ 27 (314)
..+.+|.|++|+.|+|.=
T Consensus 6 R~~~~g~l~~YvpKKDLE 23 (67)
T TIGR02934 6 RRNRAGELSAYVPKKDLE 23 (67)
T ss_pred EeCCCCCEEEEEECCcch
Confidence 456679999999999953
No 57
>PF07919 Gryzun: Gryzun, putative trafficking through Golgi; InterPro: IPR012880 The proteins featured in this family are all hypothetical eukaryotic proteins of unknown function. The region in question is approximately 150 residues long.
Probab=28.73 E-value=1.7e+02 Score=29.78 Aligned_cols=39 Identities=21% Similarity=0.272 Sum_probs=32.1
Q ss_pred eeCCcEEEEEEE--CCccccCCCeEEEEEEEEccccceeEE
Q psy2082 191 LSPGDLELEVTL--DKQLYAHNEKIAVSICIRNNSNKVVKK 229 (314)
Q Consensus 191 ~s~G~l~l~asl--dK~~Y~pGE~I~v~v~I~N~Ssk~Vk~ 229 (314)
....|-+|++++ .+.-|+-||.+.+.+.|.|+.......
T Consensus 168 I~p~pp~v~I~~~~~~~~~l~gE~~~i~i~I~n~e~~~~~~ 208 (554)
T PF07919_consen 168 ILPRPPKVSIKLPNHKPPALTGEFYPIPITISNNEDEEASG 208 (554)
T ss_pred EECCCCCeEEEeCCCCCCeEcCCEEEEEEEEEcCCCcccee
Confidence 456677777777 677899999999999999999877654
No 58
>TIGR01331 bisphos_cysQ 3'(2'),5'-bisphosphate nucleotidase, bacterial. Sulfate is incorporated into 3-phosphoadenylylsulfate, PAPS, for utilization in pathways such as methionine biosynthesis. Transfer of sulfate from PAPS to an acceptor leaves adenosine 3'-5'-bisphosphate, APS. This model describes a form found in bacteria of the enzyme 3'(2'),5'-bisphosphate nucleotidase, which removes the 3'-phosphate from APS to regenerate AMP and help drive the cycle.
Probab=27.88 E-value=35 Score=31.31 Aligned_cols=47 Identities=19% Similarity=0.253 Sum_probs=37.0
Q ss_pred CceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082 32 GVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS 82 (314)
Q Consensus 32 ~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~ 82 (314)
.||||||.- .+++|.-.|....+-++-|+--+=|+-..+..++|.+.
T Consensus 80 vIDPIDGT~----nF~~G~p~~~vsIal~~~g~pv~gvI~~P~~~~~~~A~ 126 (249)
T TIGR01331 80 LVDPLDGTK----EFINRNGDFTVNIALVEHGVPVLGVVYAPATGVTYFAT 126 (249)
T ss_pred EEcCCcChH----HHhcCCCcEEEEEEEEECCEEEEEEEEecCCCCEEEEE
Confidence 489999965 89999888988777777677666677778888888764
No 59
>PF14796 AP3B1_C: Clathrin-adaptor complex-3 beta-1 subunit C-terminal
Probab=27.86 E-value=1.6e+02 Score=25.42 Aligned_cols=44 Identities=18% Similarity=0.267 Sum_probs=36.6
Q ss_pred eeCCcEEEEEEECCccc-cCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082 191 LSPGDLELEVTLDKQLY-AHNEKIAVSICIRNNSNKVVKKIKAMV 234 (314)
Q Consensus 191 ~s~G~l~l~asldK~~Y-~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L 234 (314)
+..+=|.++-...|+-+ +..--+.|++.++|+|...+++|++.=
T Consensus 64 v~G~GL~v~Y~F~RqP~~~s~~mvsIql~ftN~s~~~i~~I~i~~ 108 (145)
T PF14796_consen 64 VNGKGLSVEYRFSRQPSLYSPSMVSIQLTFTNNSDEPIKNIHIGE 108 (145)
T ss_pred cCCCceeEEEEEccCCcCCCCCcEEEEEEEEecCCCeecceEECC
Confidence 45667888888899877 556788999999999999999988654
No 60
>PLN02911 inositol-phosphate phosphatase
Probab=27.21 E-value=40 Score=31.98 Aligned_cols=48 Identities=23% Similarity=0.435 Sum_probs=35.9
Q ss_pred CCceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082 31 TGVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS 82 (314)
Q Consensus 31 ~~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~ 82 (314)
=.||||||.. .+++|.-.|....+-++=|+-.+-|+-..+.+++|.+.
T Consensus 112 WiIDPIDGT~----NFv~G~p~favsIal~~~g~pv~GvV~~P~~~e~y~A~ 159 (296)
T PLN02911 112 WVLDPIDGTK----SFITGKPLFGTLIALLYKGKPVLGIIDQPVLKERWVGV 159 (296)
T ss_pred EEEeCCcChH----HHhcCCCceEEEEEEEECCEEEEEEEecCCCCCEEEEE
Confidence 3469999975 89999998887666665565555677777888888865
No 61
>cd01515 Arch_FBPase_1 Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family (FBPase class IV). These are Mg++ dependent phosphatases. Members in this family may have both fructose-1,6-bisphosphatase and inositol-monophosphatase activity. In hyperthermophilic archaea, inositol monophosphatase is thought to play a role in the biosynthesis of di-myo-inositol-1,1'-phosphate, an osmolyte unique to hyperthermophiles.
Probab=27.19 E-value=40 Score=31.08 Aligned_cols=48 Identities=19% Similarity=0.261 Sum_probs=36.0
Q ss_pred CCceeeeeEEEEecceeeceeEEEEEEEeeeccc--ccccccCccccceEEEEE
Q psy2082 31 TGVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGR--EEDEILGLNFQKELYLAS 82 (314)
Q Consensus 31 ~~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~--~d~dv~gl~f~k~l~~~~ 82 (314)
=.||||||.- .+++|.-.|+...+-.+.|+ -.+-|+...+..++|.+.
T Consensus 79 WvIDPIDGT~----nfv~g~p~~~isial~~~g~~~p~~gvv~~P~~~~~~~a~ 128 (257)
T cd01515 79 VVLDPLDGTY----NAINGIPFYSVSVAVFKIDKSDPYYGYVYNLATGDLYYAI 128 (257)
T ss_pred EEEeCcCChh----HHhcCCCceEEEEEEEeCCCCCeEEEEEEecCCCCeEEEE
Confidence 3478888854 78999999998878887787 555566667777787754
No 62
>cd01517 PAP_phosphatase PAP-phosphatase_like domains. PAP-phosphatase is a member of the inositol monophosphatase family, and catalyses the hydrolysis of 3'-phosphoadenosine-5'-phosphate (PAP) to AMP. In Saccharomyces cerevisiae, HAL2 (MET22) is involved in methionine biosynthesis and provides increased salt tolerance when over-expressed. Bacterial members of this domain family may differ in their substrate specificity and dephosphorylate different targets, as the substrate binding site does not appear to be conserved in that sub-set.
Probab=26.10 E-value=55 Score=30.41 Aligned_cols=47 Identities=19% Similarity=0.328 Sum_probs=32.9
Q ss_pred CCceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccc-------cceEEEEE
Q psy2082 31 TGVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNF-------QKELYLAS 82 (314)
Q Consensus 31 ~~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f-------~k~l~~~~ 82 (314)
=.||||||.- .+++|. -|....+-.+-|+--+-|+-..+ ..++|.+.
T Consensus 75 WiIDPIDGT~----nfv~g~-~~~vsIal~~~g~pv~GvI~~P~~~~~~~~~~~~~~A~ 128 (274)
T cd01517 75 WVLDPIDGTK----GFLRGD-QFAVALALIEDGEVVLGVIGCPNLPLDDGGGGDLFSAV 128 (274)
T ss_pred EEEcCCcCch----hhhcCC-ceEEEEEEEECCEEEEEEEeCCCccccCCCCCcEEEEE
Confidence 3489999965 899999 88866666654554555666666 77777754
No 63
>cd08546 cohesin_like Cohesin domain, interaction parter of dockerin. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. Cohesin modules are phylogenetically distributed into three groups: type I cohesin-dockerin interactions mediate assembly of a range of dockerin-borne enzymes to the complex, while type-II interactions mediate attachment of the cellulosome complex to the bacterial cell wall. Recently discovered type-III cohesins, such as found in the anchoring scaffoldin ScaE, appears to contribute to increased stability of the elaborate cellulosome complex. While the p
Probab=25.93 E-value=2.1e+02 Score=22.82 Aligned_cols=35 Identities=11% Similarity=0.172 Sum_probs=26.7
Q ss_pred EEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082 198 LEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQ 235 (314)
Q Consensus 198 l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~ 235 (314)
+.+..+... .+||++.+.+.++|.+ .+....+.|.
T Consensus 3 ~~~~~~~~~-~~G~~~~v~v~~~~~~--~~~~~~~~l~ 37 (135)
T cd08546 3 VSLGAPSTV-KVGETVTVTVKVNNVP--NVAAADFTLS 37 (135)
T ss_pred EEEeccccc-cCCCEEEEEEEEecCC--CeEEEEEEEE
Confidence 444555555 9999999999999998 7777666664
No 64
>TIGR02067 his_9_proposed histidinol-phosphate phosphatase HisN, inositol monophosphatase family. This subfamily belongs to the inositol monophosphatase family (pfam00459). The members of this family consist of no more than one per species and are found only in species in which histidine is synthesized de novo but no histidinol phosphatase can be found in either of the two described families (TIGR01261, TIGR01856). In at least one species, the member of this family is found near known histidine biosynthesis genes.
Probab=24.86 E-value=44 Score=30.62 Aligned_cols=47 Identities=21% Similarity=0.322 Sum_probs=33.6
Q ss_pred CceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082 32 GVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS 82 (314)
Q Consensus 32 ~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~ 82 (314)
.||||||.- .+++|.--|+...+-.+-|+-..-|+-..+..++|.+.
T Consensus 78 iiDPIDGT~----nF~~g~p~~~vsial~~~g~p~~gvv~~P~~~~~~~A~ 124 (251)
T TIGR02067 78 VLDPIDGTK----SFIRGVPVWGTLIALVEGGMPVLGVIFQPATGERWWAA 124 (251)
T ss_pred EEecCcChh----HHhcCCCceEEEEEEEECCEEEEEEEEEcCCCCEEEEe
Confidence 379999965 78888888886556565565555566667777777754
No 65
>PLN02553 inositol-phosphate phosphatase
Probab=24.43 E-value=52 Score=30.48 Aligned_cols=47 Identities=15% Similarity=0.128 Sum_probs=35.8
Q ss_pred CceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082 32 GVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS 82 (314)
Q Consensus 32 ~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~ 82 (314)
.||||||.. .+++|.-.|....+-++-|+-..=|+-..+..++|.+.
T Consensus 89 iIDPIDGT~----NF~~g~p~~avsIal~~~g~pv~GvV~~P~~~e~~~A~ 135 (270)
T PLN02553 89 IVDPLDGTT----NFVHGFPFVCVSIGLTIGKVPVVGVVYNPILDELFTAV 135 (270)
T ss_pred EEecccchh----hHhhcCCceEEEEEEEECCEEEEEEEecCCCCCeEEEE
Confidence 689999965 88899888887666676666556666677788888865
No 66
>PF12389 Peptidase_M73: Camelysin metallo-endopeptidase; InterPro: IPR022121 Camelysin is a novel surface metallopeptidase from Bacillus cereus []. Camelysin prefers cleavage sites in front of aliphatic and hydrophilic amino acid residues (-OH, -SO3H, amido group), and requires zinc for activity [, ].
Probab=24.42 E-value=1.6e+02 Score=26.78 Aligned_cols=30 Identities=7% Similarity=0.139 Sum_probs=25.6
Q ss_pred ccccCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082 205 QLYAHNEKIAVSICIRNNSNKVVKKIKAMV 234 (314)
Q Consensus 205 ~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L 234 (314)
+-..||+++.-.+.|.|.-+-+++.|.+..
T Consensus 59 ~nlkPGD~v~k~f~l~N~Gtldi~~v~l~~ 88 (199)
T PF12389_consen 59 SNLKPGDTVEKEFTLKNSGTLDIKDVLLKT 88 (199)
T ss_pred ccCCCCCeEEEEEEEEeCCeeeeeeEEEEE
Confidence 467899999999999999999988776654
No 67
>COG2309 AmpS Leucyl aminopeptidase (aminopeptidase T) [Amino acid transport and metabolism]
Probab=23.94 E-value=1.5e+02 Score=29.59 Aligned_cols=54 Identities=24% Similarity=0.375 Sum_probs=40.1
Q ss_pred eecCCC-cEEEEecceeeEecCC-------------------CceeeeeEEEEe-cceeeceeEEEEEEEeeeccc
Q psy2082 10 KCSPNG-KLTLYMGKRDFVDYVT-------------------GVEPIDGVVVFE-EDYIQDRKVFGQVVCSFRYGR 64 (314)
Q Consensus 10 k~s~n~-~~t~yl~~rd~~d~~~-------------------~vd~v~GvV~vd-~~~lk~rkv~~~~~~~fryG~ 64 (314)
-++|.| .||+.|+.|--...-. +=|.|+|+|+.| |-+-+|.-+ =.+..+|+=|+
T Consensus 193 ~~tp~gTDLt~~l~~~~~~~~~g~~~~~~~~NmPtgEVFtaP~~~sv~G~V~~~~Pl~~~G~li-e~i~l~FedG~ 267 (385)
T COG2309 193 ITTPGGTDLTFGLAGRHAWNDAGSVNKGGNANMPTGEVFTAPHEDSVNGVVVFDKPLSYQGTLI-EGIRLTFEDGK 267 (385)
T ss_pred EeCCCCceeEEEcCCceeeccCCccCCCCCCCCCCcceEeccccCccceEEEECcCCCcCCEee-eceEEEEECCE
Confidence 478988 9999999988775544 346899999999 777677663 34777775444
No 68
>cd01643 Bacterial_IMPase_like_2 Bacterial family of Mg++ dependent phosphatases, related to inositol monophosphatases. These enzymes may dephosphorylate inositol monophosphate or similar substrates.
Probab=23.74 E-value=52 Score=30.09 Aligned_cols=47 Identities=15% Similarity=0.337 Sum_probs=34.4
Q ss_pred CceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082 32 GVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS 82 (314)
Q Consensus 32 ~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~ 82 (314)
.||||||.- .+++|.-.|....+-++=|+--.=++-..+.+++|.+.
T Consensus 75 iIDPIDGT~----nF~~g~p~~~vsial~~~g~pv~GvV~~P~~~~~~~A~ 121 (242)
T cd01643 75 VIDPIDGTT----NFARGIPIWAISIALLYRGEPVFGVIALPALNQTFVAF 121 (242)
T ss_pred EEeCccChH----HHhcCCCceEEEEEEEECCEEEEEEEecCCCCCEEEEE
Confidence 389999965 88899988887766665565555566667777887764
No 69
>PRK03298 hypothetical protein; Provisional
Probab=23.59 E-value=5.9e+02 Score=23.56 Aligned_cols=46 Identities=22% Similarity=0.266 Sum_probs=27.0
Q ss_pred CeEEEEEEEEEEccCCccC--ccccceEEEeeee-ccCCCCCCCCCCeE
Q psy2082 138 PCGVTYYVKLFVGENETDR--SHRRSTVALGIRK-IQYAPSKNGRQPCT 183 (314)
Q Consensus 138 ~g~V~Y~Vka~i~~~~~dr--~~K~~~v~l~irk-l~~~p~~~~~~p~~ 183 (314)
.|.|+|+=+|.-.-+..+| ..|.|-..++-.+ -.+-|.+|.++|+.
T Consensus 7 ~C~V~Y~GRa~s~L~~~~RLiiiK~DGsvlVH~d~~~~kPlNW~pp~~~ 55 (224)
T PRK03298 7 RCSVDYVGRLTAHLPSARRLLLVKADGSVSVHADDRAYKPLNWMSPPCT 55 (224)
T ss_pred EEEEEEccceeccCCCCceEEEEccCCeEEEecCCCCCCCcccCCCCeE
Confidence 4888888777633233333 3455533344444 56778888877754
No 70
>PRK05089 cytochrome C oxidase assembly protein; Provisional
Probab=22.54 E-value=1.2e+02 Score=27.21 Aligned_cols=25 Identities=16% Similarity=-0.057 Sum_probs=22.4
Q ss_pred ccCCCeEEEEEEEEccccceeEEEE
Q psy2082 207 YAHNEKIAVSICIRNNSNKVVKKIK 231 (314)
Q Consensus 207 Y~pGE~I~v~v~I~N~Ssk~Vk~Ik 231 (314)
-.|||...+.-.+.|.|++.|....
T Consensus 90 V~pGE~~~~~y~a~N~sd~~i~g~A 114 (188)
T PRK05089 90 VHPGELNLVFYEAENLSDRPIVGQA 114 (188)
T ss_pred EcCCCeEEEEEEEECCCCCcEEEEE
Confidence 3689999999999999999998754
No 71
>TIGR03780 Bac_Flav_CT_N Bacteroides conjugative transposon TraN protein. Members of this family are the TraN protein encoded by transfer region genes of conjugative transposons of Bacteroides. The family is related to conjugative transfer proteins VirB9 and TrbG of Agrobacterium Ti plasmids.
Probab=21.92 E-value=1.2e+02 Score=29.01 Aligned_cols=37 Identities=19% Similarity=0.387 Sum_probs=29.4
Q ss_pred EEEEECCccccCCCeEEEEEEEEccccce--eEEEEEEEE
Q psy2082 198 LEVTLDKQLYAHNEKIAVSICIRNNSNKV--VKKIKAMVQ 235 (314)
Q Consensus 198 l~asldK~~Y~pGE~I~v~v~I~N~Ssk~--Vk~Ikv~L~ 235 (314)
+++.| +++|.||+-+-++.++.|+|+-. |.-|++++.
T Consensus 169 i~~~l-~~Iy~~~d~lyf~~~l~N~Sni~ydID~irfki~ 207 (285)
T TIGR03780 169 IQFLL-NGIYTHNDLLYFHTSLENKTNIPFDIDFIRFKIV 207 (285)
T ss_pred cEEEE-EeEEEECCEEEEEEEEEcCCCCCcEeEEEEEEEE
Confidence 44444 47999999999999999999854 887777664
No 72
>KOG2625|consensus
Probab=21.72 E-value=53 Score=30.60 Aligned_cols=28 Identities=29% Similarity=0.290 Sum_probs=24.4
Q ss_pred ccCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082 207 YAHNEKIAVSICIRNNSNKVVKKIKAMV 234 (314)
Q Consensus 207 Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L 234 (314)
.+-||+...-++|.|.|+++|+.|.++-
T Consensus 11 iflgetfs~yinv~nds~k~v~~i~lk~ 38 (348)
T KOG2625|consen 11 IFLGETFSFYINVHNDSEKTVKDILLKA 38 (348)
T ss_pred eeeccceEEEEEEecchhhhhhhheeee
Confidence 3569999999999999999999887654
No 73
>PRK10931 adenosine-3'(2'),5'-bisphosphate nucleotidase; Provisional
Probab=21.30 E-value=52 Score=30.08 Aligned_cols=47 Identities=19% Similarity=0.330 Sum_probs=34.8
Q ss_pred CceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082 32 GVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS 82 (314)
Q Consensus 32 ~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~ 82 (314)
.||||||.. .+++|.--|....+-++-|+-.+=|+-..+..++|.+.
T Consensus 81 iIDPIDGT~----nF~~g~p~~~vsIal~~~g~p~~GvV~~P~~~~~y~A~ 127 (246)
T PRK10931 81 LVDPLDGTK----EFIKRNGEFTVNIALIEQGKPVLGVVYAPVMNVMYSAA 127 (246)
T ss_pred EEecCcChH----HHHcCCCCEEEEEEEEECCEEEEEEEeecCCCCEEEEE
Confidence 489999965 88899888887666666566555566667777887764
No 74
>PF04442 CtaG_Cox11: Cytochrome c oxidase assembly protein CtaG/Cox11; InterPro: IPR007533 Cytochrome c oxidase assembly protein is essential for the assembly of functional cytochrome oxidase protein. In eukaryotes it is an integral protein of the mitochondrial inner membrane. Cox11 is essential for the insertion of Cu(I) ions to form the CuB site. This is essential for the stability of other structures in subunit I, for example haems a and a3, and the magnesium/manganese centre. Cox11 is probably only required in sub-stoichiometric amounts relative to the structural units []. The C-terminal region of the protein is known to form a dimer. Each monomer coordinates one Cu(I) ion via three conserved cysteine residues (111, 208 and 210) in Saccharomyces cerevisiae (P19516 from SWISSPROT). Met 224 is also thought to play a role in copper transfer or stabilising the copper site [].; GO: 0005507 copper ion binding; PDB: 1SO9_A 1SP0_A.
Probab=20.70 E-value=1e+02 Score=26.83 Aligned_cols=25 Identities=20% Similarity=0.049 Sum_probs=16.7
Q ss_pred ccCCCeEEEEEEEEccccceeEEEE
Q psy2082 207 YAHNEKIAVSICIRNNSNKVVKKIK 231 (314)
Q Consensus 207 Y~pGE~I~v~v~I~N~Ssk~Vk~Ik 231 (314)
-.|||...+.-.+.|.|++.|....
T Consensus 63 V~pGe~~~~~y~a~N~s~~~i~g~A 87 (152)
T PF04442_consen 63 VHPGETALVFYEATNPSDKPITGQA 87 (152)
T ss_dssp EETT--EEEEEEEEE-SSS-EE---
T ss_pred eCCCCEEEEEEEEECCCCCcEEEEE
Confidence 3799999999999999999998754
No 75
>cd08908 START_STARD12-like C-terminal lipid-binding START domain of mammalian STARD12 and related proteins, which also have an N-terminal Rho GTPase-activating protein (RhoGAP) domain. This subgroup includes the steroidogenic acute regulatory protein (StAR)-related lipid transfer (START) domains of STARD12 (also known as DLC-1, Arhgap7, and p122-RhoGAP) and related proteins. It belongs to the START domain family, and in turn to the SRPBCC (START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC) domain superfamily of proteins that bind hydrophobic ligands. SRPBCC domains have a deep hydrophobic ligand-binding pocket. Proteins belonging to this subgroup also have an N-terminal SAM (sterile alpha motif) domain and a RhoGAP domain, and have a SAM-RhoGAP-START domain organization. The precise function of the START domain in this subgroup is unclear.
Probab=20.11 E-value=2.5e+02 Score=25.37 Aligned_cols=36 Identities=19% Similarity=0.222 Sum_probs=26.5
Q ss_pred cceEEEeecCCCcEEEE-------ecceeeEec-CCCceeeeeE
Q psy2082 4 NFKVFKKCSPNGKLTLY-------MGKRDFVDY-VTGVEPIDGV 39 (314)
Q Consensus 4 ~~~vfkk~s~n~~~t~y-------l~~rd~~d~-~~~vd~v~Gv 39 (314)
..+|.++.++|..|.=| +..||||-+ ...-|.-+|-
T Consensus 84 e~~vIe~ld~~~~I~Yy~~~~PwP~~~RD~V~~Rs~~~~~~~g~ 127 (204)
T cd08908 84 DSKVIEILDSQTEIYQYVQNSMAPHPARDYVVLRTWRTNLPKGA 127 (204)
T ss_pred heEeeEecCCCceEEEEEccCCCCCCCcEEEEEEEEEEeCCCCe
Confidence 45788999999988777 568999988 4333444563
No 76
>cd08759 Type_III_cohesin_like Cohesin domain, interaction partner of dockerin. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. Two specific calcium-dependent interactions between cohesin and dockerin appear to be essential for cellulosome assembly, type I and type II. This subfamily represents type III cohesins and closely related domains.
Probab=20.06 E-value=2.2e+02 Score=24.93 Aligned_cols=35 Identities=17% Similarity=0.196 Sum_probs=26.7
Q ss_pred EEEEECCccccCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082 198 LEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMV 234 (314)
Q Consensus 198 l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L 234 (314)
|.+.-+|.-|-+||+|.|.+...+.. .|....+.|
T Consensus 3 l~l~~~k~~~~aGetvti~vkg~~l~--~VnA~~~~l 37 (160)
T cd08759 3 LSLVPSKTTVKAGETITIDLYGQGLK--NVNALGAHI 37 (160)
T ss_pred eEEeccccccCCCCEEEEEEEecccc--eeeeeeeEE
Confidence 55667788899999999999999886 555544443
Done!