Query psy1607
Match_columns 241
No_of_seqs 135 out of 1484
Neff 9.1
Searched_HMMs 46136
Date Fri Aug 16 20:05:45 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1607.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1607hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1427|consensus 100.0 2.1E-32 4.5E-37 221.8 13.3 193 16-217 43-247 (443)
2 COG5184 ATS1 Alpha-tubulin sup 100.0 1.7E-29 3.6E-34 216.4 16.7 192 15-217 88-303 (476)
3 KOG1427|consensus 100.0 1.4E-29 3E-34 205.5 10.7 209 2-217 79-301 (443)
4 COG5184 ATS1 Alpha-tubulin sup 100.0 1.2E-28 2.6E-33 211.1 16.8 208 1-217 144-359 (476)
5 KOG0783|consensus 99.9 1.1E-24 2.4E-29 195.9 9.1 203 4-217 146-354 (1267)
6 KOG1428|consensus 99.9 3.3E-21 7.2E-26 180.0 15.8 204 6-216 586-895 (3738)
7 KOG0783|consensus 99.8 5.2E-21 1.1E-25 172.5 6.9 165 43-216 137-303 (1267)
8 KOG1428|consensus 99.7 6.1E-17 1.3E-21 152.0 14.2 179 23-217 563-838 (3738)
9 PF00415 RCC1: Regulator of ch 99.4 7.1E-13 1.5E-17 82.4 5.5 51 160-213 1-51 (51)
10 PF00415 RCC1: Regulator of ch 99.3 9.6E-13 2.1E-17 81.8 4.0 50 47-98 1-51 (51)
11 PF13540 RCC1_2: Regulator of 99.3 3.6E-12 7.7E-17 69.9 4.2 30 144-173 1-30 (30)
12 PF13540 RCC1_2: Regulator of 99.3 4.5E-12 9.7E-17 69.5 4.1 30 85-114 1-30 (30)
13 KOG0941|consensus 99.3 3.7E-14 8E-19 129.2 -7.1 189 18-219 3-199 (850)
14 KOG0941|consensus 98.9 9.6E-12 2.1E-16 113.7 -7.7 132 74-216 5-143 (850)
15 KOG0943|consensus 95.5 0.0012 2.6E-08 63.8 -3.3 137 27-173 372-509 (3015)
16 KOG3669|consensus 95.4 0.3 6.6E-06 44.4 11.7 107 36-166 190-299 (705)
17 KOG3669|consensus 94.7 0.94 2E-05 41.4 12.6 70 30-107 228-299 (705)
18 PF11725 AvrE: Pathogenicity f 89.3 3.1 6.7E-05 42.8 9.8 135 76-234 696-832 (1774)
19 PF07569 Hira: TUP1-like enhan 88.4 3.6 7.7E-05 33.4 8.2 30 82-111 12-41 (219)
20 smart00706 TECPR Beta propelle 82.4 3.5 7.6E-05 22.6 3.9 24 29-52 8-32 (35)
21 KOG0943|consensus 80.3 2.4 5.2E-05 42.4 4.3 83 82-168 373-455 (3015)
22 smart00706 TECPR Beta propelle 79.2 5.1 0.00011 21.9 3.8 25 83-107 8-33 (35)
23 PF12341 DUF3639: Protein of u 72.2 12 0.00026 19.5 3.9 25 28-52 1-25 (27)
24 PF07569 Hira: TUP1-like enhan 69.7 6.7 0.00014 31.8 3.8 28 142-169 13-40 (219)
25 KOG0315|consensus 69.5 65 0.0014 26.9 11.6 89 6-109 103-196 (311)
26 TIGR01062 parC_Gneg DNA topois 65.0 1.4E+02 0.003 29.0 12.9 132 27-176 523-661 (735)
27 KOG0293|consensus 64.9 67 0.0015 28.7 9.0 30 140-169 439-470 (519)
28 PF06739 SBBP: Beta-propeller 64.0 8.3 0.00018 21.8 2.4 18 93-110 15-32 (38)
29 PRK05560 DNA gyrase subunit A; 63.4 1.6E+02 0.0034 29.0 12.5 125 29-169 537-670 (805)
30 KOG1900|consensus 63.2 1.1E+02 0.0023 31.5 11.0 153 42-217 93-262 (1311)
31 PF11725 AvrE: Pathogenicity f 57.8 28 0.00062 36.3 6.2 114 26-161 700-815 (1774)
32 KOG1900|consensus 56.7 2E+02 0.0044 29.6 11.7 83 82-169 242-340 (1311)
33 PF13418 Kelch_4: Galactose ox 55.1 13 0.00028 21.7 2.3 18 151-168 3-20 (49)
34 TIGR03300 assembly_YfgL outer 49.6 1.2E+02 0.0027 26.1 8.5 15 152-166 362-376 (377)
35 PF03785 Peptidase_C25_C: Pept 48.4 31 0.00066 23.2 3.4 36 81-116 14-50 (81)
36 PF03785 Peptidase_C25_C: Pept 45.6 40 0.00087 22.7 3.6 34 142-175 16-50 (81)
37 KOG0649|consensus 44.5 1.9E+02 0.0041 24.2 8.4 36 28-64 62-98 (325)
38 KOG1408|consensus 42.3 3.3E+02 0.0073 26.5 10.8 102 87-219 137-243 (1080)
39 TIGR01063 gyrA DNA gyrase, A s 42.2 3.5E+02 0.0076 26.7 17.1 129 29-173 535-672 (800)
40 PF11399 DUF3192: Protein of u 40.1 21 0.00046 25.2 1.7 25 149-173 77-101 (102)
41 KOG1034|consensus 39.5 94 0.002 26.9 5.7 58 42-109 323-382 (385)
42 PF01436 NHL: NHL repeat; Int 39.2 57 0.0012 16.7 3.0 19 93-111 4-22 (28)
43 PHA02146 hypothetical protein 39.1 39 0.00085 21.9 2.7 33 88-120 23-56 (86)
44 PLN03215 ascorbic acid mannose 39.1 1.4E+02 0.003 26.4 7.0 62 30-108 161-225 (373)
45 COG4257 Vgb Streptogramin lyas 37.4 2.6E+02 0.0057 23.9 8.7 108 37-174 62-173 (353)
46 COG4257 Vgb Streptogramin lyas 36.9 1.8E+02 0.004 24.8 6.9 70 30-110 94-167 (353)
47 PF07312 DUF1459: Protein of u 36.0 24 0.00052 23.5 1.4 12 103-114 57-69 (84)
48 KOG0291|consensus 32.8 4.8E+02 0.01 25.5 13.5 99 96-216 313-413 (893)
49 PF07646 Kelch_2: Kelch motif; 31.7 47 0.001 19.4 2.2 17 152-169 4-20 (49)
50 PF02239 Cytochrom_D1: Cytochr 30.9 3.7E+02 0.008 23.6 10.1 68 27-109 25-96 (369)
51 PF13854 Kelch_5: Kelch motif 30.5 55 0.0012 18.5 2.3 18 93-111 7-24 (42)
52 PF08887 GAD-like: GAD-like do 28.8 55 0.0012 23.4 2.4 21 150-170 78-98 (109)
53 PLN03215 ascorbic acid mannose 28.5 2.7E+02 0.0059 24.6 7.1 61 85-167 162-225 (373)
54 TIGR01061 parC_Gpos DNA topois 26.6 6.1E+02 0.013 24.8 15.4 134 28-176 530-673 (738)
55 PF08887 GAD-like: GAD-like do 26.6 68 0.0015 22.9 2.5 21 37-57 78-98 (109)
56 PRK02529 petN cytochrome b6-f 26.5 67 0.0014 17.5 1.9 13 163-175 20-32 (33)
57 PF09081 DUF1921: Domain of un 24.1 48 0.001 19.7 1.1 21 146-166 30-50 (51)
58 PF07494 Reg_prop: Two compone 24.1 1.1E+02 0.0023 15.1 2.8 15 94-108 8-22 (24)
59 PF13938 DUF4213: Domain of un 23.7 1.1E+02 0.0025 20.5 3.2 23 26-48 9-31 (87)
60 KOG0289|consensus 23.6 4.1E+02 0.0088 24.1 7.1 65 93-168 350-418 (506)
61 PF03984 DUF346: Repeat of unk 22.3 1.7E+02 0.0037 16.8 3.6 26 37-62 1-26 (40)
62 PF13964 Kelch_6: Kelch motif 21.6 80 0.0017 18.3 1.9 19 93-112 4-22 (50)
63 PF06462 Hyd_WA: Propeller; I 21.3 1.1E+02 0.0023 16.4 2.1 13 40-52 2-14 (32)
64 PHA02713 hypothetical protein; 20.8 6.2E+02 0.014 23.6 8.4 20 37-56 341-360 (557)
65 KOG1408|consensus 20.6 3E+02 0.0065 26.8 6.0 24 139-162 215-242 (1080)
66 PRK14131 N-acetylneuraminic ac 20.0 5.8E+02 0.013 22.1 7.9 17 40-56 214-230 (376)
No 1
>KOG1427|consensus
Probab=100.00 E-value=2.1e-32 Score=221.78 Aligned_cols=193 Identities=25% Similarity=0.376 Sum_probs=165.0
Q ss_pred eEecCeeeeccCCccEEEEecC--CCEEEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcCCC
Q psy1607 16 VFHMPSRMDHFLDLVVTDIACG--FDHCLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRALSGLGVVQISAGGW 93 (241)
Q Consensus 16 ~~~~p~~i~~~~~~~i~~i~~G--~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G~~ 93 (241)
+...|.++..+.+.+|+.|++| ..|+++|+-+|++|+||.|..||||+++.. ....|+.|..|...+|++.+||++
T Consensus 43 NL~sphR~~~l~gv~iR~VasG~~aaH~vli~megk~~~wGRNekGQLGhgD~k--~~e~Ptvi~gL~~~~iv~AA~Grn 120 (443)
T KOG1427|consen 43 NLVSPHRLRPLVGVNIRFVASGCAAAHCVLIDMEGKCYTWGRNEKGQLGHGDMK--QRERPTVISGLSKHKIVKAAAGRN 120 (443)
T ss_pred ccccceeccccccceEEEEecccchhhEEEEecccceeecccCccCccCccchh--hccCCchhhhhhhhhHHHHhhccC
Confidence 5668899999999999999977 489999999999999999999999999776 899999999999999999999999
Q ss_pred eEEEEEcCCCEEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEEEEccCCceEEEecCCcEEEEecCCCCc
Q psy1607 94 HSAARTQDGFLYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVACGSRHTLALCDDNSLWGCGWNEYNQ 173 (241)
Q Consensus 94 ~s~~lt~~g~vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~~~lt~~g~v~~wG~n~~gq 173 (241)
|+++||++|.||++|.|++||||+++....+.. |.++-. ....|+.|+||.++++.|...+.|..+|-..|||
T Consensus 121 HTl~ltdtG~v~afGeNK~GQlGlgn~~~~v~s-~~~~~~------~~~~v~~v~cga~ftv~l~~~~si~t~glp~ygq 193 (443)
T KOG1427|consen 121 HTLVLTDTGQVLAFGENKYGQLGLGNAKNEVES-TPLPCV------VSDEVTNVACGADFTVWLSSTESILTAGLPQYGQ 193 (443)
T ss_pred cEEEEecCCcEEEeccccccccccccccccccc-CCCccc------cCccceeeccccceEEEeecccceeecCCccccc
Confidence 999999999999999999999999987665433 333322 2468999999999999999999999999999999
Q ss_pred ccCCCCCCce----------ecceEecccccccCcCeeEEeccCCeeEEEecCC
Q psy1607 174 LSMGCGVSQV----------SVMTRLSVPEHVKNCPIVKLFAEHWNSARLSVPE 217 (241)
Q Consensus 174 lG~~~~~~~~----------~~~~~~~~~~~~~~~~i~~vacG~~~s~~lt~~~ 217 (241)
||++.+.... ....+...+..+.+..|+++|||.+|++++.++.
T Consensus 194 lgh~td~~~~~~~~~~~~~~e~~pr~~~i~~~dgvqiv~~acg~nhtvavd~nk 247 (443)
T KOG1427|consen 194 LGHGTDNEFNMKDSSVRLAYEAQPRPKAIASLDGVQIVKVACGTNHTVAVDKNK 247 (443)
T ss_pred cccCcchhhccccccceeeeecCCCccccccccceeeEEEeccCcceeeecCCc
Confidence 9999764331 1122233444667778999999999999998873
No 2
>COG5184 ATS1 Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton]
Probab=99.97 E-value=1.7e-29 Score=216.44 Aligned_cols=192 Identities=24% Similarity=0.382 Sum_probs=153.8
Q ss_pred eeEecCeeeecc--CCccEEEEecCCCEEEEEECCCcEEEEeCCCCCCCCCCCC--------------CCCCCccCeEee
Q psy1607 15 QVFHMPSRMDHF--LDLVVTDIACGFDHCLALTQEGIVFSWGNGSRGQLGYPSS--------------GTDANMEPREIR 78 (241)
Q Consensus 15 ~~~~~p~~i~~~--~~~~i~~i~~G~~h~~~lt~~g~v~~~G~n~~gqlg~~~~--------------~~~~~~~p~~v~ 78 (241)
...+.|+..++. +...|++++||..|+++|+++|+||+||.|..|+||.... .......|..++
T Consensus 88 ~~~~~P~~~~~~~~d~~~i~~~acGg~hsl~ld~Dg~lyswG~N~~G~Lgr~~~~~~~~~~~~~~~~~~~~~~~tP~~v~ 167 (476)
T COG5184 88 TKVDRPQLNPFGRIDKASIIKIACGGNHSLGLDHDGNLYSWGDNDDGALGRDIHKDICDQNNDIIDFDDYELESTPFKVP 167 (476)
T ss_pred hcccCceecCcccccceeeEEeecCCceEEeecCCCCEEEeccCcccccccccccccccccccccccchhhcccCCceee
Confidence 457889999987 6789999999999999999999999999999999998762 223466788887
Q ss_pred e----cCCCCeEEEEcCCCeEEEEEcCCCEEEEeCCCCCccCCCCCCC--c--eeeCCccccccccCCCCCCcEEEEEcc
Q psy1607 79 A----LSGLGVVQISAGGWHSAARTQDGFLYAWGWNNGGQVGVEGKLD--L--VVHDPHPMSWSEDFDSQNIKVVDVACG 150 (241)
Q Consensus 79 ~----~~~~~i~~I~~G~~~s~~lt~~g~vy~~G~n~~gqlg~~~~~~--~--~~~~p~~v~~~~~~~~~~~~i~~i~~G 150 (241)
. ....++++++||++++++|+++|+||.||....+.++.+.... + ....|..+. ...|+++++|
T Consensus 168 ~~s~~~s~~~vv~l~cg~e~svil~~~G~V~~~gt~r~~e~~~g~~~~s~k~~~~~~p~~v~--------~~~i~qla~G 239 (476)
T COG5184 168 GGSSAKSHLRVVKLACGWEISVILTADGRVYSWGTFRCGELGQGSYKNSQKTSIQFTPLKVP--------KKAIVQLAAG 239 (476)
T ss_pred ccccccCChheEEeecCCceEEEEccCCcEEEecCccccccccccccccccceeeeeeeecC--------chheeeeccC
Confidence 6 2334899999999999999999999999998888887773322 2 223344443 2589999999
Q ss_pred CCceEEEecCCcEEEEecCCCCcccCCCCCCceecceEecccccccCcCeeEEeccCCeeEEEecCC
Q psy1607 151 SRHTLALCDDNSLWGCGWNEYNQLSMGCGVSQVSVMTRLSVPEHVKNCPIVKLFAEHWNSARLSVPE 217 (241)
Q Consensus 151 ~~h~~~lt~~g~v~~wG~n~~gqlG~~~~~~~~~~~~~~~~~~~~~~~~i~~vacG~~~s~~lt~~~ 217 (241)
.+|.++|+++|++|.||++..||||.... +....+..+..+.... .|..|+||.+|+++|.+++
T Consensus 240 ~dh~i~lt~~G~vy~~Gs~qkgqlG~~~~-e~~~~~~lv~~~f~i~--~i~~vacG~~h~~al~~~G 303 (476)
T COG5184 240 ADHLIALTNEGKVYGWGSNQKGQLGRPTS-ERLKLVVLVGDPFAIR--NIKYVACGKDHSLALDEDG 303 (476)
T ss_pred CceEEEEecCCcEEEecCCcccccCCchh-hhcccccccCChhhhh--hhhhcccCcceEEEEcCCC
Confidence 99999999999999999999999998754 3344444443333332 3789999999999999986
No 3
>KOG1427|consensus
Probab=99.96 E-value=1.4e-29 Score=205.45 Aligned_cols=209 Identities=19% Similarity=0.279 Sum_probs=164.3
Q ss_pred CccCCCCccccCc----eeEecCeeeeccCCccEEEEecCCCEEEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEe
Q psy1607 2 CDHLASGPLKMLS----QVFHMPSRMDHFLDLVVTDIACGFDHCLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREI 77 (241)
Q Consensus 2 ~~~~~~g~l~~~g----~~~~~p~~i~~~~~~~i~~i~~G~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v 77 (241)
|=.|+.++-.++| +....|+.|+-++..+|++.+||++|+++||+.|.+|.||.|.+||||.+.... .+..|. +
T Consensus 79 ~~~wGRNekGQLGhgD~k~~e~Ptvi~gL~~~~iv~AA~GrnHTl~ltdtG~v~afGeNK~GQlGlgn~~~-~v~s~~-~ 156 (443)
T KOG1427|consen 79 CYTWGRNEKGQLGHGDMKQRERPTVISGLSKHKIVKAAAGRNHTLVLTDTGQVLAFGENKYGQLGLGNAKN-EVESTP-L 156 (443)
T ss_pred eeecccCccCccCccchhhccCCchhhhhhhhhHHHHhhccCcEEEEecCCcEEEeccccccccccccccc-ccccCC-C
Confidence 4456666655555 455589999999999999999999999999999999999999999999998762 233333 3
Q ss_pred eecCCCCeEEEEcCCCeEEEEEcCCCEEEEeCCCCCccCCCCCCCcee----------eCCccccccccCCCCCCcEEEE
Q psy1607 78 RALSGLGVVQISAGGWHSAARTQDGFLYAWGWNNGGQVGVEGKLDLVV----------HDPHPMSWSEDFDSQNIKVVDV 147 (241)
Q Consensus 78 ~~~~~~~i~~I~~G~~~s~~lt~~g~vy~~G~n~~gqlg~~~~~~~~~----------~~p~~v~~~~~~~~~~~~i~~i 147 (241)
....+..|+.|+||..|++.|+..+.+...|.-.+||||++.+..... ..|++-.+. +.....|+++
T Consensus 157 ~~~~~~~v~~v~cga~ftv~l~~~~si~t~glp~ygqlgh~td~~~~~~~~~~~~~~e~~pr~~~i~---~~dgvqiv~~ 233 (443)
T KOG1427|consen 157 PCVVSDEVTNVACGADFTVWLSSTESILTAGLPQYGQLGHGTDNEFNMKDSSVRLAYEAQPRPKAIA---SLDGVQIVKV 233 (443)
T ss_pred ccccCccceeeccccceEEEeecccceeecCCccccccccCcchhhccccccceeeeecCCCccccc---cccceeeEEE
Confidence 334556899999999999999999999999999999999998765322 112222111 3467789999
Q ss_pred EccCCceEEEecCCcEEEEecCCCCcccCCCCCCceecceEecccccccCcCeeEEeccCCeeEEEecCC
Q psy1607 148 ACGSRHTLALCDDNSLWGCGWNEYNQLSMGCGVSQVSVMTRLSVPEHVKNCPIVKLFAEHWNSARLSVPE 217 (241)
Q Consensus 148 ~~G~~h~~~lt~~g~v~~wG~n~~gqlG~~~~~~~~~~~~~~~~~~~~~~~~i~~vacG~~~s~~lt~~~ 217 (241)
+||.+|++++.++++||+||.+.||.||+.+..+ .-.|+.+++++. .+.--..+.||.-.|+.+..-+
T Consensus 234 acg~nhtvavd~nkrVysWGFGGyGRLGHaEqKD-EmvpRlik~Fd~-~~rg~~~~~~g~t~Sl~v~e~G 301 (443)
T KOG1427|consen 234 ACGTNHTVAVDKNKRVYSWGFGGYGRLGHAEQKD-EMVPRLIKVFDR-NNRGPPNAILGYTGSLNVAEGG 301 (443)
T ss_pred eccCcceeeecCCccEEEeccccccccccccchh-hHHHHHHHHhcC-CCCCCcceeeecccceeecccc
Confidence 9999999999999999999999999999987643 556777776643 3334567888888888776653
No 4
>COG5184 ATS1 Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton]
Probab=99.96 E-value=1.2e-28 Score=211.14 Aligned_cols=208 Identities=27% Similarity=0.381 Sum_probs=158.4
Q ss_pred CCccCCCCccccCceeEecCeeeecc----CCccEEEEecCCCEEEEEECCCcEEEEeCCCCCCCCCCCCCCCCC--ccC
Q psy1607 1 MCDHLASGPLKMLSQVFHMPSRMDHF----LDLVVTDIACGFDHCLALTQEGIVFSWGNGSRGQLGYPSSGTDAN--MEP 74 (241)
Q Consensus 1 ~~~~~~~g~l~~~g~~~~~p~~i~~~----~~~~i~~i~~G~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~--~~p 74 (241)
||+.=+++.+..+-....+|.+++.. ...++++++||++++++|+++|+||.||....+.++.+...+... ..+
T Consensus 144 ~~~~~~~~~~~~~~~~~~tP~~v~~~s~~~s~~~vv~l~cg~e~svil~~~G~V~~~gt~r~~e~~~g~~~~s~k~~~~~ 223 (476)
T COG5184 144 ICDQNNDIIDFDDYELESTPFKVPGGSSAKSHLRVVKLACGWEISVILTADGRVYSWGTFRCGELGQGSYKNSQKTSIQF 223 (476)
T ss_pred cccccccccccchhhcccCCceeeccccccCChheEEeecCCceEEEEccCCcEEEecCccccccccccccccccceeee
Confidence 45555555566665566788888872 345899999999999999999999999998888887774332233 333
Q ss_pred eEeeecCCCCeEEEEcCCCeEEEEEcCCCEEEEeCCCCCccCCCCCCCce--eeCCccccccccCCCCCCcEEEEEccCC
Q psy1607 75 REIRALSGLGVVQISAGGWHSAARTQDGFLYAWGWNNGGQVGVEGKLDLV--VHDPHPMSWSEDFDSQNIKVVDVACGSR 152 (241)
Q Consensus 75 ~~v~~~~~~~i~~I~~G~~~s~~lt~~g~vy~~G~n~~gqlg~~~~~~~~--~~~p~~v~~~~~~~~~~~~i~~i~~G~~ 152 (241)
+++... ...|+++++|.+|.++|+++|+||.||+|.+||||........ ...|.++.+ ..|+.|+||.+
T Consensus 224 ~p~~v~-~~~i~qla~G~dh~i~lt~~G~vy~~Gs~qkgqlG~~~~e~~~~~~lv~~~f~i--------~~i~~vacG~~ 294 (476)
T COG5184 224 TPLKVP-KKAIVQLAAGADHLIALTNEGKVYGWGSNQKGQLGRPTSERLKLVVLVGDPFAI--------RNIKYVACGKD 294 (476)
T ss_pred eeeecC-chheeeeccCCceEEEEecCCcEEEecCCcccccCCchhhhcccccccCChhhh--------hhhhhcccCcc
Confidence 443332 4679999999999999999999999999999999987654321 122333332 25899999999
Q ss_pred ceEEEecCCcEEEEecCCCCcccCCCCCCceecceEecccccccCcCeeEEeccCCeeEEEecCC
Q psy1607 153 HTLALCDDNSLWGCGWNEYNQLSMGCGVSQVSVMTRLSVPEHVKNCPIVKLFAEHWNSARLSVPE 217 (241)
Q Consensus 153 h~~~lt~~g~v~~wG~n~~gqlG~~~~~~~~~~~~~~~~~~~~~~~~i~~vacG~~~s~~lt~~~ 217 (241)
|+++|.++|+||+||.|.+||||.+++......-++........+..|..+++|..|+++|.+++
T Consensus 295 h~~al~~~G~i~a~G~n~fgqlg~~~~~~~~a~~tk~~~~~~~~~~~i~~is~ge~H~l~L~~~G 359 (476)
T COG5184 295 HSLALDEDGEIYAWGVNIFGQLGAGSDGEIGALTTKPNYKQLLSGVTICSISAGESHSLILRKDG 359 (476)
T ss_pred eEEEEcCCCeEEEeccchhcccccCcccccceeeccccccccCCCceEEEEecCcceEEEEecCc
Confidence 99999999999999999999999984433333333444445566667999999999999999995
No 5
>KOG0783|consensus
Probab=99.91 E-value=1.1e-24 Score=195.86 Aligned_cols=203 Identities=26% Similarity=0.383 Sum_probs=168.8
Q ss_pred cCCCCccccCc----eeEecCeeeeccC--CccEEEEecCCCEEEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEe
Q psy1607 4 HLASGPLKMLS----QVFHMPSRMDHFL--DLVVTDIACGFDHCLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREI 77 (241)
Q Consensus 4 ~~~~g~l~~~g----~~~~~p~~i~~~~--~~~i~~i~~G~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v 77 (241)
.||++..+++| +....|..+.++. +.-+.+|+.+..|++++++.|+||.||.+.-|+||++... ..+.|.+|
T Consensus 146 ~wG~N~N~tLGign~~~~~~Pe~Vdlf~~Sg~~~~qV~l~kfHSvfl~~kgqvY~cGhG~GGRlG~gdeq--~~~iPkrV 223 (1267)
T KOG0783|consen 146 GWGTNVNNTLGIGNGKEPSSPERVDLFKTSGQLFSQVQLSKFHSVFLTEKGQVYVCGHGAGGRLGFGDEQ--YNFIPKRV 223 (1267)
T ss_pred EecccccccccccCCCCCCChHHhHHHHhccHHHHHHHHhhceeeEecCCCcEEEeccCCCCccCcCccc--cccccccc
Confidence 46777777776 6777899998874 5678899999999999999999999999999999999887 88999999
Q ss_pred eecCCCCeEEEEcCCCeEEEEEcCCCEEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEEEEccCCceEEE
Q psy1607 78 RALSGLGVVQISAGGWHSAARTQDGFLYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVACGSRHTLAL 157 (241)
Q Consensus 78 ~~~~~~~i~~I~~G~~~s~~lt~~g~vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~~~l 157 (241)
+.+.+-++.+|+....|+++||++|.||+||.|.++|||+.++..+ ...|..+...... .-..|+.|++|..|+++.
T Consensus 224 ~gL~gh~~~qisvs~~HslvLT~~g~Vys~GlN~~hqLG~~~~~~~-~~~p~qI~a~r~k--g~~~iIgvaAg~~hsVaw 300 (1267)
T KOG0783|consen 224 PGLIGHKVIQISVSHTHSLVLTKFGSVYSWGLNGSHQLGLSNDELK-KDDPIQITARRIK--GFKQIIGVAAGKSHSVAW 300 (1267)
T ss_pred ccccccceEEEEeecceeEEEeecceEEEeecCcccccCCcCchhh-cCchhhhhhHhhc--chhhhhhhhcccceeeee
Confidence 9999999999999999999999999999999999999999876553 2345555433211 112799999999999999
Q ss_pred ecCCcEEEEecCCCCcccCCCCCCceecceEecccccccCcCeeEEeccCCeeEEEecCC
Q psy1607 158 CDDNSLWGCGWNEYNQLSMGCGVSQVSVMTRLSVPEHVKNCPIVKLFAEHWNSARLSVPE 217 (241)
Q Consensus 158 t~~g~v~~wG~n~~gqlG~~~~~~~~~~~~~~~~~~~~~~~~i~~vacG~~~s~~lt~~~ 217 (241)
|+.. ||+||.| .||||..+....+..|+.+. -...+|..|+|-..-|++++.+.
T Consensus 301 t~~~-VY~wGlN-~GQlGi~~n~~~Vt~Pr~l~----~~~~~v~~v~a~~~ATVc~~~~~ 354 (1267)
T KOG0783|consen 301 TDTD-VYSWGLN-NGQLGISDNISVVTTPRRLA----GLLSPVIHVVATTRATVCLLQNN 354 (1267)
T ss_pred ecce-EEEeccc-CceecCCCCCceeecchhhc----ccccceEEEEecCccEEEEecCC
Confidence 9655 9999998 89999988777777776542 12235899999999999998875
No 6
>KOG1428|consensus
Probab=99.87 E-value=3.3e-21 Score=179.99 Aligned_cols=204 Identities=23% Similarity=0.300 Sum_probs=156.7
Q ss_pred CCCccccCce----eEecCeeeeccCCccEEEEecCCCEEEEEECCCcEEEEeCCCCCCCCCCCCCCC------------
Q psy1607 6 ASGPLKMLSQ----VFHMPSRMDHFLDLVVTDIACGFDHCLALTQEGIVFSWGNGSRGQLGYPSSGTD------------ 69 (241)
Q Consensus 6 ~~g~l~~~g~----~~~~p~~i~~~~~~~i~~i~~G~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~------------ 69 (241)
++|++++.|. .-...+.+.-+++.-|.+++.|+.|.++++.+|.||.||.|+.+|+|+-.....
T Consensus 586 enGkifM~G~~tm~~n~SSqmln~L~~~~isslAlGKsH~~av~rNG~l~T~GlNN~~QCGRVEs~sTt~s~~~s~~~e~ 665 (3738)
T KOG1428|consen 586 ENGKIFMGGLHTMRVNVSSQMLNGLDNVMISSLALGKSHGVAVTRNGHLFTWGLNNMNQCGRVESTSTTSSPRHSGRQEY 665 (3738)
T ss_pred cCCeEEeecceeEEecchHHHhhccccceeehhhccccceeEEEeCCeEEEEecCCcccccccccccccCCcccccceee
Confidence 6788887773 223456777789999999999999999999999999999999999987422100
Q ss_pred ------------------------------------------------------------------------------CC
Q psy1607 70 ------------------------------------------------------------------------------AN 71 (241)
Q Consensus 70 ------------------------------------------------------------------------------~~ 71 (241)
..
T Consensus 666 ~iCP~G~HtW~~dt~~VCa~CG~Cs~~GvaC~~~~RP~G~mC~CG~GES~C~~CG~Cr~C~e~tE~~QPG~aqHvQ~~st 745 (3738)
T KOG1428|consen 666 QICPIGEHTWLTDTPSVCAQCGLCSARGVACGRVPRPKGTMCHCGVGESTCLRCGLCRPCGEVTEPAQPGRAQHVQFSST 745 (3738)
T ss_pred cccCCccceeecCCcchhhhcccccccccccccCCCCCCcccccCCCcccceeccccccccCcCCcCCCCHHHhheeccc
Confidence 00
Q ss_pred cc-------CeEeee---cCCCCeEEEEcCCCeEEEEEcCCCEEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCC
Q psy1607 72 ME-------PREIRA---LSGLGVVQISAGGWHSAARTQDGFLYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQN 141 (241)
Q Consensus 72 ~~-------p~~v~~---~~~~~i~~I~~G~~~s~~lt~~g~vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~ 141 (241)
.. |..|.. ..+.++.+|+||..|+++|.+|++||.+|.|.+||||+++...+ ..|+.+.++ .+
T Consensus 746 aa~QR~~~HPs~V~~sq~~Hdvkv~sVSCG~~HtVlL~sd~~VfTFG~~~HGQLG~GDt~Sk--~~Pq~V~~~-----~~ 818 (3738)
T KOG1428|consen 746 AAPQRSTLHPSRVILSQGPHDVKVSSVSCGNFHTVLLASDRRVFTFGSNCHGQLGVGDTLSK--NTPQQVILP-----SD 818 (3738)
T ss_pred ccccccccCchheeeccCCcceeEEEEeccCceEEEEecCCcEEEecCCcccccCcCccccC--CCcceEEcC-----CC
Confidence 00 111110 11347899999999999999999999999999999999998875 578888877 55
Q ss_pred CcEEEEEccCCceEEEecCCcEEEEecCCCCcccCCCCCCc--eecceEecccccccCcCeeEEeccCCeeEEEecC
Q psy1607 142 IKVVDVACGSRHTLALCDDNSLWGCGWNEYNQLSMGCGVSQ--VSVMTRLSVPEHVKNCPIVKLFAEHWNSARLSVP 216 (241)
Q Consensus 142 ~~i~~i~~G~~h~~~lt~~g~v~~wG~n~~gqlG~~~~~~~--~~~~~~~~~~~~~~~~~i~~vacG~~~s~~lt~~ 216 (241)
..|++|++|.+|++++..||.||.||.-..|||+....... ...|.+++.+-.-.+.+..-|-+-++.+++-...
T Consensus 819 t~~vQVaAGSNHT~l~~~DGsVFTFGaF~KGQL~RP~~e~~~WNA~Pe~v~~~G~~f~~~A~WIGAdGDss~i~~~t 895 (3738)
T KOG1428|consen 819 TVIVQVAAGSNHTILRANDGSVFTFGAFGKGQLARPAGEKAGWNAIPEKVSGFGPGFNAFAGWIGADGDSSIIHSHT 895 (3738)
T ss_pred CceEEEecCCCceEEEecCCcEEEeccccCccccCccccccccccCCCcCCCCCccccccceeeccCCCcceeehhh
Confidence 68999999999999999999999999999999998643221 3355666655555556677787777776665443
No 7
>KOG0783|consensus
Probab=99.83 E-value=5.2e-21 Score=172.49 Aligned_cols=165 Identities=20% Similarity=0.279 Sum_probs=142.1
Q ss_pred EEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeec--CCCCeEEEEcCCCeEEEEEcCCCEEEEeCCCCCccCCCCC
Q psy1607 43 ALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRAL--SGLGVVQISAGGWHSAARTQDGFLYAWGWNNGGQVGVEGK 120 (241)
Q Consensus 43 ~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~--~~~~i~~I~~G~~~s~~lt~~g~vy~~G~n~~gqlg~~~~ 120 (241)
+++.-+.||+||.|...-||++..+ ....|..|+.+ .+.-+.+|..+.+|+++|++.|.||++|...-|++|++..
T Consensus 137 ~~d~pndvy~wG~N~N~tLGign~~--~~~~Pe~Vdlf~~Sg~~~~qV~l~kfHSvfl~~kgqvY~cGhG~GGRlG~gde 214 (1267)
T KOG0783|consen 137 VLDLPNDVYGWGTNVNNTLGIGNGK--EPSSPERVDLFKTSGQLFSQVQLSKFHSVFLTEKGQVYVCGHGAGGRLGFGDE 214 (1267)
T ss_pred ccCCccceeEecccccccccccCCC--CCCChHHhHHHHhccHHHHHHHHhhceeeEecCCCcEEEeccCCCCccCcCcc
Confidence 6677799999999999999999888 67778877754 4667889999999999999999999999999999999965
Q ss_pred CCceeeCCccccccccCCCCCCcEEEEEccCCceEEEecCCcEEEEecCCCCcccCCCCCCceecceEecccccccCcCe
Q psy1607 121 LDLVVHDPHPMSWSEDFDSQNIKVVDVACGSRHTLALCDDNSLWGCGWNEYNQLSMGCGVSQVSVMTRLSVPEHVKNCPI 200 (241)
Q Consensus 121 ~~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~~~lt~~g~v~~wG~n~~gqlG~~~~~~~~~~~~~~~~~~~~~~~~i 200 (241)
.. ...|++++.+ .+.+|.+|++...|+++||++|.||+||-|..+|||..++......|..+...+......|
T Consensus 215 q~--~~iPkrV~gL-----~gh~~~qisvs~~HslvLT~~g~Vys~GlN~~hqLG~~~~~~~~~~p~qI~a~r~kg~~~i 287 (1267)
T KOG0783|consen 215 QY--NFIPKRVPGL-----IGHKVIQISVSHTHSLVLTKFGSVYSWGLNGSHQLGLSNDELKKDDPIQITARRIKGFKQI 287 (1267)
T ss_pred cc--cccccccccc-----cccceEEEEeecceeEEEeecceEEEeecCcccccCCcCchhhcCchhhhhhHhhcchhhh
Confidence 54 4678888765 6789999999999999999999999999999999999987777777777664433333379
Q ss_pred eEEeccCCeeEEEecC
Q psy1607 201 VKLFAEHWNSARLSVP 216 (241)
Q Consensus 201 ~~vacG~~~s~~lt~~ 216 (241)
+-|+||..|+++.|..
T Consensus 288 IgvaAg~~hsVawt~~ 303 (1267)
T KOG0783|consen 288 IGVAAGKSHSVAWTDT 303 (1267)
T ss_pred hhhhcccceeeeeecc
Confidence 9999999999999876
No 8
>KOG1428|consensus
Probab=99.73 E-value=6.1e-17 Score=152.04 Aligned_cols=179 Identities=21% Similarity=0.308 Sum_probs=129.0
Q ss_pred eeccCCccEEEEecCCCEEEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcCCCeEEEEEcCC
Q psy1607 23 MDHFLDLVVTDIACGFDHCLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRALSGLGVVQISAGGWHSAARTQDG 102 (241)
Q Consensus 23 i~~~~~~~i~~i~~G~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G~~~s~~lt~~g 102 (241)
....+..+|+.+-+...-.-++.++|++|..|....- .-...+.+..+.+..|.+++-|..|.++++.+|
T Consensus 563 ~~P~n~rKIv~v~~s~~VY~~vSenGkifM~G~~tm~----------~n~SSqmln~L~~~~isslAlGKsH~~av~rNG 632 (3738)
T KOG1428|consen 563 LVPSNRRKIVHVCASGHVYGYVSENGKIFMGGLHTMR----------VNVSSQMLNGLDNVMISSLALGKSHGVAVTRNG 632 (3738)
T ss_pred cCCCCcceeEEEeeeeEEEEEEccCCeEEeecceeEE----------ecchHHHhhccccceeehhhccccceeEEEeCC
Confidence 3334456777774433333567899999998853210 112334566788889999999999999999999
Q ss_pred CEEEEeCCCCCccCCCCCCCcee---------------------------------------------------------
Q psy1607 103 FLYAWGWNNGGQVGVEGKLDLVV--------------------------------------------------------- 125 (241)
Q Consensus 103 ~vy~~G~n~~gqlg~~~~~~~~~--------------------------------------------------------- 125 (241)
.||.||.|+.+|+|.-.......
T Consensus 633 ~l~T~GlNN~~QCGRVEs~sTt~s~~~s~~~e~~iCP~G~HtW~~dt~~VCa~CG~Cs~~GvaC~~~~RP~G~mC~CG~G 712 (3738)
T KOG1428|consen 633 HLFTWGLNNMNQCGRVESTSTTSSPRHSGRQEYQICPIGEHTWLTDTPSVCAQCGLCSARGVACGRVPRPKGTMCHCGVG 712 (3738)
T ss_pred eEEEEecCCcccccccccccccCCcccccceeecccCCccceeecCCcchhhhcccccccccccccCCCCCCcccccCCC
Confidence 99999999999998642211000
Q ss_pred ---------------------------------eCCcc-------ccccccCCCCCCcEEEEEccCCceEEEecCCcEEE
Q psy1607 126 ---------------------------------HDPHP-------MSWSEDFDSQNIKVVDVACGSRHTLALCDDNSLWG 165 (241)
Q Consensus 126 ---------------------------------~~p~~-------v~~~~~~~~~~~~i~~i~~G~~h~~~lt~~g~v~~ 165 (241)
..|++ +.+.. ..-+.++.+|+||..|+++|-+|++||.
T Consensus 713 ES~C~~CG~Cr~C~e~tE~~QPG~aqHvQ~~staa~QR~~~HPs~V~~sq--~~Hdvkv~sVSCG~~HtVlL~sd~~VfT 790 (3738)
T KOG1428|consen 713 ESTCLRCGLCRPCGEVTEPAQPGRAQHVQFSSTAAPQRSTLHPSRVILSQ--GPHDVKVSSVSCGNFHTVLLASDRRVFT 790 (3738)
T ss_pred cccceeccccccccCcCCcCCCCHHHhheecccccccccccCchheeecc--CCcceeEEEEeccCceEEEEecCCcEEE
Confidence 11111 11111 1124578999999999999999999999
Q ss_pred EecCCCCcccCCCCCCceecceEecccccccCcCeeEEeccCCeeEEEecCC
Q psy1607 166 CGWNEYNQLSMGCGVSQVSVMTRLSVPEHVKNCPIVKLFAEHWNSARLSVPE 217 (241)
Q Consensus 166 wG~n~~gqlG~~~~~~~~~~~~~~~~~~~~~~~~i~~vacG~~~s~~lt~~~ 217 (241)
||.|-+||||.++... .+.|+.+-++.+. .|++|++|.+|++++.-++
T Consensus 791 FG~~~HGQLG~GDt~S-k~~Pq~V~~~~~t---~~vQVaAGSNHT~l~~~DG 838 (3738)
T KOG1428|consen 791 FGSNCHGQLGVGDTLS-KNTPQQVILPSDT---VIVQVAAGSNHTILRANDG 838 (3738)
T ss_pred ecCCcccccCcCcccc-CCCcceEEcCCCC---ceEEEecCCCceEEEecCC
Confidence 9999999999998854 4556666666554 4999999999999998775
No 9
>PF00415 RCC1: Regulator of chromosome condensation (RCC1) repeat; InterPro: IPR000408 The regulator of chromosome condensation (RCC1) [] is a eukaryotic protein which binds to chromatin and interacts with ran, a nuclear GTP-binding protein IPR002041 from INTERPRO, to promote the loss of bound GDP and the uptake of fresh GTP, thus acting as a guanine-nucleotide dissociation stimulator (GDS). The interaction of RCC1 with ran probably plays an important role in the regulation of gene expression. RCC1, known as PRP20 or SRM1 in yeast, pim1 in fission yeast and BJ1 in Drosophila, is a protein that contains seven tandem repeats of a domain of about 50 to 60 amino acids. As shown in the following schematic representation, the repeats make up the major part of the length of the protein. Outside the repeat region, there is just a small N-terminal domain of about 40 to 50 residues and, in the Drosophila protein only, a C-terminal domain of about 130 residues. +----+-------+-------+-------+-------+-------+-------+-------+-------------+ |N-t.|Rpt. 1 |Rpt. 2 |Rpt. 3 |Rpt. 4 |Rpt. 5 |Rpt. 6 |Rpt. 7 | C-terminal | +----+-------+-------+-------+-------+-------+-------+-------+-------------+ The RCC1-type of repeat is also found in the X-linked retinitis pigmentosa GTPase regulator []. The RCC repeats form a beta-propeller structure.; PDB: 3MVD_L 3OF7_A 1I2M_D 1A12_B 3KCI_A 4DNV_B 4DNU_A 4D9S_B 4DNW_A.
Probab=99.39 E-value=7.1e-13 Score=82.40 Aligned_cols=51 Identities=27% Similarity=0.358 Sum_probs=43.7
Q ss_pred CCcEEEEecCCCCcccCCCCCCceecceEecccccccCcCeeEEeccCCeeEEE
Q psy1607 160 DNSLWGCGWNEYNQLSMGCGVSQVSVMTRLSVPEHVKNCPIVKLFAEHWNSARL 213 (241)
Q Consensus 160 ~g~v~~wG~n~~gqlG~~~~~~~~~~~~~~~~~~~~~~~~i~~vacG~~~s~~l 213 (241)
||+||+||+|.+||||..........|+++..+.. .+|++|+||.+||++|
T Consensus 1 dG~vy~wG~n~~GqLG~~~~~~~~~~P~~v~~~~~---~~i~~va~G~~ht~~l 51 (51)
T PF00415_consen 1 DGRVYSWGSNDYGQLGSGGDNKNVSVPTKVPFLSG---VRIVQVACGSDHTLAL 51 (51)
T ss_dssp TSEEEEEEEETTSTTSSSSSSSEEEEEEEEGGGTT---SEEEEEEEESSEEEEE
T ss_pred CCcEEEEECCCCCCCCCCCCCCceeEEEEECCCCC---CCEEEEEeCcceEEEC
Confidence 68999999999999994455577888888887766 5699999999999987
No 10
>PF00415 RCC1: Regulator of chromosome condensation (RCC1) repeat; InterPro: IPR000408 The regulator of chromosome condensation (RCC1) [] is a eukaryotic protein which binds to chromatin and interacts with ran, a nuclear GTP-binding protein IPR002041 from INTERPRO, to promote the loss of bound GDP and the uptake of fresh GTP, thus acting as a guanine-nucleotide dissociation stimulator (GDS). The interaction of RCC1 with ran probably plays an important role in the regulation of gene expression. RCC1, known as PRP20 or SRM1 in yeast, pim1 in fission yeast and BJ1 in Drosophila, is a protein that contains seven tandem repeats of a domain of about 50 to 60 amino acids. As shown in the following schematic representation, the repeats make up the major part of the length of the protein. Outside the repeat region, there is just a small N-terminal domain of about 40 to 50 residues and, in the Drosophila protein only, a C-terminal domain of about 130 residues. +----+-------+-------+-------+-------+-------+-------+-------+-------------+ |N-t.|Rpt. 1 |Rpt. 2 |Rpt. 3 |Rpt. 4 |Rpt. 5 |Rpt. 6 |Rpt. 7 | C-terminal | +----+-------+-------+-------+-------+-------+-------+-------+-------------+ The RCC1-type of repeat is also found in the X-linked retinitis pigmentosa GTPase regulator []. The RCC repeats form a beta-propeller structure.; PDB: 3MVD_L 3OF7_A 1I2M_D 1A12_B 3KCI_A 4DNV_B 4DNU_A 4D9S_B 4DNW_A.
Probab=99.35 E-value=9.6e-13 Score=81.80 Aligned_cols=50 Identities=36% Similarity=0.611 Sum_probs=45.8
Q ss_pred CCcEEEEeCCCCCCCC-CCCCCCCCCccCeEeeecCCCCeEEEEcCCCeEEEE
Q psy1607 47 EGIVFSWGNGSRGQLG-YPSSGTDANMEPREIRALSGLGVVQISAGGWHSAAR 98 (241)
Q Consensus 47 ~g~v~~~G~n~~gqlg-~~~~~~~~~~~p~~v~~~~~~~i~~I~~G~~~s~~l 98 (241)
+|+||+||.|.+|||| ..... ....|++++.+.+.+|++|+||.+|+++|
T Consensus 1 dG~vy~wG~n~~GqLG~~~~~~--~~~~P~~v~~~~~~~i~~va~G~~ht~~l 51 (51)
T PF00415_consen 1 DGRVYSWGSNDYGQLGSGGDNK--NVSVPTKVPFLSGVRIVQVACGSDHTLAL 51 (51)
T ss_dssp TSEEEEEEEETTSTTSSSSSSS--EEEEEEEEGGGTTSEEEEEEEESSEEEEE
T ss_pred CCcEEEEECCCCCCCCCCCCCC--ceeEEEEECCCCCCCEEEEEeCcceEEEC
Confidence 6899999999999999 55555 88999999999999999999999999987
No 11
>PF13540 RCC1_2: Regulator of chromosome condensation (RCC1) repeat; PDB: 3QI0_D 1JTD_B 3QHY_B.
Probab=99.30 E-value=3.6e-12 Score=69.87 Aligned_cols=30 Identities=47% Similarity=0.761 Sum_probs=26.1
Q ss_pred EEEEEccCCceEEEecCCcEEEEecCCCCc
Q psy1607 144 VVDVACGSRHTLALCDDNSLWGCGWNEYNQ 173 (241)
Q Consensus 144 i~~i~~G~~h~~~lt~~g~v~~wG~n~~gq 173 (241)
|++|+||.+|+++|+++|+||+||.|.+||
T Consensus 1 V~~ia~G~~ht~al~~~g~v~~wG~n~~GQ 30 (30)
T PF13540_consen 1 VVQIACGGYHTCALTSDGEVYCWGDNNYGQ 30 (30)
T ss_dssp EEEEEEESSEEEEEE-TTEEEEEE--TTST
T ss_pred CEEEEecCCEEEEEEcCCCEEEEcCCcCCC
Confidence 789999999999999999999999999998
No 12
>PF13540 RCC1_2: Regulator of chromosome condensation (RCC1) repeat; PDB: 3QI0_D 1JTD_B 3QHY_B.
Probab=99.29 E-value=4.5e-12 Score=69.47 Aligned_cols=30 Identities=60% Similarity=0.991 Sum_probs=25.9
Q ss_pred eEEEEcCCCeEEEEEcCCCEEEEeCCCCCc
Q psy1607 85 VVQISAGGWHSAARTQDGFLYAWGWNNGGQ 114 (241)
Q Consensus 85 i~~I~~G~~~s~~lt~~g~vy~~G~n~~gq 114 (241)
|++|+||.+|+++|+++|+||+||.|.+||
T Consensus 1 V~~ia~G~~ht~al~~~g~v~~wG~n~~GQ 30 (30)
T PF13540_consen 1 VVQIACGGYHTCALTSDGEVYCWGDNNYGQ 30 (30)
T ss_dssp EEEEEEESSEEEEEE-TTEEEEEE--TTST
T ss_pred CEEEEecCCEEEEEEcCCCEEEEcCCcCCC
Confidence 689999999999999999999999999987
No 13
>KOG0941|consensus
Probab=99.28 E-value=3.7e-14 Score=129.22 Aligned_cols=189 Identities=25% Similarity=0.304 Sum_probs=140.5
Q ss_pred ecCeeeeccCCccEEEEecCCCEEEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcCCCeEEE
Q psy1607 18 HMPSRMDHFLDLVVTDIACGFDHCLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRALSGLGVVQISAGGWHSAA 97 (241)
Q Consensus 18 ~~p~~i~~~~~~~i~~i~~G~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G~~~s~~ 97 (241)
+.|+++..+...+|.+++||.+|+++++..|+++.||.|.+||+|.+... .... |.+++.+.+.+..+|++|..|+++
T Consensus 3 ~~~~~~~~l~~k~~lq~~cGn~hclal~~~g~~~~wg~~~~g~~~~~~~~-~~~~-p~~~~sl~g~p~a~v~~g~~hs~~ 80 (850)
T KOG0941|consen 3 RAPRLVLILNYKHILQVGCGNNHCLALSCAGELFVWGMNNNGQLGRALYF-PDAK-PEPVESLKGVPLAQVSAGEAHSFA 80 (850)
T ss_pred chhHHHHHHhhhhhhhhccccHHHHhhhccCCeeeccCCccchhhhhccC-CCCC-CccchhhcCCcHHHHhcCCCcchh
Confidence 45666677777789999999999999999999999999999999999444 2333 999999999999999999999877
Q ss_pred EEc-------CCCEEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEEEEccCCceEEEec-CCcEEEEecC
Q psy1607 98 RTQ-------DGFLYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVACGSRHTLALCD-DNSLWGCGWN 169 (241)
Q Consensus 98 lt~-------~g~vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~~~lt~-~g~v~~wG~n 169 (241)
+.. +|+++.+|....+|+|+...... ..|..+.-. ...++.+|+||..|+.+... -|++|.+|.+
T Consensus 81 lS~~~~~lt~e~~~fs~Ga~~~~q~~h~~~~~~--~~~~~v~e~-----i~~~~t~ia~~~~ht~a~v~~l~qsf~~~~~ 153 (850)
T KOG0941|consen 81 LSSHTVLLTDEGKVFSFGAGSTGQLGHSLTENE--VLPLLVLEL-----IGSRVTRIACVRGHTLAIVPRLGQSFSFGKG 153 (850)
T ss_pred hhhchhhcchhccccccCCcccccccccccccc--cccHHHHHH-----HhhhhHHHHHHHHHHHhhhhhhcceeecccC
Confidence 666 99999999999999999544442 345554332 35689999999999987664 6899999998
Q ss_pred CCCcccCCCCCCceecceEecccccccCcCeeEEeccCCeeEEEecCCCC
Q psy1607 170 EYNQLSMGCGVSQVSVMTRLSVPEHVKNCPIVKLFAEHWNSARLSVPEHA 219 (241)
Q Consensus 170 ~~gqlG~~~~~~~~~~~~~~~~~~~~~~~~i~~vacG~~~s~~lt~~~~~ 219 (241)
..|.. .- .....+......+......+..+++|++.++.+......
T Consensus 154 ~sGk~-~i---~s~s~~~~l~~~d~~~~~~~~~~~~g~dq~~~l~~~~~~ 199 (850)
T KOG0941|consen 154 ASGKG-VI---VSLSGEDLLRDHDSEKDHRCSLAFAGGDQTFSLSSKGEN 199 (850)
T ss_pred CCCCc-ee---eccchhhhcccccHHHHHHHHHHhcCCCceEEEEeeccc
Confidence 87710 00 001111111222222223466789999999999887543
No 14
>KOG0941|consensus
Probab=98.93 E-value=9.6e-12 Score=113.71 Aligned_cols=132 Identities=20% Similarity=0.318 Sum_probs=102.2
Q ss_pred CeEeeecCCCCeEEEEcCCCeEEEEEcCCCEEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEEEEccCCc
Q psy1607 74 PREIRALSGLGVVQISAGGWHSAARTQDGFLYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVACGSRH 153 (241)
Q Consensus 74 p~~v~~~~~~~i~~I~~G~~~s~~lt~~g~vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h 153 (241)
|..+..+...++.+++||.+|+++++..|++++||.|.+||+|.+..... ..|.++..+ .+.+..+|++|.+|
T Consensus 5 ~~~~~~l~~k~~lq~~cGn~hclal~~~g~~~~wg~~~~g~~~~~~~~~~--~~p~~~~sl-----~g~p~a~v~~g~~h 77 (850)
T KOG0941|consen 5 PRLVLILNYKHILQVGCGNNHCLALSCAGELFVWGMNNNGQLGRALYFPD--AKPEPVESL-----KGVPLAQVSAGEAH 77 (850)
T ss_pred hHHHHHHhhhhhhhhccccHHHHhhhccCCeeeccCCccchhhhhccCCC--CCCccchhh-----cCCcHHHHhcCCCc
Confidence 34444455568999999999999999999999999999999999844443 237777755 67789999999998
Q ss_pred eEEEec-------CCcEEEEecCCCCcccCCCCCCceecceEecccccccCcCeeEEeccCCeeEEEecC
Q psy1607 154 TLALCD-------DNSLWGCGWNEYNQLSMGCGVSQVSVMTRLSVPEHVKNCPIVKLFAEHWNSARLSVP 216 (241)
Q Consensus 154 ~~~lt~-------~g~v~~wG~n~~gqlG~~~~~~~~~~~~~~~~~~~~~~~~i~~vacG~~~s~~lt~~ 216 (241)
++++.. .|.++.+|+...||+|+........ |.. ..+.-+..+..++||-.|++++..+
T Consensus 78 s~~lS~~~~~lt~e~~~fs~Ga~~~~q~~h~~~~~~~~-~~~---v~e~i~~~~t~ia~~~~ht~a~v~~ 143 (850)
T KOG0941|consen 78 SFALSSHTVLLTDEGKVFSFGAGSTGQLGHSLTENEVL-PLL---VLELIGSRVTRIACVRGHTLAIVPR 143 (850)
T ss_pred chhhhhchhhcchhccccccCCcccccccccccccccc-cHH---HHHHHhhhhHHHHHHHHHHHhhhhh
Confidence 877665 9999999999999999954433222 111 2234455689999999999888665
No 15
>KOG0943|consensus
Probab=95.45 E-value=0.0012 Score=63.84 Aligned_cols=137 Identities=20% Similarity=0.268 Sum_probs=96.6
Q ss_pred CCccEEEEecCCCEEEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEe-eecCCCCeEEEEcCCCeEEEEEcCCCEE
Q psy1607 27 LDLVVTDIACGFDHCLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREI-RALSGLGVVQISAGGWHSAARTQDGFLY 105 (241)
Q Consensus 27 ~~~~i~~i~~G~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v-~~~~~~~i~~I~~G~~~s~~lt~~g~vy 105 (241)
...+++.|.+-.+..++|..+|++|.|-+...--+-.+.........|... -.+.+.+|+.+++..--.-++|++|.|-
T Consensus 372 dan~~I~I~A~s~el~AlhrkGelYqWaWdESEglddplai~kn~dHPd~a~iG~hge~ii~lSanniR~si~T~nghla 451 (3015)
T KOG0943|consen 372 DANKFICIGALSSELLALHRKGELYQWAWDESEGLDDPLAINKNLDHPDAAFIGLHGEKIILLSANNIRASIATENGHLA 451 (3015)
T ss_pred CCCeeEEeehhHHHHHHHhhCCceeeeecccccCCCChhhcccCCCCCccceecccCCeeEEeecCceeeeeeecCCchh
Confidence 346788888888899999999999999987665555443332233334322 2467889999999999999999999999
Q ss_pred EEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEEEEccCCceEEEecCCcEEEEecCCCCc
Q psy1607 106 AWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVACGSRHTLALCDDNSLWGCGWNEYNQ 173 (241)
Q Consensus 106 ~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~~~lt~~g~v~~wG~n~~gq 173 (241)
.|=+ .+|.+..........+.+.. .+..+++.-|-..|+++.-+++.+|=||--.+-|
T Consensus 452 sWlD----EcgagV~fkLa~ea~Tkiee------d~~maVqd~~~adhlaAf~~dniihWcGiVPf~e 509 (3015)
T KOG0943|consen 452 SWLD----ECGAGVAFKLAHEAQTKIEE------DGEMAVQDHCCADHLAAFLEDNIIHWCGIVPFSE 509 (3015)
T ss_pred hHHh----hhhhhhhhhhhhhhhhhhhh------hhHHHHHHHHHHHHHHHHhhhceeeEEeeeeehh
Confidence 9943 44444443322222333332 3456777778888999999999999999755444
No 16
>KOG3669|consensus
Probab=95.44 E-value=0.3 Score=44.44 Aligned_cols=107 Identities=23% Similarity=0.222 Sum_probs=70.0
Q ss_pred cCCCEEEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeecC-CCCeEEEEcCC-CeEEEEEcCCCEEE-EeCCCC
Q psy1607 36 CGFDHCLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRALS-GLGVVQISAGG-WHSAARTQDGFLYA-WGWNNG 112 (241)
Q Consensus 36 ~G~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~-~~~i~~I~~G~-~~s~~lt~~g~vy~-~G~n~~ 112 (241)
.|..-..+|+.+|++|- +-|..... +.-...+... ...+.+|++|. .-..+|+.+|.||. -|-..+
T Consensus 190 ~g~~~awAI~s~Gd~y~-------RtGvs~~~----P~GraW~~i~~~t~L~qISagPtg~VwAvt~nG~vf~R~GVsRq 258 (705)
T KOG3669|consen 190 LGDDTAWAIRSSGDLYL-------RTGVSVDR----PCGRAWKVICPYTDLSQISAGPTGVVWAVTENGAVFYREGVSRQ 258 (705)
T ss_pred CCceEEEEEecCCcEEE-------eccccCCC----CCCceeeecCCCCccceEeecCcceEEEEeeCCcEEEEeccccc
Confidence 56677788899999884 12222211 1222222222 22588999999 56669999999874 477776
Q ss_pred CccCCCCCCCceeeCCccccccccCCCCCCcEEEEEccCCceEEEecCCcEEEE
Q psy1607 113 GQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVACGSRHTLALCDDNSLWGC 166 (241)
Q Consensus 113 gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~~~lt~~g~v~~w 166 (241)
.+.|..=. .++.+.... .++.|+.|.....+|+.+|++|.-
T Consensus 259 Np~GdsWk-----dI~tP~~a~--------~~v~iSvGt~t~Waldndg~lwfr 299 (705)
T KOG3669|consen 259 NPEGDSWK-----DIVTPRQAL--------EPVCISVGTQTLWALDNDGNLWFR 299 (705)
T ss_pred CCCCchhh-----hccCccccc--------ceEEEEeccceEEEEecCCcEEEE
Confidence 67664322 123333332 499999999999999999999863
No 17
>KOG3669|consensus
Probab=94.70 E-value=0.94 Score=41.40 Aligned_cols=70 Identities=27% Similarity=0.220 Sum_probs=53.5
Q ss_pred cEEEEecCC-CEEEEEECCCcEE-EEeCCCCCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcCCCeEEEEEcCCCEEEE
Q psy1607 30 VVTDIACGF-DHCLALTQEGIVF-SWGNGSRGQLGYPSSGTDANMEPREIRALSGLGVVQISAGGWHSAARTQDGFLYAW 107 (241)
Q Consensus 30 ~i~~i~~G~-~h~~~lt~~g~v~-~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G~~~s~~lt~~g~vy~~ 107 (241)
.+.+|++|. .-..+++.+|+|| ..|-..+++.|..-. ..+++...++ ++.|+.|....-+||.+|++|.-
T Consensus 228 ~L~qISagPtg~VwAvt~nG~vf~R~GVsRqNp~GdsWk-----dI~tP~~a~~---~v~iSvGt~t~Waldndg~lwfr 299 (705)
T KOG3669|consen 228 DLSQISAGPTGVVWAVTENGAVFYREGVSRQNPEGDSWK-----DIVTPRQALE---PVCISVGTQTLWALDNDGNLWFR 299 (705)
T ss_pred ccceEeecCcceEEEEeeCCcEEEEecccccCCCCchhh-----hccCcccccc---eEEEEeccceEEEEecCCcEEEE
Confidence 589999999 8888999999976 567677777665432 2333333322 89999998888899999999975
No 18
>PF11725 AvrE: Pathogenicity factor; InterPro: IPR021085 This family is secreted by Gram-negative Gammaproteobacteria such as Pseudomonas syringae of tomato and Erwinia amylovora (Fire blight bacteria), amongst others. It is an essential pathogenicity factor of approximately 198 kDa. Its injection into the host-plant is dependent upon the bacterial type III or Hrp secretion system []. The family is long and carries a number of predicted functional regions, including an ERMS or endoplasmic reticulum membrane retention signal at both the C- and the N-termini, a leucine-zipper motif from residues 539-560, and a nuclear localisation signal at 1358-1361. This conserved AvrE-family of effectors is among the few that are required for full virulence of many phytopathogenic pseudomonads, erwinias and pantoeas [].
Probab=89.32 E-value=3.1 Score=42.83 Aligned_cols=135 Identities=17% Similarity=0.254 Sum_probs=86.2
Q ss_pred EeeecCCCCeEEEE-cCCCeEEEEEcCCCEEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEEEEccCCce
Q psy1607 76 EIRALSGLGVVQIS-AGGWHSAARTQDGFLYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVACGSRHT 154 (241)
Q Consensus 76 ~v~~~~~~~i~~I~-~G~~~s~~lt~~g~vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~ 154 (241)
.|..+.+..|..++ .+.++.++|+++|++-+.- +- ..|..+... -....|++|++-..|.
T Consensus 696 ~l~Gl~~~~i~a~Avv~~~~fvald~qg~lt~h~--k~-------------g~p~~l~~~----gl~G~ik~l~lD~~~n 756 (1774)
T PF11725_consen 696 ALEGLEDRVITAFAVVNDNKFVALDDQGDLTAHQ--KP-------------GRPVPLSRP----GLSGEIKDLALDEKQN 756 (1774)
T ss_pred cccCCCcCcceeEEEEcCCceEEeccCCcccccc--CC-------------CCCccCCCC----CCCcchhheeeccccc
Confidence 34455555666665 4778999999999876531 10 114444332 1235799999998865
Q ss_pred -EEEecCCcEEEEecCCCCcccCCCCCCceecceEecccccccCcCeeEEeccCCeeEEEecCCCCCCCceeEEeccccc
Q psy1607 155 -LALCDDNSLWGCGWNEYNQLSMGCGVSQVSVMTRLSVPEHVKNCPIVKLFAEHWNSARLSVPEHAKDSSIVKLFAGHWN 233 (241)
Q Consensus 155 -~~lt~~g~v~~wG~n~~gqlG~~~~~~~~~~~~~~~~~~~~~~~~i~~vacG~~~s~~lt~~~~~~~~~~~~~~~~~~~ 233 (241)
+++|.+|+||.-=.-.+.+.-.++ .....|+++.++.+ .+|..+....+|.+.++..+.....+.-..-.+.|.
T Consensus 757 L~Alt~~G~Lf~~~k~~WQ~~~~~~--~~~~~W~~v~lP~~---~~v~~l~~~~~~~l~~~~~d~~~~~~~~qlk~~~W~ 831 (1774)
T PF11725_consen 757 LYALTSTGELFRLPKEAWQGNAEGD--QMAAKWQKVALPDE---QPVKSLRTNDDNHLSAQIEDGANGQGLHQLKAGEWQ 831 (1774)
T ss_pred eeEecCCCceeecCHHHhhCcccCC--ccccCceeccCCCC---CchhhhhcCCCCceEEEecCCCcchhhhhhhccccc
Confidence 699999999985443333332221 12357888887744 359999999999999998875333333344555665
Q ss_pred c
Q psy1607 234 S 234 (241)
Q Consensus 234 s 234 (241)
.
T Consensus 832 ~ 832 (1774)
T PF11725_consen 832 A 832 (1774)
T ss_pred c
Confidence 4
No 19
>PF07569 Hira: TUP1-like enhancer of split; InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=88.44 E-value=3.6 Score=33.36 Aligned_cols=30 Identities=30% Similarity=0.485 Sum_probs=25.9
Q ss_pred CCCeEEEEcCCCeEEEEEcCCCEEEEeCCC
Q psy1607 82 GLGVVQISAGGWHSAARTQDGFLYAWGWNN 111 (241)
Q Consensus 82 ~~~i~~I~~G~~~s~~lt~~g~vy~~G~n~ 111 (241)
+.++..+.|-..+-++||++|.+|+|--..
T Consensus 12 gs~~~~l~~~~~~Ll~iT~~G~l~vWnl~~ 41 (219)
T PF07569_consen 12 GSPVSFLECNGSYLLAITSSGLLYVWNLKK 41 (219)
T ss_pred CCceEEEEeCCCEEEEEeCCCeEEEEECCC
Confidence 357888999999999999999999997654
No 20
>smart00706 TECPR Beta propeller repeats in Physarum polycephalum tectonins, Limulus lectin L-6 and animal hypothetical proteins.
Probab=82.40 E-value=3.5 Score=22.62 Aligned_cols=24 Identities=17% Similarity=0.369 Sum_probs=21.5
Q ss_pred ccEEEEecCC-CEEEEEECCCcEEE
Q psy1607 29 LVVTDIACGF-DHCLALTQEGIVFS 52 (241)
Q Consensus 29 ~~i~~i~~G~-~h~~~lt~~g~v~~ 52 (241)
-++++|++|. +...+++.+|.+|.
T Consensus 8 g~l~~isvg~~~~vW~V~~~g~i~~ 32 (35)
T smart00706 8 GELVQVSVGPSDTVWAVNSDGNIYR 32 (35)
T ss_pred CCEEEEEECCCCeEEEEcCCCCEEE
Confidence 4689999999 99999999999995
No 21
>KOG0943|consensus
Probab=80.30 E-value=2.4 Score=42.41 Aligned_cols=83 Identities=13% Similarity=0.151 Sum_probs=58.2
Q ss_pred CCCeEEEEcCCCeEEEEEcCCCEEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEEEEccCCceEEEecCC
Q psy1607 82 GLGVVQISAGGWHSAARTQDGFLYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVACGSRHTLALCDDN 161 (241)
Q Consensus 82 ~~~i~~I~~G~~~s~~lt~~g~vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~~~lt~~g 161 (241)
..+++.|.+-++-.++|..+|++|.|-+....-+-......+...-|..-.+ ..-+.+|+.+++..-..-++|++|
T Consensus 373 an~~I~I~A~s~el~AlhrkGelYqWaWdESEglddplai~kn~dHPd~a~i----G~hge~ii~lSanniR~si~T~ng 448 (3015)
T KOG0943|consen 373 ANKFICIGALSSELLALHRKGELYQWAWDESEGLDDPLAINKNLDHPDAAFI----GLHGEKIILLSANNIRASIATENG 448 (3015)
T ss_pred CCeeEEeehhHHHHHHHhhCCceeeeecccccCCCChhhcccCCCCCcccee----cccCCeeEEeecCceeeeeeecCC
Confidence 4578899888888999999999999998765444332222111112222211 124578999999998999999999
Q ss_pred cEEEEec
Q psy1607 162 SLWGCGW 168 (241)
Q Consensus 162 ~v~~wG~ 168 (241)
.|-+|=.
T Consensus 449 hlasWlD 455 (3015)
T KOG0943|consen 449 HLASWLD 455 (3015)
T ss_pred chhhHHh
Confidence 9999843
No 22
>smart00706 TECPR Beta propeller repeats in Physarum polycephalum tectonins, Limulus lectin L-6 and animal hypothetical proteins.
Probab=79.16 E-value=5.1 Score=21.93 Aligned_cols=25 Identities=32% Similarity=0.278 Sum_probs=21.3
Q ss_pred CCeEEEEcCC-CeEEEEEcCCCEEEE
Q psy1607 83 LGVVQISAGG-WHSAARTQDGFLYAW 107 (241)
Q Consensus 83 ~~i~~I~~G~-~~s~~lt~~g~vy~~ 107 (241)
..+++|++|. ...-+++.+|.||..
T Consensus 8 g~l~~isvg~~~~vW~V~~~g~i~~r 33 (35)
T smart00706 8 GELVQVSVGPSDTVWAVNSDGNIYRR 33 (35)
T ss_pred CCEEEEEECCCCeEEEEcCCCCEEEE
Confidence 4689999999 777799999999964
No 23
>PF12341 DUF3639: Protein of unknown function (DUF3639) ; InterPro: IPR022100 This domain family is found in eukaryotes, and is approximately 30 amino acids in length. The family is found in association with PF00400 from PFAM. There are two completely conserved residues (E and R) that may be functionally important.
Probab=72.22 E-value=12 Score=19.53 Aligned_cols=25 Identities=20% Similarity=0.134 Sum_probs=20.8
Q ss_pred CccEEEEecCCCEEEEEECCCcEEE
Q psy1607 28 DLVVTDIACGFDHCLALTQEGIVFS 52 (241)
Q Consensus 28 ~~~i~~i~~G~~h~~~lt~~g~v~~ 52 (241)
+++|..|++|.....+.|+.+-|-.
T Consensus 1 gE~i~aia~g~~~vavaTS~~~lRi 25 (27)
T PF12341_consen 1 GEEIEAIAAGDSWVAVATSAGYLRI 25 (27)
T ss_pred CceEEEEEccCCEEEEEeCCCeEEe
Confidence 3579999999999999998886643
No 24
>PF07569 Hira: TUP1-like enhancer of split; InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=69.70 E-value=6.7 Score=31.77 Aligned_cols=28 Identities=18% Similarity=0.218 Sum_probs=24.4
Q ss_pred CcEEEEEccCCceEEEecCCcEEEEecC
Q psy1607 142 IKVVDVACGSRHTLALCDDNSLWGCGWN 169 (241)
Q Consensus 142 ~~i~~i~~G~~h~~~lt~~g~v~~wG~n 169 (241)
.++..+.|-..+.+++|++|.+|+|=-.
T Consensus 13 s~~~~l~~~~~~Ll~iT~~G~l~vWnl~ 40 (219)
T PF07569_consen 13 SPVSFLECNGSYLLAITSSGLLYVWNLK 40 (219)
T ss_pred CceEEEEeCCCEEEEEeCCCeEEEEECC
Confidence 5788899988999999999999999543
No 25
>KOG0315|consensus
Probab=69.54 E-value=65 Score=26.85 Aligned_cols=89 Identities=17% Similarity=0.273 Sum_probs=48.7
Q ss_pred CCCccccCc-eeEecCeeeeccCCccEEEEecCCC--EEEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeecCC
Q psy1607 6 ASGPLKMLS-QVFHMPSRMDHFLDLVVTDIACGFD--HCLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRALSG 82 (241)
Q Consensus 6 ~~g~l~~~g-~~~~~p~~i~~~~~~~i~~i~~G~~--h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~ 82 (241)
+||...-|. +....++.+.+. ..|..|..-.+ +-|.-+++|+|+.|-.... ... ....|.. +
T Consensus 103 eDgt~kIWdlR~~~~qR~~~~~--spVn~vvlhpnQteLis~dqsg~irvWDl~~~------~c~--~~liPe~-----~ 167 (311)
T KOG0315|consen 103 EDGTVKIWDLRSLSCQRNYQHN--SPVNTVVLHPNQTELISGDQSGNIRVWDLGEN------SCT--HELIPED-----D 167 (311)
T ss_pred CCceEEEEeccCcccchhccCC--CCcceEEecCCcceEEeecCCCcEEEEEccCC------ccc--cccCCCC-----C
Confidence 444444443 333333444443 35666665554 4455578899999985322 111 1112211 2
Q ss_pred CCeEEEEcCCCe--EEEEEcCCCEEEEeC
Q psy1607 83 LGVVQISAGGWH--SAARTQDGFLYAWGW 109 (241)
Q Consensus 83 ~~i~~I~~G~~~--s~~lt~~g~vy~~G~ 109 (241)
..|.+++-..+- .++.+++|+.|+|-.
T Consensus 168 ~~i~sl~v~~dgsml~a~nnkG~cyvW~l 196 (311)
T KOG0315|consen 168 TSIQSLTVMPDGSMLAAANNKGNCYVWRL 196 (311)
T ss_pred cceeeEEEcCCCcEEEEecCCccEEEEEc
Confidence 345566655554 457888999999975
No 26
>TIGR01062 parC_Gneg DNA topoisomerase IV, A subunit, proteobacterial. Operationally, topoisomerase IV is a type II topoisomerase required for the decatenation of chromosome segregation. Not every bacterium has both a topo II and a topo IV. The topo IV families of the Gram-positive bacteria and the Gram-negative bacteria appear not to represent a single clade among the type II topoisomerases, and are represented by separate models for this reason.
Probab=65.00 E-value=1.4e+02 Score=29.00 Aligned_cols=132 Identities=11% Similarity=0.087 Sum_probs=71.4
Q ss_pred CCccEEEEecC--CCEEEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcCCC--eEEEEEcCC
Q psy1607 27 LDLVVTDIACG--FDHCLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRALSGLGVVQISAGGW--HSAARTQDG 102 (241)
Q Consensus 27 ~~~~i~~i~~G--~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G~~--~s~~lt~~g 102 (241)
.+..+..+..+ .++.+++|++|++|.+-...-. .|.+.- .+....+....+.+|+.+.+... +.+++|+.|
T Consensus 523 egD~L~~~~~~~t~d~LllfTs~Gr~yrf~v~eIP-~GR~aG----gpV~~~L~L~~gE~Iv~~~~v~~~~~lLlaT~~G 597 (735)
T TIGR01062 523 AGDSEKAIIEGKSNQKVVFIDSTGRSYALDPDNLP-SARGQG----EPLTGKLLLPIGATITNILMYSPNQLLLMASDAG 597 (735)
T ss_pred CCCeEEEEEEecCCCEEEEEECCCeEEEEEhHhcC-cCccCC----ceeEeeecCCCCCEEEEEEEecCCcEEEEEEcCC
Confidence 34556555444 4568999999999999776543 232211 11112222235667887777543 477889999
Q ss_pred CEEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEE--EEccC-CceEEEecCCcEEEEecCCCCcccC
Q psy1607 103 FLYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVD--VACGS-RHTLALCDDNSLWGCGWNEYNQLSM 176 (241)
Q Consensus 103 ~vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~--i~~G~-~h~~~lt~~g~v~~wG~n~~gqlG~ 176 (241)
..+..-.+.+.....+... ... +. .+..++. ...+. .+.+++|++|++..+-.++--.++.
T Consensus 598 yGKrt~lse~~~~~RaGKg--vi~------Lk-----~~d~lv~v~~v~~~dd~V~liT~~GrlLrf~v~EIp~~gR 661 (735)
T TIGR01062 598 YGFLCNFNDLIARNKAGKA--LIN------LP-----ENASVIAPLPVNGDSDMIAAITEAGRMLVFPIDDLPELSK 661 (735)
T ss_pred cEEEEEhHhccccCcCCeE--EEE------eC-----CCCEEEEEEEEcCCCCEEEEEeCCCcEEEEEHHHCCccCC
Confidence 8776654433221111110 001 11 1122322 22333 3578899999999887665444443
No 27
>KOG0293|consensus
Probab=64.86 E-value=67 Score=28.69 Aligned_cols=30 Identities=7% Similarity=0.083 Sum_probs=22.2
Q ss_pred CCCcEEEEEccCCceEEEe--cCCcEEEEecC
Q psy1607 140 QNIKVVDVACGSRHTLALC--DDNSLWGCGWN 169 (241)
Q Consensus 140 ~~~~i~~i~~G~~h~~~lt--~~g~v~~wG~n 169 (241)
...-|.+-..|.+..++.. +|++||.|=.-
T Consensus 439 ~~fiIrSCFgg~~~~fiaSGSED~kvyIWhr~ 470 (519)
T KOG0293|consen 439 GHFIIRSCFGGGNDKFIASGSEDSKVYIWHRI 470 (519)
T ss_pred cceEEEeccCCCCcceEEecCCCceEEEEEcc
Confidence 3445777777777677665 78999999764
No 28
>PF06739 SBBP: Beta-propeller repeat; InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=63.97 E-value=8.3 Score=21.77 Aligned_cols=18 Identities=22% Similarity=0.399 Sum_probs=15.4
Q ss_pred CeEEEEEcCCCEEEEeCC
Q psy1607 93 WHSAARTQDGFLYAWGWN 110 (241)
Q Consensus 93 ~~s~~lt~~g~vy~~G~n 110 (241)
.+.++++.+|.||+.|.-
T Consensus 15 ~~~IavD~~GNiYv~G~T 32 (38)
T PF06739_consen 15 GNGIAVDSNGNIYVTGYT 32 (38)
T ss_pred EEEEEECCCCCEEEEEee
Confidence 367899999999999974
No 29
>PRK05560 DNA gyrase subunit A; Validated
Probab=63.40 E-value=1.6e+02 Score=29.01 Aligned_cols=125 Identities=11% Similarity=0.007 Sum_probs=66.2
Q ss_pred ccEEEEe--cCCCEEEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcCC-----CeEEEEEcC
Q psy1607 29 LVVTDIA--CGFDHCLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRALSGLGVVQISAGG-----WHSAARTQD 101 (241)
Q Consensus 29 ~~i~~i~--~G~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G~-----~~s~~lt~~ 101 (241)
..+..+. ...++.+++|+.|++|..-...-...+..... ......+....+++|+.+.+-. ...+++|++
T Consensus 537 D~l~~~~~~~t~d~LllfTs~Grv~~l~v~~iP~~~~~~~G---~~i~~ll~L~~~E~Iv~~i~~~~~~~e~~lvlvTk~ 613 (805)
T PRK05560 537 DFVEHLFVASTHDTLLFFTNRGRVYRLKVYEIPEASRTARG---RPIVNLLPLEPGEKITAILPVREFDDDKYLFFATKN 613 (805)
T ss_pred CeeEEEEEecCCCeEEEEecCCeEEEEEhhhCcCCCcCCCC---eEHHHhcCCCCCceEEEEEeccCCCCCCEEEEEeCC
Confidence 3454443 44577889999999998876533333211111 0110112223566777776644 346889999
Q ss_pred CCEEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEEEE--ccCCceEEEecCCcEEEEecC
Q psy1607 102 GFLYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVA--CGSRHTLALCDDNSLWGCGWN 169 (241)
Q Consensus 102 g~vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~--~G~~h~~~lt~~g~v~~wG~n 169 (241)
|.+--.-.+.+.....+ . ...+.+. .+..++.+. ....+.+++|++|++|.+=..
T Consensus 614 GyiKRi~l~~~~~~~r~--G------~~~ikLk-----e~D~lv~v~~~~~~d~lll~T~~Gr~~r~~~~ 670 (805)
T PRK05560 614 GTVKKTSLSEFSNIRSN--G------IIAINLD-----EGDELIGVRLTDGDDDILLATKNGKAIRFPES 670 (805)
T ss_pred CEEEEEEhHHhhhcccC--C------ceeeccC-----CCCEEEEEEEeCCCCEEEEEECCCcEEEEEhh
Confidence 98775543322111000 0 0111111 123455443 334568999999999876443
No 30
>KOG1900|consensus
Probab=63.20 E-value=1.1e+02 Score=31.48 Aligned_cols=153 Identities=15% Similarity=0.165 Sum_probs=75.7
Q ss_pred EEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcCCCeEEEEEcCCCEEEEeCCCCCccCCCCCC
Q psy1607 42 LALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRALSGLGVVQISAGGWHSAARTQDGFLYAWGWNNGGQVGVEGKL 121 (241)
Q Consensus 42 ~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G~~~s~~lt~~g~vy~~G~n~~gqlg~~~~~ 121 (241)
+=+|.++++|-|-.++.+++-.-+........-..+...++.=+-.| .|-++|..--+|+..|-......+.....
T Consensus 93 aWiTiDn~L~lWny~~~~e~~~~d~~shtIl~V~LvkPkpgvFv~~I----qhlLvvaT~~ei~ilgV~~~~~~~~~~~f 168 (1311)
T KOG1900|consen 93 AWITIDNNLFLWNYESDNELAEYDGLSHTILKVGLVKPKPGVFVPEI----QHLLVVATPVEIVILGVSFDEFTGELSIF 168 (1311)
T ss_pred eEEEeCCeEEEEEcCCCCccccccchhhhheeeeeecCCCCcchhhh----heeEEecccceEEEEEEEeccccCccccc
Confidence 34688999999999887766543332111111111221222222233 58888888889998886443222221111
Q ss_pred CceeeCCccccccccCCCCCCcEEEEEccCCceEEEecCCcEEEEecCC-------------CCc----ccCCCCCCcee
Q psy1607 122 DLVVHDPHPMSWSEDFDSQNIKVVDVACGSRHTLALCDDNSLWGCGWNE-------------YNQ----LSMGCGVSQVS 184 (241)
Q Consensus 122 ~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~~~lt~~g~v~~wG~n~-------------~gq----lG~~~~~~~~~ 184 (241)
... +.+. ..+..|..|.+ |++|+||.-|+.. +++ ..+....-..-
T Consensus 169 ~~~------~~i~----~dg~~V~~I~~--------t~nGRIF~~G~dg~lyEl~Yq~~~gWf~~rc~Kiclt~s~ls~l 230 (1311)
T KOG1900|consen 169 NTS------FKIS----VDGVSVNCITY--------TENGRIFFAGRDGNLYELVYQAEDGWFGSRCRKICLTKSVLSSL 230 (1311)
T ss_pred ccc------eeee----cCCceEEEEEe--------ccCCcEEEeecCCCEEEEEEeccCchhhcccccccCchhHHHHh
Confidence 110 1111 11234444442 5666666665543 111 11111111122
Q ss_pred cceEecccccccCcCeeEEeccCCeeEEEecCC
Q psy1607 185 VMTRLSVPEHVKNCPIVKLFAEHWNSARLSVPE 217 (241)
Q Consensus 185 ~~~~~~~~~~~~~~~i~~vacG~~~s~~lt~~~ 217 (241)
.|+.+..+ .....+|.+++-+....+.++.++
T Consensus 231 vPs~~~~~-~~~~dpI~qi~ID~SR~IlY~lse 262 (1311)
T KOG1900|consen 231 VPSLLSVP-GSSKDPIRQITIDNSRNILYVLSE 262 (1311)
T ss_pred hhhhhcCC-CCCCCcceeeEeccccceeeeecc
Confidence 34433333 223346999999988777777664
No 31
>PF11725 AvrE: Pathogenicity factor; InterPro: IPR021085 This family is secreted by Gram-negative Gammaproteobacteria such as Pseudomonas syringae of tomato and Erwinia amylovora (Fire blight bacteria), amongst others. It is an essential pathogenicity factor of approximately 198 kDa. Its injection into the host-plant is dependent upon the bacterial type III or Hrp secretion system []. The family is long and carries a number of predicted functional regions, including an ERMS or endoplasmic reticulum membrane retention signal at both the C- and the N-termini, a leucine-zipper motif from residues 539-560, and a nuclear localisation signal at 1358-1361. This conserved AvrE-family of effectors is among the few that are required for full virulence of many phytopathogenic pseudomonads, erwinias and pantoeas [].
Probab=57.77 E-value=28 Score=36.33 Aligned_cols=114 Identities=16% Similarity=0.151 Sum_probs=67.0
Q ss_pred cCCccEEEEe-cCCCEEEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcCCCeEE-EEEcCCC
Q psy1607 26 FLDLVVTDIA-CGFDHCLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRALSGLGVVQISAGGWHSA-ARTQDGF 103 (241)
Q Consensus 26 ~~~~~i~~i~-~G~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G~~~s~-~lt~~g~ 103 (241)
+++..|..++ .+.++.++|+++|++-..- ..+ .+.|-....+.+ .|++|++=..|.+ ||+.+|+
T Consensus 700 l~~~~i~a~Avv~~~~fvald~qg~lt~h~--k~g-----------~p~~l~~~gl~G-~ik~l~lD~~~nL~Alt~~G~ 765 (1774)
T PF11725_consen 700 LEDRVITAFAVVNDNKFVALDDQGDLTAHQ--KPG-----------RPVPLSRPGLSG-EIKDLALDEKQNLYALTSTGE 765 (1774)
T ss_pred CCcCcceeEEEEcCCceEEeccCCcccccc--CCC-----------CCccCCCCCCCc-chhheeeccccceeEecCCCc
Confidence 3555666655 5678888888888876432 111 112222223443 7999999888876 8999999
Q ss_pred EEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEEEEccCCceEEEecCC
Q psy1607 104 LYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVACGSRHTLALCDDN 161 (241)
Q Consensus 104 vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~~~lt~~g 161 (241)
+|..-.-.+...-.+.... ..+.|..++ .+.++..+....+|.+.+.-++
T Consensus 766 Lf~~~k~~WQ~~~~~~~~~-~~W~~v~lP-------~~~~v~~l~~~~~~~l~~~~~d 815 (1774)
T PF11725_consen 766 LFRLPKEAWQGNAEGDQMA-AKWQKVALP-------DEQPVKSLRTNDDNHLSAQIED 815 (1774)
T ss_pred eeecCHHHhhCcccCCccc-cCceeccCC-------CCCchhhhhcCCCCceEEEecC
Confidence 9976543332221111100 112333333 2467888988888887777554
No 32
>KOG1900|consensus
Probab=56.74 E-value=2e+02 Score=29.62 Aligned_cols=83 Identities=16% Similarity=0.159 Sum_probs=47.0
Q ss_pred CCCeEEEEcCCCeEE--EEEcCCCEEEEeCCCCCccCCCCCCC-------ceeeCCccccccccCCCCCCcEEEEE----
Q psy1607 82 GLGVVQISAGGWHSA--ARTQDGFLYAWGWNNGGQVGVEGKLD-------LVVHDPHPMSWSEDFDSQNIKVVDVA---- 148 (241)
Q Consensus 82 ~~~i~~I~~G~~~s~--~lt~~g~vy~~G~n~~gqlg~~~~~~-------~~~~~p~~v~~~~~~~~~~~~i~~i~---- 148 (241)
..+|.+|+-+....+ .+++.|.|-+|-....|+-+...-.. +....-.++. ...-..|++|+
T Consensus 242 ~dpI~qi~ID~SR~IlY~lsek~~v~~Y~i~~~G~~~~r~~~~~~~~i~~qa~~~~~~~~-----~s~f~~IvsI~~l~~ 316 (1311)
T KOG1900|consen 242 KDPIRQITIDNSRNILYVLSEKGTVSAYDIGGNGLGGPRFVSVSRNYIDVQALSLKNPLD-----DSVFFSIVSISPLSA 316 (1311)
T ss_pred CCcceeeEeccccceeeeeccCceEEEEEccCCCccceeeeehhHHHHHHHhhhccccCC-----CcccceeEEecccCc
Confidence 558999999888765 77788988888655555543211000 0000000010 01223455554
Q ss_pred --ccCCceEEEecCC-cEEEEecC
Q psy1607 149 --CGSRHTLALCDDN-SLWGCGWN 169 (241)
Q Consensus 149 --~G~~h~~~lt~~g-~v~~wG~n 169 (241)
.-+-|.+++|..| ++|.-|..
T Consensus 317 ~es~~l~LvA~ts~GvRlYfs~s~ 340 (1311)
T KOG1900|consen 317 SESNDLHLVAITSTGVRLYFSTSS 340 (1311)
T ss_pred ccccceeEEEEecCCeEEEEeccC
Confidence 3345999999999 67876664
No 33
>PF13418 Kelch_4: Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=55.11 E-value=13 Score=21.73 Aligned_cols=18 Identities=28% Similarity=0.307 Sum_probs=11.9
Q ss_pred CCceEEEecCCcEEEEec
Q psy1607 151 SRHTLALCDDNSLWGCGW 168 (241)
Q Consensus 151 ~~h~~~lt~~g~v~~wG~ 168 (241)
..|+++...+++||.+|=
T Consensus 3 ~~h~~~~~~~~~i~v~GG 20 (49)
T PF13418_consen 3 YGHSAVSIGDNSIYVFGG 20 (49)
T ss_dssp BS-EEEEE-TTEEEEE--
T ss_pred ceEEEEEEeCCeEEEECC
Confidence 368888888899999984
No 34
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=49.57 E-value=1.2e+02 Score=26.15 Aligned_cols=15 Identities=20% Similarity=0.271 Sum_probs=11.9
Q ss_pred CceEEEecCCcEEEE
Q psy1607 152 RHTLALCDDNSLWGC 166 (241)
Q Consensus 152 ~h~~~lt~~g~v~~w 166 (241)
++.++.+.+|.||+|
T Consensus 362 ~~l~v~~~dG~l~~~ 376 (377)
T TIGR03300 362 DGLLVQTRDGDLYAF 376 (377)
T ss_pred CEEEEEeCCceEEEe
Confidence 567788888888876
No 35
>PF03785 Peptidase_C25_C: Peptidase family C25, C terminal ig-like domain; InterPro: IPR005536 This domain is found in almost all members of MEROPS peptidase family C25, (clan CD). Peptidase family C25 is a protein family found in the bacteria Porphyromonas gingivalis (Bacteroides gingivalis) a Gram-negative anaerobic bacterial species strongly associated with adult periodontitis. One of its distinguishing characteristics and putative virulence properties is the ability to agglutinate erythrocytes []. It is a highly proteolytic organism which metabolises small peptides and amino acids. Indirect evidence suggests that the proteases produced by this microorganism constitute an important virulence factor []. Protease-encoding genes have been shown to contain multiple copies of repeated nucleotide sequences. These conserved sequences have also been found in haemagglutinin genes [].; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 1CVR_A.
Probab=48.42 E-value=31 Score=23.22 Aligned_cols=36 Identities=28% Similarity=0.274 Sum_probs=26.9
Q ss_pred CCCCeEEEEcC-CCeEEEEEcCCCEEEEeCCCCCccC
Q psy1607 81 SGLGVVQISAG-GWHSAARTQDGFLYAWGWNNGGQVG 116 (241)
Q Consensus 81 ~~~~i~~I~~G-~~~s~~lt~~g~vy~~G~n~~gqlg 116 (241)
.+..=..|+|. ....++|++||.+|.-+.-..|.+-
T Consensus 14 ~~~tS~~Vs~~~~gs~ValS~dg~l~G~ai~~sG~at 50 (81)
T PF03785_consen 14 LGQTSISVSCDVPGSYVALSQDGDLYGKAIVNSGNAT 50 (81)
T ss_dssp TT-SEEEEEESSTT-EEEEEETTEEEEEEE-BTTEEE
T ss_pred ccccEEEEEecCCCcEEEEecCCEEEEEEEecCceEE
Confidence 34556799999 8888999999999998875566654
No 36
>PF03785 Peptidase_C25_C: Peptidase family C25, C terminal ig-like domain; InterPro: IPR005536 This domain is found in almost all members of MEROPS peptidase family C25, (clan CD). Peptidase family C25 is a protein family found in the bacteria Porphyromonas gingivalis (Bacteroides gingivalis) a Gram-negative anaerobic bacterial species strongly associated with adult periodontitis. One of its distinguishing characteristics and putative virulence properties is the ability to agglutinate erythrocytes []. It is a highly proteolytic organism which metabolises small peptides and amino acids. Indirect evidence suggests that the proteases produced by this microorganism constitute an important virulence factor []. Protease-encoding genes have been shown to contain multiple copies of repeated nucleotide sequences. These conserved sequences have also been found in haemagglutinin genes [].; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 1CVR_A.
Probab=45.62 E-value=40 Score=22.67 Aligned_cols=34 Identities=21% Similarity=0.372 Sum_probs=26.2
Q ss_pred CcEEEEEcc-CCceEEEecCCcEEEEecCCCCccc
Q psy1607 142 IKVVDVACG-SRHTLALCDDNSLWGCGWNEYNQLS 175 (241)
Q Consensus 142 ~~i~~i~~G-~~h~~~lt~~g~v~~wG~n~~gqlG 175 (241)
..=..|+|. ....++|++||.+|+-+--+.|++-
T Consensus 16 ~tS~~Vs~~~~gs~ValS~dg~l~G~ai~~sG~at 50 (81)
T PF03785_consen 16 QTSISVSCDVPGSYVALSQDGDLYGKAIVNSGNAT 50 (81)
T ss_dssp -SEEEEEESSTT-EEEEEETTEEEEEEE-BTTEEE
T ss_pred ccEEEEEecCCCcEEEEecCCEEEEEEEecCceEE
Confidence 456789999 8899999999999999886666653
No 37
>KOG0649|consensus
Probab=44.54 E-value=1.9e+02 Score=24.19 Aligned_cols=36 Identities=22% Similarity=0.337 Sum_probs=24.7
Q ss_pred CccEEEEecCCCEEEEEECCCcEEEEeCCCCCC-CCCC
Q psy1607 28 DLVVTDIACGFDHCLALTQEGIVFSWGNGSRGQ-LGYP 64 (241)
Q Consensus 28 ~~~i~~i~~G~~h~~~lt~~g~v~~~G~n~~gq-lg~~ 64 (241)
+-+|-+++.-..|-+. .-+|+||.|-++..-. ++..
T Consensus 62 dgpiy~~~f~d~~Lls-~gdG~V~gw~W~E~~es~~~K 98 (325)
T KOG0649|consen 62 DGPIYYLAFHDDFLLS-GGDGLVYGWEWNEEEESLATK 98 (325)
T ss_pred CCCeeeeeeehhheee-ccCceEEEeeehhhhhhccch
Confidence 3467777777666543 4469999999987765 4443
No 38
>KOG1408|consensus
Probab=42.33 E-value=3.3e+02 Score=26.48 Aligned_cols=102 Identities=14% Similarity=0.098 Sum_probs=56.5
Q ss_pred EEEcCCCeEEEEEcCCCEEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEEEEccCCceEEEecCCc-EEE
Q psy1607 87 QISAGGWHSAARTQDGFLYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVACGSRHTLALCDDNS-LWG 165 (241)
Q Consensus 87 ~I~~G~~~s~~lt~~g~vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~~~lt~~g~-v~~ 165 (241)
-++.|..|-+++. ||.|-.|..+- . ..+. .+|..++-.+.-++.+|.-++ |-.
T Consensus 137 vvSVGsQHDMIVn----v~dWr~N~~~a------s-------nkis---------s~Vsav~fsEdgSYfvT~gnrHvk~ 190 (1080)
T KOG1408|consen 137 VVSVGSQHDMIVN----VNDWRVNSSGA------S-------NKIS---------SVVSAVAFSEDGSYFVTSGNRHVKL 190 (1080)
T ss_pred EEeeccccceEEE----hhhhhhccccc------c-------cccc---------eeEEEEEEccCCceeeeeeeeeEEE
Confidence 3566888888775 77777664311 1 1121 245555555555555554332 333
Q ss_pred EecCCCCcccCCCCCCceecceEecccccccCcCeeEEeccCC----eeEEEecCCCC
Q psy1607 166 CGWNEYNQLSMGCGVSQVSVMTRLSVPEHVKNCPIVKLFAEHW----NSARLSVPEHA 219 (241)
Q Consensus 166 wG~n~~gqlG~~~~~~~~~~~~~~~~~~~~~~~~i~~vacG~~----~s~~lt~~~~~ 219 (241)
|= -|.+. ...+..+.|-+-.+.-++......+|+||.. .+++||..+++
T Consensus 191 wy----l~~~~-KykdpiPl~gRs~~lg~lr~n~f~avaCg~gicAestfait~qGhL 243 (1080)
T KOG1408|consen 191 WY----LQIQS-KYKDPIPLPGRSYFLGNLRFNEFLAVACGVGICAESTFAITAQGHL 243 (1080)
T ss_pred EE----eeccc-cccCCccccchhhhccccccchhhhhhhcCcccccceEEEecccce
Confidence 31 12221 1112233333334445566667899999987 99999998765
No 39
>TIGR01063 gyrA DNA gyrase, A subunit. This model describes the common type II DNA topoisomerase (DNA gyrase). Two apparently independently arising families, one in the Proteobacteria and one in Gram-positive lineages, are both designated toposisomerase IV.
Probab=42.16 E-value=3.5e+02 Score=26.65 Aligned_cols=129 Identities=9% Similarity=0.022 Sum_probs=66.6
Q ss_pred ccEEEEe--cCCCEEEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcC-----CCeEEEEEcC
Q psy1607 29 LVVTDIA--CGFDHCLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRALSGLGVVQISAG-----GWHSAARTQD 101 (241)
Q Consensus 29 ~~i~~i~--~G~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G-----~~~s~~lt~~ 101 (241)
..+..+. ...++.+++|+.|++|..-...-...+..... ......+....+++|+.+.+- ....+++|++
T Consensus 535 D~l~~~~~~~t~d~LllfTs~Grv~~l~~~~IP~~~r~~~G---~~i~~ll~L~~~E~Iv~~i~~~~~~~~~~lvliT~~ 611 (800)
T TIGR01063 535 DFIEQLLVASTHDYLLFFTNRGKVYWLKVYQIPEASRTAKG---KPIVNLLPLQPDERITAILSVKEFDDGLYLFFATKN 611 (800)
T ss_pred CeeEEEEEecCCCeEEEEeCCCcEEEEEhhhCcCCCcCCCC---cCHHHhccCCCCCeEEEEEEeccCCCCCEEEEEeCC
Confidence 3454443 44577889999999999854333332221111 011111222355677766652 2356788999
Q ss_pred CCEEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEEEE--ccCCceEEEecCCcEEEEecCCCCc
Q psy1607 102 GFLYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVA--CGSRHTLALCDDNSLWGCGWNEYNQ 173 (241)
Q Consensus 102 g~vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~--~G~~h~~~lt~~g~v~~wG~n~~gq 173 (241)
|.+--.-.+.+..... .. -..+.+. .+..++.+. ....+.+++|++|++|.+=..+--.
T Consensus 612 GyiKRi~l~~~~~~~r---~G-----~~aiklk-----e~D~lv~v~~~~~~d~lll~Ts~Gr~~r~~v~eIp~ 672 (800)
T TIGR01063 612 GVVKKTSLTEFSNIRS---NG-----IIAIKLD-----DGDELISVRLTSGDDEVMLGSKNGKAVRFPEEDVRP 672 (800)
T ss_pred CEEEEEEhHHhhhhcc---CC-----cccccCC-----CCCEEEEEEEeCCCCEEEEEECCCcEEEEEhhhcCC
Confidence 9887654433211100 00 0011111 123455443 3345689999999999875544333
No 40
>PF11399 DUF3192: Protein of unknown function (DUF3192); InterPro: IPR021534 Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed.
Probab=40.08 E-value=21 Score=25.16 Aligned_cols=25 Identities=24% Similarity=0.311 Sum_probs=20.0
Q ss_pred ccCCceEEEecCCcEEEEecCCCCc
Q psy1607 149 CGSRHTLALCDDNSLWGCGWNEYNQ 173 (241)
Q Consensus 149 ~G~~h~~~lt~~g~v~~wG~n~~gq 173 (241)
.-+..+-++..+|+|.+||+..+.|
T Consensus 77 TkDECTplvF~n~~LvgWG~~ay~~ 101 (102)
T PF11399_consen 77 TKDECTPLVFKNGKLVGWGDDAYSQ 101 (102)
T ss_pred cCCceEEEEEECCEEEEEcHHhhhc
Confidence 3456778889999999999977654
No 41
>KOG1034|consensus
Probab=39.50 E-value=94 Score=26.93 Aligned_cols=58 Identities=21% Similarity=0.202 Sum_probs=38.4
Q ss_pred EEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcCCC--eEEEEEcCCCEEEEeC
Q psy1607 42 LALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRALSGLGVVQISAGGW--HSAARTQDGFLYAWGW 109 (241)
Q Consensus 42 ~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G~~--~s~~lt~~g~vy~~G~ 109 (241)
++..+.|+||.|-..... +...++......+..|.+.+-... ..+++.+++.||.|-+
T Consensus 323 a~gnq~g~v~vwdL~~~e----------p~~~ttl~~s~~~~tVRQ~sfS~dgs~lv~vcdd~~Vwrwdr 382 (385)
T KOG1034|consen 323 ALGNQSGKVYVWDLDNNE----------PPKCTTLTHSKSGSTVRQTSFSRDGSILVLVCDDGTVWRWDR 382 (385)
T ss_pred hhccCCCcEEEEECCCCC----------CccCceEEeccccceeeeeeecccCcEEEEEeCCCcEEEEEe
Confidence 344678999999853221 224455555666677777765544 4567788999999964
No 42
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=39.19 E-value=57 Score=16.71 Aligned_cols=19 Identities=32% Similarity=0.324 Sum_probs=14.7
Q ss_pred CeEEEEEcCCCEEEEeCCC
Q psy1607 93 WHSAARTQDGFLYAWGWNN 111 (241)
Q Consensus 93 ~~s~~lt~~g~vy~~G~n~ 111 (241)
.|.++++.+|+||+.-.+.
T Consensus 4 P~gvav~~~g~i~VaD~~n 22 (28)
T PF01436_consen 4 PHGVAVDSDGNIYVADSGN 22 (28)
T ss_dssp EEEEEEETTSEEEEEECCC
T ss_pred CcEEEEeCCCCEEEEECCC
Confidence 4678888999999876543
No 43
>PHA02146 hypothetical protein
Probab=39.13 E-value=39 Score=21.93 Aligned_cols=33 Identities=24% Similarity=0.334 Sum_probs=24.7
Q ss_pred EEcCCCeEE-EEEcCCCEEEEeCCCCCccCCCCC
Q psy1607 88 ISAGGWHSA-ARTQDGFLYAWGWNNGGQVGVEGK 120 (241)
Q Consensus 88 I~~G~~~s~-~lt~~g~vy~~G~n~~gqlg~~~~ 120 (241)
|.-|..|++ -|+++|.+|..|..-+|.-|....
T Consensus 23 i~ng~ef~v~~~d~dgd~~s~~iswng~dg~s~~ 56 (86)
T PHA02146 23 ITNGTEFTVTNIDDDGDLYTYDISWNGRDGKSAF 56 (86)
T ss_pred cCCCcEEEeeccccCCCeEeecccccCccCCccc
Confidence 445666666 688999999999888777775443
No 44
>PLN03215 ascorbic acid mannose pathway regulator 1; Provisional
Probab=39.08 E-value=1.4e+02 Score=26.43 Aligned_cols=62 Identities=16% Similarity=0.265 Sum_probs=43.2
Q ss_pred cEEEEecCCCE---EEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcCCCeEEEEEcCCCEEE
Q psy1607 30 VVTDIACGFDH---CLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRALSGLGVVQISAGGWHSAARTQDGFLYA 106 (241)
Q Consensus 30 ~i~~i~~G~~h---~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G~~~s~~lt~~g~vy~ 106 (241)
.+..+.++..| .+++...|++.-|..+.+. .++ .....+.+|.-=....+|++..|+||.
T Consensus 161 ~~~~~~~~~~~~~~vl~i~~~g~l~~w~~~~Wt----------------~l~-~~~~~~~DIi~~kGkfYAvD~~G~l~~ 223 (373)
T PLN03215 161 ALVKVKEGDNHRDGVLGIGRDGKINYWDGNVLK----------------ALK-QMGYHFSDIIVHKGQTYALDSIGIVYW 223 (373)
T ss_pred EEEEeecCCCcceEEEEEeecCcEeeecCCeee----------------Ecc-CCCceeeEEEEECCEEEEEcCCCeEEE
Confidence 44456777775 6777788999888643222 222 244568888887778888888899998
Q ss_pred Ee
Q psy1607 107 WG 108 (241)
Q Consensus 107 ~G 108 (241)
+.
T Consensus 224 i~ 225 (373)
T PLN03215 224 IN 225 (373)
T ss_pred Ee
Confidence 86
No 45
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=37.44 E-value=2.6e+02 Score=23.86 Aligned_cols=108 Identities=20% Similarity=0.186 Sum_probs=59.0
Q ss_pred CCCEEEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcC---CCeEEEEEcCCCEEEEeCCCCC
Q psy1607 37 GFDHCLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRALSGLGVVQISAG---GWHSAARTQDGFLYAWGWNNGG 113 (241)
Q Consensus 37 G~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G---~~~s~~lt~~g~vy~~G~n~~g 113 (241)
+.-|.++...+|.||--+. ..+-+|+-+.. .+ +++.+..| .-|.+++..||..|+.-... .
T Consensus 62 ~ap~dvapapdG~VWft~q-g~gaiGhLdP~-------------tG-ev~~ypLg~Ga~Phgiv~gpdg~~Witd~~~-a 125 (353)
T COG4257 62 SAPFDVAPAPDGAVWFTAQ-GTGAIGHLDPA-------------TG-EVETYPLGSGASPHGIVVGPDGSAWITDTGL-A 125 (353)
T ss_pred CCccccccCCCCceEEecC-ccccceecCCC-------------CC-ceEEEecCCCCCCceEEECCCCCeeEecCcc-e
Confidence 4467788899999995433 33444443222 11 23333332 34888888999999885432 1
Q ss_pred ccCCCCCCCceeeCCccccccccCCCCCCcEEEEEccCCceEEEecCCcEEEEecCC-CCcc
Q psy1607 114 QVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVACGSRHTLALCDDNSLWGCGWNE-YNQL 174 (241)
Q Consensus 114 qlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~~~lt~~g~v~~wG~n~-~gql 174 (241)
..-+..+.-.+..+|.+.+ -+-++-.+.+++..|+||..|.+. ||.|
T Consensus 126 I~R~dpkt~evt~f~lp~~--------------~a~~nlet~vfD~~G~lWFt~q~G~yGrL 173 (353)
T COG4257 126 IGRLDPKTLEVTRFPLPLE--------------HADANLETAVFDPWGNLWFTGQIGAYGRL 173 (353)
T ss_pred eEEecCcccceEEeecccc--------------cCCCcccceeeCCCccEEEeeccccceec
Confidence 1111111222222232222 222345678889999999998843 4554
No 46
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=36.86 E-value=1.8e+02 Score=24.75 Aligned_cols=70 Identities=16% Similarity=0.141 Sum_probs=40.6
Q ss_pred cEEEEecC---CCEEEEEECCCcEEEEeCCC-CCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcCCCeEEEEEcCCCEE
Q psy1607 30 VVTDIACG---FDHCLALTQEGIVFSWGNGS-RGQLGYPSSGTDANMEPREIRALSGLGVVQISAGGWHSAARTQDGFLY 105 (241)
Q Consensus 30 ~i~~i~~G---~~h~~~lt~~g~v~~~G~n~-~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G~~~s~~lt~~g~vy 105 (241)
+++.+..| .-|.+++..+|..|.+-... -++++....+ ++.+.- -.+-+-+.--+.+++..|.||
T Consensus 94 ev~~ypLg~Ga~Phgiv~gpdg~~Witd~~~aI~R~dpkt~e---------vt~f~l--p~~~a~~nlet~vfD~~G~lW 162 (353)
T COG4257 94 EVETYPLGSGASPHGIVVGPDGSAWITDTGLAIGRLDPKTLE---------VTRFPL--PLEHADANLETAVFDPWGNLW 162 (353)
T ss_pred ceEEEecCCCCCCceEEECCCCCeeEecCcceeEEecCcccc---------eEEeec--ccccCCCcccceeeCCCccEE
Confidence 56666555 36889999999999765431 1111111111 111110 123344555678899999999
Q ss_pred EEeCC
Q psy1607 106 AWGWN 110 (241)
Q Consensus 106 ~~G~n 110 (241)
.-|.+
T Consensus 163 Ft~q~ 167 (353)
T COG4257 163 FTGQI 167 (353)
T ss_pred Eeecc
Confidence 99873
No 47
>PF07312 DUF1459: Protein of unknown function (DUF1459); InterPro: IPR009924 This family consists of several hypothetical Caenorhabditis elegans proteins of around 85 residues in length. The function of this family is unknown.
Probab=35.96 E-value=24 Score=23.52 Aligned_cols=12 Identities=42% Similarity=0.822 Sum_probs=9.4
Q ss_pred CE-EEEeCCCCCc
Q psy1607 103 FL-YAWGWNNGGQ 114 (241)
Q Consensus 103 ~v-y~~G~n~~gq 114 (241)
.+ |.||+|+..+
T Consensus 57 sv~waWGSNKnk~ 69 (84)
T PF07312_consen 57 SVYWAWGSNKNKQ 69 (84)
T ss_pred ceeeeeccCCCCC
Confidence 46 9999998654
No 48
>KOG0291|consensus
Probab=32.81 E-value=4.8e+02 Score=25.51 Aligned_cols=99 Identities=14% Similarity=0.013 Sum_probs=49.4
Q ss_pred EEEEcCCCEEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEEEEccCCceEEEe--cCCcEEEEecCCCCc
Q psy1607 96 AARTQDGFLYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVACGSRHTLALC--DDNSLWGCGWNEYNQ 173 (241)
Q Consensus 96 ~~lt~~g~vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~~~lt--~~g~v~~wG~n~~gq 173 (241)
+++...|.=.+.|..+-|||.+-.......-..+.-+. .++..++-..+-.++.| +||+|-+|-...
T Consensus 313 ~~~N~tGDWiA~g~~klgQLlVweWqsEsYVlKQQgH~--------~~i~~l~YSpDgq~iaTG~eDgKVKvWn~~S--- 381 (893)
T KOG0291|consen 313 VSFNSTGDWIAFGCSKLGQLLVWEWQSESYVLKQQGHS--------DRITSLAYSPDGQLIATGAEDGKVKVWNTQS--- 381 (893)
T ss_pred EEecccCCEEEEcCCccceEEEEEeeccceeeeccccc--------cceeeEEECCCCcEEEeccCCCcEEEEeccC---
Confidence 44444566566666666666654333221111111111 25666655555444444 678888885422
Q ss_pred ccCCCCCCceecceEecccccccCcCeeEEeccCCeeEEEecC
Q psy1607 174 LSMGCGVSQVSVMTRLSVPEHVKNCPIVKLFAEHWNSARLSVP 216 (241)
Q Consensus 174 lG~~~~~~~~~~~~~~~~~~~~~~~~i~~vacG~~~s~~lt~~ 216 (241)
..-.+.+.++..+...++++.-++..+-.+.+
T Consensus 382 -----------gfC~vTFteHts~Vt~v~f~~~g~~llssSLD 413 (893)
T KOG0291|consen 382 -----------GFCFVTFTEHTSGVTAVQFTARGNVLLSSSLD 413 (893)
T ss_pred -----------ceEEEEeccCCCceEEEEEEecCCEEEEeecC
Confidence 12223334555555666666666655544444
No 49
>PF07646 Kelch_2: Kelch motif; InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=31.70 E-value=47 Score=19.37 Aligned_cols=17 Identities=18% Similarity=0.245 Sum_probs=12.1
Q ss_pred CceEEEecCCcEEEEecC
Q psy1607 152 RHTLALCDDNSLWGCGWN 169 (241)
Q Consensus 152 ~h~~~lt~~g~v~~wG~n 169 (241)
.|++++ -+++||++|=-
T Consensus 4 ~hs~~~-~~~kiyv~GG~ 20 (49)
T PF07646_consen 4 GHSAVV-LDGKIYVFGGY 20 (49)
T ss_pred ceEEEE-ECCEEEEECCc
Confidence 455554 48889999864
No 50
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=30.91 E-value=3.7e+02 Score=23.58 Aligned_cols=68 Identities=25% Similarity=0.289 Sum_probs=36.4
Q ss_pred CCccEEEEecCCC-EE-EEEECCCc-EEEEeCCCCCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcCCC-eEEEEEcCC
Q psy1607 27 LDLVVTDIACGFD-HC-LALTQEGI-VFSWGNGSRGQLGYPSSGTDANMEPREIRALSGLGVVQISAGGW-HSAARTQDG 102 (241)
Q Consensus 27 ~~~~i~~i~~G~~-h~-~~lt~~g~-v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G~~-~s~~lt~~g 102 (241)
....+..|..|.. |. ++.+.+|+ +|..+. .|. -..++......+..|..|.. +.++++.||
T Consensus 25 t~~~~~~i~~~~~~h~~~~~s~Dgr~~yv~~r--dg~-------------vsviD~~~~~~v~~i~~G~~~~~i~~s~DG 89 (369)
T PF02239_consen 25 TNKVVARIPTGGAPHAGLKFSPDGRYLYVANR--DGT-------------VSVIDLATGKVVATIKVGGNPRGIAVSPDG 89 (369)
T ss_dssp T-SEEEEEE-STTEEEEEE-TT-SSEEEEEET--TSE-------------EEEEETTSSSEEEEEE-SSEEEEEEE--TT
T ss_pred CCeEEEEEcCCCCceeEEEecCCCCEEEEEcC--CCe-------------EEEEECCcccEEEEEecCCCcceEEEcCCC
Confidence 3445777887754 66 44577776 777652 221 13344445556778887766 567888888
Q ss_pred CEEEEeC
Q psy1607 103 FLYAWGW 109 (241)
Q Consensus 103 ~vy~~G~ 109 (241)
+...-+.
T Consensus 90 ~~~~v~n 96 (369)
T PF02239_consen 90 KYVYVAN 96 (369)
T ss_dssp TEEEEEE
T ss_pred CEEEEEe
Confidence 7554443
No 51
>PF13854 Kelch_5: Kelch motif
Probab=30.53 E-value=55 Score=18.50 Aligned_cols=18 Identities=28% Similarity=0.327 Sum_probs=13.0
Q ss_pred CeEEEEEcCCCEEEEeCCC
Q psy1607 93 WHSAARTQDGFLYAWGWNN 111 (241)
Q Consensus 93 ~~s~~lt~~g~vy~~G~n~ 111 (241)
.|++++. ++++|++|-..
T Consensus 7 ~hs~~~~-~~~iyi~GG~~ 24 (42)
T PF13854_consen 7 GHSAVVV-GNNIYIFGGYS 24 (42)
T ss_pred ceEEEEE-CCEEEEEcCcc
Confidence 5666665 48899998644
No 52
>PF08887 GAD-like: GAD-like domain; InterPro: IPR014983 This domain is functionally uncharacterised, but it appears to be distantly related to the GAD domain IPR004115 from INTERPRO.
Probab=28.84 E-value=55 Score=23.36 Aligned_cols=21 Identities=19% Similarity=0.090 Sum_probs=18.3
Q ss_pred cCCceEEEecCCcEEEEecCC
Q psy1607 150 GSRHTLALCDDNSLWGCGWNE 170 (241)
Q Consensus 150 G~~h~~~lt~~g~v~~wG~n~ 170 (241)
-..|.+++|.-|.||.|+.+.
T Consensus 78 ~~~~~ia~tAFGdl~~w~e~~ 98 (109)
T PF08887_consen 78 DNYIPIARTAFGDLYVWGENT 98 (109)
T ss_pred ceEEEEEEcccccEEEEEcCC
Confidence 457999999999999999864
No 53
>PLN03215 ascorbic acid mannose pathway regulator 1; Provisional
Probab=28.46 E-value=2.7e+02 Score=24.62 Aligned_cols=61 Identities=18% Similarity=0.190 Sum_probs=35.6
Q ss_pred eEEEEcCCCe---EEEEEcCCCEEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEEEEccCCceEEEecCC
Q psy1607 85 VVQISAGGWH---SAARTQDGFLYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVDVACGSRHTLALCDDN 161 (241)
Q Consensus 85 i~~I~~G~~~---s~~lt~~g~vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~~~lt~~g 161 (241)
+..+++|..+ .+++..+|++..|..+....+.. ....+.+|.--....+|++..|
T Consensus 162 ~~~~~~~~~~~~~vl~i~~~g~l~~w~~~~Wt~l~~----------------------~~~~~~DIi~~kGkfYAvD~~G 219 (373)
T PLN03215 162 LVKVKEGDNHRDGVLGIGRDGKINYWDGNVLKALKQ----------------------MGYHFSDIIVHKGQTYALDSIG 219 (373)
T ss_pred EEEeecCCCcceEEEEEeecCcEeeecCCeeeEccC----------------------CCceeeEEEEECCEEEEEcCCC
Confidence 3345677775 66777889998886443322210 1235666665555666666666
Q ss_pred cEEEEe
Q psy1607 162 SLWGCG 167 (241)
Q Consensus 162 ~v~~wG 167 (241)
+||.+-
T Consensus 220 ~l~~i~ 225 (373)
T PLN03215 220 IVYWIN 225 (373)
T ss_pred eEEEEe
Confidence 666654
No 54
>TIGR01061 parC_Gpos DNA topoisomerase IV, A subunit, Gram-positive. Operationally, topoisomerase IV is a type II topoisomerase required for the decatenation of chromosome segregation. Not every bacterium has both a topo II and a topo IV. The topo IV families of the Gram-positive bacteria and the Gram-negative bacteria appear not to represent a single clade among the type II topoisomerases, and are represented by separate models for this reason.
Probab=26.56 E-value=6.1e+02 Score=24.75 Aligned_cols=134 Identities=11% Similarity=0.013 Sum_probs=70.1
Q ss_pred CccEEEE--ecCCCEEEEEECCCcEEEEeCCCCCCCCCCCCCCCCCccCeEeeecCCCCeEEEEcCCC------eEEEEE
Q psy1607 28 DLVVTDI--ACGFDHCLALTQEGIVFSWGNGSRGQLGYPSSGTDANMEPREIRALSGLGVVQISAGGW------HSAART 99 (241)
Q Consensus 28 ~~~i~~i--~~G~~h~~~lt~~g~v~~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~i~~I~~G~~------~s~~lt 99 (241)
+..++.+ ..+.++.+++|++|++|.+-...-...+..... ++....+..-.+.+|+.+.+-.. ..+++|
T Consensus 530 ~Delv~v~~~~~~d~IllfT~~Gkv~r~~~~eIp~~gr~a~G---v~Ivk~i~L~~~D~Iv~~~~v~~~~~~~~~ll~vT 606 (738)
T TIGR01061 530 DDILFAQTIANTTDQILIFTSLGNIINIPVHKLADIRWKDLG---EHLSNKITFDENETIVFVGTMNEFDVDQPILVLAS 606 (738)
T ss_pred CCEEEEEEEecCCCEEEEEeCCCcEEEEEHHHCcCCCCCCCC---cChhhcccCCCCCeEEEEEEeccccCCCcEEEEEe
Confidence 3444443 445667899999999999887655555443322 11111122224556766665432 478999
Q ss_pred cCCCEEEEeCCCCCccCCCCCCCceeeCCccccccccCCCCCCcEEE--EEccCCceEEEecCCcEEEEecCCCCcccC
Q psy1607 100 QDGFLYAWGWNNGGQVGVEGKLDLVVHDPHPMSWSEDFDSQNIKVVD--VACGSRHTLALCDDNSLWGCGWNEYNQLSM 176 (241)
Q Consensus 100 ~~g~vy~~G~n~~gqlg~~~~~~~~~~~p~~v~~~~~~~~~~~~i~~--i~~G~~h~~~lt~~g~v~~wG~n~~gqlG~ 176 (241)
++|.+--.-...+... .+..... .+.+. .+..++. +..+....+++|.+|.+.-+=.++.-..|.
T Consensus 607 ~~G~~KRt~l~e~~~~-r~~kGv~------~ikLk-----~~d~lV~a~~v~~~d~IlliT~~G~~iRf~~~dI~~~gR 673 (738)
T TIGR01061 607 KLGMVKRIELTELNIK-RNSKATL------CIKLK-----DKDHLISAFLQQKDKLICLVSDLGYALVFHTNEIPVVGA 673 (738)
T ss_pred cCCeEEEeEHHHhccc-cCCCceE------EEecc-----CCCcEEEEEEeCCCCEEEEEECCCeEEEEEHHHCCcccc
Confidence 9997776543332211 0011110 01111 0112221 224566789999999887765554444443
No 55
>PF08887 GAD-like: GAD-like domain; InterPro: IPR014983 This domain is functionally uncharacterised, but it appears to be distantly related to the GAD domain IPR004115 from INTERPRO.
Probab=26.56 E-value=68 Score=22.91 Aligned_cols=21 Identities=24% Similarity=0.360 Sum_probs=18.1
Q ss_pred CCCEEEEEECCCcEEEEeCCC
Q psy1607 37 GFDHCLALTQEGIVFSWGNGS 57 (241)
Q Consensus 37 G~~h~~~lt~~g~v~~~G~n~ 57 (241)
-..|.++.|.-|+||.|+.+.
T Consensus 78 ~~~~~ia~tAFGdl~~w~e~~ 98 (109)
T PF08887_consen 78 DNYIPIARTAFGDLYVWGENT 98 (109)
T ss_pred ceEEEEEEcccccEEEEEcCC
Confidence 357999999999999999753
No 56
>PRK02529 petN cytochrome b6-f complex subunit PetN; Provisional
Probab=26.51 E-value=67 Score=17.47 Aligned_cols=13 Identities=15% Similarity=0.130 Sum_probs=10.1
Q ss_pred EEEEecCCCCccc
Q psy1607 163 LWGCGWNEYNQLS 175 (241)
Q Consensus 163 v~~wG~n~~gqlG 175 (241)
+.+||+|..|.+.
T Consensus 20 lVVWGRnG~g~~~ 32 (33)
T PRK02529 20 MVVWGRNGDGSID 32 (33)
T ss_pred EEEEecCCccccC
Confidence 6789999877653
No 57
>PF09081 DUF1921: Domain of unknown function (DUF1921); InterPro: IPR015165 This domain, which is found in a set of prokaryotic amylases, has no known function []. ; PDB: 1QI5_A 1JDC_A 2AMG_A 1QPK_A 1JDD_A 1QI4_A 1JDA_A 1GCY_A 1QI3_A.
Probab=24.14 E-value=48 Score=19.74 Aligned_cols=21 Identities=24% Similarity=0.118 Sum_probs=12.5
Q ss_pred EEEccCCceEEEecCCcEEEE
Q psy1607 146 DVACGSRHTLALCDDNSLWGC 166 (241)
Q Consensus 146 ~i~~G~~h~~~lt~~g~v~~w 166 (241)
+|+.|...-++=.++|+|-.|
T Consensus 30 qVasGsfs~a~N~dnG~vRiW 50 (51)
T PF09081_consen 30 QVASGSFSQAVNEDNGQVRIW 50 (51)
T ss_dssp GT-SS--EEEEEETTTTEEEE
T ss_pred cccccchHhhhhccCCcEEee
Confidence 456675555555667888888
No 58
>PF07494 Reg_prop: Two component regulator propeller; InterPro: IPR011110 A large group of two component regulator proteins appear to have the same N-terminal structure of 14 tandem repeats. These repeats show homology to members of IPR002372 from INTERPRO and IPR001680 from INTERPRO indicating that they are likely to form a beta-propeller. This family has been built with artificially high cut-offs in order to avoid overlaps with other beta-propeller families. The fourteen repeats are likely to form two propellers; it is not clear if these structures are likely to recruit other proteins or interact with DNA.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=24.07 E-value=1.1e+02 Score=15.07 Aligned_cols=15 Identities=27% Similarity=0.180 Sum_probs=10.7
Q ss_pred eEEEEEcCCCEEEEe
Q psy1607 94 HSAARTQDGFLYAWG 108 (241)
Q Consensus 94 ~s~~lt~~g~vy~~G 108 (241)
++++.+++|+||+--
T Consensus 8 ~~i~~D~~G~lWigT 22 (24)
T PF07494_consen 8 YSIYEDSDGNLWIGT 22 (24)
T ss_dssp EEEEE-TTSCEEEEE
T ss_pred EEEEEcCCcCEEEEe
Confidence 567888899999743
No 59
>PF13938 DUF4213: Domain of unknown function (DUF4213); PDB: 3NPG_A 3L5O_B.
Probab=23.67 E-value=1.1e+02 Score=20.48 Aligned_cols=23 Identities=22% Similarity=0.341 Sum_probs=19.0
Q ss_pred cCCccEEEEecCCCEEEEEECCC
Q psy1607 26 FLDLVVTDIACGFDHCLALTQEG 48 (241)
Q Consensus 26 ~~~~~i~~i~~G~~h~~~lt~~g 48 (241)
.++.+|+++..|..++.+..++|
T Consensus 9 ~~~~~V~~~~iG~~~t~V~~~~G 31 (87)
T PF13938_consen 9 APDIRVEDVCIGLHWTAVELSDG 31 (87)
T ss_dssp CGC-EEEEEEEBSSEEEEEETT-
T ss_pred CCCCEEEEEEEcCCEEEEEeCCC
Confidence 34788999999999999999998
No 60
>KOG0289|consensus
Probab=23.57 E-value=4.1e+02 Score=24.12 Aligned_cols=65 Identities=22% Similarity=0.192 Sum_probs=38.0
Q ss_pred CeEEEEEcCCCEEEEeCCCCCccCCCCCCC--ceeeCCccccccccCCCCCCcEEEEEccCCce--EEEecCCcEEEEec
Q psy1607 93 WHSAARTQDGFLYAWGWNNGGQVGVEGKLD--LVVHDPHPMSWSEDFDSQNIKVVDVACGSRHT--LALCDDNSLWGCGW 168 (241)
Q Consensus 93 ~~s~~lt~~g~vy~~G~n~~gqlg~~~~~~--~~~~~p~~v~~~~~~~~~~~~i~~i~~G~~h~--~~lt~~g~v~~wG~ 168 (241)
++++++..||.+|.-|.-. +++-+-+-.. ....+|. -..+|+.|+-+++-- +.-++|+.|..|--
T Consensus 350 ~ts~~fHpDgLifgtgt~d-~~vkiwdlks~~~~a~Fpg----------ht~~vk~i~FsENGY~Lat~add~~V~lwDL 418 (506)
T KOG0289|consen 350 YTSAAFHPDGLIFGTGTPD-GVVKIWDLKSQTNVAKFPG----------HTGPVKAISFSENGYWLATAADDGSVKLWDL 418 (506)
T ss_pred eEEeeEcCCceEEeccCCC-ceEEEEEcCCccccccCCC----------CCCceeEEEeccCceEEEEEecCCeEEEEEe
Confidence 6788888899999888643 4442211111 1111222 234788887776544 34446788999954
No 61
>PF03984 DUF346: Repeat of unknown function (DUF346) ; InterPro: IPR007132 This repeat was found as seven tandem copies in one protein. It is predicted to be composed of beta-strands. Thus it is likely that it forms a beta-propeller structure. It is found in association with BNR repeats, which also form a beta-propeller.
Probab=22.34 E-value=1.7e+02 Score=16.82 Aligned_cols=26 Identities=27% Similarity=0.445 Sum_probs=20.2
Q ss_pred CCCEEEEEECCCcEEEEeCCCCCCCC
Q psy1607 37 GFDHCLALTQEGIVFSWGNGSRGQLG 62 (241)
Q Consensus 37 G~~h~~~lt~~g~v~~~G~n~~gqlg 62 (241)
+-.|+++.+.+|.|+-|-.+.-+.+.
T Consensus 1 ~QqHVF~r~~dg~L~H~ww~~~~~~~ 26 (40)
T PF03984_consen 1 GQQHVFARGTDGTLHHWWWDSTGGPS 26 (40)
T ss_pred CcceEEEECCCCCEEEEEECCCCCCc
Confidence 35799999999999988876555443
No 62
>PF13964 Kelch_6: Kelch motif
Probab=21.64 E-value=80 Score=18.35 Aligned_cols=19 Identities=32% Similarity=0.469 Sum_probs=12.7
Q ss_pred CeEEEEEcCCCEEEEeCCCC
Q psy1607 93 WHSAARTQDGFLYAWGWNNG 112 (241)
Q Consensus 93 ~~s~~lt~~g~vy~~G~n~~ 112 (241)
.|+++ .-+++||++|-...
T Consensus 4 ~~s~v-~~~~~iyv~GG~~~ 22 (50)
T PF13964_consen 4 GHSAV-VVGGKIYVFGGYDN 22 (50)
T ss_pred cCEEE-EECCEEEEECCCCC
Confidence 45554 45779999986543
No 63
>PF06462 Hyd_WA: Propeller; InterPro: IPR006624 Tectonins I and II are two dominant proteins in the nuclei and nuclear matrix from plasmodia of Physarum polycephalum (Slime mold) which encode 217 and 353 amino acids, respectively. Tectonin I is homologous to the C-terminal two-thirds of tectonin II. Both proteins contain six tandem repeats that are each 33-37 amino acids in length and define a new consensus sequence. Homologous repeats are found in L-6, a bacterial lipopolysaccharide-binding lectin from horseshoe crab hemocytes. The repetitive sequences of the tectonins and L-6 are reminiscent of the WD repeats of the beta-subunit of G proteins, suggesting that they form beta-propeller domains. The tectonins may be lectins that function as part of a transmembrane signalling complex during phagocytosis [].
Probab=21.32 E-value=1.1e+02 Score=16.45 Aligned_cols=13 Identities=31% Similarity=0.529 Sum_probs=10.3
Q ss_pred EEEEEECCCcEEE
Q psy1607 40 HCLALTQEGIVFS 52 (241)
Q Consensus 40 h~~~lt~~g~v~~ 52 (241)
+..+++.+|+||.
T Consensus 2 ~VWav~~~G~v~~ 14 (32)
T PF06462_consen 2 QVWAVTSDGSVYF 14 (32)
T ss_pred eEEEEcCCCCEEE
Confidence 5678888999884
No 64
>PHA02713 hypothetical protein; Provisional
Probab=20.79 E-value=6.2e+02 Score=23.57 Aligned_cols=20 Identities=5% Similarity=0.054 Sum_probs=12.7
Q ss_pred CCCEEEEEECCCcEEEEeCC
Q psy1607 37 GFDHCLALTQEGIVFSWGNG 56 (241)
Q Consensus 37 G~~h~~~lt~~g~v~~~G~n 56 (241)
.+.+..+..-+|+||.+|-.
T Consensus 341 ~R~~~~~~~~~g~IYviGG~ 360 (557)
T PHA02713 341 NRCRFSLAVIDDTIYAIGGQ 360 (557)
T ss_pred hhhceeEEEECCEEEEECCc
Confidence 33333445558899999953
No 65
>KOG1408|consensus
Probab=20.62 E-value=3e+02 Score=26.77 Aligned_cols=24 Identities=25% Similarity=0.390 Sum_probs=18.7
Q ss_pred CCCCcEEEEEccCC----ceEEEecCCc
Q psy1607 139 SQNIKVVDVACGSR----HTLALCDDNS 162 (241)
Q Consensus 139 ~~~~~i~~i~~G~~----h~~~lt~~g~ 162 (241)
++...+..|+||.. .+++||..|.
T Consensus 215 lr~n~f~avaCg~gicAestfait~qGh 242 (1080)
T KOG1408|consen 215 LRFNEFLAVACGVGICAESTFAITAQGH 242 (1080)
T ss_pred cccchhhhhhhcCcccccceEEEecccc
Confidence 35557889999987 8999998553
No 66
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=20.01 E-value=5.8e+02 Score=22.13 Aligned_cols=17 Identities=0% Similarity=-0.011 Sum_probs=11.6
Q ss_pred EEEEEECCCcEEEEeCC
Q psy1607 40 HCLALTQEGIVFSWGNG 56 (241)
Q Consensus 40 h~~~lt~~g~v~~~G~n 56 (241)
+..++..+++||..|-.
T Consensus 214 ~~a~v~~~~~iYv~GG~ 230 (376)
T PRK14131 214 GSAVVIKGNKLWLINGE 230 (376)
T ss_pred cceEEEECCEEEEEeee
Confidence 43445557999999953
Done!