Query psy7340
Match_columns 440
No_of_seqs 481 out of 2021
Neff 5.1
Searched_HMMs 46136
Date Sat Aug 17 00:16:21 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7340.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7340hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3431|consensus 100.0 1.9E-28 4.1E-33 212.8 9.7 96 6-102 21-117 (129)
2 PRK04239 hypothetical protein; 99.9 9E-27 2E-31 201.2 10.0 76 15-90 26-101 (110)
3 PF01984 dsDNA_bind: Double-st 99.9 7.2E-28 1.6E-32 208.0 1.0 90 11-100 17-106 (107)
4 COG2118 DNA-binding protein [G 99.9 3.1E-25 6.7E-30 191.1 10.6 81 20-100 34-114 (116)
5 PRK04239 hypothetical protein; 99.9 9.4E-23 2E-27 176.3 7.1 95 200-296 7-101 (110)
6 KOG3431|consensus 99.8 9.1E-22 2E-26 171.3 5.9 98 200-298 9-106 (129)
7 PF01984 dsDNA_bind: Double-st 99.8 1.2E-22 2.6E-27 175.5 -0.8 97 200-298 2-98 (107)
8 COG2118 DNA-binding protein [G 99.8 4.4E-21 9.6E-26 165.5 7.2 95 200-297 10-105 (116)
9 PF13715 DUF4480: Domain of un 99.6 1.7E-14 3.7E-19 118.0 11.1 86 316-403 1-87 (88)
10 cd03863 M14_CPD_II The second 99.5 6.9E-14 1.5E-18 144.4 11.2 94 299-392 281-375 (375)
11 cd03864 M14_CPN Peptidase M14 99.4 7.8E-13 1.7E-17 137.3 11.0 94 298-391 299-392 (392)
12 cd06245 M14_CPD_III The third 99.4 1.4E-12 2.9E-17 134.3 11.8 92 299-391 271-362 (363)
13 cd03865 M14_CPE_H Peptidase M1 99.4 1.1E-12 2.3E-17 136.6 10.6 94 298-391 309-402 (402)
14 PF13620 CarboxypepD_reg: Carb 99.3 1.1E-11 2.4E-16 99.6 8.5 76 316-391 1-82 (82)
15 cd03867 M14_CPZ Peptidase M14- 99.2 4E-11 8.6E-16 124.7 10.8 91 300-390 303-395 (395)
16 cd03868 M14_CPD_I The first ca 99.2 6.1E-11 1.3E-15 122.2 10.3 92 299-390 280-372 (372)
17 KOG2649|consensus 99.2 6.2E-11 1.3E-15 124.2 9.9 99 298-396 361-459 (500)
18 cd03869 M14_CPX_like Peptidase 99.1 1.1E-10 2.3E-15 121.8 9.1 92 299-390 313-405 (405)
19 cd03858 M14_CP_N-E_like Carbox 99.1 2.1E-10 4.5E-15 118.0 11.1 89 302-390 285-374 (374)
20 cd03866 M14_CPM Peptidase M14 99.1 2.5E-10 5.5E-15 118.1 10.7 82 299-380 279-362 (376)
21 PF08400 phage_tail_N: Prophag 97.8 0.00011 2.4E-09 66.4 9.6 66 315-380 3-79 (134)
22 PRK15036 hydroxyisourate hydro 97.2 0.00069 1.5E-08 61.5 5.6 56 316-371 28-96 (137)
23 PF08308 PEGA: PEGA domain; I 96.8 0.0063 1.4E-07 48.0 7.3 59 329-392 10-69 (71)
24 cd00421 intradiol_dioxygenase 96.5 0.012 2.7E-07 53.4 8.0 47 315-361 12-79 (146)
25 PF07210 DUF1416: Protein of u 96.4 0.031 6.7E-07 46.9 9.0 51 316-367 9-63 (85)
26 PF02369 Big_1: Bacterial Ig-l 95.8 0.053 1.1E-06 46.2 8.0 59 315-373 25-95 (100)
27 KOG1948|consensus 95.2 0.053 1.1E-06 61.6 7.6 58 315-373 316-378 (1165)
28 PF14686 fn3_3: Polysaccharide 95.1 0.1 2.2E-06 44.5 7.6 53 316-368 4-70 (95)
29 cd03458 Catechol_intradiol_dio 94.8 0.12 2.5E-06 51.7 8.2 46 315-360 105-169 (256)
30 smart00634 BID_1 Bacterial Ig- 94.8 0.17 3.7E-06 42.1 8.0 59 315-373 20-89 (92)
31 cd03462 1,2-CCD chlorocatechol 94.5 0.16 3.5E-06 50.5 8.3 46 315-360 100-164 (247)
32 TIGR02465 chlorocat_1_2 chloro 94.5 0.16 3.6E-06 50.4 8.3 46 315-360 99-163 (246)
33 PF00775 Dioxygenase_C: Dioxyg 94.4 0.18 3.9E-06 47.9 8.2 47 315-361 30-97 (183)
34 TIGR02439 catechol_proteo cate 94.3 0.17 3.8E-06 51.2 8.3 46 315-360 129-193 (285)
35 cd03463 3,4-PCD_alpha Protocat 94.3 0.19 4.1E-06 47.9 7.9 47 315-361 37-106 (185)
36 cd03459 3,4-PCD Protocatechuat 94.1 0.23 5E-06 46.1 8.0 47 315-361 16-86 (158)
37 cd03460 1,2-CTD Catechol 1,2 d 94.0 0.22 4.8E-06 50.4 8.2 46 315-360 125-189 (282)
38 COG3485 PcaH Protocatechuate 3 93.6 0.14 3.1E-06 50.3 5.8 47 314-360 72-141 (226)
39 TIGR02438 catachol_actin catec 93.6 0.25 5.5E-06 50.0 7.8 47 315-361 133-198 (281)
40 TIGR02423 protocat_alph protoc 93.4 0.44 9.6E-06 45.7 8.8 47 315-361 40-110 (193)
41 cd03464 3,4-PCD_beta Protocate 93.0 0.38 8.2E-06 47.1 7.8 46 315-360 66-135 (220)
42 TIGR02422 protocat_beta protoc 93.0 0.51 1.1E-05 46.2 8.7 46 315-360 61-130 (220)
43 cd03461 1,2-HQD Hydroxyquinol 92.9 0.33 7.1E-06 49.1 7.5 47 315-361 121-186 (277)
44 PF05738 Cna_B: Cna protein B- 92.6 0.43 9.3E-06 37.1 6.3 46 329-374 1-55 (70)
45 PF10670 DUF4198: Domain of un 91.7 0.44 9.5E-06 44.5 6.2 51 316-367 152-212 (215)
46 PF09430 DUF2012: Protein of u 91.2 0.55 1.2E-05 41.5 6.0 51 329-379 7-61 (123)
47 PF12866 DUF3823: Protein of u 90.6 1.4 3.1E-05 43.1 8.8 76 315-392 22-113 (222)
48 TIGR02962 hdxy_isourate hydrox 89.3 0.63 1.4E-05 41.0 4.8 54 317-370 3-70 (112)
49 KOG1948|consensus 88.7 1.5 3.2E-05 50.5 8.1 57 315-371 119-179 (1165)
50 PF01190 Pollen_Ole_e_I: Polle 88.1 1.2 2.6E-05 37.4 5.5 33 326-358 18-58 (97)
51 PF11974 MG1: Alpha-2-macroglo 87.5 1.6 3.4E-05 37.2 5.9 40 317-356 15-60 (97)
52 PF12985 DUF3869: Domain of un 86.5 3.4 7.5E-05 36.1 7.4 72 313-395 20-98 (104)
53 COG5266 CbiK ABC-type Co2+ tra 84.6 2.7 5.9E-05 42.2 6.7 49 316-365 173-239 (264)
54 PF00576 Transthyretin: HIUase 81.3 1.5 3.3E-05 38.6 3.2 53 316-368 2-69 (112)
55 PF06488 L_lac_phage_MSP: Lact 78.8 3.6 7.8E-05 40.4 5.1 38 342-379 259-298 (301)
56 smart00095 TR_THY Transthyreti 77.5 4.3 9.3E-05 36.4 4.8 54 316-369 5-72 (121)
57 cd05822 TLP_HIUase HIUase (5-h 76.1 4.6 0.0001 35.6 4.6 54 317-370 3-70 (112)
58 cd05821 TLP_Transthyretin Tran 75.9 4.7 0.0001 36.2 4.6 55 316-370 8-76 (121)
59 cd03457 intradiol_dioxygenase_ 75.6 6.1 0.00013 37.8 5.7 47 315-361 27-100 (188)
60 cd05469 Transthyretin_like Tra 74.0 5.9 0.00013 35.1 4.7 53 317-369 3-69 (113)
61 PF10794 DUF2606: Protein of u 64.0 47 0.001 30.0 8.2 51 316-366 43-106 (131)
62 PF13953 PapC_C: PapC C-termin 63.9 15 0.00033 28.9 4.8 47 320-366 2-49 (68)
63 PF05688 DUF824: Salmonella re 63.7 13 0.00027 28.2 4.0 23 315-337 15-37 (47)
64 COG1470 Predicted membrane pro 62.7 26 0.00056 38.2 7.5 74 317-392 189-263 (513)
65 PF11008 DUF2846: Protein of u 59.6 50 0.0011 28.6 7.6 52 329-381 40-94 (117)
66 PRK15310 fimbrial outer membra 58.8 17 0.00037 42.3 5.7 46 314-362 785-830 (895)
67 PF14347 DUF4399: Domain of un 58.5 32 0.00069 29.0 6.0 52 317-371 22-75 (87)
68 COG2351 Transthyretin-like pro 51.8 36 0.00079 30.7 5.4 54 316-369 10-77 (124)
69 PF03785 Peptidase_C25_C: Pept 51.7 64 0.0014 27.2 6.5 46 330-376 26-79 (81)
70 smart00088 PINT motif in prote 50.3 12 0.00025 30.3 2.1 53 24-76 7-59 (88)
71 smart00753 PAM PCI/PINT associ 50.3 12 0.00025 30.3 2.1 53 24-76 7-59 (88)
72 PF01060 DUF290: Transthyretin 50.2 24 0.00053 28.7 3.9 30 325-354 7-44 (80)
73 PF07523 Big_3: Bacterial Ig-l 50.1 26 0.00057 27.3 3.9 38 319-368 21-58 (67)
74 KOG3006|consensus 39.4 51 0.0011 29.9 4.4 50 316-365 22-85 (132)
75 PF08863 YolD: YolD-like prote 35.4 1.1E+02 0.0024 24.7 5.7 57 52-113 2-59 (92)
76 PF10642 Tom5: Mitochondrial i 32.9 1.1E+02 0.0023 23.5 4.7 30 1-32 1-30 (49)
77 PRK14981 DNA-directed RNA poly 32.9 1.4E+02 0.003 26.2 6.2 58 23-89 51-109 (112)
78 PF05751 FixH: FixH; InterPro 32.4 1.8E+02 0.0038 25.7 6.9 50 315-364 69-130 (146)
79 cd00390 Urease_gamma Urease ga 30.9 1.2E+02 0.0025 26.4 5.1 71 38-113 15-93 (96)
80 KOG4659|consensus 30.5 1.8E+02 0.0039 36.0 8.1 87 308-394 45-134 (1899)
81 PF15248 DUF4587: Domain of un 29.7 25 0.00055 29.2 0.9 18 54-71 6-23 (76)
82 PF01835 A2M_N: MG2 domain; I 29.1 3.2E+02 0.0069 22.3 8.0 53 315-367 17-86 (99)
83 PF13115 YtkA: YtkA-like 28.7 1.7E+02 0.0037 23.4 5.7 21 317-338 24-44 (86)
84 PF10577 UPF0560: Uncharacteri 28.4 1.4E+02 0.0031 34.6 6.8 60 316-375 2-68 (807)
85 PF07550 DUF1533: Protein of u 27.5 1.5E+02 0.0032 23.3 4.9 20 356-375 45-64 (65)
86 COG3422 Uncharacterized conser 26.4 68 0.0015 25.5 2.7 31 343-373 7-37 (59)
87 smart00561 MBT Present in Dros 24.3 37 0.00081 28.9 1.0 31 97-127 58-95 (96)
88 COG0042 tRNA-dihydrouridine sy 24.0 69 0.0015 33.0 3.1 32 76-107 119-151 (323)
89 PHA02545 45 sliding clamp; Pro 23.8 3.7E+02 0.0079 26.8 7.8 111 255-394 100-220 (223)
90 PF02820 MBT: mbt repeat; Int 23.2 11 0.00024 30.2 -2.3 26 102-127 33-64 (73)
91 PF13754 Big_3_4: Bacterial Ig 22.9 1.6E+02 0.0034 22.3 4.1 28 340-367 3-33 (54)
92 PF07411 DUF1508: Domain of un 22.8 73 0.0016 23.9 2.2 27 346-372 1-27 (49)
93 PF01399 PCI: PCI domain; Int 21.5 46 0.001 27.0 1.0 22 56-77 75-96 (105)
94 cd00390 Urease_gamma Urease ga 20.4 1.3E+02 0.0027 26.2 3.5 34 243-276 14-48 (96)
No 1
>KOG3431|consensus
Probab=99.95 E-value=1.9e-28 Score=212.81 Aligned_cols=96 Identities=51% Similarity=0.729 Sum_probs=85.4
Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHhhhccccCchHHHHHHHHHHHHHHcCCCCCCCCHHHHHHHHH
Q psy7340 6 GGGASGEQQKAAQERQEQIKDMKNSILSQVLDQSARARLNTIMLCKPEKAQQIENMICQMAQTGQIMNKLGENELIGLLE 85 (440)
Q Consensus 6 ~~~~~~e~~~~~~~~~~~~e~~~~~~L~qiL~~eAreRL~~i~lvkPe~a~~vE~~li~~aq~G~i~~kv~e~~l~~iL~ 85 (440)
||++++.++++.+++++ ++|||++||+|||++.||+||+||+|||||||++||+|||+||++|||.+||||++||.||+
T Consensus 21 G~~~~~~~q~a~q~~~~-q~e~r~~~lsQvLdqqAr~RLsrlAlvkpekAq~VE~~lirma~~gQvs~Kise~~lisiLe 99 (129)
T KOG3431|consen 21 GGANDAAQQQANQEQQE-QEEMRQSMLSQVLDQQARERLSRLALVKPEKAQAVENYLIRMAQTGQVSHKISEAELISILE 99 (129)
T ss_pred CCCccccccchhhhhhh-HHHHHHhHHHHHhhHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHhCCccccccHHHHHHHHH
Confidence 34444555666665555 89999999999999999999999999999999999999999999999999999999999999
Q ss_pred Hhhhhh-cccceeEeccc
Q psy7340 86 QISNRE-EKKSSVKTPKF 102 (440)
Q Consensus 86 ~i~~~~-~~~~~vk~~~~ 102 (440)
+|+.++ ++.++|||..+
T Consensus 100 ~is~Qt~qk~tkV~f~RR 117 (129)
T KOG3431|consen 100 KISAQTNQKNTKVKFDRR 117 (129)
T ss_pred HHHHhhccccceeeeecc
Confidence 999998 88999998544
No 2
>PRK04239 hypothetical protein; Provisional
Probab=99.94 E-value=9e-27 Score=201.17 Aligned_cols=76 Identities=39% Similarity=0.593 Sum_probs=70.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCHHHHHHhhhccccCchHHHHHHHHHHHHHHcCCCCCCCCHHHHHHHHHHhhhh
Q psy7340 15 KAAQERQEQIKDMKNSILSQVLDQSARARLNTIMLCKPEKAQQIENMICQMAQTGQIMNKLGENELIGLLEQISNR 90 (440)
Q Consensus 15 ~~~~~~~~~~e~~~~~~L~qiL~~eAreRL~~i~lvkPe~a~~vE~~li~~aq~G~i~~kv~e~~l~~iL~~i~~~ 90 (440)
+++++++++++++|++||++||||+||+||+||+|||||+|++||+|||+|||+|+|++||||++||+||++|+.+
T Consensus 26 ~~~~~~~~~~e~qk~~iL~qiLt~eAreRL~rI~lvkPe~A~~VE~~liqlAq~G~i~~ki~e~~L~~lL~~v~~~ 101 (110)
T PRK04239 26 EEQEEAQAQAEAQKQAILRQILTPEARERLNRIKLVKPEFAEQVEQQLIQLAQSGRIQGPIDDEQLKEILEQLTPQ 101 (110)
T ss_pred hhHHHHHHHHHHHHHHHHHHHCCHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHcCCCCCCcCHHHHHHHHHHHhhc
Confidence 3345566778999999999999999999999999999999999999999999999999999999999999999865
No 3
>PF01984 dsDNA_bind: Double-stranded DNA-binding domain; InterPro: IPR002836 This protein family is found in archaea and eukaryota. The human TFAR19 (TF-1 cell apoptosis-related protein 19) encodes a protein which shares significant homology to the corresponding proteins of species ranging from yeast to mice. TFAR19 exhibits a ubiquitous expression pattern and its expression is up-regulated in the tumour cells undergoing apoptosis. TFAR19 may play a general role in the apoptotic process []. Also included in this family is a DNA-binding protein from the archaea, Methanobacterium thermoautotrophicum.; GO: 0003677 DNA binding; PDB: 1EIJ_A 2K6B_A 2CRU_A 1YYB_A 2JXN_A 2FH0_A.
Probab=99.93 E-value=7.2e-28 Score=207.96 Aligned_cols=90 Identities=41% Similarity=0.651 Sum_probs=75.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHhhhccccCchHHHHHHHHHHHHHHcCCCCCCCCHHHHHHHHHHhhhh
Q psy7340 11 GEQQKAAQERQEQIKDMKNSILSQVLDQSARARLNTIMLCKPEKAQQIENMICQMAQTGQIMNKLGENELIGLLEQISNR 90 (440)
Q Consensus 11 ~e~~~~~~~~~~~~e~~~~~~L~qiL~~eAreRL~~i~lvkPe~a~~vE~~li~~aq~G~i~~kv~e~~l~~iL~~i~~~ 90 (440)
.+.++++++++++++++|++||++||||+||+||+||+|||||+|++||+|||+||++|+|++||||++|++||++|+.+
T Consensus 17 ~~~~~~~~~~~~~~ee~~~~iL~qiLt~eAreRL~rI~lvkPe~A~~VE~~Liqlaq~G~l~~kI~d~~L~~iL~~i~~~ 96 (107)
T PF01984_consen 17 QQKQQQQEEQREQQEEQRRAILRQILTPEARERLNRIKLVKPEKARQVENQLIQLAQSGQLRGKIDDEQLKEILEQISEQ 96 (107)
T ss_dssp SSTCHHCHHCHCCCHHCCHHHHHTCB-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCTSSSS-B-HHHHHHHHHHHCCC
T ss_pred cchhhhHHHHHHHHHHHHHHHHHHHcCHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHcCCCCCCcCHHHHHHHHHHHhhc
Confidence 33445556667788999999999999999999999999999999999999999999999999999999999999999998
Q ss_pred hcccceeEec
Q psy7340 91 EEKKSSVKTP 100 (440)
Q Consensus 91 ~~~~~~vk~~ 100 (440)
++++++|+|.
T Consensus 97 ~~~~~~I~~~ 106 (107)
T PF01984_consen 97 KQKETKIKIK 106 (107)
T ss_dssp -SS-SS-S--
T ss_pred ccCCCeeeee
Confidence 8877788774
No 4
>COG2118 DNA-binding protein [General function prediction only]
Probab=99.92 E-value=3.1e-25 Score=191.11 Aligned_cols=81 Identities=36% Similarity=0.567 Sum_probs=74.2
Q ss_pred HHHHHHHHHHHHHHHhcCHHHHHHhhhccccCchHHHHHHHHHHHHHHcCCCCCCCCHHHHHHHHHHhhhhhcccceeEe
Q psy7340 20 RQEQIKDMKNSILSQVLDQSARARLNTIMLCKPEKAQQIENMICQMAQTGQIMNKLGENELIGLLEQISNREEKKSSVKT 99 (440)
Q Consensus 20 ~~~~~e~~~~~~L~qiL~~eAreRL~~i~lvkPe~a~~vE~~li~~aq~G~i~~kv~e~~l~~iL~~i~~~~~~~~~vk~ 99 (440)
.+++.+++|++||++||||+||+||+||+|||||+|++||++||+|||+|+|++||||++||+||++|+.++++.+.+.|
T Consensus 34 ~r~~~eaqkqaiLrqiLtpeAreRL~~irLvRPe~AeavE~qLi~LaqtGri~~~I~e~~lk~IL~~i~~~~rre~kI~~ 113 (116)
T COG2118 34 ARQEEEAQKQAILRQILTPEARERLARVRLVRPELAEAVENQLIQLAQTGRITHKIDEEELKEILERISPQTRREFKIRR 113 (116)
T ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHhhhcCHHHHHHHHHHHHHHHHcCCCCCCCCHHHHHHHHHHHhHHhccchheec
Confidence 33556899999999999999999999999999999999999999999999999999999999999999998866666655
Q ss_pred c
Q psy7340 100 P 100 (440)
Q Consensus 100 ~ 100 (440)
+
T Consensus 114 ~ 114 (116)
T COG2118 114 R 114 (116)
T ss_pred c
Confidence 4
No 5
>PRK04239 hypothetical protein; Provisional
Probab=99.87 E-value=9.4e-23 Score=176.30 Aligned_cols=95 Identities=35% Similarity=0.485 Sum_probs=83.0
Q ss_pred CCCccccccccCCCCCChHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHhhhccccchhhHHHHHHHHHHHHhcCCccc
Q psy7340 200 RGGGLRGHVMRGGGGASGEQQKAAQERQEQIKDMKNSILSQVLDQSARARLNTIMLCKPEKAQQIENMICQMAQTGQIMN 279 (440)
Q Consensus 200 ~~~~~~~~~~~~~g~~~~e~~~~~~~~~~~~e~~~~~il~qiLt~eAreRL~~i~lvkpe~a~~VE~~LiqLAqsG~It~ 279 (440)
|..+|.++..++||++++ .+++++++++++++|++||++|||||||+||+||+|||||+|+.||++|+++|++|+|+.
T Consensus 7 R~~rl~eLq~q~~~~~~~--~~~~~~~~~~~e~qk~~iL~qiLt~eAreRL~rI~lvkPe~A~~VE~~liqlAq~G~i~~ 84 (110)
T PRK04239 7 RRRKLEELQKQAQEQQQA--QEEQEEAQAQAEAQKQAILRQILTPEARERLNRIKLVKPEFAEQVEQQLIQLAQSGRIQG 84 (110)
T ss_pred HHHHHHHHHHHhccCCcc--hhhHHHHHHHHHHHHHHHHHHHCCHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHcCCCCC
Confidence 456788888777765544 334466667789999999999999999999999999999999999999999999999999
Q ss_pred cccchhHHHHHHHhhhh
Q psy7340 280 KLGENELIGLLEQISNR 296 (440)
Q Consensus 280 KISDe~LkeIL~qis~q 296 (440)
+|+|++|+.||+.++.+
T Consensus 85 ki~e~~L~~lL~~v~~~ 101 (110)
T PRK04239 85 PIDDEQLKEILEQLTPQ 101 (110)
T ss_pred CcCHHHHHHHHHHHhhc
Confidence 99999999999998743
No 6
>KOG3431|consensus
Probab=99.85 E-value=9.1e-22 Score=171.29 Aligned_cols=98 Identities=49% Similarity=0.689 Sum_probs=93.1
Q ss_pred CCCccccccccCCCCCChHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHhhhccccchhhHHHHHHHHHHHHhcCCccc
Q psy7340 200 RGGGLRGHVMRGGGGASGEQQKAAQERQEQIKDMKNSILSQVLDQSARARLNTIMLCKPEKAQQIENMICQMAQTGQIMN 279 (440)
Q Consensus 200 ~~~~~~~~~~~~~g~~~~e~~~~~~~~~~~~e~~~~~il~qiLt~eAreRL~~i~lvkpe~a~~VE~~LiqLAqsG~It~ 279 (440)
|..||.++.++.||+.++..|++.+++.+ .+++|++||+|||++.||+||+||+|||||||..||++|++||+.|+|+-
T Consensus 9 R~qRlaqlqa~~G~~~~~~~q~a~q~~~~-q~e~r~~~lsQvLdqqAr~RLsrlAlvkpekAq~VE~~lirma~~gQvs~ 87 (129)
T KOG3431|consen 9 RAQRLAQLQANSGGANDAAQQQANQEQQE-QEEMRQSMLSQVLDQQARERLSRLALVKPEKAQAVENYLIRMAQTGQVSH 87 (129)
T ss_pred HHHHHHHhhhhcCCCccccccchhhhhhh-HHHHHHhHHHHHhhHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHhCCccc
Confidence 67899999999999999998888888877 99999999999999999999999999999999999999999999999999
Q ss_pred cccchhHHHHHHHhhhhhh
Q psy7340 280 KLGENELIGLLEQISNREE 298 (440)
Q Consensus 280 KISDe~LkeIL~qis~qt~ 298 (440)
|+++..|+.||+.++.++.
T Consensus 88 Kise~~lisiLe~is~Qt~ 106 (129)
T KOG3431|consen 88 KISEAELISILEKISAQTN 106 (129)
T ss_pred cccHHHHHHHHHHHHHhhc
Confidence 9999999999999987763
No 7
>PF01984 dsDNA_bind: Double-stranded DNA-binding domain; InterPro: IPR002836 This protein family is found in archaea and eukaryota. The human TFAR19 (TF-1 cell apoptosis-related protein 19) encodes a protein which shares significant homology to the corresponding proteins of species ranging from yeast to mice. TFAR19 exhibits a ubiquitous expression pattern and its expression is up-regulated in the tumour cells undergoing apoptosis. TFAR19 may play a general role in the apoptotic process []. Also included in this family is a DNA-binding protein from the archaea, Methanobacterium thermoautotrophicum.; GO: 0003677 DNA binding; PDB: 1EIJ_A 2K6B_A 2CRU_A 1YYB_A 2JXN_A 2FH0_A.
Probab=99.84 E-value=1.2e-22 Score=175.51 Aligned_cols=97 Identities=39% Similarity=0.591 Sum_probs=80.1
Q ss_pred CCCccccccccCCCCCChHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHhhhccccchhhHHHHHHHHHHHHhcCCccc
Q psy7340 200 RGGGLRGHVMRGGGGASGEQQKAAQERQEQIKDMKNSILSQVLDQSARARLNTIMLCKPEKAQQIENMICQMAQTGQIMN 279 (440)
Q Consensus 200 ~~~~~~~~~~~~~g~~~~e~~~~~~~~~~~~e~~~~~il~qiLt~eAreRL~~i~lvkpe~a~~VE~~LiqLAqsG~It~ 279 (440)
|..+|.++..++| ++.+.+++++++++++++++++||++|||||||+||+||+|||||+|+.||++|++||++|+|+.
T Consensus 2 R~~rl~Elq~~~~--~~~~~~~~~~~~~~~~ee~~~~iL~qiLt~eAreRL~rI~lvkPe~A~~VE~~Liqlaq~G~l~~ 79 (107)
T PF01984_consen 2 RRRRLAELQQQQG--QQQQKQQQQEEQREQQEEQRRAILRQILTPEARERLNRIKLVKPEKARQVENQLIQLAQSGQLRG 79 (107)
T ss_dssp HHHHHHHCCHTSS---SSSTCHHCHHCHCCCHHCCHHHHHTCB-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCTSSSS
T ss_pred chHHHHHHHhccc--cccchhhhHHHHHHHHHHHHHHHHHHHcCHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHcCCCCC
Confidence 3456777777766 44455555666777889999999999999999999999999999999999999999999999999
Q ss_pred cccchhHHHHHHHhhhhhh
Q psy7340 280 KLGENELIGLLEQISNREE 298 (440)
Q Consensus 280 KISDe~LkeIL~qis~qt~ 298 (440)
+|+|++|+.||+.++.++.
T Consensus 80 kI~d~~L~~iL~~i~~~~~ 98 (107)
T PF01984_consen 80 KIDDEQLKEILEQISEQKQ 98 (107)
T ss_dssp -B-HHHHHHHHHHHCCC-S
T ss_pred CcCHHHHHHHHHHHhhccc
Confidence 9999999999999876543
No 8
>COG2118 DNA-binding protein [General function prediction only]
Probab=99.83 E-value=4.4e-21 Score=165.49 Aligned_cols=95 Identities=37% Similarity=0.512 Sum_probs=79.3
Q ss_pred CCCccccccccCCCCCChHHHHHHHH-HHHHHHHHHHHHHHHhCCHHHHHHhhhccccchhhHHHHHHHHHHHHhcCCcc
Q psy7340 200 RGGGLRGHVMRGGGGASGEQQKAAQE-RQEQIKDMKNSILSQVLDQSARARLNTIMLCKPEKAQQIENMICQMAQTGQIM 278 (440)
Q Consensus 200 ~~~~~~~~~~~~~g~~~~e~~~~~~~-~~~~~e~~~~~il~qiLt~eAreRL~~i~lvkpe~a~~VE~~LiqLAqsG~It 278 (440)
|..+|.++.++. +..++++.+++ ++++.+++|++||++||||+||+||+||+|||||+|++||++|++||++|+|+
T Consensus 10 Rrrkl~eLQrq~---~~~~~~~~q~eq~r~~~eaqkqaiLrqiLtpeAreRL~~irLvRPe~AeavE~qLi~LaqtGri~ 86 (116)
T COG2118 10 RRRKLAELQRQA---KLEEQREAQEEQARQEEEAQKQAILRQILTPEARERLARVRLVRPELAEAVENQLIQLAQTGRIT 86 (116)
T ss_pred HHHHHHHHHHhh---hhHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHhhhcCHHHHHHHHHHHHHHHHcCCCC
Confidence 445566666655 33444444444 34556899999999999999999999999999999999999999999999999
Q ss_pred ccccchhHHHHHHHhhhhh
Q psy7340 279 NKLGENELIGLLEQISNRE 297 (440)
Q Consensus 279 ~KISDe~LkeIL~qis~qt 297 (440)
.+|+|++|+.||..++.++
T Consensus 87 ~~I~e~~lk~IL~~i~~~~ 105 (116)
T COG2118 87 HKIDEEELKEILERISPQT 105 (116)
T ss_pred CCCCHHHHHHHHHHHhHHh
Confidence 9999999999999988654
No 9
>PF13715 DUF4480: Domain of unknown function (DUF4480)
Probab=99.58 E-value=1.7e-14 Score=117.96 Aligned_cols=86 Identities=27% Similarity=0.328 Sum_probs=75.4
Q ss_pred eEEEEEECCC-CCcccccEEEEeccceeeeeCCCccEEEEecCCceEEEEEEeccceeEEEEEEecCCceEEEEEEeccc
Q psy7340 316 GVQGFVTDDL-GNPVEKASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYADGYVPREIDFMVVEQHPTLLNVTLHTSK 394 (440)
Q Consensus 316 ~IsG~V~D~~-GePL~GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~IGY~~q~itItVt~~~~t~lnI~L~~~~ 394 (440)
+|+|+|+|+. |+||++|+|.+.+++.++.||.+|.|.+.+++|.|+|.||++||.+....+.+.......++|.|.++.
T Consensus 1 ti~G~V~d~~t~~pl~~a~V~~~~~~~~~~Td~~G~F~i~~~~g~~~l~is~~Gy~~~~~~i~~~~~~~~~~~i~L~~~~ 80 (88)
T PF13715_consen 1 TISGKVVDSDTGEPLPGATVYLKNTKKGTVTDENGRFSIKLPEGDYTLKISYIGYETKTITISVNSNKNTNLNIYLEPKS 80 (88)
T ss_pred CEEEEEEECCCCCCccCeEEEEeCCcceEEECCCeEEEEEEcCCCeEEEEEEeCEEEEEEEEEecCCCEEEEEEEEeeCc
Confidence 4899999975 999999999999999999999999999889999999999999999999888764433357899999988
Q ss_pred cccccccEE
Q psy7340 395 WLALWTSDL 403 (440)
Q Consensus 395 ~~L~l~evV 403 (440)
..| +||+
T Consensus 81 ~~L--~eVv 87 (88)
T PF13715_consen 81 NQL--DEVV 87 (88)
T ss_pred ccC--CeEE
Confidence 885 5654
No 10
>cd03863 M14_CPD_II The second carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain II. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, while the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally ac
Probab=99.51 E-value=6.9e-14 Score=144.39 Aligned_cols=94 Identities=33% Similarity=0.584 Sum_probs=86.3
Q ss_pred hccchhhhHHHhhhcceeEEEEEECC-CCCcccccEEEEeccceeeeeCCCccEEEEecCCceEEEEEEeccceeEEEEE
Q psy7340 299 KKSSVKSLVKFLAEAHRGVQGFVTDD-LGNPVEKASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYADGYVPREIDFM 377 (440)
Q Consensus 299 kKSsv~~l~~~~~q~~~~IsG~V~D~-~GePL~GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~IGY~~q~itIt 377 (440)
|..++.+++.++.+++.+|+|+|+|. +|+||+||+|.|+|++.+++||.+|.|++.++||+|+|+||++||+++++.|.
T Consensus 281 w~~n~~all~~~~~~~~gI~G~V~D~~~g~pl~~AtV~V~g~~~~~~Td~~G~f~~~l~pG~ytl~vs~~GY~~~~~~v~ 360 (375)
T cd03863 281 WEQNRRSLLQFMKQVHRGVRGFVLDATDGRGILNATISVADINHPVTTYKDGDYWRLLVPGTYKVTASARGYDPVTKTVE 360 (375)
T ss_pred HHHHHHHHHHHHHHhcCeEEEEEEeCCCCCCCCCeEEEEecCcCceEECCCccEEEccCCeeEEEEEEEcCcccEEEEEE
Confidence 56678899999999999999999996 89999999999999999999999999998899999999999999999999988
Q ss_pred EecCCceEEEEEEec
Q psy7340 378 VVEQHPTLLNVTLHT 392 (440)
Q Consensus 378 Vt~~~~t~lnI~L~~ 392 (440)
|..+....++|.|++
T Consensus 361 V~~~~~~~~~~~L~~ 375 (375)
T cd03863 361 VDSKGAVQVNFTLSR 375 (375)
T ss_pred EcCCCcEEEEEEecC
Confidence 877777788888863
No 11
>cd03864 M14_CPN Peptidase M14 Carboxypeptidase N (CPN, also known as kininase I, creatine kinase conversion factor, plasma carboxypeptidase B, arginine carboxypeptidase, and protaminase; EC 3.4.17.3) is an extracellular glycoprotein synthesized in the liver and released into the blood, where it is present in high concentrations. CPN belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding carboxypeptidases (CPs) which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPN plays an important role in protecting the body from excessive buildup of potentially deleterious peptides that normally act as local autocrine or paracrine hormones. It specifically removes C-terminal basic residues. As CPN can cleave lysine more avidly than arginine residues it is also called lysine carboxypeptidase. CPN substrates inclu
Probab=99.41 E-value=7.8e-13 Score=137.31 Aligned_cols=94 Identities=29% Similarity=0.594 Sum_probs=84.6
Q ss_pred hhccchhhhHHHhhhcceeEEEEEECCCCCcccccEEEEeccceeeeeCCCccEEEEecCCceEEEEEEeccceeEEEEE
Q psy7340 298 EKKSSVKSLVKFLAEAHRGVQGFVTDDLGNPVEKASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYADGYVPREIDFM 377 (440)
Q Consensus 298 ~kKSsv~~l~~~~~q~~~~IsG~V~D~~GePL~GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~IGY~~q~itIt 377 (440)
.|..++.+++.++.+++.+|+|+|+|.+|+||++|+|.|+|++.+++||.+|.|+..++||+|+|.+|++||.+++.+|+
T Consensus 299 ~w~~n~~all~~~~~~~~gI~G~V~D~~g~pi~~A~V~v~g~~~~~~T~~~G~y~r~l~pG~Y~l~vs~~Gy~~~t~~v~ 378 (392)
T cd03864 299 EWLGNREALISYIEQVHQGIKGMVTDENNNGIANAVISVSGISHDVTSGTLGDYFRLLLPGTYTVTASAPGYQPSTVTVT 378 (392)
T ss_pred HHHHHHHHHHHHHHHhcCeEEEEEECCCCCccCCeEEEEECCccceEECCCCcEEecCCCeeEEEEEEEcCceeEEEEEE
Confidence 35668889999999999999999999999999999999999999999999999976789999999999999999999988
Q ss_pred EecCCceEEEEEEe
Q psy7340 378 VVEQHPTLLNVTLH 391 (440)
Q Consensus 378 Vt~~~~t~lnI~L~ 391 (440)
|..+....++|.|+
T Consensus 379 V~~~~~~~~df~L~ 392 (392)
T cd03864 379 VGPAEATLVNFQLK 392 (392)
T ss_pred EcCCCcEEEeeEeC
Confidence 87665566788763
No 12
>cd06245 M14_CPD_III The third carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain III. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally active a
Probab=99.40 E-value=1.4e-12 Score=134.29 Aligned_cols=92 Identities=30% Similarity=0.545 Sum_probs=83.5
Q ss_pred hccchhhhHHHhhhcceeEEEEEECCCCCcccccEEEEeccceeeeeCCCccEEEEecCCceEEEEEEeccceeEEEEEE
Q psy7340 299 KKSSVKSLVKFLAEAHRGVQGFVTDDLGNPVEKASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYADGYVPREIDFMV 378 (440)
Q Consensus 299 kKSsv~~l~~~~~q~~~~IsG~V~D~~GePL~GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~IGY~~q~itItV 378 (440)
|..++.+++.++.+++.+|+|+|+|.+|+||+||+|.|+|+. +++||.+|.|++.++||.|+|.++++||.+++.+|+|
T Consensus 271 w~~n~~all~~~~~~~~gI~G~V~d~~g~pi~~A~V~v~g~~-~~~T~~~G~y~~~L~pG~y~v~vs~~Gy~~~~~~V~v 349 (363)
T cd06245 271 WAENKKSLLSMIVEAHKGVHGVVTDKAGKPISGATIVLNGGH-RVYTKEGGYFHVLLAPGQHNINVIAEGYQQEHLPVVV 349 (363)
T ss_pred HHHHHHHHHHHHHHcCcEEEEEEEcCCCCCccceEEEEeCCC-ceEeCCCcEEEEecCCceEEEEEEEeCceeEEEEEEE
Confidence 456778899999999999999999999999999999999986 8999999999888999999999999999999999888
Q ss_pred ecCCceEEEEEEe
Q psy7340 379 VEQHPTLLNVTLH 391 (440)
Q Consensus 379 t~~~~t~lnI~L~ 391 (440)
..+....++|.|.
T Consensus 350 ~~~~~~~~~f~L~ 362 (363)
T cd06245 350 SHDEASSVKIVLD 362 (363)
T ss_pred cCCCeEEEEEEec
Confidence 7666667888875
No 13
>cd03865 M14_CPE_H Peptidase M14 Carboxypeptidase (CP) E (CPE, also known as carboxypeptidase H, and enkephalin convertase; EC 3.4.17.10) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPE is an important enzyme responsible for the proteolytic processing of prohormone intermediates (such as pro-insulin, pro-opiomelanocortin, or pro-gonadotropin-releasing hormone) by specifically removing C-terminal basic residues. In addition, it has been proposed that the regulated secretory pathway (RSP) of the nervous and endocrine systems utilizes membrane-bound CPE as a sorting receptor. A naturally occurring point mutation in CPE reduces the stability of the enzyme and causes its degradation, leading to an accumulation of numerous neuroendocrine pe
Probab=99.40 E-value=1.1e-12 Score=136.57 Aligned_cols=94 Identities=34% Similarity=0.630 Sum_probs=86.0
Q ss_pred hhccchhhhHHHhhhcceeEEEEEECCCCCcccccEEEEeccceeeeeCCCccEEEEecCCceEEEEEEeccceeEEEEE
Q psy7340 298 EKKSSVKSLVKFLAEAHRGVQGFVTDDLGNPVEKASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYADGYVPREIDFM 377 (440)
Q Consensus 298 ~kKSsv~~l~~~~~q~~~~IsG~V~D~~GePL~GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~IGY~~q~itIt 377 (440)
.|..++.+++.++.+++.+|+|+|+|.+|+||++|+|.|+|.+.+++||.+|.|++.++||+|+|++++.||+++...|+
T Consensus 309 ~W~~n~~all~~~~q~~~gI~G~V~D~~g~pI~~AtV~V~g~~~~~~T~~~G~Y~~~L~pG~Ytv~vsa~Gy~~~~~~V~ 388 (402)
T cd03865 309 YWEDNKNSLVNYIEQVHRGVKGFVKDLQGNPIANATISVEGIDHDITSAKDGDYWRLLAPGNYKLTASAPGYLAVVKKVA 388 (402)
T ss_pred HHHHHHHHHHHHHHHhccceEEEEECCCCCcCCCeEEEEEcCccccEECCCeeEEECCCCEEEEEEEEecCcccEEEEEE
Confidence 35778889999999999999999999999999999999999999999999999988899999999999999999988888
Q ss_pred EecCCceEEEEEEe
Q psy7340 378 VVEQHPTLLNVTLH 391 (440)
Q Consensus 378 Vt~~~~t~lnI~L~ 391 (440)
|..+....++|.|+
T Consensus 389 V~~~~~~~vdf~Le 402 (402)
T cd03865 389 VPYSPAVRVDFELE 402 (402)
T ss_pred EcCCCcEEEeEEeC
Confidence 87666778888874
No 14
>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=99.29 E-value=1.1e-11 Score=99.56 Aligned_cols=76 Identities=34% Similarity=0.510 Sum_probs=59.1
Q ss_pred eEEEEEECCCCCcccccEEEEe----ccceeeeeCCCccEEEE-ecCCceEEEEEEeccceeEE-EEEEecCCceEEEEE
Q psy7340 316 GVQGFVTDDLGNPVEKASLKIK----GRDVGFQTTKYGEFWRI-LLPGIYKLEVYADGYVPREI-DFMVVEQHPTLLNVT 389 (440)
Q Consensus 316 ~IsG~V~D~~GePL~GAtV~Ik----Gt~~gt~TD~~G~F~l~-lp~G~ytL~VS~IGY~~q~i-tItVt~~~~t~lnI~ 389 (440)
+|+|+|+|++|+||++|+|.+. +....+.||.+|.|.+. +++|.|.|.+++.||.+... .+.+..+....++|.
T Consensus 1 tI~G~V~d~~g~pv~~a~V~l~~~~~~~~~~~~Td~~G~f~~~~l~~g~Y~l~v~~~g~~~~~~~~v~v~~~~~~~~~i~ 80 (82)
T PF13620_consen 1 TISGTVTDATGQPVPGATVTLTDQDGGTVYTTTTDSDGRFSFEGLPPGTYTLRVSAPGYQPQTQENVTVTAGQTTTVDIT 80 (82)
T ss_dssp -EEEEEEETTSCBHTT-EEEET--TTTECCEEE--TTSEEEEEEE-SEEEEEEEEBTTEE-EEEEEEEESSSSEEE--EE
T ss_pred CEEEEEEcCCCCCcCCEEEEEEEeeCCCEEEEEECCCceEEEEccCCEeEEEEEEECCcceEEEEEEEEeCCCEEEEEEE
Confidence 4899999999999999999998 34566899999999875 99999999999999999987 587876666678888
Q ss_pred Ee
Q psy7340 390 LH 391 (440)
Q Consensus 390 L~ 391 (440)
|+
T Consensus 81 L~ 82 (82)
T PF13620_consen 81 LE 82 (82)
T ss_dssp EE
T ss_pred EC
Confidence 75
No 15
>cd03867 M14_CPZ Peptidase M14-like domain of carboxypeptidase (CP) Z (CPZ), CPZ belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPZ is a secreted Zn-dependent enzyme whose biological function is largely unknown. Unlike other members of the N/E subfamily, CPZ has a bipartite structure, which consists of an N-terminal cysteine-rich domain (CRD) whose sequence is similar to Wnt-binding proteins, and a C-terminal CP catalytic domain that removes C-terminal Arg residues from substrates. CPZ is enriched in the extracellular matrix and is widely distributed during early embryogenesis. That the CRD of CPZ can bind to Wnt4 suggests that CPZ plays a role in Wnt signaling.
Probab=99.23 E-value=4e-11 Score=124.71 Aligned_cols=91 Identities=29% Similarity=0.593 Sum_probs=78.8
Q ss_pred ccchhhhHHHhhhcceeEEEEEECCCCCcccccEEEEeccceeeeeCCCccEEEEecCCceEEEEEEeccceeEEEEEEe
Q psy7340 300 KSSVKSLVKFLAEAHRGVQGFVTDDLGNPVEKASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYADGYVPREIDFMVV 379 (440)
Q Consensus 300 KSsv~~l~~~~~q~~~~IsG~V~D~~GePL~GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~IGY~~q~itItVt 379 (440)
..+..++..++.+++.+|+|+|+|++|+||++|+|.++|++.++.||.+|.|...+++|.|+|.++++||++.+.+|++.
T Consensus 303 ~e~~~~l~~~~~~~~~~i~G~V~D~~g~pi~~A~V~v~g~~~~~~Td~~G~y~~~l~~G~y~l~vs~~Gy~~~~~~v~v~ 382 (395)
T cd03867 303 QENKEALLSFMEMVHRGIKGFVKDKDGNPIKGARISVRGIRHDITTAEDGDYWRLLPPGIHIVSAQAPGYTKVMKRVTLP 382 (395)
T ss_pred HHHHHHHHHHHHhccceeEEEEEcCCCCccCCeEEEEeccccceEECCCceEEEecCCCcEEEEEEecCeeeEEEEEEeC
Confidence 44566788888888999999999999999999999999999999999999998788999999999999999999888775
Q ss_pred c--CCceEEEEEE
Q psy7340 380 E--QHPTLLNVTL 390 (440)
Q Consensus 380 ~--~~~t~lnI~L 390 (440)
. +....++|.|
T Consensus 383 ~~~~~~~~~d~~l 395 (395)
T cd03867 383 ARMKRAGRVDFVL 395 (395)
T ss_pred CcCCCceEeeeEC
Confidence 3 2344566654
No 16
>cd03868 M14_CPD_I The first carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain I. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally active at p
Probab=99.20 E-value=6.1e-11 Score=122.21 Aligned_cols=92 Identities=40% Similarity=0.759 Sum_probs=78.6
Q ss_pred hccchhhhHHHhhhcceeEEEEEECCCCCcccccEEEEeccceeeeeCCCccEEEEecCCceEEEEEEeccceeEEEE-E
Q psy7340 299 KKSSVKSLVKFLAEAHRGVQGFVTDDLGNPVEKASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYADGYVPREIDF-M 377 (440)
Q Consensus 299 kKSsv~~l~~~~~q~~~~IsG~V~D~~GePL~GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~IGY~~q~itI-t 377 (440)
|..++.++..++.+++.+|+|+|+|.+|+|++||+|.|.++..++.||.+|.|+..+++|+|+|.|++.||++++... .
T Consensus 280 w~~n~~al~~~~~~~~~~i~G~V~d~~g~pv~~A~V~v~~~~~~~~td~~G~y~~~l~~G~Y~l~vs~~Gf~~~~~~~v~ 359 (372)
T cd03868 280 WNNNRESLLAYLEQVHIGVKGFVRDASGNPIEDATIMVAGIDHNVTTAKFGDYWRLLLPGTYTITAVAPGYEPSTVTDVV 359 (372)
T ss_pred HHHhHHHHHHHHHHhCCceEEEEEcCCCCcCCCcEEEEEecccceEeCCCceEEecCCCEEEEEEEEecCCCceEEeeEE
Confidence 345667788889999999999999999999999999999999999999999998778999999999999999988763 4
Q ss_pred EecCCceEEEEEE
Q psy7340 378 VVEQHPTLLNVTL 390 (440)
Q Consensus 378 Vt~~~~t~lnI~L 390 (440)
|..+....++|.|
T Consensus 360 v~~g~~~~~~~~L 372 (372)
T cd03868 360 VKEGEATSVNFTL 372 (372)
T ss_pred EcCCCeEEEeeEC
Confidence 5445555566654
No 17
>KOG2649|consensus
Probab=99.19 E-value=6.2e-11 Score=124.18 Aligned_cols=99 Identities=34% Similarity=0.603 Sum_probs=90.4
Q ss_pred hhccchhhhHHHhhhcceeEEEEEECCCCCcccccEEEEeccceeeeeCCCccEEEEecCCceEEEEEEeccceeEEEEE
Q psy7340 298 EKKSSVKSLVKFLAEAHRGVQGFVTDDLGNPVEKASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYADGYVPREIDFM 377 (440)
Q Consensus 298 ~kKSsv~~l~~~~~q~~~~IsG~V~D~~GePL~GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~IGY~~q~itIt 377 (440)
.|..++.+++.|+.|++.+|+|.|+|.+|+||++|+|.|.|.++.++|..+|.|+..++||.|.|++++-||.+.+.+++
T Consensus 361 ~WE~Nr~sLl~f~eqvH~GIkG~V~D~~G~~I~NA~IsV~ginHdv~T~~~GDYWRLL~PG~y~vta~A~Gy~~~tk~v~ 440 (500)
T KOG2649|consen 361 LWEYNRKSLLNFVEQVHRGIKGLVFDDTGNPIANATISVDGINHDVTTAKEGDYWRLLPPGKYIITASAEGYDPVTKTVT 440 (500)
T ss_pred HHHhhHHHHHHHHHHHHhccceeEEcCCCCccCceEEEEecCcCceeecCCCceEEeeCCcceEEEEecCCCcceeeEEE
Confidence 35668889999999999999999999999999999999999999999999999998899999999999999999999998
Q ss_pred EecCCceEEEEEEeccccc
Q psy7340 378 VVEQHPTLLNVTLHTSKWL 396 (440)
Q Consensus 378 Vt~~~~t~lnI~L~~~~~~ 396 (440)
|.......++|+|.+....
T Consensus 441 V~~~~a~~~df~L~~~~~~ 459 (500)
T KOG2649|consen 441 VPPDRAARVNFTLQRSIPQ 459 (500)
T ss_pred eCCCCccceeEEEecCCCc
Confidence 8775566789999976544
No 18
>cd03869 M14_CPX_like Peptidase M14-like domain of carboxypeptidase (CP)-like protein X (CPX), CPX forms a distinct subgroup of the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. Proteins belonging to this subgroup include CP-like protein X1 (CPX1), CP-like protein X2 (CPX2), and aortic CP-like protein (ACLP) and its isoform adipocyte enhancer binding protein-1 (AEBP1). AEBP1 is a truncated form of ACLP, which may arise from alternative splicing of the gene. These proteins are inactive towards standard CP substrates because they lack one or more critical active site and substrate-binding residues that are necessary for activity. They may function as binding proteins rather than as active CPs or display catalytic activity toward other substrates. Pro
Probab=99.14 E-value=1.1e-10 Score=121.80 Aligned_cols=92 Identities=32% Similarity=0.608 Sum_probs=80.9
Q ss_pred hccchhhhHHHhhhcceeEEEEEECCCCCcccccEEEEeccceeeeeCCCccEEEEecCCceEEEEEEeccceeEEEEEE
Q psy7340 299 KKSSVKSLVKFLAEAHRGVQGFVTDDLGNPVEKASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYADGYVPREIDFMV 378 (440)
Q Consensus 299 kKSsv~~l~~~~~q~~~~IsG~V~D~~GePL~GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~IGY~~q~itItV 378 (440)
|..++.+++.++.+++.+|+|.|+|.+|+||++|+|.|.|-++...|..+|.|+..++||.|+|++++.||.+.+.+++|
T Consensus 313 W~~N~~all~~~~~vh~GikG~V~d~~g~~i~~a~i~v~g~~~~v~t~~~GdywRll~pG~y~v~~~a~gy~~~~~~~~v 392 (405)
T cd03869 313 WENNKESLLVFMEQVHRGIKGVVRDKTGKGIPNAIISVEGINHDIRTASDGDYWRLLNPGEYRVTAHAEGYTSSTKNCEV 392 (405)
T ss_pred HHHHHHHHHHHHHHHhcCceEEEECCCCCcCCCcEEEEecCccceeeCCCCceEEecCCceEEEEEEecCCCcccEEEEE
Confidence 56678899999999999999999999999999999999999999999999999988999999999999999999888777
Q ss_pred ecC-CceEEEEEE
Q psy7340 379 VEQ-HPTLLNVTL 390 (440)
Q Consensus 379 t~~-~~t~lnI~L 390 (440)
... ..+.++|.|
T Consensus 393 ~~~~~~~~~~f~l 405 (405)
T cd03869 393 GYEMGPTQCNFTL 405 (405)
T ss_pred cCCCCceeeccCC
Confidence 543 344555543
No 19
>cd03858 M14_CP_N-E_like Carboxypeptidase (CP) N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. The N/E subfamily includes eight members, of which five (CPN, CPE, CPM, CPD, CPZ) are considered enzymatically active, while the other three are non-active (CPX1, PCX2, ACLP/AEBP1) and lack the critical active site and substrate-binding residues considered necessary for CP activity. These non-active members may function as binding proteins or display catalytic activity towards other substrates. Unlike the A/B CP subfamily, enzymes belonging to the N/E subfamily are not produced as inactive precursors that require proteolysis to produce the active form; rather, they rely on their substrate specificity and subcellular compartmentalization to prevent inappr
Probab=99.14 E-value=2.1e-10 Score=118.04 Aligned_cols=89 Identities=33% Similarity=0.752 Sum_probs=75.2
Q ss_pred chhhhHHHhhhcceeEEEEEECCCCCcccccEEEEeccceeeeeCCCccEEEEecCCceEEEEEEeccceeEEEEEEec-
Q psy7340 302 SVKSLVKFLAEAHRGVQGFVTDDLGNPVEKASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYADGYVPREIDFMVVE- 380 (440)
Q Consensus 302 sv~~l~~~~~q~~~~IsG~V~D~~GePL~GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~IGY~~q~itItVt~- 380 (440)
+...+..++.++..+|+|+|+|.+|+|+++|+|.+.|+..++.||.+|.|.+.+++|.|+|.|+++||.+++.++.+..
T Consensus 285 n~~all~l~~~a~~~i~G~V~d~~g~pl~~A~V~i~~~~~~~~Td~~G~f~~~l~~G~y~l~vs~~Gy~~~~~~v~v~~~ 364 (374)
T cd03858 285 NREALLAYIEQVHRGIKGFVRDANGNPIANATISVEGINHDVTTAEDGDYWRLLLPGTYNVTASAPGYEPQTKSVVVPND 364 (374)
T ss_pred HHHHHHHHHhhcCCceEEEEECCCCCccCCeEEEEecceeeeEECCCceEEEecCCEeEEEEEEEcCcceEEEEEEEecC
Confidence 3445566677777799999999999999999999999999999999999988889999999999999999998877655
Q ss_pred CCceEEEEEE
Q psy7340 381 QHPTLLNVTL 390 (440)
Q Consensus 381 ~~~t~lnI~L 390 (440)
+....++|.|
T Consensus 365 g~~~~~~~~l 374 (374)
T cd03858 365 NSAVVVDFTL 374 (374)
T ss_pred CceEEEeeEC
Confidence 4445566654
No 20
>cd03866 M14_CPM Peptidase M14 Carboxypeptidase (CP) M (CPM) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPM is an extracellular glycoprotein, bound to cell membranes via a glycosyl-phosphatidylinositol on the C-terminus of the protein. It specifically removes C-terminal basic residues such as lysine and arginine from peptides and proteins. The highest levels of CPM have been found in human lung and placenta, but significant amounts are present in kidney, blood vessels, intestine, brain, and peripheral nerves. CPM has also been found in soluble form in various body fluids, including amniotic fluid, seminal plasma and urine. Due to its wide distribution in a variety of tissues, it is believed that it plays an important role in the cont
Probab=99.12 E-value=2.5e-10 Score=118.09 Aligned_cols=82 Identities=30% Similarity=0.592 Sum_probs=73.5
Q ss_pred hccchhhhHHHhhhcceeEEEEEECCCCCcccccEEEEeccce--eeeeCCCccEEEEecCCceEEEEEEeccceeEEEE
Q psy7340 299 KKSSVKSLVKFLAEAHRGVQGFVTDDLGNPVEKASLKIKGRDV--GFQTTKYGEFWRILLPGIYKLEVYADGYVPREIDF 376 (440)
Q Consensus 299 kKSsv~~l~~~~~q~~~~IsG~V~D~~GePL~GAtV~IkGt~~--gt~TD~~G~F~l~lp~G~ytL~VS~IGY~~q~itI 376 (440)
|..++.+++.++.+.+.+|+|+|+|.+|+||+||+|.|.|++. .+.||.+|.|++.++||.|+|.|++.||.++...+
T Consensus 279 w~~n~~~ll~~i~q~~~gI~G~V~D~~g~pi~~A~V~v~g~~~~~~~~T~~~G~y~~~l~pG~Y~v~vsa~Gy~~~~~~v 358 (376)
T cd03866 279 WEDNKAALIEYIKQVHLGVKGQVFDSNGNPIPNAIVEVKGRKHICPYRTNVNGEYFLLLLPGKYMINVTAPGFKTVITNV 358 (376)
T ss_pred HHHhHHHHHHHHHHhcCceEEEEECCCCCccCCeEEEEEcCCceeEEEECCCceEEEecCCeeEEEEEEeCCcceEEEEE
Confidence 4567788999999999999999999999999999999999875 45899999998889999999999999999998888
Q ss_pred EEec
Q psy7340 377 MVVE 380 (440)
Q Consensus 377 tVt~ 380 (440)
.+..
T Consensus 359 ~v~~ 362 (376)
T cd03866 359 IIPY 362 (376)
T ss_pred EeCC
Confidence 7753
No 21
>PF08400 phage_tail_N: Prophage tail fibre N-terminal; InterPro: IPR013609 This entry represents the N terminus of phage 933W tail fibre protein. The characteristics of the protein distribution suggest prophage matches.
Probab=97.85 E-value=0.00011 Score=66.44 Aligned_cols=66 Identities=27% Similarity=0.495 Sum_probs=54.9
Q ss_pred eeEEEEEECCCCCcccccEEEEecc----------ceeeeeCCCccEEEEecCCceEEEEEEeccceeEE-EEEEec
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIKGR----------DVGFQTTKYGEFWRILLPGIYKLEVYADGYVPREI-DFMVVE 380 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~IkGt----------~~gt~TD~~G~F~l~lp~G~ytL~VS~IGY~~q~i-tItVt~ 380 (440)
..|+|..+|..|+|++|++|.++-. .-...|+.+|.|.+.+.||.|.+.+..-|+.+... .|+|..
T Consensus 3 V~ISGvL~dg~G~pv~g~~I~L~A~~tS~~Vv~~t~as~~t~~~G~Ys~~~epG~Y~V~l~~~g~~~~~vG~I~V~~ 79 (134)
T PF08400_consen 3 VKISGVLKDGAGKPVPGCTITLKARRTSSTVVVGTVASVVTGEAGEYSFDVEPGVYRVTLKVEGRPPVYVGDITVYE 79 (134)
T ss_pred EEEEEEEeCCCCCcCCCCEEEEEEccCchheEEEEEEEEEcCCCceEEEEecCCeEEEEEEECCCCceeEEEEEEec
Confidence 3699999999999999999999843 23358899999999999999999999999987663 455543
No 22
>PRK15036 hydroxyisourate hydrolase; Provisional
Probab=97.16 E-value=0.00069 Score=61.46 Aligned_cols=56 Identities=14% Similarity=0.246 Sum_probs=46.8
Q ss_pred eEEEEEECC-CCCcccccEEEEeccc-------eeeeeCCCccEEE-----EecCCceEEEEEEeccce
Q psy7340 316 GVQGFVTDD-LGNPVEKASLKIKGRD-------VGFQTTKYGEFWR-----ILLPGIYKLEVYADGYVP 371 (440)
Q Consensus 316 ~IsG~V~D~-~GePL~GAtV~IkGt~-------~gt~TD~~G~F~l-----~lp~G~ytL~VS~IGY~~ 371 (440)
.|+|.|+|. +|.|++|+.|.+.... ..+.||.||.|.. .+++|.|.|.|..-+|-.
T Consensus 28 ~Is~HVLDt~~G~PA~gV~V~L~~~~~~~w~~l~~~~Td~dGR~~~l~~~~~~~~G~Y~L~F~t~~Yf~ 96 (137)
T PRK15036 28 ILSVHILNQQTGKPAADVTVTLEKKADNGWLQLNTAKTDKDGRIKALWPEQTATTGDYRVVFKTGDYFK 96 (137)
T ss_pred CeEEEEEeCCCCcCCCCCEEEEEEccCCceEEEEEEEECCCCCCccccCcccCCCeeEEEEEEcchhhh
Confidence 599999995 9999999999998754 5689999999953 256899999998777765
No 23
>PF08308 PEGA: PEGA domain; InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=96.77 E-value=0.0063 Score=48.02 Aligned_cols=59 Identities=29% Similarity=0.343 Sum_probs=47.6
Q ss_pred ccccEEEEeccceeeeeCCCccEEE-EecCCceEEEEEEeccceeEEEEEEecCCceEEEEEEec
Q psy7340 329 VEKASLKIKGRDVGFQTTKYGEFWR-ILLPGIYKLEVYADGYVPREIDFMVVEQHPTLLNVTLHT 392 (440)
Q Consensus 329 L~GAtV~IkGt~~gt~TD~~G~F~l-~lp~G~ytL~VS~IGY~~q~itItVt~~~~t~lnI~L~~ 392 (440)
.+||.|++.|...| .|. +.+ .+++|.|+|.+..-||.+.+..+.+..+....+++.|++
T Consensus 10 p~gA~V~vdg~~~G-~tp----~~~~~l~~G~~~v~v~~~Gy~~~~~~v~v~~~~~~~v~~~L~~ 69 (71)
T PF08308_consen 10 PSGAEVYVDGKYIG-TTP----LTLKDLPPGEHTVTVEKPGYEPYTKTVTVKPGETTTVNVTLEP 69 (71)
T ss_pred CCCCEEEECCEEec-cCc----ceeeecCCccEEEEEEECCCeeEEEEEEECCCCEEEEEEEEEE
Confidence 36899999987776 222 233 388999999999999999999998887777788888876
No 24
>cd00421 intradiol_dioxygenase Intradiol dioxygenases catalyze the critical ring-cleavage step in the conversion of catecholate derivatives to citric acid cycle intermediates. This family contains catechol 1,2-dioxygenases and protocatechuate 3,4-dioxygenases which are mononuclear non-heme iron enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings. The members are intradiol-cleaving enzymes which break the catechol C1-C2 bond and utilize Fe3+, as opposed to the extradiol-cleaving enzymes which break the C2-C3 or C1-C6 bond and utilize Fe2+ and Mn+. Catechol 1,2-dioxygenases are mostly homodimers with one catalytic ferric ion per monomer. Protocatechuate 3,4-dioxygenases form more diverse oligomers.
Probab=96.45 E-value=0.012 Score=53.43 Aligned_cols=47 Identities=30% Similarity=0.346 Sum_probs=38.5
Q ss_pred eeEEEEEECCCCCcccccEEEEeccce--------------------eeeeCCCccEEE-EecCCceE
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIKGRDV--------------------GFQTTKYGEFWR-ILLPGIYK 361 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~IkGt~~--------------------gt~TD~~G~F~l-~lp~G~yt 361 (440)
..|+|+|+|.+|.|+++|.|.+-..+. ...||.+|.|.+ .++||.|.
T Consensus 12 l~l~G~V~D~~g~pv~~A~VeiW~~d~~G~Y~~~~~~~~~~~~~~rg~~~Td~~G~y~f~ti~Pg~Y~ 79 (146)
T cd00421 12 LTLTGTVLDGDGCPVPDALVEIWQADADGRYSGQDDSGLDPEFFLRGRQITDADGRYRFRTIKPGPYP 79 (146)
T ss_pred EEEEEEEECCCCCCCCCcEEEEEecCCCCccCCcCccccCCCCCCEEEEEECCCcCEEEEEEcCCCCC
Confidence 469999999999999999998853221 468999999964 58899887
No 25
>PF07210 DUF1416: Protein of unknown function (DUF1416); InterPro: IPR010814 This family consists of several hypothetical bacterial proteins of around 100 residues in length. Members of this family appear to be Actinomycete specific. The function of this family is unknown.
Probab=96.36 E-value=0.031 Score=46.86 Aligned_cols=51 Identities=24% Similarity=0.373 Sum_probs=43.5
Q ss_pred eEEEEEECCCCCcccccEEEEeccc----eeeeeCCCccEEEEecCCceEEEEEEe
Q psy7340 316 GVQGFVTDDLGNPVEKASLKIKGRD----VGFQTTKYGEFWRILLPGIYKLEVYAD 367 (440)
Q Consensus 316 ~IsG~V~D~~GePL~GAtV~IkGt~----~gt~TD~~G~F~l~lp~G~ytL~VS~I 367 (440)
.|+|+|+ .+|+|+++|.|.+.+.. .-+.|+.+|.|.+-..||++++.+-+-
T Consensus 9 VItG~V~-~~G~Pv~gAyVRLLD~sgEFtaEvvts~~G~FRFfaapG~WtvRal~~ 63 (85)
T PF07210_consen 9 VITGRVT-RDGEPVGGAYVRLLDSSGEFTAEVVTSATGDFRFFAAPGSWTVRALSR 63 (85)
T ss_pred EEEEEEe-cCCcCCCCeEEEEEcCCCCeEEEEEecCCccEEEEeCCCceEEEEEcc
Confidence 5999999 89999999999998643 447999999998888999999887543
No 26
>PF02369 Big_1: Bacterial Ig-like domain (group 1); InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=95.75 E-value=0.053 Score=46.17 Aligned_cols=59 Identities=24% Similarity=0.256 Sum_probs=40.7
Q ss_pred eeEEEEEECCCCCcccccEEEE--ecc--c---e--eeeeCCCccEEEEe---cCCceEEEEEEeccceeE
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKI--KGR--D---V--GFQTTKYGEFWRIL---LPGIYKLEVYADGYVPRE 373 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~I--kGt--~---~--gt~TD~~G~F~l~l---p~G~ytL~VS~IGY~~q~ 373 (440)
..++-+|+|.+|+||+|..|.+ ... . . ...||.+|.+...+ .+|.|++.++.-|.....
T Consensus 25 ~tltatV~D~~gnpv~g~~V~f~~~~~~~~l~~~~~~~~Td~~G~a~~tltst~aG~~~VtA~~~~~~~~~ 95 (100)
T PF02369_consen 25 NTLTATVTDANGNPVPGQPVTFSSSSSGGTLSPTNTSATTDSNGIATVTLTSTKAGTYTVTATVDGGSTSV 95 (100)
T ss_dssp EEEEEEEEETTSEB-TS-EEEE--EESSSEES-CEE-EEE-TTSEEEEEEE-SS-EEEEEEEEETTEEEEE
T ss_pred EEEEEEEEcCCCCCCCCCEEEEEEcCCCcEEecCccccEECCCEEEEEEEEecCceEEEEEEEECCcceeE
Confidence 4689999999999999999999 221 1 1 36899999996543 568888888887655443
No 27
>KOG1948|consensus
Probab=95.16 E-value=0.053 Score=61.56 Aligned_cols=58 Identities=19% Similarity=0.195 Sum_probs=46.6
Q ss_pred eeEEEEEECC-CCCcccccEEEEeccceeeeeCCCccEEE-E-ecCCceEEEEEE--eccceeE
Q psy7340 315 RGVQGFVTDD-LGNPVEKASLKIKGRDVGFQTTKYGEFWR-I-LLPGIYKLEVYA--DGYVPRE 373 (440)
Q Consensus 315 ~~IsG~V~D~-~GePL~GAtV~IkGt~~gt~TD~~G~F~l-~-lp~G~ytL~VS~--IGY~~q~ 373 (440)
.+++|+|++. +|.|+++|.|.+.| +.+..||.+|.|.+ + +..|+|+|.+.. +-|.+..
T Consensus 316 fSvtGRVl~g~~g~~l~gvvvlvng-k~~~kTdaqGyykLen~~t~gtytI~a~kehlqFstv~ 378 (1165)
T KOG1948|consen 316 FSVTGRVLVGSKGLPLSGVVVLVNG-KSGGKTDAQGYYKLENLKTDGTYTITAKKEHLQFSTVH 378 (1165)
T ss_pred EEeeeeEEeCCCCCCccceEEEEcC-cccceEcccceEEeeeeeccCcEEEEEeccceeeeeEE
Confidence 4699999995 99999999999976 55789999999977 4 678999998854 4444443
No 28
>PF14686 fn3_3: Polysaccharide lyase family 4, domain II; PDB: 1NKG_A 2XHN_B 3NJX_A 3NJV_A.
Probab=95.12 E-value=0.1 Score=44.52 Aligned_cols=53 Identities=34% Similarity=0.451 Sum_probs=28.7
Q ss_pred eEEEEEECCCC--Cccc--ccEEEEe-------c--cceeeeeCCCccEEE-EecCCceEEEEEEec
Q psy7340 316 GVQGFVTDDLG--NPVE--KASLKIK-------G--RDVGFQTTKYGEFWR-ILLPGIYKLEVYADG 368 (440)
Q Consensus 316 ~IsG~V~D~~G--ePL~--GAtV~Ik-------G--t~~gt~TD~~G~F~l-~lp~G~ytL~VS~IG 368 (440)
+|+|+|...+| .+.. .+.|-+. + ...-+.||.+|+|.+ ++.||+|.|.+-..|
T Consensus 4 ~VsG~l~l~dg~~~~~~~~~~~Vgl~~~~d~~q~~~yqYwt~td~~G~Fti~~V~pGtY~L~ay~~g 70 (95)
T PF14686_consen 4 SVSGRLTLSDGVTNPPAGANAVVGLAPPGDFQQNKGYQYWTRTDSDGNFTIPNVRPGTYRLYAYADG 70 (95)
T ss_dssp EEEEEEE---SS--TT--S-EEEEEE--------SS-EEEEE--TTSEEE---B-SEEEEEEEEE--
T ss_pred EEEEEEEEccCcccCccceeEEEEeeeccccccCCCCcEEEEeCCCCcEEeCCeeCcEeEEEEEEec
Confidence 58999987666 3333 3444443 1 123478999999976 589999999987644
No 29
>cd03458 Catechol_intradiol_dioxygenases Catechol intradiol dioxygenases can be divided into several subgroups according to their substrate specificity for catechol, chlorocatechols and hydroxyquinols. Almost all members of this family are homodimers containing one ferric ion (Fe3+) per monomer. They belong to the intradiol dioxygenase family, a family of mononuclear non-heme iron intradiol-cleaving enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings.
Probab=94.81 E-value=0.12 Score=51.74 Aligned_cols=46 Identities=30% Similarity=0.415 Sum_probs=35.5
Q ss_pred eeEEEEEECCCCCcccccEEEEe-----cc-----------c--eeeeeCCCccEEE-EecCCce
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIK-----GR-----------D--VGFQTTKYGEFWR-ILLPGIY 360 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~Ik-----Gt-----------~--~gt~TD~~G~F~l-~lp~G~y 360 (440)
..++|+|+|.+|+||+||.|-|= |. + -...||.+|.|.+ .+.|+.|
T Consensus 105 l~l~G~V~D~~G~Pv~~A~VeiWqad~~G~Y~~~~~~~~~~~lRG~~~Td~~G~y~f~Ti~P~~Y 169 (256)
T cd03458 105 LFVHGTVTDTDGKPLAGATVDVWHADPDGFYSQQDPDQPEFNLRGKFRTDEDGRYRFRTIRPVPY 169 (256)
T ss_pred EEEEEEEEcCCCCCCCCcEEEEEccCCCCCcCCCCCCCCCCCCEEEEEeCCCCCEEEEEECCCCc
Confidence 46999999999999999999883 10 0 1358999999954 5777766
No 30
>smart00634 BID_1 Bacterial Ig-like domain (group 1).
Probab=94.79 E-value=0.17 Score=42.08 Aligned_cols=59 Identities=20% Similarity=0.177 Sum_probs=42.7
Q ss_pred eeEEEEEECCCCCcccccEEEEeccc--------eeeeeCCCccEEEEe---cCCceEEEEEEeccceeE
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIKGRD--------VGFQTTKYGEFWRIL---LPGIYKLEVYADGYVPRE 373 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~IkGt~--------~gt~TD~~G~F~l~l---p~G~ytL~VS~IGY~~q~ 373 (440)
..|+-+|+|.+|+|++++.|.+.-.. ....||.+|.....+ .+|.+++.++.-|+.+..
T Consensus 20 ~~i~v~v~D~~Gnpv~~~~V~f~~~~~~~~~~~~~~~~Td~~G~a~~~l~~~~~G~~~vta~~~~~~~~~ 89 (92)
T smart00634 20 ITLTATVTDANGNPVAGQEVTFTTPSGGALTLSKGTATTDANGIATVTLTSTTAGVYTVTASLENGSSSV 89 (92)
T ss_pred EEEEEEEECCCCCCcCCCEEEEEECCCceeeccCCeeeeCCCCEEEEEEECCCCcEEEEEEEECCCccee
Confidence 46889999999999999888776221 224788899885433 457788877777766543
No 31
>cd03462 1,2-CCD chlorocatechol 1,2-dioxygenases (1,2-CCDs) (type II enzymes) are homodimeric intradiol dioxygenases that degrade chlorocatechols via the addition of molecular oxygen and the subsequent cleavage between two adjacent hydroxyl groups. This reaction is part of the modified ortho-cleavage pathway which is a central oxidative bacterial pathway that channels chlorocatechols, derived from the degradation of chlorinated benzoic acids, phenoxyacetic acids, phenols, benzenes, and other aromatics into the energy-generating tricarboxylic acid pathway.
Probab=94.49 E-value=0.16 Score=50.52 Aligned_cols=46 Identities=24% Similarity=0.268 Sum_probs=35.3
Q ss_pred eeEEEEEECCCCCcccccEEEEeccc-----------------e-eeeeCCCccEEE-EecCCce
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIKGRD-----------------V-GFQTTKYGEFWR-ILLPGIY 360 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~IkGt~-----------------~-gt~TD~~G~F~l-~lp~G~y 360 (440)
..++|+|+|.+|+||+||.|-|=-.+ . ...||.+|.|.+ .+.|+.|
T Consensus 100 l~l~G~V~D~~G~Pv~~A~VeiWqad~~G~Y~~~~~~~~~~~~RG~~~Td~~G~y~F~Ti~P~~Y 164 (247)
T cd03462 100 LLFRGTVKDLAGAPVAGAVIDVWHSTPDGKYSGFHPNIPEDYYRGKIRTDEDGRYEVRTTVPVPY 164 (247)
T ss_pred EEEEEEEEcCCCCCcCCcEEEEECCCCCCCcCCCCCCCCCCCCEEEEEeCCCCCEEEEEECCCCc
Confidence 46999999999999999999883110 1 358899999954 5677655
No 32
>TIGR02465 chlorocat_1_2 chlorocatechol 1,2-dioxygenase. Members of this protein family are chlorocatechol 1,2-dioxygenase. This protein is closely related to catechol 1,2-dioxygenase, TIGR02439, EC 1.13.11.1. Note that annotated database entries have appeared for the present protein family with the EC number that refers to that of family TIGR02439. This protein acts in pathways of the biodegradation of chlorinated aromatic compounds.
Probab=94.45 E-value=0.16 Score=50.40 Aligned_cols=46 Identities=24% Similarity=0.242 Sum_probs=35.3
Q ss_pred eeEEEEEECCCCCcccccEEEEecc----------------c--eeeeeCCCccEEE-EecCCce
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIKGR----------------D--VGFQTTKYGEFWR-ILLPGIY 360 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~IkGt----------------~--~gt~TD~~G~F~l-~lp~G~y 360 (440)
..|+|+|+|.+|+||+||.|-|=-. + -...||.+|.|.+ .+.|+.|
T Consensus 99 l~v~G~V~D~~G~Pv~~A~VeiWqad~~G~Y~~~~~~~~~~~lRG~~~Td~~G~y~F~Ti~P~~Y 163 (246)
T TIGR02465 99 LLIRGTVRDLSGTPVAGAVIDVWHSTPDGKYSGFHDNIPDDYYRGKLVTAADGSYEVRTTMPVPY 163 (246)
T ss_pred EEEEEEEEcCCCCCcCCcEEEEECCCCCCCCCCCCCCCCCCCCeEEEEECCCCCEEEEEECCCCC
Confidence 4699999999999999999988311 0 1358899999954 5667655
No 33
>PF00775 Dioxygenase_C: Dioxygenase; InterPro: IPR000627 This entry represents the C-terminal domain common to several intradiol ring-cleavage dioxygenases. Dioxygenases catalyse the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms. Cleavage of aromatic rings is one of the most important functions of dioxygenases, which play key roles in the degradation of aromatic compounds. The substrates of ring-cleavage dioxygenases can be classified into two groups according to the mode of scission of the aromatic ring. Intradiol enzymes use a non-haem Fe(III) to cleave the aromatic ring between two hydroxyl groups (ortho-cleavage), whereas extradiol enzymes (IPR000486 from INTERPRO) use a non-haem Fe(II) to cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon (meta-cleavage) []. These two subfamilies differ in sequence, structural fold, iron ligands, and the orientation of second sphere active site amino acid residues. Enzymes that belong to the intradiol family include catechol 1,2-dioxygenase (1,2-CTD) (1.13.11.1 from EC); protocatechuate 3,4-dioxygenase (3,4-PCD) (1.13.11.3 from EC); and chlorocatechol 1,2-dioxygenase (1.13.11.1 from EC) [].; GO: 0003824 catalytic activity, 0008199 ferric iron binding, 0006725 cellular aromatic compound metabolic process, 0055114 oxidation-reduction process; PDB: 2BUV_A 2BUX_A 2BUU_A 2BUR_A 1EO9_A 2BUZ_A 2BV0_A 1EO2_A 1EOC_A 1EOA_A ....
Probab=94.44 E-value=0.18 Score=47.89 Aligned_cols=47 Identities=32% Similarity=0.432 Sum_probs=32.4
Q ss_pred eeEEEEEECCCCCcccccEEEEe-----c----c-----------ceeeeeCCCccEEE-EecCCceE
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIK-----G----R-----------DVGFQTTKYGEFWR-ILLPGIYK 361 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~Ik-----G----t-----------~~gt~TD~~G~F~l-~lp~G~yt 361 (440)
..|.|+|+|.+|+||+||.|.|= | . .-...||.+|.|.+ .+.||.|.
T Consensus 30 l~l~G~V~D~~g~Pv~~A~veiWqada~G~Ys~~~~~~~~~~~~~rG~~~Td~~G~y~f~Ti~Pg~Y~ 97 (183)
T PF00775_consen 30 LVLHGRVIDTDGKPVPGALVEIWQADADGRYSGQDPGSDQPDFNLRGRFRTDADGRYSFRTIKPGPYP 97 (183)
T ss_dssp EEEEEEEEETTSSB-TTEEEEEEE--TTS--TTTBTTSSSSTTTTEEEEEECTTSEEEEEEE----EE
T ss_pred EEEEEEEECCCCCCCCCcEEEEEecCCCCccccccccccccCCCcceEEecCCCCEEEEEeeCCCCCC
Confidence 46999999999999999999982 1 0 12358999999954 57777765
No 34
>TIGR02439 catechol_proteo catechol 1,2-dioxygenase, proteobacterial. Members of this family known so far are catechol 1,2-dioxygenases of the Proteobacteria. They are distinct from catechol 1,2-dioxygenases and chlorocatechol 1,2-dioxygenases of the Actinobacteria, which are quite similar to each other and resolved by separate models. This enzyme catalyzes intradiol cleavage in which catechol + O2 becomes cis,cis-muconate. Catechol is an intermediate in the catabolism of many different aromatic compounds, as is the alternative intermediate protocatechuate. In Acinetobacter lwoffii, two isozymes are present with abilities, differing somewhat, to act on catechol analogs 3-methylcatechol, 4-methylcatechol, 4-methoxycatechol, and 4-chlorocatechol.
Probab=94.32 E-value=0.17 Score=51.24 Aligned_cols=46 Identities=26% Similarity=0.364 Sum_probs=35.1
Q ss_pred eeEEEEEECCCCCcccccEEEEe-----cc------------c-eeeeeCCCccEEE-EecCCce
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIK-----GR------------D-VGFQTTKYGEFWR-ILLPGIY 360 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~Ik-----Gt------------~-~gt~TD~~G~F~l-~lp~G~y 360 (440)
..|+|+|+|.+|+||+||.|-|= |. - -...||.+|.|.+ .+.|+.|
T Consensus 129 l~v~G~V~D~~G~PI~gA~VeIWqad~~G~Ys~~~~~~~~~~lRG~~~TD~~G~y~F~TI~P~~Y 193 (285)
T TIGR02439 129 LFLHGQVTDADGKPIAGAKVELWHANTKGNYSHFDKSQSEFNLRRTIITDAEGRYRARSIVPSGY 193 (285)
T ss_pred EEEEEEEECCCCCCcCCcEEEEEccCCCCCcCCCCCCCCCCCceEEEEECCCCCEEEEEECCCCC
Confidence 46999999999999999999882 10 0 1358999999954 5666655
No 35
>cd03463 3,4-PCD_alpha Protocatechuate 3,4-dioxygenase (3,4-PCD) , alpha subunit. 3,4-PCD catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-subunit-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=94.25 E-value=0.19 Score=47.90 Aligned_cols=47 Identities=30% Similarity=0.317 Sum_probs=36.2
Q ss_pred eeEEEEEECCCCCcccccEEEEec------------------cce----eeeeCCCccEEE-EecCCceE
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIKG------------------RDV----GFQTTKYGEFWR-ILLPGIYK 361 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~IkG------------------t~~----gt~TD~~G~F~l-~lp~G~yt 361 (440)
..|+|+|+|.+|+||+||.|.|=- .+. ...||.+|.|.+ .+.||.|.
T Consensus 37 l~l~G~V~D~~g~Pi~gA~VeiWqad~~G~Y~~~~~~~~~~~~~f~~rGr~~TD~~G~y~F~Ti~Pg~Y~ 106 (185)
T cd03463 37 ITLEGRVYDGDGAPVPDAMLEIWQADAAGRYAHPADSRRRLDPGFRGFGRVATDADGRFSFTTVKPGAVP 106 (185)
T ss_pred EEEEEEEECCCCCCCCCCEEEEEcCCCCCccCCcCCcccccCCCCCcEEEEEECCCCCEEEEEEcCCCcC
Confidence 469999999999999999998821 111 257999999954 57777764
No 36
>cd03459 3,4-PCD Protocatechuate 3,4-dioxygenase (3,4-PCD) catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=94.12 E-value=0.23 Score=46.14 Aligned_cols=47 Identities=30% Similarity=0.408 Sum_probs=36.6
Q ss_pred eeEEEEEECCCCCcccccEEEEe-----c----c----------c---e-eeeeCCCccEEE-EecCCceE
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIK-----G----R----------D---V-GFQTTKYGEFWR-ILLPGIYK 361 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~Ik-----G----t----------~---~-gt~TD~~G~F~l-~lp~G~yt 361 (440)
..|+|+|+|.+|.||++|.|.|= | . + . ...||.+|.|.+ .+.||.|.
T Consensus 16 l~l~g~V~D~~g~Pv~~A~veiWqad~~G~Y~~~~~~~~~~~~~~f~~rG~~~Td~~G~~~f~Ti~Pg~Y~ 86 (158)
T cd03459 16 IILEGRVLDGDGRPVPDALVEIWQADAAGRYRHPRDSHRAPLDPNFTGFGRVLTDADGRYRFRTIKPGAYP 86 (158)
T ss_pred EEEEEEEECCCCCCCCCCEEEEEccCCCCccCCccCCcccccCCCCCceeEEEECCCCcEEEEEECCCCcC
Confidence 46999999999999999999882 1 0 0 1 258999999954 57888776
No 37
>cd03460 1,2-CTD Catechol 1,2 dioxygenase (1,2-CTD) catalyzes an intradiol cleavage reaction of catechol to form cis,cis-muconate. 1,2-CTDs is homodimers with one catalytic non-heme ferric ion per monomer. They belong to the aromatic dioxygenase family, a family of mononuclear non-heme iron intradiol-cleaving enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings.
Probab=93.96 E-value=0.22 Score=50.44 Aligned_cols=46 Identities=28% Similarity=0.379 Sum_probs=35.3
Q ss_pred eeEEEEEECCCCCcccccEEEEe-----cc---------c----eeeeeCCCccEEE-EecCCce
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIK-----GR---------D----VGFQTTKYGEFWR-ILLPGIY 360 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~Ik-----Gt---------~----~gt~TD~~G~F~l-~lp~G~y 360 (440)
..++|+|+|.+|+||+||.|-|= |. . -...||.+|.|.+ .+.|+.|
T Consensus 125 l~l~G~V~D~~G~PI~~A~VeiWqad~~G~Ys~~~~~~~~f~~RGr~~TD~~G~y~F~TI~P~~Y 189 (282)
T cd03460 125 LVMHGTVTDTDGKPVPGAKVEVWHANSKGFYSHFDPTQSPFNLRRSIITDADGRYRFRSIMPSGY 189 (282)
T ss_pred EEEEEEEECCCCCCcCCcEEEEECCCCCCCcCCCCCCCCCCCCceEEEeCCCCCEEEEEECCCCC
Confidence 46999999999999999999882 11 0 1358999999954 5667665
No 38
>COG3485 PcaH Protocatechuate 3,4-dioxygenase beta subunit [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=93.56 E-value=0.14 Score=50.26 Aligned_cols=47 Identities=32% Similarity=0.391 Sum_probs=35.4
Q ss_pred ceeEEEEEECCCCCcccccEEEEe-----c-------cc----------eeeeeCCCccEEE-EecCCce
Q psy7340 314 HRGVQGFVTDDLGNPVEKASLKIK-----G-------RD----------VGFQTTKYGEFWR-ILLPGIY 360 (440)
Q Consensus 314 ~~~IsG~V~D~~GePL~GAtV~Ik-----G-------t~----------~gt~TD~~G~F~l-~lp~G~y 360 (440)
...|+|+|+|.+|.|+++|-|.|= | +. -.+.||.+|.|.+ .+.||.|
T Consensus 72 ~i~l~G~VlD~~G~Pv~~A~VEiWQAda~GrY~~~~d~~~~~~~~f~g~Gr~~Td~~G~y~F~Ti~Pg~y 141 (226)
T COG3485 72 RILLEGRVLDGNGRPVPDALVEIWQADADGRYSHPKDSRLAPLPNFNGRGRTITDEDGEYRFRTIKPGPY 141 (226)
T ss_pred eEEEEEEEECCCCCCCCCCEEEEEEcCCCCcccCccccccCcCccccceEEEEeCCCceEEEEEeecccc
Confidence 457999999999999999999881 1 11 1268999999954 4666644
No 39
>TIGR02438 catachol_actin catechol 1,2-dioxygenase, Actinobacterial. Members of this family are catechol 1,2-dioxygenases of the Actinobacteria. They are more closely related to actinobacterial chlorocatechol 1,2-dioxygenases than to proteobacterial catechol 1,2-dioxygenases, and so are built in this separate model. The member from Rhodococcus rhodochrous NCIMB 13259 (GB|AAC33003.1) is described as a homodimer with bound Fe, similarly active on catechol, 3-methylcatechol and 4-methylcatechol.
Probab=93.55 E-value=0.25 Score=49.97 Aligned_cols=47 Identities=26% Similarity=0.239 Sum_probs=35.6
Q ss_pred eeEEEEEECCCCCcccccEEEEe-----ccc------------e-eeeeCCCccEEE-EecCCceE
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIK-----GRD------------V-GFQTTKYGEFWR-ILLPGIYK 361 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~Ik-----Gt~------------~-gt~TD~~G~F~l-~lp~G~yt 361 (440)
..|+|+|+|.+|+||+||.|-|= |.. . ...||.+|.|.+ .+.|+.|-
T Consensus 133 l~v~G~V~D~~G~Pv~gA~VdiWqada~G~Ys~~~~~~~~~~lRGr~~TDadG~y~F~TI~Pg~Yp 198 (281)
T TIGR02438 133 LVFSGQVTDLDGNGLAGAKVELWHADDDGFYSQFAPGIPEWNLRGTIIADDEGRFEITTMQPAPYQ 198 (281)
T ss_pred EEEEEEEEcCCCCCcCCCEEEEEecCCCCCcCCCCCCCCCCCCeEEEEeCCCCCEEEEEECCCCcC
Confidence 46999999999999999999982 210 1 358899999954 56676553
No 40
>TIGR02423 protocat_alph protocatechuate 3,4-dioxygenase, alpha subunit. This model represents the alpha chain of protocatechuate 3,4-dioxygenase. The most closely related family outside this family is that of the beta chain (TIGR02422), typically encoded in an adjacent locus. This enzyme acts in the degradation of aromatic compounds by way of p-hydroxybenzoate to succinate and acetyl-CoA.
Probab=93.38 E-value=0.44 Score=45.69 Aligned_cols=47 Identities=30% Similarity=0.308 Sum_probs=36.1
Q ss_pred eeEEEEEECCCCCcccccEEEEec------------c-------c---e-eeeeCCCccEEE-EecCCceE
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIKG------------R-------D---V-GFQTTKYGEFWR-ILLPGIYK 361 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~IkG------------t-------~---~-gt~TD~~G~F~l-~lp~G~yt 361 (440)
..|+|+|+|.+|+||+||.|-|=- . + . ...||.+|.|.+ .+.||.|.
T Consensus 40 l~l~G~V~D~~g~Pv~~A~VeiWqad~~G~Y~~~~~~~~~~~~~~f~grGr~~Td~~G~y~f~TI~Pg~Yp 110 (193)
T TIGR02423 40 IRLEGRVLDGDGHPVPDALIEIWQADAAGRYNSPADLRAPATDPGFRGWGRTGTDESGEFTFETVKPGAVP 110 (193)
T ss_pred EEEEEEEECCCCCCCCCCEEEEEccCCCCccCCccCCcccccCCCCCCeEEEEECCCCCEEEEEEcCCCcC
Confidence 469999999999999999998821 0 0 1 258899999954 57787764
No 41
>cd03464 3,4-PCD_beta Protocatechuate 3,4-dioxygenase (3,4-PCD) , beta subunit. 3,4-PCD catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-subunit-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=93.03 E-value=0.38 Score=47.14 Aligned_cols=46 Identities=28% Similarity=0.337 Sum_probs=35.9
Q ss_pred eeEEEEEECCCCCcccccEEEEecc---------------------c-e-eeeeCCCccEEE-EecCCce
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIKGR---------------------D-V-GFQTTKYGEFWR-ILLPGIY 360 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~IkGt---------------------~-~-gt~TD~~G~F~l-~lp~G~y 360 (440)
..|+|+|+|.+|+||++|.|.|=-. + . ...||.+|.|.+ .+.||.|
T Consensus 66 i~l~G~V~D~~G~PV~~A~VEIWQad~~G~Y~~~~d~~~~~~~~~f~grGr~~TD~~G~y~F~TI~Pg~Y 135 (220)
T cd03464 66 IIVHGRVLDEDGRPVPNTLVEIWQANAAGRYRHKRDQHDAPLDPNFGGAGRTLTDDDGYYRFRTIKPGAY 135 (220)
T ss_pred EEEEEEEECCCCCCCCCCEEEEEecCCCCcccCccCCcccccCCCCCCEEEEEECCCccEEEEEECCCCc
Confidence 5689999999999999999988310 0 1 258899999954 5778877
No 42
>TIGR02422 protocat_beta protocatechuate 3,4-dioxygenase, beta subunit. This model represents the beta chain of protocatechuate 3,4-dioxygenase. The most closely related family outside this family is that of the alpha chain (TIGR02423), typically encoded in an adjacent locus. This enzyme acts in the degradation of aromatic compounds by way of p-hydroxybenzoate to succinate and acetyl-CoA.
Probab=92.99 E-value=0.51 Score=46.22 Aligned_cols=46 Identities=28% Similarity=0.338 Sum_probs=36.0
Q ss_pred eeEEEEEECCCCCcccccEEEEecc----------------------ce-eeeeCCCccEEE-EecCCce
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIKGR----------------------DV-GFQTTKYGEFWR-ILLPGIY 360 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~IkGt----------------------~~-gt~TD~~G~F~l-~lp~G~y 360 (440)
..|+|+|+|.+|+||++|.|.|=-. .. ...||.+|.|.+ .+.||.|
T Consensus 61 i~l~G~V~D~~g~PV~~A~VEIWQada~G~Y~~~~d~~~~~~~~~f~grGr~~TD~~G~y~F~TI~PG~Y 130 (220)
T TIGR02422 61 IIVHGRVLDEDGRPVPNTLVEVWQANAAGRYRHKNDQYLAPLDPNFGGVGRTLTDSDGYYRFRTIKPGPY 130 (220)
T ss_pred EEEEEEEECCCCCCCCCCEEEEEecCCCCcccCccCccccccCCCCCCEEEEEECCCccEEEEEECCCCc
Confidence 5699999999999999999988210 01 257899999954 5788877
No 43
>cd03461 1,2-HQD Hydroxyquinol 1,2-dioxygenase (1,2-HQD) catalyzes the ring cleavage of hydroxyquinol (1,2,4-trihydroxybenzene), a intermediate in the degradation of a large variety of aromatic compounds including some polychloro- and nitroaromatic pollutants, to form 3-hydroxy-cis,cis-muconates. 1,2-HQD blongs to the aromatic dioxygenase family, a family of mononuclear non-heme intradiol-cleaving enzymes.
Probab=92.92 E-value=0.33 Score=49.09 Aligned_cols=47 Identities=30% Similarity=0.348 Sum_probs=35.7
Q ss_pred eeEEEEEECCCCCcccccEEEEecc------------------ceeeeeCCCccEEE-EecCCceE
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIKGR------------------DVGFQTTKYGEFWR-ILLPGIYK 361 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~IkGt------------------~~gt~TD~~G~F~l-~lp~G~yt 361 (440)
..++|+|+|.+|.||+||.|-|=-. .-...||.+|.|.+ .+.||.|-
T Consensus 121 l~v~G~V~D~~G~Pv~gA~VeiWqad~~G~Y~~~~~~~~~~~lRGr~~Td~~G~y~F~Ti~Pg~Yp 186 (277)
T cd03461 121 CFVHGRVTDTDGKPLPGATVDVWQADPNGLYDVQDPDQPEFNLRGKFRTDEDGRYAFRTLRPTPYP 186 (277)
T ss_pred EEEEEEEEcCCCCCcCCcEEEEECcCCCCCcCCCCCCCCCCCCeEEEEeCCCCCEEEEEECCCCcC
Confidence 4699999999999999999988310 01358999999954 56676653
No 44
>PF05738 Cna_B: Cna protein B-type domain; InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains. The B region domain of Cna has a prealbumin-like beta-sandwich fold of seven strands in two sheets with a Greek key topology []. However, this domain does not mediate collagen binding, the IPR008456 from INTERPRO region carries out that function; instead it appears to form a stalk that presents the ligand binding domain away from the bacterial cell surface. Cna is a collagen-binding MSCRAMM (Microbial Surface Component Recognizing Adhesive Matrix Molecules), and is necessary and sufficient for S. aureus cells to adhere to cartilage.; PDB: 2X5P_A 3RKP_A 3KPT_A 1VLF_T 1TI2_F 1TI6_D 1TI4_J 1VLE_V 1VLD_X 3PF2_A ....
Probab=92.61 E-value=0.43 Score=37.12 Aligned_cols=46 Identities=26% Similarity=0.456 Sum_probs=33.5
Q ss_pred ccccEEEEeccc----e--eeeeCCCccEEE-EecCCceEEEEEE--eccceeEE
Q psy7340 329 VEKASLKIKGRD----V--GFQTTKYGEFWR-ILLPGIYKLEVYA--DGYVPREI 374 (440)
Q Consensus 329 L~GAtV~IkGt~----~--gt~TD~~G~F~l-~lp~G~ytL~VS~--IGY~~q~i 374 (440)
|+||.+.+.... . ...||.+|.|.+ ++++|.|.|.... -||.....
T Consensus 1 L~Ga~f~L~~~~~~~~~~~~~~Td~~G~~~f~~L~~G~Y~l~E~~aP~GY~~~~~ 55 (70)
T PF05738_consen 1 LAGATFELYDEDGNEVIEVTVTTDENGKYTFKNLPPGTYTLKETKAPDGYQLDDT 55 (70)
T ss_dssp -STEEEEEEETTSEEEEEEEEEGGTTSEEEEEEEESEEEEEEEEETTTTEEEEEC
T ss_pred CCCeEEEEEECCCCEEEEEEEEECCCCEEEEeecCCeEEEEEEEECCCCCEECCC
Confidence 567777776432 1 268999999965 6999999999876 78876443
No 45
>PF10670 DUF4198: Domain of unknown function (DUF4198)
Probab=91.65 E-value=0.44 Score=44.47 Aligned_cols=51 Identities=18% Similarity=0.152 Sum_probs=41.0
Q ss_pred eEEEEEECCCCCcccccEEEEecc---------ceeeeeCCCccEEEEe-cCCceEEEEEEe
Q psy7340 316 GVQGFVTDDLGNPVEKASLKIKGR---------DVGFQTTKYGEFWRIL-LPGIYKLEVYAD 367 (440)
Q Consensus 316 ~IsG~V~D~~GePL~GAtV~IkGt---------~~gt~TD~~G~F~l~l-p~G~ytL~VS~I 367 (440)
.++.+|+ -+|+|++|+.|.+... .....||.+|.+.+.+ .+|.|.|.+++.
T Consensus 152 ~~~~~vl-~~GkPl~~a~V~~~~~~~~~~~~~~~~~~~TD~~G~~~~~~~~~G~wli~a~~~ 212 (215)
T PF10670_consen 152 PLPFQVL-FDGKPLAGAEVEAFSPGGWYDVEHEAKTLKTDANGRATFTLPRPGLWLIRASHK 212 (215)
T ss_pred EEEEEEE-ECCeEcccEEEEEEECCCccccccceEEEEECCCCEEEEecCCCEEEEEEEEEe
Confidence 4777887 5899999999998643 3458999999998775 579999988764
No 46
>PF09430 DUF2012: Protein of unknown function (DUF2012); InterPro: IPR019008 This domain is found in different proteins, including uncharacterised protein family UPF0480 and nodal modulators. A nodal modulator has been identified as part of a protein complex that participates in the nodal signaling pathway during vertebrate development [].
Probab=91.19 E-value=0.55 Score=41.50 Aligned_cols=51 Identities=24% Similarity=0.257 Sum_probs=38.0
Q ss_pred ccccEEEEeccce---eeeeCCCccEEE-EecCCceEEEEEEeccceeEEEEEEe
Q psy7340 329 VEKASLKIKGRDV---GFQTTKYGEFWR-ILLPGIYKLEVYADGYVPREIDFMVV 379 (440)
Q Consensus 329 L~GAtV~IkGt~~---gt~TD~~G~F~l-~lp~G~ytL~VS~IGY~~q~itItVt 379 (440)
.+...|.+.+... .+....||.|.+ ++++|.|.|.|.+..|.=....|+|.
T Consensus 7 ~~~t~V~L~~g~~~~~~~~v~~dG~F~f~~Vp~GsY~L~V~s~~~~F~~~RVdV~ 61 (123)
T PF09430_consen 7 PSSTRVTLNGGQYRPISAFVRSDGSFVFHNVPPGSYLLEVHSPDYVFPPYRVDVS 61 (123)
T ss_pred CCCEEEEEeCCCccceEEEecCCCEEEeCCCCCceEEEEEECCCccccCEEEEEe
Confidence 4567777876555 789999999966 69999999999986655444444454
No 47
>PF12866 DUF3823: Protein of unknown function (DUF3823); InterPro: IPR024278 This is a family of uncharacterised proteins from Bacteroidetes. These proteins have characteristic DN and DR sequence-motifs but their function is not known.; PDB: 3HN5_B 4EIU_A.
Probab=90.58 E-value=1.4 Score=43.14 Aligned_cols=76 Identities=16% Similarity=0.212 Sum_probs=49.4
Q ss_pred eeEEEEEEC-CCCCccc----ccEEEEec------cceeeeeCCCccE-EEEecCCceEEEE-EEec---cceeEEEEEE
Q psy7340 315 RGVQGFVTD-DLGNPVE----KASLKIKG------RDVGFQTTKYGEF-WRILLPGIYKLEV-YADG---YVPREIDFMV 378 (440)
Q Consensus 315 ~~IsG~V~D-~~GePL~----GAtV~IkG------t~~gt~TD~~G~F-~l~lp~G~ytL~V-S~IG---Y~~q~itItV 378 (440)
.+++|+|.| .+|+||+ |+.|.+.. .........||.| ...+-.|+|.|.+ .--| +...++.+.|
T Consensus 22 s~l~G~iiD~~tgE~i~~~~~gv~i~l~e~gy~~~~~~~~~v~qDGtf~n~~lF~G~Yki~~~~G~fp~~~~~dti~v~i 101 (222)
T PF12866_consen 22 STLTGRIIDVYTGEPIQTDIGGVRIQLYELGYGDNTPQDVYVKQDGTFRNTKLFDGDYKIVPKNGNFPWVVPVDTIEVDI 101 (222)
T ss_dssp EEEEEEEEECCTTEE----STSSEEEEECS-CCG--SEEEEB-TTSEEEEEEE-SEEEEEEE-CTSCSBSCCE--EEEEE
T ss_pred ceEEEEEEEeecCCeeeecCCceEEEEEecccccCCCcceEEccCCceeeeeEeccceEEEEcCCCCcccCCCccEEEEe
Confidence 479999999 5898874 57777653 2345678899999 5667889999999 6566 5556666666
Q ss_pred ecCCceEEEEEEec
Q psy7340 379 VEQHPTLLNVTLHT 392 (440)
Q Consensus 379 t~~~~t~lnI~L~~ 392 (440)
. ..+.++|.+.|
T Consensus 102 ~--G~t~~d~eVtP 113 (222)
T PF12866_consen 102 K--GNTTQDFEVTP 113 (222)
T ss_dssp S--SCEEEEEEE-B
T ss_pred c--CceEEeEEeee
Confidence 4 34566766655
No 48
>TIGR02962 hdxy_isourate hydroxyisourate hydrolase. Members of this family, hydroxyisourate hydrolase, represent a distinct clade of transthyretin-related proteins. Bacterial members typically are encoded next to ureidoglycolate hydrolase and often near either xanthine dehydrogenase or xanthine/uracil permease genes and have been demonstrated to have hydroxyisourate hydrolase activity. In eukaryotes, a clade separate from the transthyretins (a family of thyroid-hormone binding proteins) has also been shown to have HIU hydrolase activity in urate catabolizing organisms. Transthyretin, then, would appear to be the recently diverged paralog of the more ancient HIUH family.
Probab=89.33 E-value=0.63 Score=40.99 Aligned_cols=54 Identities=26% Similarity=0.322 Sum_probs=41.0
Q ss_pred EEEEEEC-CCCCcccccEEEEeccc-------eeeeeCCCccEEE------EecCCceEEEEEEeccc
Q psy7340 317 VQGFVTD-DLGNPVEKASLKIKGRD-------VGFQTTKYGEFWR------ILLPGIYKLEVYADGYV 370 (440)
Q Consensus 317 IsG~V~D-~~GePL~GAtV~IkGt~-------~gt~TD~~G~F~l------~lp~G~ytL~VS~IGY~ 370 (440)
|+-.|+| ..|.|.+|+.|.+.... ....||.||...- .+++|.|.|.|..-.|-
T Consensus 3 lstHVLDt~~G~PAagv~V~L~~~~~~~~~~i~~~~Tn~DGR~~~~l~~~~~~~~G~Y~l~F~~g~Yf 70 (112)
T TIGR02962 3 LSTHVLDTTSGKPAAGVPVTLYRLDGSGWTPLAEGVTNADGRCPDLLPEGETLAAGIYKLRFDTGDYF 70 (112)
T ss_pred ceEEEEeCCCCccCCCCEEEEEEecCCCeEEEEEEEECCCCCCcCcccCcccCCCeeEEEEEEhhhhh
Confidence 6888999 59999999999996421 1468999999841 24679999998655554
No 49
>KOG1948|consensus
Probab=88.69 E-value=1.5 Score=50.49 Aligned_cols=57 Identities=18% Similarity=0.279 Sum_probs=47.1
Q ss_pred eeEEEEEECCCCCcccccEEEEecc---ceeeeeCCCccEE-EEecCCceEEEEEEeccce
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIKGR---DVGFQTTKYGEFW-RILLPGIYKLEVYADGYVP 371 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~IkGt---~~gt~TD~~G~F~-l~lp~G~ytL~VS~IGY~~ 371 (440)
.+|+|+|...-|--.+|+.|.+..+ -..|.|+.+|.|. .++.||+|.+..++.-+.-
T Consensus 119 Fsv~GkVlgaaggGpagV~velrs~e~~iast~T~~~Gky~f~~iiPG~Yev~ashp~w~~ 179 (1165)
T KOG1948|consen 119 FSVRGKVLGAAGGGPAGVLVELRSQEDPIASTKTEDGGKYEFRNIIPGKYEVSASHPAWEC 179 (1165)
T ss_pred eeEeeEEeeccCCCcccceeecccccCcceeeEecCCCeEEEEecCCCceEEeccCcceeE
Confidence 3699999998776678999988766 5679999999995 5699999999998866543
No 50
>PF01190 Pollen_Ole_e_I: Pollen proteins Ole e I like; InterPro: IPR006041 Allergies are hypersensitivity reactions of the immune system to specific substances called allergens (such as pollen, stings, drugs, or food) that, in most people, result in no symptoms. A nomenclature system has been established for antigens (allergens) that cause IgE-mediated atopic allergies in humans [WHO/IUIS Allergen Nomenclature Subcommittee King T.P., Hoffmann D., Loewenstein H., Marsh D.G., Platts-Mills T.A.E., Thomas W. Bull. World Health Organ. 72:797-806(1994)]. This nomenclature system is defined by a designation that is composed of the first three letters of the genus; a space; the first letter of the species name; a space and an arabic number. In the event that two species names have identical designations, they are discriminated from one another by adding one or more letters (as necessary) to each species designation. The allergens in this family include allergens with the following designations: Ole e 1. A number of plant pollen proteins, whose biological function is not yet known, are structurally related []. These proteins are most probably secreted and consist of about 145 residues. There are six cysteines which are conserved in the sequence of these proteins. They seem to be involved in disulphide bonds.
Probab=88.07 E-value=1.2 Score=37.39 Aligned_cols=33 Identities=21% Similarity=0.291 Sum_probs=26.1
Q ss_pred CCcccccEEEEeccc--------eeeeeCCCccEEEEecCC
Q psy7340 326 GNPVEKASLKIKGRD--------VGFQTTKYGEFWRILLPG 358 (440)
Q Consensus 326 GePL~GAtV~IkGt~--------~gt~TD~~G~F~l~lp~G 358 (440)
..||+||.|.+.-.+ ....||.+|.|.+.++..
T Consensus 18 ~~~l~GA~V~v~C~~~~~~~~~~~~~~Td~~G~F~i~l~~~ 58 (97)
T PF01190_consen 18 AKPLPGAKVSVECKDGNGGVVFSAEAKTDENGYFSIELPSD 58 (97)
T ss_pred CccCCCCEEEEECCCCCCCcEEEEEEEeCCCCEEEEEecCc
Confidence 579999999998433 346999999998877653
No 51
>PF11974 MG1: Alpha-2-macroglobulin MG1 domain; InterPro: IPR021868 This is the N-terminal MG1 domain from alpha-2-macroglobulin [].
Probab=87.49 E-value=1.6 Score=37.25 Aligned_cols=40 Identities=23% Similarity=0.164 Sum_probs=30.7
Q ss_pred EEEEEEC-CCCCcccccEEEEec-cc----eeeeeCCCccEEEEec
Q psy7340 317 VQGFVTD-DLGNPVEKASLKIKG-RD----VGFQTTKYGEFWRILL 356 (440)
Q Consensus 317 IsG~V~D-~~GePL~GAtV~IkG-t~----~gt~TD~~G~F~l~lp 356 (440)
+.-.|.| .+|+|++||.|.+.+ .+ ....||.+|.+.+...
T Consensus 15 ~~v~v~~L~tg~Pv~ga~V~l~~~~~~~~l~~g~TD~~G~a~~~~~ 60 (97)
T PF11974_consen 15 LLVWVTSLSTGKPVAGAEVELYDSRNGQVLASGKTDADGFASFDST 60 (97)
T ss_pred EEEEEeeCCCCCccCCCEEEEEECCCCcEeeeeeeCCCceEEecCC
Confidence 4455777 799999999999987 33 2468999999876543
No 52
>PF12985 DUF3869: Domain of unknown function (DUF3869); InterPro: IPR024620 This functionally uncharacterised domain is found in bacterial sequences, including the N-terminal region of a putative pore-forming toxin (YP_001301288.1) from Bacteroides vulgatus ATCC 8482.; PDB: 3G3L_A 3KOG_A.
Probab=86.51 E-value=3.4 Score=36.14 Aligned_cols=72 Identities=21% Similarity=0.301 Sum_probs=47.4
Q ss_pred cceeEEEEEEC-CCCCccc-ccEEEEeccceeeeeCCCccEEEEecCCceEEEEEEeccceeEEEEEEec---CCce--E
Q psy7340 313 AHRGVQGFVTD-DLGNPVE-KASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYADGYVPREIDFMVVE---QHPT--L 385 (440)
Q Consensus 313 ~~~~IsG~V~D-~~GePL~-GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~IGY~~q~itItVt~---~~~t--~ 385 (440)
....|.|+|.| .+|++|. ++.+.+.+.. +|.+ .++...+..+..||.+...++.+.. ++.. .
T Consensus 20 A~~tI~~tV~da~tG~~vt~~a~vti~~~~-------~~~~----A~~a~~vtas~~~y~~~~~tV~vpal~~Gq~a~~~ 88 (104)
T PF12985_consen 20 AKYTIAGTVYDAETGEDVTTTATVTISAGS-------DGTL----AAKAVTVTASKDGYMTGSVTVNVPALKAGQFAVYP 88 (104)
T ss_dssp -EEEEEEEEEETTTTEE-CGCSEEEETTCC-------CCCE------ECCEEEEEECTCEEEEEEEEE--B-TT-EEEEE
T ss_pred CEEEEEEEEEecCCCCeeeccceEEEccCC-------Cccc----ccccEEEEEEccCCeeeEEEEEehhhcCCceEEEe
Confidence 34579999999 5999998 8999886654 5654 3556678889999999888877653 2222 4
Q ss_pred EEEEEecccc
Q psy7340 386 LNVTLHTSKW 395 (440)
Q Consensus 386 lnI~L~~~~~ 395 (440)
.+|.|.+...
T Consensus 89 v~i~L~~~~~ 98 (104)
T PF12985_consen 89 VNIALQKEAA 98 (104)
T ss_dssp EEEEE-CTCE
T ss_pred eEEEEeeccc
Confidence 5677765443
No 53
>COG5266 CbiK ABC-type Co2+ transport system, periplasmic component [Inorganic ion transport and metabolism]
Probab=84.57 E-value=2.7 Score=42.18 Aligned_cols=49 Identities=27% Similarity=0.362 Sum_probs=35.9
Q ss_pred eEEEEEECCCCCcccccEEEEe-------c----c------ceeeeeCCCccEEE-EecCCceEEEEE
Q psy7340 316 GVQGFVTDDLGNPVEKASLKIK-------G----R------DVGFQTTKYGEFWR-ILLPGIYKLEVY 365 (440)
Q Consensus 316 ~IsG~V~D~~GePL~GAtV~Ik-------G----t------~~gt~TD~~G~F~l-~lp~G~ytL~VS 365 (440)
.++|+|+|. |+||++|+|.+. + + .....||.+|.|.+ .+..|-..+.+-
T Consensus 173 ~f~~~vl~~-GkPv~nA~V~v~~~n~~~~d~~a~~~~~ek~~~~~~TD~kG~~~fip~r~G~W~~~~~ 239 (264)
T COG5266 173 VFRGKVLDN-GKPVPNATVEVEFDNIDTKDNRAKTGNTEKTALVQFTDDKGEVSFIPLRAGVWGFAVE 239 (264)
T ss_pred eEEEEEEEC-CccCCCcEEEEEEecccccccccccCCCCCcceEEEcCCCceEEEEEccCceEEEEee
Confidence 489999997 999999999997 1 1 13458999999954 345565555443
No 54
>PF00576 Transthyretin: HIUase/Transthyretin family; InterPro: IPR023416 This family includes transthyretin that is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. However, most of the sequences listed in this family do not bind thyroid hormones. They are actually enzymes of the purine catabolism that catalyse the conversion of 5-hydroxyisourate (HIU) to OHCU [, ]. HIU hydrolysis is the original function of the family and is conserved from bacteria to mammals; transthyretins arose by gene duplications in the vertebrate lineage [, ]. HIUases are distinguished in the alignment from the conserved C-terminal YRGS sequence. Transthyretin (formerly prealbumin) is one of 3 thyroid hormone-binding proteins found in the blood of vertebrates []. It is produced in the liver and circulates in the bloodstream, where it binds retinol and thyroxine (T4) []. It differs from the other 2 hormone-binding proteins (T4-binding globulin and albumin) in 3 distinct ways: (1) the gene is expressed at a high rate in the brain choroid plexus; (2) it is enriched in cerebrospinal fluid; and (3) no genetically caused absence has been observed, suggesting an essential role in brain function, distinct from that played in the bloodstream []. The protein consists of around 130 amino acids, which assemble as a homotetramer that contains an internal channel in which T4 is bound. Within this complex, T4 appears to be transported across the blood-brain barrier, where, in the choroid plexus, the hormone stimulates further synthesis of transthyretin. The protein then diffuses back into the bloodstream, where it binds T4 for transport back to the brain [].; PDB: 1TFP_B 1KGJ_D 1IE4_C 1GKE_C 1KGI_D 2H0J_B 2H0E_B 2H0F_B 1ZD6_A 3DGD_D ....
Probab=81.28 E-value=1.5 Score=38.57 Aligned_cols=53 Identities=26% Similarity=0.301 Sum_probs=37.4
Q ss_pred eEEEEEEC-CCCCcccccEEEEeccc--------eeeeeCCCccEE--E----EecCCceEEEEEEec
Q psy7340 316 GVQGFVTD-DLGNPVEKASLKIKGRD--------VGFQTTKYGEFW--R----ILLPGIYKLEVYADG 368 (440)
Q Consensus 316 ~IsG~V~D-~~GePL~GAtV~IkGt~--------~gt~TD~~G~F~--l----~lp~G~ytL~VS~IG 368 (440)
.|+-.|+| ..|.|..|+.|.+.... ....||.||... + .+.+|.|.|.|.--.
T Consensus 2 ~iStHVLDt~~G~PA~gv~V~L~~~~~~~~~~~l~~~~Td~DGR~~~~~~~~~~~~~G~Y~l~F~~~~ 69 (112)
T PF00576_consen 2 PISTHVLDTTTGKPAAGVPVTLYRLDSDGSWTLLAEGVTDADGRIKQPLLEGESLEPGIYKLVFDTGD 69 (112)
T ss_dssp SEEEEEEETTTTEE-TT-EEEEEEEETTSCEEEEEEEEBETTSEESSTSSETTTS-SEEEEEEEEHHH
T ss_pred CcEEEEeeCCCCCCccCCEEEEEEecCCCCcEEEEEEEECCCCcccccccccccccceEEEEEEEHHH
Confidence 47889999 59999999999886433 346999999982 1 245788988886433
No 55
>PF06488 L_lac_phage_MSP: Lactococcus lactis bacteriophage major structural protein; InterPro: IPR010517 This family consists of several Lactococcus lactis bacteriophage F4-1 major structural proteins []. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=78.84 E-value=3.6 Score=40.38 Aligned_cols=38 Identities=24% Similarity=0.328 Sum_probs=30.3
Q ss_pred eeeeCCCccEEE--EecCCceEEEEEEeccceeEEEEEEe
Q psy7340 342 GFQTTKYGEFWR--ILLPGIYKLEVYADGYVPREIDFMVV 379 (440)
Q Consensus 342 gt~TD~~G~F~l--~lp~G~ytL~VS~IGY~~q~itItVt 379 (440)
.++||..|+-.. .+.+|.|++++|+-||...+.++.|+
T Consensus 259 sVITnssG~~vTNgqLsaGtYtVTySAsGY~DVTqtlvVT 298 (301)
T PF06488_consen 259 SVITNSSGNVVTNGQLSAGTYTVTYSASGYADVTQTLVVT 298 (301)
T ss_pred EEEEcCCCcEeecCcccCceEEEEEeccccccccceEEEe
Confidence 467788887743 37899999999999999988776654
No 56
>smart00095 TR_THY Transthyretin.
Probab=77.49 E-value=4.3 Score=36.40 Aligned_cols=54 Identities=24% Similarity=0.365 Sum_probs=39.8
Q ss_pred eEEEEEEC-CCCCcccccEEEEeccc--------eeeeeCCCccEE--E---EecCCceEEEEEEecc
Q psy7340 316 GVQGFVTD-DLGNPVEKASLKIKGRD--------VGFQTTKYGEFW--R---ILLPGIYKLEVYADGY 369 (440)
Q Consensus 316 ~IsG~V~D-~~GePL~GAtV~IkGt~--------~gt~TD~~G~F~--l---~lp~G~ytL~VS~IGY 369 (440)
.|+-.|+| ..|.|..|+.|.+.... ....||.||... + .+.+|.|.|.|.--.|
T Consensus 5 plTtHVLDt~~G~PAagv~V~L~~~~~~~~w~~la~~~Tn~DGR~~~ll~~~~~~~G~Y~l~F~tg~Y 72 (121)
T smart00095 5 PLMVKVLDAVRGSPAVNVAVKVFKKTEEGTWEPFASGKTNESGEIHELTTDEKFVEGLYKVEFDTKSY 72 (121)
T ss_pred CeEEEEEECCCCccCCCCEEEEEEeCCCCceEEEEEEecCCCccccCccCcccccceEEEEEEehhHh
Confidence 37889999 59999999999884321 236999999983 2 2457899998854333
No 57
>cd05822 TLP_HIUase HIUase (5-hydroxyisourate hydrolase) catalyzes the second step in a three-step ureide pathway in which 5-hydroxyisourate (HIU), a product of the uricase (urate oxidase) reaction, is hydrolyzed to 2-oxo-4-hydroxy-4-carboxy-5-ureidoimidazoline (OHCU). HIUase has high sequence similarity with transthyretins and is a member of the transthyretin-like protein (TLP) family. HIUase is distinguished from transthyretins by a conserved signature motif at its C-terminus that forms part of the active site. In HIUase, this motif is YRGS, while transthyretins have a conserved TAVV sequence in the same location. Most HIUases are cytosolic but in plants and slime molds, they are peroxisomal based on the presence of N-terminal periplasmic localization sequences. HIUase forms a homotetramer with each subunit consisting of eight beta-strands arranged in two sheets and a short alpha-helix. The central channel of the tetramer contains two independent binding sites, each located betw
Probab=76.10 E-value=4.6 Score=35.63 Aligned_cols=54 Identities=24% Similarity=0.289 Sum_probs=40.6
Q ss_pred EEEEEEC-CCCCcccccEEEEeccc-------eeeeeCCCccEEE------EecCCceEEEEEEeccc
Q psy7340 317 VQGFVTD-DLGNPVEKASLKIKGRD-------VGFQTTKYGEFWR------ILLPGIYKLEVYADGYV 370 (440)
Q Consensus 317 IsG~V~D-~~GePL~GAtV~IkGt~-------~gt~TD~~G~F~l------~lp~G~ytL~VS~IGY~ 370 (440)
|+-.|+| ..|.|..|+.|.+.-.. ....||.||.... .+.+|.|.|.|.--.|-
T Consensus 3 lstHVLDt~~G~PAagv~V~L~~~~~~~~~~i~~~~Td~DGR~~~~~~~~~~~~~G~Y~l~F~~~~Yf 70 (112)
T cd05822 3 LSTHVLDTATGKPAAGVAVTLYRLDGNGWTLLATGVTNADGRCDDLLPPGAQLAAGTYKLTFDTGAYF 70 (112)
T ss_pred ceeEEEeCCCCcccCCCEEEEEEecCCCeEEEEEEEECCCCCccCcccccccCCCeeEEEEEEhhhhh
Confidence 6788999 59999999999886421 2368999999831 24579999988655554
No 58
>cd05821 TLP_Transthyretin Transthyretin (TTR) is a 55 kDa protein responsible for the transport of thyroid hormones and retinol in vertebrates. TTR distributes the two thyroid hormones T3 (3,5,3'-triiodo-L-thyronine) and T4 (Thyroxin, or 3,5,3',5'-tetraiodo-L-thyronine), as well as retinol (vitamin A) through the formation of a macromolecular complex that includes each of these as well as retinol-binding protein. Misfolded forms of TTR are implicated in the amyloid diseases familial amyloidotic polyneuropathy and senile systemic amyloidosis. TTR forms a homotetramer with each subunit consisting of eight beta-strands arranged in two sheets and a short alpha-helix. The central channel of the tetramer contains two independent binding sites, each located between a pair of subunits, which differ in their ligand binding affinity. A negative cooperativity has been observed for the binding of T4 and other TTR ligands. A fraction of plasma TTR is carried in high density lipoproteins by bindi
Probab=75.94 E-value=4.7 Score=36.18 Aligned_cols=55 Identities=25% Similarity=0.351 Sum_probs=41.0
Q ss_pred eEEEEEEC-CCCCcccccEEEEeccc--------eeeeeCCCccEE--E---EecCCceEEEEEEeccc
Q psy7340 316 GVQGFVTD-DLGNPVEKASLKIKGRD--------VGFQTTKYGEFW--R---ILLPGIYKLEVYADGYV 370 (440)
Q Consensus 316 ~IsG~V~D-~~GePL~GAtV~IkGt~--------~gt~TD~~G~F~--l---~lp~G~ytL~VS~IGY~ 370 (440)
.|+-.|+| ..|.|-.|+.|.+.... ....||.||... + .+.+|.|.|.|---.|-
T Consensus 8 ~ittHVLDt~~G~PAaGV~V~L~~~~~~~~w~~l~~~~Tn~DGR~~~ll~~~~~~~G~Y~l~F~tg~Yf 76 (121)
T cd05821 8 PLMVKVLDAVRGSPAANVAVKVFKKTADGSWEPFASGKTTETGEIHGLTTDEQFTEGVYKVEFDTKAYW 76 (121)
T ss_pred CcEEEEEECCCCccCCCCEEEEEEecCCCceEEEEEEEECCCCCCCCccCccccCCeeEEEEEehhHhh
Confidence 47899999 59999999999985321 347999999983 2 24578999998654443
No 59
>cd03457 intradiol_dioxygenase_like Intradiol dioxygenase supgroup. Intradiol dioxygenases catalyze the critical ring-cleavage step in the conversion of catecholate derivatives to citric acid cycle intermediates. They break the catechol C1-C2 bond and utilize Fe3+, as opposed to the extradiol-cleaving enzymes which break the C2-C3 or C1-C6 bond and utilize Fe2+ and Mn+. The family contains catechol 1,2-dioxygenases and protocatechuate 3,4-dioxygenases. The specific function of this subgroup is unknown.
Probab=75.58 E-value=6.1 Score=37.76 Aligned_cols=47 Identities=21% Similarity=0.108 Sum_probs=34.5
Q ss_pred eeEEEEEECC-CCCcccccEEEEec---------c---c------------e-eeeeCCCccEEE-EecCCceE
Q psy7340 315 RGVQGFVTDD-LGNPVEKASLKIKG---------R---D------------V-GFQTTKYGEFWR-ILLPGIYK 361 (440)
Q Consensus 315 ~~IsG~V~D~-~GePL~GAtV~IkG---------t---~------------~-gt~TD~~G~F~l-~lp~G~yt 361 (440)
..+.|+|+|. +++||++|.|.|=. . . . ...||.+|.|.+ .+.||-|.
T Consensus 27 l~l~g~V~D~~~c~Pv~~a~VdiWh~da~G~Ys~~~~~~~~~~~~~~~~flRG~~~TD~~G~~~F~TI~PG~Y~ 100 (188)
T cd03457 27 LTLDLQVVDVATCCPPPNAAVDIWHCDATGVYSGYSAGGGGGEDTDDETFLRGVQPTDADGVVTFTTIFPGWYP 100 (188)
T ss_pred EEEEEEEEeCCCCccCCCeEEEEecCCCCCCCCCccCCccccccccCCCcCEEEEEECCCccEEEEEECCCCCC
Confidence 4689999994 78999999998821 0 0 1 248899999954 57788664
No 60
>cd05469 Transthyretin_like Transthyretin_like. This domain is present in the transthyretin-like protein (TLP) family which includes transthyretin (TTR) and a transthyretin-related protein called 5-hydroxyisourate hydrolase (HIUase). TTR and HIUase are homotetrameric proteins with each subunit consisting of eight beta-strands arranged in two sheets and a short alpha-helix. The central channel of the tetramer contains two independent binding sites, each located between a pair of subunits. TTR transports thyroid hormones and retinol in the blood serum of vertebrates while HIUase catalyzes the second step in a three-step ureide pathway. TTRs are highly conserved and found only in vertebrates while the HIUases are found in a wide range of bacterial, plant, fungal, slime mold and vertebrate organisms.
Probab=74.02 E-value=5.9 Score=35.13 Aligned_cols=53 Identities=23% Similarity=0.373 Sum_probs=39.1
Q ss_pred EEEEEEC-CCCCcccccEEEEeccc--------eeeeeCCCccEEE-----EecCCceEEEEEEecc
Q psy7340 317 VQGFVTD-DLGNPVEKASLKIKGRD--------VGFQTTKYGEFWR-----ILLPGIYKLEVYADGY 369 (440)
Q Consensus 317 IsG~V~D-~~GePL~GAtV~IkGt~--------~gt~TD~~G~F~l-----~lp~G~ytL~VS~IGY 369 (440)
|+-.|+| ..|.|-+|+.|.+.-.. ....||.||...- .+.+|.|.|.|.--.|
T Consensus 3 lStHVLDt~~G~PAagv~V~L~~~~~~~~w~~l~~~~Tn~DGR~~~~l~~~~~~~G~Y~l~F~t~~Y 69 (113)
T cd05469 3 LMVKVLDAVRGSPAANVAIKVFRKTADGSWEIFATGKTNEDGELHGLITEEEFXAGVYRVEFDTKSY 69 (113)
T ss_pred ceEEEEeCCCCccCCCCEEEEEEecCCCceEEEEEEEECCCCCccCccccccccceEEEEEEehHHh
Confidence 6788999 59999999999885321 2368999999831 2456888888864444
No 61
>PF10794 DUF2606: Protein of unknown function (DUF2606); InterPro: IPR019730 This entry represents bacterial proteins with unknown function.
Probab=64.05 E-value=47 Score=30.03 Aligned_cols=51 Identities=20% Similarity=0.350 Sum_probs=39.1
Q ss_pred eEEEEEECCCCCcccccEEEEec---------ccee---eeeCCCccE-EEEecCCceEEEEEE
Q psy7340 316 GVQGFVTDDLGNPVEKASLKIKG---------RDVG---FQTTKYGEF-WRILLPGIYKLEVYA 366 (440)
Q Consensus 316 ~IsG~V~D~~GePL~GAtV~IkG---------t~~g---t~TD~~G~F-~l~lp~G~ytL~VS~ 366 (440)
.|+=.|.|.+|.|+.+++|++-. ...| ..||.+|.+ |.....|+|.+.+.-
T Consensus 43 pVT~hVen~e~~pi~~~ev~lmKa~ds~~qPs~eig~~IGKTD~~Gki~Wk~~~kG~Y~v~l~n 106 (131)
T PF10794_consen 43 PVTFHVENAEGQPIKDFEVTLMKAADSDPQPSKEIGISIGKTDEEGKIIWKNGRKGKYIVFLPN 106 (131)
T ss_pred cEEEEEecCCCCcccceEEEEEeccccCCCCchhhceeecccCCCCcEEEecCCcceEEEEEcC
Confidence 46777899999999999998853 1122 589999999 666778999876643
No 62
>PF13953 PapC_C: PapC C-terminal domain; PDB: 3L48_E 2XET_A 3RFZ_E 2KT6_A.
Probab=63.92 E-value=15 Score=28.94 Aligned_cols=47 Identities=21% Similarity=0.084 Sum_probs=27.7
Q ss_pred EEECCCCCcccc-cEEEEeccceeeeeCCCccEEEEecCCceEEEEEE
Q psy7340 320 FVTDDLGNPVEK-ASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYA 366 (440)
Q Consensus 320 ~V~D~~GePL~G-AtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~ 366 (440)
++++.+|.|||. |.|...+.....+...+|..++.-.+..-+|.|+.
T Consensus 2 ~l~~~~G~~lPfGA~v~~~~g~~~g~Vg~~G~vyl~~~~~~~~L~V~w 49 (68)
T PF13953_consen 2 TLRDADGKPLPFGASVSDEDGNNIGIVGQDGQVYLSGLPPKGTLTVKW 49 (68)
T ss_dssp EEEETTSEE--TT-EEEETTSSEEEEB-GCGEEEEEEE-TCEEEEEES
T ss_pred EEEcCCCCcCCCCcEEEcCCCCEEEEEcCCCEEEEECCCCCcEEEEEE
Confidence 467899999975 77776555566677777877654333344566655
No 63
>PF05688 DUF824: Salmonella repeat of unknown function (DUF824); InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=63.73 E-value=13 Score=28.17 Aligned_cols=23 Identities=30% Similarity=0.388 Sum_probs=20.0
Q ss_pred eeEEEEEECCCCCcccccEEEEe
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIK 337 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~Ik 337 (440)
+.++-+|+|.+|+|+|++.+.+.
T Consensus 15 I~ltVt~kda~G~pv~n~~f~l~ 37 (47)
T PF05688_consen 15 IPLTVTVKDANGNPVPNAPFTLT 37 (47)
T ss_pred EEEEEEEECCCCCCcCCceEEEE
Confidence 45778899999999999999985
No 64
>COG1470 Predicted membrane protein [Function unknown]
Probab=62.66 E-value=26 Score=38.24 Aligned_cols=74 Identities=22% Similarity=0.281 Sum_probs=55.6
Q ss_pred EEEEEECCCCCcccccEEEEeccceeeeeCCCccEEEEecCCceEEEEEEec-cceeEEEEEEecCCceEEEEEEec
Q psy7340 317 VQGFVTDDLGNPVEKASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYADG-YVPREIDFMVVEQHPTLLNVTLHT 392 (440)
Q Consensus 317 IsG~V~D~~GePL~GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~IG-Y~~q~itItVt~~~~t~lnI~L~~ 392 (440)
..-.++|++|.+++-+-|... +-+-.|+..|+-.+.+.||.|.+.+..-| |......+.+..+....+.+.+.+
T Consensus 189 ~~l~~vd~~G~gv~~~~v~~g--~e~~ets~~g~~~~e~t~g~y~~~i~~~g~ye~~~~av~l~d~~t~dLkls~~~ 263 (513)
T COG1470 189 SKLRLVDDDGAGVPKALVKDG--NESFETSSKGNLEVEITPGKYVVLIAKKGIYEKKKRAVKLNDGETKDLKLSVTE 263 (513)
T ss_pred EEEEEEccCCCccchheeecC--ceeEEeecccceeEEecCcceEEEeccccceecceEEEEcCCCcccceeEEEEe
Confidence 566788999999999988774 34568889998878889999999999999 777777776654443344444433
No 65
>PF11008 DUF2846: Protein of unknown function (DUF2846); InterPro: IPR022548 Some members in this group of proteins with unknown function are annotated as lipoproteins. However this cannot be confirmed.
Probab=59.58 E-value=50 Score=28.62 Aligned_cols=52 Identities=17% Similarity=0.224 Sum_probs=32.6
Q ss_pred ccccEEEEeccceeeeeCCCccEEEEecCCceEEEEEEeccc---eeEEEEEEecC
Q psy7340 329 VEKASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYADGYV---PREIDFMVVEQ 381 (440)
Q Consensus 329 L~GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~IGY~---~q~itItVt~~ 381 (440)
.....|.|.|...+. +...+.|...++||.|+|..+.-.+. ....++++.++
T Consensus 40 ~~~~~v~vdg~~ig~-l~~g~y~~~~v~pG~h~i~~~~~~~~~~~~~~l~~~~~~G 94 (117)
T PF11008_consen 40 AVKPDVYVDGELIGE-LKNGGYFYVEVPPGKHTISAKSEFSSSPGANSLDVTVEAG 94 (117)
T ss_pred cccceEEECCEEEEE-eCCCeEEEEEECCCcEEEEEecCccCCCCccEEEEEEcCC
Confidence 556678887766554 55555557889999999988544332 24444545444
No 66
>PRK15310 fimbrial outer membrane usher protein TcfC; Provisional
Probab=58.85 E-value=17 Score=42.29 Aligned_cols=46 Identities=22% Similarity=0.003 Sum_probs=35.7
Q ss_pred ceeEEEEEECCCCCcccccEEEEeccceeeeeCCCccEEEEecCCceEE
Q psy7340 314 HRGVQGFVTDDLGNPVEKASLKIKGRDVGFQTTKYGEFWRILLPGIYKL 362 (440)
Q Consensus 314 ~~~IsG~V~D~~GePL~GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL 362 (440)
..++-|+++|++|.|+.+|.|.- ..++.||.+|.|.+++..-.-+|
T Consensus 785 ~~~yiGr~~~~~G~~l~~a~IlN---~~~~~td~~GgF~~d~d~~~~~L 830 (895)
T PRK15310 785 SRLYVGRVLDKDGRPLLDAQPLN---YPFLSLGPSGRFSLQSEHKESSL 830 (895)
T ss_pred EEEEEEEEECCCCCCcccceeec---CccceecCCCCEEEEcccCCceE
Confidence 34588999999999999998754 34599999999988775433333
No 67
>PF14347 DUF4399: Domain of unknown function (DUF4399)
Probab=58.53 E-value=32 Score=29.05 Aligned_cols=52 Identities=15% Similarity=0.100 Sum_probs=35.7
Q ss_pred EEEEEECCCCCcccccEEEEeccceeeeeCCCccE--EEEecCCceEEEEEEeccce
Q psy7340 317 VQGFVTDDLGNPVEKASLKIKGRDVGFQTTKYGEF--WRILLPGIYKLEVYADGYVP 371 (440)
Q Consensus 317 IsG~V~D~~GePL~GAtV~IkGt~~gt~TD~~G~F--~l~lp~G~ytL~VS~IGY~~ 371 (440)
-.=.++|.++.|+++..|...... +.-.+|.= .++++||.|+|++.+-.+..
T Consensus 22 h~hlliD~~~~~~~~~~I~~~~n~---vhy~~Gqte~~I~L~PG~htLtl~~~d~~h 75 (87)
T PF14347_consen 22 HHHLLIDGDGPPLANEPIPFNING---VHYGKGQTELNIELPPGKHTLTLQLGDGDH 75 (87)
T ss_pred cEEEEECCCCCcCCCCeeeecCCe---EEeCCCEEEEEEEeCCCCEEEEEEeCCCCc
Confidence 345578999999999988875322 22245543 46789999999987755443
No 68
>COG2351 Transthyretin-like protein [General function prediction only]
Probab=51.79 E-value=36 Score=30.67 Aligned_cols=54 Identities=22% Similarity=0.273 Sum_probs=40.5
Q ss_pred eEEEEEEC-CCCCcccccEEEEec---c----ceeeeeCCCccEE--E----EecCCceEEEEEEecc
Q psy7340 316 GVQGFVTD-DLGNPVEKASLKIKG---R----DVGFQTTKYGEFW--R----ILLPGIYKLEVYADGY 369 (440)
Q Consensus 316 ~IsG~V~D-~~GePL~GAtV~IkG---t----~~gt~TD~~G~F~--l----~lp~G~ytL~VS~IGY 369 (440)
.++-.|.| ..|.|-+|+.|.+.- . -..+.||.||.-. + .+..|.|.|.|..-.|
T Consensus 10 ~LTTHVLDta~GkPAagv~V~L~rl~~~~~~~l~t~~Tn~DGR~d~pll~g~~~~~G~Y~l~F~~gdY 77 (124)
T COG2351 10 RLTTHVLDTASGKPAAGVKVELYRLEGNQWELLKTVVTNADGRIDAPLLAGETLATGIYELVFHTGDY 77 (124)
T ss_pred eeeeeeeecccCCcCCCCEEEEEEecCCcceeeeEEEecCCCcccccccCccccccceEEEEEEcchh
Confidence 47888999 599999999998752 2 1457999999973 1 2467899998865444
No 69
>PF03785 Peptidase_C25_C: Peptidase family C25, C terminal ig-like domain; InterPro: IPR005536 This domain is found in almost all members of MEROPS peptidase family C25, (clan CD). Peptidase family C25 is a protein family found in the bacteria Porphyromonas gingivalis (Bacteroides gingivalis) a Gram-negative anaerobic bacterial species strongly associated with adult periodontitis. One of its distinguishing characteristics and putative virulence properties is the ability to agglutinate erythrocytes []. It is a highly proteolytic organism which metabolises small peptides and amino acids. Indirect evidence suggests that the proteases produced by this microorganism constitute an important virulence factor []. Protease-encoding genes have been shown to contain multiple copies of repeated nucleotide sequences. These conserved sequences have also been found in haemagglutinin genes [].; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 1CVR_A.
Probab=51.69 E-value=64 Score=27.19 Aligned_cols=46 Identities=20% Similarity=0.132 Sum_probs=26.7
Q ss_pred cccEEEEe--cccee-eeeCCCccEEEEec-----CCceEEEEEEeccceeEEEE
Q psy7340 330 EKASLKIK--GRDVG-FQTTKYGEFWRILL-----PGIYKLEVYADGYVPREIDF 376 (440)
Q Consensus 330 ~GAtV~Ik--Gt~~g-t~TD~~G~F~l~lp-----~G~ytL~VS~IGY~~q~itI 376 (440)
+|+.|.+. |.-.| .++| +|++.++++ ++.|+|++....|.+...++
T Consensus 26 ~gs~ValS~dg~l~G~ai~~-sG~ati~l~~~it~~~~~tlTit~~n~~t~i~~i 79 (81)
T PF03785_consen 26 PGSYVALSQDGDLYGKAIVN-SGNATINLTNPITDEGTLTLTITAFNYVTYIKTI 79 (81)
T ss_dssp TT-EEEEEETTEEEEEEE-B-TTEEEEE-SS--TT-SEEEEEEE-TTB--EEEEE
T ss_pred CCcEEEEecCCEEEEEEEec-CceEEEECCcccCCCceEEEEEEEEccEEEEEEe
Confidence 45555553 33344 4777 999988765 47889998888888776554
No 70
>smart00088 PINT motif in proteasome subunits, Int-6, Nip-1 and TRIP-15. Also called the PCI (Proteasome, COP9, Initiation factor 3) domain. Unknown function.
Probab=50.26 E-value=12 Score=30.27 Aligned_cols=53 Identities=13% Similarity=0.236 Sum_probs=35.4
Q ss_pred HHHHHHHHHHHhcCHHHHHHhhhccccCchHHHHHHHHHHHHHHcCCCCCCCC
Q psy7340 24 IKDMKNSILSQVLDQSARARLNTIMLCKPEKAQQIENMICQMAQTGQIMNKLG 76 (440)
Q Consensus 24 ~e~~~~~~L~qiL~~eAreRL~~i~lvkPe~a~~vE~~li~~aq~G~i~~kv~ 76 (440)
.+.++...|.++..+-..=.++.|+-.=+=-.+.||..|++|+..|.|.++|+
T Consensus 7 ~~~~~~~~l~~l~~~y~~i~~~~i~~~~~l~~~~vE~~i~~~i~~~~l~~~ID 59 (88)
T smart00088 7 QRKIRLTNLLQLSEPYSSISLSDLAKLLGLSVPEVEKLVSKAIRDGEISAKID 59 (88)
T ss_pred HHHHHHHHHHHHhHHhceeeHHHHHHHhCcCHHHHHHHHHHHHHCCCeEEEEc
Confidence 34556667777776665544544442212223369999999999999999876
No 71
>smart00753 PAM PCI/PINT associated module.
Probab=50.26 E-value=12 Score=30.27 Aligned_cols=53 Identities=13% Similarity=0.236 Sum_probs=35.4
Q ss_pred HHHHHHHHHHHhcCHHHHHHhhhccccCchHHHHHHHHHHHHHHcCCCCCCCC
Q psy7340 24 IKDMKNSILSQVLDQSARARLNTIMLCKPEKAQQIENMICQMAQTGQIMNKLG 76 (440)
Q Consensus 24 ~e~~~~~~L~qiL~~eAreRL~~i~lvkPe~a~~vE~~li~~aq~G~i~~kv~ 76 (440)
.+.++...|.++..+-..=.++.|+-.=+=-.+.||..|++|+..|.|.++|+
T Consensus 7 ~~~~~~~~l~~l~~~y~~i~~~~i~~~~~l~~~~vE~~i~~~i~~~~l~~~ID 59 (88)
T smart00753 7 QRKIRLTNLLQLSEPYSSISLSDLAKLLGLSVPEVEKLVSKAIRDGEISAKID 59 (88)
T ss_pred HHHHHHHHHHHHhHHhceeeHHHHHHHhCcCHHHHHHHHHHHHHCCCeEEEEc
Confidence 34556667777776665544544442212223369999999999999999876
No 72
>PF01060 DUF290: Transthyretin-like family; InterPro: IPR001534 This new apparently nematode-specific protein family has been called family 2 []. The proteins show weak similarity to transthyretin (formerly called prealbumin) which transports thyroid hormones. The specific function of this protein is unknown.; GO: 0005615 extracellular space
Probab=50.25 E-value=24 Score=28.71 Aligned_cols=30 Identities=23% Similarity=0.251 Sum_probs=22.7
Q ss_pred CCCcccccEEEEeccc--------eeeeeCCCccEEEE
Q psy7340 325 LGNPVEKASLKIKGRD--------VGFQTTKYGEFWRI 354 (440)
Q Consensus 325 ~GePL~GAtV~IkGt~--------~gt~TD~~G~F~l~ 354 (440)
+|.|++++.|.+-..+ ..+.||.+|+|.+.
T Consensus 7 ~~~P~~~~~V~L~e~d~~~~Ddll~~~~Td~~G~F~l~ 44 (80)
T PF01060_consen 7 GGKPAKNVKVKLWEDDYFDPDDLLDETKTDSDGNFELS 44 (80)
T ss_pred CCccCCCCEEEEEECCCCCCCceeEEEEECCCceEEEE
Confidence 4789999988876433 24799999999764
No 73
>PF07523 Big_3: Bacterial Ig-like domain (group 3); InterPro: IPR011080 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.; PDB: 2L7Y_A 2KPN_A.
Probab=50.10 E-value=26 Score=27.28 Aligned_cols=38 Identities=16% Similarity=0.208 Sum_probs=22.2
Q ss_pred EEEECCCCCcccccEEEEeccceeeeeCCCccEEEEecCCceEEEEEEec
Q psy7340 319 GFVTDDLGNPVEKASLKIKGRDVGFQTTKYGEFWRILLPGIYKLEVYADG 368 (440)
Q Consensus 319 G~V~D~~GePL~GAtV~IkGt~~gt~TD~~G~F~l~lp~G~ytL~VS~IG 368 (440)
-.++|++|.+++...+.+.+ .| -.-.+|.|.|.++|-|
T Consensus 21 v~at~~dG~~~~~~~~~vs~-----------~~-d~~~~G~y~Vt~~y~~ 58 (67)
T PF07523_consen 21 VTATYSDGTSLPLSDVTVSG-----------TV-DTSKAGTYTVTYTYKG 58 (67)
T ss_dssp EEEEETTS-ES-GCCSEEES---------------TTS-CCEEEEEEECT
T ss_pred EEEEEcCCCEeceeeeEEEe-----------ee-ecCCCceEEEEEEECC
Confidence 34567788876555555543 22 1236799999999998
No 74
>KOG3006|consensus
Probab=39.41 E-value=51 Score=29.86 Aligned_cols=50 Identities=22% Similarity=0.300 Sum_probs=36.9
Q ss_pred eEEEEEEC-CCCCcccccEEEEec---c----ce-eeeeCCCccEE-E----EecCCceEEEEE
Q psy7340 316 GVQGFVTD-DLGNPVEKASLKIKG---R----DV-GFQTTKYGEFW-R----ILLPGIYKLEVY 365 (440)
Q Consensus 316 ~IsG~V~D-~~GePL~GAtV~IkG---t----~~-gt~TD~~G~F~-l----~lp~G~ytL~VS 365 (440)
.|+-.|+| ..|.|..|+.|.+-- . .. ...|+.+|.-. . .+.||.|.+.+-
T Consensus 22 ~itahVLd~s~GsPA~gVqV~~f~~~~~~~w~~igs~~T~~nGrv~~~~~~~tl~~GtYr~~~d 85 (132)
T KOG3006|consen 22 PITAHVLDISRGSPAAGVQVHLFILANDDTWTPIGSGFTQDNGRVDWVSPDFTLIPGTYRLVFD 85 (132)
T ss_pred CcEeEEeecccCCcccceEEEEEEecCCCcccCccccccccCceeecccchhhhccceEEEEEe
Confidence 58899999 589999998886642 1 12 35889999872 2 257899998874
No 75
>PF08863 YolD: YolD-like protein; InterPro: IPR014962 These proteins are functionally uncharacterised. However it has been predicted that these proteins are functionally equivalent to the UmuD subunit of polymerase V from Gram-negative bacteria [].
Probab=35.44 E-value=1.1e+02 Score=24.73 Aligned_cols=57 Identities=21% Similarity=0.437 Sum_probs=36.1
Q ss_pred chHHHHHHHHHHHHHHcCCC-CCCCCHHHHHHHHHHhhhhhcccceeEeccccCCCCCeeeee
Q psy7340 52 PEKAQQIENMICQMAQTGQI-MNKLGENELIGLLEQISNREEKKSSVKTPKFWGQGGSYVSIF 113 (440)
Q Consensus 52 Pe~a~~vE~~li~~aq~G~i-~~kv~e~~l~~iL~~i~~~~~~~~~vk~~~~~~~~~~~~~~~ 113 (440)
||..+++..+.. ...++ +.-+||++|-+|...|+..-.....|.+. -|+ +|.|.++.
T Consensus 2 PEH~e~L~~~~~---e~~k~~kp~Lde~~leei~~~l~~a~~~~~~v~it-y~~-~g~~~~~~ 59 (92)
T PF08863_consen 2 PEHKEALRELIK---EQNKVEKPELDEQQLEEINEKLSEAYQENQPVTIT-YYE-DGYYQSVT 59 (92)
T ss_pred ChHHHHHHHHHH---HhcccCCCCCcHHHHHHHHHHHHHHhcCCCEEEEE-EEE-CCeeEEEE
Confidence 455544444332 23444 46679999999999998876555566553 355 67776654
No 76
>PF10642 Tom5: Mitochondrial import receptor subunit or translocase; InterPro: IPR019603 This entry represents a short family of yeast proteins. Tom5 is one of three very small translocases of the mitochondrial outer membrane. Tom5 links mitochondrial preprotein receptors to the general import pore []. Although Tom5 has allegedly been identified in vertebrates this could not be confirmed.
Probab=32.90 E-value=1.1e+02 Score=23.48 Aligned_cols=30 Identities=30% Similarity=0.241 Sum_probs=17.2
Q ss_pred CcccCCCCCCHHHHHHHHHHHHHHHHHHHHHH
Q psy7340 1 MLGVGGGGASGEQQKAAQERQEQIKDMKNSIL 32 (440)
Q Consensus 1 ~~~~~~~~~~~e~~~~~~~~~~~~e~~~~~~L 32 (440)
|.|.+.+.+|.|+.+++|++. ....|+.+.
T Consensus 1 MFgg~~~qpS~eE~k~~e~~A--~~Tvk~a~~ 30 (49)
T PF10642_consen 1 MFGGPPPQPSEEEIKAAEAQA--NFTVKNAAA 30 (49)
T ss_pred CCCCCCCCCCHHHHHHHHHHH--HHHHHHHHH
Confidence 677666667777776665433 334444443
No 77
>PRK14981 DNA-directed RNA polymerase subunit F; Provisional
Probab=32.86 E-value=1.4e+02 Score=26.17 Aligned_cols=58 Identities=12% Similarity=0.231 Sum_probs=42.6
Q ss_pred HHHHHHHHHHHHh-cCHHHHHHhhhccccCchHHHHHHHHHHHHHHcCCCCCCCCHHHHHHHHHHhhh
Q psy7340 23 QIKDMKNSILSQV-LDQSARARLNTIMLCKPEKAQQIENMICQMAQTGQIMNKLGENELIGLLEQISN 89 (440)
Q Consensus 23 ~~e~~~~~~L~qi-L~~eAreRL~~i~lvkPe~a~~vE~~li~~aq~G~i~~kv~e~~l~~iL~~i~~ 89 (440)
...++++.++..- |++..+.+|.||.=.-+|-++.+=- .++.+++|++|.+||+-|+.
T Consensus 51 ~a~elve~L~~~~~l~e~~a~~I~nL~P~~~dElrai~~---------~~~~~~~~e~l~~ILd~l~k 109 (112)
T PRK14981 51 DAEELVEELLELEKMKEKTAVKIADILPETRDELRAIFA---------KERYTLSPEELDEILDIVKK 109 (112)
T ss_pred HHHHHHHHHHHccCCCHHHHHHHHhcCCCCHHHHHHHHH---------HhccCCCHHHHHHHHHHHHH
Confidence 4456667777766 5999999999987666666665532 23568999999999998864
No 78
>PF05751 FixH: FixH; InterPro: IPR008620 This family consists of several Rhizobium FixH like proteins. It has been suggested that the four proteins FixG, FixH, FixI, and FixS may participate in a membrane-bound complex coupling the FixI cation pump with a redox process catalysed by FixG [].
Probab=32.38 E-value=1.8e+02 Score=25.67 Aligned_cols=50 Identities=22% Similarity=0.253 Sum_probs=32.2
Q ss_pred eeEEEEEECCCCCcccccEEEEec---------cceeeeeCCCccEEEE---ecCCceEEEE
Q psy7340 315 RGVQGFVTDDLGNPVEKASLKIKG---------RDVGFQTTKYGEFWRI---LLPGIYKLEV 364 (440)
Q Consensus 315 ~~IsG~V~D~~GePL~GAtV~IkG---------t~~gt~TD~~G~F~l~---lp~G~ytL~V 364 (440)
..++-+++|.+|.|+.++.+.+.= .........+|.|... .++|.+.|.+
T Consensus 69 ~~~~i~~~d~~g~~~~~~~~~l~l~rp~~~~~D~~~~l~~~~~g~y~~~~~~~~~G~W~l~l 130 (146)
T PF05751_consen 69 NSLTIRLTDPNGAPVSGAKLTLSLYRPTDAKLDFTLTLTESAPGVYRAPVPLLKKGRWYLRL 130 (146)
T ss_pred CeEEEEEEcCCCCcCcCceEEEEEECCCCccCCeeEEeeECCCceEEEEcCCCCCccEEEEE
Confidence 357777889999999999988841 1122234666777442 2456666666
No 79
>cd00390 Urease_gamma Urease gamma-subunit; Urease is a nickel-dependent metalloenzyme that catalyzes the hydrolysis of urea to form ammonia and carbon dioxide. Nickel-dependent ureases are found in bacteria, archaea, fungi and plants. Their primary role is to allow the use of external and internally-generated urea as a nitrogen source. The enzyme consists of three subunits, alpha, beta and gamma, which can exist as separate proteins or can be fused on a single protein chain. The alpha-beta-gamma heterotrimer forms multimers, mainly trimers. The large alpha subunit is the catalytic domain containing an active site with a bi-nickel center complexed by a carbamylated lysine. The beta and gamma subunits play a role in subunit association to form the higher order trimers.
Probab=30.87 E-value=1.2e+02 Score=26.42 Aligned_cols=71 Identities=25% Similarity=0.368 Sum_probs=51.8
Q ss_pred HHHHHHhhh-ccccCchHHHHHHHHHHHHHHcCCCCCCCCHHHHHHHHHHhhhhhc-------ccceeEeccccCCCCCe
Q psy7340 38 QSARARLNT-IMLCKPEKAQQIENMICQMAQTGQIMNKLGENELIGLLEQISNREE-------KKSSVKTPKFWGQGGSY 109 (440)
Q Consensus 38 ~eAreRL~~-i~lvkPe~a~~vE~~li~~aq~G~i~~kv~e~~l~~iL~~i~~~~~-------~~~~vk~~~~~~~~~~~ 109 (440)
.=|+.|++| ++|-.||-..-|=++++..|+.|+ |=++|.++-..+=...+ --..|-+---|.|+...
T Consensus 15 ~lA~~R~~rGlkLN~pEAvAlIs~~v~E~aRdG~-----svaelm~~g~~~L~~d~Vm~GV~emi~~vqVEatFpDGTkL 89 (96)
T cd00390 15 ELARKRLARGLKLNYPEAVALIADEILEGARDGK-----SVAELMSLGKTVLTRDDVMEGVPEMLHDVQVEATFPDGTKL 89 (96)
T ss_pred HHHHHHHHcCcccCcHHHHHHHHHHHHHHhhcCC-----CHHHHHHHHhhhCCHHhcccCHHHhhcceeEEEEeCCCCEE
Confidence 347777776 999999999999999999999997 67788777666543321 12345555567777777
Q ss_pred eeee
Q psy7340 110 VSIF 113 (440)
Q Consensus 110 ~~~~ 113 (440)
|.+.
T Consensus 90 Vtvh 93 (96)
T cd00390 90 VTVH 93 (96)
T ss_pred EEcc
Confidence 7663
No 80
>KOG4659|consensus
Probab=30.48 E-value=1.8e+02 Score=35.96 Aligned_cols=87 Identities=13% Similarity=0.030 Sum_probs=60.2
Q ss_pred HHhhhcceeEEEEEECCCCCcccccEEEEec-cceeeeeCCCccEEEEecC-CceEEEEEEeccceeEEEEEEecCCceE
Q psy7340 308 KFLAEAHRGVQGFVTDDLGNPVEKASLKIKG-RDVGFQTTKYGEFWRILLP-GIYKLEVYADGYVPREIDFMVVEQHPTL 385 (440)
Q Consensus 308 ~~~~q~~~~IsG~V~D~~GePL~GAtV~IkG-t~~gt~TD~~G~F~l~lp~-G~ytL~VS~IGY~~q~itItVt~~~~t~ 385 (440)
.|.......|.|.|.-..|-||.|+.|...- ...-|.|-.||.|-+.+.- +..+|.|--.-|.++..++-|.-+..-.
T Consensus 45 ~fne~~~~vIrgrvv~~~~~pLVGVrVS~~~~~~yfTlTR~DG~FDL~vnGg~svtLqF~R~pF~~qkr~v~vpwnq~i~ 124 (1899)
T KOG4659|consen 45 QFNENRISVIRGRVVWGGGVPLVGVRVSDAAHPLYFTLTREDGYFDLTVNGGRSVTLQFLRTPFQSQKRSVFVPWNQIIH 124 (1899)
T ss_pred hhccccceEEeccEeecCCcceEEEEeecccccceEEEEecCceEEEEEcccceEEEEEccCCCcccceeEEeChhhEEE
Confidence 3444444569999999999999999987643 2346899999999665443 3677888888899998887664443333
Q ss_pred E-EEEEeccc
Q psy7340 386 L-NVTLHTSK 394 (440)
Q Consensus 386 l-nI~L~~~~ 394 (440)
+ +++|...+
T Consensus 125 id~vvm~~~~ 134 (1899)
T KOG4659|consen 125 IDDVVMYRQE 134 (1899)
T ss_pred EEeEEEEeec
Confidence 3 45555433
No 81
>PF15248 DUF4587: Domain of unknown function (DUF4587)
Probab=29.68 E-value=25 Score=29.21 Aligned_cols=18 Identities=44% Similarity=0.433 Sum_probs=16.2
Q ss_pred HHHHHHHHHHHHHHcCCC
Q psy7340 54 KAQQIENMICQMAQTGQI 71 (440)
Q Consensus 54 ~a~~vE~~li~~aq~G~i 71 (440)
|.+.||-|++|.||+-||
T Consensus 6 KEDmvElMlmQNAQMHQi 23 (76)
T PF15248_consen 6 KEDMVELMLMQNAQMHQI 23 (76)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 678999999999999887
No 82
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=29.05 E-value=3.2e+02 Score=22.28 Aligned_cols=53 Identities=25% Similarity=0.258 Sum_probs=30.7
Q ss_pred eeEEEEEECCC--CCcccc--cEEEEeccc------eee-eeCCCccEEE--Ee----cCCceEEEEEEe
Q psy7340 315 RGVQGFVTDDL--GNPVEK--ASLKIKGRD------VGF-QTTKYGEFWR--IL----LPGIYKLEVYAD 367 (440)
Q Consensus 315 ~~IsG~V~D~~--GePL~G--AtV~IkGt~------~gt-~TD~~G~F~l--~l----p~G~ytL~VS~I 367 (440)
..+++.+.|.+ ..+..+ ++|.|.+.+ ... .++..|.|.. .+ +.|.|+|.+..-
T Consensus 17 V~~~~~~~~~~~~~~~~~~~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~~~~~ 86 (99)
T PF01835_consen 17 VHFRAIVRDLDNDFKPPANSPVTVTIKDPSGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYTIRVKTD 86 (99)
T ss_dssp EEEEEEEEEECTTCSCESSEEEEEEEEETTSEEEEEEEEEETTCTTEEEEEEE--SS---EEEEEEEEET
T ss_pred EEEEEEEeccccccccccCCceEEEEECCCCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEEEEEEEc
Confidence 34777777755 345544 444554432 223 5688999843 33 358899988773
No 83
>PF13115 YtkA: YtkA-like
Probab=28.68 E-value=1.7e+02 Score=23.38 Aligned_cols=21 Identities=24% Similarity=0.343 Sum_probs=16.5
Q ss_pred EEEEEECCCCCcccccEEEEec
Q psy7340 317 VQGFVTDDLGNPVEKASLKIKG 338 (440)
Q Consensus 317 IsG~V~D~~GePL~GAtV~IkG 338 (440)
+.=+ +|.+|+|+.+|.|.+.-
T Consensus 24 i~v~-~~~~g~pv~~a~V~~~~ 44 (86)
T PF13115_consen 24 ITVT-VDQGGKPVTDADVQFEI 44 (86)
T ss_pred EEEE-ECCCCCCCCCCEEEEEE
Confidence 4434 77899999999998863
No 84
>PF10577 UPF0560: Uncharacterised protein family UPF0560; InterPro: IPR018890 This family of proteins has no known function.
Probab=28.35 E-value=1.4e+02 Score=34.63 Aligned_cols=60 Identities=23% Similarity=0.321 Sum_probs=44.3
Q ss_pred eEEEEEECC-CCCcccccEEEEeccc---eeeeeCCCccEEEEec--CCc-eEEEEEEeccceeEEE
Q psy7340 316 GVQGFVTDD-LGNPVEKASLKIKGRD---VGFQTTKYGEFWRILL--PGI-YKLEVYADGYVPREID 375 (440)
Q Consensus 316 ~IsG~V~D~-~GePL~GAtV~IkGt~---~gt~TD~~G~F~l~lp--~G~-ytL~VS~IGY~~q~it 375 (440)
.|+-.|.|. +-+||++|.|.|-+.. ..+.|+.+|.-.+.++ .|. .+|+.+.-||.+....
T Consensus 2 ~lKV~V~d~~t~qpl~~A~V~V~~N~t~~~s~~t~~dG~~~l~~~yrlg~~l~v~a~k~gyv~ns~P 68 (807)
T PF10577_consen 2 TLKVQVSDASTRQPLSDASVEVFGNQTLTASGTTGNDGVVLLKFPYRLGTWLIVTASKDGYVPNSVP 68 (807)
T ss_pred eEEEEEecccCcccCCCCeEEEEeceeEeecceecCCceEEEEEeccCCCeEEEEEecCCccCCCCC
Confidence 367788894 7889999999987632 3468999999876655 454 4566677899876554
No 85
>PF07550 DUF1533: Protein of unknown function (DUF1533); InterPro: IPR011432 This domain is found duplicated in proteins of unknown function. The proteins typically also contain leucine-rich repeats.
Probab=27.51 E-value=1.5e+02 Score=23.26 Aligned_cols=20 Identities=20% Similarity=0.330 Sum_probs=16.3
Q ss_pred cCCceEEEEEEeccceeEEE
Q psy7340 356 LPGIYKLEVYADGYVPREID 375 (440)
Q Consensus 356 p~G~ytL~VS~IGY~~q~it 375 (440)
..|.|+|.|.+-||...+++
T Consensus 45 ~~G~~~I~I~A~GY~d~~~~ 64 (65)
T PF07550_consen 45 KDGENTIVIKATGYKDKTFT 64 (65)
T ss_pred cCCceEEEEEeCCccceEEE
Confidence 46899999999999876543
No 86
>COG3422 Uncharacterized conserved protein [Function unknown]
Probab=26.44 E-value=68 Score=25.50 Aligned_cols=31 Identities=19% Similarity=0.273 Sum_probs=26.0
Q ss_pred eeeCCCccEEEEecCCceEEEEEEeccceeE
Q psy7340 343 FQTTKYGEFWRILLPGIYKLEVYADGYVPRE 373 (440)
Q Consensus 343 t~TD~~G~F~l~lp~G~ytL~VS~IGY~~q~ 373 (440)
..-|.+|.|.+.+...+..+.++.-||.+..
T Consensus 7 i~kdk~Ge~rfrlkA~N~eiI~~sEgY~ska 37 (59)
T COG3422 7 IYKDKAGEYRFRLKAANGEIILTSEGYKSKA 37 (59)
T ss_pred EEEcCCCcEEEEEEccCccEEEeecccchhH
Confidence 4568999998888888888999999998754
No 87
>smart00561 MBT Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
Probab=24.33 E-value=37 Score=28.95 Aligned_cols=31 Identities=19% Similarity=0.524 Sum_probs=23.0
Q ss_pred eEecc-ccCCCCCee------eeeeeeeeecccCCCCC
Q psy7340 97 VKTPK-FWGQGGSYV------SIFPIRWAIRHSTPYRP 127 (440)
Q Consensus 97 vk~~~-~~~~~~~~~------~~~~~~~~~~~~~~~~~ 127 (440)
+..++ |||+.-||. .|||+-|+-.+..|-.|
T Consensus 58 l~v~~dg~~~~~D~W~~~~S~~I~PvGwc~~~g~~L~p 95 (96)
T smart00561 58 LLLHFDGWDDKYDFWCDADSPDIHPVGWCEKNGHPLQP 95 (96)
T ss_pred EEEEEccCCCcCCEEEECCCCCcccCchHHhcCCcCCC
Confidence 33444 488876664 59999999998888766
No 88
>COG0042 tRNA-dihydrouridine synthase [Translation, ribosomal structure and biogenesis]
Probab=23.98 E-value=69 Score=32.99 Aligned_cols=32 Identities=13% Similarity=0.170 Sum_probs=27.6
Q ss_pred CHHHHHHHHHHhhhhhc-ccceeEeccccCCCC
Q psy7340 76 GENELIGLLEQISNREE-KKSSVKTPKFWGQGG 107 (440)
Q Consensus 76 ~e~~l~~iL~~i~~~~~-~~~~vk~~~~~~~~~ 107 (440)
+-+-+.+|++.+..... .-.|||+|-||||+.
T Consensus 119 ~p~lv~~iv~a~~~av~~iPVTVKiRlG~d~~~ 151 (323)
T COG0042 119 NPELLAEIVKAMVEAVGDIPVTVKIRLGWDDDD 151 (323)
T ss_pred CHHHHHHHHHHHHHhhCCCCeEEEEecccCccc
Confidence 56788999999988774 789999999998886
No 89
>PHA02545 45 sliding clamp; Provisional
Probab=23.77 E-value=3.7e+02 Score=26.78 Aligned_cols=111 Identities=22% Similarity=0.236 Sum_probs=58.6
Q ss_pred ccchhhHHHHHHHHHHHHhcCCccccccchhHHHHHHHhhhhhhhccchhhhH--HHhhh-cceeEEEEEECCCCCcccc
Q psy7340 255 LCKPEKAQQIENMICQMAQTGQIMNKLGENELIGLLEQISNREEKKSSVKSLV--KFLAE-AHRGVQGFVTDDLGNPVEK 331 (440)
Q Consensus 255 lvkpe~a~~VE~~LiqLAqsG~It~KISDe~LkeIL~qis~qt~kKSsv~~l~--~~~~q-~~~~IsG~V~D~~GePL~G 331 (440)
+|+|++.-. +....+.+.++.+.+.++++..+ +..+. ++... ....++|.-.|.++.+=.-
T Consensus 100 Iv~P~k~i~--------~P~~dv~f~L~aedl~ql~kas~--------~l~l~dl~~~~~~G~ivi~~~~~~~~~~~sn~ 163 (223)
T PHA02545 100 IVKPKKRVT--------FPVADVEFELKAEDLQQLLKASR--------VLGLDDLAITNIDGKIVINLFNEDDKNDTSNS 163 (223)
T ss_pred EecCCcccC--------CCCccEEEEEcHHHHHHHHHHHH--------hcCCCeEEEEeeCCEEEEEEEeccCCCCCCCc
Confidence 456666544 56678889999999988875522 11111 11111 1123444444555544333
Q ss_pred cEEEEeccceeeeeCCCccE--EE-----EecCCceEEEEEEeccceeEEEEEEecCCceEEEEEEeccc
Q psy7340 332 ASLKIKGRDVGFQTTKYGEF--WR-----ILLPGIYKLEVYADGYVPREIDFMVVEQHPTLLNVTLHTSK 394 (440)
Q Consensus 332 AtV~IkGt~~gt~TD~~G~F--~l-----~lp~G~ytL~VS~IGY~~q~itItVt~~~~t~lnI~L~~~~ 394 (440)
.+|.+ .-++.++.| .+ .+++|+|++.++.-|-...+ +....+-|.|++++
T Consensus 164 ys~~v------ge~~g~~~F~f~~~~~Nmk~~~gdY~v~is~~g~a~F~-------~~~~~Y~IalE~~S 220 (223)
T PHA02545 164 YSLEV------GEYDGDNEFNFIIKIENMKLVPGDYKVSISKKGAAKFE-------GENLSYVIALEADS 220 (223)
T ss_pred EEEEe------ecccCCcceEEEEEeceeEecCCCeEEEEEccccceee-------cCCeeEEEEEccCC
Confidence 44443 122233365 22 25789999999988853322 22334556666543
No 90
>PF02820 MBT: mbt repeat; InterPro: IPR004092 The function of the malignant brain tumor (MBT) repeat is unknown, but is found in a number of nuclear proteins involved in transcriptional repression. The repeat contains a completely conserved glutamate at its amino terminus that may be important for function. The crystal structure of the two MBT repeats of human SCM-like 2 protein has been reported. Each repeat consists of an extended "arm" and a globular core. The arm of the first repeat packs against the core of the second repeat and vice versa. The structure of the core-interacting part of each arm consists of an N-terminal alpha-helix and a turn of 310 helix connected by a short beta-strand. The core consists of an Src homology 3-like five-stranded beta-barrel followed by a C-terminal alpha-helix and another short beta-strand. Each arm interacts with its partner core in a similar way, with the orientation of the N-terminal helix relative to the barrel varying slightly. There are also extensive interactions between the two barrels [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 2P0K_A 3F70_B 3CEY_A 2VYT_A 2BIV_A 1OI1_A 3OQ5_A 2RJE_B 2RJD_A 2RI3_A ....
Probab=23.18 E-value=11 Score=30.20 Aligned_cols=26 Identities=27% Similarity=0.799 Sum_probs=18.5
Q ss_pred ccCCCCCee------eeeeeeeeecccCCCCC
Q psy7340 102 FWGQGGSYV------SIFPIRWAIRHSTPYRP 127 (440)
Q Consensus 102 ~~~~~~~~~------~~~~~~~~~~~~~~~~~ 127 (440)
||+++.+|. .|||+-||-+|..+..|
T Consensus 33 g~~~~~d~w~~~~S~~i~PvGw~~~~g~~L~p 64 (73)
T PF02820_consen 33 GWDDDYDFWCHIDSPRIFPVGWCEKNGHPLQP 64 (73)
T ss_dssp TSTGGGEEEEETTSTTEEETTHHHHHT-EEE-
T ss_pred CCCCCccEEEECCCCCeeecchHHhcCCcccC
Confidence 477777664 48999999998876554
No 91
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=22.95 E-value=1.6e+02 Score=22.26 Aligned_cols=28 Identities=18% Similarity=0.208 Sum_probs=19.2
Q ss_pred ceeeeeCCCccEEEEe---cCCceEEEEEEe
Q psy7340 340 DVGFQTTKYGEFWRIL---LPGIYKLEVYAD 367 (440)
Q Consensus 340 ~~gt~TD~~G~F~l~l---p~G~ytL~VS~I 367 (440)
...+..|.+|.|.+.+ ..|.|.+.++..
T Consensus 3 ~~~~t~~~~G~Ws~t~~~~~dG~y~itv~a~ 33 (54)
T PF13754_consen 3 TYTTTVDSDGNWSFTVPALADGTYTITVTAT 33 (54)
T ss_pred EEEEEECCCCcEEEeCCCCCCccEEEEEEEE
Confidence 3456778999997654 347888777653
No 92
>PF07411 DUF1508: Domain of unknown function (DUF1508); InterPro: IPR010879 This domain is found in a family of proteins, which have no known function. Members of this family are often found as tandem repeats and in some cases represent the whole protein.; PDB: 3BID_H 2K49_A 2K8E_A 2K7I_A.
Probab=22.83 E-value=73 Score=23.86 Aligned_cols=27 Identities=22% Similarity=0.325 Sum_probs=19.2
Q ss_pred CCCccEEEEecCCceEEEEEEecccee
Q psy7340 346 TKYGEFWRILLPGIYKLEVYADGYVPR 372 (440)
Q Consensus 346 D~~G~F~l~lp~G~ytL~VS~IGY~~q 372 (440)
|.+|.|.+.+..++..++.+.-+|.+.
T Consensus 1 d~~g~~~f~L~a~ng~viasse~Y~sk 27 (49)
T PF07411_consen 1 DSDGQFRFRLKAGNGEVIASSEGYSSK 27 (49)
T ss_dssp ETTSEEEEEEE-TTS-EEEEBEEBSSH
T ss_pred CCCCCEEEEEEcCCCCEEEecCCcCCH
Confidence 578999777767766777798888764
No 93
>PF01399 PCI: PCI domain; InterPro: IPR000717 A homology domain of unclear function, occurs in the C-terminal region of several regulatory components of the 26S proteasome as well as in other proteins. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) []. Apparently, all of the characterised proteins containing PCI domains are parts of larger multi-protein complexes. Proteins with PCI domains include budding yeast proteasome regulatory components Rpn3(Sun2), Rpn5, Rpn6, Rpn7and Rpn9 []; mammalian proteasome regulatory components p55, p58 and p44.5, and translation initiation factor 3 complex subunits p110 and INT6 [, ]; Arabidopsis COP9 and FUS6/COP11 []; mammalian G-protein pathway suppressor GPS1, and several uncharacterised ORFs from plant, nematodes and mammals. The complete homology domain comprises approx. 200 residues, the highest conservation is found in the C-terminal half. Several of the proteins mentioned above have no detectable homology to the N-terminal half of the domain.; GO: 0005515 protein binding; PDB: 3TXM_A 3TXN_A 1UFM_A 3CHM_A 3T5X_A 3T5V_B.
Probab=21.52 E-value=46 Score=26.96 Aligned_cols=22 Identities=18% Similarity=0.514 Sum_probs=20.5
Q ss_pred HHHHHHHHHHHHcCCCCCCCCH
Q psy7340 56 QQIENMICQMAQTGQIMNKLGE 77 (440)
Q Consensus 56 ~~vE~~li~~aq~G~i~~kv~e 77 (440)
..||..|+.|...|.|.++||.
T Consensus 75 ~~vE~~l~~~I~~~~i~~~ID~ 96 (105)
T PF01399_consen 75 EEVESILIDLISNGLIKAKIDQ 96 (105)
T ss_dssp HHHHHHHHHHHHTTSSEEEEET
T ss_pred HHHHHHHHHHHHCCCEEEEEEC
Confidence 8999999999999999999874
No 94
>cd00390 Urease_gamma Urease gamma-subunit; Urease is a nickel-dependent metalloenzyme that catalyzes the hydrolysis of urea to form ammonia and carbon dioxide. Nickel-dependent ureases are found in bacteria, archaea, fungi and plants. Their primary role is to allow the use of external and internally-generated urea as a nitrogen source. The enzyme consists of three subunits, alpha, beta and gamma, which can exist as separate proteins or can be fused on a single protein chain. The alpha-beta-gamma heterotrimer forms multimers, mainly trimers. The large alpha subunit is the catalytic domain containing an active site with a bi-nickel center complexed by a carbamylated lysine. The beta and gamma subunits play a role in subunit association to form the higher order trimers.
Probab=20.39 E-value=1.3e+02 Score=26.22 Aligned_cols=34 Identities=32% Similarity=0.410 Sum_probs=30.7
Q ss_pred CHHHHHHhhh-ccccchhhHHHHHHHHHHHHhcCC
Q psy7340 243 DQSARARLNT-IMLCKPEKAQQIENMICQMAQTGQ 276 (440)
Q Consensus 243 t~eAreRL~~-i~lvkpe~a~~VE~~LiqLAqsG~ 276 (440)
-.=|+.|++| ++|-.||-..-|-..+.-.|..|.
T Consensus 14 ~~lA~~R~~rGlkLN~pEAvAlIs~~v~E~aRdG~ 48 (96)
T cd00390 14 AELARKRLARGLKLNYPEAVALIADEILEGARDGK 48 (96)
T ss_pred HHHHHHHHHcCcccCcHHHHHHHHHHHHHHhhcCC
Confidence 3458899998 999999999999999999999997
Done!