Query psy14949
Match_columns 102
No_of_seqs 121 out of 1155
Neff 9.4
Searched_HMMs 46136
Date Fri Aug 16 16:34:25 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14949.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14949hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PTZ00405 cytochrome c; Provisi 100.0 3E-31 6.6E-36 157.3 9.6 102 1-102 12-113 (114)
2 COG3474 Cytochrome c2 [Energy 100.0 2.4E-30 5.2E-35 153.8 8.9 100 1-101 29-130 (135)
3 PTZ00048 cytochrome c; Provisi 100.0 7.6E-29 1.6E-33 147.5 10.1 101 1-102 13-113 (115)
4 KOG3453|consensus 99.8 2.7E-19 5.8E-24 103.5 3.8 100 1-100 7-106 (110)
5 TIGR03872 cytochrome_MoxG cyto 99.7 2.7E-17 5.8E-22 100.0 8.0 79 2-102 43-122 (133)
6 TIGR02603 CxxCH_TIGR02603 puta 99.7 3.3E-16 7.2E-21 95.4 7.4 35 1-36 1-35 (133)
7 TIGR03045 PS_II_C550 cytochrom 99.6 6.4E-16 1.4E-20 96.1 7.3 85 1-101 49-147 (159)
8 PF13442 Cytochrome_CBB3: Cyto 99.6 2.6E-16 5.7E-21 85.3 4.4 67 1-98 1-67 (67)
9 PF00034 Cytochrom_C: Cytochro 99.6 2.3E-18 5.1E-23 97.1 -4.0 89 3-102 1-91 (91)
10 PRK13618 psbV cytochrome c-550 99.6 1.7E-15 3.6E-20 94.4 8.1 86 1-101 50-148 (163)
11 CHL00183 petJ cytochrome c553; 99.6 1.2E-15 2.7E-20 89.8 6.9 80 1-101 24-103 (108)
12 CHL00133 psbV photosystem II c 99.6 1.3E-15 2.8E-20 94.9 7.0 81 2-101 51-148 (163)
13 PRK13621 psbV cytochrome c-550 99.6 9.4E-16 2E-20 95.5 6.0 85 2-101 65-154 (170)
14 TIGR03046 PS_II_psbV2 photosys 99.6 1.3E-15 2.8E-20 94.0 6.2 85 2-101 54-143 (155)
15 PRK13617 psbV cytochrome c-550 99.6 1.7E-15 3.7E-20 94.5 6.7 84 2-101 58-155 (170)
16 PRK13697 cytochrome c6; Provis 99.6 5.1E-15 1.1E-19 87.4 6.8 80 1-101 26-105 (111)
17 PRK13620 psbV cytochrome c-550 99.6 1.4E-14 2.9E-19 92.5 7.2 82 2-100 103-199 (215)
18 COG2863 Cytochrome c553 [Energ 99.6 6.1E-15 1.3E-19 87.6 4.6 78 1-102 22-101 (121)
19 TIGR03874 4cys_cytochr c-type 99.5 2.8E-14 6E-19 87.3 7.3 77 4-101 34-115 (143)
20 TIGR00782 ccoP cytochrome c ox 99.5 4.9E-14 1.1E-18 95.4 7.8 79 3-102 203-282 (285)
21 PRK13619 psbV cytochrome c-550 99.5 7E-14 1.5E-18 86.0 6.2 85 1-100 49-146 (160)
22 TIGR00782 ccoP cytochrome c ox 99.5 2.8E-13 6.1E-18 91.7 8.4 80 4-102 108-193 (285)
23 PRK14486 putative bifunctional 99.5 3.6E-13 7.8E-18 91.4 8.0 76 3-102 215-291 (294)
24 PRK14487 cbb3-type cytochrome 99.4 2.8E-13 6E-18 87.3 6.1 98 4-102 52-159 (217)
25 PRK13622 psbV cytochrome c-550 99.3 5.1E-12 1.1E-16 79.7 7.0 85 2-101 61-158 (180)
26 COG3258 Cytochrome c [Energy p 99.3 1E-11 2.3E-16 81.9 7.2 82 2-101 161-251 (293)
27 COG4654 Cytochrome c551/c552 [ 99.3 6.7E-12 1.4E-16 71.8 3.8 83 1-101 21-105 (110)
28 PF14495 Cytochrom_C550: Cytoc 99.3 1.4E-11 3E-16 73.7 5.1 84 2-100 24-120 (135)
29 COG2010 CccA Cytochrome c, mon 99.2 1.2E-11 2.7E-16 76.1 4.2 19 3-21 51-69 (150)
30 TIGR00781 ccoO cytochrome c ox 99.1 2.4E-10 5.2E-15 74.4 5.9 77 4-81 51-135 (232)
31 PRK14486 putative bifunctional 99.1 4.6E-10 9.9E-15 76.4 6.6 75 3-81 51-136 (294)
32 TIGR03791 TTQ_mauG tryptophan 98.9 1.4E-08 3E-13 69.1 7.5 99 3-102 159-279 (291)
33 COG2857 CYT1 Cytochrome c1 [En 98.4 1.3E-06 2.8E-11 58.4 6.5 41 61-101 172-212 (250)
34 PF02433 FixO: Cytochrome C ox 98.3 5.9E-06 1.3E-10 54.1 7.0 74 4-81 51-135 (226)
35 PF09098 Dehyd-heme_bind: Quin 98.1 1.6E-06 3.5E-11 54.0 1.3 19 5-23 2-20 (167)
36 PRK14485 putative bifunctional 98.0 2.8E-05 6E-10 58.7 7.4 94 4-101 538-647 (712)
37 PF06537 DUF1111: Protein of u 98.0 2.8E-06 6.1E-11 61.3 1.2 20 2-21 362-382 (499)
38 PF09086 DUF1924: Domain of un 97.8 2.3E-05 5.1E-10 44.8 3.1 20 2-21 10-38 (98)
39 COG3245 CycB Cytochrome c5 [En 97.8 8.8E-05 1.9E-09 43.9 5.3 17 5-21 49-65 (126)
40 COG2993 CcoO Cbb3-type cytochr 97.7 6.4E-06 1.4E-10 52.8 -0.0 53 49-101 105-159 (227)
41 PF02167 Cytochrom_C1: Cytochr 97.7 2.9E-05 6.3E-10 51.0 2.2 20 2-21 14-33 (219)
42 TIGR03806 chp_HNE_0200 conserv 97.4 0.00059 1.3E-08 47.2 6.3 36 63-101 280-317 (317)
43 PF10643 Cytochrome-c551: Phot 97.2 7.7E-05 1.7E-09 48.3 0.3 20 2-21 168-187 (233)
44 COG1858 MauG Cytochrome c pero 97.0 0.0022 4.8E-08 45.1 5.5 18 4-21 221-240 (364)
45 TIGR02162 torC trimethylamine- 97.0 0.00076 1.7E-08 47.8 3.0 18 4-21 322-339 (386)
46 PF14376 Haem_bd: Haem-binding 96.9 0.0024 5.3E-08 39.1 4.5 20 83-102 118-137 (137)
47 COG3748 Predicted membrane pro 96.8 0.0068 1.5E-07 42.1 6.3 16 85-100 387-402 (407)
48 KOG3052|consensus 96.7 0.00068 1.5E-08 45.3 0.8 18 3-20 95-112 (311)
49 PRK15032 trimethylamine N-oxid 96.6 0.0019 4E-08 46.0 2.7 18 4-21 319-336 (390)
50 TIGR03791 TTQ_mauG tryptophan 96.5 0.00096 2.1E-08 45.7 0.9 20 3-22 6-34 (291)
51 COG3258 Cytochrome c [Energy p 95.9 0.024 5.1E-07 38.3 4.8 14 87-100 121-134 (293)
52 PF03150 CCP_MauG: Di-haem cyt 94.9 0.014 3E-07 36.7 1.4 19 3-21 5-32 (159)
53 PF09626 DHC: Dihaem cytochrom 94.2 0.097 2.1E-06 31.4 3.8 11 10-20 1-11 (120)
54 COG3488 Predicted thiol oxidor 94.1 0.028 6E-07 39.4 1.5 20 2-21 350-370 (481)
55 COG2857 CYT1 Cytochrome c1 [En 91.9 0.024 5.2E-07 38.1 -1.4 21 2-22 42-62 (250)
56 PF07635 PSCyt1: Planctomycete 91.0 0.13 2.7E-06 26.9 1.1 9 14-22 1-9 (59)
57 PF06537 DUF1111: Protein of u 85.1 0.55 1.2E-05 34.7 1.6 17 9-25 66-83 (499)
58 COG1858 MauG Cytochrome c pero 84.8 0.54 1.2E-05 33.4 1.4 20 3-22 62-90 (364)
59 PF09832 DUF2059: Uncharacteri 83.0 1.2 2.7E-05 23.2 2.0 17 84-100 15-31 (64)
60 PF07627 PSCyt3: Protein of un 79.6 0.75 1.6E-05 26.8 0.5 8 13-20 71-78 (101)
61 TIGR03153 cytochr_NrfH cytochr 78.9 0.77 1.7E-05 28.0 0.4 16 5-20 96-111 (135)
62 PF08090 Enterotoxin_HS1: Heat 77.0 0.88 1.9E-05 20.7 0.2 10 12-21 20-29 (36)
63 COG3488 Predicted thiol oxidor 75.0 1.3 2.8E-05 31.4 0.7 19 9-27 93-112 (481)
64 PF07583 PSCyt2: Protein of un 73.9 1.5 3.2E-05 28.8 0.7 7 13-19 176-182 (208)
65 PF11845 DUF3365: Protein of u 71.0 1.2 2.7E-05 28.1 -0.2 8 12-19 147-154 (188)
66 PF15182 OTOS: Otospiralin 64.9 8.9 0.00019 20.4 2.3 23 78-101 15-37 (69)
67 COG2996 Predicted RNA-bindinin 62.4 15 0.00032 25.4 3.6 31 60-95 226-256 (287)
68 PF02304 Phage_B: Scaffold pro 62.3 6.1 0.00013 23.2 1.6 18 2-19 63-82 (117)
69 TIGR01904 GSu_C4xC__C2xCH Geob 61.2 3.2 7E-05 20.2 0.3 6 13-18 37-42 (42)
70 TIGR01905 paired_CXXCH_1 doubl 60.3 3.5 7.6E-05 19.8 0.3 9 13-21 8-16 (41)
71 PHA00003 B internal scaffoldin 58.2 8.2 0.00018 22.7 1.6 18 2-19 66-85 (120)
72 PF09699 Paired_CXXCH_1: Doubl 56.9 3.4 7.4E-05 19.5 -0.1 9 13-21 8-16 (41)
73 PF06943 zf-LSD1: LSD1 zinc fi 54.4 5.7 0.00012 17.1 0.4 8 13-20 18-25 (25)
74 COG3043 NapB Nitrate reductase 53.3 11 0.00023 23.6 1.7 8 13-20 88-95 (155)
75 PF12162 STAT1_TAZ2bind: STAT1 53.2 18 0.0004 15.1 2.1 18 77-98 5-22 (23)
76 PF09722 DUF2384: Protein of u 52.8 11 0.00023 18.9 1.4 41 60-101 10-50 (54)
77 PF13822 ACC_epsilon: Acyl-CoA 52.5 15 0.00033 19.2 2.0 16 86-101 11-26 (62)
78 cd07321 Extradiol_Dioxygenase_ 52.2 7.7 0.00017 21.3 0.8 31 60-96 15-45 (77)
79 COG3445 Acid-induced glycyl ra 52.1 18 0.00039 21.1 2.3 38 58-96 77-120 (127)
80 TIGR02826 RNR_activ_nrdG3 anae 51.2 36 0.00078 21.0 3.8 9 13-21 28-36 (147)
81 PHA02902 putative IMV membrane 50.6 20 0.00043 19.1 2.2 20 81-100 49-68 (70)
82 PF14769 CLAMP: Flagellar C1a 50.3 47 0.001 18.9 4.1 18 83-100 61-78 (101)
83 PF04320 DUF469: Protein with 49.3 25 0.00054 20.5 2.7 18 84-101 68-85 (101)
84 PRK11586 napB nitrate reductas 48.5 13 0.00029 23.1 1.6 6 14-19 83-88 (149)
85 cd07921 PCA_45_Doxase_A_like S 47.7 11 0.00024 22.2 1.0 31 60-96 25-55 (106)
86 TIGR01053 LSD1 zinc finger dom 46.4 6.6 0.00014 17.7 0.0 9 13-21 21-29 (31)
87 PHA02119 hypothetical protein 46.4 14 0.00031 20.0 1.3 10 92-101 57-66 (87)
88 cd06395 PB1_Map2k5 PB1 domain 44.6 21 0.00046 20.0 1.8 15 86-100 57-71 (91)
89 COG3303 NrfA Formate-dependent 44.5 8.2 0.00018 27.9 0.2 14 8-21 327-340 (501)
90 PF14522 Cytochrome_C7: Cytoch 44.1 9.2 0.0002 19.5 0.3 9 12-20 13-21 (65)
91 PF03892 NapB: Nitrate reducta 43.9 12 0.00027 22.9 0.9 7 14-20 72-78 (133)
92 COG3184 Uncharacterized protei 43.1 12 0.00025 24.1 0.7 19 82-100 104-122 (183)
93 PF11256 DUF3055: Protein of u 42.5 30 0.00066 19.3 2.2 15 87-101 67-81 (81)
94 PF07128 DUF1380: Protein of u 41.5 69 0.0015 19.8 3.9 33 57-100 25-57 (139)
95 PHA02681 ORF089 virion membran 41.0 31 0.00067 19.4 2.1 20 81-100 47-66 (92)
96 PF14537 Cytochrom_c3_2: Cytoc 40.9 14 0.0003 19.5 0.7 8 13-20 8-15 (80)
97 PF13435 Cytochrome_C554: Cyto 39.5 13 0.00029 21.1 0.6 9 13-21 50-58 (130)
98 COG3005 TorC Nitrate/TMAO redu 39.3 12 0.00026 24.3 0.4 10 12-21 135-144 (190)
99 PRK11702 hypothetical protein; 38.9 41 0.0009 19.9 2.5 18 84-101 75-92 (108)
100 PF06252 DUF1018: Protein of u 38.9 34 0.00073 20.2 2.2 13 87-99 56-68 (119)
101 CHL00037 petA cytochrome f 38.8 10 0.00023 26.4 0.0 8 14-21 56-63 (320)
102 PRK02693 apocytochrome f; Revi 37.9 11 0.00024 26.1 0.1 8 14-21 49-56 (312)
103 PF07102 DUF1364: Protein of u 37.2 8.4 0.00018 22.2 -0.5 8 13-20 53-60 (94)
104 PF13453 zf-TFIIB: Transcripti 37.0 11 0.00024 17.8 -0.0 8 13-20 21-28 (41)
105 PF13821 DUF4187: Domain of un 36.6 32 0.0007 17.6 1.7 15 87-101 12-26 (55)
106 TIGR02161 napC_nirT periplasmi 36.2 15 0.00032 23.8 0.4 10 12-21 136-145 (185)
107 PF10058 DUF2296: Predicted in 36.2 14 0.0003 18.9 0.3 10 13-22 24-33 (54)
108 smart00386 HAT HAT (Half-A-TPR 36.1 31 0.00066 14.2 1.4 18 1-18 1-18 (33)
109 cd08168 Cytochrom_C3 Heme-bind 35.8 13 0.00029 20.3 0.2 14 8-21 55-68 (85)
110 PF03264 Cytochrom_NNT: NapC/N 35.6 19 0.0004 22.7 0.8 16 6-21 115-134 (173)
111 PF02085 Cytochrom_CIII: Class 35.3 15 0.00032 20.9 0.3 9 13-21 78-86 (102)
112 cd07922 CarBa CarBa is the A s 33.6 22 0.00048 19.8 0.8 31 60-96 16-46 (81)
113 KOG3507|consensus 33.5 17 0.00037 19.0 0.3 9 13-21 22-30 (62)
114 PF08263 LRRNT_2: Leucine rich 33.2 54 0.0012 15.3 2.1 14 88-101 2-15 (43)
115 PF13986 DUF4224: Domain of un 33.1 26 0.00057 17.3 1.0 13 85-97 2-14 (47)
116 PF02335 Cytochrom_C552: Cytoc 32.8 20 0.00044 26.3 0.7 9 11-19 272-280 (434)
117 TIGR03092 SASP_sspI small, aci 32.7 46 0.00099 17.8 1.9 41 60-100 14-62 (65)
118 PF11829 DUF3349: Protein of u 32.4 50 0.0011 19.1 2.1 16 84-99 34-49 (96)
119 KOG1200|consensus 31.7 59 0.0013 21.8 2.6 20 81-100 218-237 (256)
120 PF10955 DUF2757: Protein of u 31.6 13 0.00028 20.5 -0.3 13 86-98 32-44 (76)
121 TIGR03152 cyto_c552_HCOOH form 31.5 18 0.00039 26.6 0.3 11 10-20 273-283 (439)
122 PF06883 RNA_pol_Rpa2_4: RNA p 30.3 46 0.001 17.2 1.6 17 86-102 4-20 (58)
123 cd07923 Gallate_dioxygenase_C 30.2 26 0.00057 20.1 0.8 31 60-96 17-47 (94)
124 PRK10617 cytochrome c-type pro 30.1 20 0.00043 23.5 0.3 10 12-21 145-154 (200)
125 CHL00136 rpl31 ribosomal prote 29.8 21 0.00045 19.2 0.3 8 12-19 36-43 (68)
126 TIGR03631 bact_S13 30S ribosom 29.7 61 0.0013 19.1 2.3 17 84-100 44-60 (113)
127 PF07295 DUF1451: Protein of u 29.4 55 0.0012 20.3 2.1 36 60-100 12-47 (146)
128 COG1579 Zn-ribbon protein, pos 29.0 20 0.00044 24.2 0.2 8 13-20 199-206 (239)
129 COG0099 RpsM Ribosomal protein 28.9 63 0.0014 19.5 2.2 17 84-100 46-62 (121)
130 TIGR03146 cyt_nit_nrfB cytochr 28.8 19 0.00041 22.1 0.0 8 13-20 42-49 (145)
131 CHL00137 rps13 ribosomal prote 28.8 65 0.0014 19.4 2.3 17 84-100 46-62 (122)
132 PRK00019 rpmE 50S ribosomal pr 28.5 22 0.00048 19.3 0.3 8 12-19 36-43 (72)
133 PF14098 SSPI: Small, acid-sol 28.3 68 0.0015 17.1 2.1 41 60-100 15-63 (65)
134 PF07757 AdoMet_MTase: Predict 28.2 36 0.00078 20.2 1.1 13 89-102 38-50 (112)
135 PF05927 Penaeidin: Penaeidin; 27.9 27 0.00059 18.8 0.5 10 12-21 44-53 (73)
136 PF11293 DUF3094: Protein of u 27.6 77 0.0017 16.3 2.1 16 85-100 3-18 (55)
137 cd04860 AE_Prim_S AE_Prim_S: p 27.3 59 0.0013 21.8 2.2 16 85-100 150-165 (232)
138 TIGR03823 FliZ flagellar regul 27.0 19 0.00042 22.7 -0.2 9 12-20 23-31 (168)
139 PF09695 YtfJ_HI0045: Bacteria 26.9 74 0.0016 20.2 2.4 17 84-100 143-159 (160)
140 PF13099 DUF3944: Domain of un 26.7 81 0.0018 14.7 2.0 14 85-98 12-25 (35)
141 PRK05179 rpsM 30S ribosomal pr 26.5 75 0.0016 19.1 2.3 17 84-100 46-62 (122)
142 PF12123 Amidase02_C: N-acetyl 26.5 81 0.0017 15.5 2.0 17 84-100 22-38 (45)
143 PRK11659 cytochrome c nitrite 26.4 22 0.00048 22.8 0.0 9 13-21 74-82 (183)
144 TIGR02251 HIF-SF_euk Dullard-l 26.3 80 0.0017 19.5 2.5 22 81-102 137-158 (162)
145 PRK02955 small acid-soluble sp 26.1 68 0.0015 17.3 1.8 41 60-100 17-65 (68)
146 COG1365 Predicted ATPase (PP-l 25.8 19 0.00041 24.1 -0.3 7 14-20 135-141 (255)
147 PF09601 DUF2459: Protein of u 25.8 69 0.0015 20.4 2.2 15 86-100 98-112 (173)
148 PF12408 DUF3666: Ribose-5-pho 25.7 53 0.0012 16.4 1.3 12 90-101 37-48 (48)
149 cd03271 ABC_UvrA_II The excisi 25.6 29 0.00063 23.5 0.5 20 81-101 199-218 (261)
150 PRK01397 50S ribosomal protein 25.6 27 0.00059 19.3 0.3 8 12-19 35-42 (78)
151 TIGR03629 arch_S13P archaeal r 25.5 79 0.0017 19.6 2.4 17 84-100 50-66 (144)
152 PRK11125 nrfA cytochrome c nit 25.5 27 0.00058 26.1 0.3 10 10-19 311-320 (480)
153 KOG1605|consensus 25.2 1.3E+02 0.0028 20.7 3.5 22 81-102 226-247 (262)
154 PF09343 DUF2460: Conserved hy 25.1 84 0.0018 20.6 2.5 17 85-101 50-66 (198)
155 PRK11032 hypothetical protein; 25.1 74 0.0016 20.1 2.2 17 84-100 41-57 (160)
156 PF09383 NIL: NIL domain; Int 25.0 81 0.0017 16.7 2.1 15 87-101 55-69 (76)
157 PRK11582 flagella biosynthesis 24.6 22 0.00048 22.5 -0.2 9 12-20 23-31 (169)
158 PF13709 DUF4159: Domain of un 24.6 73 0.0016 20.8 2.2 16 85-100 66-81 (207)
159 COG0633 Fdx Ferredoxin [Energy 24.2 27 0.00059 20.1 0.1 8 13-20 44-51 (102)
160 PF15332 LIME1: Lck-interactin 24.0 29 0.00063 22.9 0.2 8 14-21 2-9 (228)
161 PTZ00134 40S ribosomal protein 23.6 87 0.0019 19.7 2.3 17 84-100 59-75 (154)
162 TIGR00778 ahpD_dom alkylhydrop 23.5 82 0.0018 15.0 1.8 11 87-97 38-48 (50)
163 PF14053 DUF4248: Domain of un 23.2 79 0.0017 16.9 1.8 18 81-98 49-66 (69)
164 TIGR00105 L31 ribosomal protei 23.1 32 0.0007 18.4 0.3 8 12-19 36-43 (68)
165 PF15161 Neuropep_like: Neurop 23.0 27 0.00058 18.2 -0.1 16 5-20 5-22 (65)
166 PRK13696 hypothetical protein; 23.0 25 0.00053 18.6 -0.2 19 77-95 41-59 (62)
167 PF04270 Strep_his_triad: Stre 22.9 1E+02 0.0022 15.7 2.1 17 84-100 36-52 (53)
168 PRK00528 rpmE 50S ribosomal pr 22.8 33 0.00071 18.5 0.3 8 12-19 38-45 (71)
169 PF13447 Multi-haem_cyto: Seve 22.8 46 0.00099 22.8 1.0 12 8-19 210-221 (267)
170 PF01197 Ribosomal_L31: Riboso 22.5 34 0.00074 18.3 0.3 9 12-20 37-45 (69)
171 PRK04053 rps13p 30S ribosomal 22.4 1E+02 0.0022 19.3 2.4 17 84-100 54-70 (149)
172 PRK01678 rpmE2 50S ribosomal p 22.3 34 0.00074 19.4 0.3 8 12-19 49-56 (87)
173 PF11116 DUF2624: Protein of u 22.2 1.3E+02 0.0029 16.9 2.6 15 86-100 31-45 (85)
174 PF09286 Pro-kuma_activ: Pro-k 21.8 87 0.0019 18.8 2.0 14 87-100 60-73 (143)
175 cd07347 harmonin_N_like N-term 21.4 98 0.0021 17.1 2.0 17 84-100 14-30 (78)
176 TIGR03507 decahem_SO1788 decah 21.4 37 0.00079 26.4 0.4 10 12-21 294-303 (664)
177 PF14317 YcxB: YcxB-like prote 21.3 95 0.0021 15.1 1.9 15 85-99 48-62 (62)
178 PF06648 DUF1160: Protein of u 20.7 66 0.0014 19.4 1.3 14 86-99 99-112 (122)
179 PF09966 DUF2200: Uncharacteri 20.6 95 0.0021 18.4 1.9 13 87-99 24-36 (111)
180 PF15614 WHIM3: WSTF, HB1, Itc 20.2 1.2E+02 0.0025 15.1 1.9 12 87-98 6-17 (46)
181 PF07746 LigA: Aromatic-ring-o 20.2 31 0.00067 19.5 -0.2 31 60-96 10-40 (88)
182 COG5204 SPT4 Transcription elo 20.1 1.5E+02 0.0032 17.2 2.5 36 60-95 62-97 (112)
183 PF02591 DUF164: Putative zinc 20.1 39 0.00085 17.0 0.2 7 13-19 24-30 (56)
No 1
>PTZ00405 cytochrome c; Provisional
Probab=99.97 E-value=3e-31 Score=157.31 Aligned_cols=102 Identities=53% Similarity=1.045 Sum_probs=95.2
Q ss_pred CcHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCCCCCcc
Q psy14949 1 MSVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYIPGTKM 80 (102)
Q Consensus 1 ~d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~m 80 (102)
||+++|+++|..+|++||.++..+.+.+||+|.++.++..++.++|.|+..+...++.|+.+.|..||.+|..+.|++.|
T Consensus 12 gd~~~G~~lF~~~C~aCH~~~~~~~~~vGP~L~gv~gR~~g~~~~~~YS~al~~~g~~wd~~~L~~~l~~P~~~~pgt~M 91 (114)
T PTZ00405 12 GDAERGEKLFKGRAAQCHTATKGGSNGVGPNLFGIVNRKSGTVEGFAYSKANADSGVIWTPEVLDVYLENPKKFMPGTKM 91 (114)
T ss_pred cCHHHHHHHHHhhhHhhCCCCCCCCCCcCCCccccccCccccccCccccHHHHhccCcCCHHHHHHHHHCHHhhCCCCCC
Confidence 57899999999889999998755567899999999999999999999999999999999999999999999999999999
Q ss_pred CCCCCCCHHHHHHHHHHHhhcC
Q psy14949 81 VFPGLRKEKDREDLIAYLSTLH 102 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl~sl~ 102 (102)
+|.++.+++|+.+|++||++|+
T Consensus 92 ~f~gl~~~~dr~~liaYL~sl~ 113 (114)
T PTZ00405 92 SFAGIKKPQERADVIAYLETLK 113 (114)
T ss_pred CCCCCCCHHHHHHHHHHHHHhc
Confidence 9999989999999999999985
No 2
>COG3474 Cytochrome c2 [Energy production and conversion]
Probab=99.97 E-value=2.4e-30 Score=153.80 Aligned_cols=100 Identities=53% Similarity=1.058 Sum_probs=95.7
Q ss_pred CcHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhc--CccccHHHHHHHHhCCCCCCCCC
Q psy14949 1 MSVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNK--GITWSASTLDIYLQNPKKYIPGT 78 (102)
Q Consensus 1 ~d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~~~~~~~~~~~~~~ 78 (102)
||+..|+.+|++ |.+||+++.++.+.+||.|.++.+|..++.++|.|+..+... ++.|+++.|..||++|+.+.|||
T Consensus 29 ~da~~G~~vFkk-C~~CH~i~~~g~nkvGP~L~gVvGR~ags~egf~YS~Amk~~~~g~vWd~~~L~~fL~~Pkk~vpGT 107 (135)
T COG3474 29 GDAAAGEKVFKK-CQACHSIEKGGPNKVGPHLWGVVGRPAGSVEGFSYSAAMKKAGGGIVWDEDNLDEFLTAPKKYVPGT 107 (135)
T ss_pred ccHHHhHHHHHH-HHHhhccccCCCCCCCCccccccCccccccCCcccCHHHHhccCCcccCHHHHHHHHhChhhhCCCc
Confidence 588999999987 999999998888999999999999999999999999999988 89999999999999999999999
Q ss_pred ccCCCCCCCHHHHHHHHHHHhhc
Q psy14949 79 KMVFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 79 ~m~~~~~ls~~e~~~l~~yl~sl 101 (102)
.|.|.++-.++|+.+||+||+++
T Consensus 108 kM~faGlkk~~dradlIAYLk~~ 130 (135)
T COG3474 108 KMAFAGLKKDQDRADLIAYLKSL 130 (135)
T ss_pred ceeecCCCCHHHHHHHHHHHHhc
Confidence 99999998999999999999985
No 3
>PTZ00048 cytochrome c; Provisional
Probab=99.96 E-value=7.6e-29 Score=147.48 Aligned_cols=101 Identities=58% Similarity=1.001 Sum_probs=92.4
Q ss_pred CcHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCCCCCcc
Q psy14949 1 MSVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYIPGTKM 80 (102)
Q Consensus 1 ~d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~m 80 (102)
+++++|+++|+++|++||++++.+...+||+|.++.++..+. ++|.|+..+...++.|+++.|..||.+|+...|++.|
T Consensus 13 ~~~~~G~~~f~~~C~~CH~~~~~g~~~~GP~L~Gi~gR~~g~-~~~~ys~~~~~~g~~wt~~~L~~~l~~P~~~~pgt~M 91 (115)
T PTZ00048 13 GDAKKGAKLFKAKCAQCHTINKGGAVKQGPNLHGFYGRKSGS-ADFPYSDANKNSGIVWSDKHLFEYLVNPKLYIPGTKM 91 (115)
T ss_pred ccHHHHHHHHHhhhhhcCCCcCCCCCCcCCcccccccccccC-CCCccchhhhhcccccCHHHHHHHHhCcCccCCCCcc
Confidence 578899999998999999998766678999999999998887 7888888887888999999999999999999999999
Q ss_pred CCCCCCCHHHHHHHHHHHhhcC
Q psy14949 81 VFPGLRKEKDREDLIAYLSTLH 102 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl~sl~ 102 (102)
+|.++.+++|+.+|++||+++.
T Consensus 92 ~~~gl~~~~~~~~liaYL~s~~ 113 (115)
T PTZ00048 92 VFAGIKKEKERADLIAYLKEAS 113 (115)
T ss_pred CcCCCCCHHHHHHHHHHHHHhc
Confidence 9999888999999999999873
No 4
>KOG3453|consensus
Probab=99.77 E-value=2.7e-19 Score=103.49 Aligned_cols=100 Identities=64% Similarity=1.118 Sum_probs=93.0
Q ss_pred CcHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCCCCCcc
Q psy14949 1 MSVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYIPGTKM 80 (102)
Q Consensus 1 ~d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~m 80 (102)
+|++.|+.+|.+.|+.||..+..+....+|++.++.++..+...+|.++.....-++.|.++.|..++.+|..+.||+-|
T Consensus 7 ~d~~~g~~~f~~rc~qch~~~~~~~~k~~p~l~gl~g~~~g~~~~~sy~~a~KnKgV~wgE~tl~eyLenpkkyipGtKm 86 (110)
T KOG3453|consen 7 GDVEKGKKIFPQRCAQCHTVEKGGFHKTGPNLHGLFGRQLGQAAGLSYTDANKNKGVTWGEDTLMEYLENPKKYIPGTKM 86 (110)
T ss_pred cccccccccceeeccccccccCCcccccCCcchhhHHHhhccccCcceeecccCCceEEcchhHHHHHhCCCccccccce
Confidence 57889999999999999999877777899999999999999999999999999999999999999999999999999999
Q ss_pred CCCCCCCHHHHHHHHHHHhh
Q psy14949 81 VFPGLRKEKDREDLIAYLST 100 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl~s 100 (102)
-|.++-...|..++++||.+
T Consensus 87 ifaGikk~~eraDlIayl~k 106 (110)
T KOG3453|consen 87 IFAGIKKKAERADLIAYLKK 106 (110)
T ss_pred eecccCchHHHHHHHHHHHH
Confidence 99988666999999999975
No 5
>TIGR03872 cytochrome_MoxG cytochrome c(L), periplasmic. This model describes a periplasmic c-type cytochrome that serves as the primary electron acceptor for the quinoprotein methanol dehydrogenase, a PQQ enzyme. The member from Paracoccus denitrificans is also characterized as an electron acceptor for methylamine dehydrogenase, a tryptophan tryptophylquinone enzyme. This protein is called cytochrome c(L) in methylotrophic bacteria such Methylobacterium extorquens, but c551i in Paracoccus denitrificans.
Probab=99.73 E-value=2.7e-17 Score=100.00 Aligned_cols=79 Identities=19% Similarity=0.259 Sum_probs=55.3
Q ss_pred cHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCCCCCcc-
Q psy14949 2 SVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYIPGTKM- 80 (102)
Q Consensus 2 d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~m- 80 (102)
.++.|+++|..+|++||+.++.|. .+|+|.+....... ....+.+...+.++. ..+|
T Consensus 43 ~~a~G~~ly~~~CAaCHG~~g~G~--~gP~L~~~~~~~~~----------------~~~~~~l~~~i~~G~----~g~Mp 100 (133)
T TIGR03872 43 ALKKGESLFATACSGCHGHLAEGK--LGPGLNDDYWTYPK----------------NTTDKGLFETIFGGA----NGMMG 100 (133)
T ss_pred HHHHHHHHHHHhhHHhCCCCCCCC--CCCCCcCcccccCC----------------cccHHHHHHHHHcCC----CCCCc
Confidence 357899999999999999987543 46888764321110 012344666666653 2356
Q ss_pred CCCCCCCHHHHHHHHHHHhhcC
Q psy14949 81 VFPGLRKEKDREDLIAYLSTLH 102 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl~sl~ 102 (102)
++...|+++||++|++||++++
T Consensus 101 ~~~~~LsdeeI~aLaaYI~sl~ 122 (133)
T TIGR03872 101 PQYGNLTLDEMLQIMAWIRHLY 122 (133)
T ss_pred ccccCCCHHHHHHHHHHHHHhC
Confidence 5777789999999999999975
No 6
>TIGR02603 CxxCH_TIGR02603 putative heme-binding domain, Pirellula/Verrucomicrobium type. This model represents a domain limited to very few species but expanded into large paralogous families in some species that conain it. We find it in over 20 copies each in Pirellula sp. strain 1 (phylum Planctomycetes) and Verrucomicrobium spinosum DSM 4136 (phylum Verrucomicrobia), and no matches above trusted cutoff an any other species so far. This domain, about 140 amino acids long, contains an absolutely conserved motif CxxCH, the cytochrome c family heme-binding site signature (PS00190).
Probab=99.67 E-value=3.3e-16 Score=95.40 Aligned_cols=35 Identities=37% Similarity=0.732 Sum_probs=30.3
Q ss_pred CcHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCc
Q psy14949 1 MSVENGKKIFTRACAQCHTANEGGANKVGPNLFGVV 36 (102)
Q Consensus 1 ~d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~ 36 (102)
||+++|+.+|..+|++||++++.| ...||+|.++.
T Consensus 1 gd~~~G~~~f~~~C~~CH~~~g~g-~~~gP~L~~~~ 35 (133)
T TIGR02603 1 GDAEKGKAVYAQRCYVCHRIGGEG-VDVGPDLTGVG 35 (133)
T ss_pred CCHHHHHHHHHhHHHHhCCCCCCC-CccCCCccccc
Confidence 789999999998899999999754 46799998864
No 7
>TIGR03045 PS_II_C550 cytochrome c-550. Members of this protein family are cytochrome c-550, the PsbV extrinsic protein of photosystem II, from both Cyanobacteria and chloroplasts. A paralog to this protein, PsbV2, is found in some species in addition to PsbV itself.
Probab=99.65 E-value=6.4e-16 Score=96.09 Aligned_cols=85 Identities=19% Similarity=0.251 Sum_probs=56.9
Q ss_pred CcHHhHHHHHHHhcccccCCCCCC-CCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCC-----
Q psy14949 1 MSVENGKKIFTRACAQCHTANEGG-ANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKY----- 74 (102)
Q Consensus 1 ~d~~~G~~lf~~~C~~CH~~~~~g-~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~----- 74 (102)
+++++|+++|+.+|++||...... .+.+++.+..+..... ...+.+.|.+|+++|..+
T Consensus 49 ~~~~~Gk~lF~~~Ca~CH~~G~~~~~p~vgl~l~~L~~A~~----------------~r~~v~~Lv~~iknP~~~dg~~~ 112 (159)
T TIGR03045 49 EQVKRGKRLFNTACGTCHVGGITKTNPNVGLDPEALALATP----------------PRDNVEALVDYMKNPTSYDGEES 112 (159)
T ss_pred HhHHHHHHHHHHHHHHhCCCCCCCCCCCCCCChhhHHhcCC----------------CccCHHHHHHHHhCcccccccch
Confidence 367899999999999999522111 1233444433322110 012578899999999865
Q ss_pred --------CCCCccCCCCCCCHHHHHHHHHHHhhc
Q psy14949 75 --------IPGTKMVFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 75 --------~~~~~m~~~~~ls~~e~~~l~~yl~sl 101 (102)
.+...||....||++|+++|++||..+
T Consensus 113 ~~~~hp~~k~~~~mP~~~~LsdeEL~avAaYIl~q 147 (159)
T TIGR03045 113 IAELHPSIRSADIFPKMRNLTDEDLRLIAGHILVQ 147 (159)
T ss_pred hhhcccccCcccccCCcCCCCHHHHHHHHHHHHHh
Confidence 345677755557999999999999764
No 8
>PF13442 Cytochrome_CBB3: Cytochrome C oxidase, cbb3-type, subunit III ; PDB: 1KB0_A 2DGE_D 2CE1_A 2CE0_A 2V07_A 1W2L_A 2ZOO_A 2ZBO_G 1DVV_A 2EXV_A ....
Probab=99.64 E-value=2.6e-16 Score=85.26 Aligned_cols=67 Identities=30% Similarity=0.594 Sum_probs=49.8
Q ss_pred CcHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCCCCCcc
Q psy14949 1 MSVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYIPGTKM 80 (102)
Q Consensus 1 ~d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~m 80 (102)
++++.|+.+|.++|++||+.++ .||++.+. .|+.+.+..++.++.. .|.
T Consensus 1 a~~~~G~~ly~~~C~~CH~~~~-----~gp~l~~~----------------------~~~~~~l~~~i~~g~~----~Mp 49 (67)
T PF13442_consen 1 ADAAKGKALYEQNCASCHGPGG-----AGPSLAGK----------------------DWSPEELYNIIRNGRG----GMP 49 (67)
T ss_dssp -HHHHHHHHHHHHTHHHHGTGS-----SSSTSTHH----------------------HHHHHHHHHHHHHTBT----TBS
T ss_pred CcHHHHHHHHHhHhHHhcCCCc-----cCccchhh----------------------hhhHHHHHHHHHhCcC----CCC
Confidence 3688999999999999998332 24655431 3356788889988773 233
Q ss_pred CCCCCCCHHHHHHHHHHH
Q psy14949 81 VFPGLRKEKDREDLIAYL 98 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl 98 (102)
++...||++|+++|++||
T Consensus 50 ~~~~~ls~~e~~~l~~yi 67 (67)
T PF13442_consen 50 PFGGQLSDEEIEALAAYI 67 (67)
T ss_dssp CTTTTSTHHHHHHHHHHH
T ss_pred CCCCCCCHHHHHHHHHHC
Confidence 666678999999999997
No 9
>PF00034 Cytochrom_C: Cytochrome c; InterPro: IPR003088 Cytochromes c (cytC) can be defined as electron-transfer proteins having one or several haem c groups, bound to the protein by one or, more generally, two thioether bonds involving sulphydryl groups of cysteine residues. The fifth haem iron ligand is always provided by a histidine residue. CytC possess a wide range of properties and function in a large number of different redox processes. Ambler [] recognised four classes of cytC. Class I includes the low-spin soluble cytC of mitochondria and bacteria, with the haem-attachment site towards the N terminus, and the sixth ligand provided by a methionine residue about 40 residues further on towards the C terminus. On the basis of sequence similarity, class I cytC were further subdivided into five classes, IA to IE. Class IB includes the eukaryotic mitochondrial cytC and prokaryotic 'short' cyt c2 exemplified by Rhodopila globiformis cyt c2; class IA includes 'long' cyt c2, such as Rhodospirillum rubrum cyt c2 and Aquaspirillum itersonii cyt c-550, which have several extra loops by comparison with class IB cytC.; GO: 0005506 iron ion binding, 0009055 electron carrier activity, 0020037 heme binding; PDB: 1YNR_B 2AI5_A 1AYG_A 3O5C_C 1YEA_A 3CXH_W 1YTC_A 1YEB_A 2YBB_Y 2B4Z_A ....
Probab=99.64 E-value=2.3e-18 Score=97.10 Aligned_cols=89 Identities=26% Similarity=0.415 Sum_probs=49.0
Q ss_pred HHhHHHHHHHhcccccCCCCCCCCCC-CCCccCCcCCcCCCCCCCccchhhhhcCccccHH-HHHHHHhCCCCCCCCCcc
Q psy14949 3 VENGKKIFTRACAQCHTANEGGANKV-GPNLFGVVGRQTGQAPGFDYTDANKNKGITWSAS-TLDIYLQNPKKYIPGTKM 80 (102)
Q Consensus 3 ~~~G~~lf~~~C~~CH~~~~~g~~~~-gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~l~~~~~~~~~~~~~~~m 80 (102)
+++|++||+.+|++||+.++.+.... +|+|.++..+..... ... .+... .+......+........|
T Consensus 1 a~~G~~l~~~~C~~CH~~~~~~~~~~~~p~l~~~~~~~~~~~-----~~~------~~~~~~~~~~~~~~~~~~~~~~~~ 69 (91)
T PF00034_consen 1 AARGKELFQANCAACHGADGNGDGGGPGPDLTGIGKRYSYDW-----IGR------YITNPEAISPPAHMPDAMPMFPMM 69 (91)
T ss_dssp HHHHHHHHHHHTTTTHBTSTTSSSSSTSHBHTTHTHHBTTHH-----THH------HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ChhHHHHHhCcChhcCCCCCcCCccccCccccCccccchHHH-----HHH------HHHHHhhhccccchhhccCcccCC
Confidence 57899999879999999987654444 577877654332100 000 00001 000000000000111223
Q ss_pred CCCCCCCHHHHHHHHHHHhhcC
Q psy14949 81 VFPGLRKEKDREDLIAYLSTLH 102 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl~sl~ 102 (102)
+....||++|+++|++||++||
T Consensus 70 ~~~~~ls~~e~~~l~ayl~slk 91 (91)
T PF00034_consen 70 PMPKILSDEEIADLAAYLRSLK 91 (91)
T ss_dssp HHHHTSSHHHHHHHHHHHHHTS
T ss_pred cccCCCCHHHHHHHHHHHHHhC
Confidence 2222569999999999999986
No 10
>PRK13618 psbV cytochrome c-550; Provisional
Probab=99.64 E-value=1.7e-15 Score=94.45 Aligned_cols=86 Identities=26% Similarity=0.332 Sum_probs=60.4
Q ss_pred CcHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCC------
Q psy14949 1 MSVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKY------ 74 (102)
Q Consensus 1 ~d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~------ 74 (102)
+++++|+++|+++|++||. . |.+..+|++.+......+..+ .+.+.+.|.+|+++|..+
T Consensus 50 ~~~~~G~~lF~~~Ca~CH~-~--G~~~~~p~~~l~~~~La~a~p------------~rd~v~~l~~yik~P~~~Dg~~~~ 114 (163)
T PRK13618 50 KQVKEGKRLFNYACAQCHA-G--GVTKTNQNVGLEPEALALATP------------NRDNIEGLVDYMKNPTTYDGEEEI 114 (163)
T ss_pred hhHHHHHHHHHHHHHHhcC-C--CCCCCCCCcCCChhhhccCCC------------CccCHHHHHHHHhCchhccccchh
Confidence 4678999999999999994 3 346667766643221211111 012578899999999986
Q ss_pred -------CCCCccCCCCCCCHHHHHHHHHHHhhc
Q psy14949 75 -------IPGTKMVFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 75 -------~~~~~m~~~~~ls~~e~~~l~~yl~sl 101 (102)
.+...||-...|||+|+.+|++||..+
T Consensus 115 ~~~h~~ik~~~~mP~~~~Lsd~eL~ava~yll~~ 148 (163)
T PRK13618 115 SEIHPSIKSADIFTAMRNLTDKDLEAIAGHILVQ 148 (163)
T ss_pred cccccccCccccCCCCCCCCHHHHHHHHHHHHhc
Confidence 456677755557999999999999753
No 11
>CHL00183 petJ cytochrome c553; Provisional
Probab=99.63 E-value=1.2e-15 Score=89.82 Aligned_cols=80 Identities=28% Similarity=0.452 Sum_probs=54.0
Q ss_pred CcHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCCCCCcc
Q psy14949 1 MSVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYIPGTKM 80 (102)
Q Consensus 1 ~d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~m 80 (102)
++++.|+.+|..+|++||+.. .+..+|++.... +.+... ...+.+.+..+++++...||
T Consensus 24 a~~~~G~~ly~~~Ca~CHg~g---~~~~~P~~~~~~-------------~~l~~~-~~~~~~~i~~~i~~G~~~MP---- 82 (108)
T CHL00183 24 ADLDNGEQIFSANCAACHAGG---NNVIMPEKTLKK-------------DALEAN-SMNSIEAITYQVTNGKNAMP---- 82 (108)
T ss_pred ccHHHHHHHHHHHHHHHCCCC---CCCCCCCcccCH-------------HHHhhC-cCCCHHHHHHHHHcCccccc----
Confidence 367899999999999999842 344566653211 001000 11246788888988764222
Q ss_pred CCCCCCCHHHHHHHHHHHhhc
Q psy14949 81 VFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl~sl 101 (102)
+|.+.||++|+++|++||.++
T Consensus 83 ~f~~~Ls~~ei~~i~aYi~~~ 103 (108)
T CHL00183 83 AFGGRLSDEDIEDVANYVLSQ 103 (108)
T ss_pred cccCCCCHHHHHHHHHHHHHh
Confidence 576778999999999999875
No 12
>CHL00133 psbV photosystem II cytochrome c550; Validated
Probab=99.63 E-value=1.3e-15 Score=94.85 Aligned_cols=81 Identities=20% Similarity=0.299 Sum_probs=55.4
Q ss_pred cHHhHHHHHHHhcccccCCCCCCCCCCCCCc----cCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCC----
Q psy14949 2 SVENGKKIFTRACAQCHTANEGGANKVGPNL----FGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKK---- 73 (102)
Q Consensus 2 d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~---- 73 (102)
++++|+++|+.+|++||.. |.+..+|.+ ..+.... ....+.+.|.+||++|..
T Consensus 51 ~~~~Gk~lF~~~CaaCH~~---G~~~~~p~vgl~l~~L~~A~----------------~~r~~~~~Lv~~iknP~~ydg~ 111 (163)
T CHL00133 51 QVKRGKRLFNASCGACHVG---GITKTNPNVGLDPEALSLAT----------------PPRDNIEALVDYMKNPTTYDGL 111 (163)
T ss_pred HHHHHHHHHHhhHHHhCCC---CCCCCCCCCCCCHHHHhhcC----------------CCcccHHHHHHHHhCcccccch
Confidence 5789999999999999952 233334433 3222100 012257899999999987
Q ss_pred ---------CCCCCccCCCCCCCHHHHHHHHHHHhhc
Q psy14949 74 ---------YIPGTKMVFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 74 ---------~~~~~~m~~~~~ls~~e~~~l~~yl~sl 101 (102)
..++..||-...||++|+.+|++||..+
T Consensus 112 ~~i~~~~~~~K~~~~MPa~~~LsdeEL~aVAaYIl~q 148 (163)
T CHL00133 112 ESIAEIHPSIKSADIFPKMRSLTDEDLYAIAGHILLQ 148 (163)
T ss_pred HHHHHhhcccCccccCCCCCCCCHHHHHHHHHHHHhc
Confidence 3345668644667999999999998753
No 13
>PRK13621 psbV cytochrome c-550; Provisional
Probab=99.62 E-value=9.4e-16 Score=95.48 Aligned_cols=85 Identities=24% Similarity=0.446 Sum_probs=57.8
Q ss_pred cHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCCC-----
Q psy14949 2 SVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYIP----- 76 (102)
Q Consensus 2 d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~----- 76 (102)
|++.|+++|..+|++||. . |++.+.|..+.......+..+ ..-..+.|.+|+++|+.+.+
T Consensus 65 d~~~G~~lF~~~Ca~CH~-g--G~n~v~p~ktL~~~~L~~a~p------------~rd~I~~LV~~iknPms~kg~~~~~ 129 (170)
T PRK13621 65 QLTDGKQLFDSNCLNCHV-G--GATLPNPNVSLSLKDLRGATP------------PRDNIAALVAYQRDPMSYDGSEESY 129 (170)
T ss_pred HHHhHHHHHHHHHHHhcc-C--CCCCcCCCCCCCHHHHhcCCC------------chHHHHHHHHHhhCCCCCCcccccc
Confidence 678999999999999994 2 345555644332211111100 11245789999999999876
Q ss_pred CCccCCCCCCCHHHHHHHHHHHhhc
Q psy14949 77 GTKMVFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 77 ~~~m~~~~~ls~~e~~~l~~yl~sl 101 (102)
+..|+....||++|+.+|++||..+
T Consensus 130 ~~~mps~~~LSdeEL~aIAaYLL~q 154 (170)
T PRK13621 130 GCRQVPEDWMTDEELQNLAAFILRA 154 (170)
T ss_pred cccCCccCCCCHHHHHHHHHHHHhh
Confidence 5556555667999999999999764
No 14
>TIGR03046 PS_II_psbV2 photosystem II cytochrome PsbV2. Members of this protein family are PsbV2, a protein closely related cytochrome c-550 (PsbV), a protein important to the water-splitting and oxygen-evolving activity of photosystem II. Mutant studies in Thermosynechococcus elongatus showed PsbV2 can partially replace PsbV, from which it appears to have arisen first by duplication, then by intergenic recombination with a different gene.
Probab=99.62 E-value=1.3e-15 Score=94.04 Aligned_cols=85 Identities=25% Similarity=0.451 Sum_probs=56.2
Q ss_pred cHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCCC-----
Q psy14949 2 SVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYIP----- 76 (102)
Q Consensus 2 d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~----- 76 (102)
|+.+|+++|+.+|++||.- |++.+.|..+..........+ .+-..+.|.+|+++|+.+.+
T Consensus 54 d~~~G~~lF~~~Ca~CH~g---G~n~~~p~~~L~~~~L~~atp------------~Rd~I~~Lv~~iknP~s~kG~~~~~ 118 (155)
T TIGR03046 54 QLTDGKNLFESNCLNCHVG---GATLPNPNVSLSLKDLKGATP------------PRDTIQSLVAYQRDPMSYDGSEESY 118 (155)
T ss_pred HHHhHHHHHHHHHHHhccC---CCCCcCCCCCCCHHHHhcCCC------------chHHHHHHHHHhhCCcccCcccccc
Confidence 6789999999999999953 234444433222111111000 01135789999999998866
Q ss_pred CCccCCCCCCCHHHHHHHHHHHhhc
Q psy14949 77 GTKMVFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 77 ~~~m~~~~~ls~~e~~~l~~yl~sl 101 (102)
+..|+....|+++|+++|++||..+
T Consensus 119 ~~~mp~~~~LsdeEL~aIAaYLl~q 143 (155)
T TIGR03046 119 GCRPVPEDWMDDEEVENLAAFILRA 143 (155)
T ss_pred cccCCcccCCCHHHHHHHHHHHHHh
Confidence 4456655667999999999999764
No 15
>PRK13617 psbV cytochrome c-550; Provisional
Probab=99.62 E-value=1.7e-15 Score=94.53 Aligned_cols=84 Identities=19% Similarity=0.320 Sum_probs=59.8
Q ss_pred cHHhHHHHHHHhcccccCCCCCC-CCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCC-----
Q psy14949 2 SVENGKKIFTRACAQCHTANEGG-ANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYI----- 75 (102)
Q Consensus 2 d~~~G~~lf~~~C~~CH~~~~~g-~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~----- 75 (102)
++++|+++|+.+|++||...+-. .+.+||+|..+..... .+.+.+.|.+|+++|..+.
T Consensus 58 ~~~~G~~~F~~~C~~CH~~g~T~~n~~vg~dL~~L~aa~p----------------~r~nv~aLv~yikdP~sydg~~s~ 121 (170)
T PRK13617 58 EIKAGRKVFNTSCGTCHAGGITKTNQNVGLDPETLALATP----------------ARDNVDALVDYLKDPTSYDGEYSI 121 (170)
T ss_pred HHHHHHHHHHcchhhhccCCCcCCCCCcCCCHHHHhccCC----------------CCCCHHHHHHHHhChHhhcchhhc
Confidence 46789999998999999654321 2457888854421110 1235789999999998766
Q ss_pred ----CCC----ccCCCCCCCHHHHHHHHHHHhhc
Q psy14949 76 ----PGT----KMVFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 76 ----~~~----~m~~~~~ls~~e~~~l~~yl~sl 101 (102)
|+. .||-...||++|+++|++||..+
T Consensus 122 ~e~~P~~~~~~imP~~~~LsdeeL~alAayLl~~ 155 (170)
T PRK13617 122 ADLHPSMRSADLYPAMRDLNDEDLRLMAGYILVA 155 (170)
T ss_pred cccCccccccccCcccCCCCHHHHHHHHHHHHhc
Confidence 553 56644446999999999999876
No 16
>PRK13697 cytochrome c6; Provisional
Probab=99.59 E-value=5.1e-15 Score=87.43 Aligned_cols=80 Identities=24% Similarity=0.428 Sum_probs=52.1
Q ss_pred CcHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCCCCCcc
Q psy14949 1 MSVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYIPGTKM 80 (102)
Q Consensus 1 ~d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~m 80 (102)
+|++.|+.+|+.+|++||+.+. +....+|.+.... +...+ .++.+.+...++++...||
T Consensus 26 ~~~~~G~~ly~~~C~~CHg~g~-~~~~~~p~l~~~~---------------~~~~~-~~~~~~l~~~i~~g~~~Mp---- 84 (111)
T PRK13697 26 ADAANGEQVFSANCASCHAGGK-NLVNAGKTLKKAD---------------LEKYG-MYSLEAITAQVTNGKNAMP---- 84 (111)
T ss_pred cCHHHHHHHHHHHHHHhCCCCC-CCCCCCCCCCHHH---------------HHhcC-CCCHHHHHHHHHcCCCCCC----
Confidence 3567899999999999998532 1112244443211 00000 2346678888888753333
Q ss_pred CCCCCCCHHHHHHHHHHHhhc
Q psy14949 81 VFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl~sl 101 (102)
+|...+|++|+++|++||.++
T Consensus 85 ~~~~~ls~~di~~l~~Yi~~~ 105 (111)
T PRK13697 85 AFKDRLSPDQIEDVAAYVLEQ 105 (111)
T ss_pred CCcCCCCHHHHHHHHHHHHHH
Confidence 676778999999999999875
No 17
>PRK13620 psbV cytochrome c-550; Provisional
Probab=99.56 E-value=1.4e-14 Score=92.48 Aligned_cols=82 Identities=24% Similarity=0.254 Sum_probs=59.9
Q ss_pred cHHhHHHHHHHhcccccCCCCCC--CCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCCC---
Q psy14949 2 SVENGKKIFTRACAQCHTANEGG--ANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYIP--- 76 (102)
Q Consensus 2 d~~~G~~lf~~~C~~CH~~~~~g--~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~--- 76 (102)
++++|+++|++.|++|| +.|.. .+.++|+++.+..... ...+.+.|..||++|..+.+
T Consensus 103 q~~~GkqLF~~~Ca~CH-VgG~Tktnp~vgpdLt~LaaAtp----------------pRdn~e~Lv~wLkdP~sydg~~s 165 (215)
T PRK13620 103 QVAEGKQLFAYACGQCH-VGGITKTDPNVGLDPEALALATP----------------PRDSVESLVDYLHNPTTYDGERE 165 (215)
T ss_pred HHHHHHHHHHhhhhhcc-CCCCCCCCCCCCCCHHHHhccCC----------------CCCCHHHHHHHHhCccccCCcch
Confidence 57899999999999999 44311 2355676664321110 02367899999999999887
Q ss_pred ----------CCccCCCCCCCHHHHHHHHHHHhh
Q psy14949 77 ----------GTKMVFPGLRKEKDREDLIAYLST 100 (102)
Q Consensus 77 ----------~~~m~~~~~ls~~e~~~l~~yl~s 100 (102)
+..||-...||++|+.+|+.|+.-
T Consensus 166 iae~HPs~~s~d~mP~~r~LtdedL~aIa~~IL~ 199 (215)
T PRK13620 166 ISELHPSTKSTDIFPKMRNLTEDDLVAISGHILL 199 (215)
T ss_pred hhhcCccccccccccccCCCCHHHHHHHHHHHhc
Confidence 778875555699999999999864
No 18
>COG2863 Cytochrome c553 [Energy production and conversion]
Probab=99.55 E-value=6.1e-15 Score=87.55 Aligned_cols=78 Identities=27% Similarity=0.386 Sum_probs=54.8
Q ss_pred CcHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCC-CCCCCCc
Q psy14949 1 MSVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPK-KYIPGTK 79 (102)
Q Consensus 1 ~d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~~ 79 (102)
+|++.|+++|.+.|++||+.++.+.....|+|.++. .++|...++.-+ ...++..
T Consensus 22 ~~a~~G~~~~~~~Ca~CHG~~g~~~~~~~P~Lagq~------------------------~~yl~~~L~a~k~g~r~~~v 77 (121)
T COG2863 22 ADAALGKALAAQSCAACHGADGNSPAPGYPKLAGQS------------------------EAYLEKQLKAYKDGKRPGPV 77 (121)
T ss_pred hhHHHHHHhhcchhhhccCCCCCCccCCCCCcCCCC------------------------HHHHHHHHHHHHcCCCCcch
Confidence 467899999988899999999876555567776543 344444443222 1123345
Q ss_pred c-CCCCCCCHHHHHHHHHHHhhcC
Q psy14949 80 M-VFPGLRKEKDREDLIAYLSTLH 102 (102)
Q Consensus 80 m-~~~~~ls~~e~~~l~~yl~sl~ 102 (102)
| .....|||+||.+|++|+.+++
T Consensus 78 M~~~a~~LsD~Di~~lAa~~a~~~ 101 (121)
T COG2863 78 MNAIASGLSDEDIADLAAYYAAQK 101 (121)
T ss_pred HHHHHHhCCHHHHHHHHHHHHhCC
Confidence 6 4566679999999999999874
No 19
>TIGR03874 4cys_cytochr c-type cytochrome, methanol metabolism-related. This family represents a c-type cytochrome related to (but excluding) cytochrome c-555 of Methylococcus capsulatus. Members contain four invariant Cys residues, including two from a heme-binding motif shared with c-555, and two others.
Probab=99.55 E-value=2.8e-14 Score=87.30 Aligned_cols=77 Identities=22% Similarity=0.320 Sum_probs=56.3
Q ss_pred HhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCCC---CCcc
Q psy14949 4 ENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYIP---GTKM 80 (102)
Q Consensus 4 ~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~---~~~m 80 (102)
-.|+++|..+|++||+.++.|.. ..|+|.+.... .+...|.+.|.++..... ...|
T Consensus 34 ~~G~~lY~~~CAaCHG~dG~G~~-~~P~Lans~v~--------------------~s~~nli~vIl~G~~~~~~~~~~~M 92 (143)
T TIGR03874 34 YSGYRRYHSECHVCHGPDGMGST-YAPALKDSVKR--------------------MSYGDFLGVVANGRQNVSAAQNNVM 92 (143)
T ss_pred ccHHHHHHHHHHHhCCCCCCCCC-CCCCCCCcccc--------------------CCHHHHHHHHHhCCCCCCCCCCCCC
Confidence 36999999999999999986543 57777532211 146678888888775321 1346
Q ss_pred -CCCCCCCHH-HHHHHHHHHhhc
Q psy14949 81 -VFPGLRKEK-DREDLIAYLSTL 101 (102)
Q Consensus 81 -~~~~~ls~~-e~~~l~~yl~sl 101 (102)
+|...|+++ ||.+|++||++.
T Consensus 93 PaF~~~LsD~~eIa~L~~YLR~~ 115 (143)
T TIGR03874 93 PAFGDNPNVMCYLDDLYVYLRAR 115 (143)
T ss_pred CCccccCCcHHHHHHHHHHHHhc
Confidence 588888886 999999999864
No 20
>TIGR00782 ccoP cytochrome c oxidase, cbb3-type, subunit III. This model describes a di-heme subunit of approximately 26 kDa of the cbb3 type copper and heme-containing cytochrome oxidase.
Probab=99.53 E-value=4.9e-14 Score=95.41 Aligned_cols=79 Identities=23% Similarity=0.393 Sum_probs=55.2
Q ss_pred HHhHHHHHHHhcccccCCCCCCCCCC-CCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCCCCCccC
Q psy14949 3 VENGKKIFTRACAQCHTANEGGANKV-GPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYIPGTKMV 81 (102)
Q Consensus 3 ~~~G~~lf~~~C~~CH~~~~~g~~~~-gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~m~ 81 (102)
+++|+++|.+.|++||+.++.|.... .|+|.+....+. . +...+.+.+++++. +.|.+
T Consensus 203 ~~~G~~lf~~~Ca~CHG~~G~G~~~~gaP~L~~~~~~y~---------~---------~~~~i~~~i~~G~~---g~Mp~ 261 (285)
T TIGR00782 203 AAKGQELFADNCTTCHGEDGKGLQELGAPNLTDDVWLYG---------G---------DLKTITTTITNGRG---GVMPA 261 (285)
T ss_pred HHHHHHHHhccchhhCCCCCCCCCCCCCCCCCcchhhcC---------C---------CHHHHHHHHHhCCC---CCCCC
Confidence 46899999989999999997654333 488876432111 0 23456666776542 22335
Q ss_pred CCCCCCHHHHHHHHHHHhhcC
Q psy14949 82 FPGLRKEKDREDLIAYLSTLH 102 (102)
Q Consensus 82 ~~~~ls~~e~~~l~~yl~sl~ 102 (102)
|...||++||++|++||.+|+
T Consensus 262 ~~~~Ls~~ei~~La~Yv~sL~ 282 (285)
T TIGR00782 262 WGPRLSEAQIKALAAYVHSLG 282 (285)
T ss_pred ccccCCHHHHHHHHHHHHHhc
Confidence 777789999999999999974
No 21
>PRK13619 psbV cytochrome c-550; Provisional
Probab=99.50 E-value=7e-14 Score=85.96 Aligned_cols=85 Identities=25% Similarity=0.362 Sum_probs=58.8
Q ss_pred CcHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCC-----
Q psy14949 1 MSVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYI----- 75 (102)
Q Consensus 1 ~d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~----- 75 (102)
+|+++|+++|+.+|+.||.. |++.++|++...........+ .+...+.|.+|+++|..+.
T Consensus 49 ~d~~~GkklF~~~Ca~CH~g---G~nk~~Pnl~L~~~~L~~atP------------~RdnV~aLVdymk~PtsyDG~~~~ 113 (160)
T PRK13619 49 KQITNGQRLFVQECTQCHLQ---GKTKTNNNVSLGLEDLAGAEP------------PRDNVLALVDYLKHPTSYDGEDDY 113 (160)
T ss_pred HHHHHHHHHHHHHHHHcccC---CCCCcCCCCCcCHHHHHhcCC------------CcccHHHHHHHHhCCcccccchhh
Confidence 46889999999999999975 467778877653222211111 1235778999999999872
Q ss_pred ---C-----CCccCCCCCCCHHHHHHHHHHHhh
Q psy14949 76 ---P-----GTKMVFPGLRKEKDREDLIAYLST 100 (102)
Q Consensus 76 ---~-----~~~m~~~~~ls~~e~~~l~~yl~s 100 (102)
. ...||-..-|+|+|+.+|++|+..
T Consensus 114 a~~hpsi~~~di~P~mr~LtdedL~~iAg~IL~ 146 (160)
T PRK13619 114 SELHPNVSRPDIFPELRNFTEDDLYDVAGYMLV 146 (160)
T ss_pred hhhcccccccccccccCCCCHHHHHHHHHHHHh
Confidence 1 123443334799999999999864
No 22
>TIGR00782 ccoP cytochrome c oxidase, cbb3-type, subunit III. This model describes a di-heme subunit of approximately 26 kDa of the cbb3 type copper and heme-containing cytochrome oxidase.
Probab=99.47 E-value=2.8e-13 Score=91.73 Aligned_cols=80 Identities=29% Similarity=0.380 Sum_probs=53.9
Q ss_pred HhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCC----C-CC
Q psy14949 4 ENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYI----P-GT 78 (102)
Q Consensus 4 ~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~----~-~~ 78 (102)
+.|+++|..+|++||+.++.|. ...|+|.+..... . .+.+.+...++++.... . ..
T Consensus 108 ~~G~~lf~~~Ca~CHG~~g~G~-~g~P~L~~~~~~~---------g---------~~~~~i~~~i~~G~~~~~~~~~~~~ 168 (285)
T TIGR00782 108 NAGAAIFRTWCAQCHGSGAGGA-KGFPNLLDNDWLW---------G---------GTLEGIHTTIKHGIRDPDDGDTYVG 168 (285)
T ss_pred HHHHHHHHHHhHHhCCCCCCCC-CCCCCCCCCcccc---------C---------CCHHHHHHHHHhCccCcccCCcCCC
Confidence 6799999999999999987543 2237665421100 0 03556777777665311 0 23
Q ss_pred cc-CCCCCCCHHHHHHHHHHHhhcC
Q psy14949 79 KM-VFPGLRKEKDREDLIAYLSTLH 102 (102)
Q Consensus 79 ~m-~~~~~ls~~e~~~l~~yl~sl~ 102 (102)
.| +|...||++|+++|++||++++
T Consensus 169 ~Mp~~~~~LsdeeI~aVaaYv~sl~ 193 (285)
T TIGR00782 169 EMPAFGPLLEEADIKDVASYVMSLS 193 (285)
T ss_pred CCCccccccChHHHHHHHHHHHHhc
Confidence 45 5767789999999999999863
No 23
>PRK14486 putative bifunctional cbb3-type cytochrome c oxidase subunit II/cytochrome c; Provisional
Probab=99.46 E-value=3.6e-13 Score=91.38 Aligned_cols=76 Identities=20% Similarity=0.358 Sum_probs=51.3
Q ss_pred HHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCCCCCcc-C
Q psy14949 3 VENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYIPGTKM-V 81 (102)
Q Consensus 3 ~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~m-~ 81 (102)
+.+|+++|+.+|++||+.++.|. .+|.+...... ....+.+...+.++.. +..| +
T Consensus 215 ~~~G~~ly~~~Ca~CHg~~g~G~--~gp~p~~~~~~-------------------~~~~~~~~~~I~~G~~---~~~MP~ 270 (294)
T PRK14486 215 IAKGKALYDANCAACHGDEAQGQ--EGVALNDIDDG-------------------DLPDAAYFGMIKGGSD---AKGMPG 270 (294)
T ss_pred HHHHHHHHHHHHHHhcCCCCCCC--CCCCccccccC-------------------CCcHHHHHHHHHcCCC---cCCCCc
Confidence 56899999999999999886543 34433322110 0023345666777663 2245 5
Q ss_pred CCCCCCHHHHHHHHHHHhhcC
Q psy14949 82 FPGLRKEKDREDLIAYLSTLH 102 (102)
Q Consensus 82 ~~~~ls~~e~~~l~~yl~sl~ 102 (102)
|...||++|+++|++||++++
T Consensus 271 f~~~Lsdeei~~LaaYV~sl~ 291 (294)
T PRK14486 271 FGGDLSDDDIWAIVAYIRSQK 291 (294)
T ss_pred ccccCCHHHHHHHHHHHHhcc
Confidence 667689999999999999874
No 24
>PRK14487 cbb3-type cytochrome c oxidase subunit II; Provisional
Probab=99.44 E-value=2.8e-13 Score=87.34 Aligned_cols=98 Identities=21% Similarity=0.250 Sum_probs=60.9
Q ss_pred HhHHHHHHH-hcccccC--CCCCCCC--CCC-CCccCCc--CCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCC
Q psy14949 4 ENGKKIFTR-ACAQCHT--ANEGGAN--KVG-PNLFGVV--GRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYI 75 (102)
Q Consensus 4 ~~G~~lf~~-~C~~CH~--~~~~g~~--~~g-p~l~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 75 (102)
.+|+.+|.+ .|..||+ +.+-... .-| +++.+-. .++. .-..-...|+|.+.|.+.+.+++..+|.+|+...
T Consensus 52 l~Gr~iyi~eGC~~CHsQ~VR~~~~e~~r~G~~S~a~e~~yd~P~-lwGs~RtGPDLt~vG~R~s~~w~~~hl~nP~~v~ 130 (217)
T PRK14487 52 LAGRDIYIREGCYNCHSQMIRPFRAETERYGHYSLAGESVYDHPF-LWGSKRTGPDLARVGGRYSDEWHRNHLINPRSVV 130 (217)
T ss_pred HHHHHHHHhcChhhccCccccCCchhhhhcCcccccchhhccccc-ccCCCCCCcchhhhhccCCHHHHHHHHhCcccCC
Confidence 579999975 6999998 3321000 001 1111100 0000 0001123455666666678899999999999999
Q ss_pred CCCccCCCCCCCHHHHH--HHHHHHhhcC
Q psy14949 76 PGTKMVFPGLRKEKDRE--DLIAYLSTLH 102 (102)
Q Consensus 76 ~~~~m~~~~~ls~~e~~--~l~~yl~sl~ 102 (102)
|++.||-...|++++++ +|++||++|+
T Consensus 131 PgS~MPay~~L~~~~ld~~~~~~~l~~l~ 159 (217)
T PRK14487 131 PESNMPAYPWLAENDLDGTDTAEKMTALR 159 (217)
T ss_pred CCCCCCCCcccccccCCHHHHHHHHHHhh
Confidence 99999744445677766 9999999874
No 25
>PRK13622 psbV cytochrome c-550; Provisional
Probab=99.34 E-value=5.1e-12 Score=79.75 Aligned_cols=85 Identities=25% Similarity=0.416 Sum_probs=51.1
Q ss_pred cHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCC------
Q psy14949 2 SVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYI------ 75 (102)
Q Consensus 2 d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~------ 75 (102)
++..|+++|..+|++||.. | .+...|..+.......+..+. .-+.+.|..++++|..+.
T Consensus 61 ~~~~G~~lF~~~Ca~CH~~-G--~ni~~P~~tLk~~aL~~a~p~------------rdnv~AIv~yLk~p~tYdg~~~~~ 125 (180)
T PRK13622 61 QLAKGKKLFNRACAQCHVG-G--QTYPNPDVSLKLSDLEGATPP------------RDNVLAIVDYIKNPVTYDGVESLL 125 (180)
T ss_pred HHHHHHHHHHhhhHHhccC-C--CCCcCCCcccCHHHHcCCCCC------------cccHHHHHHHHhcccccCCccchh
Confidence 4578999999999999954 2 333334333211111100000 124567888999988544
Q ss_pred ---CC----CccCCCCCCCHHHHHHHHHHHhhc
Q psy14949 76 ---PG----TKMVFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 76 ---~~----~~m~~~~~ls~~e~~~l~~yl~sl 101 (102)
|. +.||-...|||+|+.+|++||..+
T Consensus 126 e~~p~~~~~~~~p~~~~LsdeEI~~VA~yIl~q 158 (180)
T PRK13622 126 EYHPNTQLLSEYPRLRNLTDEDLKLIAGYILVQ 158 (180)
T ss_pred hccccchhccccccccCCCHHHHHHHHHHHHhC
Confidence 11 123322457999999999999764
No 26
>COG3258 Cytochrome c [Energy production and conversion]
Probab=99.30 E-value=1e-11 Score=81.87 Aligned_cols=82 Identities=26% Similarity=0.439 Sum_probs=50.1
Q ss_pred cHHhHHHHHHHhcccccCCCCCCCCC--------CCCCccCCcCCcCCCCCCCccchhhhhcCc-cccHHHHHHHHhCCC
Q psy14949 2 SVENGKKIFTRACAQCHTANEGGANK--------VGPNLFGVVGRQTGQAPGFDYTDANKNKGI-TWSASTLDIYLQNPK 72 (102)
Q Consensus 2 d~~~G~~lf~~~C~~CH~~~~~g~~~--------~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~~~~~~~ 72 (102)
|+.+|+.||.++|+.||+.+|.|... ..|+|.+-. .|. .|. ..-...+..||+.-+
T Consensus 161 dp~rG~kly~eqCa~CHg~~G~G~k~~~~~~~~y~fPpLwG~d--------Sfn-------~GagMari~t~A~Fi~~nM 225 (293)
T COG3258 161 DPVRGKKLYAEQCAACHGADGQGLKNDDEQGAGYLFPPLWGPD--------SFN-------DGAGMARINTLARFIKANM 225 (293)
T ss_pred CchhHHHHHHHHHHHhcCCCCCccccCcCCCcceecCcccCCc--------ccC-------CccchhhHHHHHHHHHhcC
Confidence 77899999999999999999765322 234444321 111 111 112245667776432
Q ss_pred CCCCCCccCCCCCCCHHHHHHHHHHHhhc
Q psy14949 73 KYIPGTKMVFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 73 ~~~~~~~m~~~~~ls~~e~~~l~~yl~sl 101 (102)
|-..-....+||++|.++|++|+.++
T Consensus 226 ---P~g~~~~~P~Lsd~dA~DiAay~~~~ 251 (293)
T COG3258 226 ---PYGFSGTNPILSDQDAWDIAAYVNSQ 251 (293)
T ss_pred ---CCCcCccCCccChHHHHHHHHHHcCC
Confidence 21111122358999999999999764
No 27
>COG4654 Cytochrome c551/c552 [Energy production and conversion]
Probab=99.26 E-value=6.7e-12 Score=71.77 Aligned_cols=83 Identities=20% Similarity=0.376 Sum_probs=58.4
Q ss_pred CcHHhHHHHHHHh-cccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCC-CCC
Q psy14949 1 MSVENGKKIFTRA-CAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYI-PGT 78 (102)
Q Consensus 1 ~d~~~G~~lf~~~-C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~ 78 (102)
+++++|+.||.++ |.+||.++ ...+||++..+...+.+..+ ....|...++++.... ...
T Consensus 21 ~~a~~~~aif~qkgC~~CHq~~---vktVGPS~kdIAakYag~~~---------------~~~kl~q~i~~g~~g~wg~i 82 (110)
T COG4654 21 ADAEDGKAIFSQKGCVACHQPD---VKTVGPSYKDIAAKYAGKAG---------------ALAKLAQGIKPGGVGVWGPI 82 (110)
T ss_pred cchhhhHHHHHhccchhhcccc---ccccCccHHHHHHHHccchh---------------HHHHHHHhccccCcCccccC
Confidence 3678999999975 99999988 67789999988777665433 2344555555544322 235
Q ss_pred ccCCCCCCCHHHHHHHHHHHhhc
Q psy14949 79 KMVFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 79 ~m~~~~~ls~~e~~~l~~yl~sl 101 (102)
.||....+++.+...++.||.+.
T Consensus 83 pMppqp~~sd~~a~~~~kwvl~~ 105 (110)
T COG4654 83 PMPPQPAISDADAKTLAKWVLAF 105 (110)
T ss_pred CCCCcccccchHHHHHHHHHHhc
Confidence 67766566788888888887653
No 28
>PF14495 Cytochrom_C550: Cytochrome c-550 domain; PDB: 3ARC_V 1IZL 3A0H_V 3A0B_v 1E29_A 1F1C_B 1S5L_V 4FBY_i 3PRR_V 3PRQ_V ....
Probab=99.25 E-value=1.4e-11 Score=73.65 Aligned_cols=84 Identities=26% Similarity=0.444 Sum_probs=49.1
Q ss_pred cHHhHHHHHHHhcccccCCCCCCCCCCCCCccCCcCCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCCCCC---
Q psy14949 2 SVENGKKIFTRACAQCHTANEGGANKVGPNLFGVVGRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYIPGT--- 78 (102)
Q Consensus 2 d~~~G~~lf~~~C~~CH~~~~~g~~~~gp~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~--- 78 (102)
++.+|++||+..|+.||. +|.....|+............+ .+.+.+.|.+|+++|..+..-.
T Consensus 24 q~~~GkrLF~~~C~~CH~---GG~TktNpnV~L~le~L~~AtP------------pRDNi~~LVdYmk~PtsYDG~~~i~ 88 (135)
T PF14495_consen 24 QLKRGKRLFNASCAQCHV---GGITKTNPNVSLSLEDLAGATP------------PRDNIEALVDYMKNPTSYDGEESIS 88 (135)
T ss_dssp HHHHHHHHHHHHTHHHHG---GGCBTTSTTSBSSHHHHHTSSS--------------SSHHHHHHHHHS-B-TTSSSBGT
T ss_pred HHHHHHHHHHHHHHhhcc---CCcccCCCCCCcCHHHHccCCC------------CcccHHHHHHHhhCCCCcCCchhHH
Confidence 467899999999999995 1234445554332211111111 1336788999999999765321
Q ss_pred ----------ccCCCCCCCHHHHHHHHHHHhh
Q psy14949 79 ----------KMVFPGLRKEKDREDLIAYLST 100 (102)
Q Consensus 79 ----------~m~~~~~ls~~e~~~l~~yl~s 100 (102)
..|-..-|+++|+.+|++||..
T Consensus 89 e~hp~~~s~di~p~mr~ltdddL~~iAg~IL~ 120 (135)
T PF14495_consen 89 ELHPSIKSADIFPKMRNLTDDDLYAIAGYILR 120 (135)
T ss_dssp TTS-STTCTTTSGGGTS--HHHHHHHHHHHHH
T ss_pred HhCcCcccchhhHhhcCCCHHHHHHHHHHHHh
Confidence 1111123689999999999864
No 29
>COG2010 CccA Cytochrome c, mono- and diheme variants [Energy production and conversion]
Probab=99.23 E-value=1.2e-11 Score=76.10 Aligned_cols=19 Identities=37% Similarity=0.744 Sum_probs=17.4
Q ss_pred HHhHHHHHHHhcccccCCC
Q psy14949 3 VENGKKIFTRACAQCHTAN 21 (102)
Q Consensus 3 ~~~G~~lf~~~C~~CH~~~ 21 (102)
...|+++|..+|+.||+.+
T Consensus 51 ~~~G~~~f~~~C~~CHg~~ 69 (150)
T COG2010 51 RGAGLALFLGNCAACHGPN 69 (150)
T ss_pred HHHHHHHhcccchhccCCC
Confidence 5689999999999999988
No 30
>TIGR00781 ccoO cytochrome c oxidase, cbb3-type, subunit II. This model describes the monoheme subunit of the cbb3-type cytochrome oxidase, found in a subset of Proteobacterial species. Species having this protein also have CcoN (subunit I, containing copper and two heme groups), CcoP (subunit III, containing two hemes), and CcoQ (essential for incorporation of the prosthetic groups).
Probab=99.10 E-value=2.4e-10 Score=74.37 Aligned_cols=77 Identities=21% Similarity=0.258 Sum_probs=44.2
Q ss_pred HhHHHHHHH-hcccccC--CCCCC--CCCCC-CCccCCc--CCcCCCCCCCccchhhhhcCccccHHHHHHHHhCCCCCC
Q psy14949 4 ENGKKIFTR-ACAQCHT--ANEGG--ANKVG-PNLFGVV--GRQTGQAPGFDYTDANKNKGITWSASTLDIYLQNPKKYI 75 (102)
Q Consensus 4 ~~G~~lf~~-~C~~CH~--~~~~g--~~~~g-p~l~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 75 (102)
.+|+.+|.+ .|..||+ +.+.. ...-| +++.+-. .++. .-..-...|+|...|..++.+++..++.+|+...
T Consensus 51 l~Gr~iyi~eGC~~CHsQ~VR~~~~e~~ryG~~S~a~e~~yd~p~-lwGs~RtGPDLt~vG~R~s~~wh~~hl~nPr~v~ 129 (232)
T TIGR00781 51 LAGRDIYIREGCYHCHSQMIRPFRAEVERYGHYSLAGESMYDHPF-QWGSKRTGPDLARVGGRYSDEWHVKHLFDPRSVV 129 (232)
T ss_pred HHHHHHHHhcChhhcccccccCcchhhhhcCcccccchhhccccc-ccCCCCcCcCcccccccCCHHHHHHHHhCccccC
Confidence 579999975 6999997 33210 00000 1111100 0000 0000123445555566667899999999999999
Q ss_pred CCCccC
Q psy14949 76 PGTKMV 81 (102)
Q Consensus 76 ~~~~m~ 81 (102)
|++.||
T Consensus 130 PgSiMP 135 (232)
T TIGR00781 130 PESIMP 135 (232)
T ss_pred CCCCCC
Confidence 999997
No 31
>PRK14486 putative bifunctional cbb3-type cytochrome c oxidase subunit II/cytochrome c; Provisional
Probab=99.07 E-value=4.6e-10 Score=76.37 Aligned_cols=75 Identities=15% Similarity=0.278 Sum_probs=45.9
Q ss_pred HHhHHHHHHH-hcccccCCCCCCCCCCCCCccCCcCCcCC----------CCCCCccchhhhhcCccccHHHHHHHHhCC
Q psy14949 3 VENGKKIFTR-ACAQCHTANEGGANKVGPNLFGVVGRQTG----------QAPGFDYTDANKNKGITWSASTLDIYLQNP 71 (102)
Q Consensus 3 ~~~G~~lf~~-~C~~CH~~~~~g~~~~gp~l~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 71 (102)
..+|+.||.+ .|..||+..- .+....... .++... .-..-...|.|...|..++.+++..+|.+|
T Consensus 51 ~~~G~~~y~~~gC~~CH~q~v---r~~~~~~~~-~g~~s~~~~~~~~~p~~~g~~r~GPDL~~vG~r~~~~w~~~~l~~P 126 (294)
T PRK14486 51 ELAGRDVYQREGCVNCHTQTV---RPLKSEVVR-YGQYSKAGEFAYDHPFLWGSKRTGPDLARIGGKYPDAWHYAHFEDP 126 (294)
T ss_pred HHHHHHHHHHcCchhhcCccc---cCCcccccc-cCCCCcchhhhccccccccCCCCCCchhhhcccCCHHHHHHHHhCc
Confidence 3579999986 6999998420 000111100 000000 001112355666667777899999999999
Q ss_pred CCCCCCCccC
Q psy14949 72 KKYIPGTKMV 81 (102)
Q Consensus 72 ~~~~~~~~m~ 81 (102)
+...|++.||
T Consensus 127 ~~~~p~s~MP 136 (294)
T PRK14486 127 QAVVPRSNMP 136 (294)
T ss_pred ccCCCCCCCC
Confidence 9999999886
No 32
>TIGR03791 TTQ_mauG tryptophan tryptophylquinone biosynthesis enzyme MauG. Members of this protein family are the tryptophan tryptophylquinone biosynthesis (TTQ) enzyme MauG, as found in Methylobacterium extorquens and related species. This protein is required to complete the maturation of the TTQ cofactor in the methylamine dehydrogenase light (beta) chain.
Probab=98.86 E-value=1.4e-08 Score=69.15 Aligned_cols=99 Identities=24% Similarity=0.335 Sum_probs=54.0
Q ss_pred HHhHHHHHH--HhcccccCCCCCC---CCCCC-CC-c-cCC------------cCCcCCCCCCCccchhhhhcCccccHH
Q psy14949 3 VENGKKIFT--RACAQCHTANEGG---ANKVG-PN-L-FGV------------VGRQTGQAPGFDYTDANKNKGITWSAS 62 (102)
Q Consensus 3 ~~~G~~lf~--~~C~~CH~~~~~g---~~~~g-p~-l-~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 62 (102)
..+|..||. .+|+.||...--+ -..+| |. + .+. ....+++..+...++++.+.|...+.+
T Consensus 159 e~~G~~LF~~k~~C~~CH~g~~ftd~~f~~iG~~~~~d~G~~~~~~~~~~~~~~~FrtPsLRnV~~taPY~HdG~~~tL~ 238 (291)
T TIGR03791 159 AKRGFALFKGKAGCAACHSSWRFTDDSFHDIGLKAGLDLGRGAFAPPQVTAMQHAFKTPSLRDLPMEGPFMHDGQLGSLD 238 (291)
T ss_pred HHHHHHHhcCCCCCCCCCCCCCcCCchHHhcCCCCccCCCcccccccccccccCcccCccccccccCCCCCCCCCcCCHH
Confidence 468999998 3699999632100 00112 11 0 011 111223333444444555677777777
Q ss_pred HHHHHHhCCCCCCCC--CccCCCCCCCHHHHHHHHHHHhhcC
Q psy14949 63 TLDIYLQNPKKYIPG--TKMVFPGLRKEKDREDLIAYLSTLH 102 (102)
Q Consensus 63 ~l~~~~~~~~~~~~~--~~m~~~~~ls~~e~~~l~~yl~sl~ 102 (102)
.+..+........++ ..+. .-.||++|+.+|++||++|.
T Consensus 239 evv~~y~~~g~~~~~~~~~~~-~~~Lt~~E~~dLvaFL~tLt 279 (291)
T TIGR03791 239 AVIDHYEKGGEKRPSISAEMK-PFELSEREREDLIAFIETLD 279 (291)
T ss_pred HHHHHHHccCccccccccccc-cCCCCHHHHHHHHHHHHhcC
Confidence 777776554322221 1111 11369999999999999973
No 33
>COG2857 CYT1 Cytochrome c1 [Energy production and conversion]
Probab=98.39 E-value=1.3e-06 Score=58.40 Aligned_cols=41 Identities=32% Similarity=0.454 Sum_probs=34.0
Q ss_pred HHHHHHHHhCCCCCCCCCccCCCCCCCHHHHHHHHHHHhhc
Q psy14949 61 ASTLDIYLQNPKKYIPGTKMVFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 61 ~~~l~~~~~~~~~~~~~~~m~~~~~ls~~e~~~l~~yl~sl 101 (102)
...+..+|++|....+++.|+-...++|+++++|++||+.+
T Consensus 172 ~~~~~~~i~~p~~~k~~~~m~~~~~~tdq~~~dlvaYL~~~ 212 (250)
T COG2857 172 EGELGIFIADPLKDKPGTYMPGNPALTDQEVKDLVAYLKWA 212 (250)
T ss_pred hhhHhhhccCccccCCcCCCCCChhhHHHHHHHHHHHHHHc
Confidence 34488999999999999988744445899999999999875
No 34
>PF02433 FixO: Cytochrome C oxidase, mono-heme subunit/FixO; InterPro: IPR003468 Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO) []. Cytochrome cbb3 oxidases are required both to support symbiotic nitrogen fixation, whilst ensuring that the oxygen-labile nitrogenase is not compromised. Cytochrome cbb3 oxidases consist of four subunits: FixN (or CcoN), FixO (or CcoO), FixP (or CcoP) and FixQ (or CcoQ). The catalytic core is comprised of subunits FixN, FixO and FixP, where FixN acts as the catalytic subunit, and Fix O and FixP are membrane-bound mono- and di-haem cytochromes c, respectively. The FixQ subunit protects the core complex in the presence of oxygen from proteolytic degradation []. This entry represents the mono-haem FixO subunit.
Probab=98.26 E-value=5.9e-06 Score=54.06 Aligned_cols=74 Identities=22% Similarity=0.263 Sum_probs=46.2
Q ss_pred HhHHHHHHH-hcccccCCCCCCCCCCCCCccCCcCCcCC----------CCCCCccchhhhhcCccccHHHHHHHHhCCC
Q psy14949 4 ENGKKIFTR-ACAQCHTANEGGANKVGPNLFGVVGRQTG----------QAPGFDYTDANKNKGITWSASTLDIYLQNPK 72 (102)
Q Consensus 4 ~~G~~lf~~-~C~~CH~~~~~g~~~~gp~l~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 72 (102)
.+|+.+|.+ .|..||+-.- .++...... ++++.. .-+.-...|+|...|...+.++-..-+.+|+
T Consensus 51 l~GR~iYi~eGC~~CHSQ~V---Rp~~~e~~R-yG~yS~a~e~~yd~P~lwGSkRtGPDLarvG~r~s~~Wh~~Hl~~Pr 126 (226)
T PF02433_consen 51 LAGRDIYIREGCYYCHSQMV---RPFRAEVER-YGRYSVAGEYVYDHPFLWGSKRTGPDLARVGGRYSDDWHLAHLYNPR 126 (226)
T ss_pred HhHHHHHHHcCchhcccccC---CCchhhhhh-cCCCCchhhhhccCccccCCCCcCccHHHHhccCChHHHHHHhhChH
Confidence 479999996 5999998531 111111100 111110 0112234566666777778888888899999
Q ss_pred CCCCCCccC
Q psy14949 73 KYIPGTKMV 81 (102)
Q Consensus 73 ~~~~~~~m~ 81 (102)
...|++.||
T Consensus 127 ~v~p~SiMP 135 (226)
T PF02433_consen 127 SVVPGSIMP 135 (226)
T ss_pred hhCCCCCCC
Confidence 999999996
No 35
>PF09098 Dehyd-heme_bind: Quinohemoprotein amine dehydrogenase A, alpha subunit, haem binding; InterPro: IPR015182 Quinohemoprotein amine dehydrogenases (QHNDH) 1.4.99 from EC) are enzymes produced in the periplasmic space of certain Gram-negative bacteria, such as Paracoccus denitrificans and Pseudomonas putida, in response to primary amines, including n-butylamine and benzylamine. QHNDH catalyses the oxidative deamination of a wide range of aliphatic and aromatic amines through formation of a Schiff-base intermediate involving one of the quinone O atoms []. Catalysis requires the presence of a novel redox cofactor, cysteine tryptophylquinone (CTQ). CTQ is derived from the post-translational modification of specific residues, which involves the oxidation of the indole ring of a tryptophan residue to form tryptophylquinone, followed by covalent cross-linking with a cysteine residue []. There is one CTQ per subunit in QHNDH. In addition to CTQ, two haem c cofactors are present in QHNDH that mediate the transfer of the substrate-derived electrons from CTQ to an external electron acceptor, cytochrome c-550 [, ]. QHNDH is a heterotrimer of alpha, beta and gamma subunits. The alpha and beta subunits contain signal peptides necessary for the translocation of QHNDH to the periplasm. The alpha subunit is composed of four domains - domain 1 forming a dihaem cytochrome, and domains 2-4 forming antiparallel beta-barrel structures; the beta subunit is a 7-bladed beta-propeller that provides part of the active site; and the small, catalytic gamma subunit contains the novel cross-linked CTQ cofactor, in addition to additional thioester cross-links between Cys and Asp/Glu residues that encage CTQ. The gamma subunit assumes a globular secondary structure with two short alpha-helices having many turns and bends []. This entry represents the dihaem cytochrome c domain of the QHNDH alpha subunit. The domain contain two cysteine residues that are involved in thioether linkages to haem []. ; PDB: 1PBY_A 1JJU_A 1JMZ_A 1JMX_A.
Probab=98.06 E-value=1.6e-06 Score=54.02 Aligned_cols=19 Identities=32% Similarity=0.847 Sum_probs=15.1
Q ss_pred hHHHHHHHhcccccCCCCC
Q psy14949 5 NGKKIFTRACAQCHTANEG 23 (102)
Q Consensus 5 ~G~~lf~~~C~~CH~~~~~ 23 (102)
.|++|.+++|+.||+.+..
T Consensus 2 ~G~~Lv~~kC~~CHs~~~~ 20 (167)
T PF09098_consen 2 DGEQLVQEKCAGCHSADYD 20 (167)
T ss_dssp -HHHHHHHCHCCTC-EECT
T ss_pred cHHHHHHHHHHHhcCcccc
Confidence 6999999999999987643
No 36
>PRK14485 putative bifunctional cbb3-type cytochrome c oxidase subunit I/II; Provisional
Probab=98.03 E-value=2.8e-05 Score=58.70 Aligned_cols=94 Identities=17% Similarity=0.229 Sum_probs=56.9
Q ss_pred HhHHHHHHH-hcccccCCCCCCCCCCCCCccCCcCCcCC----------CCCCCccchhhhhcCccccHHHHHHHHhCCC
Q psy14949 4 ENGKKIFTR-ACAQCHTANEGGANKVGPNLFGVVGRQTG----------QAPGFDYTDANKNKGITWSASTLDIYLQNPK 72 (102)
Q Consensus 4 ~~G~~lf~~-~C~~CH~~~~~g~~~~gp~l~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 72 (102)
.+|+.+|.+ .|..||+-.- .+..++... .+++.. .-..-...|++...|.+.+.++-..-+.||+
T Consensus 538 ~~Gr~iyi~egC~~CHsq~v---r~~~~e~~r-~G~~s~~~e~~~d~p~~~Gs~rtgpdl~~~g~~~~~~wh~~hl~~p~ 613 (712)
T PRK14485 538 LEGRDLYIREGCYNCHSQMI---RPFRSEVER-YGEYSKAGEFVYDHPFLWGSKRTGPDLAREGGKYPDSWHYNHMEDPQ 613 (712)
T ss_pred HhhHHHHHHcCccccccccC---CCCchhHHh-cCCCCchhhhhccCCcccCCCCcCcchhhhcCCCChHHHHHHhcCch
Confidence 579999996 5999997531 111111100 111110 0111234566666777777788888889999
Q ss_pred CCCCCCccC-C----CCCCCHHHHHHHHHHHhhc
Q psy14949 73 KYIPGTKMV-F----PGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 73 ~~~~~~~m~-~----~~~ls~~e~~~l~~yl~sl 101 (102)
...|++.|| | ...++.+++.+-+..++.+
T Consensus 614 ~~~p~s~mp~y~~l~~~~~~~~~~~~~~~~~~~~ 647 (712)
T PRK14485 614 STSPGSIMPAYPWLLENELDISDTPAKIKAMQTL 647 (712)
T ss_pred hcCCCCCCCCChhhhhCCCChHHHHHHHHHHHhc
Confidence 999999886 3 3445666666666666553
No 37
>PF06537 DUF1111: Protein of unknown function (DUF1111); InterPro: IPR010538 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=97.96 E-value=2.8e-06 Score=61.35 Aligned_cols=20 Identities=45% Similarity=0.959 Sum_probs=16.6
Q ss_pred cHHhHHHHHHH-hcccccCCC
Q psy14949 2 SVENGKKIFTR-ACAQCHTAN 21 (102)
Q Consensus 2 d~~~G~~lf~~-~C~~CH~~~ 21 (102)
++.+|++||.+ .|++||...
T Consensus 362 ~v~~G~~lF~~~GCa~CH~p~ 382 (499)
T PF06537_consen 362 QVLRGKQLFYQIGCASCHTPS 382 (499)
T ss_pred HHHHHHHHHHhcCCcccCCCc
Confidence 45789999997 599999754
No 38
>PF09086 DUF1924: Domain of unknown function (DUF1924); InterPro: IPR015170 This entry is found in a set of bacterial proteins, including Cytochrome c-type protein. It is functionally uncharacterised. ; PDB: 1DW2_C 1DW1_A 1DW3_C 1DW0_A 1OAE_A 1GU2_B 1E8E_A.
Probab=97.82 E-value=2.3e-05 Score=44.81 Aligned_cols=20 Identities=50% Similarity=0.969 Sum_probs=15.0
Q ss_pred cHHhHHHHHHH-h--------cccccCCC
Q psy14949 2 SVENGKKIFTR-A--------CAQCHTAN 21 (102)
Q Consensus 2 d~~~G~~lf~~-~--------C~~CH~~~ 21 (102)
++++|+++|.. + |++||+.+
T Consensus 10 sa~rG~~~f~~~~~~~g~~~sCasCH~~~ 38 (98)
T PF09086_consen 10 SAARGEAFFTSKHTGNGKEWSCASCHTAD 38 (98)
T ss_dssp -HHHHHHHHH--ECCTTCECSCHHHH-SS
T ss_pred CHHHHHHHHHccCCCCCCCCCcccccCCC
Confidence 67899999974 2 99999876
No 39
>COG3245 CycB Cytochrome c5 [Energy production and conversion]
Probab=97.80 E-value=8.8e-05 Score=43.90 Aligned_cols=17 Identities=41% Similarity=1.038 Sum_probs=14.5
Q ss_pred hHHHHHHHhcccccCCC
Q psy14949 5 NGKKIFTRACAQCHTAN 21 (102)
Q Consensus 5 ~G~~lf~~~C~~CH~~~ 21 (102)
.|+.+|...|..||+..
T Consensus 49 ~Gk~vy~~tC~~CHa~~ 65 (126)
T COG3245 49 EGKKVYGATCQACHAAG 65 (126)
T ss_pred ccchhHhhhhhHhccCC
Confidence 38999999999999743
No 40
>COG2993 CcoO Cbb3-type cytochrome oxidase, cytochrome c subunit [Energy production and conversion]
Probab=97.74 E-value=6.4e-06 Score=52.76 Aligned_cols=53 Identities=25% Similarity=0.281 Sum_probs=33.9
Q ss_pred chhhhhcCccccHHHHHHHHhCCCCCCCCCccC-CCCCCC-HHHHHHHHHHHhhc
Q psy14949 49 TDANKNKGITWSASTLDIYLQNPKKYIPGTKMV-FPGLRK-EKDREDLIAYLSTL 101 (102)
Q Consensus 49 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~m~-~~~~ls-~~e~~~l~~yl~sl 101 (102)
.|++-..|-.++.++=..-+.+|+.+.|.+.|| |..+.+ +-|++++.+=++++
T Consensus 105 GPDLaRVG~ryS~dWH~~Hl~~PR~vVPeSiMPsY~~L~~~~ld~~d~~~~~~~~ 159 (227)
T COG2993 105 GPDLARVGGRYSDDWHRAHLLDPRSVVPESIMPSYPWLFDNKLDVDDIGAELKAL 159 (227)
T ss_pred CcchhhhccccccHHHHHHccCchhcCccccCcccHHHhcCCCchHHHHHHHHhh
Confidence 344444555567777777788999999999886 443322 23566666555543
No 41
>PF02167 Cytochrom_C1: Cytochrome C1 family; InterPro: IPR002326 Cytochrome bc1 complex (ubiquinol:ferricytochrome c oxidoreductase) is found in mitochondria, photosynthetic bacteria and other prokaryotes. It is minimally composed of three subunits: cytochrome b, carrying a low- and a high-potential haem group; cytochrome c1 (cyt c1); and a high-potential Rieske iron-sulphur protein. The general function of the complex is electron transfer between two mobile redox carriers, ubiquinol and cytochrome c; the electron transfer is coupled with proton translocation across the membrane, thus generating proton-motive force in the form of an electrochemical potential that can drive ATP synthesis. In its structure and functions, the cytochrome bc1 complex bears extensive analogy to the cytochrome b6f complex of chloroplasts and cyanobacteria; cyt c1 plays an analogous role to cytochrome f, in spite of their different structures [].; GO: 0005506 iron ion binding, 0009055 electron carrier activity, 0020037 heme binding; PDB: 1P84_D 2IBZ_D 1EZV_D 3CX5_O 3CXH_O 1KB9_D 1KYO_D 1ZRT_Q 2CA4_B 2C9X_B ....
Probab=97.67 E-value=2.9e-05 Score=50.97 Aligned_cols=20 Identities=30% Similarity=0.908 Sum_probs=17.7
Q ss_pred cHHhHHHHHHHhcccccCCC
Q psy14949 2 SVENGKKIFTRACAQCHTAN 21 (102)
Q Consensus 2 d~~~G~~lf~~~C~~CH~~~ 21 (102)
.+.+|.++|.+.|++||+.+
T Consensus 14 slqRG~qvy~~~C~~CHsl~ 33 (219)
T PF02167_consen 14 SLQRGAQVYMEVCASCHSLK 33 (219)
T ss_dssp HHHHHHHHHHHTGGGTSBCT
T ss_pred HHHHHHHHHHHHHhhccccc
Confidence 36789999999999999975
No 42
>TIGR03806 chp_HNE_0200 conserved hypothetical protein, HNE_0200 family. The model TIGR03805 describes an uncharacterized protein family that contains repeats associated with the formation of a right-handed helical stack of parallel beta strands, homologous to those found in a number of carbohydrate-binding proteins and sugar hydrolases. This model describes another uncharacterized protein family, found in the same species as TIGR03805 member proteins, usually as the adjacent gene or in a fusion protein. An example is HNE_0200 from Hyphomonas neptunium ATCC 15444. Sometimes two members of this family are with a single member of TIGR03805. The function is unknown.
Probab=97.44 E-value=0.00059 Score=47.22 Aligned_cols=36 Identities=19% Similarity=0.280 Sum_probs=22.6
Q ss_pred HHHHHHhCCCCCCCCCccCCCC--CCCHHHHHHHHHHHhhc
Q psy14949 63 TLDIYLQNPKKYIPGTKMVFPG--LRKEKDREDLIAYLSTL 101 (102)
Q Consensus 63 ~l~~~~~~~~~~~~~~~m~~~~--~ls~~e~~~l~~yl~sl 101 (102)
.|...+.. ..++.+||..+ +..++++.-|..||.+|
T Consensus 280 iL~~R~~~---~d~~~~MPp~G~~l~~~e~vaLI~~WI~~l 317 (317)
T TIGR03806 280 ILLYRMHS---TEPGVRMPELGRSLVHQEGLALIRQWIASL 317 (317)
T ss_pred hHHHHHhc---CCccccCCcCCCCCCCHHHHHHHHHHHHhC
Confidence 35555542 22456777543 33457899999999875
No 43
>PF10643 Cytochrome-c551: Photosystem P840 reaction-centre cytochrome c-551; InterPro: IPR019604 A photosynthetic reaction-centre complex is found in certain green sulphur bacteria such as Chlorobium vibrioforme, which are anaerobic photo-auto-trophic organisms. The primary electron donor is P840, a probable B-Chl a dimer, and the primary electron acceptor is a B-Chl monomer. Also on the donor side c-type cytochromes are known to function as electron donors to photo-oxidised P840. This family is thus the secondary endogenous donor of the photosynthetic reaction-centre complex and is a membrane-bound cytochrome containing a single haem group. ; PDB: 3A9F_A.
Probab=97.25 E-value=7.7e-05 Score=48.25 Aligned_cols=20 Identities=35% Similarity=0.833 Sum_probs=15.1
Q ss_pred cHHhHHHHHHHhcccccCCC
Q psy14949 2 SVENGKKIFTRACAQCHTAN 21 (102)
Q Consensus 2 d~~~G~~lf~~~C~~CH~~~ 21 (102)
|...-+.+|+.+|..||+.+
T Consensus 168 df~AAk~L~~~KCNkCHTl~ 187 (233)
T PF10643_consen 168 DFAAAKALFDRKCNKCHTLK 187 (233)
T ss_dssp -HHHHHHHHHHHTTSSS-SH
T ss_pred hHHHHHHHHHhhccccccHH
Confidence 34567899999999999865
No 44
>COG1858 MauG Cytochrome c peroxidase [Inorganic ion transport and metabolism]
Probab=97.01 E-value=0.0022 Score=45.09 Aligned_cols=18 Identities=33% Similarity=0.875 Sum_probs=14.7
Q ss_pred HhHHHHHH-H-hcccccCCC
Q psy14949 4 ENGKKIFT-R-ACAQCHTAN 21 (102)
Q Consensus 4 ~~G~~lf~-~-~C~~CH~~~ 21 (102)
.+|.++|. + +|++||...
T Consensus 221 ~rGl~LF~~k~~C~aCH~g~ 240 (364)
T COG1858 221 KRGLALFKGKANCAACHNGI 240 (364)
T ss_pred HHHHHHHccCCCchhhccCc
Confidence 47999999 5 599999753
No 45
>TIGR02162 torC trimethylamine-N-oxide reductase c-type cytochrome TorC. This family includes consists of TorC, a pentahemic c-type cytochrome subunit of periplasmic reductases for trimethylamine-N-oxide (TMAO). The N-terminal half is closely related to tetrahemic NapC (or NirT) subunits of periplasmic nitrate (or nitrite) reductases; some species have both TMAO and nitrate reductase complexes.
Probab=96.96 E-value=0.00076 Score=47.82 Aligned_cols=18 Identities=28% Similarity=0.739 Sum_probs=15.4
Q ss_pred HhHHHHHHHhcccccCCC
Q psy14949 4 ENGKKIFTRACAQCHTAN 21 (102)
Q Consensus 4 ~~G~~lf~~~C~~CH~~~ 21 (102)
..|+.+|..+|+.||+.-
T Consensus 322 ~~a~~ly~~~Cs~CH~~~ 339 (386)
T TIGR02162 322 AYAKSMYNGACSMCHVQP 339 (386)
T ss_pred HHHHHHHhcchhhhcCCC
Confidence 369999999999999754
No 46
>PF14376 Haem_bd: Haem-binding domain
Probab=96.89 E-value=0.0024 Score=39.08 Aligned_cols=20 Identities=10% Similarity=0.332 Sum_probs=17.0
Q ss_pred CCCCCHHHHHHHHHHHhhcC
Q psy14949 83 PGLRKEKDREDLIAYLSTLH 102 (102)
Q Consensus 83 ~~~ls~~e~~~l~~yl~sl~ 102 (102)
...||++|+..|++|+++.+
T Consensus 118 ~a~Ls~~ek~~Ll~Wi~~~R 137 (137)
T PF14376_consen 118 EAKLSEEEKQALLNWIKEQR 137 (137)
T ss_pred CCCCCHHHHHHHHHHHHHcC
Confidence 45679999999999998764
No 47
>COG3748 Predicted membrane protein [Function unknown]
Probab=96.77 E-value=0.0068 Score=42.09 Aligned_cols=16 Identities=19% Similarity=0.366 Sum_probs=13.9
Q ss_pred CCCHHHHHHHHHHHhh
Q psy14949 85 LRKEKDREDLIAYLST 100 (102)
Q Consensus 85 ~ls~~e~~~l~~yl~s 100 (102)
.+||||+.-|.+|+++
T Consensus 387 ~mT~eER~ll~aW~e~ 402 (407)
T COG3748 387 QMTDEERALLAAWFES 402 (407)
T ss_pred cCCHHHHHHHHHHHHh
Confidence 3589999999999976
No 48
>KOG3052|consensus
Probab=96.66 E-value=0.00068 Score=45.32 Aligned_cols=18 Identities=28% Similarity=0.957 Sum_probs=16.2
Q ss_pred HHhHHHHHHHhcccccCC
Q psy14949 3 VENGKKIFTRACAQCHTA 20 (102)
Q Consensus 3 ~~~G~~lf~~~C~~CH~~ 20 (102)
+.+|-++|++.|++||+.
T Consensus 95 iRRGyqVYkqVCaaCHSm 112 (311)
T KOG3052|consen 95 IRRGYQVYKQVCAACHSM 112 (311)
T ss_pred HhhhHHHHHHHHHHhhhh
Confidence 568999999999999984
No 49
>PRK15032 trimethylamine N-oxide reductase cytochrome c-type subunit; Provisional
Probab=96.60 E-value=0.0019 Score=45.96 Aligned_cols=18 Identities=28% Similarity=0.732 Sum_probs=15.3
Q ss_pred HhHHHHHHHhcccccCCC
Q psy14949 4 ENGKKIFTRACAQCHTAN 21 (102)
Q Consensus 4 ~~G~~lf~~~C~~CH~~~ 21 (102)
..|+++|+.+|+.||+.-
T Consensus 319 ~ya~~ly~~~Cs~CHa~~ 336 (390)
T PRK15032 319 AYADSLYNGTCNQCHGAP 336 (390)
T ss_pred HHHHHHHhccchhhcCCC
Confidence 358999999999999754
No 50
>TIGR03791 TTQ_mauG tryptophan tryptophylquinone biosynthesis enzyme MauG. Members of this protein family are the tryptophan tryptophylquinone biosynthesis (TTQ) enzyme MauG, as found in Methylobacterium extorquens and related species. This protein is required to complete the maturation of the TTQ cofactor in the methylamine dehydrogenase light (beta) chain.
Probab=96.53 E-value=0.00096 Score=45.70 Aligned_cols=20 Identities=35% Similarity=0.745 Sum_probs=15.4
Q ss_pred HHhHHHHHHH---------hcccccCCCC
Q psy14949 3 VENGKKIFTR---------ACAQCHTANE 22 (102)
Q Consensus 3 ~~~G~~lf~~---------~C~~CH~~~~ 22 (102)
++-|+.||.. .|++||..+.
T Consensus 6 ~~LGk~LFfD~~LS~~~~~SCasCH~p~~ 34 (291)
T TIGR03791 6 AALGKALFFDPRLSRDGSMSCATCHNPGL 34 (291)
T ss_pred HHHHHHHhcCcccCCCCCcCchhcCCccc
Confidence 4679999952 4999998763
No 51
>COG3258 Cytochrome c [Energy production and conversion]
Probab=95.86 E-value=0.024 Score=38.31 Aligned_cols=14 Identities=14% Similarity=0.411 Sum_probs=11.4
Q ss_pred CHHHHHHHHHHHhh
Q psy14949 87 KEKDREDLIAYLST 100 (102)
Q Consensus 87 s~~e~~~l~~yl~s 100 (102)
+..|+++|++|++=
T Consensus 121 DspEmkAmlaY~kW 134 (293)
T COG3258 121 DSPEMKAMLAYMKW 134 (293)
T ss_pred CCHHHHHHHHHHHH
Confidence 45799999999864
No 52
>PF03150 CCP_MauG: Di-haem cytochrome c peroxidase; InterPro: IPR004852 This is a group of distinct cytochrome c peroxidases (CCPs) that contain two haem groups. Similar to other cytochrome c peroxidases, they reduce hydrogen peroxide to water using c-type haem as an oxidizable substrate. However, since they possess two, instead of one, haem prosthetic groups, bacterial CCPs reduce hydrogen peroxide without the need to generate semi-stable free radicals. The two haem groups have significantly different redox potentials. The high potential (+320 mV) haem feeds electrons from electron shuttle proteins to the low potential (-330 mV) haem, where peroxide is reduced (indeed, the low potential site is known as the peroxidatic site) []. The CCP protein itself is structured into two domains, each containing one c-type haem group, with a calcium-binding site at the domain interface. This family also includes MauG proteins, whose similarity to di-haem CCP was previously recognised [].; GO: 0016491 oxidoreductase activity, 0055114 oxidation-reduction process; PDB: 1IQC_A 2VHD_B 1EB7_A 3RN0_A 3SVW_B 3RMZ_A 3SJL_B 3PXW_A 3SLE_B 3PXS_A ....
Probab=94.95 E-value=0.014 Score=36.66 Aligned_cols=19 Identities=42% Similarity=0.899 Sum_probs=13.7
Q ss_pred HHhHHHHHHH---------hcccccCCC
Q psy14949 3 VENGKKIFTR---------ACAQCHTAN 21 (102)
Q Consensus 3 ~~~G~~lf~~---------~C~~CH~~~ 21 (102)
++.|+.||.. .|++||..+
T Consensus 5 ~~LGk~LF~D~~LS~~~~~SCasCH~~~ 32 (159)
T PF03150_consen 5 AALGKKLFFDPRLSGDGTVSCASCHDPE 32 (159)
T ss_dssp HHHHHHHHT-GGGSTTSS--HHHHS-TT
T ss_pred HHHHHHHhCCCccCCCcCcCchhhCCCc
Confidence 4679999964 499999765
No 53
>PF09626 DHC: Dihaem cytochrome c; InterPro: IPR018588 Dihaem cytochrome c (DHC) is a soluble c-type cytochrome that folds into two distinct domains, each binding a single haem group and connected by a small linker region. Despite little sequence similarity, the N-terminal domain (residues 12-75) is a class I type cytochrome c, that binds one of the haems, but the domain surrounding the other haem is structurally unique. DHC binds electrostatically to an oxygen-binding protein, sphaeroides haem protein (SHP), as a component of a conserved electron transfer pathway. DHC acts as the physiological electron donor for SHP during phototrophic growth []. In certain species DHC is found upstream of IPR011577 from INTERPRO. ; PDB: 2FWT_A 2FW5_A.
Probab=94.24 E-value=0.097 Score=31.41 Aligned_cols=11 Identities=36% Similarity=1.123 Sum_probs=7.4
Q ss_pred HHHhcccccCC
Q psy14949 10 FTRACAQCHTA 20 (102)
Q Consensus 10 f~~~C~~CH~~ 20 (102)
|.+.|++||-+
T Consensus 1 Y~~eCgsCH~a 11 (120)
T PF09626_consen 1 YKEECGSCHMA 11 (120)
T ss_dssp -HHHTTSSS--
T ss_pred CccchhhccCc
Confidence 67889999953
No 54
>COG3488 Predicted thiol oxidoreductase [Energy production and conversion]
Probab=94.14 E-value=0.028 Score=39.43 Aligned_cols=20 Identities=45% Similarity=0.973 Sum_probs=16.3
Q ss_pred cHHhHHHHHHH-hcccccCCC
Q psy14949 2 SVENGKKIFTR-ACAQCHTAN 21 (102)
Q Consensus 2 d~~~G~~lf~~-~C~~CH~~~ 21 (102)
++..||+||.+ .|.+||...
T Consensus 350 ~vl~GkkLF~~agC~aCH~pk 370 (481)
T COG3488 350 QVLAGKKLFAQAGCVACHTPK 370 (481)
T ss_pred hhhhhhHHHHhcCchhccCCc
Confidence 45689999987 699999753
No 55
>COG2857 CYT1 Cytochrome c1 [Energy production and conversion]
Probab=91.89 E-value=0.024 Score=38.11 Aligned_cols=21 Identities=24% Similarity=0.763 Sum_probs=17.6
Q ss_pred cHHhHHHHHHHhcccccCCCC
Q psy14949 2 SVENGKKIFTRACAQCHTANE 22 (102)
Q Consensus 2 d~~~G~~lf~~~C~~CH~~~~ 22 (102)
.+.+|.++|.+.|..||+...
T Consensus 42 ~lq~g~~~~~~~c~~chs~~~ 62 (250)
T COG2857 42 SLQRGAQLYKEYCSACHSLKL 62 (250)
T ss_pred HhhhceeeeecCChhhccccc
Confidence 456899999999999998763
No 56
>PF07635 PSCyt1: Planctomycete cytochrome C; InterPro: IPR011429 These proteins share a region of homology at their N terminus that contains the C-{CPWHF}-{CPWR}-C-H-{CFYW} motif typical of cytochrome c.
Probab=91.05 E-value=0.13 Score=26.85 Aligned_cols=9 Identities=33% Similarity=1.091 Sum_probs=7.5
Q ss_pred cccccCCCC
Q psy14949 14 CAQCHTANE 22 (102)
Q Consensus 14 C~~CH~~~~ 22 (102)
|..||+.+.
T Consensus 1 C~~CHg~~~ 9 (59)
T PF07635_consen 1 CFSCHGPDK 9 (59)
T ss_pred CcCCCCCCC
Confidence 999998764
No 57
>PF06537 DUF1111: Protein of unknown function (DUF1111); InterPro: IPR010538 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=85.07 E-value=0.55 Score=34.69 Aligned_cols=17 Identities=29% Similarity=0.882 Sum_probs=12.6
Q ss_pred HHHH-hcccccCCCCCCC
Q psy14949 9 IFTR-ACAQCHTANEGGA 25 (102)
Q Consensus 9 lf~~-~C~~CH~~~~~g~ 25 (102)
+|+. .|.+||--+|-|.
T Consensus 66 LfN~~SC~sCH~~dGRG~ 83 (499)
T PF06537_consen 66 LFNARSCQSCHIKDGRGH 83 (499)
T ss_pred hhhhhhHhhcccCCCCCC
Confidence 6775 5999998776443
No 58
>COG1858 MauG Cytochrome c peroxidase [Inorganic ion transport and metabolism]
Probab=84.78 E-value=0.54 Score=33.40 Aligned_cols=20 Identities=40% Similarity=0.821 Sum_probs=15.6
Q ss_pred HHhHHHHHHH---------hcccccCCCC
Q psy14949 3 VENGKKIFTR---------ACAQCHTANE 22 (102)
Q Consensus 3 ~~~G~~lf~~---------~C~~CH~~~~ 22 (102)
++.|+.||-. .|++||...-
T Consensus 62 ~aLGk~LffDprLS~sg~~SC~sCH~~~~ 90 (364)
T COG1858 62 AALGKKLFFDPRLSASGTISCATCHNLAR 90 (364)
T ss_pred HHHHHHhhcCcccCCCCCcCchhhcCccc
Confidence 5679999953 4999998764
No 59
>PF09832 DUF2059: Uncharacterized protein conserved in bacteria (DUF2059); InterPro: IPR018637 This entry contains proteins that have no known function. ; PDB: 2X3O_B 3OAO_A.
Probab=82.99 E-value=1.2 Score=23.18 Aligned_cols=17 Identities=12% Similarity=0.329 Sum_probs=13.6
Q ss_pred CCCCHHHHHHHHHHHhh
Q psy14949 84 GLRKEKDREDLIAYLST 100 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~s 100 (102)
..+|++|+.+|++|..|
T Consensus 15 ~~ft~~El~~i~~FY~S 31 (64)
T PF09832_consen 15 EHFTEEELDAILAFYES 31 (64)
T ss_dssp HHS-HHHHHHHHHHHHS
T ss_pred HHCCHHHHHHHHHHHCC
Confidence 34689999999999976
No 60
>PF07627 PSCyt3: Protein of unknown function (DUF1588); InterPro: IPR013039 A region of similarity shared by several Rhodopirellula baltica cytochrome-like proteins that are predicted to be secreted. These proteins also contain IPR011478 from INTERPRO, IPR013036 from INTERPRO, IPR013042 from INTERPRO and IPR013043 from INTERPRO.
Probab=79.63 E-value=0.75 Score=26.77 Aligned_cols=8 Identities=63% Similarity=1.389 Sum_probs=6.5
Q ss_pred hcccccCC
Q psy14949 13 ACAQCHTA 20 (102)
Q Consensus 13 ~C~~CH~~ 20 (102)
.|++||..
T Consensus 71 ~Ca~CH~~ 78 (101)
T PF07627_consen 71 ACASCHRK 78 (101)
T ss_pred cHHHHhhh
Confidence 49999973
No 61
>TIGR03153 cytochr_NrfH cytochrome c nitrate reductase, small subunit. Members of this protein family are NrfH, a tetraheme cytochrome c. NrfH is the cytochrome c nitrate reductase small subunit, and forms a heterodimer with NrfA, the catalytic subunit. While NrfA can act as a monomer, NrfH can bind to and anchor NrfA in the membrane and enables electron transfer to NrfA from quinones.
Probab=78.93 E-value=0.77 Score=27.98 Aligned_cols=16 Identities=25% Similarity=0.885 Sum_probs=12.3
Q ss_pred hHHHHHHHhcccccCC
Q psy14949 5 NGKKIFTRACAQCHTA 20 (102)
Q Consensus 5 ~G~~lf~~~C~~CH~~ 20 (102)
.++++.+.+|.+||+.
T Consensus 96 ~~~~~~~~nC~~CH~~ 111 (135)
T TIGR03153 96 HSRKVVQENCVRCHAG 111 (135)
T ss_pred HHhHhhhcchHHHHhH
Confidence 3466777899999974
No 62
>PF08090 Enterotoxin_HS1: Heat stable E.coli enterotoxin 1; InterPro: IPR012557 Heat-stable toxin 1 of entero-aggregative Escherichia coli (EAST1) is a small toxin. It is not, however, solely associated with entero-aggregative E. coli but also with many other diarrhoeic E. coli families. Some studies have established the role of EAST1 in some human outbreaks of diarrhoea. Isolates from farm animals have been shown to carry the astA gene coding for EAST1. However, the relation between the presence of EAST1 and disease is not conclusive [].
Probab=77.01 E-value=0.88 Score=20.70 Aligned_cols=10 Identities=40% Similarity=1.022 Sum_probs=7.6
Q ss_pred HhcccccCCC
Q psy14949 12 RACAQCHTAN 21 (102)
Q Consensus 12 ~~C~~CH~~~ 21 (102)
..|++||+..
T Consensus 20 tac~s~~grt 29 (36)
T PF08090_consen 20 TACGSCHGRT 29 (36)
T ss_pred hhhccCCCCc
Confidence 3699999754
No 63
>COG3488 Predicted thiol oxidoreductase [Energy production and conversion]
Probab=74.97 E-value=1.3 Score=31.38 Aligned_cols=19 Identities=32% Similarity=0.815 Sum_probs=13.2
Q ss_pred HHHH-hcccccCCCCCCCCC
Q psy14949 9 IFTR-ACAQCHTANEGGANK 27 (102)
Q Consensus 9 lf~~-~C~~CH~~~~~g~~~ 27 (102)
+|+. .|..||-.+|-|.++
T Consensus 93 lfN~raCqnCHvkDGRGrPP 112 (481)
T COG3488 93 LFNTRACQNCHVKDGRGRPP 112 (481)
T ss_pred ccccccccccccccCCCCCC
Confidence 5664 599999888655443
No 64
>PF07583 PSCyt2: Protein of unknown function (DUF1549); InterPro: IPR011444 The function is not known. It is found associated with IPR022655 from INTERPRO. It is also found associated with the Planctomycete cytochrome C domain IPR011429 from INTERPRO.
Probab=73.90 E-value=1.5 Score=28.81 Aligned_cols=7 Identities=71% Similarity=1.611 Sum_probs=6.0
Q ss_pred hcccccC
Q psy14949 13 ACAQCHT 19 (102)
Q Consensus 13 ~C~~CH~ 19 (102)
.|+.||-
T Consensus 176 ~CAqCHd 182 (208)
T PF07583_consen 176 QCAQCHD 182 (208)
T ss_pred chhhccC
Confidence 5999994
No 65
>PF11845 DUF3365: Protein of unknown function (DUF3365); InterPro: IPR021796 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 198 to 657 amino acids in length.
Probab=70.98 E-value=1.2 Score=28.13 Aligned_cols=8 Identities=38% Similarity=1.182 Sum_probs=6.9
Q ss_pred HhcccccC
Q psy14949 12 RACAQCHT 19 (102)
Q Consensus 12 ~~C~~CH~ 19 (102)
..|..||+
T Consensus 147 ~~CL~CHg 154 (188)
T PF11845_consen 147 ESCLSCHG 154 (188)
T ss_pred hHHHHccC
Confidence 46999998
No 66
>PF15182 OTOS: Otospiralin
Probab=64.95 E-value=8.9 Score=20.39 Aligned_cols=23 Identities=22% Similarity=0.312 Sum_probs=19.0
Q ss_pred CccCCCCCCCHHHHHHHHHHHhhc
Q psy14949 78 TKMVFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 78 ~~m~~~~~ls~~e~~~l~~yl~sl 101 (102)
..||++.. |..|.|.-+.|+++|
T Consensus 15 pa~PYWPf-stsDFW~YveyFrtl 37 (69)
T PF15182_consen 15 PAMPYWPF-STSDFWNYVEYFRTL 37 (69)
T ss_pred CcCCcCCc-cchHHHHHHHHHHHh
Confidence 46788877 678999999999986
No 67
>COG2996 Predicted RNA-bindining protein (contains S1 and HTH domains) [General function prediction only]
Probab=62.35 E-value=15 Score=25.37 Aligned_cols=31 Identities=23% Similarity=0.286 Sum_probs=23.9
Q ss_pred cHHHHHHHHhCCCCCCCCCccCCCCCCCHHHHHHHH
Q psy14949 60 SASTLDIYLQNPKKYIPGTKMVFPGLRKEKDREDLI 95 (102)
Q Consensus 60 ~~~~l~~~~~~~~~~~~~~~m~~~~~ls~~e~~~l~ 95 (102)
+.+.+..+++. .+..|||+...|+++|++..
T Consensus 226 daq~Il~yL~~-----~gG~mpf~DKSsPEdIk~~F 256 (287)
T COG2996 226 DAQMILTYLES-----NGGFMPFNDKSSPEDIKATF 256 (287)
T ss_pred hHHHHHHHHHH-----cCCccccCCCCCHHHHHHHh
Confidence 45556677764 47789999998999999864
No 68
>PF02304 Phage_B: Scaffold protein B; InterPro: IPR003513 This is a family of proteins from single-stranded DNA bacteriophages. Scaffold proteins B and D are required for procapsid formation. Sixty copies of the internal scaffold protein B are found in the procapsid.; GO: 0019069 viral capsid assembly, 0046729 viral procapsid; PDB: 1CD3_B 1AL0_B.
Probab=62.35 E-value=6.1 Score=23.22 Aligned_cols=18 Identities=39% Similarity=0.837 Sum_probs=13.0
Q ss_pred cHHhHHHHHHHh--cccccC
Q psy14949 2 SVENGKKIFTRA--CAQCHT 19 (102)
Q Consensus 2 d~~~G~~lf~~~--C~~CH~ 19 (102)
+++.|+.+|.++ |+.|--
T Consensus 63 ~iE~~ka~~~R~FG~A~~~d 82 (117)
T PF02304_consen 63 DIEAGKAACARRFGCATCDD 82 (117)
T ss_dssp HHHHHHHHHHHHHHSS--SC
T ss_pred HHHHHHHHHHHHcCCCCcch
Confidence 578899999975 999973
No 69
>TIGR01904 GSu_C4xC__C2xCH Geobacter sulfurreducens CxxxxCH...CXXCH domain. This domain occurs from three to eight times in eight different proteins of Geobacter sulfurreducens. The final CXXCH motif matches ProSite motif PS00190, the cytochrome c family heme-binding site signature, suggesting
Probab=61.17 E-value=3.2 Score=20.17 Aligned_cols=6 Identities=50% Similarity=1.564 Sum_probs=5.3
Q ss_pred hccccc
Q psy14949 13 ACAQCH 18 (102)
Q Consensus 13 ~C~~CH 18 (102)
.|.+||
T Consensus 37 ~C~~CH 42 (42)
T TIGR01904 37 GCNGCH 42 (42)
T ss_pred cCcccC
Confidence 599999
No 70
>TIGR01905 paired_CXXCH_1 doubled CXXCH domain. This model represents a domain of about 41 amino acids that contains, among other motifs, two copies of the motif CXXCH associated with heme binding. Almost every member of this family has at least three copies of this domain (at least six copies of CXXCH) is predicted to be a high molecular weight c-type cytochrome. Members are found mostly in species of Shewanella, Geobacter, and Vibrio.
Probab=60.35 E-value=3.5 Score=19.83 Aligned_cols=9 Identities=33% Similarity=1.147 Sum_probs=7.3
Q ss_pred hcccccCCC
Q psy14949 13 ACAQCHTAN 21 (102)
Q Consensus 13 ~C~~CH~~~ 21 (102)
.|.+||..-
T Consensus 8 ~C~~CH~pH 16 (41)
T TIGR01905 8 DCTSCHDPH 16 (41)
T ss_pred Ccccccccc
Confidence 599999754
No 71
>PHA00003 B internal scaffolding protein
Probab=58.20 E-value=8.2 Score=22.74 Aligned_cols=18 Identities=39% Similarity=0.676 Sum_probs=15.0
Q ss_pred cHHhHHHHHHHh--cccccC
Q psy14949 2 SVENGKKIFTRA--CAQCHT 19 (102)
Q Consensus 2 d~~~G~~lf~~~--C~~CH~ 19 (102)
+++.|+.+|.++ |+.|--
T Consensus 66 ~iEagk~~c~RrFGgAtcdd 85 (120)
T PHA00003 66 DIEAGKAICARRFGGATCDD 85 (120)
T ss_pred HHHHHHHHHHHHcCCCCcch
Confidence 578899999974 999974
No 72
>PF09699 Paired_CXXCH_1: Doubled CXXCH motif (Paired_CXXCH_1)
Probab=56.92 E-value=3.4 Score=19.52 Aligned_cols=9 Identities=33% Similarity=1.150 Sum_probs=7.2
Q ss_pred hcccccCCC
Q psy14949 13 ACAQCHTAN 21 (102)
Q Consensus 13 ~C~~CH~~~ 21 (102)
.|.+||..-
T Consensus 8 ~C~~CH~~H 16 (41)
T PF09699_consen 8 QCTSCHDPH 16 (41)
T ss_pred ChhHhcccc
Confidence 499999755
No 73
>PF06943 zf-LSD1: LSD1 zinc finger; InterPro: IPR005735 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This model describes a putative zinc finger domain found in three closely spaced copies in Arabidopsis protein LSD1 and in two copies in other proteins from the same species. The motif resembles CxxCRxxLMYxxGASxVxCxxC []. This domain may play a role in the regulation of transcription, via either repression of a prodeath pathway or activation of an antideath pathway, in response to signals emanating from cells undergoing pathogen-induced hypersensitive cell death. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].
Probab=54.36 E-value=5.7 Score=17.07 Aligned_cols=8 Identities=63% Similarity=1.290 Sum_probs=6.1
Q ss_pred hcccccCC
Q psy14949 13 ACAQCHTA 20 (102)
Q Consensus 13 ~C~~CH~~ 20 (102)
.|+.||.+
T Consensus 18 rCa~C~~V 25 (25)
T PF06943_consen 18 RCACCHTV 25 (25)
T ss_pred ECCccCcC
Confidence 49999963
No 74
>COG3043 NapB Nitrate reductase cytochrome c-type subunit [Energy production and conversion]
Probab=53.33 E-value=11 Score=23.56 Aligned_cols=8 Identities=38% Similarity=1.207 Sum_probs=5.6
Q ss_pred hcccccCC
Q psy14949 13 ACAQCHTA 20 (102)
Q Consensus 13 ~C~~CH~~ 20 (102)
.|.+||+.
T Consensus 88 ~CLsCH~~ 95 (155)
T COG3043 88 RCLSCHSV 95 (155)
T ss_pred hhhhccCH
Confidence 48888864
No 75
>PF12162 STAT1_TAZ2bind: STAT1 TAZ2 binding domain; InterPro: IPR022752 This entry represents the C-terminal domain of STAT1, which selectively binds the TAZ2 domain of CRB (CREB-binding protein) []. This group of eukaryotic proteins is approximately 20 amino acids in length, and is found in association with PF02865 from PFAM, PF00017 from PFAM, PF01017 from PFAM, PF02864 from PFAM. By binding to CRB, it becomes a transcriptional activator and can initiate transcription of certain genes. ; GO: 0003700 sequence-specific DNA binding transcription factor activity; PDB: 2KA6_B.
Probab=53.20 E-value=18 Score=15.12 Aligned_cols=18 Identities=17% Similarity=0.200 Sum_probs=10.0
Q ss_pred CCccCCCCCCCHHHHHHHHHHH
Q psy14949 77 GTKMVFPGLRKEKDREDLIAYL 98 (102)
Q Consensus 77 ~~~m~~~~~ls~~e~~~l~~yl 98 (102)
.++|| ||++|..+|-.++
T Consensus 5 dnmmP----MSPddy~~l~~~V 22 (23)
T PF12162_consen 5 DNMMP----MSPDDYDELERMV 22 (23)
T ss_dssp TS-------S-HHHHHHHHHHH
T ss_pred hcccC----CCHHHHHHHHHhh
Confidence 35566 5788888887664
No 76
>PF09722 DUF2384: Protein of unknown function (DUF2384); InterPro: IPR024467 This domain is found predominantly in proteobacterial proteins. Its function in unknown.
Probab=52.81 E-value=11 Score=18.86 Aligned_cols=41 Identities=22% Similarity=0.401 Sum_probs=26.7
Q ss_pred cHHHHHHHHhCCCCCCCCCccCCCCCCCHHHHHHHHHHHhhc
Q psy14949 60 SASTLDIYLQNPKKYIPGTKMVFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 60 ~~~~l~~~~~~~~~~~~~~~m~~~~~ls~~e~~~l~~yl~sl 101 (102)
+.+....|+..|.....+- -|..-+.|+..+..|..||..+
T Consensus 10 d~~~a~~Wl~~p~~~l~g~-~Plel~~t~~G~~~V~~~L~~~ 50 (54)
T PF09722_consen 10 DEDKARRWLRTPNPALGGR-TPLELLRTEAGAERVLDYLDRI 50 (54)
T ss_pred CHHHHHHHHHChHHHhCCC-CHHHHHcChHHHHHHHHHHHHH
Confidence 4677889999888654331 1111122578888999998764
No 77
>PF13822 ACC_epsilon: Acyl-CoA carboxylase epsilon subunit
Probab=52.55 E-value=15 Score=19.21 Aligned_cols=16 Identities=19% Similarity=0.241 Sum_probs=13.5
Q ss_pred CCHHHHHHHHHHHhhc
Q psy14949 86 RKEKDREDLIAYLSTL 101 (102)
Q Consensus 86 ls~~e~~~l~~yl~sl 101 (102)
-|++|+.+|++-|.++
T Consensus 11 Pt~eElAAL~aVlaa~ 26 (62)
T PF13822_consen 11 PTDEELAALTAVLAAR 26 (62)
T ss_pred CCHHHHHHHHHHHHHH
Confidence 4899999999988764
No 78
>cd07321 Extradiol_Dioxygenase_3A_like Subunit A of Class III extradiol dioxygenases. Extradiol dioxygenases catalyze the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms, resulting in the cleavage of aromatic rings. There are two major groups of dioxygenases according to the cleavage site of the aromatic ring. Intradiol enzymes cleave the aromatic ring between two hydroxyl groups, whereas extradiol enzymes cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon. Extradiol dioxygenases can be divided into three classes. Class I and II enzymes are evolutionary related and show sequence similarity, with the two domain class II enzymes evolving from the class I enzyme through gene duplication. Class III enzymes are different in sequence and structure and usually have two subunits, designated A and B, which form a tetramer composed of two copies of each subunit. This model represents subunit A of c
Probab=52.22 E-value=7.7 Score=21.30 Aligned_cols=31 Identities=13% Similarity=0.310 Sum_probs=21.7
Q ss_pred cHHHHHHHHhCCCCCCCCCccCCCCCCCHHHHHHHHH
Q psy14949 60 SASTLDIYLQNPKKYIPGTKMVFPGLRKEKDREDLIA 96 (102)
Q Consensus 60 ~~~~l~~~~~~~~~~~~~~~m~~~~~ls~~e~~~l~~ 96 (102)
+++...+|+.+|.++.. . .+ ||++|+.+|.+
T Consensus 15 ~~~~re~f~~dp~a~~~----~-~~-Lt~eE~~al~~ 45 (77)
T cd07321 15 KPEVKERFKADPEAVLA----E-YG-LTPEEKAALLA 45 (77)
T ss_pred CHHHHHHHHhCHHHHHH----H-cC-CCHHHHHHHHc
Confidence 46678888888776543 1 23 48999998864
No 79
>COG3445 Acid-induced glycyl radical enzyme [General function prediction only]
Probab=52.09 E-value=18 Score=21.09 Aligned_cols=38 Identities=29% Similarity=0.477 Sum_probs=27.9
Q ss_pred cccHHHHHHHHhCCCCCCC------CCccCCCCCCCHHHHHHHHH
Q psy14949 58 TWSASTLDIYLQNPKKYIP------GTKMVFPGLRKEKDREDLIA 96 (102)
Q Consensus 58 ~~~~~~l~~~~~~~~~~~~------~~~m~~~~~ls~~e~~~l~~ 96 (102)
....+.|.+.+++|.++.. |....|+.+ ++++-+++++
T Consensus 77 vl~retledav~~pekypqltirvsgyavrfnsl-tpeqqrdvi~ 120 (127)
T COG3445 77 VLRRETLEDAVKHPEKYPQLTIRVSGYAVRFNSL-TPEQQRDVIA 120 (127)
T ss_pred eeehhhHHHHhhCcccCCceEEEEeeEEEEeccC-CHHHhhhHHH
Confidence 3567889999999997653 344457766 7888888876
No 80
>TIGR02826 RNR_activ_nrdG3 anaerobic ribonucleoside-triphosphate reductase activating protein. Members of this family represent a set of proteins related to, yet architecturally different from, the activating protein for the glycine radical-containing, oxygen-sensitive ribonucleoside-triphosphate reductase (RNR) as described in model TIGR02491. Members of this family are found paired with members of a similarly divergent set of anaerobic ribonucleoside-triphosphate reductases. Identification of this protein as an RNR activitating protein is partly from pairing with a candidate RNR. It is further supported by our finding that upstream of these operons are examples of a conserved regulatory element (described Rodionov and Gelfand) that is found in nearly all bacteria and that occurs specifically upstream of operons for all three classes of RNR genes.
Probab=51.18 E-value=36 Score=20.97 Aligned_cols=9 Identities=33% Similarity=1.139 Sum_probs=7.1
Q ss_pred hcccccCCC
Q psy14949 13 ACAQCHTAN 21 (102)
Q Consensus 13 ~C~~CH~~~ 21 (102)
.|..||...
T Consensus 28 rC~~C~n~~ 36 (147)
T TIGR02826 28 GCKGCHSPE 36 (147)
T ss_pred CCCCCCChH
Confidence 599999754
No 81
>PHA02902 putative IMV membrane protein; Provisional
Probab=50.61 E-value=20 Score=19.13 Aligned_cols=20 Identities=15% Similarity=0.125 Sum_probs=15.7
Q ss_pred CCCCCCCHHHHHHHHHHHhh
Q psy14949 81 VFPGLRKEKDREDLIAYLST 100 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl~s 100 (102)
.|...|+++|++++=.++-+
T Consensus 49 ~F~D~lTpDQirAlHrlvT~ 68 (70)
T PHA02902 49 LFKDSLTPDQIKALHRLVSL 68 (70)
T ss_pred hhhccCCHHHHHHHHHHHhc
Confidence 47778899999998877643
No 82
>PF14769 CLAMP: Flagellar C1a complex subunit C1a-32
Probab=50.30 E-value=47 Score=18.94 Aligned_cols=18 Identities=17% Similarity=0.348 Sum_probs=15.1
Q ss_pred CCCCCHHHHHHHHHHHhh
Q psy14949 83 PGLRKEKDREDLIAYLST 100 (102)
Q Consensus 83 ~~~ls~~e~~~l~~yl~s 100 (102)
.++.|.+|+++|++|+.+
T Consensus 61 ~~iFs~~~~~~i~~y~~~ 78 (101)
T PF14769_consen 61 IGIFSVDQVKAIIDYFHN 78 (101)
T ss_pred cCcCCHHHHHHHHHHHHH
Confidence 456799999999999864
No 83
>PF04320 DUF469: Protein with unknown function (DUF469); InterPro: IPR007416 This entry represents a family of uncharacterised proteins which are predicted to function as phosphotransferases.
Probab=49.27 E-value=25 Score=20.52 Aligned_cols=18 Identities=33% Similarity=0.453 Sum_probs=15.1
Q ss_pred CCCCHHHHHHHHHHHhhc
Q psy14949 84 GLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~sl 101 (102)
+..|++++.+|.+||++.
T Consensus 68 gs~tee~R~~v~~WL~~~ 85 (101)
T PF04320_consen 68 GSCTEEDRAAVEAWLKAR 85 (101)
T ss_pred CCCCHHHHHHHHHHHHhC
Confidence 345899999999999874
No 84
>PRK11586 napB nitrate reductase cytochrome C550 subunit; Provisional
Probab=48.51 E-value=13 Score=23.09 Aligned_cols=6 Identities=67% Similarity=1.630 Sum_probs=3.1
Q ss_pred cccccC
Q psy14949 14 CAQCHT 19 (102)
Q Consensus 14 C~~CH~ 19 (102)
|.+||.
T Consensus 83 CL~CH~ 88 (149)
T PRK11586 83 CLQCHG 88 (149)
T ss_pred CccCCC
Confidence 555554
No 85
>cd07921 PCA_45_Doxase_A_like Subunit A of the Class III Extradiol dioxygenase, Protocatechuate 4,5-dioxygenase, and similar enzymes. This subfamily includes the A subunit of protocatechuate (PCA) 4,5-dioxygenase (LigAB) and two subfamilies of unknown function. The A subunit is the smaller, non-catalytic subunit of LigAB. PCA 4,5-dioxygenase catalyzes the oxidization and subsequent ring-opening of PCA (or 3,4-dihydroxybenzoic acid), which is an intermediate in the breakdown of lignin and other compounds. PCA 4,5-dioxygenase is one of the aromatic ring opening dioxygenases which play key roles in the degradation of aromatic compounds. As members of the Class III extradiol dioxygenase family, the enzymes use a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon. LigAB-like class III enzymes are usually composed of two subunits, designated A and B, which form a tetramer composed of two copies of each subunit.
Probab=47.67 E-value=11 Score=22.18 Aligned_cols=31 Identities=13% Similarity=0.295 Sum_probs=21.9
Q ss_pred cHHHHHHHHhCCCCCCCCCccCCCCCCCHHHHHHHHH
Q psy14949 60 SASTLDIYLQNPKKYIPGTKMVFPGLRKEKDREDLIA 96 (102)
Q Consensus 60 ~~~~l~~~~~~~~~~~~~~~m~~~~~ls~~e~~~l~~ 96 (102)
..+...+|+.+|.++.. . .+ ||+||+.+|.+
T Consensus 25 ~a~~Re~F~aD~eAy~~----~-~g-LTeEe~~AV~~ 55 (106)
T cd07921 25 KAENREAFKADEEAYCD----K-FG-LTEEQKQAVLD 55 (106)
T ss_pred CHHHHHHHHhCHHHHHH----H-cC-CCHHHHHHHHh
Confidence 56778888888876543 1 23 48999998864
No 86
>TIGR01053 LSD1 zinc finger domain, LSD1 subclass. This model describes a putative zinc finger domain found in three closely spaced copies in Arabidopsis protein LSD1 and in two copies in other proteins from the same species. The motif resembles CxxCRxxLMYxxGASxVxCxxC
Probab=46.45 E-value=6.6 Score=17.72 Aligned_cols=9 Identities=56% Similarity=1.139 Sum_probs=7.0
Q ss_pred hcccccCCC
Q psy14949 13 ACAQCHTAN 21 (102)
Q Consensus 13 ~C~~CH~~~ 21 (102)
.|+.||.++
T Consensus 21 rCs~C~~vt 29 (31)
T TIGR01053 21 RCALCQTVN 29 (31)
T ss_pred ECCCCCeEe
Confidence 599999764
No 87
>PHA02119 hypothetical protein
Probab=46.40 E-value=14 Score=20.02 Aligned_cols=10 Identities=40% Similarity=0.953 Sum_probs=8.2
Q ss_pred HHHHHHHhhc
Q psy14949 92 EDLIAYLSTL 101 (102)
Q Consensus 92 ~~l~~yl~sl 101 (102)
++|+.||++|
T Consensus 57 ~divdylr~l 66 (87)
T PHA02119 57 KDIVDYLRSL 66 (87)
T ss_pred HHHHHHHHHc
Confidence 4789999986
No 88
>cd06395 PB1_Map2k5 PB1 domain is essential part of the mitogen-activated protein kinase kinase 5 (Map2k5, alias MEK5) one of the key member of the signaling kinases cascade which involved in angiogenesis and early cardiovascular development. The PB1 domain of Map2k5 interacts with the PB1 domain of another members of kinase cascade MEKK2 (or MEKK3). A canonical PB1-PB1 interaction, involving heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The Map2k5 protein contains a type I PB1 domain.
Probab=44.57 E-value=21 Score=20.01 Aligned_cols=15 Identities=13% Similarity=0.594 Sum_probs=12.5
Q ss_pred CCHHHHHHHHHHHhh
Q psy14949 86 RKEKDREDLIAYLST 100 (102)
Q Consensus 86 ls~~e~~~l~~yl~s 100 (102)
-||+|++++++|.-+
T Consensus 57 RSDeEm~AMlsyy~~ 71 (91)
T cd06395 57 RSDEEMKAMLSYYCS 71 (91)
T ss_pred cchHHHHHHHHHHHH
Confidence 379999999999644
No 89
>COG3303 NrfA Formate-dependent nitrite reductase, periplasmic cytochrome c552 subunit [Inorganic ion transport and metabolism]
Probab=44.53 E-value=8.2 Score=27.94 Aligned_cols=14 Identities=43% Similarity=0.975 Sum_probs=10.1
Q ss_pred HHHHHhcccccCCC
Q psy14949 8 KIFTRACAQCHTAN 21 (102)
Q Consensus 8 ~lf~~~C~~CH~~~ 21 (102)
.-|.+-|+.||.-+
T Consensus 327 ~~f~stCanCH~Qs 340 (501)
T COG3303 327 DNFASTCANCHTQS 340 (501)
T ss_pred HHHHHHHhhhhhhh
Confidence 34667799999643
No 90
>PF14522 Cytochrome_C7: Cytochrome c7; PDB: 3OV0_A 3OUQ_A 3H4N_A 3BXU_B 1OS6_A 2LDO_A 3OUE_A 3H33_A 1RWJ_A 1LM2_A ....
Probab=44.07 E-value=9.2 Score=19.51 Aligned_cols=9 Identities=44% Similarity=1.128 Sum_probs=7.2
Q ss_pred HhcccccCC
Q psy14949 12 RACAQCHTA 20 (102)
Q Consensus 12 ~~C~~CH~~ 20 (102)
..|..||+.
T Consensus 13 ~~C~~CH~~ 21 (65)
T PF14522_consen 13 VNCASCHSD 21 (65)
T ss_dssp TCGCCTSTT
T ss_pred CChhhhCCC
Confidence 369999974
No 91
>PF03892 NapB: Nitrate reductase cytochrome c-type subunit (NapB); InterPro: IPR005591 The napB gene encodes a dihaem cytochrome c, the small subunit of a heterodimeric periplasmic nitrate reductase [].; PDB: 3O5A_B 3ML1_B 1OGY_L 1JNI_A.
Probab=43.94 E-value=12 Score=22.87 Aligned_cols=7 Identities=43% Similarity=1.417 Sum_probs=3.7
Q ss_pred cccccCC
Q psy14949 14 CAQCHTA 20 (102)
Q Consensus 14 C~~CH~~ 20 (102)
|.+||..
T Consensus 72 Cl~CH~~ 78 (133)
T PF03892_consen 72 CLSCHSP 78 (133)
T ss_dssp CHHHHST
T ss_pred CcccCCc
Confidence 5555543
No 92
>COG3184 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.12 E-value=12 Score=24.14 Aligned_cols=19 Identities=11% Similarity=0.378 Sum_probs=16.1
Q ss_pred CCCCCCHHHHHHHHHHHhh
Q psy14949 82 FPGLRKEKDREDLIAYLST 100 (102)
Q Consensus 82 ~~~~ls~~e~~~l~~yl~s 100 (102)
|....|++|+.+|.+|..|
T Consensus 104 Ya~~FT~eEl~ai~aFY~S 122 (183)
T COG3184 104 YAKIFTEEELNAITAFYGS 122 (183)
T ss_pred HHHhcCHHHHHHHHHHHcC
Confidence 5566799999999999865
No 93
>PF11256 DUF3055: Protein of unknown function (DUF3055); InterPro: IPR021415 This family of proteins with unknown function appear to be restricted to Firmicutes.
Probab=42.52 E-value=30 Score=19.31 Aligned_cols=15 Identities=33% Similarity=0.578 Sum_probs=13.4
Q ss_pred CHHHHHHHHHHHhhc
Q psy14949 87 KEKDREDLIAYLSTL 101 (102)
Q Consensus 87 s~~e~~~l~~yl~sl 101 (102)
+++|..+|.+||..+
T Consensus 67 ~~eea~eL~~fl~~~ 81 (81)
T PF11256_consen 67 SEEEAEELREFLYEL 81 (81)
T ss_pred CHHHHHHHHHHHhhC
Confidence 899999999999764
No 94
>PF07128 DUF1380: Protein of unknown function (DUF1380); InterPro: IPR009811 This family consists of several hypothetical bacterial proteins of around 140 residues in length. Members of this family seem to be specific to Enterobacteria. The function of this family is unknown.
Probab=41.49 E-value=69 Score=19.83 Aligned_cols=33 Identities=9% Similarity=0.333 Sum_probs=21.8
Q ss_pred ccccHHHHHHHHhCCCCCCCCCccCCCCCCCHHHHHHHHHHHhh
Q psy14949 57 ITWSASTLDIYLQNPKKYIPGTKMVFPGLRKEKDREDLIAYLST 100 (102)
Q Consensus 57 ~~~~~~~l~~~~~~~~~~~~~~~m~~~~~ls~~e~~~l~~yl~s 100 (102)
+.|+.+.+..... +.... +|++|+..|..+|-.
T Consensus 25 lIWT~eDV~~~a~-gme~~----------lTd~E~~aVL~~I~~ 57 (139)
T PF07128_consen 25 LIWTREDVRALAD-GMEYN----------LTDDEARAVLARIGD 57 (139)
T ss_pred EEecHHHHHHHHh-cCCCC----------CCHHHHHHHHHHHhc
Confidence 4688888776644 22111 378899998888754
No 95
>PHA02681 ORF089 virion membrane protein; Provisional
Probab=41.04 E-value=31 Score=19.39 Aligned_cols=20 Identities=10% Similarity=0.187 Sum_probs=15.8
Q ss_pred CCCCCCCHHHHHHHHHHHhh
Q psy14949 81 VFPGLRKEKDREDLIAYLST 100 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl~s 100 (102)
.|...||++|++++=.++-+
T Consensus 47 ~F~D~lTpDQVrAlHRlvTs 66 (92)
T PHA02681 47 SFEDKMTDDQVRAFHALVTS 66 (92)
T ss_pred hhhccCCHHHHHHHHHHHhC
Confidence 47778899999998877643
No 96
>PF14537 Cytochrom_c3_2: Cytochrome c3; PDB: 1D4C_A 1D4E_A 1D4D_A 2K3V_A 1QO8_D 2P0B_A 2OZY_A 1M64_A 1JRX_A 1QJD_A ....
Probab=40.95 E-value=14 Score=19.54 Aligned_cols=8 Identities=38% Similarity=1.203 Sum_probs=6.5
Q ss_pred hcccccCC
Q psy14949 13 ACAQCHTA 20 (102)
Q Consensus 13 ~C~~CH~~ 20 (102)
.|..||+.
T Consensus 8 ~C~~CH~~ 15 (80)
T PF14537_consen 8 NCVDCHGP 15 (80)
T ss_dssp TGGGTSSS
T ss_pred ChhHhCCC
Confidence 69999963
No 97
>PF13435 Cytochrome_C554: Cytochrome c554 and c-prime; PDB: 1BVB_A 1FT5_A 1FT6_A 1SP3_A.
Probab=39.51 E-value=13 Score=21.11 Aligned_cols=9 Identities=44% Similarity=1.261 Sum_probs=7.4
Q ss_pred hcccccCCC
Q psy14949 13 ACAQCHTAN 21 (102)
Q Consensus 13 ~C~~CH~~~ 21 (102)
.|..||...
T Consensus 50 ~C~~CH~~~ 58 (130)
T PF13435_consen 50 SCTSCHTPG 58 (130)
T ss_dssp CCGGGSCCT
T ss_pred cccccCCCc
Confidence 599999755
No 98
>COG3005 TorC Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit [Energy production and conversion]
Probab=39.28 E-value=12 Score=24.30 Aligned_cols=10 Identities=30% Similarity=0.962 Sum_probs=8.1
Q ss_pred HhcccccCCC
Q psy14949 12 RACAQCHTAN 21 (102)
Q Consensus 12 ~~C~~CH~~~ 21 (102)
..|..||..+
T Consensus 135 ~eCr~CH~~~ 144 (190)
T COG3005 135 AECRNCHNFD 144 (190)
T ss_pred hhhhhccchh
Confidence 4599999876
No 99
>PRK11702 hypothetical protein; Provisional
Probab=38.92 E-value=41 Score=19.86 Aligned_cols=18 Identities=22% Similarity=0.235 Sum_probs=15.0
Q ss_pred CCCCHHHHHHHHHHHhhc
Q psy14949 84 GLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~sl 101 (102)
+..|++++.+|.+||++.
T Consensus 75 gs~tEe~R~~V~~WL~~r 92 (108)
T PRK11702 75 GKCTEEHRALVKKWLEGR 92 (108)
T ss_pred CCCCHHHHHHHHHHHHhC
Confidence 345899999999999863
No 100
>PF06252 DUF1018: Protein of unknown function (DUF1018); InterPro: IPR009363 This family consists of several bacterial and phage proteins, related to Gp16 of phage Mu, of unknown function.
Probab=38.90 E-value=34 Score=20.15 Aligned_cols=13 Identities=8% Similarity=0.046 Sum_probs=7.5
Q ss_pred CHHHHHHHHHHHh
Q psy14949 87 KEKDREDLIAYLS 99 (102)
Q Consensus 87 s~~e~~~l~~yl~ 99 (102)
+..++..|.+-..
T Consensus 56 ~~~q~~KI~aLw~ 68 (119)
T PF06252_consen 56 TSAQLRKIRALWK 68 (119)
T ss_pred chHHHHHHHHHHH
Confidence 4566666655544
No 101
>CHL00037 petA cytochrome f
Probab=38.79 E-value=10 Score=26.38 Aligned_cols=8 Identities=75% Similarity=1.539 Sum_probs=6.6
Q ss_pred cccccCCC
Q psy14949 14 CAQCHTAN 21 (102)
Q Consensus 14 C~~CH~~~ 21 (102)
|+.||-..
T Consensus 56 CANCHLA~ 63 (320)
T CHL00037 56 CANCHLAN 63 (320)
T ss_pred eecccccc
Confidence 99999654
No 102
>PRK02693 apocytochrome f; Reviewed
Probab=37.87 E-value=11 Score=26.05 Aligned_cols=8 Identities=63% Similarity=1.410 Sum_probs=6.6
Q ss_pred cccccCCC
Q psy14949 14 CAQCHTAN 21 (102)
Q Consensus 14 C~~CH~~~ 21 (102)
|+.||-..
T Consensus 49 CANCHLA~ 56 (312)
T PRK02693 49 CANCHLAQ 56 (312)
T ss_pred eecccccc
Confidence 99999654
No 103
>PF07102 DUF1364: Protein of unknown function (DUF1364); InterPro: IPR010774 This entry is represented by Bacteriophage 82, YbcO. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several bacterial and phage proteins of around 95 residues in length. The function of this family is unknown.; PDB: 3G27_A.
Probab=37.19 E-value=8.4 Score=22.16 Aligned_cols=8 Identities=50% Similarity=1.344 Sum_probs=5.5
Q ss_pred hcccccCC
Q psy14949 13 ACAQCHTA 20 (102)
Q Consensus 13 ~C~~CH~~ 20 (102)
.|++||..
T Consensus 53 aCs~CHd~ 60 (94)
T PF07102_consen 53 ACSACHDE 60 (94)
T ss_dssp E-HHHHHH
T ss_pred hHHHHHHH
Confidence 49999964
No 104
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=36.98 E-value=11 Score=17.77 Aligned_cols=8 Identities=25% Similarity=0.845 Sum_probs=6.2
Q ss_pred hcccccCC
Q psy14949 13 ACAQCHTA 20 (102)
Q Consensus 13 ~C~~CH~~ 20 (102)
.|.+|||+
T Consensus 21 ~C~~C~G~ 28 (41)
T PF13453_consen 21 VCPSCGGI 28 (41)
T ss_pred ECCCCCeE
Confidence 59999964
No 105
>PF13821 DUF4187: Domain of unknown function (DUF4187)
Probab=36.61 E-value=32 Score=17.58 Aligned_cols=15 Identities=27% Similarity=0.479 Sum_probs=12.4
Q ss_pred CHHHHHHHHHHHhhc
Q psy14949 87 KEKDREDLIAYLSTL 101 (102)
Q Consensus 87 s~~e~~~l~~yl~sl 101 (102)
..+.+..|+.|||+.
T Consensus 12 ~~e~L~~l~~YLR~~ 26 (55)
T PF13821_consen 12 PEERLDKLLSYLREE 26 (55)
T ss_pred HHHHHHHHHHHHHhh
Confidence 478899999999873
No 106
>TIGR02161 napC_nirT periplasmic nitrate (or nitrite) reductase c-type cytochrome, NapC/NirT family. Nearly every member of this subfamily is NapC, a predicted membrane-anchored four-heme c-type cytochrome that forms one component of the periplasmic nitrate reductase along with NapA, NapB, NapD, NapE, and NapF subunits. A single known exception at this time is NirT, which is instead a component of a nitrite reductase. This family excludes TorC subunits of trimethylamine N-oxide (TMAO) reductases.
Probab=36.16 E-value=15 Score=23.76 Aligned_cols=10 Identities=30% Similarity=0.866 Sum_probs=7.8
Q ss_pred HhcccccCCC
Q psy14949 12 RACAQCHTAN 21 (102)
Q Consensus 12 ~~C~~CH~~~ 21 (102)
+.|..||+.+
T Consensus 136 ~~C~~CH~~~ 145 (185)
T TIGR02161 136 LECRNCHNFE 145 (185)
T ss_pred hhchhhcCcc
Confidence 4799999844
No 107
>PF10058 DUF2296: Predicted integral membrane metal-binding protein (DUF2296); InterPro: IPR019273 This domain, found mainly in the eukaryotic lunapark proteins, has no known function [].
Probab=36.16 E-value=14 Score=18.88 Aligned_cols=10 Identities=30% Similarity=0.946 Sum_probs=8.3
Q ss_pred hcccccCCCC
Q psy14949 13 ACAQCHTANE 22 (102)
Q Consensus 13 ~C~~CH~~~~ 22 (102)
.|..||..+|
T Consensus 24 IC~~C~~hNG 33 (54)
T PF10058_consen 24 ICSKCFSHNG 33 (54)
T ss_pred ECcccchhhc
Confidence 5999998774
No 108
>smart00386 HAT HAT (Half-A-TPR) repeats. Present in several RNA-binding proteins. Structurally and sequentially thought to be similar to TPRs.
Probab=36.11 E-value=31 Score=14.20 Aligned_cols=18 Identities=28% Similarity=0.456 Sum_probs=13.4
Q ss_pred CcHHhHHHHHHHhccccc
Q psy14949 1 MSVENGKKIFTRACAQCH 18 (102)
Q Consensus 1 ~d~~~G~~lf~~~C~~CH 18 (102)
++.+..+.+|.+.+..|+
T Consensus 1 ~~~~~~r~i~e~~l~~~~ 18 (33)
T smart00386 1 GDIERARKIYERALEKFP 18 (33)
T ss_pred CcHHHHHHHHHHHHHHCC
Confidence 466778889987766665
No 109
>cd08168 Cytochrom_C3 Heme-binding domain of the class III cytochrome C family and related proteins. This alignment models heme binding core motifs as encountered in the cytochrome C3 family and related proteins. Cytochrome C3 is a tetraheme protein found in sulfate-reducing bacteria which use either thiosulfate or sulfate as the ultimate electron acceptors. C3 is an integral part of a complex electron transfer chain. The model also contains triheme cytochromes C7 which function in electron transfer during Fe(III) respiration by Geobacter sulfurreducens (PpcA, PpcB, PpcC, PpcD, and PpcE) and four repeated core motifs as found in the 16-heme cytochrome C HmcA of Desulfovibrio vulgaris Hildenborough which plays a role in electron transfer through the membrane following periplasmic oxidation of hydrogen (resulting in sulfate reduction in the cytoplasm).
Probab=35.76 E-value=13 Score=20.31 Aligned_cols=14 Identities=29% Similarity=0.783 Sum_probs=9.7
Q ss_pred HHHHHhcccccCCC
Q psy14949 8 KIFTRACAQCHTAN 21 (102)
Q Consensus 8 ~lf~~~C~~CH~~~ 21 (102)
..|-..|..||...
T Consensus 55 ~a~H~~C~~CH~~~ 68 (85)
T cd08168 55 DAFHNSCIGCHKEM 68 (85)
T ss_pred HHHHHhHHHHHHHH
Confidence 34445799999754
No 110
>PF03264 Cytochrom_NNT: NapC/NirT cytochrome c family, N-terminal region; InterPro: IPR005126 Within the NapC/NirT family of cytochrome c proteins, some members, such as NapC P33932 from SWISSPROT and NirT P24038 from SWISSPROT, bind four haem groups, while others, such as TorC P33226 from SWISSPROT, bind five haems. This family aligns the common N-terminal region that contains four haem-binding C-X(2)-CH motifs.; PDB: 2VR0_F 2J7A_C.
Probab=35.56 E-value=19 Score=22.67 Aligned_cols=16 Identities=31% Similarity=0.638 Sum_probs=10.6
Q ss_pred HHHHHHHh----cccccCCC
Q psy14949 6 GKKIFTRA----CAQCHTAN 21 (102)
Q Consensus 6 G~~lf~~~----C~~CH~~~ 21 (102)
....+..+ |..||...
T Consensus 115 ~~~~~~~~~~~~C~~CH~~~ 134 (173)
T PF03264_consen 115 SWKRFKANDDSTCLNCHSDL 134 (173)
T ss_dssp HHHHHHHH---HHHHHCHHH
T ss_pred HHHHHHhhhcccCcccCCCc
Confidence 44555544 99999754
No 111
>PF02085 Cytochrom_CIII: Class III cytochrome C family; InterPro: IPR020942 Cytochromes c (cytC) can be defined as electron-transfer proteins having one or several haem c groups, bound to the protein by one or, more generally, two thioether bonds involving sulphydryl groups of cysteine residues. The fifth haem iron ligand is always provided by a histidine residue. CytC possess a wide range of properties and function in a large number of different redox processes []. Ambler [] recognised four classes of cytC. Class III comprises the low redox potential multiple haem cytochromes: cyt C7 (trihaem), C3 (tetrahaem), and high-molecular-weight cytC, HMC (hexadecahaem), with only 30-40 residues per haem group. The haem c groups, all bis-histidinyl coordinated, are structurally and functionally nonequivalent and present different redox potentials in the range 0 to -400 mV []. The 3D structures of a number of cyt C3 proteins have been determined. The proteins consist of 4-5 alpha-helices and 2 beta-strands wrapped around a compact core of four non-parallel haems, which present a relatively high degree of exposure to the solvent. The overall protein architecture, haem plane orientations and iron-iron distances are highly conserved [].; GO: 0009055 electron carrier activity, 0020037 heme binding; PDB: 1AQE_A 1CZJ_A 1H29_B 1GWS_A 2CVC_A 2KMY_A 1DUW_A 1UPD_A 1I77_A 1GMB_A ....
Probab=35.29 E-value=15 Score=20.91 Aligned_cols=9 Identities=33% Similarity=0.973 Sum_probs=7.1
Q ss_pred hcccccCCC
Q psy14949 13 ACAQCHTAN 21 (102)
Q Consensus 13 ~C~~CH~~~ 21 (102)
.|.+||...
T Consensus 78 ~C~~CH~~~ 86 (102)
T PF02085_consen 78 SCMGCHKEM 86 (102)
T ss_dssp HHHHHHHHH
T ss_pred hHHHHHHHH
Confidence 799999643
No 112
>cd07922 CarBa CarBa is the A subunit of 2-aminophenol 1,6-dioxygenase, which catalyzes the oxidization and subsequent ring-opening of 2-aminophenyl-2,3-diol. CarBa is the A subunit of 2-aminophenol 1,6-dioxygenase, which catalyzes the oxidization and subsequent ring-opening of 2-aminophenyl-2,3-diol. 2-aminophenol 1,6-dioxygenase is a key enzyme in the carbazole degradation pathway isolated from bacterial strains with carbazole degradation ability. The enzyme is a heterotetramer composed of two A and two B subunits. CarB belongs to the class III extradiol dioxygenase family, composed of enzymes which use a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon. Although the enzyme was originally isolated as a meta-cleavage enzyme for 2'-aminobiphenyl-2,3-diol involved in carbazole degradation, the enzyme has also shown high specificity for 2,3-dihydroxybiphenyl.
Probab=33.61 E-value=22 Score=19.81 Aligned_cols=31 Identities=13% Similarity=0.234 Sum_probs=20.8
Q ss_pred cHHHHHHHHhCCCCCCCCCccCCCCCCCHHHHHHHHH
Q psy14949 60 SASTLDIYLQNPKKYIPGTKMVFPGLRKEKDREDLIA 96 (102)
Q Consensus 60 ~~~~l~~~~~~~~~~~~~~~m~~~~~ls~~e~~~l~~ 96 (102)
.++...+|+.+|.+... -.+ ||++|+.+|..
T Consensus 16 dp~~rerF~~DPea~~~-----~~g-Lt~eE~~aL~~ 46 (81)
T cd07922 16 DPGLIERFQDDPSAVFE-----EYG-LTPAERAALRE 46 (81)
T ss_pred CHHHHHHHHHCHHHHHH-----HcC-CCHHHHHHHHc
Confidence 56677888888775432 123 48999888753
No 113
>KOG3507|consensus
Probab=33.52 E-value=17 Score=19.02 Aligned_cols=9 Identities=33% Similarity=0.962 Sum_probs=7.4
Q ss_pred hcccccCCC
Q psy14949 13 ACAQCHTAN 21 (102)
Q Consensus 13 ~C~~CH~~~ 21 (102)
.|+.||+.+
T Consensus 22 iCgdC~~en 30 (62)
T KOG3507|consen 22 ICGDCGQEN 30 (62)
T ss_pred Eeccccccc
Confidence 499999865
No 114
>PF08263 LRRNT_2: Leucine rich repeat N-terminal domain; InterPro: IPR013210 Leucine-rich repeats (LRR) consist of 2-45 motifs of 20-30 amino acids in length that generally folds into an arc or horseshoe shape []. LRRs occur in proteins ranging from viruses to eukaryotes, and appear to provide a structural framework for the formation of protein-protein interactions [, ].Proteins containing LRRs include tyrosine kinase receptors, cell-adhesion molecules, virulence factors, and extracellular matrix-binding glycoproteins, and are involved in a variety of biological processes, including signal transduction, cell adhesion, DNA repair, recombination, transcription, RNA processing, disease resistance, apoptosis, and the immune response []. Sequence analyses of LRR proteins suggested the existence of several different subfamilies of LRRs. The significance of this classification is that repeats from different subfamilies never occur simultaneously and have most probably evolved independently. It is, however, now clear that all major classes of LRR have curved horseshoe structures with a parallel beta sheet on the concave side and mostly helical elements on the convex side. At least six families of LRR proteins, characterised by different lengths and consensus sequences of the repeats, have been identified. Eleven-residue segments of the LRRs (LxxLxLxxN/CxL), corresponding to the beta-strand and adjacent loop regions, are conserved in LRR proteins, whereas the remaining parts of the repeats (herein termed variable) may be very different. Despite the differences, each of the variable parts contains two half-turns at both ends and a "linear" segment (as the chain follows a linear path overall), usually formed by a helix, in the middle. The concave face and the adjacent loops are the most common protein interaction surfaces on LRR proteins. 3D structure of some LRR proteins-ligand complexes show that the concave surface of LRR domain is ideal for interaction with alpha-helix, thus supporting earlier conclusions that the elongated and curved LRR structure provides an outstanding framework for achieving diverse protein-protein interactions []. Molecular modeling suggests that the conserved pattern LxxLxL, which is shorter than the previously proposed LxxLxLxxN/CxL is sufficient to impart the characteristic horseshoe curvature to proteins with 20- to 30-residue repeats []. This domain is often found at the N terminus of tandem leucine rich repeats.; PDB: 3RGZ_A 3RJ0_A 3RIZ_A 3RGX_A 1OGQ_A.
Probab=33.17 E-value=54 Score=15.30 Aligned_cols=14 Identities=36% Similarity=0.738 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHhhc
Q psy14949 88 EKDREDLIAYLSTL 101 (102)
Q Consensus 88 ~~e~~~l~~yl~sl 101 (102)
++|+.+|++|-+++
T Consensus 2 ~~d~~aLl~~k~~l 15 (43)
T PF08263_consen 2 NQDRQALLAFKKSL 15 (43)
T ss_dssp HHHHHHHHHHHHCT
T ss_pred cHHHHHHHHHHHhc
Confidence 57888999887765
No 115
>PF13986 DUF4224: Domain of unknown function (DUF4224)
Probab=33.11 E-value=26 Score=17.25 Aligned_cols=13 Identities=15% Similarity=0.524 Sum_probs=9.8
Q ss_pred CCCHHHHHHHHHH
Q psy14949 85 LRKEKDREDLIAY 97 (102)
Q Consensus 85 ~ls~~e~~~l~~y 97 (102)
.||++|+..|-.|
T Consensus 2 fLT~~El~elTG~ 14 (47)
T PF13986_consen 2 FLTDEELQELTGY 14 (47)
T ss_pred CCCHHHHHHHHCC
Confidence 4688898888655
No 116
>PF02335 Cytochrom_C552: Cytochrome c552; InterPro: IPR003321 The enzyme cytochrome c nitrite reductase (c552) catalyses the six-electron reduction of nitrite to ammonia as one of the key steps in the biological nitrogen cycle, where it participates in the anaerobic energy metabolism of dissimilatory nitrate ammonification. Cytochrome c nitrite reductase from Sulfurospirillum deleyianum is a functional dimer, with 10 close-packed haem groups of type c and an unusual lysine-coordinated high-spin haem at the active site [].; GO: 0006807 nitrogen compound metabolic process, 0055114 oxidation-reduction process; PDB: 1QDB_B 3TTB_B 3BNJ_A 3BNG_A 1FS8_A 2E80_A 3BNH_A 1FS7_A 2E81_A 3BNF_A ....
Probab=32.77 E-value=20 Score=26.30 Aligned_cols=9 Identities=33% Similarity=1.169 Sum_probs=7.3
Q ss_pred HHhcccccC
Q psy14949 11 TRACAQCHT 19 (102)
Q Consensus 11 ~~~C~~CH~ 19 (102)
.+.|..||.
T Consensus 272 ~~sC~~CH~ 280 (434)
T PF02335_consen 272 ENSCQTCHS 280 (434)
T ss_dssp HHCTTTTST
T ss_pred hhHHhhhcC
Confidence 357999997
No 117
>TIGR03092 SASP_sspI small, acid-soluble spore protein I. This protein family is restricted to a subset of endospore-forming bacteria such as Bacillus subtilis, all of which are in the Firmicutes (low-GC Gram-positive) lineage. It is a minor SASP (small, acid-soluble spore protein) designated SspI. The gene in Bacillus subtilis previously was designated ysfA.
Probab=32.65 E-value=46 Score=17.78 Aligned_cols=41 Identities=15% Similarity=0.356 Sum_probs=24.8
Q ss_pred cHHHHHHHHhCCC-----CCCCC--Ccc-CCCCCCCHHHHHHHHHHHhh
Q psy14949 60 SASTLDIYLQNPK-----KYIPG--TKM-VFPGLRKEKDREDLIAYLST 100 (102)
Q Consensus 60 ~~~~l~~~~~~~~-----~~~~~--~~m-~~~~~ls~~e~~~l~~yl~s 100 (102)
+.+.+..+|.+.. ...|| ..+ .++.-++++|...++..|..
T Consensus 14 s~~elk~~I~daI~sgEEk~LPGLGVlFE~~W~~~~~~ek~~m~~~l~~ 62 (65)
T TIGR03092 14 TKEQLEATIVDAIQSGEEKMLPGLGVLFEAIWKHANEQEKDEMLETLEQ 62 (65)
T ss_pred CHHHHHHHHHHHHhccchhcCCccHHHHHHHHHhcCHHHHHHHHHHHHH
Confidence 4566666665433 23444 222 23444589999999998865
No 118
>PF11829 DUF3349: Protein of unknown function (DUF3349); InterPro: IPR021784 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 99 to 124 amino acids in length. ; PDB: 2KVC_A 3OL3_B 3OL4_A 2LKY_A.
Probab=32.37 E-value=50 Score=19.08 Aligned_cols=16 Identities=13% Similarity=0.150 Sum_probs=12.3
Q ss_pred CCCCHHHHHHHHHHHh
Q psy14949 84 GLRKEKDREDLIAYLS 99 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~ 99 (102)
..||++|+.+|+.-|.
T Consensus 34 r~Ltd~ev~~Va~~L~ 49 (96)
T PF11829_consen 34 RRLTDDEVAEVAAELA 49 (96)
T ss_dssp TTS-HHHHHHHHHHHH
T ss_pred ccCCHHHHHHHHHHHH
Confidence 3479999999998874
No 119
>KOG1200|consensus
Probab=31.69 E-value=59 Score=21.78 Aligned_cols=20 Identities=15% Similarity=0.386 Sum_probs=15.2
Q ss_pred CCCCCCCHHHHHHHHHHHhh
Q psy14949 81 VFPGLRKEKDREDLIAYLST 100 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl~s 100 (102)
|...+-..||+.+++.||.+
T Consensus 218 Pmgr~G~~EevA~~V~fLAS 237 (256)
T KOG1200|consen 218 PMGRLGEAEEVANLVLFLAS 237 (256)
T ss_pred CccccCCHHHHHHHHHHHhc
Confidence 33344468999999999986
No 120
>PF10955 DUF2757: Protein of unknown function (DUF2757); InterPro: IPR020115 This entry contains proteins with no known function.
Probab=31.61 E-value=13 Score=20.50 Aligned_cols=13 Identities=31% Similarity=0.618 Sum_probs=10.6
Q ss_pred CCHHHHHHHHHHH
Q psy14949 86 RKEKDREDLIAYL 98 (102)
Q Consensus 86 ls~~e~~~l~~yl 98 (102)
||++|+++|+.|=
T Consensus 32 Lt~eEr~dmI~~~ 44 (76)
T PF10955_consen 32 LTPEERQDMISYD 44 (76)
T ss_pred CCHHHHhhheEEc
Confidence 5899999998763
No 121
>TIGR03152 cyto_c552_HCOOH formate-dependent cytochrome c nitrite reductase, c552 subunit. Members of this protein family are cytochrome c552, a component of cytochrome c nitrite reductase, which is known more formally as nitrite reductase (cytochrome; ammonia-forming) (EC 1.7.2.2). Nitrate can be reduced by several enzymes. EC 1.7.2.2 reduces nitrite all the way to ammonia, rather than to ammonium hydroxide (nitrite reductase (NAD(P)H), EC 1.7.1.4) or nitric oxide (nitrite reductase (NO-forming), EC 1.7.2.1). Some examples of EC 1.7.2.2 occur in a seven gene system that enables formate-dependent nitrite reduction, but is also found in simpler contexts. Members of this protein family, however, belong to the formate-dependent system.
Probab=31.51 E-value=18 Score=26.65 Aligned_cols=11 Identities=55% Similarity=1.262 Sum_probs=8.2
Q ss_pred HHHhcccccCC
Q psy14949 10 FTRACAQCHTA 20 (102)
Q Consensus 10 f~~~C~~CH~~ 20 (102)
+...|.+||+.
T Consensus 273 ~~~aC~~CH~~ 283 (439)
T TIGR03152 273 FDDTCANCHTQ 283 (439)
T ss_pred HHHHHHhCCCC
Confidence 44579999974
No 122
>PF06883 RNA_pol_Rpa2_4: RNA polymerase I, Rpa2 specific domain ; InterPro: IPR009674 This domain is found between domain 3 and domain 5, but shows no homology to domain 4 of Rpb2. The external domains in multisubunit RNA polymerase (those most distant from the active site) are known to demonstrate more sequence variability [].; GO: 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent, 0005634 nucleus
Probab=30.28 E-value=46 Score=17.23 Aligned_cols=17 Identities=18% Similarity=0.235 Sum_probs=13.2
Q ss_pred CCHHHHHHHHHHHhhcC
Q psy14949 86 RKEKDREDLIAYLSTLH 102 (102)
Q Consensus 86 ls~~e~~~l~~yl~sl~ 102 (102)
.++++...+++.|+.+|
T Consensus 4 ~~~~~a~~~~~~LR~~K 20 (58)
T PF06883_consen 4 VSPEEAEQIADQLRYLK 20 (58)
T ss_pred ecHHHHHHHHHHHHHHH
Confidence 46788889999888654
No 123
>cd07923 Gallate_dioxygenase_C The C-terminal domain of Gallate Dioxygenase, which catalyzes the oxidization and subsequent ring-opening of gallate. Gallate Dioxygenase catalyzes the oxidization and subsequent ring-opening of gallate, an intermediate in the degradation of the aromatic compound, syringate. The reaction product of gallate dioxygenase is 4-oxalomesaconate. The amino acid sequence of the N-terminal and C-terminal regions of gallate dioxygenase exhibits homology with the sequence of the PCA 4,5-dioxygenase B (catalytic) and A subunits, respectively. This model represents the C-terminal domain, which is similar to the A subunit of PCA 4,5-dioxygenase (or LigAB). The enzyme is estimated to be a homodimer according to the Escherichia coli enzyme. Since enzymes in this subfamily have fused A and B subunits, the dimer interface may resemble the tetramer interface of classical LigAB enzymes. This enzyme belongs to the class III extradiol dioxygenase family, composed of enzymes whi
Probab=30.23 E-value=26 Score=20.13 Aligned_cols=31 Identities=16% Similarity=0.326 Sum_probs=20.9
Q ss_pred cHHHHHHHHhCCCCCCCCCccCCCCCCCHHHHHHHHH
Q psy14949 60 SASTLDIYLQNPKKYIPGTKMVFPGLRKEKDREDLIA 96 (102)
Q Consensus 60 ~~~~l~~~~~~~~~~~~~~~m~~~~~ls~~e~~~l~~ 96 (102)
..+...+|+.+|.+++. -.+ ||+||+.+|.+
T Consensus 17 ~a~~RerF~~D~ea~~~-----e~g-Lt~Ee~~av~~ 47 (94)
T cd07923 17 EPAHRERFLEDPEALFD-----EAG-LTEEERTLIRN 47 (94)
T ss_pred CHHHHHHHHhCHHHHHH-----HcC-CCHHHHHHHHc
Confidence 45677788888776543 123 48999988864
No 124
>PRK10617 cytochrome c-type protein NapC; Provisional
Probab=30.09 E-value=20 Score=23.50 Aligned_cols=10 Identities=30% Similarity=1.032 Sum_probs=7.7
Q ss_pred HhcccccCCC
Q psy14949 12 RACAQCHTAN 21 (102)
Q Consensus 12 ~~C~~CH~~~ 21 (102)
+.|.+||+.+
T Consensus 145 ~~Cr~CH~~~ 154 (200)
T PRK10617 145 QECRNCHNFD 154 (200)
T ss_pred ccchhhcCcc
Confidence 4699999843
No 125
>CHL00136 rpl31 ribosomal protein L31; Validated
Probab=29.79 E-value=21 Score=19.24 Aligned_cols=8 Identities=13% Similarity=0.098 Sum_probs=6.3
Q ss_pred HhcccccC
Q psy14949 12 RACAQCHT 19 (102)
Q Consensus 12 ~~C~~CH~ 19 (102)
..|+.||-
T Consensus 36 dv~s~~HP 43 (68)
T CHL00136 36 DIWSGNHP 43 (68)
T ss_pred EeCCCCCc
Confidence 35999994
No 126
>TIGR03631 bact_S13 30S ribosomal protein S13. This model describes bacterial ribosomal protein S13, to the exclusion of the homologous archaeal S13P and eukaryotic ribosomal protein S18. This model identifies some (but not all) instances of chloroplast and mitochondrial S13, which is of bacterial type.
Probab=29.73 E-value=61 Score=19.15 Aligned_cols=17 Identities=6% Similarity=0.003 Sum_probs=14.1
Q ss_pred CCCCHHHHHHHHHHHhh
Q psy14949 84 GLRKEKDREDLIAYLST 100 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~s 100 (102)
+.|+++|+..|..+|.+
T Consensus 44 ~~L~~~qi~~l~~~l~~ 60 (113)
T TIGR03631 44 KDLTEEELNAIREEIEA 60 (113)
T ss_pred ccCCHHHHHHHHHHHHh
Confidence 45699999999999953
No 127
>PF07295 DUF1451: Protein of unknown function (DUF1451); InterPro: IPR009912 This family consists of several hypothetical bacterial proteins of around 160 residues in length. Members of this family contain four highly conserved cysteine resides toward the C-terminal region of the protein. The function of this family is unknown.
Probab=29.37 E-value=55 Score=20.34 Aligned_cols=36 Identities=28% Similarity=0.366 Sum_probs=22.2
Q ss_pred cHHHHHHHHhCCCCCCCCCccCCCCCCCHHHHHHHHHHHhh
Q psy14949 60 SASTLDIYLQNPKKYIPGTKMVFPGLRKEKDREDLIAYLST 100 (102)
Q Consensus 60 ~~~~l~~~~~~~~~~~~~~~m~~~~~ls~~e~~~l~~yl~s 100 (102)
+...|..++...+... .-.+-||.+|+..|.+||+.
T Consensus 12 ~~~~L~~~le~a~e~~-----~~~~elT~eEl~lv~~ylkR 47 (146)
T PF07295_consen 12 SEEELQEALEKAKEYL-----VAAGELTREELALVSAYLKR 47 (146)
T ss_pred CHHHHHHHHHHHHHHH-----HHHhhcCHHHHHHHHHHHHH
Confidence 3455666665433221 11234589999999999975
No 128
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=28.97 E-value=20 Score=24.19 Aligned_cols=8 Identities=38% Similarity=1.149 Sum_probs=6.4
Q ss_pred hcccccCC
Q psy14949 13 ACAQCHTA 20 (102)
Q Consensus 13 ~C~~CH~~ 20 (102)
.|++||-.
T Consensus 199 ~C~GC~m~ 206 (239)
T COG1579 199 VCGGCHMK 206 (239)
T ss_pred cccCCeee
Confidence 49999953
No 129
>COG0099 RpsM Ribosomal protein S13 [Translation, ribosomal structure and biogenesis]
Probab=28.90 E-value=63 Score=19.49 Aligned_cols=17 Identities=24% Similarity=0.251 Sum_probs=14.1
Q ss_pred CCCCHHHHHHHHHHHhh
Q psy14949 84 GLRKEKDREDLIAYLST 100 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~s 100 (102)
+-||++|+..|..++.+
T Consensus 46 ~eLteeei~~ir~~i~~ 62 (121)
T COG0099 46 GELTEEEIERLRDAIQN 62 (121)
T ss_pred ccCCHHHHHHHHHHHHh
Confidence 34699999999999874
No 130
>TIGR03146 cyt_nit_nrfB cytochrome c nitrite reductase, pentaheme subunit. Members of this protein family contain five copies of the CXXCH heme-binding motif, and are the NrfB component of the multisubunit enzyme, cytochrome c nitrite reductase.
Probab=28.84 E-value=19 Score=22.07 Aligned_cols=8 Identities=38% Similarity=1.207 Sum_probs=6.9
Q ss_pred hcccccCC
Q psy14949 13 ACAQCHTA 20 (102)
Q Consensus 13 ~C~~CH~~ 20 (102)
.|+.||+.
T Consensus 42 ~C~~CH~~ 49 (145)
T TIGR03146 42 TCVNCHGN 49 (145)
T ss_pred chhhcCCc
Confidence 69999984
No 131
>CHL00137 rps13 ribosomal protein S13; Validated
Probab=28.84 E-value=65 Score=19.36 Aligned_cols=17 Identities=6% Similarity=-0.019 Sum_probs=14.1
Q ss_pred CCCCHHHHHHHHHHHhh
Q psy14949 84 GLRKEKDREDLIAYLST 100 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~s 100 (102)
+.|+++|+..|..+|..
T Consensus 46 ~~Lt~~qi~~l~~~i~~ 62 (122)
T CHL00137 46 KDLTDEQISALREIIEE 62 (122)
T ss_pred ccCCHHHHHHHHHHHHH
Confidence 34699999999999963
No 132
>PRK00019 rpmE 50S ribosomal protein L31; Reviewed
Probab=28.55 E-value=22 Score=19.34 Aligned_cols=8 Identities=38% Similarity=1.310 Sum_probs=6.3
Q ss_pred HhcccccC
Q psy14949 12 RACAQCHT 19 (102)
Q Consensus 12 ~~C~~CH~ 19 (102)
..|+.||-
T Consensus 36 di~s~~HP 43 (72)
T PRK00019 36 DVCSKCHP 43 (72)
T ss_pred EeCCCCCC
Confidence 45999994
No 133
>PF14098 SSPI: Small, acid-soluble spore protein I
Probab=28.29 E-value=68 Score=17.14 Aligned_cols=41 Identities=20% Similarity=0.328 Sum_probs=24.2
Q ss_pred cHHHHHHHHhCCC-----CCCCC--Ccc-CCCCCCCHHHHHHHHHHHhh
Q psy14949 60 SASTLDIYLQNPK-----KYIPG--TKM-VFPGLRKEKDREDLIAYLST 100 (102)
Q Consensus 60 ~~~~l~~~~~~~~-----~~~~~--~~m-~~~~~ls~~e~~~l~~yl~s 100 (102)
+.+.+..+|.+.. ...|| ..+ -++.-++++|...++.+|..
T Consensus 15 s~~el~~~I~daI~sgEE~~LPGLGVlFE~~W~~~~~~ek~~m~~~l~~ 63 (65)
T PF14098_consen 15 SKEELKDTIEDAIQSGEEKALPGLGVLFEVIWKNSDESEKQEMVNTLEQ 63 (65)
T ss_pred CHHHHHHHHHHHHhccchhcCCchHHHHHHHHHhCCHHHHHHHHHHHHh
Confidence 4566666665433 23444 122 12334489999999999875
No 134
>PF07757 AdoMet_MTase: Predicted AdoMet-dependent methyltransferase; InterPro: IPR011671 tRNA (uracil-O(2)-)-methyltransferase catalyses the formation of O(2)-methyl-uracil at position 44 (m2U44) in tRNA(Ser) [].; GO: 0008168 methyltransferase activity
Probab=28.21 E-value=36 Score=20.24 Aligned_cols=13 Identities=38% Similarity=0.537 Sum_probs=10.2
Q ss_pred HHHHHHHHHHhhcC
Q psy14949 89 KDREDLIAYLSTLH 102 (102)
Q Consensus 89 ~e~~~l~~yl~sl~ 102 (102)
||+ +|++||-.||
T Consensus 38 EDl-aIAAyLi~LW 50 (112)
T PF07757_consen 38 EDL-AIAAYLIELW 50 (112)
T ss_pred HHH-HHHHHHHHHH
Confidence 565 8999998764
No 135
>PF05927 Penaeidin: Penaeidin; InterPro: IPR009226 This family consists of several isoforms of the penaeidin protein, which is specific to shrimps. Penaeidins, a unique family of antimicrobial peptides (AMPs) with both proline and cysteine-rich domains, were initially identified in the hemolymph of the Pacific white shrimp, Penaeus vannamei [].; GO: 0008061 chitin binding, 0005737 cytoplasm; PDB: 1UEO_A 1XV3_A.
Probab=27.90 E-value=27 Score=18.82 Aligned_cols=10 Identities=20% Similarity=0.660 Sum_probs=7.2
Q ss_pred HhcccccCCC
Q psy14949 12 RACAQCHTAN 21 (102)
Q Consensus 12 ~~C~~CH~~~ 21 (102)
..|.+||++.
T Consensus 44 ~~C~sC~~i~ 53 (73)
T PF05927_consen 44 PVCNSCYRIS 53 (73)
T ss_dssp SSTTTTTTS-
T ss_pred cccccccccC
Confidence 3599999764
No 136
>PF11293 DUF3094: Protein of unknown function (DUF3094); InterPro: IPR021444 This family of proteins with unknown function appears to be restricted to Gammaproteobacteria.
Probab=27.64 E-value=77 Score=16.30 Aligned_cols=16 Identities=19% Similarity=0.320 Sum_probs=13.5
Q ss_pred CCCHHHHHHHHHHHhh
Q psy14949 85 LRKEKDREDLIAYLST 100 (102)
Q Consensus 85 ~ls~~e~~~l~~yl~s 100 (102)
.|++||-+.|-+||.+
T Consensus 3 rL~pEDQ~~Vd~yL~a 18 (55)
T PF11293_consen 3 RLNPEDQQRVDEYLQA 18 (55)
T ss_pred CCCHHHHHHHHHHHhC
Confidence 4689999999999976
No 137
>cd04860 AE_Prim_S AE_Prim_S: primase domain similar to that found in the small subunit of archaeal and eukaryotic (A/E) DNA primases. Primases are DNA-dependent RNA polymerases which synthesis the short RNA primers required for DNA replication. In addition to its catalytic role in replication, DNA primase may play a role in coupling replication to DNA damage repair and in checkpoint control during S phase. In eukaryotes, this small catalytically active primase subunit (p50) and a larger primase subunit (p60), referred to jointly as the core primase, associate with the B subunit and the DNA polymerase alpha subunit in a complex, called Pol alpha-pri. The function of the larger primase subunit is unclear. Included in this group are Pfu41 and Pfu46, these two proteins comprise the primase complex of the archaea Pyrococcus furiosus; Pfu41 and Pfu46 have sequence identity to the eukaryotic p50 and p60 primase proteins respectively. Pfu41 preferentially uses dNTPs as substrate. Pfu46 regulat
Probab=27.33 E-value=59 Score=21.76 Aligned_cols=16 Identities=19% Similarity=0.513 Sum_probs=13.9
Q ss_pred CCCHHHHHHHHHHHhh
Q psy14949 85 LRKEKDREDLIAYLST 100 (102)
Q Consensus 85 ~ls~~e~~~l~~yl~s 100 (102)
.|+.+++.+|+.||.-
T Consensus 150 ~L~~~~R~~Iv~Yl~~ 165 (232)
T cd04860 150 KLDSDERREIVDYLNG 165 (232)
T ss_pred hCCHHHHHHHHHHHHH
Confidence 4589999999999974
No 138
>TIGR03823 FliZ flagellar regulatory protein FliZ. FliZ is involved in the regulation of flagellar assembly and possibly also the down-regulation of the motile phenotype. FliZ interacts with the flagellar translational activator FlhCD complex.
Probab=26.96 E-value=19 Score=22.72 Aligned_cols=9 Identities=33% Similarity=1.080 Sum_probs=7.2
Q ss_pred HhcccccCC
Q psy14949 12 RACAQCHTA 20 (102)
Q Consensus 12 ~~C~~CH~~ 20 (102)
.+|+.||..
T Consensus 23 thCshC~K~ 31 (168)
T TIGR03823 23 THCSHCHKL 31 (168)
T ss_pred chhhhhcch
Confidence 379999964
No 139
>PF09695 YtfJ_HI0045: Bacterial protein of unknown function (YtfJ_HI0045); InterPro: IPR006513 These are sequences from gammaproteobacteria that are related to the Escherichia coli protein, YtfJ.
Probab=26.93 E-value=74 Score=20.18 Aligned_cols=17 Identities=24% Similarity=0.356 Sum_probs=14.5
Q ss_pred CCCCHHHHHHHHHHHhh
Q psy14949 84 GLRKEKDREDLIAYLST 100 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~s 100 (102)
+.||++|+..+++-|+.
T Consensus 143 G~Ls~~Ev~qVi~Ll~~ 159 (160)
T PF09695_consen 143 GALSPAEVQQVIALLKK 159 (160)
T ss_pred CCCCHHHHHHHHHHHhc
Confidence 56799999999998864
No 140
>PF13099 DUF3944: Domain of unknown function (DUF3944)
Probab=26.67 E-value=81 Score=14.66 Aligned_cols=14 Identities=29% Similarity=0.375 Sum_probs=11.2
Q ss_pred CCCHHHHHHHHHHH
Q psy14949 85 LRKEKDREDLIAYL 98 (102)
Q Consensus 85 ~ls~~e~~~l~~yl 98 (102)
..|.+|+..|+..|
T Consensus 12 ~cs~edL~~L~~~L 25 (35)
T PF13099_consen 12 ECSNEDLKDLVDIL 25 (35)
T ss_pred HCCHHHHHHHHHHH
Confidence 35789999998876
No 141
>PRK05179 rpsM 30S ribosomal protein S13; Validated
Probab=26.53 E-value=75 Score=19.07 Aligned_cols=17 Identities=0% Similarity=0.032 Sum_probs=14.2
Q ss_pred CCCCHHHHHHHHHHHhh
Q psy14949 84 GLRKEKDREDLIAYLST 100 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~s 100 (102)
+.|+++|+..|..+|..
T Consensus 46 ~~L~~~qi~~l~~~i~~ 62 (122)
T PRK05179 46 KDLTDEELDKIREEIDK 62 (122)
T ss_pred ccCCHHHHHHHHHHHHh
Confidence 35699999999999964
No 142
>PF12123 Amidase02_C: N-acetylmuramoyl-l-alanine amidase; InterPro: IPR021976 This domain is found in bacteria and viruses. This domain is about 50 amino acids in length. This domain is classified with the enzyme classification code 3.5.1.28 from EC. This domain is the C-terminal of the enzyme which hydrolyses the link between N-acetylmuramoyl residues and L-amino acid residues in certain cell-wall glycopeptides. ; PDB: 2L48_B.
Probab=26.51 E-value=81 Score=15.51 Aligned_cols=17 Identities=12% Similarity=0.276 Sum_probs=11.6
Q ss_pred CCCCHHHHHHHHHHHhh
Q psy14949 84 GLRKEKDREDLIAYLST 100 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~s 100 (102)
+-+|+.|++++-+||..
T Consensus 22 ~~~s~~~L~k~~~wld~ 38 (45)
T PF12123_consen 22 DPLSDAELDKFTAWLDE 38 (45)
T ss_dssp ----HHHHHHHHHHHHH
T ss_pred CCCCHHHHHHHHHHHHh
Confidence 34589999999999865
No 143
>PRK11659 cytochrome c nitrite reductase pentaheme subunit; Provisional
Probab=26.45 E-value=22 Score=22.82 Aligned_cols=9 Identities=33% Similarity=0.988 Sum_probs=7.2
Q ss_pred hcccccCCC
Q psy14949 13 ACAQCHTAN 21 (102)
Q Consensus 13 ~C~~CH~~~ 21 (102)
.|..||+.-
T Consensus 74 ~C~~CH~~~ 82 (183)
T PRK11659 74 TCTNCHGNI 82 (183)
T ss_pred chhhhCccc
Confidence 599999843
No 144
>TIGR02251 HIF-SF_euk Dullard-like phosphatase domain. This domain is related to domains found in FCP1-like phosphatases (TIGR02250), and together both are detected by the Pfam model pfam03031.
Probab=26.34 E-value=80 Score=19.54 Aligned_cols=22 Identities=23% Similarity=0.382 Sum_probs=18.1
Q ss_pred CCCCCCCHHHHHHHHHHHhhcC
Q psy14949 81 VFPGLRKEKDREDLIAYLSTLH 102 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl~sl~ 102 (102)
+|.+-.+|.++..|+.||.+|.
T Consensus 137 ~f~~~~~D~~L~~l~~~L~~l~ 158 (162)
T TIGR02251 137 SWFGDPNDTELLNLIPFLEGLR 158 (162)
T ss_pred CCCCCCCHHHHHHHHHHHHHHh
Confidence 5666668999999999998863
No 145
>PRK02955 small acid-soluble spore protein SspI; Provisional
Probab=26.06 E-value=68 Score=17.29 Aligned_cols=41 Identities=20% Similarity=0.340 Sum_probs=24.6
Q ss_pred cHHHHHHHHhCCC-----CCCCC--Ccc-CCCCCCCHHHHHHHHHHHhh
Q psy14949 60 SASTLDIYLQNPK-----KYIPG--TKM-VFPGLRKEKDREDLIAYLST 100 (102)
Q Consensus 60 ~~~~l~~~~~~~~-----~~~~~--~~m-~~~~~ls~~e~~~l~~yl~s 100 (102)
+.+.+..+|.+.. ...|| ..+ .++.-++++|...++..|..
T Consensus 17 s~eel~~~I~daIqsgEEk~LPGLGVlFE~~W~~~~~~ek~~m~~~l~~ 65 (68)
T PRK02955 17 SKEELEGTIVDAIQSGEEKMLPGLGVLFEVIWKNADENEKDEMLETLEQ 65 (68)
T ss_pred CHHHHHHHHHHHHhccchhcCCcchhHHHHHHHhcCHHHHHHHHHHHHH
Confidence 4566666665433 23444 222 23444589999999998865
No 146
>COG1365 Predicted ATPase (PP-loop superfamily) [General function prediction only]
Probab=25.78 E-value=19 Score=24.15 Aligned_cols=7 Identities=43% Similarity=1.512 Sum_probs=6.1
Q ss_pred cccccCC
Q psy14949 14 CAQCHTA 20 (102)
Q Consensus 14 C~~CH~~ 20 (102)
|..||+.
T Consensus 135 CGRCh~~ 141 (255)
T COG1365 135 CGRCHSM 141 (255)
T ss_pred cchHHHH
Confidence 9999973
No 147
>PF09601 DUF2459: Protein of unknown function (DUF2459); InterPro: IPR011727 This conserved hypothetical protein of unknown function is predominantly found in proteobacteria. Its function is unknown and its genome context is not well-conserved. It is found amid urease genes in at least one species.
Probab=25.76 E-value=69 Score=20.40 Aligned_cols=15 Identities=20% Similarity=0.403 Sum_probs=13.7
Q ss_pred CCHHHHHHHHHHHhh
Q psy14949 86 RKEKDREDLIAYLST 100 (102)
Q Consensus 86 ls~~e~~~l~~yl~s 100 (102)
+|+++.+.|++||++
T Consensus 98 ls~~~y~~L~~~I~~ 112 (173)
T PF09601_consen 98 LSEAQYRRLVAFIRA 112 (173)
T ss_pred cCHHHHHHHHHHHHH
Confidence 589999999999976
No 148
>PF12408 DUF3666: Ribose-5-phosphate isomerase ; InterPro: IPR022133 This domain family is found in bacteria, and is approximately 50 amino acids in length. The family is found in association with PF02502 from PFAM. There are two completely conserved residues (D and F) that may be functionally important. ; PDB: 3ONO_A 3C5Y_N 2PPW_A.
Probab=25.73 E-value=53 Score=16.43 Aligned_cols=12 Identities=33% Similarity=0.692 Sum_probs=6.9
Q ss_pred HHHHHHHHHhhc
Q psy14949 90 DREDLIAYLSTL 101 (102)
Q Consensus 90 e~~~l~~yl~sl 101 (102)
+-..|++|++++
T Consensus 37 qd~eI~~yvk~l 48 (48)
T PF12408_consen 37 QDEEIAAYVKEL 48 (48)
T ss_dssp --HHHHHHHHCC
T ss_pred CcHHHHHHHHcC
Confidence 445677887764
No 149
>cd03271 ABC_UvrA_II The excision repair protein UvrA domain II; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins and UvrB having one ATP binding site that is structurally related to that of helicases.
Probab=25.59 E-value=29 Score=23.53 Aligned_cols=20 Identities=30% Similarity=0.370 Sum_probs=13.1
Q ss_pred CCCCCCCHHHHHHHHHHHhhc
Q psy14949 81 VFPGLRKEKDREDLIAYLSTL 101 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl~sl 101 (102)
|+.++ +..++..|...|+.+
T Consensus 199 PtsgL-D~~~~~~l~~~L~~l 218 (261)
T cd03271 199 PTTGL-HFHDVKKLLEVLQRL 218 (261)
T ss_pred CCCCC-CHHHHHHHHHHHHHH
Confidence 34444 777888777777653
No 150
>PRK01397 50S ribosomal protein L31; Provisional
Probab=25.57 E-value=27 Score=19.33 Aligned_cols=8 Identities=13% Similarity=-0.035 Sum_probs=6.4
Q ss_pred HhcccccC
Q psy14949 12 RACAQCHT 19 (102)
Q Consensus 12 ~~C~~CH~ 19 (102)
..|+.||-
T Consensus 35 di~s~~HP 42 (78)
T PRK01397 35 DVDFRKHP 42 (78)
T ss_pred EeCCCCCC
Confidence 45999994
No 151
>TIGR03629 arch_S13P archaeal ribosomal protein S13P. This model describes exclusively the archaeal ribosomal protein S13P. It excludes the homologous eukaryotic 40S ribosomal protein S18 and bacterial 30S ribosomal protein S13.
Probab=25.53 E-value=79 Score=19.58 Aligned_cols=17 Identities=18% Similarity=0.292 Sum_probs=14.4
Q ss_pred CCCCHHHHHHHHHHHhh
Q psy14949 84 GLRKEKDREDLIAYLST 100 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~s 100 (102)
+.||++|+..|..+|..
T Consensus 50 ~~Lt~~qi~~l~~~i~~ 66 (144)
T TIGR03629 50 GYLDDEEIEKLEEAVEN 66 (144)
T ss_pred ccCCHHHHHHHHHHHHh
Confidence 34699999999999975
No 152
>PRK11125 nrfA cytochrome c nitrite reductase subunit c552; Provisional
Probab=25.46 E-value=27 Score=26.06 Aligned_cols=10 Identities=60% Similarity=1.454 Sum_probs=7.8
Q ss_pred HHHhcccccC
Q psy14949 10 FTRACAQCHT 19 (102)
Q Consensus 10 f~~~C~~CH~ 19 (102)
+.+.|..||.
T Consensus 311 ~~~~C~~CH~ 320 (480)
T PRK11125 311 FDQTCANCHT 320 (480)
T ss_pred HHHHHHhCCC
Confidence 3457999996
No 153
>KOG1605|consensus
Probab=25.25 E-value=1.3e+02 Score=20.68 Aligned_cols=22 Identities=14% Similarity=0.234 Sum_probs=17.1
Q ss_pred CCCCCCCHHHHHHHHHHHhhcC
Q psy14949 81 VFPGLRKEKDREDLIAYLSTLH 102 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl~sl~ 102 (102)
+|..-.+|.|+-.|+-||++|.
T Consensus 226 sw~~d~~D~eLL~LlpfLe~L~ 247 (262)
T KOG1605|consen 226 SWFDDPTDTELLKLLPFLEALA 247 (262)
T ss_pred ccccCCChHHHHHHHHHHHHhc
Confidence 4444457999999999999863
No 154
>PF09343 DUF2460: Conserved hypothetical protein 2217 (DUF2460); InterPro: IPR011740 The entry represents a number of conserved hypothetical proteins. Their genes are often, though not always, encoded in apparent phage-derived regions of bacterial chromosomes. The Rhodobacter capsulatus sequence is apparently part of the gene transfer agent [see Fig.1, in ].
Probab=25.15 E-value=84 Score=20.58 Aligned_cols=17 Identities=35% Similarity=0.528 Sum_probs=14.5
Q ss_pred CCCHHHHHHHHHHHhhc
Q psy14949 85 LRKEKDREDLIAYLSTL 101 (102)
Q Consensus 85 ~ls~~e~~~l~~yl~sl 101 (102)
+++.+|+.+|++|++..
T Consensus 50 ~~s~~~l~~l~~FF~ar 66 (198)
T PF09343_consen 50 VRSLEDLEALIAFFEAR 66 (198)
T ss_pred CCCHHHHHHHHHHHHHh
Confidence 45899999999999864
No 155
>PRK11032 hypothetical protein; Provisional
Probab=25.11 E-value=74 Score=20.14 Aligned_cols=17 Identities=6% Similarity=0.057 Sum_probs=14.1
Q ss_pred CCCCHHHHHHHHHHHhh
Q psy14949 84 GLRKEKDREDLIAYLST 100 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~s 100 (102)
+-||.+|+..|.+||+.
T Consensus 41 ~elT~dEl~lv~~ylkR 57 (160)
T PRK11032 41 GELTRDEVDLITRAVRR 57 (160)
T ss_pred HhcCHHHHHHHHHHHHH
Confidence 34589999999999974
No 156
>PF09383 NIL: NIL domain; InterPro: IPR018449 This domain is found at the C terminus of ABC transporter proteins involved in D-methionine transport as well as a number of ferredoxin-like proteins. This domain is likely to act as a substrate binding domain. The domain has been named after a conserved sequence in some members of the family. ; PDB: 2QRR_A 3CED_A 2QSW_A 3TUZ_D 3TUJ_D 3DHX_B 3TUI_H 3DHW_D.
Probab=25.00 E-value=81 Score=16.67 Aligned_cols=15 Identities=33% Similarity=0.532 Sum_probs=12.3
Q ss_pred CHHHHHHHHHHHhhc
Q psy14949 87 KEKDREDLIAYLSTL 101 (102)
Q Consensus 87 s~~e~~~l~~yl~sl 101 (102)
+++++.+.++||++.
T Consensus 55 ~~~~~~~a~~~L~~~ 69 (76)
T PF09383_consen 55 DDEEIEKAIAYLREQ 69 (76)
T ss_dssp -HHHHHHHHHHHHHT
T ss_pred CHHHHHHHHHHHHHC
Confidence 578899999999874
No 157
>PRK11582 flagella biosynthesis protein FliZ; Provisional
Probab=24.62 E-value=22 Score=22.48 Aligned_cols=9 Identities=33% Similarity=0.929 Sum_probs=7.1
Q ss_pred HhcccccCC
Q psy14949 12 RACAQCHTA 20 (102)
Q Consensus 12 ~~C~~CH~~ 20 (102)
.+|+.||..
T Consensus 23 thCshC~K~ 31 (169)
T PRK11582 23 THCAHCRKL 31 (169)
T ss_pred cchhhhccc
Confidence 379999964
No 158
>PF13709 DUF4159: Domain of unknown function (DUF4159)
Probab=24.57 E-value=73 Score=20.82 Aligned_cols=16 Identities=19% Similarity=0.324 Sum_probs=13.9
Q ss_pred CCCHHHHHHHHHHHhh
Q psy14949 85 LRKEKDREDLIAYLST 100 (102)
Q Consensus 85 ~ls~~e~~~l~~yl~s 100 (102)
.+|++|++.|-+||+.
T Consensus 66 ~~s~~e~~~Lr~Yl~~ 81 (207)
T PF13709_consen 66 PLSDEEIANLRRYLEN 81 (207)
T ss_pred CCCHHHHHHHHHHHHc
Confidence 4689999999999974
No 159
>COG0633 Fdx Ferredoxin [Energy production and conversion]
Probab=24.25 E-value=27 Score=20.06 Aligned_cols=8 Identities=38% Similarity=1.086 Sum_probs=6.5
Q ss_pred hcccccCC
Q psy14949 13 ACAQCHTA 20 (102)
Q Consensus 13 ~C~~CH~~ 20 (102)
.|+.||..
T Consensus 44 ~C~TC~v~ 51 (102)
T COG0633 44 ACGTCRVK 51 (102)
T ss_pred ccCccEEE
Confidence 59999963
No 160
>PF15332 LIME1: Lck-interacting transmembrane adapter 1
Probab=24.04 E-value=29 Score=22.86 Aligned_cols=8 Identities=38% Similarity=1.319 Sum_probs=6.7
Q ss_pred cccccCCC
Q psy14949 14 CAQCHTAN 21 (102)
Q Consensus 14 C~~CH~~~ 21 (102)
|..||..+
T Consensus 2 CtaC~R~d 9 (228)
T PF15332_consen 2 CTACHRPD 9 (228)
T ss_pred cccccCch
Confidence 99999865
No 161
>PTZ00134 40S ribosomal protein S18; Provisional
Probab=23.60 E-value=87 Score=19.70 Aligned_cols=17 Identities=12% Similarity=0.333 Sum_probs=14.4
Q ss_pred CCCCHHHHHHHHHHHhh
Q psy14949 84 GLRKEKDREDLIAYLST 100 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~s 100 (102)
+.||++|+..|..+|..
T Consensus 59 ~~Lt~~qi~~l~~~i~~ 75 (154)
T PTZ00134 59 GELTAEEIEKIVEIIAN 75 (154)
T ss_pred ccCCHHHHHHHHHHHhc
Confidence 45699999999999974
No 162
>TIGR00778 ahpD_dom alkylhydroperoxidase AhpD family core domain. Members of the family include the alkylhydroperoxidase AhpD of Mycobacterium tuberculosis, a macrophage infectivity potentiator peptide of Legionella pneumophila, and an uncharacterized peptide in the tetrachloroethene reductive dehalogenase operon of Dehalospirillum multivorans. We suggest that many peptides containing this domain may have alkylhydroperoxidase or related antioxidant activity.
Probab=23.52 E-value=82 Score=14.96 Aligned_cols=11 Identities=9% Similarity=0.383 Sum_probs=9.1
Q ss_pred CHHHHHHHHHH
Q psy14949 87 KEKDREDLIAY 97 (102)
Q Consensus 87 s~~e~~~l~~y 97 (102)
+++|+.+++.|
T Consensus 38 ~~~ei~~v~~~ 48 (50)
T TIGR00778 38 TAEELAEALAW 48 (50)
T ss_pred CHHHHHHHHHh
Confidence 78889888876
No 163
>PF14053 DUF4248: Domain of unknown function (DUF4248)
Probab=23.23 E-value=79 Score=16.94 Aligned_cols=18 Identities=11% Similarity=0.062 Sum_probs=14.6
Q ss_pred CCCCCCCHHHHHHHHHHH
Q psy14949 81 VFPGLRKEKDREDLIAYL 98 (102)
Q Consensus 81 ~~~~~ls~~e~~~l~~yl 98 (102)
+....+|+.|+.-|+.||
T Consensus 49 ~~~r~~TP~QV~lIv~~L 66 (69)
T PF14053_consen 49 PRQRSFTPRQVRLIVRYL 66 (69)
T ss_pred CCCEecCHHHHHHHHHHc
Confidence 344567999999999997
No 164
>TIGR00105 L31 ribosomal protein L31. This family consists exclusively of bacterial (and organellar) 50S ribosomal protein L31. In some species, such as Bacillus subtilis, this protein exists in two forms (RpmE and YtiA), one of which (RpmE) contains a pair of motifs, CXC and CXXC, for binding zinc.
Probab=23.07 E-value=32 Score=18.39 Aligned_cols=8 Identities=38% Similarity=1.273 Sum_probs=6.2
Q ss_pred HhcccccC
Q psy14949 12 RACAQCHT 19 (102)
Q Consensus 12 ~~C~~CH~ 19 (102)
..|+.||-
T Consensus 36 di~s~~HP 43 (68)
T TIGR00105 36 DICSKCHP 43 (68)
T ss_pred EECCCCcc
Confidence 35999994
No 165
>PF15161 Neuropep_like: Neuropeptide-like
Probab=22.98 E-value=27 Score=18.18 Aligned_cols=16 Identities=31% Similarity=0.744 Sum_probs=9.7
Q ss_pred hHHHHHHH--hcccccCC
Q psy14949 5 NGKKIFTR--ACAQCHTA 20 (102)
Q Consensus 5 ~G~~lf~~--~C~~CH~~ 20 (102)
.|..+-.+ -|..||..
T Consensus 5 ~gs~iPaesRPCVDCHAF 22 (65)
T PF15161_consen 5 SGSVIPAESRPCVDCHAF 22 (65)
T ss_pred CCCcccCCCCCchhhHHH
Confidence 34444443 49999963
No 166
>PRK13696 hypothetical protein; Provisional
Probab=22.98 E-value=25 Score=18.64 Aligned_cols=19 Identities=26% Similarity=0.300 Sum_probs=14.0
Q ss_pred CCccCCCCCCCHHHHHHHH
Q psy14949 77 GTKMVFPGLRKEKDREDLI 95 (102)
Q Consensus 77 ~~~m~~~~~ls~~e~~~l~ 95 (102)
...|-+.+.++++|...+.
T Consensus 41 ~~l~~~~Gil~dee~~e~~ 59 (62)
T PRK13696 41 DKLMKAFGILSEEEAEELK 59 (62)
T ss_pred HHHHHHHCCCCHHHHHHHH
Confidence 3456677788999988775
No 167
>PF04270 Strep_his_triad: Streptococcal histidine triad protein ; InterPro: IPR006270 This entry represents a repeated sequence region that includes a His-X-X-His-X-His (histidine triad) motif, which is found in family of Streptococcal proteins. Members of the family are suggested to cleave human complement component 3, and family member PhpA has been shown in vaccine studies to be a protective antigen in mice []. ; PDB: 2CS7_C.
Probab=22.92 E-value=1e+02 Score=15.72 Aligned_cols=17 Identities=18% Similarity=0.137 Sum_probs=12.4
Q ss_pred CCCCHHHHHHHHHHHhh
Q psy14949 84 GLRKEKDREDLIAYLST 100 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~s 100 (102)
..||+.|+++--+||+.
T Consensus 36 ~dLs~~E~~aA~~~~~~ 52 (53)
T PF04270_consen 36 SDLSASELKAAQAYLAG 52 (53)
T ss_dssp GGS-HHHHHHHHHHHH-
T ss_pred hhCCHHHHHHHHHHHhc
Confidence 34589999999999874
No 168
>PRK00528 rpmE 50S ribosomal protein L31; Reviewed
Probab=22.82 E-value=33 Score=18.55 Aligned_cols=8 Identities=13% Similarity=0.007 Sum_probs=6.4
Q ss_pred HhcccccC
Q psy14949 12 RACAQCHT 19 (102)
Q Consensus 12 ~~C~~CH~ 19 (102)
..|..||-
T Consensus 38 dv~s~~HP 45 (71)
T PRK00528 38 DIDSGNHP 45 (71)
T ss_pred EECCCCCc
Confidence 45999994
No 169
>PF13447 Multi-haem_cyto: Seven times multi-haem cytochrome CxxCH; PDB: 1FGJ_B.
Probab=22.76 E-value=46 Score=22.81 Aligned_cols=12 Identities=33% Similarity=1.032 Sum_probs=8.1
Q ss_pred HHHHHhcccccC
Q psy14949 8 KIFTRACAQCHT 19 (102)
Q Consensus 8 ~lf~~~C~~CH~ 19 (102)
.-....|..||+
T Consensus 210 ~~m~~vC~~CHS 221 (267)
T PF13447_consen 210 DKMKKVCSQCHS 221 (267)
T ss_dssp HHHHHHHTTTS-
T ss_pred HHHHHhhhccCC
Confidence 344557999996
No 170
>PF01197 Ribosomal_L31: Ribosomal protein L31; InterPro: IPR002150 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein L31 is one of the proteins from the large ribosomal subunit. L31 is a protein of 66 to 97 amino-acid residues which has only been found so far in bacteria and in some plant and algal chloroplasts.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 3D5D_4 3PYO_1 3D5B_4 3PYV_1 3PYT_1 3MRZ_1 3MS1_1 3PYR_1 3F1F_4 3F1H_4 ....
Probab=22.49 E-value=34 Score=18.33 Aligned_cols=9 Identities=33% Similarity=1.017 Sum_probs=6.7
Q ss_pred HhcccccCC
Q psy14949 12 RACAQCHTA 20 (102)
Q Consensus 12 ~~C~~CH~~ 20 (102)
..|+.||-.
T Consensus 37 di~s~~HPf 45 (69)
T PF01197_consen 37 DICSNCHPF 45 (69)
T ss_dssp CSCSSSSCT
T ss_pred eecCCCCEE
Confidence 359999943
No 171
>PRK04053 rps13p 30S ribosomal protein S13P; Reviewed
Probab=22.40 E-value=1e+02 Score=19.27 Aligned_cols=17 Identities=18% Similarity=0.294 Sum_probs=14.1
Q ss_pred CCCCHHHHHHHHHHHhh
Q psy14949 84 GLRKEKDREDLIAYLST 100 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~s 100 (102)
+.||++|+..|..||.+
T Consensus 54 ~~Lt~~qi~~l~~~i~~ 70 (149)
T PRK04053 54 GYLSDEEIEKIEEALED 70 (149)
T ss_pred CcCCHHHHHHHHHHHHh
Confidence 35699999999999963
No 172
>PRK01678 rpmE2 50S ribosomal protein L31 type B; Reviewed
Probab=22.29 E-value=34 Score=19.37 Aligned_cols=8 Identities=13% Similarity=0.285 Sum_probs=6.4
Q ss_pred HhcccccC
Q psy14949 12 RACAQCHT 19 (102)
Q Consensus 12 ~~C~~CH~ 19 (102)
..|+.||-
T Consensus 49 di~S~~HP 56 (87)
T PRK01678 49 EISSASHP 56 (87)
T ss_pred EeCCCCCC
Confidence 46999994
No 173
>PF11116 DUF2624: Protein of unknown function (DUF2624); InterPro: IPR020277 This entry contains proteins with no known function.
Probab=22.22 E-value=1.3e+02 Score=16.92 Aligned_cols=15 Identities=20% Similarity=0.237 Sum_probs=11.9
Q ss_pred CCHHHHHHHHHHHhh
Q psy14949 86 RKEKDREDLIAYLST 100 (102)
Q Consensus 86 ls~~e~~~l~~yl~s 100 (102)
+|+++.+.|+.+|+.
T Consensus 31 it~~QA~~I~~~lr~ 45 (85)
T PF11116_consen 31 ITKKQAEQIANILRG 45 (85)
T ss_pred CCHHHHHHHHHHHhc
Confidence 478888888888875
No 174
>PF09286 Pro-kuma_activ: Pro-kumamolisin, activation domain ; InterPro: IPR015366 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found at the N terminus of peptidases belonging to MEROPS peptidase family S53 (sedolisin, clan SB). The domain adopts a ferredoxin-like fold, with an alpha+beta sandwich. Cleavage of the domain results in activation of the peptidase []. ; GO: 0008236 serine-type peptidase activity; PDB: 1T1E_A 3EDY_A 3EE6_A.
Probab=21.83 E-value=87 Score=18.82 Aligned_cols=14 Identities=14% Similarity=0.328 Sum_probs=11.6
Q ss_pred CHHHHHHHHHHHhh
Q psy14949 87 KEKDREDLIAYLST 100 (102)
Q Consensus 87 s~~e~~~l~~yl~s 100 (102)
+++++.+|.+||++
T Consensus 60 ~~~~v~~V~~wL~~ 73 (143)
T PF09286_consen 60 SPEDVAAVKSWLKS 73 (143)
T ss_dssp -HHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHH
Confidence 58889999999986
No 175
>cd07347 harmonin_N_like N-terminal protein-binding module of harmonin and similar domains. This domain is found in harmonin, and similar proteins such as delphilin, and whirlin. These are postsynaptic density-95/discs-large/ZO-1 (PDZ) domain-containing scaffold proteins. Harmonin and whirlin are organizers of the Usher protein network of the inner ear and the retina, delphilin is found at the cerebellar parallel fiber-Purkinje cell synapses. This harmonin_N_like domain is found in either one or two copies. Harmonin contains a single copy, which is found at its N-terminus and binds specifically to a short internal peptide fragment of the cadherin 23 cytoplasmic domain; cadherin 23 is a component of the Usher protein network. Whirlin contains two copies of the harmonin_N_like domain; the first of these has been assayed for interaction with the cytoplasmic domain of cadherin 23 and no interaction could be detected.
Probab=21.44 E-value=98 Score=17.11 Aligned_cols=17 Identities=24% Similarity=0.337 Sum_probs=13.9
Q ss_pred CCCCHHHHHHHHHHHhh
Q psy14949 84 GLRKEKDREDLIAYLST 100 (102)
Q Consensus 84 ~~ls~~e~~~l~~yl~s 100 (102)
.+|++.|...+..||+.
T Consensus 14 ~LL~~~Er~~~~~~L~~ 30 (78)
T cd07347 14 HLLTDAEREQVTRALER 30 (78)
T ss_pred HHCCHHHHHHHHHHHHH
Confidence 35689999999998875
No 176
>TIGR03507 decahem_SO1788 decaheme c-type cytochrome, OmcA/MtrC family. The protein SO_1778 (MtrC) of Shewanella oneidensis MR-1, and its paralog SO_1779 (OmcA), with which it intteracts, are large decaheme proteins, about 900 amino acids in length, involved in the use of manganese [Mn(III/IV)] and iron [Fe(III)] as terminal electron acceptors. This model represents these and similar decaheme proteins, found also in Rhodoferax ferrireducens DSM 15236, Aeromonas hydrophila ATCC7966, and a few other bacterial species.
Probab=21.37 E-value=37 Score=26.40 Aligned_cols=10 Identities=50% Similarity=1.135 Sum_probs=7.8
Q ss_pred HhcccccCCC
Q psy14949 12 RACAQCHTAN 21 (102)
Q Consensus 12 ~~C~~CH~~~ 21 (102)
..|++||+..
T Consensus 294 ~~C~~CH~~~ 303 (664)
T TIGR03507 294 TQCAACHNAG 303 (664)
T ss_pred CchhhccCCC
Confidence 4699999754
No 177
>PF14317 YcxB: YcxB-like protein
Probab=21.25 E-value=95 Score=15.08 Aligned_cols=15 Identities=20% Similarity=0.468 Sum_probs=11.9
Q ss_pred CCCHHHHHHHHHHHh
Q psy14949 85 LRKEKDREDLIAYLS 99 (102)
Q Consensus 85 ~ls~~e~~~l~~yl~ 99 (102)
.++++|..++.+||+
T Consensus 48 ~f~~~e~~~f~~~lk 62 (62)
T PF14317_consen 48 AFSEEEKEEFREFLK 62 (62)
T ss_pred HCCHhHHHHHHHHhC
Confidence 456889999998875
No 178
>PF06648 DUF1160: Protein of unknown function (DUF1160); InterPro: IPR010594 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf75; it is a family of uncharacterised viral proteins.
Probab=20.70 E-value=66 Score=19.42 Aligned_cols=14 Identities=21% Similarity=0.266 Sum_probs=12.3
Q ss_pred CCHHHHHHHHHHHh
Q psy14949 86 RKEKDREDLIAYLS 99 (102)
Q Consensus 86 ls~~e~~~l~~yl~ 99 (102)
++++|+..|+.||-
T Consensus 99 l~dddi~~ls~FLV 112 (122)
T PF06648_consen 99 LTDDDISYLSEFLV 112 (122)
T ss_pred CCcccHHHHHHHHH
Confidence 58999999999984
No 179
>PF09966 DUF2200: Uncharacterized protein conserved in bacteria (DUF2200); InterPro: IPR014580 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.; PDB: 3C9P_A.
Probab=20.61 E-value=95 Score=18.37 Aligned_cols=13 Identities=15% Similarity=0.555 Sum_probs=11.0
Q ss_pred CHHHHHHHHHHHh
Q psy14949 87 KEKDREDLIAYLS 99 (102)
Q Consensus 87 s~~e~~~l~~yl~ 99 (102)
|.+|+..|+.||-
T Consensus 24 t~~Evd~vi~WLT 36 (111)
T PF09966_consen 24 TKEEVDQVIRWLT 36 (111)
T ss_dssp -HHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHh
Confidence 7899999999984
No 180
>PF15614 WHIM3: WSTF, HB1, Itc1p, MBD9 motif 3
Probab=20.24 E-value=1.2e+02 Score=15.05 Aligned_cols=12 Identities=17% Similarity=0.426 Sum_probs=10.4
Q ss_pred CHHHHHHHHHHH
Q psy14949 87 KEKDREDLIAYL 98 (102)
Q Consensus 87 s~~e~~~l~~yl 98 (102)
+++|+.+|+.+|
T Consensus 6 ~~e~ld~L~~aL 17 (46)
T PF15614_consen 6 DPEELDELLKAL 17 (46)
T ss_pred CHHHHHHHHHHH
Confidence 688999999888
No 181
>PF07746 LigA: Aromatic-ring-opening dioxygenase LigAB, LigA subunit; InterPro: IPR011986 Dioxygenases catalyse the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms. Cleavage of aromatic rings is one of the most important functions of dioxygenases, which play key roles in the degradation of aromatic compounds. The substrates of ring-cleavage dioxygenases can be classified into two groups according to the mode of scission of the aromatic ring. Intradiol enzymes (IPR000627 from INTERPRO) use a non-haem Fe(III) to cleave the aromatic ring between two hydroxyl groups (ortho-cleavage), whereas extradiol enzymes use a non-haem Fe(II) to cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon (meta-cleavage) [, ]. These two subfamilies differ in sequence, structural fold, iron ligands, and the orientation of second sphere active site amino acid residues. Extradiol dioxygenases are usually homo-multimeric, bind one atom of ferrous ion per subunit and have a subunit size of about 33 kDa. Extradiol dioxygenases can be divided into three classes. Class I and II enzymes (IPR000486 from INTERPRO) show sequence similarity, with the two-domain class II enzymes having evolved from a class I enzyme through gene duplication. Class III enzymes are different in sequence and structure, but they do share several common active-site characteristics with the class II enzymes, in particular the coordination sphere and the disposition of the putative catalytic base are very similar. Class III enzymes usually have two subunits, designated A (IPR004183 from INTERPRO) and B (IPR004183 from INTERPRO). LigAB is a protocatechuate 4,5-dioxygenase (1.13.11.8 from EC) that belongs to the extradiol class III enzyme family. The LigA subunit of this enzyme is multi-helical, containing a compact array of 6 short helices [].; PDB: 1BOU_A 1B4U_A.
Probab=20.17 E-value=31 Score=19.50 Aligned_cols=31 Identities=19% Similarity=0.500 Sum_probs=18.8
Q ss_pred cHHHHHHHHhCCCCCCCCCccCCCCCCCHHHHHHHHH
Q psy14949 60 SASTLDIYLQNPKKYIPGTKMVFPGLRKEKDREDLIA 96 (102)
Q Consensus 60 ~~~~l~~~~~~~~~~~~~~~m~~~~~ls~~e~~~l~~ 96 (102)
+.+...+|+.+|.++.. . .+ ||++|+.+|.+
T Consensus 10 ~~~~r~~F~~D~~a~~~----~-~~-Lt~eer~av~~ 40 (88)
T PF07746_consen 10 DPENRERFLADPEAYLD----E-YG-LTEEERQAVLD 40 (88)
T ss_dssp SHHHHHHHHH-HHHHHH----C-CT---HHHHHHHHC
T ss_pred CHHHHHHHHHCHHHHHH----H-cC-CCHHHHHHHHc
Confidence 46677888888775443 1 23 48999988753
No 182
>COG5204 SPT4 Transcription elongation factor SPT4 [Transcription]
Probab=20.12 E-value=1.5e+02 Score=17.25 Aligned_cols=36 Identities=22% Similarity=0.048 Sum_probs=19.0
Q ss_pred cHHHHHHHHhCCCCCCCCCccCCCCCCCHHHHHHHH
Q psy14949 60 SASTLDIYLQNPKKYIPGTKMVFPGLRKEKDREDLI 95 (102)
Q Consensus 60 ~~~~l~~~~~~~~~~~~~~~m~~~~~ls~~e~~~l~ 95 (102)
+..++.+|.+=......-..|...+.|+++=+++|-
T Consensus 62 t~SWVakWqrid~f~~GlYA~~V~G~L~edvve~L~ 97 (112)
T COG5204 62 TNSWVAKWQRIDEFRKGLYAMVVEGALSEDVVEDLE 97 (112)
T ss_pred cHHHHHHHhhhcccccceeEEEEcccCCHHHHHHHH
Confidence 456677776533222222345556777766555553
No 183
>PF02591 DUF164: Putative zinc ribbon domain; InterPro: IPR003743 This entry describes proteins of unknown function.
Probab=20.11 E-value=39 Score=16.99 Aligned_cols=7 Identities=43% Similarity=1.383 Sum_probs=5.8
Q ss_pred hcccccC
Q psy14949 13 ACAQCHT 19 (102)
Q Consensus 13 ~C~~CH~ 19 (102)
.|.+||-
T Consensus 24 ~C~gC~~ 30 (56)
T PF02591_consen 24 TCSGCHM 30 (56)
T ss_pred ccCCCCE
Confidence 5999994
Done!