Query psy4018
Match_columns 136
No_of_seqs 102 out of 414
Neff 5.4
Searched_HMMs 46136
Date Fri Aug 16 23:17:43 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4018.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4018hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF03392 OS-D: Insect pheromon 100.0 1.7E-47 3.6E-52 272.1 9.2 95 27-121 1-95 (95)
2 PF04030 ALO: D-arabinono-1,4- 89.4 0.41 8.9E-06 38.3 3.3 28 96-123 228-257 (259)
3 PF06530 Phage_antitermQ: Phag 80.4 2.6 5.6E-05 30.9 3.7 65 46-111 8-75 (125)
4 TIGR01676 GLDHase galactonolac 76.9 2.8 6.1E-05 38.2 3.5 40 86-125 499-539 (541)
5 TIGR01679 bact_FAD_ox FAD-link 76.6 2.2 4.8E-05 37.0 2.7 32 94-125 384-417 (419)
6 PHA02939 hypothetical protein; 75.6 26 0.00057 26.4 7.8 80 45-127 40-130 (144)
7 PF04818 CTD_bind: RNA polymer 75.3 8.6 0.00019 24.4 4.6 38 61-98 22-59 (64)
8 PLN02465 L-galactono-1,4-lacto 74.6 2 4.3E-05 39.4 1.9 45 82-126 525-570 (573)
9 TIGR01677 pln_FAD_oxido plant- 73.8 2.4 5.1E-05 38.6 2.2 33 95-127 476-510 (557)
10 PF01805 Surp: Surp module; I 73.7 16 0.00034 22.8 5.5 27 85-111 1-27 (55)
11 PF08031 BBE: Berberine and be 71.0 2.5 5.3E-05 25.9 1.2 16 103-118 26-41 (47)
12 PLN00107 FAD-dependent oxidore 67.7 3.8 8.3E-05 34.1 2.0 36 93-128 168-205 (257)
13 TIGR02765 crypto_DASH cryptoch 67.5 2.5 5.5E-05 36.3 1.0 16 107-122 411-426 (429)
14 TIGR02766 crypt_chrom_pln cryp 66.4 2.7 5.8E-05 36.8 0.9 20 108-127 409-428 (475)
15 TIGR03556 photolyase_8HDF deox 63.3 3.8 8.3E-05 36.1 1.3 23 106-128 408-430 (471)
16 PRK10674 deoxyribodipyrimidine 63.0 3.6 7.7E-05 36.3 1.1 22 107-128 405-426 (472)
17 PF11919 DUF3437: Domain of un 61.6 3.3 7.1E-05 29.1 0.5 46 69-114 27-74 (90)
18 PF04245 NA37: 37-kD nucleoid- 59.9 51 0.0011 26.6 7.3 86 48-134 174-264 (325)
19 PF08910 Aida_N: Aida N-termin 57.3 24 0.00052 25.8 4.4 36 62-97 27-63 (106)
20 smart00582 RPR domain present 55.5 29 0.00062 24.3 4.5 37 62-98 71-107 (121)
21 PF14658 EF-hand_9: EF-hand do 55.4 25 0.00053 23.5 3.9 37 89-125 20-62 (66)
22 PF13956 Ibs_toxin: Toxin Ibs, 53.2 6.1 0.00013 20.3 0.5 11 5-15 1-11 (19)
23 PF03441 FAD_binding_7: FAD bi 52.1 6.2 0.00013 32.3 0.7 21 107-127 207-227 (277)
24 cd08806 CARD_CARD14_CARMA2 Cas 41.9 65 0.0014 22.7 4.4 56 30-107 30-85 (86)
25 PF07172 GRP: Glycine rich pro 40.9 15 0.00034 25.9 1.2 9 4-12 1-9 (95)
26 KOG0133|consensus 40.7 21 0.00045 32.7 2.2 27 105-131 422-448 (531)
27 PF12066 DUF3546: Domain of un 39.6 23 0.00049 25.4 1.9 25 89-115 31-55 (110)
28 PRK15370 E3 ubiquitin-protein 38.5 44 0.00094 31.7 4.1 37 94-130 671-710 (754)
29 COG0415 PhrB Deoxyribodipyrimi 38.4 16 0.00034 32.9 1.1 21 108-128 399-419 (461)
30 COG3636 Predicted transcriptio 37.1 31 0.00068 25.0 2.3 31 28-58 5-35 (100)
31 PF10731 Anophelin: Thrombin i 36.5 27 0.00059 23.2 1.8 8 4-11 1-8 (65)
32 PF14853 Fis1_TPR_C: Fis1 C-te 35.5 52 0.0011 20.7 2.9 35 42-76 16-50 (53)
33 PF00996 GDI: GDP dissociation 35.1 21 0.00045 31.7 1.4 34 72-107 125-161 (438)
34 KOG3481|consensus 34.4 49 0.0011 23.3 2.9 23 54-76 37-59 (87)
35 TIGR01678 FAD_lactone_ox sugar 32.7 27 0.00058 30.7 1.7 24 95-118 411-436 (438)
36 PF02950 Conotoxin: Conotoxin; 32.6 15 0.00032 23.9 0.0 10 50-59 48-57 (75)
37 PF03564 DUF1759: Protein of u 32.5 79 0.0017 22.6 3.9 39 89-127 42-83 (145)
38 PF14194 Cys_rich_VLP: Cystein 31.4 48 0.001 21.6 2.2 20 82-101 1-20 (56)
39 KOG0210|consensus 30.6 38 0.00083 32.8 2.3 23 75-97 758-780 (1051)
40 PF01726 LexA_DNA_bind: LexA D 30.4 1.6E+02 0.0034 19.1 6.2 48 82-129 5-54 (65)
41 TIGR00952 S15_bact ribosomal p 29.6 79 0.0017 22.0 3.3 24 89-112 60-83 (86)
42 PF15335 CAAP1: Caspase activi 29.0 1.2E+02 0.0026 20.2 3.9 35 35-76 4-38 (64)
43 PF12399 BCA_ABC_TP_C: Branche 28.9 50 0.0011 17.6 1.7 15 36-50 8-22 (23)
44 PF06576 DUF1133: Protein of u 28.7 1.3E+02 0.0029 23.8 4.7 52 77-128 92-143 (176)
45 PRK05626 rpsO 30S ribosomal pr 28.6 84 0.0018 22.0 3.3 23 90-112 64-86 (89)
46 PRK15387 E3 ubiquitin-protein 28.1 57 0.0012 31.2 3.1 37 94-130 705-743 (788)
47 CHL00027 rps15 ribosomal prote 28.0 88 0.0019 22.1 3.3 24 89-112 60-83 (90)
48 TIGR03826 YvyF flagellar opero 27.7 1.3E+02 0.0029 22.6 4.5 61 58-122 5-67 (137)
49 PF04959 ARS2: Arsenite-resist 27.1 41 0.0009 27.2 1.7 41 78-123 80-120 (214)
50 smart00648 SWAP Suppressor-of- 27.1 1.5E+02 0.0034 18.0 4.5 25 87-111 2-26 (54)
51 PF11417 Inhibitor_G39P: Loade 27.1 71 0.0015 21.4 2.6 41 86-127 3-58 (71)
52 PF02771 Acyl-CoA_dh_N: Acyl-C 26.9 1.2E+02 0.0027 20.1 3.9 44 82-129 1-44 (113)
53 PF03790 KNOX1: KNOX1 domain ; 26.9 41 0.00089 20.9 1.3 16 43-58 14-29 (45)
54 PLN02150 terpene synthase/cycl 26.8 1.3E+02 0.0028 21.0 4.1 50 46-95 6-60 (96)
55 PF14060 DUF4252: Domain of un 26.4 78 0.0017 23.0 3.0 47 62-112 60-106 (155)
56 PLN03079 Uncharacterized prote 26.2 75 0.0016 22.6 2.7 33 44-76 20-64 (91)
57 PHA00447 lysozyme 25.8 57 0.0012 24.3 2.2 21 82-102 95-115 (142)
58 PF11090 DUF2833: Protein of u 25.5 39 0.00084 23.8 1.1 30 77-106 29-62 (86)
59 cd08807 CARD_CARD10_CARMA3 Cas 25.4 1.2E+02 0.0026 21.4 3.5 56 30-107 30-85 (86)
60 PRK09706 transcriptional repre 25.3 1.1E+02 0.0024 22.0 3.6 33 79-111 97-129 (135)
61 PRK08557 hypothetical protein; 25.3 68 0.0015 28.3 2.9 32 76-111 349-383 (417)
62 KOG3355|consensus 24.9 2.2E+02 0.0047 22.6 5.3 69 56-124 63-140 (177)
63 PF05927 Penaeidin: Penaeidin; 24.9 27 0.00059 23.8 0.3 26 6-31 1-26 (73)
64 PRK10410 hypothetical protein; 24.7 70 0.0015 22.9 2.4 21 75-95 56-81 (100)
65 PF10589 NADH_4Fe-4S: NADH-ubi 24.7 44 0.00095 20.3 1.1 12 73-84 11-22 (46)
66 PF02364 Glucan_synthase: 1,3- 24.5 57 0.0012 31.5 2.4 28 89-116 63-90 (817)
67 PLN03190 aminophospholipid tra 24.4 83 0.0018 31.4 3.5 27 68-94 837-867 (1178)
68 PF05823 Gp-FAR-1: Nematode fa 23.9 39 0.00084 25.7 1.0 47 64-110 4-63 (154)
69 PF00312 Ribosomal_S15: Riboso 23.0 1.2E+02 0.0026 20.5 3.2 24 88-111 57-80 (83)
70 PF15197 Leukemia_assc_2: Leuk 22.8 94 0.002 19.8 2.4 32 79-111 6-37 (53)
71 PF10441 Urb2: Urb2/Npa2 famil 22.8 1.6E+02 0.0036 23.0 4.4 67 44-117 155-221 (223)
72 cd05030 calgranulins Calgranul 22.0 1.5E+02 0.0033 19.8 3.5 25 101-125 50-74 (88)
73 PF06286 Coleoptericin: Coleop 21.7 26 0.00056 26.6 -0.4 10 5-14 1-10 (143)
74 PRK09993 C-lysozyme inhibitor; 21.6 1.9E+02 0.0041 22.4 4.3 20 34-53 28-47 (153)
75 KOG4730|consensus 21.1 1.4E+02 0.0029 27.4 3.9 36 89-126 477-512 (518)
76 cd08809 CARD_CARD9 Caspase act 21.0 2.3E+02 0.0049 20.0 4.3 50 30-107 30-85 (86)
77 PF15393 DUF4615: Domain of un 20.6 1.8E+02 0.0039 21.7 3.9 48 80-127 22-69 (124)
78 PRK12423 LexA repressor; Provi 20.1 3.2E+02 0.007 21.1 5.5 31 82-112 5-36 (202)
79 PF03489 SapB_2: Saposin-like 20.1 34 0.00074 19.0 -0.0 22 94-115 6-27 (35)
No 1
>PF03392 OS-D: Insect pheromone-binding family, A10/OS-D; InterPro: IPR005055 A class of small (14-20 Kd) water-soluble proteins, called odorant binding proteins (OBPs), first discovered in the insect sensillar lymph but also in the mucus of vertebrates, is postulated to mediate the solubilisation of hydrophobic odorant molecules, and thereby to facilitate their transport to the receptor neurons. The product of a gene expressed in the olfactory system of Drosophila melanogaster (Fruit fly), OS-D, shares features common to vertebrate odorant-binding proteins, but has a primary structure unlike odorant-binding proteins []. OS-D derivatives have subsequently been found in chemosensory organs of phylogenetically distinct insects, including cockroaches, phasmids and moths, suggesting that OS-D-like proteins seem to be conserved in the insect phylum.; PDB: 1KX9_A 1N8U_A 1KX8_A 1K19_A 1N8V_A 2GVS_A 2JNT_A.
Probab=100.00 E-value=1.7e-47 Score=272.08 Aligned_cols=95 Identities=57% Similarity=1.158 Sum_probs=90.4
Q ss_pred cCCcCccCCHHHHhcCHHHHHHHhhhhcCCCCCChhHHHHHhhhHHHHHhcCccCCHHHHhhHHHHHHHHHhcChHHHHH
Q psy4018 27 YTTKYDNVDLDQILKNDRLFNNYYNCLLDKGKCSPDGQELKNKLPDAIATECKSCSEKQKEGSKKIFKYLIDNKPEQWAE 106 (136)
Q Consensus 27 Y~~~~D~iDid~iL~N~rl~~~y~~ClLdkgpCt~~gk~lK~~lpeal~t~C~KCT~kQK~~~~kvi~~l~~~~P~~w~~ 106 (136)
|+++||||||++||+|+|++++|++||||+|||||+|++||++|||||+|+|+||||+||+++++|++||++++|++|++
T Consensus 1 Y~~~yD~iDvd~iL~N~rl~~~y~~ClldkgpCt~~~~~lK~~ipeal~t~C~KCt~kQK~~~~kv~~~l~~~~P~~w~~ 80 (95)
T PF03392_consen 1 YTTKYDNIDVDEILKNDRLLKSYIDCLLDKGPCTPEGKELKKVIPEALKTNCAKCTPKQKENARKVIKFLKKNYPDEWEE 80 (95)
T ss_dssp -HCTTTSSCHHHHHH-HHHHHHHHHHHTSSSTSHHHHHHHHHHHHHHHHCTTTTS-HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCccccccCHHHHHcCHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHccccCCCHHHHHHHHHHHHHHHHcCHHHHHH
Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHhhCCChhhHHHH
Q psy4018 107 LEKKFDPNGTYKAKY 121 (136)
Q Consensus 107 l~~KyDP~~~y~~~~ 121 (136)
|++||||+|+|+++|
T Consensus 81 l~~KyDp~~~y~~ky 95 (95)
T PF03392_consen 81 LVKKYDPEGKYRKKY 95 (95)
T ss_dssp HHHHHTTT-TTHHHH
T ss_pred HHHHHCCCcchhhcC
Confidence 999999999999987
No 2
>PF04030 ALO: D-arabinono-1,4-lactone oxidase ; InterPro: IPR007173 This domain is specific to D-arabinono-1,4-lactone oxidase 1.1.3.37 from EC, which is involved in the final step of the D-erythroascorbic acid biosynthesis pathway [].; GO: 0003885 D-arabinono-1,4-lactone oxidase activity, 0055114 oxidation-reduction process, 0016020 membrane; PDB: 2VFU_A 2VFV_A 2VFT_A 2VFS_A 2VFR_A.
Probab=89.36 E-value=0.41 Score=38.30 Aligned_cols=28 Identities=29% Similarity=0.630 Sum_probs=20.5
Q ss_pred HHhcCh--HHHHHHHHhhCCChhhHHHHHH
Q psy4018 96 LIDNKP--EQWAELEKKFDPNGTYKAKYQK 123 (136)
Q Consensus 96 l~~~~P--~~w~~l~~KyDP~~~y~~~~~~ 123 (136)
+.+.|| ++|.++.+++||+|.+...|.+
T Consensus 228 l~~~Yp~~~~F~~~r~~~DP~g~F~n~~~~ 257 (259)
T PF04030_consen 228 LRKLYPRLDDFLAVRKKLDPQGVFLNDYLR 257 (259)
T ss_dssp HHHT-TTHHHHHHHHHHH-TT-TT--HHHH
T ss_pred HHHHCcCHHHHHHHHHHhCCCCCCCCHhhh
Confidence 778888 8999999999999999998864
No 3
>PF06530 Phage_antitermQ: Phage antitermination protein Q; InterPro: IPR010534 This entry is represented by Bacteriophage 933W, GpQ. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several phage antitermination protein Q and related bacterial sequences. Phage 993W gene Q encodes a phage-specific positive regulator of late gene expression, thought, by analogy to the corresponding gene of phage lambda, to be a transcription antiterminator. GpQ positively regulates expression of the phage late gene operons. Bacterial host RNA polymerase modified by antitermination proteins transcribes through termination sites that otherwise prevent expression of the regulated genes [, ].; GO: 0003677 DNA binding, 0060567 negative regulation of transcription termination, DNA-dependent
Probab=80.39 E-value=2.6 Score=30.85 Aligned_cols=65 Identities=18% Similarity=0.238 Sum_probs=50.6
Q ss_pred HHHHhhhhcCC---CCCChhHHHHHhhhHHHHHhcCccCCHHHHhhHHHHHHHHHhcChHHHHHHHHhh
Q psy4018 46 FNNYYNCLLDK---GKCSPDGQELKNKLPDAIATECKSCSEKQKEGSKKIFKYLIDNKPEQWAELEKKF 111 (136)
Q Consensus 46 ~~~y~~ClLdk---gpCt~~gk~lK~~lpeal~t~C~KCT~kQK~~~~kvi~~l~~~~P~~w~~l~~Ky 111 (136)
+..+=....+. -..+|....|+.++|..-.... -||+..=-.+...+..|+++.|+.++-|...|
T Consensus 8 Le~WG~W~~~~~~~~~~s~i~a~f~~~~p~~~~~r~-~csDDdgl~Id~~v~~L~~~~~~~~~ll~~~Y 75 (125)
T PF06530_consen 8 LERWGAWVRSGRSGVDYSPIAAGFKGVLPSSSKSRP-SCSDDDGLLIDRCVARLKKRDPEEYDLLILYY 75 (125)
T ss_pred HHHhhHHHhCCCCCCCcccHHHHHhcCCCCCCCCCc-eechHHHHHHHHHHHHHHccCHHHHHHHHHHH
Confidence 34444444432 2467788888999997554444 49999999999999999999999999999988
No 4
>TIGR01676 GLDHase galactonolactone dehydrogenase. This model represents L-Galactono-gamma-lactone dehydrogenase (EC 1.3.2.3). This enzyme catalyzes the final step in ascorbic acid biosynthesis in higher plants. This protein is homologous to ascorbic acid biosynthesis enzymes of other species: L-gulono-gamma-lactone oxidase in rat and L-galactono-gamma-lactone oxidase in yeast. All three covalently bind the cofactor FAD.
Probab=76.90 E-value=2.8 Score=38.17 Aligned_cols=40 Identities=20% Similarity=0.248 Sum_probs=30.0
Q ss_pred HhhHHHHHHHHHhcCh-HHHHHHHHhhCCChhhHHHHHHHH
Q psy4018 86 KEGSKKIFKYLIDNKP-EQWAELEKKFDPNGTYKAKYQKEL 125 (136)
Q Consensus 86 K~~~~kvi~~l~~~~P-~~w~~l~~KyDP~~~y~~~~~~~l 125 (136)
.+....+-..+.+.|| ++|.++++++||+|.+...|.+.+
T Consensus 499 ~~~l~~~yprl~~~YP~d~F~~~R~~lDP~g~F~N~yL~~l 539 (541)
T TIGR01676 499 KDELAALQARLKKKFPVDASNKARKALDPNKILSNNKLEKL 539 (541)
T ss_pred HHHHHHHHHHHHhhCCHHHHHHHHHHhCCCCccccHHHHHh
Confidence 3444555556777888 567788899999999999997654
No 5
>TIGR01679 bact_FAD_ox FAD-linked oxidoreductase. This model represents a family of bacterial oxidoreductases with covalently linked FAD, closely related to two different eukaryotic oxidases, L-gulonolactone oxidase (EC 1.1.3.8) from rat and D-arabinono-1,4-lactone oxidase (EC 1.1.3.37) from Saccharomyces cerevisiae.
Probab=76.57 E-value=2.2 Score=36.98 Aligned_cols=32 Identities=22% Similarity=0.539 Sum_probs=25.4
Q ss_pred HHHHhcCh--HHHHHHHHhhCCChhhHHHHHHHH
Q psy4018 94 KYLIDNKP--EQWAELEKKFDPNGTYKAKYQKEL 125 (136)
Q Consensus 94 ~~l~~~~P--~~w~~l~~KyDP~~~y~~~~~~~l 125 (136)
.-+.+.|| ++|.+++++|||+|.+...|.+-+
T Consensus 384 ~~l~~~YP~~~~F~~~r~~~DP~g~F~n~~~~rl 417 (419)
T TIGR01679 384 ATLRERYPRWDDFAAVRDDLDPDRRFLNPYTRGL 417 (419)
T ss_pred HHHHHHCcCHHHHHHHHHHhCCCCccCCHHHHHh
Confidence 44566677 578899999999999999886644
No 6
>PHA02939 hypothetical protein; Provisional
Probab=75.63 E-value=26 Score=26.45 Aligned_cols=80 Identities=21% Similarity=0.436 Sum_probs=60.8
Q ss_pred HHHHHhhhhcCCCCCChh---HHHHHhhhHHHH--------HhcCccCCHHHHhhHHHHHHHHHhcChHHHHHHHHhhCC
Q psy4018 45 LFNNYYNCLLDKGKCSPD---GQELKNKLPDAI--------ATECKSCSEKQKEGSKKIFKYLIDNKPEQWAELEKKFDP 113 (136)
Q Consensus 45 l~~~y~~ClLdkgpCt~~---gk~lK~~lpeal--------~t~C~KCT~kQK~~~~kvi~~l~~~~P~~w~~l~~KyDP 113 (136)
.+..|..|- |+| |... .-.+++.+||.+ +-.=+.-|..|++.++.-+.-|.+..|..+++|.. |--
T Consensus 40 if~d~~~~~-de~-c~~~kwE~df~rD~ipe~iW~vRkaV~~l~DP~st~~qrEyae~~l~~lle~~~r~f~~Ll~-~ci 116 (144)
T PHA02939 40 IFSDFARCD-DEG-CPRSKWESDFLRDLIPEEIWYVRKAVKELYDPESTDDQREYAEHRLKGLLELLERNFNQLLE-FCI 116 (144)
T ss_pred HHHHHhhcc-ccc-cchHhhHHHHHHHHhhHHHHHHHHHHHHHhCCccchhHHHHHHhhHHHHHHHHHHHHHHHHH-HHh
Confidence 566677775 555 7654 455788898865 33457789999999999999999999999999988 656
Q ss_pred ChhhHHHHHHHHhh
Q psy4018 114 NGTYKAKYQKELSD 127 (136)
Q Consensus 114 ~~~y~~~~~~~l~~ 127 (136)
+.+-+++|.+.+..
T Consensus 117 ~ekRreKYLkLiEE 130 (144)
T PHA02939 117 DEKRREKYLKLIEE 130 (144)
T ss_pred hHHHHHHHHHHHHH
Confidence 66777777755443
No 7
>PF04818 CTD_bind: RNA polymerase II-binding domain.; InterPro: IPR006903 This entry represents a conserved region found in a number of uncharacterised eukaryotic proteins.; PDB: 2L0I_A 2KM4_A 3D9I_B 3D9N_B 3D9O_A 3D9P_B 3D9K_A 3D9M_A 3D9J_A 3D9L_A ....
Probab=75.33 E-value=8.6 Score=24.41 Aligned_cols=38 Identities=16% Similarity=0.266 Sum_probs=31.7
Q ss_pred hhHHHHHhhhHHHHHhcCccCCHHHHhhHHHHHHHHHh
Q psy4018 61 PDGQELKNKLPDAIATECKSCSEKQKEGSKKIFKYLID 98 (136)
Q Consensus 61 ~~gk~lK~~lpeal~t~C~KCT~kQK~~~~kvi~~l~~ 98 (136)
.-...|...||+++...+..-.++.++.+.++++.-.+
T Consensus 22 ~f~~~F~~~l~~~~~~~~~~~~~~~~~kv~rll~iW~~ 59 (64)
T PF04818_consen 22 EFAPAFSPVLPDAFAHAYKNVDPEVRKKVQRLLNIWEE 59 (64)
T ss_dssp CHHHHHHCCHHHHHHHHCCCS-HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhhC
Confidence 35789999999999998887799999999999987444
No 8
>PLN02465 L-galactono-1,4-lactone dehydrogenase
Probab=74.61 E-value=2 Score=39.41 Aligned_cols=45 Identities=22% Similarity=0.352 Sum_probs=33.8
Q ss_pred CHHHHhhHHHHHHHHHhcCh-HHHHHHHHhhCCChhhHHHHHHHHh
Q psy4018 82 SEKQKEGSKKIFKYLIDNKP-EQWAELEKKFDPNGTYKAKYQKELS 126 (136)
Q Consensus 82 T~kQK~~~~kvi~~l~~~~P-~~w~~l~~KyDP~~~y~~~~~~~l~ 126 (136)
+|+-++.....-..|.+.|| ++|.++++.+||+|.+...|.+.+-
T Consensus 525 ~~~~~~~~~~~~~~L~~~YP~d~F~~~R~~lDP~g~f~N~~L~~lf 570 (573)
T PLN02465 525 VPKDKEELEALRERLRKRFPVDAFNKARKELDPKGILSNNLLEKLF 570 (573)
T ss_pred cccccchhhcCHHHHHhhCCHHHHHHHHHHhCCCCccCCHHHHHhh
Confidence 34455566667788899999 3445677899999999999987653
No 9
>TIGR01677 pln_FAD_oxido plant-specific FAD-dependent oxidoreductase. This model represents an uncharacterized plant-specific family of FAD-dependent oxidoreductases. At least seven distinct members are found in Arabidopsis thaliana. The family shows considerable sequence similarity to three different enzymes of ascorbic acid biosynthesis: L-galactono-1,4-lactone dehydrogenase (EC 1.3.2.3) from higher plants, D-arabinono-1,4-lactone oxidase (EC 1.1.3.37 from Saccharomyces cerevisiae, and L-gulonolactone oxidase (EC 1.1.3.8) from mouse, as well as to a bacterial sorbitol oxidase. The class of compound acted on by members of this family is unknown.
Probab=73.75 E-value=2.4 Score=38.64 Aligned_cols=33 Identities=18% Similarity=0.601 Sum_probs=27.0
Q ss_pred HHHhcCh--HHHHHHHHhhCCChhhHHHHHHHHhh
Q psy4018 95 YLIDNKP--EQWAELEKKFDPNGTYKAKYQKELSD 127 (136)
Q Consensus 95 ~l~~~~P--~~w~~l~~KyDP~~~y~~~~~~~l~~ 127 (136)
-+.+.|| ++|.++++++||+|.+...|.+.+-.
T Consensus 476 ~l~~~YP~~~dF~alR~~~DP~g~F~N~yl~~l~~ 510 (557)
T TIGR01677 476 GVIRKYPNADKFLKVKDSYDPKGLFSSEWSDEILG 510 (557)
T ss_pred HHHHhCCCHHHHHHHHHhcCCCCccCCHHHHHHhC
Confidence 4667788 67788899999999999999876644
No 10
>PF01805 Surp: Surp module; InterPro: IPR000061 SWAP is derived from the Suppressor-of-White-APricot splicing regulator from Drosophila melanogaster. The domain is found in regulators responsible for pervasive, nonsex-specific alternative pre-mRNA splicing characteristics and has been found in splicing regulatory proteins []. These ancient, conserved SWAP proteins share a colinearly arrayed series of novel sequence motifs [].; GO: 0003723 RNA binding, 0006396 RNA processing; PDB: 2E5Z_A 2DT7_B 2DT6_A 1UG0_A 1X4P_A 2E60_A 1X4O_A 4DGW_B.
Probab=73.72 E-value=16 Score=22.78 Aligned_cols=27 Identities=22% Similarity=0.244 Sum_probs=25.5
Q ss_pred HHhhHHHHHHHHHhcChHHHHHHHHhh
Q psy4018 85 QKEGSKKIFKYLIDNKPEQWAELEKKF 111 (136)
Q Consensus 85 QK~~~~kvi~~l~~~~P~~w~~l~~Ky 111 (136)
|++.++++..++.++=|+.+..+.++-
T Consensus 1 ~~~~I~~tA~~Va~~G~~fE~~l~~~~ 27 (55)
T PF01805_consen 1 LREIIDKTAEFVAKNGPEFEEKLRERE 27 (55)
T ss_dssp HHHHHHHHHHHHHHCSHHHHHHHHHHT
T ss_pred ChHHHHHHHHHHHhcCHHHHHHHHHhc
Confidence 688999999999999999999999987
No 11
>PF08031 BBE: Berberine and berberine like ; InterPro: IPR012951 This domain is found in the berberine bridge and berberine bridge-like enzymes, which are involved in the biosynthesis of numerous isoquinoline alkaloids. They catalyse the transformation of the N-methyl group of (S)-reticuline into the C-8 berberine bridge carbon of (S)-scoulerine [].; GO: 0016491 oxidoreductase activity, 0050660 flavin adenine dinucleotide binding, 0055114 oxidation-reduction process; PDB: 2IPI_A 2Y3S_A 2Y3R_D 2Y08_B 2Y4G_A 3D2H_A 3FW9_A 3FW8_A 3FWA_A 3D2J_A ....
Probab=70.97 E-value=2.5 Score=25.91 Aligned_cols=16 Identities=19% Similarity=0.789 Sum_probs=10.2
Q ss_pred HHHHHHHhhCCChhhH
Q psy4018 103 QWAELEKKFDPNGTYK 118 (136)
Q Consensus 103 ~w~~l~~KyDP~~~y~ 118 (136)
.-.++.++|||+|-+.
T Consensus 26 rL~~iK~~yDP~n~F~ 41 (47)
T PF08031_consen 26 RLRAIKRKYDPDNVFR 41 (47)
T ss_dssp HHHHHHHHH-TT-TS-
T ss_pred HHHHHHHHhCccceeC
Confidence 3467789999999764
No 12
>PLN00107 FAD-dependent oxidoreductase; Provisional
Probab=67.69 E-value=3.8 Score=34.13 Aligned_cols=36 Identities=17% Similarity=0.504 Sum_probs=28.9
Q ss_pred HHHHHhcCh--HHHHHHHHhhCCChhhHHHHHHHHhhc
Q psy4018 93 FKYLIDNKP--EQWAELEKKFDPNGTYKAKYQKELSDL 128 (136)
Q Consensus 93 i~~l~~~~P--~~w~~l~~KyDP~~~y~~~~~~~l~~~ 128 (136)
...+.+.|| ++|.++.+.+||+|.+...|.+.+-..
T Consensus 168 ~~~l~~lYPr~~dFlavR~~lDP~G~F~N~yl~rllg~ 205 (257)
T PLN00107 168 FDGAIAKYKKAGEFLKVKERLDPEGLFSSEWSDKILGL 205 (257)
T ss_pred HHHHHHHCcCHHHHHHHHHHhCCCCccCCHHHHHHhCc
Confidence 445677777 578899999999999999998766543
No 13
>TIGR02765 crypto_DASH cryptochrome, DASH family. Photolyases and cryptochromes are related flavoproteins. Photolyases harness the energy of blue light to repair DNA damage by removing pyrimidine dimers. Cryptochromes do not repair DNA and are presumed to act instead in some other (possibly unknown) process such as entraining circadian rhythms. This model describes the cryptochrome DASH subfamily, one of at least five major subfamilies, which is found in plants, animals, marine bacteria, etc. Members of this family bind both folate and FAD. They may show weak photolyase activity in vitro but have not been shown to affect DNA repair in vivo. Rather, DASH family cryptochromes have been shown to bind RNA (Vibrio cholerae VC1814), or DNA, and seem likely to act in light-responsive regulatory processes.
Probab=67.54 E-value=2.5 Score=36.33 Aligned_cols=16 Identities=31% Similarity=0.785 Sum_probs=11.9
Q ss_pred HHHhhCCChhhHHHHH
Q psy4018 107 LEKKFDPNGTYKAKYQ 122 (136)
Q Consensus 107 l~~KyDP~~~y~~~~~ 122 (136)
-.+||||+|+|.++|-
T Consensus 411 q~~k~Dp~g~yir~wv 426 (429)
T TIGR02765 411 QAQDYDPDGEYVATWV 426 (429)
T ss_pred HHHhcCCCCCcHHHhc
Confidence 3678888888888773
No 14
>TIGR02766 crypt_chrom_pln cryptochrome, plant family. At least five major families of cryptochomes and photolyases share FAD cofactor binding, sequence homology, and the ability to react to short wavelengths of visible light. Photolysases are responsible for light-dependent DNA repair by removal of two types of uv-induced DNA dimerizations. Cryptochromes have other functions, often regulatory and often largely unknown, which may include circadian clock entrainment and control of development. Members of this subfamily are known so far only in plants; they may show some photolyase activity in vitro but appear mostly to be regulatory proteins that respond to blue light.
Probab=66.35 E-value=2.7 Score=36.85 Aligned_cols=20 Identities=45% Similarity=0.806 Sum_probs=17.7
Q ss_pred HHhhCCChhhHHHHHHHHhh
Q psy4018 108 EKKFDPNGTYKAKYQKELSD 127 (136)
Q Consensus 108 ~~KyDP~~~y~~~~~~~l~~ 127 (136)
.+||||+|+|.++|--+|+.
T Consensus 409 ~~~~Dp~g~yir~wvPeL~~ 428 (475)
T TIGR02766 409 GYKFDPNGEYVRRWLPELAR 428 (475)
T ss_pred HhhcCCCcccHHHhChhhcc
Confidence 78999999999999888765
No 15
>TIGR03556 photolyase_8HDF deoxyribodipyrimidine photo-lyase, 8-HDF type. This model describes a narrow clade of cyanobacterial deoxyribodipyrimidine photo-lyase. This group, in contrast to several closely related proteins, uses a chromophore that, in other lineages is modified further to become coenzyme F420. This chromophore is called 8-HDF in most articles on the DNA photolyase and FO in most literature on coenzyme F420.
Probab=63.28 E-value=3.8 Score=36.09 Aligned_cols=23 Identities=30% Similarity=0.628 Sum_probs=19.4
Q ss_pred HHHHhhCCChhhHHHHHHHHhhc
Q psy4018 106 ELEKKFDPNGTYKAKYQKELSDL 128 (136)
Q Consensus 106 ~l~~KyDP~~~y~~~~~~~l~~~ 128 (136)
.-.+||||+|+|.++|.-+|+..
T Consensus 408 ~q~~k~Dp~G~yIr~w~PeL~~~ 430 (471)
T TIGR03556 408 SQAQKFDPEAEYIRRWLPELRSV 430 (471)
T ss_pred HHHHHhCCCCchHHHhCHhhccC
Confidence 34789999999999999888763
No 16
>PRK10674 deoxyribodipyrimidine photolyase; Provisional
Probab=62.97 E-value=3.6 Score=36.29 Aligned_cols=22 Identities=27% Similarity=0.635 Sum_probs=19.1
Q ss_pred HHHhhCCChhhHHHHHHHHhhc
Q psy4018 107 LEKKFDPNGTYKAKYQKELSDL 128 (136)
Q Consensus 107 l~~KyDP~~~y~~~~~~~l~~~ 128 (136)
..+||||+|+|.++|.-+|+..
T Consensus 405 q~~k~Dp~g~yIr~w~PeL~~~ 426 (472)
T PRK10674 405 QGERFDRDGEFIRRWLPELRDV 426 (472)
T ss_pred HHHHhCCCCChHHHhChhhccC
Confidence 3789999999999999888764
No 17
>PF11919 DUF3437: Domain of unknown function (DUF3437); InterPro: IPR021843 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is typically between 142 to 163 amino acids in length. ; PDB: 3L5Q_6 1VSY_5.
Probab=61.64 E-value=3.3 Score=29.09 Aligned_cols=46 Identities=20% Similarity=0.429 Sum_probs=29.4
Q ss_pred hhHHHHHhcCccCCHHH--HhhHHHHHHHHHhcChHHHHHHHHhhCCC
Q psy4018 69 KLPDAIATECKSCSEKQ--KEGSKKIFKYLIDNKPEQWAELEKKFDPN 114 (136)
Q Consensus 69 ~lpeal~t~C~KCT~kQ--K~~~~kvi~~l~~~~P~~w~~l~~KyDP~ 114 (136)
.+|++|..=|...++.| +..+++.+.-.++.|.|.|.+-..+|.++
T Consensus 27 w~P~~l~~La~~~~~~~~I~~tvk~tl~eFkrtH~D~W~~~~~~Ft~~ 74 (90)
T PF11919_consen 27 WMPEVLEELARHANDPQPIRTTVKKTLSEFKRTHQDTWHEHKKKFTED 74 (90)
T ss_dssp HHHHHHHHHHTTSSS-SSHHHHTHHHHHHHHHHTSTTHHHHGGG--SS
T ss_pred cHHHHHHHHHHHhCCCchHHHHHHHHHHHHHHhCcccHHHHHHhCCHH
Confidence 44555554444444333 35567778888889999998888888776
No 18
>PF04245 NA37: 37-kD nucleoid-associated bacterial protein; InterPro: IPR007358 The Escherichia coli nucleoid contains DNA in a condensed but functional form. Analysis of proteins released from isolated spermidine nucleoids after treatment with DNase I revealed significant amounts of two proteins not previously detected in wild-type E. coli. Partial amino-terminal sequencing has identified them as the products of rdgC and yejK. These proteins are strongly conserved in Gram-negative bacteria, suggesting that they have important cellular roles []. This entry represents the product of gene yejK, nucleoid associated protein NdpA. ; GO: 0009295 nucleoid
Probab=59.95 E-value=51 Score=26.61 Aligned_cols=86 Identities=20% Similarity=0.287 Sum_probs=59.5
Q ss_pred HHhhhhcCCCCCChhHHHHHhhhHHHHHhcCccC--CHHHHhh-HHHHHHHHHhcChHHHHHHHHhhCCC--hhhHHHHH
Q psy4018 48 NYYNCLLDKGKCSPDGQELKNKLPDAIATECKSC--SEKQKEG-SKKIFKYLIDNKPEQWAELEKKFDPN--GTYKAKYQ 122 (136)
Q Consensus 48 ~y~~ClLdkgpCt~~gk~lK~~lpeal~t~C~KC--T~kQK~~-~~kvi~~l~~~~P~~w~~l~~KyDP~--~~y~~~~~ 122 (136)
.|+.=||+=.+-. ..+..-+.+-++++.-+..+ .+.++.. .+++..++.++.+=..+++...+.|+ -..++.|.
T Consensus 174 ~~~~~fL~~~~~~-~~~~~tk~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~v~~~e~~~~v~~~~~~~~~~~f~ 252 (325)
T PF04245_consen 174 YYFEDFLGCSEKI-SSKEKTKKLLKAVKKFIKEHNLDPEEKAKLKNKVVEYLEENEEVNIEEIAEEVFEDEDPEVKEEFE 252 (325)
T ss_pred HHHHHhcCCccCC-ChHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHhcCCCcCHHHHHHHHHHhhhHHHHHHHH
Confidence 3333344433322 23444555777777777777 4555444 45888999999999999999999855 47889999
Q ss_pred HHHhhcCCCCCC
Q psy4018 123 KELSDLKAGKPV 134 (136)
Q Consensus 123 ~~l~~~~~~~~~ 134 (136)
++++..+..+|.
T Consensus 253 ~~~~~~~~~~~~ 264 (325)
T PF04245_consen 253 EFLEENEYGLPE 264 (325)
T ss_pred HHHHhcccCCCC
Confidence 999988666664
No 19
>PF08910 Aida_N: Aida N-terminus; InterPro: IPR015006 This entry represents the axin interactor, dorsalization-associated protein family AIDA [].; PDB: 1UG7_A.
Probab=57.28 E-value=24 Score=25.76 Aligned_cols=36 Identities=25% Similarity=0.264 Sum_probs=26.9
Q ss_pred hHHHHHhhhHHHHHh-cCccCCHHHHhhHHHHHHHHH
Q psy4018 62 DGQELKNKLPDAIAT-ECKSCSEKQKEGSKKIFKYLI 97 (136)
Q Consensus 62 ~gk~lK~~lpeal~t-~C~KCT~kQK~~~~kvi~~l~ 97 (136)
+++.|.+.+..++.. ++..+|+.||+.+.||..-|.
T Consensus 27 eY~~La~~l~k~~~~~~~~~fte~qkk~i~Kia~cL~ 63 (106)
T PF08910_consen 27 EYQRLARQLKKEVQSHQDSDFTEDQKKTIGKIATCLE 63 (106)
T ss_dssp HHHHHHHHHHHHHT-SS--SS-HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhcccCCCCCHHHHHHHHHHHHHHH
Confidence 567777778888887 999999999999999987764
No 20
>smart00582 RPR domain present in proteins, which are involved in regulation of nuclear pre-mRNA.
Probab=55.49 E-value=29 Score=24.30 Aligned_cols=37 Identities=16% Similarity=0.281 Sum_probs=31.6
Q ss_pred hHHHHHhhhHHHHHhcCccCCHHHHhhHHHHHHHHHh
Q psy4018 62 DGQELKNKLPDAIATECKSCSEKQKEGSKKIFKYLID 98 (136)
Q Consensus 62 ~gk~lK~~lpeal~t~C~KCT~kQK~~~~kvi~~l~~ 98 (136)
....|..++++++......+.+..|+.+.++++.-.+
T Consensus 71 f~~~~~~~~~~~~~~~~~~~~~~~~~ki~kll~iW~~ 107 (121)
T smart00582 71 FGDELGPVFQDALRDVLGAANDETKKKIRRLLNIWEE 107 (121)
T ss_pred HHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHhc
Confidence 3678888999999999999999999999999987554
No 21
>PF14658 EF-hand_9: EF-hand domain
Probab=55.36 E-value=25 Score=23.51 Aligned_cols=37 Identities=16% Similarity=0.314 Sum_probs=25.5
Q ss_pred HHHHHHHHHhcChHHH--HHHHHhhCCChh----hHHHHHHHH
Q psy4018 89 SKKIFKYLIDNKPEQW--AELEKKFDPNGT----YKAKYQKEL 125 (136)
Q Consensus 89 ~~kvi~~l~~~~P~~w--~~l~~KyDP~~~----y~~~~~~~l 125 (136)
+..+++++-.+.|+++ +.|.+-+||+|. +++.|..-+
T Consensus 20 l~~~Lra~~~~~p~e~~Lq~l~~elDP~g~~~~v~~d~F~~iM 62 (66)
T PF14658_consen 20 LITYLRAVTGRSPEESELQDLINELDPEGRDGSVNFDTFLAIM 62 (66)
T ss_pred HHHHHHHHcCCCCcHHHHHHHHHHhCCCCCCceEeHHHHHHHH
Confidence 4456666666688777 457777799999 666665544
No 22
>PF13956 Ibs_toxin: Toxin Ibs, type I toxin-antitoxin system
Probab=53.20 E-value=6.1 Score=20.33 Aligned_cols=11 Identities=9% Similarity=0.108 Sum_probs=7.0
Q ss_pred hHHHHHHHHHH
Q psy4018 5 CSAAVGLFLAV 15 (136)
Q Consensus 5 ~MK~~i~l~~~ 15 (136)
|||.++++.++
T Consensus 1 MMk~vIIlvvL 11 (19)
T PF13956_consen 1 MMKLVIILVVL 11 (19)
T ss_pred CceehHHHHHH
Confidence 57877665544
No 23
>PF03441 FAD_binding_7: FAD binding domain of DNA photolyase from Prosite.; InterPro: IPR005101 This entry represents a multi-helical domain composed of two all-alpha subdomains that is found as the C-terminal domain in cryptochrome proteins, as well as at the N-terminal of DNA photolyase where it acts as a FAD-binding domain (the N-terminal of DNA photolyase binds a light-harvesting cofactor). Photolyases and cryptochromes are related flavoproteins that bind FAD. Photolyases harness the energy of blue light to repair DNA damage by removing pyrimidine dimers. Cryptochromes (CRY1 and CRY2) are blue light photoreceptors that mediate blue light-induced gene expression [, ]. DNA photolyases are DNA repair enzymes that repair mismatched pyrimidine dimers induced by exposure to ultra-violet light. They bind to UV-damaged DNA containing pyrimidine dimers and, upon absorbing a near-UV photon (300 to 500 nm), they catalyse dimer splitting, breaking the cyclobutane ring joining the two pyrimidines of the dimer so as to split them into the constituent monomers; this process is called photoreactivation. DNA photolyases require two choromophore-cofactors for their activity. All monomers contain a reduced FAD moiety, and, in addition, either a reduced pterin or 8-hydroxy-5-diazaflavin as a second chromophore. Either chromophore may act as the primary photon acceptor, peak absorptions occurring in the blue region of the spectrum and in the UV-B region, at a wavelength around 290nm [, ].; GO: 0003913 DNA photolyase activity, 0006281 DNA repair; PDB: 3UMV_A 3ZXS_A 1DNP_A 2XRZ_B 2XRY_A 2VTB_A 2J4D_B 2IJG_X 3TVS_A 2E0I_D ....
Probab=52.06 E-value=6.2 Score=32.33 Aligned_cols=21 Identities=48% Similarity=0.892 Sum_probs=17.4
Q ss_pred HHHhhCCChhhHHHHHHHHhh
Q psy4018 107 LEKKFDPNGTYKAKYQKELSD 127 (136)
Q Consensus 107 l~~KyDP~~~y~~~~~~~l~~ 127 (136)
..++|||+|+|.+++--+|+.
T Consensus 207 q~~~~Dp~g~~ir~w~PeL~~ 227 (277)
T PF03441_consen 207 QSKKFDPDGEYIRRWVPELAD 227 (277)
T ss_dssp HHHHHSTTSHHHHHHSGGGTT
T ss_pred HHHhhCcHHHHHHHHHHHHhc
Confidence 357999999999999766655
No 24
>cd08806 CARD_CARD14_CARMA2 Caspase activation and recruitment domain of CARD14-like proteins. Caspase activation and recruitment domain (CARD) similar to that found in CARD14, also known as BIMP2 or CARMA2 (caspase recruitment domain-containing membrane-associated guanylate kinase protein 2). CARD14 has been identified as a novel member of the MAGUK (membrane-associated guanylate kinase) family that functions as upstream activators of BCL10 (B-cell lymphoma 10) and NF-kB signaling. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effector Domain). They serve as adaptors in signaling pathways
Probab=41.94 E-value=65 Score=22.70 Aligned_cols=56 Identities=23% Similarity=0.361 Sum_probs=37.6
Q ss_pred cCccCCHHHHhcCHHHHHHHhhhhcCCCCCChhHHHHHhhhHHHHHhcCccCCHHHHhhHHHHHHHHHhcChHHHHHH
Q psy4018 30 KYDNVDLDQILKNDRLFNNYYNCLLDKGKCSPDGQELKNKLPDAIATECKSCSEKQKEGSKKIFKYLIDNKPEQWAEL 107 (136)
Q Consensus 30 ~~D~iDid~iL~N~rl~~~y~~ClLdkgpCt~~gk~lK~~lpeal~t~C~KCT~kQK~~~~kvi~~l~~~~P~~w~~l 107 (136)
-+|.-|.++|+++..+.++-.+ -+.+=+.|+|. =++++..|++.|+..||+-+.++
T Consensus 30 IL~~~deeeIls~~t~~~r~~k---------------~g~LLDIL~tr-------G~~g~~aFLeSLe~~yP~ly~~i 85 (86)
T cd08806 30 VLDQLDEEEVLHSPRLTNRAMR---------------VGHLLDLLKTR-------GKNGAIAFLESLKFHNPDVYTLV 85 (86)
T ss_pred CCChhhHHHHHccchHHHHHHH---------------HHHHHHHHHhc-------CchHHHHHHHHHHHHCHHHHHhc
Confidence 4688999999999887665433 12222333321 13578889999999999988764
No 25
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=40.92 E-value=15 Score=25.93 Aligned_cols=9 Identities=33% Similarity=0.342 Sum_probs=4.2
Q ss_pred hhHHHHHHH
Q psy4018 4 LCSAAVGLF 12 (136)
Q Consensus 4 ~~MK~~i~l 12 (136)
|..|.+++|
T Consensus 1 MaSK~~llL 9 (95)
T PF07172_consen 1 MASKAFLLL 9 (95)
T ss_pred CchhHHHHH
Confidence 345555443
No 26
>KOG0133|consensus
Probab=40.69 E-value=21 Score=32.73 Aligned_cols=27 Identities=33% Similarity=0.504 Sum_probs=23.2
Q ss_pred HHHHHhhCCChhhHHHHHHHHhhcCCC
Q psy4018 105 AELEKKFDPNGTYKAKYQKELSDLKAG 131 (136)
Q Consensus 105 ~~l~~KyDP~~~y~~~~~~~l~~~~~~ 131 (136)
..+-++|||+|.|.+.+...++..+..
T Consensus 422 ~~~~kk~dP~g~yir~~lp~l~~~p~~ 448 (531)
T KOG0133|consen 422 VALGKKLDPDGLYIRQWLPELRSGPMH 448 (531)
T ss_pred HHHhCcCCcchhhHHHHhHHHhcCCcc
Confidence 467889999999999999999886655
No 27
>PF12066 DUF3546: Domain of unknown function (DUF3546); InterPro: IPR021933 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is typically between 93 to 114 amino acids in length. This domain has two completely conserved Y residues that may be functionally important. ; PDB: 3AX1_A.
Probab=39.62 E-value=23 Score=25.41 Aligned_cols=25 Identities=24% Similarity=0.696 Sum_probs=18.9
Q ss_pred HHHHHHHHHhcChHHHHHHHHhhCCCh
Q psy4018 89 SKKIFKYLIDNKPEQWAELEKKFDPNG 115 (136)
Q Consensus 89 ~~kvi~~l~~~~P~~w~~l~~KyDP~~ 115 (136)
.+.+-.++..|+.++| |..||+|+.
T Consensus 31 ~~q~~~FF~~HK~eeW--FreKY~P~~ 55 (110)
T PF12066_consen 31 RKQLRAFFEQHKDEEW--FREKYHPEE 55 (110)
T ss_dssp HHHHHHHHHHHTTSHH--HHHHH-HHH
T ss_pred HHHHHHHHHHhcCCHH--HHHhcCcHh
Confidence 3445567889999999 789999984
No 28
>PRK15370 E3 ubiquitin-protein ligase SlrP; Provisional
Probab=38.48 E-value=44 Score=31.68 Aligned_cols=37 Identities=30% Similarity=0.436 Sum_probs=31.0
Q ss_pred HHHHhcChHHHHHHHHh-h-CCC-hhhHHHHHHHHhhcCC
Q psy4018 94 KYLIDNKPEQWAELEKK-F-DPN-GTYKAKYQKELSDLKA 130 (136)
Q Consensus 94 ~~l~~~~P~~w~~l~~K-y-DP~-~~y~~~~~~~l~~~~~ 130 (136)
.||+..+|+.|.++..| | +=+ ++|..++.+.|+..|.
T Consensus 671 ~~l~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 710 (754)
T PRK15370 671 KVLERIAPEEWREMMAKRAECIETDEYQSRVNAELEALGI 710 (754)
T ss_pred HHHHHhChHHHHHHHHHHHHHhchhhHHHHHHHHHHhcCC
Confidence 35778999999998765 4 778 9999999999998774
No 29
>COG0415 PhrB Deoxyribodipyrimidine photolyase [DNA replication, recombination, and repair]
Probab=38.36 E-value=16 Score=32.89 Aligned_cols=21 Identities=38% Similarity=0.842 Sum_probs=17.8
Q ss_pred HHhhCCChhhHHHHHHHHhhc
Q psy4018 108 EKKFDPNGTYKAKYQKELSDL 128 (136)
Q Consensus 108 ~~KyDP~~~y~~~~~~~l~~~ 128 (136)
.+||||+|.|.++|--+|+..
T Consensus 399 ~~kfDp~g~fIr~wvPeL~~~ 419 (461)
T COG0415 399 AEKFDPDGEFIRRWVPELRNL 419 (461)
T ss_pred HhhcCCCcccHHhhCHHhhCC
Confidence 468999999999998888763
No 30
>COG3636 Predicted transcriptional regulator [Transcription]
Probab=37.09 E-value=31 Score=24.95 Aligned_cols=31 Identities=26% Similarity=0.564 Sum_probs=28.1
Q ss_pred CCcCccCCHHHHhcCHHHHHHHhhhhcCCCC
Q psy4018 28 TTKYDNVDLDQILKNDRLFNNYYNCLLDKGK 58 (136)
Q Consensus 28 ~~~~D~iDid~iL~N~rl~~~y~~ClLdkgp 58 (136)
+..++++|+.+.|.|++....|++=.|++|.
T Consensus 5 ~~~~~~~d~ae~l~~ee~ia~yL~~~le~~d 35 (100)
T COG3636 5 TRLLDDFDVAELLTDEEAIAAYLNAALEEGD 35 (100)
T ss_pred hhhcccccHHHHhCCHHHHHHHHHHHHHcCC
Confidence 3467899999999999999999999999986
No 31
>PF10731 Anophelin: Thrombin inhibitor from mosquito; InterPro: IPR018932 Members of this family are all inhibitors of thrombin, the peptidase that is at the end of the blood coagulation cascade and which creates the clot by cleaving fibrinogen. The interaction between thrombin and fibrinogen involves two different areas of contact - via the thrombin active site and via a second substrate-binding site known as an exosite. The inhibitor acts by blocking the exosite, rather than by interacting with the active site. The inhibitors are from mosquitoes that feed on human blood and which, by inhibiting thrombin, prevent the blood from clotting and keep it flowing.
Probab=36.53 E-value=27 Score=23.24 Aligned_cols=8 Identities=13% Similarity=0.272 Sum_probs=3.8
Q ss_pred hhHHHHHH
Q psy4018 4 LCSAAVGL 11 (136)
Q Consensus 4 ~~MK~~i~ 11 (136)
|.-|++++
T Consensus 1 MA~Kl~vi 8 (65)
T PF10731_consen 1 MASKLIVI 8 (65)
T ss_pred CcchhhHH
Confidence 34455544
No 32
>PF14853 Fis1_TPR_C: Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=35.53 E-value=52 Score=20.68 Aligned_cols=35 Identities=23% Similarity=0.283 Sum_probs=28.4
Q ss_pred CHHHHHHHhhhhcCCCCCChhHHHHHhhhHHHHHh
Q psy4018 42 NDRLFNNYYNCLLDKGKCSPDGQELKNKLPDAIAT 76 (136)
Q Consensus 42 N~rl~~~y~~ClLdkgpCt~~gk~lK~~lpeal~t 76 (136)
|=.--.+|++-+|...|=++.++.|+..|-+.++.
T Consensus 16 ~Y~~A~~~~~~lL~~eP~N~Qa~~L~~~i~~~i~k 50 (53)
T PF14853_consen 16 EYEKARRYCDALLEIEPDNRQAQSLKELIEDKIQK 50 (53)
T ss_dssp -HHHHHHHHHHHHHHTTS-HHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHhhCCCcHHHHHHHHHHHHHHhc
Confidence 34455789999999999999999999999988764
No 33
>PF00996 GDI: GDP dissociation inhibitor; InterPro: IPR018203 Rab proteins constitute a family of small GTPases that serve a regulatory role in vesicular membrane traffic [, ]; C-terminal geranylgeranylation is crucial for their membrane association and function. This post-translational modification is catalysed by Rab geranylgeranyl transferase (Rab-GGTase), a multi-subunit enzyme that contains a catalytic heterodimer and an accessory component, termed Rab escort protein (REP)-1 []. REP-1 presents newly- synthesised Rab proteins to the catalytic component, and forms a stable complex with the prenylated proteins following the transfer reaction. The mechanism of REP-1-mediated membrane association of Rab5 is similar to that mediated by Rab GDP dissociation inhibitor (GDI). REP-1 and Rab GDI also share other functional properties, including the ability to inhibit the release of GDP and to remove Rab proteins from membranes. The crystal structure of the bovine alpha-isoform of Rab GDI has been determined to a resolution of 1.81A []. The protein is composed of two main structural units: a large complex multi-sheet domain I, and a smaller alpha-helical domain II. The structural organisation of domain I is closely related to FAD-containing monooxygenases and oxidases []. Conserved regions common to GDI and the choroideraemia gene product, which delivers Rab to catalytic subunits of Rab geranylgeranyltransferase II, are clustered on one face of the domain []. The two most conserved regions form a compact structure at the apex of the molecule; site-directed mutagenesis has shown these regions to play a critical role in the binding of Rab proteins [].; PDB: 1VG9_C 1VG0_A 1LTX_R 3P1W_A 3CPH_H 3CPJ_G 3CPI_H 1UKV_G 2BCG_G 1GND_A ....
Probab=35.10 E-value=21 Score=31.70 Aligned_cols=34 Identities=24% Similarity=0.331 Sum_probs=24.4
Q ss_pred HHHHhcCccCCHHHHhhHHHHHHHHHh---cChHHHHHH
Q psy4018 72 DAIATECKSCSEKQKEGSKKIFKYLID---NKPEQWAEL 107 (136)
Q Consensus 72 eal~t~C~KCT~kQK~~~~kvi~~l~~---~~P~~w~~l 107 (136)
|++.+ .--+-.+|+.+.|+++|+.+ +.|+.|+.+
T Consensus 125 dvf~s--~~lsl~eKR~lmkFl~~v~~~~~~~~~~~~~~ 161 (438)
T PF00996_consen 125 DVFKS--KLLSLFEKRRLMKFLKFVANYEEDDPSTHKGL 161 (438)
T ss_dssp HHHC---TTS-HHHHHHHHHHHHHHHHGCTTBGGGSTTG
T ss_pred HhhcC--CCccHHHHHHHHHHHHHHhhcccCCcchhhcc
Confidence 44443 66899999999999999985 356677654
No 34
>KOG3481|consensus
Probab=34.35 E-value=49 Score=23.35 Aligned_cols=23 Identities=22% Similarity=0.141 Sum_probs=19.8
Q ss_pred cCCCCCChhHHHHHhhhHHHHHh
Q psy4018 54 LDKGKCSPDGQELKNKLPDAIAT 76 (136)
Q Consensus 54 LdkgpCt~~gk~lK~~lpeal~t 76 (136)
.+..||+...+.++.-+-+||.|
T Consensus 37 ~~~~pC~~l~k~Y~~Cv~kal~t 59 (87)
T KOG3481|consen 37 SSGEPCSRLFKVYKQCVQKALKT 59 (87)
T ss_pred cccCcHHHHHHHHHHHHHHHHhh
Confidence 34569999999999999999976
No 35
>TIGR01678 FAD_lactone_ox sugar 1,4-lactone oxidases. This model represents a family of at least two different sugar 1,4 lactone oxidases, both involved in synthesizing ascorbic acid or a derivative. These include L-gulonolactone oxidase (EC 1.1.3.8) from rat and D-arabinono-1,4-lactone oxidase (EC 1.1.3.37) from Saccharomyces cerevisiae. Members are proposed to have the cofactor FAD covalently bound at a site specified by Prosite motif PS00862; OX2_COVAL_FAD; 1.
Probab=32.70 E-value=27 Score=30.69 Aligned_cols=24 Identities=25% Similarity=0.628 Sum_probs=18.4
Q ss_pred HHHhcCh--HHHHHHHHhhCCChhhH
Q psy4018 95 YLIDNKP--EQWAELEKKFDPNGTYK 118 (136)
Q Consensus 95 ~l~~~~P--~~w~~l~~KyDP~~~y~ 118 (136)
-|.+.|| ++|.++.+++||+|.+.
T Consensus 411 ~l~~~YP~~~~F~~vr~~~DP~g~F~ 436 (438)
T TIGR01678 411 DFEEMYPTLHKFCDIRKKLDPTGVFL 436 (438)
T ss_pred HHHHHCcCHHHHHHHHHhhCcccccC
Confidence 3445566 57889999999999875
No 36
>PF02950 Conotoxin: Conotoxin; InterPro: IPR004214 Cone snail toxins, conotoxins, are small neurotoxic peptides with disulphide connectivity that target ion-channels or G-protein coupled receptors. Based on the number and pattern of disulphide bonds and biological activities, conotoxins can be classified into several families []. Omega, delta and kappa families of conotoxins have a knottin or inhibitor cysteine knot scaffold. The knottin scaffold is a very special disulphide-through-disulphide knot, in which the III-VI disulphide bond crosses the macrocycle formed by two other disulphide bonds (I-IV and II-V) and the interconnecting backbone segments, where I-VI indicates the six cysteine residues starting from the N terminus. The disulphide bonding network, as well as specific amino acids in inter-cysteine loops, provide the specificity of conotoxins []. The cysteine arrangements are the same for omega, delta and kappa families, even though omega conotoxins are calcium channel blockers, whereas delta conotoxins delay the inactivation of sodium channels, and kappa conotoxins are potassium channel blockers []. Mu conotoxins have two types of cysteine arrangements, but the knottin scaffold is not observed. Mu conotoxins target the voltage-gated sodium channels [], and are useful probes for investigating voltage-dependent sodium channels of excitable tissues []. Alpha conotoxins have two types of cysteine arrangements [], and are competitive nicotinic acetylcholine receptor antagonists. ; GO: 0008200 ion channel inhibitor activity, 0009405 pathogenesis, 0005576 extracellular region; PDB: 2EFZ_A 1FYG_A 1RMK_A 1DG0_A 1DFY_A 1DFZ_A 2JQC_A 2YYF_A 2JQB_A 1F3K_A ....
Probab=32.63 E-value=15 Score=23.87 Aligned_cols=10 Identities=20% Similarity=0.481 Sum_probs=3.5
Q ss_pred hhhhcCCCCC
Q psy4018 50 YNCLLDKGKC 59 (136)
Q Consensus 50 ~~ClLdkgpC 59 (136)
-.|.-.-++|
T Consensus 48 ~~C~~~g~~C 57 (75)
T PF02950_consen 48 RRCTPPGSYC 57 (75)
T ss_dssp --EB-TTSB-
T ss_pred cccCCCCCcC
Confidence 3465555566
No 37
>PF03564 DUF1759: Protein of unknown function (DUF1759); InterPro: IPR005312 This is a small family of proteins of unknown function.
Probab=32.50 E-value=79 Score=22.62 Aligned_cols=39 Identities=23% Similarity=0.407 Sum_probs=27.9
Q ss_pred HHHHHHHH---HhcChHHHHHHHHhhCCChhhHHHHHHHHhh
Q psy4018 89 SKKIFKYL---IDNKPEQWAELEKKFDPNGTYKAKYQKELSD 127 (136)
Q Consensus 89 ~~kvi~~l---~~~~P~~w~~l~~KyDP~~~y~~~~~~~l~~ 127 (136)
+...+.++ ..+|+..|+.|.++|+....-...+.+.+..
T Consensus 42 A~~~i~~~~~~~~~Y~~a~~~L~~~yg~~~~i~~~~~~~l~~ 83 (145)
T PF03564_consen 42 AKELIRGLPLSEENYEEAWELLEERYGNPRRIIQALLEELRN 83 (145)
T ss_pred HHHHHHcccccchhhHHHHHHHHHHhCCchHHHHHHHHHHhc
Confidence 34455555 4899999999999997766666666655544
No 38
>PF14194 Cys_rich_VLP: Cysteine-rich VLP
Probab=31.42 E-value=48 Score=21.63 Aligned_cols=20 Identities=15% Similarity=0.174 Sum_probs=16.9
Q ss_pred CHHHHhhHHHHHHHHHhcCh
Q psy4018 82 SEKQKEGSKKIFKYLIDNKP 101 (136)
Q Consensus 82 T~kQK~~~~kvi~~l~~~~P 101 (136)
|++|++.++++++..--||-
T Consensus 1 T~~q~r~~~~LV~~~C~Nyd 20 (56)
T PF14194_consen 1 TPRQRRRIRKLVRRECCNYD 20 (56)
T ss_pred CHHHHHHHHHHHHHHcccCC
Confidence 78999999999998777764
No 39
>KOG0210|consensus
Probab=30.57 E-value=38 Score=32.76 Aligned_cols=23 Identities=22% Similarity=0.407 Sum_probs=17.8
Q ss_pred HhcCccCCHHHHhhHHHHHHHHH
Q psy4018 75 ATECKSCSEKQKEGSKKIFKYLI 97 (136)
Q Consensus 75 ~t~C~KCT~kQK~~~~kvi~~l~ 97 (136)
.--|-+|||.||..+-..++-..
T Consensus 758 aVv~CRctPtQKA~v~~llq~~t 780 (1051)
T KOG0210|consen 758 AVVCCRCTPTQKAQVVRLLQKKT 780 (1051)
T ss_pred cEEEEecChhHHHHHHHHHHHhh
Confidence 34677999999999888776443
No 40
>PF01726 LexA_DNA_bind: LexA DNA binding domain; InterPro: IPR006199 This is the DNA binding domain of the LexA SOS regulon repressor which prevents expression of DNA repair proteins in bacteria. The aligned region contains a variant form of the helix-turn-helix DNA binding motif []. This domain usually at the N terminus is found associated with IPR006198 from INTERPRO the auto-proteolytic domain of LexA 3.4.21.88 from EC.; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1LEA_A 1JHH_A 3JSP_A 1JHF_A 3JSO_B 1LEB_A 3K2Z_A.
Probab=30.38 E-value=1.6e+02 Score=19.11 Aligned_cols=48 Identities=21% Similarity=0.223 Sum_probs=32.4
Q ss_pred CHHHHhhHHHHHHHHHh-cChHHHHHHHHhhCCC-hhhHHHHHHHHhhcC
Q psy4018 82 SEKQKEGSKKIFKYLID-NKPEQWAELEKKFDPN-GTYKAKYQKELSDLK 129 (136)
Q Consensus 82 T~kQK~~~~kvi~~l~~-~~P~~w~~l~~KyDP~-~~y~~~~~~~l~~~~ 129 (136)
|++|++...-+.+|+.+ -+|....+|.+.+.=. -.=...+.+.|...|
T Consensus 5 T~rQ~~vL~~I~~~~~~~G~~Pt~rEIa~~~g~~S~~tv~~~L~~Le~kG 54 (65)
T PF01726_consen 5 TERQKEVLEFIREYIEENGYPPTVREIAEALGLKSTSTVQRHLKALERKG 54 (65)
T ss_dssp -HHHHHHHHHHHHHHHHHSS---HHHHHHHHTSSSHHHHHHHHHHHHHTT
T ss_pred CHHHHHHHHHHHHHHHHcCCCCCHHHHHHHhCCCChHHHHHHHHHHHHCc
Confidence 89999999999999885 5899999999998643 333344555555554
No 41
>TIGR00952 S15_bact ribosomal protein S15, bacterial/organelle. This model is built to recognize specifically bacterial, chloroplast, and mitochondrial ribosomal protein S15. The homologous proteins of Archaea and Eukarya are designated S13.
Probab=29.59 E-value=79 Score=22.00 Aligned_cols=24 Identities=25% Similarity=0.555 Sum_probs=19.3
Q ss_pred HHHHHHHHHhcChHHHHHHHHhhC
Q psy4018 89 SKKIFKYLIDNKPEQWAELEKKFD 112 (136)
Q Consensus 89 ~~kvi~~l~~~~P~~w~~l~~KyD 112 (136)
=+++++||.++.++.|..++++.+
T Consensus 60 RkrlL~YL~~~d~~~Y~~li~~Lg 83 (86)
T TIGR00952 60 RRRLLKYLKRTDVERYRSLIKRLG 83 (86)
T ss_pred HHHHHHHHHhCCHHHHHHHHHHhC
Confidence 367888999999999988887653
No 42
>PF15335 CAAP1: Caspase activity and apoptosis inhibitor 1
Probab=28.98 E-value=1.2e+02 Score=20.19 Aligned_cols=35 Identities=14% Similarity=0.268 Sum_probs=26.1
Q ss_pred CHHHHhcCHHHHHHHhhhhcCCCCCChhHHHHHhhhHHHHHh
Q psy4018 35 DLDQILKNDRLFNNYYNCLLDKGKCSPDGQELKNKLPDAIAT 76 (136)
Q Consensus 35 Did~iL~N~rl~~~y~~ClLdkgpCt~~gk~lK~~lpeal~t 76 (136)
.|...++|++.+-.+.=|.+. .++++.++|+.++.
T Consensus 4 pis~Yi~Dr~Eml~Q~F~~l~-------~kkl~~MlPd~LK~ 38 (64)
T PF15335_consen 4 PISFYIDDRKEMLRQCFSVLK-------EKKLQKMLPDILKG 38 (64)
T ss_pred cHHHHHccHHHHHHHHHHHcC-------HHHHHHHCHHHHhc
Confidence 466778888887776555443 48999999999974
No 43
>PF12399 BCA_ABC_TP_C: Branched-chain amino acid ATP-binding cassette transporter
Probab=28.90 E-value=50 Score=17.57 Aligned_cols=15 Identities=20% Similarity=0.313 Sum_probs=13.1
Q ss_pred HHHHhcCHHHHHHHh
Q psy4018 36 LDQILKNDRLFNNYY 50 (136)
Q Consensus 36 id~iL~N~rl~~~y~ 50 (136)
.++|.+|++.+..|+
T Consensus 8 p~~i~~n~~V~~aYL 22 (23)
T PF12399_consen 8 PEEIRANPEVREAYL 22 (23)
T ss_pred HHHHhcCHHHHHhhC
Confidence 578999999999886
No 44
>PF06576 DUF1133: Protein of unknown function (DUF1133); InterPro: IPR010557 This family consists of a number of hypothetical proteins from Escherichia coli O157:H7 and Salmonella typhi. The function of this family is unknown.
Probab=28.68 E-value=1.3e+02 Score=23.80 Aligned_cols=52 Identities=15% Similarity=0.251 Sum_probs=45.2
Q ss_pred cCccCCHHHHhhHHHHHHHHHhcChHHHHHHHHhhCCChhhHHHHHHHHhhc
Q psy4018 77 ECKSCSEKQKEGSKKIFKYLIDNKPEQWAELEKKFDPNGTYKAKYQKELSDL 128 (136)
Q Consensus 77 ~C~KCT~kQK~~~~kvi~~l~~~~P~~w~~l~~KyDP~~~y~~~~~~~l~~~ 128 (136)
.=+-||+...-.+++||.-+-.++|.-.+-+.+.|==-|++..+....|...
T Consensus 92 ~La~ctD~Eal~iDrVI~~vL~~~~gl~~Vl~qrY~~RgkSk~~MA~eL~~~ 143 (176)
T PF06576_consen 92 WLAFCTDDEALFIDRVIGEVLAEHPGLINVLRQRYCGRGKSKRKMAEELNEK 143 (176)
T ss_pred ccceecchHHHHHHHHHHHHHHhCcchHHHHHHHHHcccccHHHHHHHHhcc
Confidence 3567888889999999999999999999999999988889988888777654
No 45
>PRK05626 rpsO 30S ribosomal protein S15; Reviewed
Probab=28.57 E-value=84 Score=22.02 Aligned_cols=23 Identities=22% Similarity=0.534 Sum_probs=18.7
Q ss_pred HHHHHHHHhcChHHHHHHHHhhC
Q psy4018 90 KKIFKYLIDNKPEQWAELEKKFD 112 (136)
Q Consensus 90 ~kvi~~l~~~~P~~w~~l~~KyD 112 (136)
+++++||.++.++.|..++++.+
T Consensus 64 krlL~YL~~~d~~~Y~~li~~Lg 86 (89)
T PRK05626 64 RKLLDYLKKKDVERYRALIERLG 86 (89)
T ss_pred HHHHHHHHhcCHHHHHHHHHHhC
Confidence 57888999999988888887653
No 46
>PRK15387 E3 ubiquitin-protein ligase SspH2; Provisional
Probab=28.14 E-value=57 Score=31.22 Aligned_cols=37 Identities=27% Similarity=0.293 Sum_probs=31.4
Q ss_pred HHHHhcChHHHHHHHHhh--CCChhhHHHHHHHHhhcCC
Q psy4018 94 KYLIDNKPEQWAELEKKF--DPNGTYKAKYQKELSDLKA 130 (136)
Q Consensus 94 ~~l~~~~P~~w~~l~~Ky--DP~~~y~~~~~~~l~~~~~ 130 (136)
.+|+..+|+.|..+.+|= |=+++|..++.+.|+..|.
T Consensus 705 ~~L~~~~~~~f~~~~e~~~~~~e~~~~~~~~~~l~~~g~ 743 (788)
T PRK15387 705 RVLERKAPERVNALREKQISDYEETYRMLSDTELRPSGL 743 (788)
T ss_pred HHHHHhCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC
Confidence 367889999999998764 8889999999999998774
No 47
>CHL00027 rps15 ribosomal protein S15
Probab=27.99 E-value=88 Score=22.11 Aligned_cols=24 Identities=13% Similarity=0.405 Sum_probs=20.3
Q ss_pred HHHHHHHHHhcChHHHHHHHHhhC
Q psy4018 89 SKKIFKYLIDNKPEQWAELEKKFD 112 (136)
Q Consensus 89 ~~kvi~~l~~~~P~~w~~l~~KyD 112 (136)
=+++++||.++.++.|..++.+.+
T Consensus 60 RkrLL~YL~r~d~~~Y~~li~~Lg 83 (90)
T CHL00027 60 RQRLLAYLSKKNRVRYKKLISQLG 83 (90)
T ss_pred HHHHHHHHHhCCHHHHHHHHHHhC
Confidence 367899999999999999988764
No 48
>TIGR03826 YvyF flagellar operon protein TIGR03826. This gene is found in flagellar operons of Bacillus-related organisms. Its function has not been determined and an official gene symbol has not been assigned, although the gene is designated yvyF in B. subtilus. A tentative assignment as a regulator is suggested in the NCBI record GI:16080597.
Probab=27.67 E-value=1.3e+02 Score=22.62 Aligned_cols=61 Identities=18% Similarity=0.356 Sum_probs=45.4
Q ss_pred CCChhHHHHHhhhHHHHHhcCccCCHHHHhhHHHHHHHHHhcCh--HHHHHHHHhhCCChhhHHHHH
Q psy4018 58 KCSPDGQELKNKLPDAIATECKSCSEKQKEGSKKIFKYLIDNKP--EQWAELEKKFDPNGTYKAKYQ 122 (136)
Q Consensus 58 pCt~~gk~lK~~lpeal~t~C~KCT~kQK~~~~kvi~~l~~~~P--~~w~~l~~KyDP~~~y~~~~~ 122 (136)
-|..=|+-|-.. .+.-|..|..+.-+.+.+|-+||.+|-. -.-.++..--+=+-+...+|-
T Consensus 5 nC~~CgklF~~~----~~~iCp~C~~~~e~~f~kV~~yLr~~p~~~ati~eV~e~tgVs~~~I~~~I 67 (137)
T TIGR03826 5 NCPKCGRLFVKT----GRDVCPSCYEEEEREFEKVYKFLRKHENRQATVSEIVEETGVSEKLILKFI 67 (137)
T ss_pred cccccchhhhhc----CCccCHHHhHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCcCHHHHHHHH
Confidence 344445555443 5567999999999999999999997755 677888877777777776664
No 49
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=27.12 E-value=41 Score=27.19 Aligned_cols=41 Identities=27% Similarity=0.518 Sum_probs=26.2
Q ss_pred CccCCHHHHhhHHHHHHHHHhcChHHHHHHHHhhCCChhhHHHHHH
Q psy4018 78 CKSCSEKQKEGSKKIFKYLIDNKPEQWAELEKKFDPNGTYKAKYQK 123 (136)
Q Consensus 78 C~KCT~kQK~~~~kvi~~l~~~~P~~w~~l~~KyDP~~~y~~~~~~ 123 (136)
|.-|+ |-=+..+=|.+||..+|||..+++.+.. .|+..|..
T Consensus 80 C~lc~-KlFkg~eFV~KHI~nKH~e~ve~~~~ev----~~fnnY~~ 120 (214)
T PF04959_consen 80 CPLCG-KLFKGPEFVRKHIFNKHPEKVEEVKKEV----EYFNNYLL 120 (214)
T ss_dssp E-SSS--EESSHHHHHHHHHHH-HHHHHHHHHHH----HHHHHH--
T ss_pred CCCCC-cccCChHHHHHHHhhcCHHHHHHHHHHH----HHHHHHhc
Confidence 44454 2223467788999999999999998765 36666653
No 50
>smart00648 SWAP Suppressor-of-White-APricot splicing regulator. domain present in regulators which are responsible for pre-mRNA splicing processes
Probab=27.10 E-value=1.5e+02 Score=18.03 Aligned_cols=25 Identities=24% Similarity=0.230 Sum_probs=21.6
Q ss_pred hhHHHHHHHHHhcChHHHHHHHHhh
Q psy4018 87 EGSKKIFKYLIDNKPEQWAELEKKF 111 (136)
Q Consensus 87 ~~~~kvi~~l~~~~P~~w~~l~~Ky 111 (136)
+.++++..++.++=|+-...|.++.
T Consensus 2 ~iI~~tA~~Va~~G~~fe~~l~~~~ 26 (54)
T smart00648 2 DIIDKTAQFVARNGPEFEAKLMERE 26 (54)
T ss_pred cHHHHHHHHHHHhhHHHHHHHHHhc
Confidence 5688999999999999888888776
No 51
>PF11417 Inhibitor_G39P: Loader and inhibitor of phage G40P; InterPro: IPR024424 G39P inhibits the initiation of DNA replication by blocking G40P replicative helicase. G39P has a bipartite stricture consisting of a folded N-terminal domain and an unfolded C-terminal domain. The C-terminal is essential for helicase interaction [].; PDB: 1NO1_B.
Probab=27.07 E-value=71 Score=21.39 Aligned_cols=41 Identities=22% Similarity=0.457 Sum_probs=27.3
Q ss_pred HhhHHHHHHHHHhcCh---------------HHHHHHHHhhCCChhhHHHHHHHHhh
Q psy4018 86 KEGSKKIFKYLIDNKP---------------EQWAELEKKFDPNGTYKAKYQKELSD 127 (136)
Q Consensus 86 K~~~~kvi~~l~~~~P---------------~~w~~l~~KyDP~~~y~~~~~~~l~~ 127 (136)
|+-+-+++..+..-|| +.|..+++.|| =+.=..++.+.+++
T Consensus 3 k~E~~~ll~~I~~aYP~~~~~f~~~~~k~~v~~W~~~L~d~~-ye~v~~al~~~i~~ 58 (71)
T PF11417_consen 3 KEETAKLLKLIKAAYPQWAGNFKPTDSKETVDLWYDMLKDYD-YEIVMKALKKHIAT 58 (71)
T ss_dssp HHHHHHHHHHHHHHST---TT---STHHHHHHHHHHHHTTS--HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHCCcchhccchhhHHHHHHHHHHHHHhCC-HHHHHHHHHHHHHh
Confidence 5667789999999999 67888886654 33344445444543
No 52
>PF02771 Acyl-CoA_dh_N: Acyl-CoA dehydrogenase, N-terminal domain; InterPro: IPR006092 Mammalian Co-A dehydrogenases (1.3.99.3 from EC) are enzymes that catalyse the first step in each cycle of beta-oxidation in mitochondion. Acyl-CoA dehydrogenases [, , ] catalyze the alpha,beta-dehydrogenation of acyl-CoA thioesters to the corresponding trans 2,3-enoyl CoA-products with concommitant reduction of enzyme-bound FAD. Reoxidation of the flavin involves transfer of electrons to ETF (electron transfering flavoprotein). These enzymes are homodimers containing one molecule of FAD. The monomeric enzyme is folded into three domains of approximately equal size. The N-terminal and the C-terminal are mainly alpha-helices packed together, and the middle domain consists of two orthogonal beta-sheets. The flavin ring is buried in the crevise between two alpha-helical domains and the beta-sheet of one subunit, and the adenosine pyrophosphate moiety is stretched into the subunit junction with one formed by two C-terminal domains []. The N-terminal domain of Acyl-CoA dehydrogenase is an all-alpha domain, on dimerisation, the N-terminal of one molecule extends into the other dimer and lies on the surface of the molecule.; GO: 0003995 acyl-CoA dehydrogenase activity, 0055114 oxidation-reduction process; PDB: 2WBI_B 1SIQ_A 1SIR_A 2R0N_A 2R0M_A 2DVL_A 1UKW_B 3MDD_B 1UDY_C 3MDE_B ....
Probab=26.95 E-value=1.2e+02 Score=20.10 Aligned_cols=44 Identities=11% Similarity=0.268 Sum_probs=28.9
Q ss_pred CHHHHhhHHHHHHHHHhcChHHHHHHHHhhCCChhhHHHHHHHHhhcC
Q psy4018 82 SEKQKEGSKKIFKYLIDNKPEQWAELEKKFDPNGTYKAKYQKELSDLK 129 (136)
Q Consensus 82 T~kQK~~~~kvi~~l~~~~P~~w~~l~~KyDP~~~y~~~~~~~l~~~~ 129 (136)
||.|+.....+-+|+.++-++. ...+|-++.+-..+-+.+...|
T Consensus 1 t~~~~~l~~~~~~~~~~~~~~~----~~~~d~~~~~p~~~~~~l~~~G 44 (113)
T PF02771_consen 1 TEEQQALREEAREFAEEEIAPH----AAEWDEDGRFPREVWRALGEAG 44 (113)
T ss_dssp SHHHHHHHHHHHHHHHHHTHHH----HHHHHHHTSCHHHHHHHHHHTT
T ss_pred CHHHHHHHHHHHHHHHHHchHH----HHHHHHhCCCCHHHHHHHHHHH
Confidence 7889999999999998876554 3344444455555555555544
No 53
>PF03790 KNOX1: KNOX1 domain ; InterPro: IPR005540 The MEINOX region is comprised of two domains, KNOX1 and KNOX2. KNOX1 plays a role in suppressing target gene expression. KNOX2, essential for function, is thought to be necessary for homo-dimerization [].; GO: 0003677 DNA binding, 0005634 nucleus
Probab=26.93 E-value=41 Score=20.93 Aligned_cols=16 Identities=25% Similarity=0.468 Sum_probs=12.1
Q ss_pred HHHHHHHhhhhcCCCC
Q psy4018 43 DRLFNNYYNCLLDKGK 58 (136)
Q Consensus 43 ~rl~~~y~~ClLdkgp 58 (136)
++|+..|++|.==-.|
T Consensus 14 ~~Ll~Ayi~C~KVGAP 29 (45)
T PF03790_consen 14 PRLLAAYIDCQKVGAP 29 (45)
T ss_pred HHHHHHHHHHHhcCCC
Confidence 6899999999853333
No 54
>PLN02150 terpene synthase/cyclase family protein
Probab=26.81 E-value=1.3e+02 Score=20.97 Aligned_cols=50 Identities=16% Similarity=0.224 Sum_probs=35.3
Q ss_pred HHHHhhhhcCCCCCChh--HHHHHhhhHHHHH---hcCccCCHHHHhhHHHHHHH
Q psy4018 46 FNNYYNCLLDKGKCSPD--GQELKNKLPDAIA---TECKSCSEKQKEGSKKIFKY 95 (136)
Q Consensus 46 ~~~y~~ClLdkgpCt~~--gk~lK~~lpeal~---t~C~KCT~kQK~~~~kvi~~ 95 (136)
.-+.+.|.|.+-.+|.+ -+.+++.+-++-+ ..|-+.++--+..++.+++.
T Consensus 6 vaSsIeCYMke~g~seeeA~~~i~~li~~~WK~iN~e~l~~~~~p~~~~~~~~Nl 60 (96)
T PLN02150 6 VANGVNCYMKQHGVTKEEAVSELKKMIRDNYKIVMEEFLTIKDVPRPVLVRCLNL 60 (96)
T ss_pred chHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHH
Confidence 45789999999889987 5677888877744 35766655555555555553
No 55
>PF14060 DUF4252: Domain of unknown function (DUF4252)
Probab=26.36 E-value=78 Score=23.03 Aligned_cols=47 Identities=26% Similarity=0.295 Sum_probs=28.1
Q ss_pred hHHHHHhhhHHHHHhcCccCCHHHHhhHHHHHHHHHhcChHHHHHHHHhhC
Q psy4018 62 DGQELKNKLPDAIATECKSCSEKQKEGSKKIFKYLIDNKPEQWAELEKKFD 112 (136)
Q Consensus 62 ~gk~lK~~lpeal~t~C~KCT~kQK~~~~kvi~~l~~~~P~~w~~l~~KyD 112 (136)
+.+.+-+.+-..=--.+.+|+ .++.+.+-++-+.+ . +.|++|...=|
T Consensus 60 ~~~~~~~~i~~i~vl~~~~~~--~~~~~~~~~~~~~~-~-~~ye~l~~~~~ 106 (155)
T PF14060_consen 60 EIKELLKKIDSIRVLSLEDCS--NKQEFRKEFNKLKK-L-SGYEELMSVKD 106 (155)
T ss_pred HHHHHHHhCCceEEEEecCcc--hHHHHHHHHHHHHc-c-hhhHHHhhccC
Confidence 444443344333334677788 55566666666666 3 59999987666
No 56
>PLN03079 Uncharacterized protein At4g33100; Provisional
Probab=26.21 E-value=75 Score=22.64 Aligned_cols=33 Identities=24% Similarity=0.501 Sum_probs=26.3
Q ss_pred HHHHHHhhhhc------------CCCCCChhHHHHHhhhHHHHHh
Q psy4018 44 RLFNNYYNCLL------------DKGKCSPDGQELKNKLPDAIAT 76 (136)
Q Consensus 44 rl~~~y~~ClL------------dkgpCt~~gk~lK~~lpeal~t 76 (136)
.+...|-+||. ++.+|+++.+.++.=|-++|+.
T Consensus 20 elK~~YD~CFN~WYsEkFLKG~~~~~eC~~~w~~Yq~Cv~~aL~e 64 (91)
T PLN03079 20 ELRTAYHNCFNRWYSEKFVKGQWDKEDCVAEWHKYRACLSEHLED 64 (91)
T ss_pred HHHHHHHHHHHHHHHHhhhcCCcccchHHHHHHHHHHHHHHHHHH
Confidence 45667777776 4568999999999999888874
No 57
>PHA00447 lysozyme
Probab=25.76 E-value=57 Score=24.35 Aligned_cols=21 Identities=19% Similarity=0.216 Sum_probs=19.6
Q ss_pred CHHHHhhHHHHHHHHHhcChH
Q psy4018 82 SEKQKEGSKKIFKYLIDNKPE 102 (136)
Q Consensus 82 T~kQK~~~~kvi~~l~~~~P~ 102 (136)
|+.|++.+..++.+|.+.+|.
T Consensus 95 T~aQ~~al~~Ll~~L~~~~~~ 115 (142)
T PHA00447 95 TPAQMQSLKSLLVTLKAKYPG 115 (142)
T ss_pred CHHHHHHHHHHHHHHHHHCCC
Confidence 899999999999999999875
No 58
>PF11090 DUF2833: Protein of unknown function (DUF2833); InterPro: IPR020335 This entry contains proteins with no known function.
Probab=25.50 E-value=39 Score=23.80 Aligned_cols=30 Identities=23% Similarity=0.465 Sum_probs=20.0
Q ss_pred cCccCCHHHHhhHHHHHH----HHHhcChHHHHH
Q psy4018 77 ECKSCSEKQKEGSKKIFK----YLIDNKPEQWAE 106 (136)
Q Consensus 77 ~C~KCT~kQK~~~~kvi~----~l~~~~P~~w~~ 106 (136)
.+.++|++||..+.+.++ -+.+.||.-||-
T Consensus 29 ~v~~~~~~~~~eF~k~i~~~~d~~l~~Y~~l~N~ 62 (86)
T PF11090_consen 29 KVKSLTKKERREFRKLIKEYLDKMLKQYPVLWNF 62 (86)
T ss_pred HHhhcCHhhhHHHHHHHHHHHHHHHHHhhheeEE
Confidence 346889999877777765 444556666653
No 59
>cd08807 CARD_CARD10_CARMA3 Caspase activation and recruitment domain of CARD10-like proteins. Caspase activation and recruitment domain (CARD) similar to that found in CARD10, also known as CARMA3 (caspase recruitment domain-containing membrane-associated guanylate kinase protein 3) or BIMP1. The CARMA3-BCL10-MALT1 signalosome plays a role in the GPCR-induced NF-kB activation. CARMA3 is more widely expressed than CARMA1, which is found only in hematopoietic cells. In endothelial and smooth muscle cells, CARMA3-mediated NF-kB activation induces pro-inflammatory signals within the vasculature and is a key factor in atherogenesis. In bronchial epithelial cells, CARMA3-mediated NF-kB signaling is important for the development of allergic airway inflammation. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains fo
Probab=25.38 E-value=1.2e+02 Score=21.42 Aligned_cols=56 Identities=21% Similarity=0.269 Sum_probs=33.9
Q ss_pred cCccCCHHHHhcCHHHHHHHhhhhcCCCCCChhHHHHHhhhHHHHHhcCccCCHHHHhhHHHHHHHHHhcChHHHHHH
Q psy4018 30 KYDNVDLDQILKNDRLFNNYYNCLLDKGKCSPDGQELKNKLPDAIATECKSCSEKQKEGSKKIFKYLIDNKPEQWAEL 107 (136)
Q Consensus 30 ~~D~iDid~iL~N~rl~~~y~~ClLdkgpCt~~gk~lK~~lpeal~t~C~KCT~kQK~~~~kvi~~l~~~~P~~w~~l 107 (136)
-++.-|.++|++..++..+--+ -..+=+.|+|. =++++..|++.|...||+-|..+
T Consensus 30 VL~~~deEEI~~~~~~~~~~~k---------------~~~LLDIL~tr-------G~~gf~aFLeSLE~~yp~~y~~~ 85 (86)
T cd08807 30 VIDEQDEEEVLNSYRFPCRINR---------------TGRLMDILRGR-------GKRGYEAFLEALEFYYPEHFTLL 85 (86)
T ss_pred CCChhhHHHHHhccchhhHHHH---------------HHHHHHHHHhc-------CchHHHHHHHHHHhhhHHHhhcc
Confidence 3577899999998885322110 11122222221 13567788888888899888654
No 60
>PRK09706 transcriptional repressor DicA; Reviewed
Probab=25.34 E-value=1.1e+02 Score=22.01 Aligned_cols=33 Identities=15% Similarity=0.194 Sum_probs=28.7
Q ss_pred ccCCHHHHhhHHHHHHHHHhcChHHHHHHHHhh
Q psy4018 79 KSCSEKQKEGSKKIFKYLIDNKPEQWAELEKKF 111 (136)
Q Consensus 79 ~KCT~kQK~~~~kvi~~l~~~~P~~w~~l~~Ky 111 (136)
..|++.|++.+-.+++...+++-+.++++.++=
T Consensus 97 ~~L~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~ 129 (135)
T PRK09706 97 DALPESEQDAQLSEMRARVENFNKLFEELLKAR 129 (135)
T ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 569999999999999999999988888887753
No 61
>PRK08557 hypothetical protein; Provisional
Probab=25.31 E-value=68 Score=28.31 Aligned_cols=32 Identities=22% Similarity=0.382 Sum_probs=24.9
Q ss_pred hcCccCCHHHHhhHHHHHHHHHhcChHHHHH---HHHhh
Q psy4018 76 TECKSCSEKQKEGSKKIFKYLIDNKPEQWAE---LEKKF 111 (136)
Q Consensus 76 t~C~KCT~kQK~~~~kvi~~l~~~~P~~w~~---l~~Ky 111 (136)
-.|.-|+-.+. .-+..+++.||+.|+. ++++|
T Consensus 349 iGC~~Cp~~~~----~e~~~l~~~~Pe~~~k~~~~l~~~ 383 (417)
T PRK08557 349 IGCYLCPSALN----SEFLRVKELYPELFNRWVKYLKKY 383 (417)
T ss_pred CCccCCCCccH----HHHHHHHHHCHHHHHHHHHHHHHh
Confidence 47999998765 3556779999999888 66666
No 62
>KOG3355|consensus
Probab=24.94 E-value=2.2e+02 Score=22.63 Aligned_cols=69 Identities=16% Similarity=0.225 Sum_probs=50.3
Q ss_pred CCCCChhHHHHHhh----hHHHHHhcCccCCHHHHhhHHHHHHHHHhcCh-----HHHHHHHHhhCCChhhHHHHHHH
Q psy4018 56 KGKCSPDGQELKNK----LPDAIATECKSCSEKQKEGSKKIFKYLIDNKP-----EQWAELEKKFDPNGTYKAKYQKE 124 (136)
Q Consensus 56 kgpCt~~gk~lK~~----lpeal~t~C~KCT~kQK~~~~kvi~~l~~~~P-----~~w~~l~~KyDP~~~y~~~~~~~ 124 (136)
+-+|.++-.+|=+. |=-...+-=.+=|+.||..++.+|.-+-+-|| ++..+++.+|-|+=.=++.|-.-
T Consensus 63 ~~~~~~~~eeLGRstWtllHT~aA~yP~~PT~eqk~~~~sFi~~~s~~yPC~eCa~dl~K~l~~nppq~~SRe~~~~W 140 (177)
T KOG3355|consen 63 RKGDPPDKEELGRSTWTLLHTLAANYPDRPTPEQKDDMRSFIHLFSKFYPCGECAKDLRKILRKNPPQTSSREAFTQW 140 (177)
T ss_pred hcCCCchHHHHhHHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHhhhhcchHHHHHHHHHHHHhCCCCcchHHHHHHH
Confidence 44588776655443 33333344455599999999999999999998 66788888998888777777643
No 63
>PF05927 Penaeidin: Penaeidin; InterPro: IPR009226 This family consists of several isoforms of the penaeidin protein, which is specific to shrimps. Penaeidins, a unique family of antimicrobial peptides (AMPs) with both proline and cysteine-rich domains, were initially identified in the hemolymph of the Pacific white shrimp, Penaeus vannamei [].; GO: 0008061 chitin binding, 0005737 cytoplasm; PDB: 1UEO_A 1XV3_A.
Probab=24.94 E-value=27 Score=23.76 Aligned_cols=26 Identities=19% Similarity=0.133 Sum_probs=0.8
Q ss_pred HHHHHHHHHHHHHhhccCCCccCCcC
Q psy4018 6 SAAVGLFLAVQLVAGKPAGEKYTTKY 31 (136)
Q Consensus 6 MK~~i~l~~~~~~~~~~a~~~Y~~~~ 31 (136)
|++++.|..+++++.|-..+.|.+.|
T Consensus 1 MRLVVCLVFLaSFALVCQG~gykggy 26 (73)
T PF05927_consen 1 MRLVVCLVFLASFALVCQGQGYKGGY 26 (73)
T ss_dssp -----------------------S-S
T ss_pred CeEEEehHHHHHHHHhccCccccCCc
Confidence 66665544443444443334344433
No 64
>PRK10410 hypothetical protein; Provisional
Probab=24.73 E-value=70 Score=22.92 Aligned_cols=21 Identities=24% Similarity=0.756 Sum_probs=18.4
Q ss_pred HhcCccC-----CHHHHhhHHHHHHH
Q psy4018 75 ATECKSC-----SEKQKEGSKKIFKY 95 (136)
Q Consensus 75 ~t~C~KC-----T~kQK~~~~kvi~~ 95 (136)
++.|++| .|.+++.+++|++|
T Consensus 56 Kp~C~~C~ihCY~p~~r~ki~~VMr~ 81 (100)
T PRK10410 56 KPTCKKCPVHCYKPAKREKMKQVMRF 81 (100)
T ss_pred CCCCCCCCCcCCCHHHHHHHHHHHHH
Confidence 5689999 58999999999998
No 65
>PF10589 NADH_4Fe-4S: NADH-ubiquinone oxidoreductase-F iron-sulfur binding region; InterPro: IPR019575 NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I []. This entry describes the F subunit of complexes that resemble NADH-quinone oxidoreductases. The electron acceptor is a quinone, ubiquinone, in mitochondria and most bacteria, including Escherichia coli, where the recommended gene symbol is nuoF. This family does not have any members in chloroplast or cyanobacteria, where the quinone may be plastoquinone and NADH may be replaced by NADPH, nor in Methanosarcina, where NADH is replaced by F420H2. This entry represents the iron-sulphur binding domain of the F subunit.; GO: 0055114 oxidation-reduction process; PDB: 3IAS_S 2FUG_A 3I9V_A 3M9S_1 3IAM_A 2YBB_1.
Probab=24.69 E-value=44 Score=20.34 Aligned_cols=12 Identities=17% Similarity=0.457 Sum_probs=5.3
Q ss_pred HHHhcCccCCHH
Q psy4018 73 AIATECKSCSEK 84 (136)
Q Consensus 73 al~t~C~KCT~k 84 (136)
.-...|+||+|=
T Consensus 11 ~~~ESCGkC~PC 22 (46)
T PF10589_consen 11 FAHESCGKCTPC 22 (46)
T ss_dssp HHHH--S--HHH
T ss_pred HHHcCCCCCCCc
Confidence 345799999883
No 66
>PF02364 Glucan_synthase: 1,3-beta-glucan synthase component ; InterPro: IPR003440 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. This is the glycosyltransferase 48 family GT48 from CAZY, which consists of various 1,3-beta-glucan synthase components including Gls1, Gls2 and Gls3 from yeast. 1,3-beta-glucan synthase (2.4.1.34 from EC) also known as callose synthase catalyses the formation of a beta-1,3-glucan polymer that is a major component of the fungal cell wall []. The reaction catalysed is:- UDP-glucose + {1,3-beta-D-glucosyl}(N) = UDP + {1,3-beta-D-glucosyl}(N+1).; GO: 0003843 1,3-beta-D-glucan synthase activity, 0006075 1,3-beta-D-glucan biosynthetic process, 0000148 1,3-beta-D-glucan synthase complex, 0016020 membrane
Probab=24.46 E-value=57 Score=31.48 Aligned_cols=28 Identities=14% Similarity=0.432 Sum_probs=22.8
Q ss_pred HHHHHHHHHhcChHHHHHHHHhhCCChh
Q psy4018 89 SKKIFKYLIDNKPEQWAELEKKFDPNGT 116 (136)
Q Consensus 89 ~~kvi~~l~~~~P~~w~~l~~KyDP~~~ 116 (136)
--.++.||++-||++|+.+++.-+-.+.
T Consensus 63 ~vtlL~YLk~l~p~EW~nF~er~k~~~~ 90 (817)
T PF02364_consen 63 GVTLLEYLKQLYPDEWENFVERTKRLAE 90 (817)
T ss_pred CccHHHHHHHHChHHHHHHHHHhcccch
Confidence 3468999999999999999998755443
No 67
>PLN03190 aminophospholipid translocase; Provisional
Probab=24.39 E-value=83 Score=31.39 Aligned_cols=27 Identities=22% Similarity=0.268 Sum_probs=17.8
Q ss_pred hhhHHHHHh----cCccCCHHHHhhHHHHHH
Q psy4018 68 NKLPDAIAT----ECKSCSEKQKEGSKKIFK 94 (136)
Q Consensus 68 ~~lpeal~t----~C~KCT~kQK~~~~kvi~ 94 (136)
+.+.+.... -|.+|||.||..+-+.++
T Consensus 837 ~~f~~l~~~~~~VI~cR~sP~QKa~IV~~vk 867 (1178)
T PLN03190 837 EQLFQLASKCSVVLCCRVAPLQKAGIVALVK 867 (1178)
T ss_pred HHHHHHHHhCCEEEEecCCHHHHHHHHHHHH
Confidence 344444432 388999999977666554
No 68
>PF05823 Gp-FAR-1: Nematode fatty acid retinoid binding protein (Gp-FAR-1); InterPro: IPR008632 Parasitic nematodes produce at least two structurally novel classes of small helix-rich retinol- and fatty-acid-binding proteins that have no counterparts in their plant or animal hosts and thus represent potential targets for new nematicides. Gp-FAR-1 is a member of the nematode-specific fatty-acid- and retinol-binding (FAR) family of proteins but localises to the surface of the organism, placing it in a strategic position for interaction with the host. Gp-FAR-1 functions as a broad-spectrum retinol- and fatty-acid-binding protein, and it is thought that it is involved in the evasion of primary host plant defence systems [].; GO: 0008289 lipid binding; PDB: 2W9Y_A.
Probab=23.95 E-value=39 Score=25.67 Aligned_cols=47 Identities=21% Similarity=0.352 Sum_probs=27.8
Q ss_pred HHHHhhhHHHHHhcCccCCHHHHhhHHH-------------HHHHHHhcChHHHHHHHHh
Q psy4018 64 QELKNKLPDAIATECKSCSEKQKEGSKK-------------IFKYLIDNKPEQWAELEKK 110 (136)
Q Consensus 64 k~lK~~lpeal~t~C~KCT~kQK~~~~k-------------vi~~l~~~~P~~w~~l~~K 110 (136)
+.+|+.||.-+.+.=..=|+.||..++. ++..|+++.|.-+..+...
T Consensus 4 ~~~k~~iP~ev~~~~~~Lt~eeK~~lkev~~~~~~~~~~de~i~~LK~ksP~L~~k~~~l 63 (154)
T PF05823_consen 4 EEYKELIPSEVVEFYKNLTPEEKAELKEVAKNYAKFKNEDEMIAALKEKSPSLYEKAEKL 63 (154)
T ss_dssp HHHHTT--HHHHHHHHH--TTTHHHHHHHHTT-------TTHHHHHHHH-HHHHHHHHHH
T ss_pred HHHHHhCcHHHHHHHHcCCHHHHHHHHHHHHHccccCCHHHHHHHHHHhCHHHHHHHHHH
Confidence 4566777766666666666666666654 4566788888888775543
No 69
>PF00312 Ribosomal_S15: Ribosomal protein S15; InterPro: IPR000589 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein S15 is one of the proteins from the small ribosomal subunit. In Escherichia coli, this protein binds to 16S ribosomal RNA and functions at early steps in ribosome assembly. It belongs to a family of ribosomal proteins which, on the basis of sequence similarities [, ,], groups bacterial and plant chloroplast S15; archaeal Haloarcula marismortui HmaS15 (HS11); yeast mitochondrial S28; and mammalian, yeast, Brugia pahangi and Wuchereria bancrofti S13. S15 is a protein of 80 to 250 amino-acid residues.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 1YSH_E 3U5C_N 3O30_G 3IZB_O 3O2Z_G 3U5G_N 1S1H_O 3MR8_O 3D5C_O 3MS0_O ....
Probab=22.98 E-value=1.2e+02 Score=20.55 Aligned_cols=24 Identities=25% Similarity=0.432 Sum_probs=19.6
Q ss_pred hHHHHHHHHHhcChHHHHHHHHhh
Q psy4018 88 GSKKIFKYLIDNKPEQWAELEKKF 111 (136)
Q Consensus 88 ~~~kvi~~l~~~~P~~w~~l~~Ky 111 (136)
.-+++++||+.+.++.|..+.++.
T Consensus 57 kRrrlL~YLrr~~~~~Y~~~~~~L 80 (83)
T PF00312_consen 57 KRRRLLKYLRRKDFERYEWVLKKL 80 (83)
T ss_dssp HHHHHHHHHHHHTHHHHHHHHHHH
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHh
Confidence 457889999999999888887764
No 70
>PF15197 Leukemia_assc_2: Leukemia-associated protein 2
Probab=22.80 E-value=94 Score=19.75 Aligned_cols=32 Identities=22% Similarity=0.362 Sum_probs=26.2
Q ss_pred ccCCHHHHhhHHHHHHHHHhcChHHHHHHHHhh
Q psy4018 79 KSCSEKQKEGSKKIFKYLIDNKPEQWAELEKKF 111 (136)
Q Consensus 79 ~KCT~kQK~~~~kvi~~l~~~~P~~w~~l~~Ky 111 (136)
++||.+-|-.+.....- ..||-..|+.+.-|+
T Consensus 6 erctarnkmfvnsaftk-vdnyct~~kkfy~k~ 37 (53)
T PF15197_consen 6 ERCTARNKMFVNSAFTK-VDNYCTFWKKFYFKC 37 (53)
T ss_pred HHhhhhhhHHHhhHHHH-HhHHHHHHHHHHHHH
Confidence 57999999998887766 788889998876655
No 71
>PF10441 Urb2: Urb2/Npa2 family; InterPro: IPR018849 This entry represents a conserved domain found towards the C terminus of proteins involved in ribosome biogenesis, such as the Urb2 protein from yeast [].
Probab=22.80 E-value=1.6e+02 Score=23.05 Aligned_cols=67 Identities=18% Similarity=0.209 Sum_probs=48.9
Q ss_pred HHHHHHhhhhcCCCCCChhHHHHHhhhHHHHHhcCccCCHHHHhhHHHHHHHHHhcChHHHHHHHHhhCCChhh
Q psy4018 44 RLFNNYYNCLLDKGKCSPDGQELKNKLPDAIATECKSCSEKQKEGSKKIFKYLIDNKPEQWAELEKKFDPNGTY 117 (136)
Q Consensus 44 rl~~~y~~ClLdkgpCt~~gk~lK~~lpeal~t~C~KCT~kQK~~~~kvi~~l~~~~P~~w~~l~~KyDP~~~y 117 (136)
-++.+|+.+.++-+ ...+.|+.|-..+=.-|.-|++...+.+...++.=- ...|+.|.+-|.=-|+|
T Consensus 155 ~lL~~Yi~~~~~~~----l~~~vr~~L~pGiy~l~d~~s~~e~~~l~a~Ld~~g---r~~fk~Ly~dY~k~~K~ 221 (223)
T PF10441_consen 155 YLLANYISLQLKYT----LPPEVREALMPGIYALFDVCSQHELQQLNASLDASG---RAVFKSLYKDYKKFGKW 221 (223)
T ss_pred HHHHHHHHHHhcCC----CCHHHHHHHHHHHHHHHHhCCHHHHHHHHHHcChHH---HHHHHHHHHHHHHhCCC
Confidence 46788999999887 778889999999999999999888776665554322 45566666655544444
No 72
>cd05030 calgranulins Calgranulins: S-100 domain found in proteins belonging to the Calgranulin subgroup of the S100 family of EF-hand calcium-modulated proteins, including S100A8, S100A9, and S100A12 . Note that the S-100 hierarchy, to which this Calgranulin group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. These proteins are expressed mainly in granulocytes, and are involved in inflammation, allergy, and neuritogenesis, as well as in host-parasite response. Calgranulins are modulated not only by calcium, but also by other metals such as zinc and copper. Structural data suggested that calgranulins may exist in multiple structural forms, homodimers, as well as hetero-oligomers. For example, the S100A8/S100A9 complex called calprotectin plays important roles in the regulation of inflammatory processes, wound repair, and regulating zinc-dependent enzymes as well as microbial growth.
Probab=21.96 E-value=1.5e+02 Score=19.83 Aligned_cols=25 Identities=12% Similarity=0.197 Sum_probs=19.1
Q ss_pred hHHHHHHHHhhCCChhhHHHHHHHH
Q psy4018 101 PEQWAELEKKFDPNGTYKAKYQKEL 125 (136)
Q Consensus 101 P~~w~~l~~KyDP~~~y~~~~~~~l 125 (136)
+++|+.+...+|+++.=.=.|++|+
T Consensus 50 ~~~v~~i~~~~D~d~dG~I~f~eF~ 74 (88)
T cd05030 50 QKAIDKIFEDLDTNQDGQLSFEEFL 74 (88)
T ss_pred HHHHHHHHHHcCCCCCCcCcHHHHH
Confidence 6889999999999886555555554
No 73
>PF06286 Coleoptericin: Coleoptericin; InterPro: IPR009382 This family consists of several insect coleoptericin, acaloleptin, holotricin and rhinocerosin proteins which are all known to be antibacterial proteins []. These all appear to be short, glycine-rich molecules, inducible by infection.; GO: 0042742 defense response to bacterium, 0005576 extracellular region
Probab=21.75 E-value=26 Score=26.61 Aligned_cols=10 Identities=10% Similarity=-0.174 Sum_probs=6.8
Q ss_pred hHHHHHHHHH
Q psy4018 5 CSAAVGLFLA 14 (136)
Q Consensus 5 ~MK~~i~l~~ 14 (136)
|||+.|++.+
T Consensus 1 mmkl~i~~~l 10 (143)
T PF06286_consen 1 MMKLYIIFGL 10 (143)
T ss_pred CceEeeehhH
Confidence 5888766554
No 74
>PRK09993 C-lysozyme inhibitor; Provisional
Probab=21.63 E-value=1.9e+02 Score=22.44 Aligned_cols=20 Identities=5% Similarity=0.220 Sum_probs=16.2
Q ss_pred CCHHHHhcCHHHHHHHhhhh
Q psy4018 34 VDLDQILKNDRLFNNYYNCL 53 (136)
Q Consensus 34 iDid~iL~N~rl~~~y~~Cl 53 (136)
.-+.+++.++.++..+-+=+
T Consensus 28 ~t~seLl~~p~yk~aw~~mv 47 (153)
T PRK09993 28 LTISSLAKGETTKAAFNQMV 47 (153)
T ss_pred ccHHHHhcCHHHHHHHHHHH
Confidence 67889999999998887643
No 75
>KOG4730|consensus
Probab=21.09 E-value=1.4e+02 Score=27.42 Aligned_cols=36 Identities=11% Similarity=0.469 Sum_probs=26.4
Q ss_pred HHHHHHHHHhcChHHHHHHHHhhCCChhhHHHHHHHHh
Q psy4018 89 SKKIFKYLIDNKPEQWAELEKKFDPNGTYKAKYQKELS 126 (136)
Q Consensus 89 ~~kvi~~l~~~~P~~w~~l~~KyDP~~~y~~~~~~~l~ 126 (136)
.+-+++++ ..-+.|.++++.+||+|.+....-+.+-
T Consensus 477 fe~vi~ky--~n~~~flkvr~~lDP~~lFssew~d~il 512 (518)
T KOG4730|consen 477 FEGVIRKY--KNLDKFLKVRKELDPKGLFSSEWTDQIL 512 (518)
T ss_pred HHHHHHhc--cChHHHHHHHHhcCccchhhhhhHHHHh
Confidence 44455555 4446799999999999999877766553
No 76
>cd08809 CARD_CARD9 Caspase activation and recruitment domain of CARD9-like proteins. Caspase activation and recruitment domain (CARD) similar to that found in CARD9. CARD9 is a central regulator of innate immunity and is highly expressed in dendritic cells and macrophages. Together with BCL10 (B-cell lymphoma 10) and Malt1 (mucosa-associated lymphoid tissue-lymphoma-translocation gene 1), it forms the M-CBM signalosome (the CBM complex in myeloid immune cells), which mediates activation of NF-kB and MAPK by ITAM-coupled receptors expressed on immune cells. CARD9 associates with BCL10 via a CARD-CARD interaction. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating wi
Probab=20.98 E-value=2.3e+02 Score=19.99 Aligned_cols=50 Identities=18% Similarity=0.290 Sum_probs=33.0
Q ss_pred cCccCCHHHHhcCHHH------HHHHhhhhcCCCCCChhHHHHHhhhHHHHHhcCccCCHHHHhhHHHHHHHHHhcChHH
Q psy4018 30 KYDNVDLDQILKNDRL------FNNYYNCLLDKGKCSPDGQELKNKLPDAIATECKSCSEKQKEGSKKIFKYLIDNKPEQ 103 (136)
Q Consensus 30 ~~D~iDid~iL~N~rl------~~~y~~ClLdkgpCt~~gk~lK~~lpeal~t~C~KCT~kQK~~~~kvi~~l~~~~P~~ 103 (136)
-++.-|.++|++.+++ ...-+|=+-.+|| +++..|+..|...||+-
T Consensus 30 Vl~~~deEeI~~~~~~~~~~~kvg~LLDIL~~rG~----------------------------~~f~aFLeSLE~~yP~l 81 (86)
T cd08809 30 VLNSDDEEQVLNDPSLVIRKRKVGVLLDILQRTGL----------------------------KGYEAFLESLELYYPQL 81 (86)
T ss_pred CCChhhHHHHHcCcccchHHHHHHHHHHHHHhcCc----------------------------hHHHHHHHHHHHHHHHH
Confidence 3567899999999554 3333444444453 56777788888888877
Q ss_pred HHHH
Q psy4018 104 WAEL 107 (136)
Q Consensus 104 w~~l 107 (136)
++.+
T Consensus 82 ~~~~ 85 (86)
T cd08809 82 YKKI 85 (86)
T ss_pred Hhhc
Confidence 7653
No 77
>PF15393 DUF4615: Domain of unknown function (DUF4615)
Probab=20.57 E-value=1.8e+02 Score=21.71 Aligned_cols=48 Identities=25% Similarity=0.238 Sum_probs=30.0
Q ss_pred cCCHHHHhhHHHHHHHHHhcChHHHHHHHHhhCCChhhHHHHHHHHhh
Q psy4018 80 SCSEKQKEGSKKIFKYLIDNKPEQWAELEKKFDPNGTYKAKYQKELSD 127 (136)
Q Consensus 80 KCT~kQK~~~~kvi~~l~~~~P~~w~~l~~KyDP~~~y~~~~~~~l~~ 127 (136)
|=|++|.+-+.++|+-|+.+.-..=+.=.-----=|.|+++.+++..+
T Consensus 22 k~t~kq~ee~~k~i~~L~S~k~plvkKRqlMr~~fGdYR~kM~~e~~k 69 (124)
T PF15393_consen 22 KLTPKQIEEASKAIRTLKSNKAPLVKKRQLMRSTFGDYRAKMKEEERK 69 (124)
T ss_pred CCCHHHHHHHHHHHHHhhCCCCccHHHHHHHHHHHHhHHHHHHHHHHH
Confidence 338999999999999998776433322111112225588887766544
No 78
>PRK12423 LexA repressor; Provisional
Probab=20.13 E-value=3.2e+02 Score=21.06 Aligned_cols=31 Identities=19% Similarity=0.248 Sum_probs=26.9
Q ss_pred CHHHHhhHHHHHHHHHhc-ChHHHHHHHHhhC
Q psy4018 82 SEKQKEGSKKIFKYLIDN-KPEQWAELEKKFD 112 (136)
Q Consensus 82 T~kQK~~~~kvi~~l~~~-~P~~w~~l~~KyD 112 (136)
|++|++....+.+++.+. +|..-.+|.+.++
T Consensus 5 t~~q~~il~~l~~~i~~~g~~Ps~~eia~~~g 36 (202)
T PRK12423 5 TPKRAAILAFIRERIAQAGQPPSLAEIAQAFG 36 (202)
T ss_pred CHHHHHHHHHHHHHHHHcCCCCCHHHHHHHhC
Confidence 899999999999987775 4778899999998
No 79
>PF03489 SapB_2: Saposin-like type B, region 2; InterPro: IPR008138 Saposins are small lysosomal proteins that serve as activators of various lysosomal lipid-degrading enzymes []. They probably act by isolating the lipid substrate from the membrane surroundings, thus making it more accessible to the soluble degradative enzymes. All mammalian saposins are synthesized as a single precursor molecule (prosaposin) which contains four Saposin-B domains, yielding the active saposins after proteolytic cleavage, and two Saposin-A domains that are removed in the activation reaction. The Saposin-B domains also occur in other proteins, many of them active in the lysis of membranes [, ].; PDB: 3BQQ_A 2RB3_B 2R0R_A 3BQP_A 2R1Q_A 1NKL_A 1L9L_A 1QDM_C 3RFI_A 4DDJ_A ....
Probab=20.13 E-value=34 Score=19.03 Aligned_cols=22 Identities=27% Similarity=0.474 Sum_probs=17.3
Q ss_pred HHHHhcChHHHHHHHHhhCCCh
Q psy4018 94 KYLIDNKPEQWAELEKKFDPNG 115 (136)
Q Consensus 94 ~~l~~~~P~~w~~l~~KyDP~~ 115 (136)
+++.+..|..|+.|.+..||+.
T Consensus 6 ~~V~~y~~~ii~~l~~~~~p~~ 27 (35)
T PF03489_consen 6 NFVDQYGPQIIQLLEKQLDPQQ 27 (35)
T ss_dssp HHHHHHHHHHHHHHHTTSTHHH
T ss_pred HHHHHHHHHHHHHHHhcCChHH
Confidence 4556667999999999988875
Done!