Query psy13420
Match_columns 169
No_of_seqs 209 out of 691
Neff 5.4
Searched_HMMs 46136
Date Fri Aug 16 19:00:39 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13420.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13420hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF03392 OS-D: Insect pheromon 100.0 7.9E-47 1.7E-51 278.6 9.4 95 68-162 1-95 (95)
2 PF03392 OS-D: Insect pheromon 99.5 2E-14 4.3E-19 106.1 4.0 51 2-52 7-68 (95)
3 PF04030 ALO: D-arabinono-1,4- 85.2 0.94 2E-05 37.8 3.3 31 135-165 226-258 (259)
4 TIGR02766 crypt_chrom_pln cryp 74.7 1.4 3.1E-05 40.2 1.1 20 149-168 409-428 (475)
5 PF01805 Surp: Surp module; I 73.8 13 0.00029 24.1 5.3 27 126-152 1-27 (55)
6 PRK10674 deoxyribodipyrimidine 72.6 1.8 3.9E-05 39.8 1.2 22 148-169 405-426 (472)
7 TIGR03556 photolyase_8HDF deox 72.2 2 4.3E-05 39.5 1.4 23 147-169 408-430 (471)
8 PF06530 Phage_antitermQ: Phag 70.3 5.7 0.00012 30.2 3.3 53 99-152 23-75 (125)
9 TIGR02765 crypto_DASH cryptoch 68.4 2.4 5.2E-05 38.0 1.1 17 147-163 410-426 (429)
10 PF08031 BBE: Berberine and be 68.4 3.8 8.3E-05 26.0 1.7 17 143-159 25-41 (47)
11 TIGR01679 bact_FAD_ox FAD-link 68.1 4.5 9.9E-05 36.6 2.8 32 135-166 384-417 (419)
12 PHA02939 hypothetical protein; 66.1 52 0.0011 25.8 7.8 79 86-167 40-129 (144)
13 PF03441 FAD_binding_7: FAD bi 65.1 2.6 5.6E-05 36.0 0.6 21 148-168 207-227 (277)
14 TIGR01676 GLDHase galactonolac 64.6 7.5 0.00016 36.9 3.6 38 129-166 501-539 (541)
15 TIGR01677 pln_FAD_oxido plant- 64.1 5 0.00011 38.1 2.3 31 136-166 476-508 (557)
16 PF11919 DUF3437: Domain of un 61.6 3.9 8.6E-05 29.8 0.9 46 110-155 27-74 (90)
17 PLN02465 L-galactono-1,4-lacto 61.5 5.4 0.00012 38.1 2.0 42 125-166 527-569 (573)
18 PF04818 CTD_bind: RNA polymer 59.4 24 0.00052 23.2 4.4 38 102-139 22-59 (64)
19 PLN00107 FAD-dependent oxidore 59.1 6.8 0.00015 34.0 2.1 33 134-166 168-202 (257)
20 COG0415 PhrB Deoxyribodipyrimi 51.3 7.8 0.00017 36.2 1.3 21 149-169 399-419 (461)
21 PF12066 DUF3546: Domain of un 43.0 17 0.00036 27.2 1.7 24 130-155 31-54 (110)
22 PF08910 Aida_N: Aida N-termin 42.4 61 0.0013 24.6 4.7 36 103-138 27-63 (106)
23 PF13956 Ibs_toxin: Toxin Ibs, 41.3 7.9 0.00017 20.6 -0.2 11 46-56 1-11 (19)
24 PF03564 DUF1759: Protein of u 41.0 44 0.00095 25.0 3.8 39 130-168 42-83 (145)
25 smart00582 RPR domain present 40.8 44 0.00096 24.3 3.7 37 103-139 71-107 (121)
26 PF14658 EF-hand_9: EF-hand do 35.6 81 0.0018 21.9 4.1 37 130-166 20-62 (66)
27 PF06286 Coleoptericin: Coleop 35.5 9.5 0.00021 30.0 -0.6 24 46-69 1-25 (143)
28 PF14060 DUF4252: Domain of un 33.6 62 0.0013 24.6 3.7 48 102-153 59-106 (155)
29 KOG0133|consensus 32.2 30 0.00066 32.9 2.0 23 146-168 422-444 (531)
30 smart00648 SWAP Suppressor-of- 31.3 1.4E+02 0.0031 19.0 4.7 25 128-152 2-26 (54)
31 PF12399 BCA_ABC_TP_C: Branche 30.8 35 0.00076 18.9 1.3 16 5-20 8-23 (23)
32 COG3636 Predicted transcriptio 30.1 40 0.00087 25.3 2.0 31 69-99 5-35 (100)
33 TIGR01678 FAD_lactone_ox sugar 28.3 35 0.00077 31.2 1.7 24 136-159 411-436 (438)
34 PF04959 ARS2: Arsenite-resist 27.3 41 0.00089 28.3 1.8 41 119-164 80-120 (214)
35 PF09494 Slx4: Slx4 endonuclea 26.4 72 0.0016 21.4 2.6 20 134-154 5-24 (64)
36 PF06576 DUF1133: Protein of u 24.5 1.4E+02 0.0031 24.5 4.4 48 120-167 94-141 (176)
37 PF08139 LPAM_1: Prokaryotic m 24.1 1.1E+02 0.0024 17.4 2.6 13 45-57 5-17 (25)
38 TIGR00952 S15_bact ribosomal p 24.1 1.1E+02 0.0023 22.2 3.2 24 130-153 60-83 (86)
39 CHL00027 rps15 ribosomal prote 24.0 1.1E+02 0.0023 22.5 3.2 24 130-153 60-83 (90)
40 PRK05626 rpsO 30S ribosomal pr 23.5 1.1E+02 0.0024 22.3 3.2 23 131-153 64-86 (89)
41 PF14853 Fis1_TPR_C: Fis1 C-te 23.4 1.1E+02 0.0023 20.0 2.9 35 83-117 16-50 (53)
42 PF03790 KNOX1: KNOX1 domain ; 22.6 47 0.001 21.5 1.0 13 84-96 14-26 (45)
43 PF14194 Cys_rich_VLP: Cystein 22.6 83 0.0018 21.3 2.2 19 123-141 1-19 (56)
44 KOG3355|consensus 22.4 1.9E+02 0.0042 23.8 4.8 68 97-164 63-139 (177)
45 PRK08557 hypothetical protein; 21.6 86 0.0019 28.8 2.9 32 117-152 349-383 (417)
46 PF15393 DUF4615: Domain of un 21.3 1.2E+02 0.0026 23.6 3.2 44 121-167 22-68 (124)
47 PRK10410 hypothetical protein; 21.0 95 0.0021 23.1 2.5 22 115-136 55-81 (100)
48 COG4075 Uncharacterized conser 20.9 22 0.00047 27.0 -0.9 33 128-160 14-59 (110)
49 PF14072 DndB: DNA-sulfur modi 20.5 1.5E+02 0.0032 25.7 4.0 59 104-163 207-267 (346)
50 COG5510 Predicted small secret 20.2 88 0.0019 20.2 1.9 7 46-52 2-8 (44)
51 PRK09993 C-lysozyme inhibitor; 20.1 2.5E+02 0.0053 22.7 4.8 41 45-92 5-45 (153)
52 KOG4623|consensus 20.1 84 0.0018 30.2 2.5 42 117-165 102-143 (611)
No 1
>PF03392 OS-D: Insect pheromone-binding family, A10/OS-D; InterPro: IPR005055 A class of small (14-20 Kd) water-soluble proteins, called odorant binding proteins (OBPs), first discovered in the insect sensillar lymph but also in the mucus of vertebrates, is postulated to mediate the solubilisation of hydrophobic odorant molecules, and thereby to facilitate their transport to the receptor neurons. The product of a gene expressed in the olfactory system of Drosophila melanogaster (Fruit fly), OS-D, shares features common to vertebrate odorant-binding proteins, but has a primary structure unlike odorant-binding proteins []. OS-D derivatives have subsequently been found in chemosensory organs of phylogenetically distinct insects, including cockroaches, phasmids and moths, suggesting that OS-D-like proteins seem to be conserved in the insect phylum.; PDB: 1KX9_A 1N8U_A 1KX8_A 1K19_A 1N8V_A 2GVS_A 2JNT_A.
Probab=100.00 E-value=7.9e-47 Score=278.59 Aligned_cols=95 Identities=57% Similarity=1.139 Sum_probs=90.5
Q ss_pred cCCcCccCCHHHHhcCHHHHHHHhhccccCCCCChhHHHHHHHHHHHHhhcCCCCCHHHHhhHHHHHHHHHhcChHHHHH
Q psy13420 68 YTTKYDNVDLDEIIANDRLFTNYYKCLMDTGACSPDGAELKKVLPDAIATECGGCSDKQKEGAKKIFKFLIEKKPEEWKA 147 (169)
Q Consensus 68 Y~~~yDniDid~iL~N~rl~~~y~~Clld~gpCt~~g~~lK~~lPeal~t~C~KCt~kQK~~~~kvi~~l~~~~P~~w~~ 147 (169)
|+++||||||++||+|+|++++|++||||+|||||+|++||++|||||+|+|+||||+||+++++|++||++++|++|++
T Consensus 1 Y~~~yD~iDvd~iL~N~rl~~~y~~ClldkgpCt~~~~~lK~~ipeal~t~C~KCt~kQK~~~~kv~~~l~~~~P~~w~~ 80 (95)
T PF03392_consen 1 YTTKYDNIDVDEILKNDRLLKSYIDCLLDKGPCTPEGKELKKVIPEALKTNCAKCTPKQKENARKVIKFLKKNYPDEWEE 80 (95)
T ss_dssp -HCTTTSSCHHHHHH-HHHHHHHHHHHTSSSTSHHHHHHHHHHHHHHHHCTTTTS-HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCccccccCHHHHHcCHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHccccCCCHHHHHHHHHHHHHHHHcCHHHHHH
Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHhhCCCchhHHHH
Q psy13420 148 LEGKYDPSGSYKAKY 162 (169)
Q Consensus 148 l~~KyDP~~~y~~~~ 162 (169)
|++||||+|+|+++|
T Consensus 81 l~~KyDp~~~y~~ky 95 (95)
T PF03392_consen 81 LVKKYDPEGKYRKKY 95 (95)
T ss_dssp HHHHHTTT-TTHHHH
T ss_pred HHHHHCCCcchhhcC
Confidence 999999999999998
No 2
>PF03392 OS-D: Insect pheromone-binding family, A10/OS-D; InterPro: IPR005055 A class of small (14-20 Kd) water-soluble proteins, called odorant binding proteins (OBPs), first discovered in the insect sensillar lymph but also in the mucus of vertebrates, is postulated to mediate the solubilisation of hydrophobic odorant molecules, and thereby to facilitate their transport to the receptor neurons. The product of a gene expressed in the olfactory system of Drosophila melanogaster (Fruit fly), OS-D, shares features common to vertebrate odorant-binding proteins, but has a primary structure unlike odorant-binding proteins []. OS-D derivatives have subsequently been found in chemosensory organs of phylogenetically distinct insects, including cockroaches, phasmids and moths, suggesting that OS-D-like proteins seem to be conserved in the insect phylum.; PDB: 1KX9_A 1N8U_A 1KX8_A 1K19_A 1N8V_A 2GVS_A 2JNT_A.
Probab=99.49 E-value=2e-14 Score=106.13 Aligned_cols=51 Identities=37% Similarity=0.719 Sum_probs=45.5
Q ss_pred CCCHHHHhcCHHhHHHHhccccCCCCCCcchHhhhhc-----------cCchhHHHHHHHHH
Q psy13420 2 RVCVDEIIANDRLFSNYYKCLMDTGACSPDGAELKMN-----------NSSNTIIMFKSLVL 52 (169)
Q Consensus 2 ~idv~~Il~n~rl~~~y~~C~l~~gpCt~e~~~lk~~-----------~~s~~~~~MK~~~~ 52 (169)
+|||++||+|+|++++|++|||++|||||+|++||++ ||+.|..+++.++-
T Consensus 7 ~iDvd~iL~N~rl~~~y~~ClldkgpCt~~~~~lK~~ipeal~t~C~KCt~kQK~~~~kv~~ 68 (95)
T PF03392_consen 7 NIDVDEILKNDRLLKSYIDCLLDKGPCTPEGKELKKVIPEALKTNCAKCTPKQKENARKVIK 68 (95)
T ss_dssp SSCHHHHHH-HHHHHHHHHHHTSSSTSHHHHHHHHHHHHHHHHCTTTTS-HHHHHHHHHHHH
T ss_pred ccCHHHHHcCHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHccccCCCHHHHHHHHHHHH
Confidence 6999999999999999999999999999999999998 88888888877754
No 3
>PF04030 ALO: D-arabinono-1,4-lactone oxidase ; InterPro: IPR007173 This domain is specific to D-arabinono-1,4-lactone oxidase 1.1.3.37 from EC, which is involved in the final step of the D-erythroascorbic acid biosynthesis pathway [].; GO: 0003885 D-arabinono-1,4-lactone oxidase activity, 0055114 oxidation-reduction process, 0016020 membrane; PDB: 2VFU_A 2VFV_A 2VFT_A 2VFS_A 2VFR_A.
Probab=85.22 E-value=0.94 Score=37.75 Aligned_cols=31 Identities=26% Similarity=0.478 Sum_probs=21.2
Q ss_pred HHHHhcCh--HHHHHHHHhhCCCchhHHHHHHH
Q psy13420 135 KFLIEKKP--EEWKALEGKYDPSGSYKAKYDAQ 165 (169)
Q Consensus 135 ~~l~~~~P--~~w~~l~~KyDP~~~y~~~~~~~ 165 (169)
..+.+.|| ++|.++++++||+|.+...|.+.
T Consensus 226 ~~l~~~Yp~~~~F~~~r~~~DP~g~F~n~~~~r 258 (259)
T PF04030_consen 226 EQLRKLYPRLDDFLAVRKKLDPQGVFLNDYLRR 258 (259)
T ss_dssp HHHHHT-TTHHHHHHHHHHH-TT-TT--HHHH-
T ss_pred HHHHHHCcCHHHHHHHHHHhCCCCCCCCHhhhc
Confidence 33778888 89999999999999999988653
No 4
>TIGR02766 crypt_chrom_pln cryptochrome, plant family. At least five major families of cryptochomes and photolyases share FAD cofactor binding, sequence homology, and the ability to react to short wavelengths of visible light. Photolysases are responsible for light-dependent DNA repair by removal of two types of uv-induced DNA dimerizations. Cryptochromes have other functions, often regulatory and often largely unknown, which may include circadian clock entrainment and control of development. Members of this subfamily are known so far only in plants; they may show some photolyase activity in vitro but appear mostly to be regulatory proteins that respond to blue light.
Probab=74.73 E-value=1.4 Score=40.17 Aligned_cols=20 Identities=30% Similarity=0.614 Sum_probs=17.4
Q ss_pred HHhhCCCchhHHHHHHHHhh
Q psy13420 149 EGKYDPSGSYKAKYDAQLKA 168 (169)
Q Consensus 149 ~~KyDP~~~y~~~~~~~~~~ 168 (169)
.+||||+|+|.++|--+++.
T Consensus 409 ~~~~Dp~g~yir~wvPeL~~ 428 (475)
T TIGR02766 409 GYKFDPNGEYVRRWLPELAR 428 (475)
T ss_pred HhhcCCCcccHHHhChhhcc
Confidence 78999999999999877764
No 5
>PF01805 Surp: Surp module; InterPro: IPR000061 SWAP is derived from the Suppressor-of-White-APricot splicing regulator from Drosophila melanogaster. The domain is found in regulators responsible for pervasive, nonsex-specific alternative pre-mRNA splicing characteristics and has been found in splicing regulatory proteins []. These ancient, conserved SWAP proteins share a colinearly arrayed series of novel sequence motifs [].; GO: 0003723 RNA binding, 0006396 RNA processing; PDB: 2E5Z_A 2DT7_B 2DT6_A 1UG0_A 1X4P_A 2E60_A 1X4O_A 4DGW_B.
Probab=73.84 E-value=13 Score=24.08 Aligned_cols=27 Identities=22% Similarity=0.220 Sum_probs=25.6
Q ss_pred HHhhHHHHHHHHHhcChHHHHHHHHhh
Q psy13420 126 QKEGAKKIFKFLIEKKPEEWKALEGKY 152 (169)
Q Consensus 126 QK~~~~kvi~~l~~~~P~~w~~l~~Ky 152 (169)
|++.++++..++.++=|+-+..+.++-
T Consensus 1 ~~~~I~~tA~~Va~~G~~fE~~l~~~~ 27 (55)
T PF01805_consen 1 LREIIDKTAEFVAKNGPEFEEKLRERE 27 (55)
T ss_dssp HHHHHHHHHHHHHHCSHHHHHHHHHHT
T ss_pred ChHHHHHHHHHHHhcCHHHHHHHHHhc
Confidence 688999999999999999999999987
No 6
>PRK10674 deoxyribodipyrimidine photolyase; Provisional
Probab=72.61 E-value=1.8 Score=39.77 Aligned_cols=22 Identities=14% Similarity=0.472 Sum_probs=18.7
Q ss_pred HHHhhCCCchhHHHHHHHHhhC
Q psy13420 148 LEGKYDPSGSYKAKYDAQLKAL 169 (169)
Q Consensus 148 l~~KyDP~~~y~~~~~~~~~~~ 169 (169)
..+||||+|+|.++|.-+++.+
T Consensus 405 q~~k~Dp~g~yIr~w~PeL~~~ 426 (472)
T PRK10674 405 QGERFDRDGEFIRRWLPELRDV 426 (472)
T ss_pred HHHHhCCCCChHHHhChhhccC
Confidence 3789999999999998887653
No 7
>TIGR03556 photolyase_8HDF deoxyribodipyrimidine photo-lyase, 8-HDF type. This model describes a narrow clade of cyanobacterial deoxyribodipyrimidine photo-lyase. This group, in contrast to several closely related proteins, uses a chromophore that, in other lineages is modified further to become coenzyme F420. This chromophore is called 8-HDF in most articles on the DNA photolyase and FO in most literature on coenzyme F420.
Probab=72.16 E-value=2 Score=39.45 Aligned_cols=23 Identities=22% Similarity=0.573 Sum_probs=18.9
Q ss_pred HHHHhhCCCchhHHHHHHHHhhC
Q psy13420 147 ALEGKYDPSGSYKAKYDAQLKAL 169 (169)
Q Consensus 147 ~l~~KyDP~~~y~~~~~~~~~~~ 169 (169)
...+||||+|+|.++|.-+++.+
T Consensus 408 ~q~~k~Dp~G~yIr~w~PeL~~~ 430 (471)
T TIGR03556 408 SQAQKFDPEAEYIRRWLPELRSV 430 (471)
T ss_pred HHHHHhCCCCchHHHhCHhhccC
Confidence 34689999999999998887643
No 8
>PF06530 Phage_antitermQ: Phage antitermination protein Q; InterPro: IPR010534 This entry is represented by Bacteriophage 933W, GpQ. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several phage antitermination protein Q and related bacterial sequences. Phage 993W gene Q encodes a phage-specific positive regulator of late gene expression, thought, by analogy to the corresponding gene of phage lambda, to be a transcription antiterminator. GpQ positively regulates expression of the phage late gene operons. Bacterial host RNA polymerase modified by antitermination proteins transcribes through termination sites that otherwise prevent expression of the regulated genes [, ].; GO: 0003677 DNA binding, 0060567 negative regulation of transcription termination, DNA-dependent
Probab=70.29 E-value=5.7 Score=30.22 Aligned_cols=53 Identities=30% Similarity=0.387 Sum_probs=45.3
Q ss_pred CCChhHHHHHHHHHHHHhhcCCCCCHHHHhhHHHHHHHHHhcChHHHHHHHHhh
Q psy13420 99 ACSPDGAELKKVLPDAIATECGGCSDKQKEGAKKIFKFLIEKKPEEWKALEGKY 152 (169)
Q Consensus 99 pCt~~g~~lK~~lPeal~t~C~KCt~kQK~~~~kvi~~l~~~~P~~w~~l~~Ky 152 (169)
..+|....|+.++|..-.... -||+-.=-.+..++..|+++.|++++-|...|
T Consensus 23 ~~s~i~a~f~~~~p~~~~~r~-~csDDdgl~Id~~v~~L~~~~~~~~~ll~~~Y 75 (125)
T PF06530_consen 23 DYSPIAAGFKGVLPSSSKSRP-SCSDDDGLLIDRCVARLKKRDPEEYDLLILYY 75 (125)
T ss_pred CcccHHHHHhcCCCCCCCCCc-eechHHHHHHHHHHHHHHccCHHHHHHHHHHH
Confidence 367788888999997554444 49999999999999999999999999999887
No 9
>TIGR02765 crypto_DASH cryptochrome, DASH family. Photolyases and cryptochromes are related flavoproteins. Photolyases harness the energy of blue light to repair DNA damage by removing pyrimidine dimers. Cryptochromes do not repair DNA and are presumed to act instead in some other (possibly unknown) process such as entraining circadian rhythms. This model describes the cryptochrome DASH subfamily, one of at least five major subfamilies, which is found in plants, animals, marine bacteria, etc. Members of this family bind both folate and FAD. They may show weak photolyase activity in vitro but have not been shown to affect DNA repair in vivo. Rather, DASH family cryptochromes have been shown to bind RNA (Vibrio cholerae VC1814), or DNA, and seem likely to act in light-responsive regulatory processes.
Probab=68.43 E-value=2.4 Score=38.03 Aligned_cols=17 Identities=35% Similarity=0.681 Sum_probs=14.3
Q ss_pred HHHHhhCCCchhHHHHH
Q psy13420 147 ALEGKYDPSGSYKAKYD 163 (169)
Q Consensus 147 ~l~~KyDP~~~y~~~~~ 163 (169)
.-.+||||+|+|.++|-
T Consensus 410 ~q~~k~Dp~g~yir~wv 426 (429)
T TIGR02765 410 KQAQDYDPDGEYVATWV 426 (429)
T ss_pred HHHHhcCCCCCcHHHhc
Confidence 34689999999999984
No 10
>PF08031 BBE: Berberine and berberine like ; InterPro: IPR012951 This domain is found in the berberine bridge and berberine bridge-like enzymes, which are involved in the biosynthesis of numerous isoquinoline alkaloids. They catalyse the transformation of the N-methyl group of (S)-reticuline into the C-8 berberine bridge carbon of (S)-scoulerine [].; GO: 0016491 oxidoreductase activity, 0050660 flavin adenine dinucleotide binding, 0055114 oxidation-reduction process; PDB: 2IPI_A 2Y3S_A 2Y3R_D 2Y08_B 2Y4G_A 3D2H_A 3FW9_A 3FW8_A 3FWA_A 3D2J_A ....
Probab=68.38 E-value=3.8 Score=26.03 Aligned_cols=17 Identities=29% Similarity=0.814 Sum_probs=10.9
Q ss_pred HHHHHHHHhhCCCchhH
Q psy13420 143 EEWKALEGKYDPSGSYK 159 (169)
Q Consensus 143 ~~w~~l~~KyDP~~~y~ 159 (169)
+.-.+++++|||+|-+.
T Consensus 25 ~rL~~iK~~yDP~n~F~ 41 (47)
T PF08031_consen 25 DRLRAIKRKYDPDNVFR 41 (47)
T ss_dssp HHHHHHHHHH-TT-TS-
T ss_pred HHHHHHHHHhCccceeC
Confidence 34567889999999875
No 11
>TIGR01679 bact_FAD_ox FAD-linked oxidoreductase. This model represents a family of bacterial oxidoreductases with covalently linked FAD, closely related to two different eukaryotic oxidases, L-gulonolactone oxidase (EC 1.1.3.8) from rat and D-arabinono-1,4-lactone oxidase (EC 1.1.3.37) from Saccharomyces cerevisiae.
Probab=68.10 E-value=4.5 Score=36.56 Aligned_cols=32 Identities=25% Similarity=0.506 Sum_probs=25.3
Q ss_pred HHHHhcCh--HHHHHHHHhhCCCchhHHHHHHHH
Q psy13420 135 KFLIEKKP--EEWKALEGKYDPSGSYKAKYDAQL 166 (169)
Q Consensus 135 ~~l~~~~P--~~w~~l~~KyDP~~~y~~~~~~~~ 166 (169)
.-+.+.|| ++|.+++++|||+|.+...|.+.+
T Consensus 384 ~~l~~~YP~~~~F~~~r~~~DP~g~F~n~~~~rl 417 (419)
T TIGR01679 384 ATLRERYPRWDDFAAVRDDLDPDRRFLNPYTRGL 417 (419)
T ss_pred HHHHHHCcCHHHHHHHHHHhCCCCccCCHHHHHh
Confidence 34556677 578899999999999998887654
No 12
>PHA02939 hypothetical protein; Provisional
Probab=66.06 E-value=52 Score=25.77 Aligned_cols=79 Identities=23% Similarity=0.407 Sum_probs=58.9
Q ss_pred HHHHHhhccccCCCCChh---HHHHHHHHHHHH--------hhcCCCCCHHHHhhHHHHHHHHHhcChHHHHHHHHhhCC
Q psy13420 86 LFTNYYKCLMDTGACSPD---GAELKKVLPDAI--------ATECGGCSDKQKEGAKKIFKFLIEKKPEEWKALEGKYDP 154 (169)
Q Consensus 86 l~~~y~~Clld~gpCt~~---g~~lK~~lPeal--------~t~C~KCt~kQK~~~~kvi~~l~~~~P~~w~~l~~KyDP 154 (169)
.+..|.-|- |+| |... ..-+++.+||.| +-.=+.-|+.||+.++.-+..|.+..|..+++|.. |=-
T Consensus 40 if~d~~~~~-de~-c~~~kwE~df~rD~ipe~iW~vRkaV~~l~DP~st~~qrEyae~~l~~lle~~~r~f~~Ll~-~ci 116 (144)
T PHA02939 40 IFSDFARCD-DEG-CPRSKWESDFLRDLIPEEIWYVRKAVKELYDPESTDDQREYAEHRLKGLLELLERNFNQLLE-FCI 116 (144)
T ss_pred HHHHHhhcc-ccc-cchHhhHHHHHHHHhhHHHHHHHHHHHHHhCCccchhHHHHHHhhHHHHHHHHHHHHHHHHH-HHh
Confidence 566677775 554 7544 334678788865 33457789999999999999999999999999986 555
Q ss_pred CchhHHHHHHHHh
Q psy13420 155 SGSYKAKYDAQLK 167 (169)
Q Consensus 155 ~~~y~~~~~~~~~ 167 (169)
+.+-+++|.+.++
T Consensus 117 ~ekRreKYLkLiE 129 (144)
T PHA02939 117 DEKRREKYLKLIE 129 (144)
T ss_pred hHHHHHHHHHHHH
Confidence 6677777765443
No 13
>PF03441 FAD_binding_7: FAD binding domain of DNA photolyase from Prosite.; InterPro: IPR005101 This entry represents a multi-helical domain composed of two all-alpha subdomains that is found as the C-terminal domain in cryptochrome proteins, as well as at the N-terminal of DNA photolyase where it acts as a FAD-binding domain (the N-terminal of DNA photolyase binds a light-harvesting cofactor). Photolyases and cryptochromes are related flavoproteins that bind FAD. Photolyases harness the energy of blue light to repair DNA damage by removing pyrimidine dimers. Cryptochromes (CRY1 and CRY2) are blue light photoreceptors that mediate blue light-induced gene expression [, ]. DNA photolyases are DNA repair enzymes that repair mismatched pyrimidine dimers induced by exposure to ultra-violet light. They bind to UV-damaged DNA containing pyrimidine dimers and, upon absorbing a near-UV photon (300 to 500 nm), they catalyse dimer splitting, breaking the cyclobutane ring joining the two pyrimidines of the dimer so as to split them into the constituent monomers; this process is called photoreactivation. DNA photolyases require two choromophore-cofactors for their activity. All monomers contain a reduced FAD moiety, and, in addition, either a reduced pterin or 8-hydroxy-5-diazaflavin as a second chromophore. Either chromophore may act as the primary photon acceptor, peak absorptions occurring in the blue region of the spectrum and in the UV-B region, at a wavelength around 290nm [, ].; GO: 0003913 DNA photolyase activity, 0006281 DNA repair; PDB: 3UMV_A 3ZXS_A 1DNP_A 2XRZ_B 2XRY_A 2VTB_A 2J4D_B 2IJG_X 3TVS_A 2E0I_D ....
Probab=65.06 E-value=2.6 Score=36.03 Aligned_cols=21 Identities=29% Similarity=0.631 Sum_probs=17.4
Q ss_pred HHHhhCCCchhHHHHHHHHhh
Q psy13420 148 LEGKYDPSGSYKAKYDAQLKA 168 (169)
Q Consensus 148 l~~KyDP~~~y~~~~~~~~~~ 168 (169)
...+|||+|+|.++|--+|+.
T Consensus 207 q~~~~Dp~g~~ir~w~PeL~~ 227 (277)
T PF03441_consen 207 QSKKFDPDGEYIRRWVPELAD 227 (277)
T ss_dssp HHHHHSTTSHHHHHHSGGGTT
T ss_pred HHHhhCcHHHHHHHHHHHHhc
Confidence 357999999999999876654
No 14
>TIGR01676 GLDHase galactonolactone dehydrogenase. This model represents L-Galactono-gamma-lactone dehydrogenase (EC 1.3.2.3). This enzyme catalyzes the final step in ascorbic acid biosynthesis in higher plants. This protein is homologous to ascorbic acid biosynthesis enzymes of other species: L-gulono-gamma-lactone oxidase in rat and L-galactono-gamma-lactone oxidase in yeast. All three covalently bind the cofactor FAD.
Probab=64.61 E-value=7.5 Score=36.91 Aligned_cols=38 Identities=16% Similarity=0.109 Sum_probs=27.9
Q ss_pred hHHHHHHHHHhcCh-HHHHHHHHhhCCCchhHHHHHHHH
Q psy13420 129 GAKKIFKFLIEKKP-EEWKALEGKYDPSGSYKAKYDAQL 166 (169)
Q Consensus 129 ~~~kvi~~l~~~~P-~~w~~l~~KyDP~~~y~~~~~~~~ 166 (169)
....+--.+.+.|| ++|.++++++||+|.+...|.+.+
T Consensus 501 ~l~~~yprl~~~YP~d~F~~~R~~lDP~g~F~N~yL~~l 539 (541)
T TIGR01676 501 ELAALQARLKKKFPVDASNKARKALDPNKILSNNKLEKL 539 (541)
T ss_pred HHHHHHHHHHhhCCHHHHHHHHHHhCCCCccccHHHHHh
Confidence 33444445677777 557778899999999999997654
No 15
>TIGR01677 pln_FAD_oxido plant-specific FAD-dependent oxidoreductase. This model represents an uncharacterized plant-specific family of FAD-dependent oxidoreductases. At least seven distinct members are found in Arabidopsis thaliana. The family shows considerable sequence similarity to three different enzymes of ascorbic acid biosynthesis: L-galactono-1,4-lactone dehydrogenase (EC 1.3.2.3) from higher plants, D-arabinono-1,4-lactone oxidase (EC 1.1.3.37 from Saccharomyces cerevisiae, and L-gulonolactone oxidase (EC 1.1.3.8) from mouse, as well as to a bacterial sorbitol oxidase. The class of compound acted on by members of this family is unknown.
Probab=64.11 E-value=5 Score=38.06 Aligned_cols=31 Identities=23% Similarity=0.634 Sum_probs=25.8
Q ss_pred HHHhcCh--HHHHHHHHhhCCCchhHHHHHHHH
Q psy13420 136 FLIEKKP--EEWKALEGKYDPSGSYKAKYDAQL 166 (169)
Q Consensus 136 ~l~~~~P--~~w~~l~~KyDP~~~y~~~~~~~~ 166 (169)
-|.+.|| ++|.++++++||+|.|...|.+.+
T Consensus 476 ~l~~~YP~~~dF~alR~~~DP~g~F~N~yl~~l 508 (557)
T TIGR01677 476 GVIRKYPNADKFLKVKDSYDPKGLFSSEWSDEI 508 (557)
T ss_pred HHHHhCCCHHHHHHHHHhcCCCCccCCHHHHHH
Confidence 4677788 677888999999999999998765
No 16
>PF11919 DUF3437: Domain of unknown function (DUF3437); InterPro: IPR021843 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is typically between 142 to 163 amino acids in length. ; PDB: 3L5Q_6 1VSY_5.
Probab=61.60 E-value=3.9 Score=29.81 Aligned_cols=46 Identities=15% Similarity=0.343 Sum_probs=31.7
Q ss_pred HHHHHHhhcCCCCCHHH--HhhHHHHHHHHHhcChHHHHHHHHhhCCC
Q psy13420 110 VLPDAIATECGGCSDKQ--KEGAKKIFKFLIEKKPEEWKALEGKYDPS 155 (169)
Q Consensus 110 ~lPeal~t~C~KCt~kQ--K~~~~kvi~~l~~~~P~~w~~l~~KyDP~ 155 (169)
.||++|..=|...++.| +..+++.+.-.++.|.|.|..-..+|.++
T Consensus 27 w~P~~l~~La~~~~~~~~I~~tvk~tl~eFkrtH~D~W~~~~~~Ft~~ 74 (90)
T PF11919_consen 27 WMPEVLEELARHANDPQPIRTTVKKTLSEFKRTHQDTWHEHKKKFTED 74 (90)
T ss_dssp HHHHHHHHHHTTSSS-SSHHHHTHHHHHHHHHHTSTTHHHHGGG--SS
T ss_pred cHHHHHHHHHHHhCCCchHHHHHHHHHHHHHHhCcccHHHHHHhCCHH
Confidence 56666665555555433 35677888888999999999999888776
No 17
>PLN02465 L-galactono-1,4-lactone dehydrogenase
Probab=61.54 E-value=5.4 Score=38.12 Aligned_cols=42 Identities=21% Similarity=0.338 Sum_probs=30.6
Q ss_pred HHHhhHHHHHHHHHhcCh-HHHHHHHHhhCCCchhHHHHHHHH
Q psy13420 125 KQKEGAKKIFKFLIEKKP-EEWKALEGKYDPSGSYKAKYDAQL 166 (169)
Q Consensus 125 kQK~~~~kvi~~l~~~~P-~~w~~l~~KyDP~~~y~~~~~~~~ 166 (169)
+-++.....-..|.+.|| ++|.++++.+||+|.+...|.+.+
T Consensus 527 ~~~~~~~~~~~~L~~~YP~d~F~~~R~~lDP~g~f~N~~L~~l 569 (573)
T PLN02465 527 KDKEELEALRERLRKRFPVDAFNKARKELDPKGILSNNLLEKL 569 (573)
T ss_pred cccchhhcCHHHHHhhCCHHHHHHHHHHhCCCCccCCHHHHHh
Confidence 344444556667888888 334567789999999999997765
No 18
>PF04818 CTD_bind: RNA polymerase II-binding domain.; InterPro: IPR006903 This entry represents a conserved region found in a number of uncharacterised eukaryotic proteins.; PDB: 2L0I_A 2KM4_A 3D9I_B 3D9N_B 3D9O_A 3D9P_B 3D9K_A 3D9M_A 3D9J_A 3D9L_A ....
Probab=59.45 E-value=24 Score=23.22 Aligned_cols=38 Identities=18% Similarity=0.306 Sum_probs=31.4
Q ss_pred hhHHHHHHHHHHHHhhcCCCCCHHHHhhHHHHHHHHHh
Q psy13420 102 PDGAELKKVLPDAIATECGGCSDKQKEGAKKIFKFLIE 139 (169)
Q Consensus 102 ~~g~~lK~~lPeal~t~C~KCt~kQK~~~~kvi~~l~~ 139 (169)
.-...|...||+++...+..-.++.++.+.++++.-.+
T Consensus 22 ~f~~~F~~~l~~~~~~~~~~~~~~~~~kv~rll~iW~~ 59 (64)
T PF04818_consen 22 EFAPAFSPVLPDAFAHAYKNVDPEVRKKVQRLLNIWEE 59 (64)
T ss_dssp CHHHHHHCCHHHHHHHHCCCS-HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhhC
Confidence 45688999999999998887799999999999987443
No 19
>PLN00107 FAD-dependent oxidoreductase; Provisional
Probab=59.06 E-value=6.8 Score=33.95 Aligned_cols=33 Identities=21% Similarity=0.481 Sum_probs=26.8
Q ss_pred HHHHHhcCh--HHHHHHHHhhCCCchhHHHHHHHH
Q psy13420 134 FKFLIEKKP--EEWKALEGKYDPSGSYKAKYDAQL 166 (169)
Q Consensus 134 i~~l~~~~P--~~w~~l~~KyDP~~~y~~~~~~~~ 166 (169)
...+.+.|| ++|.++++++||+|++...|.+.+
T Consensus 168 ~~~l~~lYPr~~dFlavR~~lDP~G~F~N~yl~rl 202 (257)
T PLN00107 168 FDGAIAKYKKAGEFLKVKERLDPEGLFSSEWSDKI 202 (257)
T ss_pred HHHHHHHCcCHHHHHHHHHHhCCCCccCCHHHHHH
Confidence 455677777 578999999999999999987754
No 20
>COG0415 PhrB Deoxyribodipyrimidine photolyase [DNA replication, recombination, and repair]
Probab=51.34 E-value=7.8 Score=36.16 Aligned_cols=21 Identities=29% Similarity=0.727 Sum_probs=17.0
Q ss_pred HHhhCCCchhHHHHHHHHhhC
Q psy13420 149 EGKYDPSGSYKAKYDAQLKAL 169 (169)
Q Consensus 149 ~~KyDP~~~y~~~~~~~~~~~ 169 (169)
-+||||+|.|.++|--+++.+
T Consensus 399 ~~kfDp~g~fIr~wvPeL~~~ 419 (461)
T COG0415 399 AEKFDPDGEFIRRWVPELRNL 419 (461)
T ss_pred HhhcCCCcccHHhhCHHhhCC
Confidence 368999999999998877653
No 21
>PF12066 DUF3546: Domain of unknown function (DUF3546); InterPro: IPR021933 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is typically between 93 to 114 amino acids in length. This domain has two completely conserved Y residues that may be functionally important. ; PDB: 3AX1_A.
Probab=43.00 E-value=17 Score=27.17 Aligned_cols=24 Identities=38% Similarity=0.786 Sum_probs=18.6
Q ss_pred HHHHHHHHHhcChHHHHHHHHhhCCC
Q psy13420 130 AKKIFKFLIEKKPEEWKALEGKYDPS 155 (169)
Q Consensus 130 ~~kvi~~l~~~~P~~w~~l~~KyDP~ 155 (169)
.+.+-.|+..|+.++| |+.||+|+
T Consensus 31 ~~q~~~FF~~HK~eeW--FreKY~P~ 54 (110)
T PF12066_consen 31 RKQLRAFFEQHKDEEW--FREKYHPE 54 (110)
T ss_dssp HHHHHHHHHHHTTSHH--HHHHH-HH
T ss_pred HHHHHHHHHHhcCCHH--HHHhcCcH
Confidence 3445668889999999 78999998
No 22
>PF08910 Aida_N: Aida N-terminus; InterPro: IPR015006 This entry represents the axin interactor, dorsalization-associated protein family AIDA [].; PDB: 1UG7_A.
Probab=42.45 E-value=61 Score=24.60 Aligned_cols=36 Identities=19% Similarity=0.229 Sum_probs=26.2
Q ss_pred hHHHHHHHHHHHHhh-cCCCCCHHHHhhHHHHHHHHH
Q psy13420 103 DGAELKKVLPDAIAT-ECGGCSDKQKEGAKKIFKFLI 138 (169)
Q Consensus 103 ~g~~lK~~lPeal~t-~C~KCt~kQK~~~~kvi~~l~ 138 (169)
+++.|-+.|-.++.. ++..+|+.||+.+.||.--|.
T Consensus 27 eY~~La~~l~k~~~~~~~~~fte~qkk~i~Kia~cL~ 63 (106)
T PF08910_consen 27 EYQRLARQLKKEVQSHQDSDFTEDQKKTIGKIATCLE 63 (106)
T ss_dssp HHHHHHHHHHHHHT-SS--SS-HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhcccCCCCCHHHHHHHHHHHHHHH
Confidence 456677778888887 999999999999999987653
No 23
>PF13956 Ibs_toxin: Toxin Ibs, type I toxin-antitoxin system
Probab=41.33 E-value=7.9 Score=20.63 Aligned_cols=11 Identities=27% Similarity=0.625 Sum_probs=6.8
Q ss_pred HHHHHHHHHHH
Q psy13420 46 MFKSLVLVACL 56 (169)
Q Consensus 46 ~MK~~~~l~~~ 56 (169)
|||.++++.++
T Consensus 1 MMk~vIIlvvL 11 (19)
T PF13956_consen 1 MMKLVIILVVL 11 (19)
T ss_pred CceehHHHHHH
Confidence 58887665443
No 24
>PF03564 DUF1759: Protein of unknown function (DUF1759); InterPro: IPR005312 This is a small family of proteins of unknown function.
Probab=41.00 E-value=44 Score=25.02 Aligned_cols=39 Identities=26% Similarity=0.389 Sum_probs=28.0
Q ss_pred HHHHHHHH---HhcChHHHHHHHHhhCCCchhHHHHHHHHhh
Q psy13420 130 AKKIFKFL---IEKKPEEWKALEGKYDPSGSYKAKYDAQLKA 168 (169)
Q Consensus 130 ~~kvi~~l---~~~~P~~w~~l~~KyDP~~~y~~~~~~~~~~ 168 (169)
+..+|.++ ..+|+..|+.|.++|+....-...+.+.+..
T Consensus 42 A~~~i~~~~~~~~~Y~~a~~~L~~~yg~~~~i~~~~~~~l~~ 83 (145)
T PF03564_consen 42 AKELIRGLPLSEENYEEAWELLEERYGNPRRIIQALLEELRN 83 (145)
T ss_pred HHHHHHcccccchhhHHHHHHHHHHhCCchHHHHHHHHHHhc
Confidence 45555666 4899999999999997766666666655543
No 25
>smart00582 RPR domain present in proteins, which are involved in regulation of nuclear pre-mRNA.
Probab=40.75 E-value=44 Score=24.29 Aligned_cols=37 Identities=27% Similarity=0.433 Sum_probs=31.0
Q ss_pred hHHHHHHHHHHHHhhcCCCCCHHHHhhHHHHHHHHHh
Q psy13420 103 DGAELKKVLPDAIATECGGCSDKQKEGAKKIFKFLIE 139 (169)
Q Consensus 103 ~g~~lK~~lPeal~t~C~KCt~kQK~~~~kvi~~l~~ 139 (169)
-...|...+++++......+.+..|+.+.++++.-.+
T Consensus 71 f~~~~~~~~~~~~~~~~~~~~~~~~~ki~kll~iW~~ 107 (121)
T smart00582 71 FGDELGPVFQDALRDVLGAANDETKKKIRRLLNIWEE 107 (121)
T ss_pred HHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHhc
Confidence 3567888999999999988999999999999876444
No 26
>PF14658 EF-hand_9: EF-hand domain
Probab=35.61 E-value=81 Score=21.85 Aligned_cols=37 Identities=22% Similarity=0.323 Sum_probs=24.7
Q ss_pred HHHHHHHHHhcChHHH--HHHHHhhCCCch----hHHHHHHHH
Q psy13420 130 AKKIFKFLIEKKPEEW--KALEGKYDPSGS----YKAKYDAQL 166 (169)
Q Consensus 130 ~~kvi~~l~~~~P~~w--~~l~~KyDP~~~----y~~~~~~~~ 166 (169)
+..+++++-.+-|+++ +.|.+-+||+|. +++-|..-+
T Consensus 20 l~~~Lra~~~~~p~e~~Lq~l~~elDP~g~~~~v~~d~F~~iM 62 (66)
T PF14658_consen 20 LITYLRAVTGRSPEESELQDLINELDPEGRDGSVNFDTFLAIM 62 (66)
T ss_pred HHHHHHHHcCCCCcHHHHHHHHHHhCCCCCCceEeHHHHHHHH
Confidence 3455666666688777 467777899999 555554433
No 27
>PF06286 Coleoptericin: Coleoptericin; InterPro: IPR009382 This family consists of several insect coleoptericin, acaloleptin, holotricin and rhinocerosin proteins which are all known to be antibacterial proteins []. These all appear to be short, glycine-rich molecules, inducible by infection.; GO: 0042742 defense response to bacterium, 0005576 extracellular region
Probab=35.51 E-value=9.5 Score=29.99 Aligned_cols=24 Identities=17% Similarity=0.275 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHH-HhhcCCccC
Q psy13420 46 MFKSLVLVACLACIG-SVLTSPAYT 69 (169)
Q Consensus 46 ~MK~~~~l~~~~~~~-~~~a~e~Y~ 69 (169)
|||+.+++.++++++ .+..++.|.
T Consensus 1 mmkl~i~~~lia~saay~vP~~yy~ 25 (143)
T PF06286_consen 1 MMKLYIIFGLIALSAAYAVPEEYYE 25 (143)
T ss_pred CceEeeehhHHHHHHhhcccHhhcC
Confidence 689988776665433 333443333
No 28
>PF14060 DUF4252: Domain of unknown function (DUF4252)
Probab=33.61 E-value=62 Score=24.58 Aligned_cols=48 Identities=25% Similarity=0.290 Sum_probs=28.6
Q ss_pred hhHHHHHHHHHHHHhhcCCCCCHHHHhhHHHHHHHHHhcChHHHHHHHHhhC
Q psy13420 102 PDGAELKKVLPDAIATECGGCSDKQKEGAKKIFKFLIEKKPEEWKALEGKYD 153 (169)
Q Consensus 102 ~~g~~lK~~lPeal~t~C~KCt~kQK~~~~kvi~~l~~~~P~~w~~l~~KyD 153 (169)
++.+++-+.|-..=--.+.+|+ .++...+-++-+.+ . +.|++|...=|
T Consensus 59 ~~~~~~~~~i~~i~vl~~~~~~--~~~~~~~~~~~~~~-~-~~ye~l~~~~~ 106 (155)
T PF14060_consen 59 PEIKELLKKIDSIRVLSLEDCS--NKQEFRKEFNKLKK-L-SGYEELMSVKD 106 (155)
T ss_pred HHHHHHHHhCCceEEEEecCcc--hHHHHHHHHHHHHc-c-hhhHHHhhccC
Confidence 3444443344333334677788 55566666666666 4 59999987666
No 29
>KOG0133|consensus
Probab=32.16 E-value=30 Score=32.95 Aligned_cols=23 Identities=35% Similarity=0.577 Sum_probs=19.6
Q ss_pred HHHHHhhCCCchhHHHHHHHHhh
Q psy13420 146 KALEGKYDPSGSYKAKYDAQLKA 168 (169)
Q Consensus 146 ~~l~~KyDP~~~y~~~~~~~~~~ 168 (169)
..+.+||||+|.|.+.|...+..
T Consensus 422 ~~~~kk~dP~g~yir~~lp~l~~ 444 (531)
T KOG0133|consen 422 VALGKKLDPDGLYIRQWLPELRS 444 (531)
T ss_pred HHHhCcCCcchhhHHHHhHHHhc
Confidence 46788999999999999887764
No 30
>smart00648 SWAP Suppressor-of-White-APricot splicing regulator. domain present in regulators which are responsible for pre-mRNA splicing processes
Probab=31.34 E-value=1.4e+02 Score=18.96 Aligned_cols=25 Identities=20% Similarity=0.151 Sum_probs=22.3
Q ss_pred hhHHHHHHHHHhcChHHHHHHHHhh
Q psy13420 128 EGAKKIFKFLIEKKPEEWKALEGKY 152 (169)
Q Consensus 128 ~~~~kvi~~l~~~~P~~w~~l~~Ky 152 (169)
+.++++..++.++=|+-.+.|+++.
T Consensus 2 ~iI~~tA~~Va~~G~~fe~~l~~~~ 26 (54)
T smart00648 2 DIIDKTAQFVARNGPEFEAKLMERE 26 (54)
T ss_pred cHHHHHHHHHHHhhHHHHHHHHHhc
Confidence 5789999999999999999998776
No 31
>PF12399 BCA_ABC_TP_C: Branched-chain amino acid ATP-binding cassette transporter
Probab=30.83 E-value=35 Score=18.89 Aligned_cols=16 Identities=31% Similarity=0.360 Sum_probs=13.8
Q ss_pred HHHHhcCHHhHHHHhc
Q psy13420 5 VDEIIANDRLFSNYYK 20 (169)
Q Consensus 5 v~~Il~n~rl~~~y~~ 20 (169)
.++|.+|++.+..|+.
T Consensus 8 p~~i~~n~~V~~aYLG 23 (23)
T PF12399_consen 8 PEEIRANPEVREAYLG 23 (23)
T ss_pred HHHHhcCHHHHHhhCc
Confidence 5789999999999963
No 32
>COG3636 Predicted transcriptional regulator [Transcription]
Probab=30.15 E-value=40 Score=25.34 Aligned_cols=31 Identities=19% Similarity=0.519 Sum_probs=28.2
Q ss_pred CCcCccCCHHHHhcCHHHHHHHhhccccCCC
Q psy13420 69 TTKYDNVDLDEIIANDRLFTNYYKCLMDTGA 99 (169)
Q Consensus 69 ~~~yDniDid~iL~N~rl~~~y~~Clld~gp 99 (169)
+.+++++|+.+.|.+++....|++=.|++|.
T Consensus 5 ~~~~~~~d~ae~l~~ee~ia~yL~~~le~~d 35 (100)
T COG3636 5 TRLLDDFDVAELLTDEEAIAAYLNAALEEGD 35 (100)
T ss_pred hhhcccccHHHHhCCHHHHHHHHHHHHHcCC
Confidence 3467899999999999999999999999987
No 33
>TIGR01678 FAD_lactone_ox sugar 1,4-lactone oxidases. This model represents a family of at least two different sugar 1,4 lactone oxidases, both involved in synthesizing ascorbic acid or a derivative. These include L-gulonolactone oxidase (EC 1.1.3.8) from rat and D-arabinono-1,4-lactone oxidase (EC 1.1.3.37) from Saccharomyces cerevisiae. Members are proposed to have the cofactor FAD covalently bound at a site specified by Prosite motif PS00862; OX2_COVAL_FAD; 1.
Probab=28.32 E-value=35 Score=31.21 Aligned_cols=24 Identities=25% Similarity=0.450 Sum_probs=18.7
Q ss_pred HHHhcCh--HHHHHHHHhhCCCchhH
Q psy13420 136 FLIEKKP--EEWKALEGKYDPSGSYK 159 (169)
Q Consensus 136 ~l~~~~P--~~w~~l~~KyDP~~~y~ 159 (169)
-|.+-|| ++|.++++++||+|.+.
T Consensus 411 ~l~~~YP~~~~F~~vr~~~DP~g~F~ 436 (438)
T TIGR01678 411 DFEEMYPTLHKFCDIRKKLDPTGVFL 436 (438)
T ss_pred HHHHHCcCHHHHHHHHHhhCcccccC
Confidence 3445566 57899999999999874
No 34
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=27.26 E-value=41 Score=28.32 Aligned_cols=41 Identities=24% Similarity=0.440 Sum_probs=26.2
Q ss_pred CCCCCHHHHhhHHHHHHHHHhcChHHHHHHHHhhCCCchhHHHHHH
Q psy13420 119 CGGCSDKQKEGAKKIFKFLIEKKPEEWKALEGKYDPSGSYKAKYDA 164 (169)
Q Consensus 119 C~KCt~kQK~~~~kvi~~l~~~~P~~w~~l~~KyDP~~~y~~~~~~ 164 (169)
|.-|+ |-=+...=|.+||..+|||..+++.+.. .|+..|..
T Consensus 80 C~lc~-KlFkg~eFV~KHI~nKH~e~ve~~~~ev----~~fnnY~~ 120 (214)
T PF04959_consen 80 CPLCG-KLFKGPEFVRKHIFNKHPEKVEEVKKEV----EYFNNYLL 120 (214)
T ss_dssp E-SSS--EESSHHHHHHHHHHH-HHHHHHHHHHH----HHHHHH--
T ss_pred CCCCC-cccCChHHHHHHHhhcCHHHHHHHHHHH----HHHHHHhc
Confidence 44444 2223566788999999999999998764 37666653
No 35
>PF09494 Slx4: Slx4 endonuclease; InterPro: IPR018574 The Slx4 protein is a heteromeric structure-specific endonuclease found in fungi. Slx4 with Slx1 acts as a nuclease on branched DNA substrates, particularly simple-Y, 5'-flap, or replication fork structures by cleaving the strand bearing the 5' non-homologous arm at the branch junction and thus generating ligatable nicked products from 5'-flap or replication fork substrates [].
Probab=26.38 E-value=72 Score=21.36 Aligned_cols=20 Identities=30% Similarity=0.630 Sum_probs=15.6
Q ss_pred HHHHHhcChHHHHHHHHhhCC
Q psy13420 134 FKFLIEKKPEEWKALEGKYDP 154 (169)
Q Consensus 134 i~~l~~~~P~~w~~l~~KyDP 154 (169)
+..+.+..|+.|+.+. .|+|
T Consensus 5 lt~~I~~~p~l~ekIL-~YeP 24 (64)
T PF09494_consen 5 LTKLIRSDPELYEKIL-MYEP 24 (64)
T ss_pred HHHHHHcCHHHHHHHH-cCCC
Confidence 4455569999999988 7888
No 36
>PF06576 DUF1133: Protein of unknown function (DUF1133); InterPro: IPR010557 This family consists of a number of hypothetical proteins from Escherichia coli O157:H7 and Salmonella typhi. The function of this family is unknown.
Probab=24.53 E-value=1.4e+02 Score=24.52 Aligned_cols=48 Identities=19% Similarity=0.302 Sum_probs=39.1
Q ss_pred CCCCHHHHhhHHHHHHHHHhcChHHHHHHHHhhCCCchhHHHHHHHHh
Q psy13420 120 GGCSDKQKEGAKKIFKFLIEKKPEEWKALEGKYDPSGSYKAKYDAQLK 167 (169)
Q Consensus 120 ~KCt~kQK~~~~kvi~~l~~~~P~~w~~l~~KyDP~~~y~~~~~~~~~ 167 (169)
+-||+...-.+++||.-+-.++|-...-+++.|==-|++..+..+.+.
T Consensus 94 a~ctD~Eal~iDrVI~~vL~~~~gl~~Vl~qrY~~RgkSk~~MA~eL~ 141 (176)
T PF06576_consen 94 AFCTDDEALFIDRVIGEVLAEHPGLINVLRQRYCGRGKSKRKMAEELN 141 (176)
T ss_pred ceecchHHHHHHHHHHHHHHhCcchHHHHHHHHHcccccHHHHHHHHh
Confidence 567777777888888888888999999999999888888888776654
No 37
>PF08139 LPAM_1: Prokaryotic membrane lipoprotein lipid attachment site; InterPro: IPR012640 In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognises a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached [,]. This lipid attachment site is found in homologues of the VirB proteins of type IV secretion systems (T4SS). Conjugal transfer across the cell envelope of Gram-negative bacteria is mediated by a supramolecular structure termed mating pair formation (Mpf) complex. Collectively, secretion pathways ancestrally related to bacterial conjugation systems are now known as T4SS. T4SS are involved in the delivery of effector molecules to eukaryotic target cells; each of these systems exports distinct DNA or protein substrates to effect a myriad of changes in host cell physiology during infection [].
Probab=24.12 E-value=1.1e+02 Score=17.40 Aligned_cols=13 Identities=15% Similarity=0.184 Sum_probs=7.6
Q ss_pred HHHHHHHHHHHHH
Q psy13420 45 IMFKSLVLVACLA 57 (169)
Q Consensus 45 ~~MK~~~~l~~~~ 57 (169)
.|||-++++++++
T Consensus 5 ~mmKkil~~l~a~ 17 (25)
T PF08139_consen 5 SMMKKILFPLLAL 17 (25)
T ss_pred HHHHHHHHHHHHH
Confidence 4677776554433
No 38
>TIGR00952 S15_bact ribosomal protein S15, bacterial/organelle. This model is built to recognize specifically bacterial, chloroplast, and mitochondrial ribosomal protein S15. The homologous proteins of Archaea and Eukarya are designated S13.
Probab=24.10 E-value=1.1e+02 Score=22.21 Aligned_cols=24 Identities=17% Similarity=0.472 Sum_probs=20.3
Q ss_pred HHHHHHHHHhcChHHHHHHHHhhC
Q psy13420 130 AKKIFKFLIEKKPEEWKALEGKYD 153 (169)
Q Consensus 130 ~~kvi~~l~~~~P~~w~~l~~KyD 153 (169)
=+++++||.++.++.|..+.++.+
T Consensus 60 RkrlL~YL~~~d~~~Y~~li~~Lg 83 (86)
T TIGR00952 60 RRRLLKYLKRTDVERYRSLIKRLG 83 (86)
T ss_pred HHHHHHHHHhCCHHHHHHHHHHhC
Confidence 367899999999999999987753
No 39
>CHL00027 rps15 ribosomal protein S15
Probab=24.02 E-value=1.1e+02 Score=22.54 Aligned_cols=24 Identities=17% Similarity=0.394 Sum_probs=21.0
Q ss_pred HHHHHHHHHhcChHHHHHHHHhhC
Q psy13420 130 AKKIFKFLIEKKPEEWKALEGKYD 153 (169)
Q Consensus 130 ~~kvi~~l~~~~P~~w~~l~~KyD 153 (169)
=+++++||.++.++.|..+.++.+
T Consensus 60 RkrLL~YL~r~d~~~Y~~li~~Lg 83 (90)
T CHL00027 60 RQRLLAYLSKKNRVRYKKLISQLG 83 (90)
T ss_pred HHHHHHHHHhCCHHHHHHHHHHhC
Confidence 368899999999999999998764
No 40
>PRK05626 rpsO 30S ribosomal protein S15; Reviewed
Probab=23.46 E-value=1.1e+02 Score=22.27 Aligned_cols=23 Identities=26% Similarity=0.548 Sum_probs=19.8
Q ss_pred HHHHHHHHhcChHHHHHHHHhhC
Q psy13420 131 KKIFKFLIEKKPEEWKALEGKYD 153 (169)
Q Consensus 131 ~kvi~~l~~~~P~~w~~l~~KyD 153 (169)
+++++||.++.++.|..+.++.+
T Consensus 64 krlL~YL~~~d~~~Y~~li~~Lg 86 (89)
T PRK05626 64 RKLLDYLKKKDVERYRALIERLG 86 (89)
T ss_pred HHHHHHHHhcCHHHHHHHHHHhC
Confidence 58899999999999999987753
No 41
>PF14853 Fis1_TPR_C: Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=23.36 E-value=1.1e+02 Score=20.01 Aligned_cols=35 Identities=17% Similarity=0.285 Sum_probs=28.3
Q ss_pred CHHHHHHHhhccccCCCCChhHHHHHHHHHHHHhh
Q psy13420 83 NDRLFTNYYKCLMDTGACSPDGAELKKVLPDAIAT 117 (169)
Q Consensus 83 N~rl~~~y~~Clld~gpCt~~g~~lK~~lPeal~t 117 (169)
|=.--.+|++-+|...|=.+.++.|+..|-+.++.
T Consensus 16 ~Y~~A~~~~~~lL~~eP~N~Qa~~L~~~i~~~i~k 50 (53)
T PF14853_consen 16 EYEKARRYCDALLEIEPDNRQAQSLKELIEDKIQK 50 (53)
T ss_dssp -HHHHHHHHHHHHHHTTS-HHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHhhCCCcHHHHHHHHHHHHHHhc
Confidence 33445689999999999999999999999988864
No 42
>PF03790 KNOX1: KNOX1 domain ; InterPro: IPR005540 The MEINOX region is comprised of two domains, KNOX1 and KNOX2. KNOX1 plays a role in suppressing target gene expression. KNOX2, essential for function, is thought to be necessary for homo-dimerization [].; GO: 0003677 DNA binding, 0005634 nucleus
Probab=22.63 E-value=47 Score=21.48 Aligned_cols=13 Identities=31% Similarity=0.603 Sum_probs=11.1
Q ss_pred HHHHHHHhhcccc
Q psy13420 84 DRLFTNYYKCLMD 96 (169)
Q Consensus 84 ~rl~~~y~~Clld 96 (169)
++|+..|++|.==
T Consensus 14 ~~Ll~Ayi~C~KV 26 (45)
T PF03790_consen 14 PRLLAAYIDCQKV 26 (45)
T ss_pred HHHHHHHHHHHhc
Confidence 6899999999854
No 43
>PF14194 Cys_rich_VLP: Cysteine-rich VLP
Probab=22.59 E-value=83 Score=21.30 Aligned_cols=19 Identities=11% Similarity=0.135 Sum_probs=15.8
Q ss_pred CHHHHhhHHHHHHHHHhcC
Q psy13420 123 SDKQKEGAKKIFKFLIEKK 141 (169)
Q Consensus 123 t~kQK~~~~kvi~~l~~~~ 141 (169)
|++|++.++++++..--|+
T Consensus 1 T~~q~r~~~~LV~~~C~Ny 19 (56)
T PF14194_consen 1 TPRQRRRIRKLVRRECCNY 19 (56)
T ss_pred CHHHHHHHHHHHHHHcccC
Confidence 7899999999998776654
No 44
>KOG3355|consensus
Probab=22.37 E-value=1.9e+02 Score=23.80 Aligned_cols=68 Identities=16% Similarity=0.217 Sum_probs=50.3
Q ss_pred CCCCChhHHHHHHHHHHHHhh----cCCCCCHHHHhhHHHHHHHHHhcCh-----HHHHHHHHhhCCCchhHHHHHH
Q psy13420 97 TGACSPDGAELKKVLPDAIAT----ECGGCSDKQKEGAKKIFKFLIEKKP-----EEWKALEGKYDPSGSYKAKYDA 164 (169)
Q Consensus 97 ~gpCt~~g~~lK~~lPeal~t----~C~KCt~kQK~~~~kvi~~l~~~~P-----~~w~~l~~KyDP~~~y~~~~~~ 164 (169)
+-+|.++-.+|=+.-=-.|-| -=..=|+.||+.++.+|.-+-+-|| ++.+.++.+|=|.-.-++.|-.
T Consensus 63 ~~~~~~~~eeLGRstWtllHT~aA~yP~~PT~eqk~~~~sFi~~~s~~yPC~eCa~dl~K~l~~nppq~~SRe~~~~ 139 (177)
T KOG3355|consen 63 RKGDPPDKEELGRSTWTLLHTLAANYPDRPTPEQKDDMRSFIHLFSKFYPCGECAKDLRKILRKNPPQTSSREAFTQ 139 (177)
T ss_pred hcCCCchHHHHhHHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHhhhhcchHHHHHHHHHHHHhCCCCcchHHHHHH
Confidence 445888776666543333333 3445599999999999999999998 5668888889888877777653
No 45
>PRK08557 hypothetical protein; Provisional
Probab=21.65 E-value=86 Score=28.81 Aligned_cols=32 Identities=25% Similarity=0.318 Sum_probs=24.6
Q ss_pred hcCCCCCHHHHhhHHHHHHHHHhcChHHHHH---HHHhh
Q psy13420 117 TECGGCSDKQKEGAKKIFKFLIEKKPEEWKA---LEGKY 152 (169)
Q Consensus 117 t~C~KCt~kQK~~~~kvi~~l~~~~P~~w~~---l~~Ky 152 (169)
-+|.-|+-.+. .-+..+++.||+.|+. +.++|
T Consensus 349 iGC~~Cp~~~~----~e~~~l~~~~Pe~~~k~~~~l~~~ 383 (417)
T PRK08557 349 IGCYLCPSALN----SEFLRVKELYPELFNRWVKYLKKY 383 (417)
T ss_pred CCccCCCCccH----HHHHHHHHHCHHHHHHHHHHHHHh
Confidence 58999998755 3556779999999888 66666
No 46
>PF15393 DUF4615: Domain of unknown function (DUF4615)
Probab=21.26 E-value=1.2e+02 Score=23.55 Aligned_cols=44 Identities=25% Similarity=0.387 Sum_probs=30.2
Q ss_pred CCCHHHHhhHHHHHHHHHhcChH---HHHHHHHhhCCCchhHHHHHHHHh
Q psy13420 121 GCSDKQKEGAKKIFKFLIEKKPE---EWKALEGKYDPSGSYKAKYDAQLK 167 (169)
Q Consensus 121 KCt~kQK~~~~kvi~~l~~~~P~---~w~~l~~KyDP~~~y~~~~~~~~~ 167 (169)
|=|++|.+-+.++|+-|+.+.-- -=+....-| |.|+++.+++..
T Consensus 22 k~t~kq~ee~~k~i~~L~S~k~plvkKRqlMr~~f---GdYR~kM~~e~~ 68 (124)
T PF15393_consen 22 KLTPKQIEEASKAIRTLKSNKAPLVKKRQLMRSTF---GDYRAKMKEEER 68 (124)
T ss_pred CCCHHHHHHHHHHHHHhhCCCCccHHHHHHHHHHH---HhHHHHHHHHHH
Confidence 33899999999999999876532 222333333 668888887654
No 47
>PRK10410 hypothetical protein; Provisional
Probab=21.04 E-value=95 Score=23.13 Aligned_cols=22 Identities=23% Similarity=0.634 Sum_probs=18.9
Q ss_pred HhhcCCCC-----CHHHHhhHHHHHHH
Q psy13420 115 IATECGGC-----SDKQKEGAKKIFKF 136 (169)
Q Consensus 115 l~t~C~KC-----t~kQK~~~~kvi~~ 136 (169)
-++.|++| .+.+++.+++|++|
T Consensus 55 ~Kp~C~~C~ihCY~p~~r~ki~~VMr~ 81 (100)
T PRK10410 55 EKPTCKKCPVHCYKPAKREKMKQVMRF 81 (100)
T ss_pred CCCCCCCCCCcCCCHHHHHHHHHHHHH
Confidence 35689999 58999999999998
No 48
>COG4075 Uncharacterized conserved protein, homolog of nitrogen regulatory protein PII [Function unknown]
Probab=20.94 E-value=22 Score=26.96 Aligned_cols=33 Identities=27% Similarity=0.599 Sum_probs=24.8
Q ss_pred hhHHHHHHHHHh-------------cChHHHHHHHHhhCCCchhHH
Q psy13420 128 EGAKKIFKFLIE-------------KKPEEWKALEGKYDPSGSYKA 160 (169)
Q Consensus 128 ~~~~kvi~~l~~-------------~~P~~w~~l~~KyDP~~~y~~ 160 (169)
+++.|+++.+.. --|+.|+-|..+-||+..|..
T Consensus 14 eNvGkaiN~mad~GiTGFfl~eYrGvsPd~wkgf~~~EDpE~aik~ 59 (110)
T COG4075 14 ENVGKAINIMADAGITGFFLHEYRGVSPDKWKGFSKEEDPESAIKA 59 (110)
T ss_pred HHHHHHHHHHHhcCcceEEEEEecCcChhHhcCcccccCHHHHHHH
Confidence 455566665543 359999999999999988754
No 49
>PF14072 DndB: DNA-sulfur modification-associated
Probab=20.52 E-value=1.5e+02 Score=25.66 Aligned_cols=59 Identities=17% Similarity=0.171 Sum_probs=41.1
Q ss_pred HHHHHHHHHHHHh--hcCCCCCHHHHhhHHHHHHHHHhcChHHHHHHHHhhCCCchhHHHHH
Q psy13420 104 GAELKKVLPDAIA--TECGGCSDKQKEGAKKIFKFLIEKKPEEWKALEGKYDPSGSYKAKYD 163 (169)
Q Consensus 104 g~~lK~~lPeal~--t~C~KCt~kQK~~~~kvi~~l~~~~P~~w~~l~~KyDP~~~y~~~~~ 163 (169)
-..|.+.+-+.+. ..-..=.+.+.+.+..+.+.+.+..|+ |+.+..+--..+++++.|.
T Consensus 207 ~~~l~~~i~~~~~~~~~~~~~~~~~~~~l~~f~~~v~~~~~~-w~~~~~~~~~~~~~r~~~~ 267 (346)
T PF14072_consen 207 LSALVDAIKRLIKKDRPSEEDLEEIYEILNNFWDAVAEVFPE-WQKLFEKENSPVDFREKSI 267 (346)
T ss_pred HHHHHHHHHHHHhccCCCcccHHHHHHHHHHHHHHHHHHChH-HHHhhcccCchhhHhhHhh
Confidence 4566666666662 223333588899999999999999999 9988766543356655543
No 50
>COG5510 Predicted small secreted protein [Function unknown]
Probab=20.16 E-value=88 Score=20.19 Aligned_cols=7 Identities=43% Similarity=0.681 Sum_probs=2.9
Q ss_pred HHHHHHH
Q psy13420 46 MFKSLVL 52 (169)
Q Consensus 46 ~MK~~~~ 52 (169)
|+|++++
T Consensus 2 mk~t~l~ 8 (44)
T COG5510 2 MKKTILL 8 (44)
T ss_pred chHHHHH
Confidence 3443433
No 51
>PRK09993 C-lysozyme inhibitor; Provisional
Probab=20.12 E-value=2.5e+02 Score=22.69 Aligned_cols=41 Identities=20% Similarity=0.378 Sum_probs=25.5
Q ss_pred HHHHHHHHHHHHHHHHHhhcCCccCCcCccCCHHHHhcCHHHHHHHhh
Q psy13420 45 IMFKSLVLVACLACIGSVLTSPAYTTKYDNVDLDEIIANDRLFTNYYK 92 (169)
Q Consensus 45 ~~MK~~~~l~~~~~~~~~~a~e~Y~~~yDniDid~iL~N~rl~~~y~~ 92 (169)
+|.|.++.++++++++++.|+. ..-+.+++.++.+...+-+
T Consensus 5 ~~~~~l~aa~lL~~s~~a~Aq~-------d~t~seLl~~p~yk~aw~~ 45 (153)
T PRK09993 5 MMFKALTTVAALVIATSAMAQD-------DLTISSLAKGETTKAAFNQ 45 (153)
T ss_pred hHHHHHHHHHHHHHHHHHHhcC-------CccHHHHhcCHHHHHHHHH
Confidence 4455555444444444444543 1668899999998888765
No 52
>KOG4623|consensus
Probab=20.10 E-value=84 Score=30.19 Aligned_cols=42 Identities=14% Similarity=0.266 Sum_probs=29.1
Q ss_pred hcCCCCCHHHHhhHHHHHHHHHhcChHHHHHHHHhhCCCchhHHHHHHH
Q psy13420 117 TECGGCSDKQKEGAKKIFKFLIEKKPEEWKALEGKYDPSGSYKAKYDAQ 165 (169)
Q Consensus 117 t~C~KCt~kQK~~~~kvi~~l~~~~P~~w~~l~~KyDP~~~y~~~~~~~ 165 (169)
.-|.+|...|.-.++|+..|.=.++ +.|++=. ..|++++|+-
T Consensus 102 glC~~CN~~Q~ik~~kLA~FeP~de-~rydeeL------evYR~~LE~m 143 (611)
T KOG4623|consen 102 GLCEKCNMNQLIKNRKLADFEPPDE-QRYDEEL------EVYRKSLEEM 143 (611)
T ss_pred hhhhhhhhhHHHHHHHHhhcCCCch-hhHHHHH------HHHHHHHHHH
Confidence 4799999999999999998875332 3344333 3566666653
Done!