Query psy7032
Match_columns 136
No_of_seqs 116 out of 200
Neff 4.9
Searched_HMMs 46136
Date Sat Aug 17 01:01:37 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7032.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7032hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 TIGR00870 trp transient-recept 99.9 2.2E-27 4.8E-32 213.6 6.2 112 5-116 621-743 (743)
2 KOG3609|consensus 99.9 3.5E-27 7.5E-32 216.4 7.3 125 5-129 617-753 (822)
3 KOG3614|consensus 93.2 0.45 9.7E-06 47.6 8.4 38 5-43 1048-1085(1381)
4 COG5466 Predicted small metal- 76.8 5.4 0.00012 26.6 4.0 34 75-112 25-58 (59)
5 PF13624 SurA_N_3: SurA N-term 75.8 14 0.0003 26.9 6.4 59 72-130 74-139 (154)
6 KOG4807|consensus 73.6 3.8 8.3E-05 37.1 3.4 29 98-126 516-544 (593)
7 PRK00059 prsA peptidylprolyl i 71.1 26 0.00056 29.1 7.7 56 73-128 85-148 (336)
8 PF14394 DUF4423: Domain of un 62.0 11 0.00024 29.3 3.5 29 96-124 127-155 (171)
9 PF09388 SpoOE-like: Spo0E lik 61.4 13 0.00028 22.6 3.1 27 105-131 1-27 (45)
10 COG4224 Uncharacterized protei 60.8 25 0.00053 24.6 4.7 41 78-123 2-42 (77)
11 PRK06835 DNA replication prote 55.0 69 0.0015 27.3 7.5 57 73-129 37-93 (329)
12 PF08880 QLQ: QLQ; InterPro: 52.7 20 0.00044 21.5 2.8 19 97-115 2-20 (37)
13 PF10845 DUF2576: Protein of u 52.7 24 0.00051 22.6 3.2 28 101-128 11-40 (48)
14 PF03210 Paramyx_P_V_C: Paramy 51.9 11 0.00023 29.1 1.9 39 72-110 111-154 (155)
15 PF03564 DUF1759: Protein of u 49.4 96 0.0021 22.3 6.9 59 70-129 52-117 (145)
16 PF10146 zf-C4H2: Zinc finger- 47.4 41 0.00088 27.6 4.7 13 73-85 36-48 (230)
17 PF09312 SurA_N: SurA N-termin 47.2 71 0.0015 22.8 5.5 55 74-128 43-101 (118)
18 PF08105 Antimicrobial10: Metc 46.4 9.7 0.00021 24.7 0.8 9 26-34 35-43 (52)
19 cd08807 CARD_CARD10_CARMA3 Cas 45.7 25 0.00055 25.1 2.9 32 95-126 28-64 (86)
20 cd08806 CARD_CARD14_CARMA2 Cas 45.2 29 0.00063 24.8 3.1 32 95-126 28-64 (86)
21 TIGR03720 exospor_lead exospor 43.9 13 0.00027 21.0 0.9 11 25-35 7-19 (26)
22 PF15344 FAM217: FAM217 family 43.2 11 0.00024 31.3 0.8 10 26-35 50-59 (233)
23 PF09366 DUF1997: Protein of u 42.7 35 0.00076 25.7 3.5 14 25-38 110-123 (158)
24 PF10073 DUF2312: Uncharacteri 42.5 92 0.002 21.6 5.2 40 76-130 4-43 (74)
25 PF11176 DUF2962: Protein of u 42.4 41 0.00089 25.8 3.9 14 75-88 68-81 (155)
26 KOG3609|consensus 41.6 5 0.00011 38.6 -1.5 43 70-112 734-776 (822)
27 TIGR02147 Fsuc_second hypothet 40.4 38 0.00081 28.4 3.6 29 96-124 225-253 (271)
28 TIGR01219 Pmev_kin_ERG8 phosph 40.2 2.4E+02 0.0051 25.5 8.8 28 101-128 352-379 (454)
29 PF08776 VASP_tetra: VASP tetr 39.0 58 0.0013 20.1 3.4 21 102-122 12-32 (40)
30 PF13789 DUF4181: Domain of un 38.8 3.6 7.9E-05 29.6 -2.3 14 2-15 72-85 (110)
31 cd08809 CARD_CARD9 Caspase act 37.5 45 0.00098 23.8 3.1 33 95-127 28-65 (86)
32 PRK10788 periplasmic folding c 33.0 78 0.0017 28.8 4.7 55 74-128 88-149 (623)
33 PF13801 Metal_resist: Heavy-m 31.4 94 0.002 20.8 3.9 28 95-122 81-108 (125)
34 PRK10770 peptidyl-prolyl cis-t 30.7 2.4E+02 0.0052 24.0 7.1 55 73-127 48-106 (413)
35 PF05121 GvpK: Gas vesicle pro 29.8 2.1E+02 0.0046 20.4 6.2 53 76-131 23-75 (88)
36 PRK01631 hypothetical protein; 29.7 1.6E+02 0.0034 20.6 4.7 40 78-122 1-40 (76)
37 TIGR00496 frr ribosome recycli 29.4 2.8E+02 0.006 21.7 7.3 48 72-119 113-160 (176)
38 PRK07857 hypothetical protein; 29.2 86 0.0019 23.0 3.5 30 95-124 22-51 (106)
39 PF05291 Bystin: Bystin; Inte 28.1 60 0.0013 28.0 2.9 34 11-45 50-83 (301)
40 PRK10132 hypothetical protein; 27.9 87 0.0019 22.8 3.4 26 100-125 11-36 (108)
41 PF08172 CASP_C: CASP C termin 27.6 74 0.0016 26.3 3.3 25 100-124 99-123 (248)
42 PF04728 LPP: Lipoprotein leuc 27.5 1.1E+02 0.0024 20.1 3.5 21 101-121 17-37 (56)
43 PRK13723 conjugal transfer pil 27.4 2.6E+02 0.0055 25.4 6.9 51 73-123 358-413 (451)
44 PF09860 DUF2087: Uncharacteri 26.4 2.1E+02 0.0045 19.3 5.1 25 95-119 27-55 (71)
45 PF07889 DUF1664: Protein of u 26.0 68 0.0015 24.2 2.6 24 98-121 86-109 (126)
46 TIGR02698 CopY_TcrY copper tra 26.0 2.6E+02 0.0057 20.4 5.7 40 72-111 71-115 (130)
47 PF09851 SHOCT: Short C-termin 25.7 1.2E+02 0.0025 17.1 3.0 18 94-111 13-30 (31)
48 TIGR03575 selen_PSTK_euk L-ser 25.6 2.6E+02 0.0057 24.1 6.5 47 75-125 259-305 (340)
49 PF04353 Rsd_AlgQ: Regulator o 25.0 1.9E+02 0.004 22.6 4.9 38 8-46 20-59 (153)
50 PF14838 INTS5_C: Integrator c 24.8 86 0.0019 30.0 3.5 81 7-87 597-694 (696)
51 PRK12807 flagellin; Provisiona 24.2 3.5E+02 0.0075 22.3 6.7 32 93-124 97-128 (287)
52 KOG3870|consensus 23.9 1.4E+02 0.003 27.1 4.5 21 99-119 59-79 (434)
53 KOG0289|consensus 22.6 43 0.00093 30.7 1.1 30 78-107 24-54 (506)
54 PF15397 DUF4618: Domain of un 22.4 1.2E+02 0.0026 25.5 3.7 19 101-119 200-218 (258)
55 cd00520 RRF Ribosome recycling 22.1 3.8E+02 0.0083 20.8 7.2 49 71-119 117-165 (179)
56 PF14389 Lzipper-MIP1: Leucine 22.0 1.1E+02 0.0023 21.3 2.9 20 100-119 67-86 (88)
57 TIGR01801 CM_A chorismate muta 21.4 1.3E+02 0.0028 21.5 3.2 26 99-124 2-27 (102)
58 PF01988 VIT1: VIT family; In 21.1 93 0.002 24.5 2.7 16 97-112 93-108 (213)
59 PF14164 YqzH: YqzH-like prote 21.0 2.7E+02 0.006 18.7 4.7 18 94-111 22-39 (64)
60 PRK09269 chorismate mutase; Pr 20.8 4.3E+02 0.0092 21.1 6.4 45 79-123 101-145 (193)
61 PHA02554 13 neck protein; Prov 20.6 2E+02 0.0043 25.0 4.7 42 79-120 246-302 (311)
62 PF01152 Bac_globin: Bacterial 20.4 2.5E+02 0.0053 19.5 4.5 43 77-119 77-119 (120)
63 PF06757 Ins_allergen_rp: Inse 20.2 95 0.0021 23.8 2.5 53 78-130 20-75 (179)
No 1
>TIGR00870 trp transient-receptor-potential calcium channel protein. after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
Probab=99.94 E-value=2.2e-27 Score=213.61 Aligned_cols=112 Identities=30% Similarity=0.551 Sum_probs=88.9
Q ss_pred CCccccchhhhhHHHHHhcCCCCCCCCcccccChHHHHHHHhhhhhhcccccc-HHH-HH--------HHHH-hhhhhhh
Q psy7032 5 RQIDKMRKDSIKLWLSYFDDSPTLPPPFNIFPTTKMFLKMFGLRQIDKMRKDS-IKR-KQ--------QARQ-EKERDFR 73 (136)
Q Consensus 5 ~~~~eWKFaRtkLwmsyf~~~~tlPpPFNliPspk~~~~l~~~~~~~~~~~~s-~~r-~~--------~~~~-~~~~~~~ 73 (136)
+++.||||+|+++||+|++.+.|+||||||||+||+++|++.+..+..+.... ..+ .. .... ..+.+.+
T Consensus 621 ~~~~~wk~~r~~~~~~y~~~~~~~p~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 700 (743)
T TIGR00870 621 DADEEWKFQRAKLWMSYEREGGTCPPPFNIIPGPKSFVGLFKRIEKHDGKKRQRWCRRVEEVNWTTWERKAETLIEDGLH 700 (743)
T ss_pred hhHHHHHHHHHHHHHHHHhcCCCCCCCceeccchHHHHHHHhhhhcccCcchHhHhhhhhhhccchhhhhhhhccccchh
Confidence 56899999999999999999999999999999999999988665433222110 001 00 0001 1122367
Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHH
Q psy7032 74 YAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYE 116 (136)
Q Consensus 74 y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyE 116 (136)
|+++|++||+|||++++++.+++++||||++||||||||||||
T Consensus 701 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~ 743 (743)
T TIGR00870 701 YQRVMKRLIKRYVLAEQRPRDDEGTTEEETKELKQDISSLRFE 743 (743)
T ss_pred HHHHHHhhhhhhhhhhccccccCCCChhhHHHHHHHhhhccCC
Confidence 9999999999999999999999999999999999999999997
No 2
>KOG3609|consensus
Probab=99.94 E-value=3.5e-27 Score=216.39 Aligned_cols=125 Identities=22% Similarity=0.277 Sum_probs=99.1
Q ss_pred CCccccchhhhhHHHHHhcCCCCCCCCcccccChHHHHHHHhhhhhhccccc---cHHHHHHH--------HHhhhhhhh
Q psy7032 5 RQIDKMRKDSIKLWLSYFDDSPTLPPPFNIFPTTKMFLKMFGLRQIDKMRKD---SIKRKQQA--------RQEKERDFR 73 (136)
Q Consensus 5 ~~~~eWKFaRtkLwmsyf~~~~tlPpPFNliPspk~~~~l~~~~~~~~~~~~---s~~r~~~~--------~~~~~~~~~ 73 (136)
++|+||||||||||||||++|.||||||||||+||+++|+..+.+++.+... +.++..+. .....+...
T Consensus 617 ~ADvEWKFARAKLw~syF~eg~tLPpPFNlipspks~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~ 696 (822)
T KOG3609|consen 617 DADVEWKFARAKLWMSYFDEGMALPPPFNLIPSPKSFVYRNLRLRASIWKEKPANDDVEDQAADMLLVVFAAFVARKEEK 696 (822)
T ss_pred cchhHHHHHHHHHHHHHhcCCCCCCCCcccccchHHHHHHHhhhhcchhccCcccccchhhhHhHHHHHhhhhccchhhc
Confidence 6899999999999999999999999999999999999998866554432222 11111111 122345679
Q ss_pred HHHHHHHHHHHHHHHHHHhhc-CCCCChhHHHHHHhhHHHHHHHHHHHHHhcCCCCC
Q psy7032 74 YAAVMRALVWRYICLQHRQAD-TNPVTEDDVNEVKGEITAMRYELMNVLEKNGMDTS 129 (136)
Q Consensus 74 y~~VMr~LVkRYi~~~qr~~e-~~~vtE~dl~EiKQDISslRyEl~e~l~~~gm~~~ 129 (136)
|+.+|+..+.||..+..+... ..+.++...++++||++++|+|+++.++...+.+.
T Consensus 697 ~~~~~~~~~~r~~~~~~~~~~~t~g~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~ 753 (822)
T KOG3609|consen 697 YKSIWRSEFHREEKQPPRVTFLTTGAHSFNMDNTFEGFKMRRSETMEALGGKEYTTV 753 (822)
T ss_pred chhhhhhHHHHHHHHHHHHHhhccchhhhhhhhhhhccchhHHHHHHHhcCceeehh
Confidence 999999999999999888765 77999999999999999999999999975444333
No 3
>KOG3614|consensus
Probab=93.16 E-value=0.45 Score=47.64 Aligned_cols=38 Identities=29% Similarity=0.543 Sum_probs=29.9
Q ss_pred CCccccchhhhhHHHHHhcCCCCCCCCcccccChHHHHH
Q psy7032 5 RQIDKMRKDSIKLWLSYFDDSPTLPPPFNIFPTTKMFLK 43 (136)
Q Consensus 5 ~~~~eWKFaRtkLwmsyf~~~~tlPpPFNliPspk~~~~ 43 (136)
+.+.=|||-|=+|-|.|=+ -..|||||=||-..-.+..
T Consensus 1048 ~sd~iWkFQRY~limeyh~-rP~LPPPfiilsHi~l~~~ 1085 (1381)
T KOG3614|consen 1048 NSDQIWKFQRYSLIMEYHS-RPALPPPFIILSHIYLLLK 1085 (1381)
T ss_pred hHHHHHHHHHHHHHHHhhc-CCCCCCCcHHHHHHHHHHH
Confidence 3456799999999999876 5589999988876655443
No 4
>COG5466 Predicted small metal-binding protein [Function unknown]
Probab=76.77 E-value=5.4 Score=26.62 Aligned_cols=34 Identities=29% Similarity=0.404 Sum_probs=24.7
Q ss_pred HHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHH
Q psy7032 75 AAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITA 112 (136)
Q Consensus 75 ~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISs 112 (136)
++||+++|- .+...-..+.+.||+++-|||-|+.
T Consensus 25 ~Ev~~~iv~----H~k~~Hg~t~I~ed~in~Ik~rIk~ 58 (59)
T COG5466 25 AEVMRRIVE----HAKEAHGETEIREDMINKIKSRIKK 58 (59)
T ss_pred HHHHHHHHH----HHHHhcCCccccHHHHHHHHHHhhc
Confidence 567777763 3333334788999999999999874
No 5
>PF13624 SurA_N_3: SurA N-terminal domain; PDB: 3NRK_A.
Probab=75.81 E-value=14 Score=26.88 Aligned_cols=59 Identities=19% Similarity=0.260 Sum_probs=31.8
Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHH-------HHHHHHHHHhcCCCCCc
Q psy7032 72 FRYAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAM-------RYELMNVLEKNGMDTSS 130 (136)
Q Consensus 72 ~~y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISsl-------RyEl~e~l~~~gm~~~~ 130 (136)
.-.+.++..||.+.+....-++..-.|++++|.+.=+.|-.| +-.+.+.|+++||+...
T Consensus 74 ~~~~~~l~~lI~~~ll~q~A~~~gi~vsd~ev~~~i~~~~~f~~~g~~~~~~f~~~L~~~g~t~~~ 139 (154)
T PF13624_consen 74 QLKQQVLDQLIDQKLLLQEAKKLGISVSDAEVDDAIKQIPAFQENGKFDKEAFEEFLKQQGMTEEE 139 (154)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHHTT----HHHHHHHHHH--HHHHH----HHHHHHHHH--------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHhhccccc
Confidence 456779999999999886666666777888887655554444 23456678888886543
No 6
>KOG4807|consensus
Probab=73.60 E-value=3.8 Score=37.07 Aligned_cols=29 Identities=28% Similarity=0.460 Sum_probs=25.5
Q ss_pred CChhHHHHHHhhHHHHHHHHHHHHHhcCC
Q psy7032 98 VTEDDVNEVKGEITAMRYELMNVLEKNGM 126 (136)
Q Consensus 98 vtE~dl~EiKQDISslRyEl~e~l~~~gm 126 (136)
|-|.+|.=|||+|||||-||.-.+++..+
T Consensus 516 VKEsEiQYLKqEissLkDELQtalrDKky 544 (593)
T KOG4807|consen 516 VKESEIQYLKQEISSLKDELQTALRDKKY 544 (593)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhhhc
Confidence 45899999999999999999999987654
No 7
>PRK00059 prsA peptidylprolyl isomerase; Provisional
Probab=71.11 E-value=26 Score=29.06 Aligned_cols=56 Identities=18% Similarity=0.321 Sum_probs=40.4
Q ss_pred hHHHHHHHHHHHHHHHHHHhhcCCCCChhHHH-HHHhhHHHHHH-------HHHHHHHhcCCCC
Q psy7032 73 RYAAVMRALVWRYICLQHRQADTNPVTEDDVN-EVKGEITAMRY-------ELMNVLEKNGMDT 128 (136)
Q Consensus 73 ~y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~-EiKQDISslRy-------El~e~l~~~gm~~ 128 (136)
.+..|+.+||.+.+....-+...-.|++++|. +|.+.|..|.- .+...|...||+.
T Consensus 85 ~~~~vL~~LI~~~ll~q~a~~~gi~vsd~ei~~~i~~~~~~~~~~~~~~~~~~~~~L~~~g~t~ 148 (336)
T PRK00059 85 QKEQILDSLITEKVLLQKAKELKLIPSEEELNKEVDKKINEIKKQFNNDEEQFEEALKATGFTE 148 (336)
T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCcCHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHcCCCH
Confidence 46789999999999886665556667888874 57777776643 4666777778753
No 8
>PF14394 DUF4423: Domain of unknown function (DUF4423)
Probab=61.98 E-value=11 Score=29.28 Aligned_cols=29 Identities=24% Similarity=0.504 Sum_probs=26.3
Q ss_pred CCCChhHHHHHHhhHHHHHHHHHHHHHhc
Q psy7032 96 NPVTEDDVNEVKGEITAMRYELMNVLEKN 124 (136)
Q Consensus 96 ~~vtE~dl~EiKQDISslRyEl~e~l~~~ 124 (136)
-+|+++++.+|++-|-.||.++..+..+.
T Consensus 127 ~~vs~~~~~ki~~~i~~fRk~i~~i~~~~ 155 (171)
T PF14394_consen 127 MSVSREDYEKIKKEIREFRKKIIAIAEED 155 (171)
T ss_pred EEeCHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 46789999999999999999999998874
No 9
>PF09388 SpoOE-like: Spo0E like sporulation regulatory protein; InterPro: IPR018540 Spore formation is an extreme response to starvation and can also be a component of disease transmission. Sporulation is controlled by an expanded two-component system where starvation signals result in sensor kinase activation and phosphorylation of the master sporulation response regulator Spo0A. Phosphatases such as Spo0E dephosphorylate Spo0A thereby inhibiting sporulation. This is a family of Spo0E-like phosphatases. The structure of a Bacillus anthracis member of this family has revealed an anti-parallel alpha-helical structure []. ; PDB: 2BZB_B 2C0S_A.
Probab=61.36 E-value=13 Score=22.65 Aligned_cols=27 Identities=33% Similarity=0.500 Sum_probs=23.4
Q ss_pred HHHhhHHHHHHHHHHHHHhcCCCCCcc
Q psy7032 105 EVKGEITAMRYELMNVLEKNGMDTSSA 131 (136)
Q Consensus 105 EiKQDISslRyEl~e~l~~~gm~~~~~ 131 (136)
+|.+.|..+|.+|.+.....|+.-|.+
T Consensus 1 ~L~~~Ie~~R~~L~~~~~~~~l~~~~v 27 (45)
T PF09388_consen 1 ELLEEIEELRQELNELAEKKGLTDPEV 27 (45)
T ss_dssp HHHHHHHHHHHHHHHHHHHCCTTCHHH
T ss_pred CHHHHHHHHHHHHHHHHHHcCCCCHHH
Confidence 578899999999999999988876654
No 10
>COG4224 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=60.79 E-value=25 Score=24.65 Aligned_cols=41 Identities=17% Similarity=0.255 Sum_probs=26.2
Q ss_pred HHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHHHHHh
Q psy7032 78 MRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMNVLEK 123 (136)
Q Consensus 78 Mr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e~l~~ 123 (136)
|..++-| |-..-.++...++||++.. +=..||.|-++.++.
T Consensus 2 ~~~~i~r-INeLAkk~K~~gLTeeEk~----eQ~~LR~eYl~~fr~ 42 (77)
T COG4224 2 SKKKIAR-INELAKKKKEEGLTEEEKK----EQAKLRREYLESFRG 42 (77)
T ss_pred cHHHHHH-HHHHHHHhcccCCCHHHHH----HHHHHHHHHHHHHHH
Confidence 3455556 5444556678999999854 445666666666653
No 11
>PRK06835 DNA replication protein DnaC; Validated
Probab=55.05 E-value=69 Score=27.32 Aligned_cols=57 Identities=21% Similarity=0.230 Sum_probs=40.2
Q ss_pred hHHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHHHHHhcCCCCC
Q psy7032 73 RYAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMNVLEKNGMDTS 129 (136)
Q Consensus 73 ~y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e~l~~~gm~~~ 129 (136)
+..++-..+...=|..++.--..+.-.+..+.++|+.|..|+.+--++|..+|.+..
T Consensus 37 ~~~~id~~i~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~l~~~~~~lL~~~g~~~d 93 (329)
T PRK06835 37 EIAEIDDEIAKLGIKLSRAILKNPDKKEETLKELKEKITDLRVKKAELLVSNGYPPD 93 (329)
T ss_pred cHHHHHHHHHHHHHHHHHHHhcCCCccHHHHHHHHHHHHHHHHHHHHHHHHcCCChh
Confidence 444555555555555444433322223789999999999999999999999998744
No 12
>PF08880 QLQ: QLQ; InterPro: IPR014978 QLQ is named after the conserved Gln, Leu, Gln motif. QLQ is found at the N terminus of SWI2/SNF2 protein, which has been shown to be involved in protein-protein interactions. QLQ has been postulated to be involved in mediating protein interactions []. ; GO: 0005524 ATP binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=52.71 E-value=20 Score=21.47 Aligned_cols=19 Identities=32% Similarity=0.676 Sum_probs=17.4
Q ss_pred CCChhHHHHHHhhHHHHHH
Q psy7032 97 PVTEDDVNEVKGEITAMRY 115 (136)
Q Consensus 97 ~vtE~dl~EiKQDISslRy 115 (136)
..|.+-+++||..|..+||
T Consensus 2 ~FT~~Ql~~L~~Qi~ayK~ 20 (37)
T PF08880_consen 2 PFTPAQLQELRAQILAYKY 20 (37)
T ss_pred CCCHHHHHHHHHHHHHHHH
Confidence 4688999999999999999
No 13
>PF10845 DUF2576: Protein of unknown function (DUF2576); InterPro: IPR022556 The function of this viral family of proteins is unknown. The entry contains Orf5 from Autographa californica nuclear polyhedrosis virus (AcMNPV).
Probab=52.67 E-value=24 Score=22.56 Aligned_cols=28 Identities=18% Similarity=0.447 Sum_probs=22.3
Q ss_pred hHHHHHHhhHHHHHHHHHHHHHhc--CCCC
Q psy7032 101 DDVNEVKGEITAMRYELMNVLEKN--GMDT 128 (136)
Q Consensus 101 ~dl~EiKQDISslRyEl~e~l~~~--gm~~ 128 (136)
=|-++||+|+.|||.-|.|+-..+ |++.
T Consensus 11 ydreqlrrelnsLR~~vhelctRs~t~fDc 40 (48)
T PF10845_consen 11 YDREQLRRELNSLRRSVHELCTRSTTGFDC 40 (48)
T ss_pred cCHHHHHHHHHHHHHHHHHHHHhcCCCcch
Confidence 356789999999999999887765 4543
No 14
>PF03210 Paramyx_P_V_C: Paramyxovirus P/V phosphoprotein C-terminal; InterPro: IPR004897 Paramyxoviral P genes are able to generate more than one product, using alternative reading frames and RNA editing. The P gene encodes the structural phosphoprotein P. In addition, it encodes several non-structural proteins present in the infected cell but not in the virus particle. This family includes phosphoprotein P and the non-structural phosphoprotein V from different paramyxoviruses. Phosphoprotein P is a modular protein organised into two moieties that are both functionally and structurally distinct: a well-conserved C-terminal moiety that contains all the regions required for transcription, and a poorly conserved, intrinsically unstructured N-terminal moiety that provides several additional functions required for replication. The N-terminal moiety is responsible for binding to newly synthesized free N(0) (nucleoprotein that has not yet bound RNA), in order to prevent the binding of N(0) to cellular RNA. The C-terminal moiety consists of an oligomerisation domain, an N-RNA (nucleoprotein-RNA)-binding domain and an L polymerase-binding domain [, ]. Phosphoprotein P is essential for the activity of the RNA polymerase complex which it forms with another subunit, L IPR001016 from INTERPRO. Although all the catalytic activities of the polymerase are associated with the L subunit, its function requires specific interactions with phosphoprotein P []. The P and V phosphoproteins are amino co-terminal, but diverge at their C-termini. This difference is generated by an RNA-editing mechanism in which one or two non-templated G residues are inserted into P-gene-derived mRNA. In Measles virus and Sendai virus, one G residue is inserted and the edited transcript encodes the V protein. In Mumps virus, Simian virus 5 and Newcastle disease virus, two G residues are inserted, and the edited transcript codes for the P protein []. Being phosphoproteins, both P and V are rich in serine and threonine residues over their whole lengths. In addition, the V proteins are rich in cysteine residues at the C-termini [].; PDB: 1T6O_A 2K9D_A 1OKS_A 3BBZ_B.
Probab=51.93 E-value=11 Score=29.13 Aligned_cols=39 Identities=26% Similarity=0.280 Sum_probs=27.7
Q ss_pred hhHHHHHHHHHHHHHHHHHHhhc-----CCCCChhHHHHHHhhH
Q psy7032 72 FRYAAVMRALVWRYICLQHRQAD-----TNPVTEDDVNEVKGEI 110 (136)
Q Consensus 72 ~~y~~VMr~LVkRYi~~~qr~~e-----~~~vtE~dl~EiKQDI 110 (136)
.-|..+++.|++-.+.......+ +.--|++||++||+.|
T Consensus 111 s~~k~~v~aLI~~~i~~~~~r~kl~~~l~~a~t~~dl~~ikr~I 154 (155)
T PF03210_consen 111 SGEKDTVRALIKSCIKDPDKRAKLLKKLNQAKTEEDLKRIKRMI 154 (155)
T ss_dssp --HHHHHHHHHHHSTTSHHHHHHHHHHHHC--SHHHHHHHHHHH
T ss_pred chHHHHHHHHHHHcCCCHHHHHHHHHHHHHcCCHHHHHHHHHHh
Confidence 45678999999999886544332 4556899999999987
No 15
>PF03564 DUF1759: Protein of unknown function (DUF1759); InterPro: IPR005312 This is a small family of proteins of unknown function.
Probab=49.44 E-value=96 Score=22.28 Aligned_cols=59 Identities=15% Similarity=0.237 Sum_probs=42.9
Q ss_pred hhhhHHHHHHHHHHHHHHHH-----HHh-hcCCC-CChhHHHHHHhhHHHHHHHHHHHHHhcCCCCC
Q psy7032 70 RDFRYAAVMRALVWRYICLQ-----HRQ-ADTNP-VTEDDVNEVKGEITAMRYELMNVLEKNGMDTS 129 (136)
Q Consensus 70 ~~~~y~~VMr~LVkRYi~~~-----qr~-~e~~~-vtE~dl~EiKQDISslRyEl~e~l~~~gm~~~ 129 (136)
.+..|+..+..|-+||=... ... -.+-+ +.++|..+|+..+..+..-+ ..|+..|.++.
T Consensus 52 ~~~~Y~~a~~~L~~~yg~~~~i~~~~~~~l~~l~~~~~~d~~~L~~~~~~v~~~i-~~L~~lg~~~~ 117 (145)
T PF03564_consen 52 SEENYEEAWELLEERYGNPRRIIQALLEELRNLPPISNDDPEALRSLVDKVNNCI-RALKALGVNVD 117 (145)
T ss_pred cchhhHHHHHHHHHHhCCchHHHHHHHHHHhccccccchhHHHHHHHHHHHHHHH-HHHHHcCCCCC
Confidence 45789999999999995432 222 12333 78899999999998888766 57777776654
No 16
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=47.41 E-value=41 Score=27.64 Aligned_cols=13 Identities=23% Similarity=0.301 Sum_probs=5.8
Q ss_pred hHHHHHHHHHHHH
Q psy7032 73 RYAAVMRALVWRY 85 (136)
Q Consensus 73 ~y~~VMr~LVkRY 85 (136)
.|..-|..|..-+
T Consensus 36 e~~kE~~~L~~Er 48 (230)
T PF10146_consen 36 EYRKEMEELLQER 48 (230)
T ss_pred HHHHHHHHHHHHH
Confidence 4444444444433
No 17
>PF09312 SurA_N: SurA N-terminal domain; InterPro: IPR015391 The correct folding of outer membrane proteins in Gram negative bacteria is facilitated by the survival protein SurA []. This entry represents the domain found at the N terminus of the chaperone SurA. It is a helical domain of unknown function. The C terminus of the SurA protein folds back and forms part of this domain also but is not included in the current alignment. ; PDB: 3RGC_B 2PV3_B 1M5Y_A.
Probab=47.21 E-value=71 Score=22.80 Aligned_cols=55 Identities=22% Similarity=0.280 Sum_probs=36.1
Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHH----HHHHHHHHhcCCCC
Q psy7032 74 YAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMR----YELMNVLEKNGMDT 128 (136)
Q Consensus 74 y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslR----yEl~e~l~~~gm~~ 128 (136)
...|+..||-..+....-+..+=.||+++|++.=++|.+-. -++...|...|++.
T Consensus 43 ~~qvLd~LI~e~L~~q~ak~~gI~vsd~evd~~i~~ia~~n~ls~~ql~~~L~~~G~s~ 101 (118)
T PF09312_consen 43 RKQVLDQLIDEKLQLQEAKRLGIKVSDEEVDEAIANIAKQNNLSVEQLRQQLEQQGISY 101 (118)
T ss_dssp HHHHHHHHHHHHHHHHHHHHCT----HHHHHHHHHHHHHHTT--HHHHHHHCHHCT--H
T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHcCCCHHHHHHHHHHcCCCH
Confidence 46799999999998766666677889999988777776543 26666666777653
No 18
>PF08105 Antimicrobial10: Metchnikowin family; InterPro: IPR012513 This family consists of the metchnikowin family of antimicrobial peptides from Drosophila. metchnikowin is a proline-rich peptide whose expression is immune-inducible. Induction of the metchnikowin gene expression can be mediated either by the TOLL pathway or by the imd gene product. The metchnikowin peptide is unique among the Drosophila antimicrobial peptides in that it is active against both bacteria and fungi [].
Probab=46.39 E-value=9.7 Score=24.71 Aligned_cols=9 Identities=56% Similarity=0.866 Sum_probs=7.4
Q ss_pred CCCCCCccc
Q psy7032 26 PTLPPPFNI 34 (136)
Q Consensus 26 ~tlPpPFNl 34 (136)
.|-|.|||=
T Consensus 35 DTRPSPFNP 43 (52)
T PF08105_consen 35 DTRPSPFNP 43 (52)
T ss_pred CCCCCCCCC
Confidence 389999994
No 19
>cd08807 CARD_CARD10_CARMA3 Caspase activation and recruitment domain of CARD10-like proteins. Caspase activation and recruitment domain (CARD) similar to that found in CARD10, also known as CARMA3 (caspase recruitment domain-containing membrane-associated guanylate kinase protein 3) or BIMP1. The CARMA3-BCL10-MALT1 signalosome plays a role in the GPCR-induced NF-kB activation. CARMA3 is more widely expressed than CARMA1, which is found only in hematopoietic cells. In endothelial and smooth muscle cells, CARMA3-mediated NF-kB activation induces pro-inflammatory signals within the vasculature and is a key factor in atherogenesis. In bronchial epithelial cells, CARMA3-mediated NF-kB signaling is important for the development of allergic airway inflammation. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains fo
Probab=45.69 E-value=25 Score=25.07 Aligned_cols=32 Identities=25% Similarity=0.388 Sum_probs=25.5
Q ss_pred CCCCChhHHHHHHhhHH---HHHH--HHHHHHHhcCC
Q psy7032 95 TNPVTEDDVNEVKGEIT---AMRY--ELMNVLEKNGM 126 (136)
Q Consensus 95 ~~~vtE~dl~EiKQDIS---slRy--El~e~l~~~gm 126 (136)
+.-+||.|..||..+.+ +-+. -|+|+|...|-
T Consensus 28 ~~VL~~~deEEI~~~~~~~~~~~k~~~LLDIL~trG~ 64 (86)
T cd08807 28 CRVIDEQDEEEVLNSYRFPCRINRTGRLMDILRGRGK 64 (86)
T ss_pred hCCCChhhHHHHHhccchhhHHHHHHHHHHHHHhcCc
Confidence 45569999999999988 5554 47999998883
No 20
>cd08806 CARD_CARD14_CARMA2 Caspase activation and recruitment domain of CARD14-like proteins. Caspase activation and recruitment domain (CARD) similar to that found in CARD14, also known as BIMP2 or CARMA2 (caspase recruitment domain-containing membrane-associated guanylate kinase protein 2). CARD14 has been identified as a novel member of the MAGUK (membrane-associated guanylate kinase) family that functions as upstream activators of BCL10 (B-cell lymphoma 10) and NF-kB signaling. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effector Domain). They serve as adaptors in signaling pathways
Probab=45.21 E-value=29 Score=24.76 Aligned_cols=32 Identities=19% Similarity=0.231 Sum_probs=25.0
Q ss_pred CCCCChhHHHHHHhhHHHHHHH-----HHHHHHhcCC
Q psy7032 95 TNPVTEDDVNEVKGEITAMRYE-----LMNVLEKNGM 126 (136)
Q Consensus 95 ~~~vtE~dl~EiKQDISslRyE-----l~e~l~~~gm 126 (136)
+.-+|++|..||..+.+.-... |+|+|...|=
T Consensus 28 ~~IL~~~deeeIls~~t~~~r~~k~g~LLDIL~trG~ 64 (86)
T cd08806 28 AKVLDQLDEEEVLHSPRLTNRAMRVGHLLDLLKTRGK 64 (86)
T ss_pred cCCCChhhHHHHHccchHHHHHHHHHHHHHHHHhcCc
Confidence 4456999999999998865553 6899998874
No 21
>TIGR03720 exospor_lead exosporium leader peptide. This domain is found as a leader peptide in at least two proteins targeted to the exosporium, a structure that occurs as the outermost layer of Bacillus anthracis, B. cereus, and B. thuringiensis spores. The exosporium consists of a basal layer and a nap of hair-like filaments. BclA, the major protein of the nap filaments, is targeted there by this leader peptide.
Probab=43.92 E-value=13 Score=21.05 Aligned_cols=11 Identities=64% Similarity=1.325 Sum_probs=8.3
Q ss_pred CCCCCC--Ccccc
Q psy7032 25 SPTLPP--PFNIF 35 (136)
Q Consensus 25 ~~tlPp--PFNli 35 (136)
|.|||| ||.+.
T Consensus 7 GPTlPpippft~P 19 (26)
T TIGR03720 7 GPTLPPIPPFTLP 19 (26)
T ss_pred CCCCCCCCCcccc
Confidence 779987 77764
No 22
>PF15344 FAM217: FAM217 family
Probab=43.17 E-value=11 Score=31.34 Aligned_cols=10 Identities=60% Similarity=1.215 Sum_probs=8.3
Q ss_pred CCCCCCcccc
Q psy7032 26 PTLPPPFNIF 35 (136)
Q Consensus 26 ~tlPpPFNli 35 (136)
..||||||-.
T Consensus 50 DFLPPPFNSw 59 (233)
T PF15344_consen 50 DFLPPPFNSW 59 (233)
T ss_pred CCCCCCCcch
Confidence 4799999965
No 23
>PF09366 DUF1997: Protein of unknown function (DUF1997); InterPro: IPR018971 This family of proteins are functionally uncharacterised.
Probab=42.70 E-value=35 Score=25.74 Aligned_cols=14 Identities=50% Similarity=1.146 Sum_probs=11.5
Q ss_pred CCCCCCCcccccCh
Q psy7032 25 SPTLPPPFNIFPTT 38 (136)
Q Consensus 25 ~~tlPpPFNliPsp 38 (136)
+..+||||.++|.+
T Consensus 110 ~v~~P~~~~~~P~~ 123 (158)
T PF09366_consen 110 SVELPPPFRLLPES 123 (158)
T ss_pred EEEcChhHHhCCHH
Confidence 34689999999985
No 24
>PF10073 DUF2312: Uncharacterized protein conserved in bacteria (DUF2312); InterPro: IPR018753 This entry is represented by Azospirillum phage Cd, Gp10. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. Members of this family of hypothetical bacterial proteins have no known function.
Probab=42.52 E-value=92 Score=21.59 Aligned_cols=40 Identities=23% Similarity=0.412 Sum_probs=26.9
Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHHHHHhcCCCCCc
Q psy7032 76 AVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMNVLEKNGMDTSS 130 (136)
Q Consensus 76 ~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e~l~~~gm~~~~ 130 (136)
.-++..|.|...- |+|...|..||. ++....+.+|+|+.-
T Consensus 4 ~~Lr~~ieRiErL-----------EeEk~~i~~dik----dVyaEAK~~GfD~K~ 43 (74)
T PF10073_consen 4 EQLRQFIERIERL-----------EEEKKAISDDIK----DVYAEAKGNGFDTKA 43 (74)
T ss_pred HHHHHHHHHHHHH-----------HHHHHHHHHHHH----HHHHHHHhCCCCHHH
Confidence 3466677776655 677666777775 566667888888753
No 25
>PF11176 DUF2962: Protein of unknown function (DUF2962); InterPro: IPR021346 This eukaryotic family of proteins has no known function. ; PDB: 2KKM_A.
Probab=42.35 E-value=41 Score=25.83 Aligned_cols=14 Identities=21% Similarity=0.349 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHH
Q psy7032 75 AAVMRALVWRYICL 88 (136)
Q Consensus 75 ~~VMr~LVkRYi~~ 88 (136)
..-|+.||.+||..
T Consensus 68 ~~e~~~lI~~yl~R 81 (155)
T PF11176_consen 68 LEEIHELIERYLHR 81 (155)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhc
Confidence 45678888888865
No 26
>KOG3609|consensus
Probab=41.58 E-value=5 Score=38.65 Aligned_cols=43 Identities=14% Similarity=0.026 Sum_probs=39.2
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHH
Q psy7032 70 RDFRYAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITA 112 (136)
Q Consensus 70 ~~~~y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISs 112 (136)
....++++|.+|..+|....|-..+.+.+++....+.+||+++
T Consensus 734 ~~~~~~e~~~~~~~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~ 776 (822)
T KOG3609|consen 734 FKMRRSETMEALGGKEYTTVQYYVPKDKTPGPAAAADAGTKVA 776 (822)
T ss_pred cchhHHHHHHHhcCceeehhhhccccCcCCCccchhhhccccc
Confidence 3468999999999999999999999999999999999999993
No 27
>TIGR02147 Fsuc_second hypothetical protein, TIGR02147. This family consists of the 40 members of a paralogous protein family in the rumen anaerobe Fibrobacter succinogenes S85. Member proteins are about 270 residues long and appear to lack signal sequences and transmembrane helices. The only perfectly conserved residue is a glycine in an otherwise poorly conserved region, suggesting members are not enzymes. The family is not characterized.
Probab=40.40 E-value=38 Score=28.37 Aligned_cols=29 Identities=21% Similarity=0.392 Sum_probs=25.4
Q ss_pred CCCChhHHHHHHhhHHHHHHHHHHHHHhc
Q psy7032 96 NPVTEDDVNEVKGEITAMRYELMNVLEKN 124 (136)
Q Consensus 96 ~~vtE~dl~EiKQDISslRyEl~e~l~~~ 124 (136)
-+++++++.+|++-|-.||-+++.+..+.
T Consensus 225 ~~i~~~~~~~i~~~i~~fRk~i~~i~~~~ 253 (271)
T TIGR02147 225 FGISEEAYKEIVKKIQEFRKEVLAIATKD 253 (271)
T ss_pred EecCHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 46679999999999999999999877654
No 28
>TIGR01219 Pmev_kin_ERG8 phosphomevalonate kinase, ERG8-type, eukaryotic branch. This enzyme is part of the mevalonate pathway, one of two alternative pathways for the biosynthesis of IPP. In an example of nonorthologous gene displacement, two different types of phosphomevalonate kinase are found - the animal type and this ERG8 type. This model represents plant and fungal forms of the ERG8 type of phosphomevalonate kinase.
Probab=40.25 E-value=2.4e+02 Score=25.49 Aligned_cols=28 Identities=14% Similarity=0.325 Sum_probs=24.0
Q ss_pred hHHHHHHhhHHHHHHHHHHHHHhcCCCC
Q psy7032 101 DDVNEVKGEITAMRYELMNVLEKNGMDT 128 (136)
Q Consensus 101 ~dl~EiKQDISslRyEl~e~l~~~gm~~ 128 (136)
.|+.+|++-|...|..|.++=...|.++
T Consensus 352 ~~~~~i~~~i~~~R~~Lr~~~~~sgv~I 379 (454)
T TIGR01219 352 EELLEAREAMLRIRRLMRQITEEASVDI 379 (454)
T ss_pred ccHHHHHHHHHHHHHHHHHhhHhcCCcc
Confidence 3899999999999999988888887665
No 29
>PF08776 VASP_tetra: VASP tetramerisation domain; InterPro: IPR014885 Vasodilator-stimulated phosphoprotein (VASP) is an actin cytoskeletal regulatory protein. This region corresponds to the tetramerisation domain which forms a right handed alpha helical coiled coil structure []. ; PDB: 1USE_A 1USD_A.
Probab=38.97 E-value=58 Score=20.13 Aligned_cols=21 Identities=19% Similarity=0.449 Sum_probs=11.9
Q ss_pred HHHHHHhhHHHHHHHHHHHHH
Q psy7032 102 DVNEVKGEITAMRYELMNVLE 122 (136)
Q Consensus 102 dl~EiKQDISslRyEl~e~l~ 122 (136)
=|.|.+..+..++.|+++.++
T Consensus 12 IL~EvrkEl~K~K~EIIeA~~ 32 (40)
T PF08776_consen 12 ILEEVRKELQKVKEEIIEAIR 32 (40)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 344555555666666666554
No 30
>PF13789 DUF4181: Domain of unknown function (DUF4181)
Probab=38.83 E-value=3.6 Score=29.62 Aligned_cols=14 Identities=21% Similarity=0.290 Sum_probs=11.3
Q ss_pred CCCCCccccchhhh
Q psy7032 2 FGLRQIDKMRKDSI 15 (136)
Q Consensus 2 ~~~~~~~eWKFaRt 15 (136)
+++++-+||||+|-
T Consensus 72 ~~~ra~mEWKy~re 85 (110)
T PF13789_consen 72 FCFRAFMEWKYDRE 85 (110)
T ss_pred HHHHHHHHHHhccc
Confidence 46788899999873
No 31
>cd08809 CARD_CARD9 Caspase activation and recruitment domain of CARD9-like proteins. Caspase activation and recruitment domain (CARD) similar to that found in CARD9. CARD9 is a central regulator of innate immunity and is highly expressed in dendritic cells and macrophages. Together with BCL10 (B-cell lymphoma 10) and Malt1 (mucosa-associated lymphoid tissue-lymphoma-translocation gene 1), it forms the M-CBM signalosome (the CBM complex in myeloid immune cells), which mediates activation of NF-kB and MAPK by ITAM-coupled receptors expressed on immune cells. CARD9 associates with BCL10 via a CARD-CARD interaction. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating wi
Probab=37.51 E-value=45 Score=23.80 Aligned_cols=33 Identities=21% Similarity=0.482 Sum_probs=24.3
Q ss_pred CCCCChhHHHHHHhh---HHHHHH--HHHHHHHhcCCC
Q psy7032 95 TNPVTEDDVNEVKGE---ITAMRY--ELMNVLEKNGMD 127 (136)
Q Consensus 95 ~~~vtE~dl~EiKQD---ISslRy--El~e~l~~~gm~ 127 (136)
+.-++++|++||..+ .++-+. -|+|+|...|-.
T Consensus 28 ~~Vl~~~deEeI~~~~~~~~~~~kvg~LLDIL~~rG~~ 65 (86)
T cd08809 28 CKVLNSDDEEQVLNDPSLVIRKRKVGVLLDILQRTGLK 65 (86)
T ss_pred hCCCChhhHHHHHcCcccchHHHHHHHHHHHHHhcCch
Confidence 445699999999999 334333 478999988843
No 32
>PRK10788 periplasmic folding chaperone; Provisional
Probab=33.04 E-value=78 Score=28.79 Aligned_cols=55 Identities=15% Similarity=0.172 Sum_probs=42.1
Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHH-------HHHHHHHhcCCCC
Q psy7032 74 YAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRY-------ELMNVLEKNGMDT 128 (136)
Q Consensus 74 y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRy-------El~e~l~~~gm~~ 128 (136)
.+.|+..||.+-+....-++-.-.|++++|.+.=+.+..|.- .+...|+++||+.
T Consensus 88 ~~qvl~~LI~~~Ll~q~A~~lgi~vsd~ev~~~I~~~p~Fq~~G~Fd~~~y~~~L~~~g~t~ 149 (623)
T PRK10788 88 RQQVLNRLIDEALLDQYARELGLGISDEQVKQAIFATPAFQTDGKFDNNKYLAILNQMGMTA 149 (623)
T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCcCHHHHHHHHHhCcccccCCCcCHHHHHHHHHHcCCCH
Confidence 478999999999998777777788899998876666655532 2567888889853
No 33
>PF13801 Metal_resist: Heavy-metal resistance; PDB: 3EPV_C 2Y3D_A 2Y3H_D 2Y3G_B 2Y3B_A 2Y39_A 3LAY_H.
Probab=31.41 E-value=94 Score=20.79 Aligned_cols=28 Identities=21% Similarity=0.205 Sum_probs=20.3
Q ss_pred CCCCChhHHHHHHhhHHHHHHHHHHHHH
Q psy7032 95 TNPVTEDDVNEVKGEITAMRYELMNVLE 122 (136)
Q Consensus 95 ~~~vtE~dl~EiKQDISslRyEl~e~l~ 122 (136)
.+.++++.|..+-++|+..+.++.....
T Consensus 81 ~~~~D~~~i~a~~~~~~~~~~~l~~~~~ 108 (125)
T PF13801_consen 81 APPPDEAAIEALLEEIREAQAELRQERL 108 (125)
T ss_dssp CSSS-HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5677888888888888888887765543
No 34
>PRK10770 peptidyl-prolyl cis-trans isomerase SurA; Provisional
Probab=30.74 E-value=2.4e+02 Score=24.02 Aligned_cols=55 Identities=16% Similarity=0.260 Sum_probs=35.5
Q ss_pred hHHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHH----HHHHHHHHHhcCCC
Q psy7032 73 RYAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAM----RYELMNVLEKNGMD 127 (136)
Q Consensus 73 ~y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISsl----RyEl~e~l~~~gm~ 127 (136)
-.+.++.+||.+.+....-+...-.|++++|++-=+.+..- .-++...|..+|++
T Consensus 48 l~~~~l~~Li~~~Ll~q~A~~~gi~vsd~ev~~~i~~~~~~~~~~~~~~~~~L~~~g~~ 106 (413)
T PRK10770 48 LRHQILERLIMDNIILQMAQKMGVKISDEQLDQAIANIAAQNNMTLDQMRSRLAYDGLN 106 (413)
T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCcCHHHHHHHHHHHHHHCCCCHHHHHHHHHHcCCC
Confidence 34678999999999987777777788888886633333220 11344555566653
No 35
>PF05121 GvpK: Gas vesicle protein K ; InterPro: IPR007805 Gas vesicles are intracellular, protein-coated, and hollow organelles found in cyanobacteria and halophilic archaea. They are permeable to ambient gases by diffusion and provide buoyancy, enabling cells to move upwards in liquid to access oxygen and/or light. Proteins containing this domain are involved in the formation of gas vesicles [].; GO: 0031412 gas vesicle organization
Probab=29.85 E-value=2.1e+02 Score=20.45 Aligned_cols=53 Identities=13% Similarity=0.328 Sum_probs=41.4
Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHHHHHhcCCCCCcc
Q psy7032 76 AVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMNVLEKNGMDTSSA 131 (136)
Q Consensus 76 ~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e~l~~~gm~~~~~ 131 (136)
+++++|..| ++.|..+...+|++++..+=..+-.+.-.+-++...-|.+..+.
T Consensus 23 ELLRqlmEr---QAiRRme~G~Lse~qiErlG~tLm~Le~~~~~l~~~~gl~~~dL 75 (88)
T PF05121_consen 23 ELLRQLMER---QAIRRMEAGSLSEEQIERLGETLMKLEEAMEELCERFGLTPEDL 75 (88)
T ss_pred HHHHHHHHH---HHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHh
Confidence 455666655 34666788889999999999999999999998888877765543
No 36
>PRK01631 hypothetical protein; Provisional
Probab=29.73 E-value=1.6e+02 Score=20.56 Aligned_cols=40 Identities=18% Similarity=0.351 Sum_probs=22.4
Q ss_pred HHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHHHHH
Q psy7032 78 MRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMNVLE 122 (136)
Q Consensus 78 Mr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e~l~ 122 (136)
|+.++.|-=.- -+++.+.|+|+++..|-+ .||.|-++.++
T Consensus 1 m~~ii~RINeL-akK~K~~gLT~eE~~Eq~----~LR~eYl~~fR 40 (76)
T PRK01631 1 MKNILFRINEL-SKKEKATGLTVDEKQEQQ----MLRQNYTQTFR 40 (76)
T ss_pred ChhHHHHHHHH-HHHhcccCCCHHHHHHHH----HHHHHHHHHHH
Confidence 45566664333 234447899998866543 34444444444
No 37
>TIGR00496 frr ribosome recycling factor. This model finds only eubacterial proteins. Mitochondrial and/or chloroplast forms might be expected but are not currently known. This protein was previously called ribosome releasing factor. By releasing ribosomes from mRNA at the end of protein biosynthesis, it prevents inappropriate translation from 3-prime regions of the mRNA and frees the ribosome for new rounds of translation. EGAD|53116|YHR038W is part of the frr superfamily.
Probab=29.39 E-value=2.8e+02 Score=21.69 Aligned_cols=48 Identities=13% Similarity=0.145 Sum_probs=33.5
Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHH
Q psy7032 72 FRYAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMN 119 (136)
Q Consensus 72 ~~y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e 119 (136)
+.+..-+|++=+.+.........+.+++|||+..+..+|..+=-+-..
T Consensus 113 E~aKv~iRniRr~~~~~iKk~~k~~~iseD~~k~~~~~iQkltd~~i~ 160 (176)
T TIGR00496 113 EQAKVAVRNVRRDANDKVKKLEKDKEISEDEERRLQEEIQKLTDEYIK 160 (176)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCCChhHHHHHHHHHHHHHHHHHH
Confidence 455566666666666555544456779999999999999988665443
No 38
>PRK07857 hypothetical protein; Provisional
Probab=29.18 E-value=86 Score=23.00 Aligned_cols=30 Identities=17% Similarity=0.609 Sum_probs=25.2
Q ss_pred CCCCChhHHHHHHhhHHHHHHHHHHHHHhc
Q psy7032 95 TNPVTEDDVNEVKGEITAMRYELMNVLEKN 124 (136)
Q Consensus 95 ~~~vtE~dl~EiKQDISslRyEl~e~l~~~ 124 (136)
.++.+..+|.++++.|..+=.||++.|..+
T Consensus 22 ~~p~~~~~L~~lR~eID~ID~eIl~LL~eR 51 (106)
T PRK07857 22 DDPLSDAEIDELREEIDRLDAEILALVKRR 51 (106)
T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 356678999999999999999999888653
No 39
>PF05291 Bystin: Bystin; InterPro: IPR007955 Trophinin and tastin form a cell adhesion molecule complex that potentially mediates an initial attachment of the blastocyst to uterine epithelial cells at the time of implantation. Trophinin and tastin bind to an intermediary cytoplasmic protein called bystin. Bystin may be involved in implantation and trophoblast invasion because bystin is found with trophinin and tastin in the cells at human implantation sites and also in the intermediate trophoblasts at invasion front in the placenta from early pregnancy []. This family also includes the Saccharomyces cerevisiae protein ENP1. ENP1 is an essential protein in S. cerevisiae and is localised in the nucleus []. It is thought that ENP1 plays a direct role in the early steps of rRNA processing as enp1 defective S. cerevisiae cannot synthesise 20S pre-rRNA and hence 18S rRNA, which leads to reduced formation of 40S ribosomal subunits [].
Probab=28.08 E-value=60 Score=28.00 Aligned_cols=34 Identities=21% Similarity=0.377 Sum_probs=27.1
Q ss_pred chhhhhHHHHHhcCCCCCCCCcccccChHHHHHHH
Q psy7032 11 RKDSIKLWLSYFDDSPTLPPPFNIFPTTKMFLKMF 45 (136)
Q Consensus 11 KFaRtkLwmsyf~~~~tlPpPFNliPspk~~~~l~ 45 (136)
-|.....+||-+.- +-||-||.|||+-+.+-.++
T Consensus 50 vY~~vG~~Ls~YrS-GkLPKafKiiP~l~nWEeiL 83 (301)
T PF05291_consen 50 VYTKVGELLSRYRS-GKLPKAFKIIPSLPNWEEIL 83 (301)
T ss_pred HHHHHHHHHhhcCC-CCCCcCeecccCchhHHHHH
Confidence 46667788888884 48999999999988876555
No 40
>PRK10132 hypothetical protein; Provisional
Probab=27.94 E-value=87 Score=22.83 Aligned_cols=26 Identities=23% Similarity=0.509 Sum_probs=21.5
Q ss_pred hhHHHHHHhhHHHHHHHHHHHHHhcC
Q psy7032 100 EDDVNEVKGEITAMRYELMNVLEKNG 125 (136)
Q Consensus 100 E~dl~EiKQDISslRyEl~e~l~~~g 125 (136)
|+++..|++||.+|--++=++|+..|
T Consensus 11 ~~q~e~L~~Dl~~L~~~le~ll~~~~ 36 (108)
T PRK10132 11 DDGVQDIQNDVNQLADSLESVLKSWG 36 (108)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 57788999999999998888887655
No 41
>PF08172 CASP_C: CASP C terminal; InterPro: IPR012955 This domain is the C-terminal region of the CASP family of proteins. These are Golgi membrane proteins which are thought to have a role in vesicle transport [].; GO: 0006891 intra-Golgi vesicle-mediated transport, 0030173 integral to Golgi membrane
Probab=27.62 E-value=74 Score=26.34 Aligned_cols=25 Identities=20% Similarity=0.424 Sum_probs=16.0
Q ss_pred hhHHHHHHhhHHHHHHHHHHHHHhc
Q psy7032 100 EDDVNEVKGEITAMRYELMNVLEKN 124 (136)
Q Consensus 100 E~dl~EiKQDISslRyEl~e~l~~~ 124 (136)
|+++...+|.|++||.|+-....+|
T Consensus 99 E~elr~~~~~~~~L~~Ev~~L~~DN 123 (248)
T PF08172_consen 99 EEELRKQQQTISSLRREVESLRADN 123 (248)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5667777777777777765544443
No 42
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=27.53 E-value=1.1e+02 Score=20.09 Aligned_cols=21 Identities=14% Similarity=0.411 Sum_probs=16.2
Q ss_pred hHHHHHHhhHHHHHHHHHHHH
Q psy7032 101 DDVNEVKGEITAMRYELMNVL 121 (136)
Q Consensus 101 ~dl~EiKQDISslRyEl~e~l 121 (136)
..+..|.+||.++|-++....
T Consensus 17 ~kvdqLs~dv~~lr~~v~~ak 37 (56)
T PF04728_consen 17 SKVDQLSSDVNALRADVQAAK 37 (56)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 567788888888888886554
No 43
>PRK13723 conjugal transfer pilus assembly protein TraH; Provisional
Probab=27.43 E-value=2.6e+02 Score=25.43 Aligned_cols=51 Identities=8% Similarity=0.147 Sum_probs=33.8
Q ss_pred hHHHHH-HHHHHHHHHHHH----HhhcCCCCChhHHHHHHhhHHHHHHHHHHHHHh
Q psy7032 73 RYAAVM-RALVWRYICLQH----RQADTNPVTEDDVNEVKGEITAMRYELMNVLEK 123 (136)
Q Consensus 73 ~y~~VM-r~LVkRYi~~~q----r~~e~~~vtE~dl~EiKQDISslRyEl~e~l~~ 123 (136)
.|.+++ -.++-.|+...- ....+...+++++.+++++|...|.++.++..+
T Consensus 358 ~~a~~IA~dll~~yL~~~l~~~~~al~~~~~~~~~~~~~~~~l~~a~~~~~~~~~~ 413 (451)
T PRK13723 358 QLTDYIGYDILLQYIQELIQQARAMLATGNYPEAVMDHLRENLNQAQRQIAAFQSQ 413 (451)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344433 345555555443 333355678999999999999999988766543
No 44
>PF09860 DUF2087: Uncharacterized protein conserved in bacteria (DUF2087); InterPro: IPR018656 This domain, found in various hypothetical prokaryotic proteins and transcriptional activators, has no known function.
Probab=26.37 E-value=2.1e+02 Score=19.25 Aligned_cols=25 Identities=24% Similarity=0.425 Sum_probs=19.1
Q ss_pred CCCCChhHHHHHH----hhHHHHHHHHHH
Q psy7032 95 TNPVTEDDVNEVK----GEITAMRYELMN 119 (136)
Q Consensus 95 ~~~vtE~dl~EiK----QDISslRyEl~e 119 (136)
....||.|||++= .|-+.||-+|.|
T Consensus 27 g~~y~E~EVN~~L~~~~~D~a~LRR~LVd 55 (71)
T PF09860_consen 27 GREYSEKEVNEILKRFFDDYATLRRYLVD 55 (71)
T ss_pred CCccCHHHHHHHHHHHcccHHHHHHHHHH
Confidence 4556888888764 477899999876
No 45
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=26.03 E-value=68 Score=24.15 Aligned_cols=24 Identities=21% Similarity=0.408 Sum_probs=19.6
Q ss_pred CChhHHHHHHhhHHHHHHHHHHHH
Q psy7032 98 VTEDDVNEVKGEITAMRYELMNVL 121 (136)
Q Consensus 98 vtE~dl~EiKQDISslRyEl~e~l 121 (136)
-+.+|+.+++.|++.+++++-.+-
T Consensus 86 ~i~~eV~~v~~dv~~i~~dv~~v~ 109 (126)
T PF07889_consen 86 QIKDEVTEVREDVSQIGDDVDSVQ 109 (126)
T ss_pred HHHHHHHHHHhhHHHHHHHHHHHH
Confidence 357899999999999999886554
No 46
>TIGR02698 CopY_TcrY copper transport repressor, CopY/TcrY family. This family includes metal-fist type transcriptional repressors of copper transport systems such as copYZAB of Enterococcus hirae and tcrYAZB (transferble copper resistance) of an Enterocuccus faecium plasmid. High levels of copper can displace zinc and prevent binding by the repressor, activating efflux by copper resistance transporters. The most closely related proteins excluded by this model are antibiotic resistance regulators including the methicillin resistance regulatory protein MecI.
Probab=25.97 E-value=2.6e+02 Score=20.42 Aligned_cols=40 Identities=10% Similarity=0.165 Sum_probs=24.9
Q ss_pred hhHHHHHHHHHHHHHHH----HH-HhhcCCCCChhHHHHHHhhHH
Q psy7032 72 FRYAAVMRALVWRYICL----QH-RQADTNPVTEDDVNEVKGEIT 111 (136)
Q Consensus 72 ~~y~~VMr~LVkRYi~~----~q-r~~e~~~vtE~dl~EiKQDIS 111 (136)
...+..++.++.++.-- .- .=.+++.++++|+.+|+|=|.
T Consensus 71 e~~~~~~~~~~~~~f~gs~~~ll~~l~~~~~ls~eele~L~~li~ 115 (130)
T TIGR02698 71 EAVENAAQELFSRICSRKVGAVIADLIEESPLSQTDIEKLEKLLS 115 (130)
T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHhcCCCCHHHHHHHHHHHH
Confidence 34466777777766321 11 112357799999999998443
No 47
>PF09851 SHOCT: Short C-terminal domain; InterPro: IPR018649 This family of hypothetical prokaryotic proteins has no known function.
Probab=25.72 E-value=1.2e+02 Score=17.10 Aligned_cols=18 Identities=17% Similarity=0.440 Sum_probs=14.5
Q ss_pred cCCCCChhHHHHHHhhHH
Q psy7032 94 DTNPVTEDDVNEVKGEIT 111 (136)
Q Consensus 94 e~~~vtE~dl~EiKQDIS 111 (136)
+...+|++|..+.|+.|-
T Consensus 13 ~~G~IseeEy~~~k~~ll 30 (31)
T PF09851_consen 13 DKGEISEEEYEQKKARLL 30 (31)
T ss_pred HcCCCCHHHHHHHHHHHh
Confidence 456789999999998773
No 48
>TIGR03575 selen_PSTK_euk L-seryl-tRNA(Sec) kinase, eukaryotic. Members of this protein are L-seryl-tRNA(Sec) kinase. This enzyme is part of a two-step pathway in Eukaryota and Archaea for performing selenocysteine biosynthesis by changing serine misacylated on selenocysteine-tRNA to selenocysteine. This enzyme performs the first step, phosphorylation of the OH group of the serine side chain. This family represents eukaryotic proteins with this activity.
Probab=25.61 E-value=2.6e+02 Score=24.11 Aligned_cols=47 Identities=6% Similarity=0.217 Sum_probs=35.1
Q ss_pred HHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHHHHHhcC
Q psy7032 75 AAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMNVLEKNG 125 (136)
Q Consensus 75 ~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e~l~~~g 125 (136)
..+.+..|...+.+.+. +.++..+++.+-+.++.+|.|+|+.++...
T Consensus 259 D~~tr~ii~~i~~~~k~----~~~~~~~~~~~~~~l~~~rk~~L~~~r~~~ 305 (340)
T TIGR03575 259 DQTLRRIISQTMREAKD----EQASAYNLKLLAEELNKLKADFLEDLRQGN 305 (340)
T ss_pred HHHHHHHHHHHHHHHHh----ccCChHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 34666777776654332 236788999999999999999999998654
No 49
>PF04353 Rsd_AlgQ: Regulator of RNA polymerase sigma(70) subunit, Rsd/AlgQ; InterPro: IPR007448 This family includes bacterial transcriptional regulators that are thought to act through an interaction with the conserved region 4 of the sigma(70) subunit of RNA polymerase. The Pseudomonas aeruginosa homologue, AlgQ, positively regulates virulence gene expression and is associated with the mucoid phenotype observed in P. aeruginosa isolates from cystic fibrosis patients.; GO: 0006355 regulation of transcription, DNA-dependent; PDB: 2P7V_A.
Probab=25.04 E-value=1.9e+02 Score=22.55 Aligned_cols=38 Identities=16% Similarity=0.275 Sum_probs=21.9
Q ss_pred cccchhhhhHHHHHhcCCCCCCCC--cccccChHHHHHHHh
Q psy7032 8 DKMRKDSIKLWLSYFDDSPTLPPP--FNIFPTTKMFLKMFG 46 (136)
Q Consensus 8 ~eWKFaRtkLwmsyf~~~~tlPpP--FNliPspk~~~~l~~ 46 (136)
.-|--+|-+|+..|..-. -+||. =+-.|+++.+..+|.
T Consensus 20 D~WL~~Rq~Llv~Yc~L~-gl~p~~~~~~~~~~~~l~~FCq 59 (153)
T PF04353_consen 20 DRWLEERQQLLVAYCKLA-GLKPYKESLTLPSEEALQNFCQ 59 (153)
T ss_dssp HHHHHHHHHHHHHHHHHH-S-------------HHHHHHHH
T ss_pred HHHHHHHHHHHHHHHhHh-cCCCCccccCCCCHHHHHHHHH
Confidence 358899999999999743 24443 467899999988885
No 50
>PF14838 INTS5_C: Integrator complex subunit 5 C-terminus
Probab=24.77 E-value=86 Score=30.02 Aligned_cols=81 Identities=15% Similarity=0.181 Sum_probs=48.7
Q ss_pred ccccchhhhhHHHHHhcCCCCCCCCcccc------cChHHHHHHHhhhhhhcccc--ccHHHH--HH-----HH--Hhhh
Q psy7032 7 IDKMRKDSIKLWLSYFDDSPTLPPPFNIF------PTTKMFLKMFGLRQIDKMRK--DSIKRK--QQ-----AR--QEKE 69 (136)
Q Consensus 7 ~~eWKFaRtkLwmsyf~~~~tlPpPFNli------Pspk~~~~l~~~~~~~~~~~--~s~~r~--~~-----~~--~~~~ 69 (136)
+--|-+..|..|+.-+..|..||||++.+ -+|.-+++++.-......+. ++...- +. .+ ....
T Consensus 597 ~~~~~l~~t~~Lv~lm~~gqlLPppL~~~~el~~~ltp~Ei~~lL~~cIW~y~kdh~Psp~~f~~~~~~g~~~RD~~~~~ 676 (696)
T PF14838_consen 597 SSPWHLEATCKLVELMAKGQLLPPPLSYIHELFPYLTPHEIYLLLLSCIWNYMKDHVPSPALFVFNDETGLFWRDFSREK 676 (696)
T ss_pred CcHHHHHHHHHHHHHHHhcCcCCchHHHHHHHHHhcCHHHHHHHHHHHHHHHHhhCCCCHHHHhcCcccCccccCccccc
Confidence 34577788999999999999999999976 45666666553211111110 111000 00 00 0111
Q ss_pred hhhhHHHHHHHHHHHHHH
Q psy7032 70 RDFRYAAVMRALVWRYIC 87 (136)
Q Consensus 70 ~~~~y~~VMr~LVkRYi~ 87 (136)
....|.++.+.++.|-|.
T Consensus 677 ~~~~y~~~lr~v~h~nI~ 694 (696)
T PF14838_consen 677 PPEKYTETLRSVLHRNID 694 (696)
T ss_pred ChhHHHHHHHHHHHhhhh
Confidence 457899999999988774
No 51
>PRK12807 flagellin; Provisional
Probab=24.16 E-value=3.5e+02 Score=22.28 Aligned_cols=32 Identities=6% Similarity=0.217 Sum_probs=27.4
Q ss_pred hcCCCCChhHHHHHHhhHHHHHHHHHHHHHhc
Q psy7032 93 ADTNPVTEDDVNEVKGEITAMRYELMNVLEKN 124 (136)
Q Consensus 93 ~e~~~vtE~dl~EiKQDISslRyEl~e~l~~~ 124 (136)
.-+...+.+|..-|.++|..|+-+|..+..+.
T Consensus 97 a~ngt~s~~dr~ai~~Ei~~l~~~i~~~a~~t 128 (287)
T PRK12807 97 SSNGTNTAENQSALQKEFAELQEQIDYIAKNT 128 (287)
T ss_pred HccCCCCHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 44566789999999999999999999998643
No 52
>KOG3870|consensus
Probab=23.86 E-value=1.4e+02 Score=27.05 Aligned_cols=21 Identities=19% Similarity=0.428 Sum_probs=18.3
Q ss_pred ChhHHHHHHhhHHHHHHHHHH
Q psy7032 99 TEDDVNEVKGEITAMRYELMN 119 (136)
Q Consensus 99 tE~dl~EiKQDISslRyEl~e 119 (136)
-+.|+++|...++.||||++.
T Consensus 59 ~~~d~k~ii~~l~~Lr~ei~~ 79 (434)
T KOG3870|consen 59 VVEDGKQIIKLLSKLRYEIQT 79 (434)
T ss_pred HHHHHHHHHHHHHHHHHHHhc
Confidence 368899999999999999964
No 53
>KOG0289|consensus
Probab=22.58 E-value=43 Score=30.65 Aligned_cols=30 Identities=20% Similarity=0.413 Sum_probs=21.8
Q ss_pred HHHHHHHHHHHHHHhhc-CCCCChhHHHHHH
Q psy7032 78 MRALVWRYICLQHRQAD-TNPVTEDDVNEVK 107 (136)
Q Consensus 78 Mr~LVkRYi~~~qr~~e-~~~vtE~dl~EiK 107 (136)
=|+|+.+||...-.+-= +++++++||-+||
T Consensus 24 EkrLIEqyI~e~G~DPIt~~pLs~eelV~Ik 54 (506)
T KOG0289|consen 24 EKRLIEQYIAETGKDPITNEPLSIEELVEIK 54 (506)
T ss_pred HHHHHHHHHHHcCCCCCCCCcCCHHHeeecc
Confidence 37899999977555432 6677888887765
No 54
>PF15397 DUF4618: Domain of unknown function (DUF4618)
Probab=22.40 E-value=1.2e+02 Score=25.50 Aligned_cols=19 Identities=26% Similarity=0.587 Sum_probs=12.9
Q ss_pred hHHHHHHhhHHHHHHHHHH
Q psy7032 101 DDVNEVKGEITAMRYELMN 119 (136)
Q Consensus 101 ~dl~EiKQDISslRyEl~e 119 (136)
+++.+++.+|+-||.||-.
T Consensus 200 e~i~el~e~I~~L~~eV~~ 218 (258)
T PF15397_consen 200 EEIDELEEEIPQLRAEVEQ 218 (258)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 5667777777777777633
No 55
>cd00520 RRF Ribosome recycling factor (RRF). Ribosome recycling factor dissociates the posttermination complex, composed of the ribosome, deacylated tRNA, and mRNA, after termination of translation. Thus ribosomes are "recycled" and ready for another round of protein synthesis. RRF is believed to bind the ribosome at the A-site in a manner that mimics tRNA, but the specific mechanisms remain unclear. RRF is essential for bacterial growth. It is not necessary for cell growth in archaea or eukaryotes, but is found in mitochondria or chloroplasts of some eukaryotic species.
Probab=22.13 E-value=3.8e+02 Score=20.81 Aligned_cols=49 Identities=10% Similarity=0.159 Sum_probs=34.8
Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHH
Q psy7032 71 DFRYAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMN 119 (136)
Q Consensus 71 ~~~y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e 119 (136)
-+.+..-+|++=+.+...........+++|||+.-...+|..+=-+-..
T Consensus 117 ~E~~Kv~iRniR~~~~~~lKk~~k~~~iseD~~k~~~~~iqkltd~~i~ 165 (179)
T cd00520 117 AEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKKAEEDLQKLTDEYIK 165 (179)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCCCchHHHHHHHHHHHHHHHHHH
Confidence 3456666777766666665555556678999999999999887665443
No 56
>PF14389 Lzipper-MIP1: Leucine-zipper of ternary complex factor MIP1
Probab=22.04 E-value=1.1e+02 Score=21.32 Aligned_cols=20 Identities=15% Similarity=0.395 Sum_probs=16.8
Q ss_pred hhHHHHHHhhHHHHHHHHHH
Q psy7032 100 EDDVNEVKGEITAMRYELMN 119 (136)
Q Consensus 100 E~dl~EiKQDISslRyEl~e 119 (136)
|.||.-+.|.+++|++.+.+
T Consensus 67 E~eV~~LE~~v~~L~~~l~~ 86 (88)
T PF14389_consen 67 EAEVAKLEQKVLSLYRQLFQ 86 (88)
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 78888888999999888765
No 57
>TIGR01801 CM_A chorismate mutase domain of gram positive AroA protein. This model represents a small clade of chorismate mutase domains N-terminally fused to the first enzyme in the chorismate pathway, 2-dehydro-3-deoxyphosphoheptanoate aldolase (DAHP synthetase, AroA) which are found in some gram positive species and Deinococcus. Only in Deinococcus, where this domain is the sole CM domain in the genome can a trusted assignment of function be made. In the other species there is at least one other trusted CM domain present. The similarity between the Deinococcus gene and the others in this clade is sufficiently strong (~44% identity), that the whole clade can be trusted to be functional. The possibility exists, however, that in the gram positive species the fusion to the first enzyme in the pathway has evolved a separate, regulatory role.
Probab=21.41 E-value=1.3e+02 Score=21.47 Aligned_cols=26 Identities=4% Similarity=0.473 Sum_probs=20.7
Q ss_pred ChhHHHHHHhhHHHHHHHHHHHHHhc
Q psy7032 99 TEDDVNEVKGEITAMRYELMNVLEKN 124 (136)
Q Consensus 99 tE~dl~EiKQDISslRyEl~e~l~~~ 124 (136)
|+.++.++++.|..+=.||++.|..+
T Consensus 2 ~~~~L~~lR~~ID~ID~eIl~LL~eR 27 (102)
T TIGR01801 2 SNQSLEDLRAEVDQLNRQILALISRR 27 (102)
T ss_pred CcchHHHHHHHHHHHHHHHHHHHHHH
Confidence 45578899999999998888877643
No 58
>PF01988 VIT1: VIT family; InterPro: IPR008217 Proteins containing this entry have no known function and are predicted to be integral membrane proteins. They include the Ccc1 protein from Saccharomyces cerevisiae (Baker's yeast) (P47818 from SWISSPROT) that may have a role in regulating calcium levels [].
Probab=21.10 E-value=93 Score=24.52 Aligned_cols=16 Identities=25% Similarity=0.557 Sum_probs=9.9
Q ss_pred CCChhHHHHHHhhHHH
Q psy7032 97 PVTEDDVNEVKGEITA 112 (136)
Q Consensus 97 ~vtE~dl~EiKQDISs 112 (136)
|++++|..++-.++++
T Consensus 93 Gl~~~~a~~i~~~l~~ 108 (213)
T PF01988_consen 93 GLSEEDAEEIAEELSK 108 (213)
T ss_pred CCCHHHHHHHHHHHHh
Confidence 5666666666666555
No 59
>PF14164 YqzH: YqzH-like protein
Probab=21.02 E-value=2.7e+02 Score=18.74 Aligned_cols=18 Identities=17% Similarity=0.431 Sum_probs=7.9
Q ss_pred cCCCCChhHHHHHHhhHH
Q psy7032 94 DTNPVTEDDVNEVKGEIT 111 (136)
Q Consensus 94 e~~~vtE~dl~EiKQDIS 111 (136)
+..+++.+|.++|.+-|.
T Consensus 22 ~~~pls~~E~~~L~~~i~ 39 (64)
T PF14164_consen 22 ECMPLSDEEWEELCKHIQ 39 (64)
T ss_pred cCCCCCHHHHHHHHHHHH
Confidence 344444444444444443
No 60
>PRK09269 chorismate mutase; Provisional
Probab=20.80 E-value=4.3e+02 Score=21.11 Aligned_cols=45 Identities=11% Similarity=0.290 Sum_probs=32.3
Q ss_pred HHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHHHHHh
Q psy7032 79 RALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMNVLEK 123 (136)
Q Consensus 79 r~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e~l~~ 123 (136)
+.+-.+|+..-....+....+-.||.+++..|..|--+|+..+..
T Consensus 101 k~iQ~~~~a~W~~~~~~~~~~~~dL~~~Rp~l~~L~~~il~~l~~ 145 (193)
T PRK09269 101 KLVQYALLARWRLAGAAPPGPRPDLASIRPRLDRLQQELLDALAD 145 (193)
T ss_pred HHHHHHHHHHHHhCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH
Confidence 344456665555443434446789999999999999999988875
No 61
>PHA02554 13 neck protein; Provisional
Probab=20.59 E-value=2e+02 Score=25.02 Aligned_cols=42 Identities=21% Similarity=0.220 Sum_probs=29.2
Q ss_pred HHHHHHHHHHHHHhh-----------c-CCCCCh---hHHHHHHhhHHHHHHHHHHH
Q psy7032 79 RALVWRYICLQHRQA-----------D-TNPVTE---DDVNEVKGEITAMRYELMNV 120 (136)
Q Consensus 79 r~LVkRYi~~~qr~~-----------e-~~~vtE---~dl~EiKQDISslRyEl~e~ 120 (136)
-+.||+|.++.-++. + -.|||- .=+.|-.|||..||.||+..
T Consensus 246 nrWvKdYaTAL~Ke~wGq~L~K~qGmqLPGGITldG~~i~eeA~~eie~Lr~eL~~~ 302 (311)
T PHA02554 246 NRWVKDYATALVKEQWGQILAKHQGMQLPGGVTLDGRRLIEEARQEKEKLREELLLL 302 (311)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHccCcccCCceeechHhHHHHHHHHHHHHHHHHHhc
Confidence 468888888753321 1 245653 44678899999999999754
No 62
>PF01152 Bac_globin: Bacterial-like globin; InterPro: IPR001486 Globins are haem-containing proteins involved in binding and/or transporting oxygen. They belong to a very large and well studied family that is widely distributed in many organisms []. Globins have evolved from a common ancestor and can be divided into three groups: single-domain globins, and two types of chimeric globins, flavohaemoglobins and globin-coupled sensors. Bacteria have all three types of globins, while archaea lack flavohaemoglobins, and eukaryotes lack globin-coupled sensors []. Several functionally different haemoglobins can coexist in the same species. The major types of globins include: Haemoglobin (Hb): trimer of two alpha and two beta chains, although embryonic and foetal forms can substitute the alpha or beta chain for ones with higher oxygen affinity, such as gamma, delta, epsilon or zeta chains. Hb transports oxygen from lungs to other tissues in vertebrates []. Hb proteins are also present in unicellular organisms where they act as enzymes or sensors []. Myoglobin (Mb): monomeric protein responsible for oxygen storage in vertebrate muscle []. Neuroglobin: a myoglobin-like haemprotein expressed in vertebrate brain and retina, where it is involved in neuroprotection from damage due to hypoxia or ischemia []. Neuroglobin belongs to a branch of the globin family that diverged early in evolution. Cytoglobin: an oxygen sensor expressed in multiple tissues. Related to neuroglobin []. Erythrocruorin: highly cooperative extracellular respiratory proteins found in annelids and arthropods that are assembled from as many as 180 subunit into hexagonal bilayers []. Leghaemoglobin (legHb or symbiotic Hb): occurs in the root nodules of leguminous plants, where it facilitates the diffusion of oxygen to symbiotic bacteriods in order to promote nitrogen fixation. Non-symbiotic haemoglobin (NsHb): occurs in non-leguminous plants, and can be over-expressed in stressed plants []. Flavohaemoglobins (FHb): chimeric, with an N-terminal globin domain and a C-terminal ferredoxin reductase-like NAD/FAD-binding domain. FHb provides protection against nitric oxide via its C-terminal domain, which transfers electrons to haem in the globin []. Globin-coupled sensors: chimeric, with an N-terminal myoglobin-like domain and a C-terminal domain that resembles the cytoplasmic signalling domain of bacterial chemoreceptors. They bind oxygen, and act to initiate an aerotactic response or regulate gene expression [, ]. Protoglobin: a single domain globin found in archaea that is related to the N-terminal domain of globin-coupled sensors []. Truncated 2/2 globin: lack the first helix, giving them a 2-over-2 instead of the canonical 3-over-3 alpha-helical sandwich fold. Can be divided into three main groups (I, II and II) based on structural features []. This entry represents a group of haemoglobin-like proteins found in eubacteria, cyanobacteria, protozoa, algae and plants, but not in animals or yeast. These proteins have a truncated 2-over-2 rather than the canonical 3-over-3 alpha-helical sandwich fold []. This entry includes: HbN (or GlbN): a truncated haemoglobin-like protein that binds oxygen cooperatively with a very high affinity and a slow dissociation rate, which may exclude it from oxygen transport. It appears to be involved in bacterial nitric oxide detoxification and in nitrosative stress []. Cyanoglobin (or GlbN): a truncated haemoprotein found in cyanobacteria that has high oxygen affinity, and which appears to serve as part of a terminal oxidase, rather than as a respiratory pigment []. HbO (or GlbO): a truncated haemoglobin-like protein with a lower oxygen affinity than HbN. HbO associates with the bacterial cell membrane, where it significantly increases oxygen uptake over membranes lacking this protein. HbO appears to interact with a terminal oxidase, and could participate in an oxygen/electron-transfer process that facilitates oxygen transfer during aerobic metabolism []. Glb3: a nuclear-encoded truncated haemoglobin from plants that appears more closely related to HbO than HbN. Glb3 from Arabidopsis thaliana (Mouse-ear cress) exhibits an unusual concentration-independent binding of oxygen and carbon dioxide []. ; GO: 0019825 oxygen binding, 0015671 oxygen transport; PDB: 2BKM_B 1UVY_A 1DLW_A 2XYK_B 2IG3_A 2GKM_B 1S61_A 1S56_B 1RTE_B 2GLN_A ....
Probab=20.37 E-value=2.5e+02 Score=19.51 Aligned_cols=43 Identities=19% Similarity=0.329 Sum_probs=32.9
Q ss_pred HHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHH
Q psy7032 77 VMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMN 119 (136)
Q Consensus 77 VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e 119 (136)
|......+++...+..-++.+|.+.+++++-+-+.++|..+..
T Consensus 77 it~~~f~~~~~~~~~al~~~~v~~~~~~~~~~~~~~~~~~i~n 119 (120)
T PF01152_consen 77 ITEEHFDRWLELLKQALDELGVPEELIDELLARLESLRDDIVN 119 (120)
T ss_dssp BBHHHHHHHHHHHHHHHHHTTCTHHHHHHHHHHHHHHHHHHHH
T ss_pred CCHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHhcC
Confidence 3344455555566667778899999999999999999998864
No 63
>PF06757 Ins_allergen_rp: Insect allergen related repeat, nitrile-specifier detoxification; InterPro: IPR010629 This entry represents several insect specific allergen repeats. These repeats are commonly found in various proteins from cockroaches, fruit flies and mosquitos. It has been suggested that the repeat sequences have evolved by duplication of an ancestral amino acid domain, which may have arisen from the mitochondrial energy transfer proteins []. This family exemplifies a case of novel gene evolution. The case in point is the arms-race between plants and their infective insective herbivores in the area of the glucosinolate-myrosinase system. Brassicas have developed the glucosinolate-myrosinase system as chemical defence mechanism against the insects, and consequently the insects have adapted to produce a detoxifying molecule, nitrile-specifier protein (NSP). NSP is present in the Pieris rapae (Cabbage white butterfly). NSP is structurally different from and has no amino acid homology to any known detoxifying enzymes, and it appears to have arisen by a process of domain and gene duplication of a sequence of unknown function that is widespread in insect species and referred to as insect-allergen-repeat protein. Thus this family is found either as a single domain or as a multiple repeat-domain [].
Probab=20.16 E-value=95 Score=23.83 Aligned_cols=53 Identities=21% Similarity=0.322 Sum_probs=34.2
Q ss_pred HHHHHHHHHHH-HHHhhcCCCCChhHHHHHHhhHHHHHH--HHHHHHHhcCCCCCc
Q psy7032 78 MRALVWRYICL-QHRQADTNPVTEDDVNEVKGEITAMRY--ELMNVLEKNGMDTSS 130 (136)
Q Consensus 78 Mr~LVkRYi~~-~qr~~e~~~vtE~dl~EiKQDISslRy--El~e~l~~~gm~~~~ 130 (136)
+..|+.+|+.. .+-.+--+=++-.+++++.+++.++-- ++++.|+++|.++..
T Consensus 20 i~~i~~~Y~~~D~efq~~~~yl~s~~f~~l~~~l~~~pE~~~l~~yL~~~gldv~~ 75 (179)
T PF06757_consen 20 IQDIVQRYYLEDAEFQAAVRYLNSSEFKQLWQQLEALPEVKALLDYLESAGLDVYY 75 (179)
T ss_pred HHHHHHHHHHcCHHHHHHHHHHcChHHHHHHHHHHcCHHHHHHHHHHHHCCCCHHH
Confidence 67788888542 222211122234578899988887542 788888999988765
Done!