Query 048151
Match_columns 164
No_of_seqs 115 out of 1108
Neff 9.0
Searched_HMMs 46136
Date Fri Mar 29 06:46:27 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/048151.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/048151hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd05381 SCP_PR-1_like SCP_PR-1 100.0 1.5E-41 3.3E-46 236.7 16.1 133 30-164 2-136 (136)
2 cd05384 SCP_PRY1_like SCP_PRY1 100.0 1E-37 2.2E-42 215.6 14.4 128 27-160 1-129 (129)
3 cd05382 SCP_GAPR-1_like SCP_GA 100.0 5.4E-37 1.2E-41 212.5 12.0 128 27-158 1-132 (132)
4 cd05383 SCP_CRISP SCP_CRISP: S 100.0 7.4E-35 1.6E-39 203.5 13.3 122 28-155 2-138 (138)
5 smart00198 SCP SCP / Tpx-1 / A 100.0 1.3E-34 2.9E-39 203.2 12.9 122 28-155 2-144 (144)
6 cd05385 SCP_GLIPR-1_like SCP_G 100.0 2.6E-34 5.7E-39 202.1 13.2 121 27-155 1-144 (144)
7 cd00168 SCP SCP: SCP-like extr 100.0 5.4E-33 1.2E-37 190.1 12.5 116 30-153 2-122 (122)
8 cd05559 SCP_HrTT-1 SCP_HrTT-1: 100.0 6.6E-33 1.4E-37 193.2 11.9 119 29-153 1-136 (136)
9 KOG3017 Defense-related protei 100.0 4.5E-33 9.8E-38 209.0 9.4 132 27-164 40-198 (225)
10 cd05380 SCP_euk SCP_euk: SCP-l 100.0 1.8E-31 4E-36 186.7 11.0 119 29-153 1-144 (144)
11 PF00188 CAP: Cysteine-rich se 99.9 2.5E-21 5.4E-26 129.7 11.1 114 33-152 1-124 (124)
12 TIGR02909 spore_YkwD uncharact 99.8 1.5E-19 3.3E-24 124.5 12.1 106 27-152 3-125 (127)
13 cd05379 SCP_bacterial SCP_bact 99.7 6.9E-16 1.5E-20 104.6 10.7 103 30-152 2-121 (122)
14 COG2340 Uncharacterized protei 99.3 1.5E-11 3.3E-16 91.3 8.8 98 26-141 78-192 (207)
15 PF11054 Surface_antigen: Spor 92.5 1.2 2.5E-05 33.8 8.0 129 29-164 35-218 (254)
16 PF15240 Pro-rich: Proline-ric 52.9 8.5 0.00018 28.0 1.4 18 7-24 2-19 (179)
17 KOG0286 G-protein beta subunit 52.7 9.3 0.0002 30.1 1.6 34 123-158 78-111 (343)
18 PF04202 Mfp-3: Foot protein 3 49.9 18 0.00038 21.8 2.2 24 1-24 1-24 (71)
19 PF11254 DUF3053: Protein of u 45.9 66 0.0014 24.4 5.2 50 1-53 1-53 (229)
20 PF08105 Antimicrobial10: Metc 45.2 28 0.0006 19.6 2.4 24 1-24 1-24 (52)
21 PF04648 MF_alpha: Yeast matin 44.5 12 0.00027 14.9 0.7 6 159-164 8-13 (13)
22 PF02402 Lysis_col: Lysis prot 43.4 12 0.00026 20.6 0.7 23 1-23 1-23 (46)
23 PHA03066 Hypothetical protein; 26.0 1.7E+02 0.0037 19.5 4.1 22 1-22 1-22 (110)
24 PF08138 Sex_peptide: Sex pept 24.8 24 0.00053 20.2 0.0 14 1-14 1-14 (56)
25 PF15284 PAGK: Phage-encoded v 24.8 74 0.0016 18.8 2.0 17 1-17 1-17 (61)
26 KOG4228 Protein tyrosine phosp 24.6 83 0.0018 29.4 3.2 38 117-155 619-656 (1087)
27 TIGR03044 PS_II_psb27 photosys 24.1 2.4E+02 0.0052 19.6 4.7 21 26-46 35-55 (135)
28 KOG4326 Mitochondrial F1F0-ATP 23.3 1.3E+02 0.0029 18.4 3.0 45 20-66 23-67 (81)
29 PF03295 Pox_TAA1: Poxvirus tr 22.5 79 0.0017 18.7 1.8 19 27-45 24-42 (63)
30 COG3026 RseB Negative regulato 21.6 1.4E+02 0.0031 23.6 3.6 37 7-44 6-42 (320)
31 PF08194 DIM: DIM protein; In 21.3 99 0.0021 16.2 1.9 6 1-6 1-6 (36)
32 PF14276 DUF4363: Domain of un 20.3 2.7E+02 0.0058 18.3 5.1 46 7-56 4-49 (121)
33 KOG4439 RNA polymerase II tran 20.0 1.1E+02 0.0024 27.5 2.9 40 27-66 783-843 (901)
No 1
>cd05381 SCP_PR-1_like SCP_PR-1_like: SCP-like extracellular protein domain, PR-1 like subfamily. The wider family of SCP containing proteins includes plant pathogenesis-related protein 1 (PR-1), which accumulates after infections with pathogens, and may act as an anti-fungal agent or be involved in cell wall loosening. It also includes CRISPs, mammalian cysteine-rich secretory proteins, and allergen 5 from vespid venom. It has been proposed that SCP domains may function as endopeptidases.
Probab=100.00 E-value=1.5e-41 Score=236.73 Aligned_cols=133 Identities=51% Similarity=1.046 Sum_probs=121.8
Q ss_pred HHHHHHHHHHHhhcCCCCCccccHHHHHHHHHHHHhhhccCccccCCCCccceEEeecCC-CCHHHHHHHHHhccccCcC
Q 048151 30 QRYVHLHNEARRNVGIGIGMTWDKTLEDHAHSYAQKLKVDCIIEHSVSHYGENLAWADYD-FTVDHIVKMWVDEKQFYDY 108 (164)
Q Consensus 30 ~~il~~hN~~R~~~~~~~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~~~~gen~~~~~~~-~~~~~~v~~W~~e~~~y~~ 108 (164)
+.||+.||.+|++++++ +|+||++|++.||.+|++|+..|...|+...+|||+++..+. ..++++|+.|++|...|++
T Consensus 2 ~~il~~hN~~R~~~~~~-~L~Wd~~La~~A~~~a~~~~~~c~~~~~~~~~GeNi~~~~~~~~~~~~~v~~W~~e~~~y~~ 80 (136)
T cd05381 2 QDFLDAHNAARAAVGVP-PLKWDDTLAAYAQRYANQRRGDCALVHSNGPYGENLFWGSGGNWSAADAVASWVSEKKYYDY 80 (136)
T ss_pred hHHHHHHHHHHHhcCCC-cceECHHHHHHHHHHHHHhcCCCCcccCCCCCCceEEEecCCCCCHHHHHHHHHhccccCCC
Confidence 68999999999999999 999999999999999998888899988877899999987643 5789999999999999999
Q ss_pred CCCCCCCCCcchHHHHHHHHcCCeeeEEEEEeCC-CCEEEEEEEecCCCCCCCCCCC
Q 048151 109 NSNTCAPNQMCGHYTQVVWRKSVRLGCAKERCNN-NHQFIAICNYDPPGNAAGERPF 164 (164)
Q Consensus 109 ~~~~~~~~~~~~~f~qmvw~~~~~vGCa~~~c~~-~~~~~~vC~Y~p~gn~~g~~~Y 164 (164)
..+.+..+..++||+||||+++++||||++.|.+ ++.++ ||+|+|+||+.|++||
T Consensus 81 ~~~~~~~~~~~~hftq~vw~~t~~vGCa~~~c~~~~~~~v-vC~Y~p~gn~~g~~~Y 136 (136)
T cd05381 81 DSNTCAAGKMCGHYTQVVWRNTTRVGCARVTCDNGGGVFI-ICNYDPPGNYIGQRPY 136 (136)
T ss_pred CCCCcCCCccchHHHHHHHHhcCEeceEEEEeCCCCcEEE-EEEeeCCCCCCCCCCC
Confidence 8887777778999999999999999999999987 45777 9999999999999998
No 2
>cd05384 SCP_PRY1_like SCP_PRY1_like: SCP-like extracellular protein domain, PRY1-like sub-family restricted to fungi. The wider family of SCP containing proteins includes plant pathogenesis-related protein 1 (PR-1), CRISPs, mammalian cysteine-rich secretory proteins, and allergen 5 from vespid venom. It has been proposed that SCP domains may function as endopeptidases. PRY1 is a yeast protein that is up-regulated in core ESCRT mutants. This PRY1-like group also contains fruiting body proteins SC7/14 from Schizophyllum commune.
Probab=100.00 E-value=1e-37 Score=215.57 Aligned_cols=128 Identities=34% Similarity=0.672 Sum_probs=112.9
Q ss_pred HHHHHHHHHHHHHHhhcCCCCCccccHHHHHHHHHHHHhhhccCccccCCCCccceEEeecCCCCHHHHHHHHHhccccC
Q 048151 27 ATQQRYVHLHNEARRNVGIGIGMTWDKTLEDHAHSYAQKLKVDCIIEHSVSHYGENLAWADYDFTVDHIVKMWVDEKQFY 106 (164)
Q Consensus 27 ~~~~~il~~hN~~R~~~~~~~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~~~~gen~~~~~~~~~~~~~v~~W~~e~~~y 106 (164)
++++.||+.||.+|+.++++ +|+||++|+..||.||++|+..|.+.|++..+|||++...+ .++++|+.|++|.+.|
T Consensus 1 ~~~~~iL~~hN~~R~~~g~~-~L~w~~~La~~A~~~a~~c~~~~~~~~~~~~~geNi~~~~~--~~~~~v~~W~~e~~~y 77 (129)
T cd05384 1 SFASSILDAHNSKRALHGVQ-PLTWNNTLAEYAQDYANSYDCSGNLAHSGGPYGENLAAGYP--SGTSAVDAWYDEIEDY 77 (129)
T ss_pred CHHHHHHHHHHHHHHHcCCC-cCccCHHHHHHHHHHHHHhccCCceecCCCCCCcEEEEecC--CHHHHHHHHHhhhhhC
Confidence 36899999999999999999 99999999999999999666566688888889999987653 6889999999999999
Q ss_pred cCCCCCCCCCCcchHHHHHHHHcCCeeeEEEEEeCCC-CEEEEEEEecCCCCCCC
Q 048151 107 DYNSNTCAPNQMCGHYTQVVWRKSVRLGCAKERCNNN-HQFIAICNYDPPGNAAG 160 (164)
Q Consensus 107 ~~~~~~~~~~~~~~~f~qmvw~~~~~vGCa~~~c~~~-~~~~~vC~Y~p~gn~~g 160 (164)
+++.+.+ +..++||+||||+++++||||++.|+.+ ..++ ||+|+|+||+.|
T Consensus 78 ~~~~~~~--~~~~~h~tqmvw~~t~~vGCa~~~c~~~~~~~~-vC~Y~p~Gn~~g 129 (129)
T cd05384 78 DYSNPGF--SEATGHFTQLVWKSTTQVGCAYKDCGGAWGWYI-VCEYDPAGNVIG 129 (129)
T ss_pred CCCCCCC--CCcccchhhhhhhccceeeeEEEEeCCCCeEEE-EEEEECCCCCCc
Confidence 9977543 4569999999999999999999999873 4667 999999999876
No 3
>cd05382 SCP_GAPR-1_like SCP_GAPR-1_like: SCP-like extracellular protein domain, golgi-associated plant pathogenesis related protein (GAPR)-like sub-family. The wider family of SCP containing proteins includes plant pathogenesis-related protein 1 (PR-1), CRISPs, mammalian cysteine-rich secretory proteins, which combine SCP with a C-terminal cysteine rich domain, and allergen 5 from vespid venom. It has been proposed that SCP domains may function as endopeptidases. The human GAPR-1 protein has been reported to dimerize, and such a dimer may form an active site containing a catalytic triad. GAPR-1 and GLIPR-2 appear to be synonyms.
Probab=100.00 E-value=5.4e-37 Score=212.48 Aligned_cols=128 Identities=34% Similarity=0.596 Sum_probs=113.4
Q ss_pred HHHHHHHHHHHHHHhhcCCCCCccccHHHHHHHHHHHHhhhccCccccCCC-CccceEEeecC---CCCHHHHHHHHHhc
Q 048151 27 ATQQRYVHLHNEARRNVGIGIGMTWDKTLEDHAHSYAQKLKVDCIIEHSVS-HYGENLAWADY---DFTVDHIVKMWVDE 102 (164)
Q Consensus 27 ~~~~~il~~hN~~R~~~~~~~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~~-~~gen~~~~~~---~~~~~~~v~~W~~e 102 (164)
++++.||+.||.+|+.++++ +|+||++|+..||.||++|+..+.+.|++. .+|||+++..+ ...++++|+.|++|
T Consensus 1 ~~~~~iL~~hN~~R~~~g~~-~L~wd~~La~~A~~~a~~c~~~~~~~h~~~~~~GeN~~~~~~~~~~~~~~~~v~~W~~e 79 (132)
T cd05382 1 DFQKECLDAHNEYRALHGAP-PLKLDKELAKEAQKWAEKLASSGKLQHSSPSGYGENLAYASGSGPDLTGEEAVDSWYNE 79 (132)
T ss_pred CHHHHHHHHHHHHHHHcCCC-cCeeCHHHHHHHHHHHHHhhhcCceeCCCCCCCCceeEEecCCCCCCCHHHHHHHHHhc
Confidence 47899999999999999999 999999999999999997776666788776 59999998763 56889999999999
Q ss_pred cccCcCCCCCCCCCCcchHHHHHHHHcCCeeeEEEEEeCCCCEEEEEEEecCCCCC
Q 048151 103 KQFYDYNSNTCAPNQMCGHYTQVVWRKSVRLGCAKERCNNNHQFIAICNYDPPGNA 158 (164)
Q Consensus 103 ~~~y~~~~~~~~~~~~~~~f~qmvw~~~~~vGCa~~~c~~~~~~~~vC~Y~p~gn~ 158 (164)
...|++..+.. +..++||+||||+++++||||++.|..+..++ ||+|+|+||+
T Consensus 80 ~~~y~~~~~~~--~~~~gh~tqmvw~~t~~vGCa~~~~~~~~~~~-vC~Y~p~Gn~ 132 (132)
T cd05382 80 IKKYDFNKPGF--SSKTGHFTQVVWKSSTELGVGVAKSKKGCVYV-VARYRPAGNV 132 (132)
T ss_pred cccCCCCCCCC--CCCCCCeEEeEecCCCceeeEEEEcCCCCEEE-EEEEeCCCCC
Confidence 99999875443 45699999999999999999999998776778 9999999995
No 4
>cd05383 SCP_CRISP SCP_CRISP: SCP-like extracellular protein domain, CRISP-like sub-family. The wider family of SCP containing proteins includes plant pathogenesis-related protein 1 (PR-1), CRISPs, mammalian cysteine-rich secretory proteins, which combine SCP with a C-terminal cysteine rich domain, and allergen 5 from vespid venom. Involvement of CRISP in response to pathogens, fertilization, and sperm maturation have been proposed. One member, Tex31 from the venom duct of Conus textile, has been shown to possess proteolytic activity sensitive to serine protease inhibitors. SCP has also been proposed to be a Ca++ chelating serine protease. The Ca++-chelating function would fit with various signaling processes that members of this family, such as the CRISPs, are involved in, and is supported by sequence and structural evidence of a conserved pocket containing two histidines and a glutamate. It also may explain how helothermine, a toxic peptide secreted by the beaded lizard, blocks Ca++ t
Probab=100.00 E-value=7.4e-35 Score=203.55 Aligned_cols=122 Identities=35% Similarity=0.723 Sum_probs=107.5
Q ss_pred HHHHHHHHHHHHHhhcC-----CCCCccccHHHHHHHHHHHHhhhccCccccCCC--------CccceEEeecCCCCHHH
Q 048151 28 TQQRYVHLHNEARRNVG-----IGIGMTWDKTLEDHAHSYAQKLKVDCIIEHSVS--------HYGENLAWADYDFTVDH 94 (164)
Q Consensus 28 ~~~~il~~hN~~R~~~~-----~~~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~~--------~~gen~~~~~~~~~~~~ 94 (164)
.|+.||+.||.+|+.+. |. +|+||++||..||.||+ +|...|++. .+|||++..+.....++
T Consensus 2 ~~~~il~~HN~~R~~~~p~a~~M~-~l~Wd~~La~~A~~~a~----~C~~~~~~~~~~~~~~~~~GeNl~~~~~~~~~~~ 76 (138)
T cd05383 2 VQKEIVDLHNELRRSVNPTASNML-KMEWNEEAAQNAKKWAN----TCNLTHSPPNGRTIGGITCGENIFMSSYPRSWSD 76 (138)
T ss_pred HHHHHHHHHHHHhccCCCCcccCc-ccEeCHHHHHHHHHHHh----cCCCcCCchhhcccCCCCcceeeeccCCCCCHHH
Confidence 47899999999999975 34 79999999999999999 999888753 47999998776667899
Q ss_pred HHHHHHhccccCcCCCCCCCCCCcchHHHHHHHHcCCeeeEEEEEeCCC--CEEEEEEEecCC
Q 048151 95 IVKMWVDEKQFYDYNSNTCAPNQMCGHYTQVVWRKSVRLGCAKERCNNN--HQFIAICNYDPP 155 (164)
Q Consensus 95 ~v~~W~~e~~~y~~~~~~~~~~~~~~~f~qmvw~~~~~vGCa~~~c~~~--~~~~~vC~Y~p~ 155 (164)
+|+.||+|...|+++.+.+..+..++||+||||+++++||||++.|.++ +.++ ||+|+|+
T Consensus 77 av~~W~~e~~~y~~~~~~~~~~~~~~hftqmvw~~t~~vGCa~~~c~~~~~~~~~-vC~Y~P~ 138 (138)
T cd05383 77 VIQAWYDEYKDFKYGVGATPPGAVVGHYTQIVWYKSYLVGCAVAYCPNSKYKYFY-VCHYCPA 138 (138)
T ss_pred HHHHHHHHHHhCCCCCCCCCCCCchhhHHHHHHHhccccceEEEECCCCCcCEEE-EEecCCC
Confidence 9999999999999988776667889999999999999999999999875 5677 9999985
No 5
>smart00198 SCP SCP / Tpx-1 / Ag5 / PR-1 / Sc7 family of extracellular domains. Human glioma pathogenesis-related protein GliPR and the plant pathogenesis-related protein represent functional links between plant defense systems and human immune system. This family has no known function.
Probab=100.00 E-value=1.3e-34 Score=203.24 Aligned_cols=122 Identities=43% Similarity=0.815 Sum_probs=108.8
Q ss_pred HHHHHHHHHHHHHhhcC-----------CCCCccccHHHHHHHHHHHHhhhccCccccCCC-CccceEEeecC-----CC
Q 048151 28 TQQRYVHLHNEARRNVG-----------IGIGMTWDKTLEDHAHSYAQKLKVDCIIEHSVS-HYGENLAWADY-----DF 90 (164)
Q Consensus 28 ~~~~il~~hN~~R~~~~-----------~~~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~~-~~gen~~~~~~-----~~ 90 (164)
.|+.||+.||.+|++++ |+ +|+||++||..||.+|+ +|...|+.. .+|||+++.++ ..
T Consensus 2 ~~~~iL~~HN~~R~~~a~G~~~~p~a~~m~-~l~Wd~~La~~A~~~a~----~C~~~~~~~~~~GeNi~~~~~~~~~~~~ 76 (144)
T smart00198 2 QQQEILDAHNKLRSQVAKGLLANPAASNML-KLTWDCELASSAQNWAN----QCPFGHSTPRGYGENLAWWSSSTDLPIT 76 (144)
T ss_pred HHHHHHHHHHHHHHHHhcCCCCCCcccccc-cccCCHHHHHHHHHHHH----hCCCcCCCcCCcCcceEEecccCcccch
Confidence 57899999999999999 99 99999999999999999 999888765 78999987653 34
Q ss_pred CHHHHHHHHHhccccCcCCCCCCCC-CCcchHHHHHHHHcCCeeeEEEEEeCCCC---EEEEEEEecCC
Q 048151 91 TVDHIVKMWVDEKQFYDYNSNTCAP-NQMCGHYTQVVWRKSVRLGCAKERCNNNH---QFIAICNYDPP 155 (164)
Q Consensus 91 ~~~~~v~~W~~e~~~y~~~~~~~~~-~~~~~~f~qmvw~~~~~vGCa~~~c~~~~---~~~~vC~Y~p~ 155 (164)
.++++|+.|++|...|++..+.+.. +..++||+||||+++++||||++.|.++. .++ ||+|+|+
T Consensus 77 ~~~~av~~W~~e~~~y~~~~~~~~~~~~~~~hftqmvw~~s~~vGCa~~~c~~~~~~~~~~-vC~Y~P~ 144 (144)
T smart00198 77 YASAAVQLWYDEFQDYGYSSNTCKDTNGKIGHYTQVVWAKTYKVGCGVSNCPDGTKKKTVV-VCNYDPP 144 (144)
T ss_pred hHHHHHHHHHHHHHHcCCCCCccccCccchhHHHHHHHHhcCCcceEEEECCCCCcceEEE-EEecCCC
Confidence 7889999999999999998877665 67799999999999999999999998764 577 9999985
No 6
>cd05385 SCP_GLIPR-1_like SCP_GLIPR-1_like: SCP-like extracellular protein domain, glioma pathogenesis-related protein (GLIPR)-like sub-family. The wider family of SCP containing proteins includes plant pathogenesis-related protein 1 (PR-1), CRISPs, mammalian cysteine-rich secretory proteins, and allergen 5 from vespid venom. It has been proposed that SCP domains may function as endopeptidases.
Probab=100.00 E-value=2.6e-34 Score=202.09 Aligned_cols=121 Identities=39% Similarity=0.836 Sum_probs=104.6
Q ss_pred HHHHHHHHHHHHHHhhcC-----CCCCccccHHHHHHHHHHHHhhhccCccccCCC------------CccceEEeec-C
Q 048151 27 ATQQRYVHLHNEARRNVG-----IGIGMTWDKTLEDHAHSYAQKLKVDCIIEHSVS------------HYGENLAWAD-Y 88 (164)
Q Consensus 27 ~~~~~il~~hN~~R~~~~-----~~~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~~------------~~gen~~~~~-~ 88 (164)
+++++||+.||.+|+++. |+ +|+||++|+..||.||+ +|.+.|++. .+|||+++.. +
T Consensus 1 ~f~~~~L~~HN~~R~~~~p~a~~m~-~l~Wd~~La~~Aq~~a~----~C~~~~~~~~~~~~~~~~~~~~~GeNi~~~~~~ 75 (144)
T cd05385 1 EFIDECVRIHNELRSKVSPPAANMR-YMTWDAALAKTARAWAK----KCKFKHNIYLGKRYKCHPKFTSVGENIWLGSIY 75 (144)
T ss_pred CHHHHHHHHHHHHHhhCCCCcccCc-ccccCHHHHHHHHHHHh----cCCCCCCchhhcccccccccCcccceeeecccC
Confidence 468899999999999994 67 99999999999999999 998877542 4899998765 3
Q ss_pred CCCHHHHHHHHHhccccCcCCCCCCCCCCcchHHHHHHHHcCCeeeEEEEEeCCCC-----EEEEEEEecCC
Q 048151 89 DFTVDHIVKMWVDEKQFYDYNSNTCAPNQMCGHYTQVVWRKSVRLGCAKERCNNNH-----QFIAICNYDPP 155 (164)
Q Consensus 89 ~~~~~~~v~~W~~e~~~y~~~~~~~~~~~~~~~f~qmvw~~~~~vGCa~~~c~~~~-----~~~~vC~Y~p~ 155 (164)
.+.++++|+.||+|...|+++.+.+. ..++||+||||+++++||||++.|.++. .++ ||+|+|+
T Consensus 76 ~~~~~~av~~W~~e~~~y~~~~~~~~--~~~ghftqmvw~~t~~vGCa~~~c~~~~~~~~~~~v-VC~Y~p~ 144 (144)
T cd05385 76 IFSPKNAVTSWYNEGKFYDFDTNSCS--RVCGHYTQVVWATSYKVGCAVAFCPNLGGIPNAAIF-VCNYAPA 144 (144)
T ss_pred CCCHHHHHHHHHHHHHhCCCCCCCCC--CcccCHHHHHHhhccccceEEEECCCCCCccccEEE-EEeCCCC
Confidence 45889999999999999999876654 4699999999999999999999998752 567 9999984
No 7
>cd00168 SCP SCP: SCP-like extracellular protein domain, found in eukaryotes and prokaryotes. This family includes plant pathogenesis-related protein 1 (PR-1), which accumulates after infections with pathogens, and may act as an anti-fungal agent or be involved in cell wall loosening. This family also includes CRISPs, mammalian cysteine-rich secretory proteins, which combine SCP with a C-terminal cysteine rich domain, and allergen 5 from vespid venom. Roles for CRISP, in response to pathogens, fertilization, and sperm maturation have been proposed. One member, Tex31 from the venom duct of Conus textile, has been shown to possess proteolytic activity sensitive to serine protease inhibitors. The human GAPR-1 protein has been reported to dimerize, and such a dimer may form an active site containing a catalytic triad. SCP has also been proposed to be a Ca++ chelating serine protease. The Ca++-chelating function would fit with various signaling processes that members of this family, such as
Probab=100.00 E-value=5.4e-33 Score=190.08 Aligned_cols=116 Identities=40% Similarity=0.774 Sum_probs=104.4
Q ss_pred HHHHHHHHHHHhhc-CCCCCccccHHHHHHHHHHHHhhhccCccccCCC----CccceEEeecCCCCHHHHHHHHHhccc
Q 048151 30 QRYVHLHNEARRNV-GIGIGMTWDKTLEDHAHSYAQKLKVDCIIEHSVS----HYGENLAWADYDFTVDHIVKMWVDEKQ 104 (164)
Q Consensus 30 ~~il~~hN~~R~~~-~~~~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~~----~~gen~~~~~~~~~~~~~v~~W~~e~~ 104 (164)
++||+.||.+|+++ +++ +|+||++|+..||.+|+ +|.+.|++. .+|||+++......++++++.|++|..
T Consensus 2 ~~il~~hN~~R~~~a~~~-~L~wd~~La~~A~~~a~----~c~~~h~~~~~~~~~geNi~~~~~~~~~~~~v~~W~~e~~ 76 (122)
T cd00168 2 QEVVRLHNSYRAKVNGML-PMSWDAELAKTAQNYAN----RCIFKHSGEDGRGFVGENLAAGSYDMTGPAAVQAWYNEIK 76 (122)
T ss_pred cHHHHHHHHHHHhcCCCC-CCccCHHHHHHHHHHHh----hccccCCCcccCCCCCceeEEecCCCCHHHHHHHHHHHHH
Confidence 67999999999999 999 99999999999999999 999888765 589999988755689999999999999
Q ss_pred cCcCCCCCCCCCCcchHHHHHHHHcCCeeeEEEEEeCCCCEEEEEEEec
Q 048151 105 FYDYNSNTCAPNQMCGHYTQVVWRKSVRLGCAKERCNNNHQFIAICNYD 153 (164)
Q Consensus 105 ~y~~~~~~~~~~~~~~~f~qmvw~~~~~vGCa~~~c~~~~~~~~vC~Y~ 153 (164)
.|++.+.. .+..++||+||||+++++||||++.|+++..++ ||+|+
T Consensus 77 ~y~~~~~~--~~~~~~h~~qmvw~~s~~vGca~~~~~~~~~~~-vC~Y~ 122 (122)
T cd00168 77 NYNFGQPG--FSSGTGHYTQVVWKNTTKIGCGVAFCGSNSYYV-VCNYG 122 (122)
T ss_pred hCCCCCCC--CCCCccchhhhhcccCCeeeeEEEEcCCCCEEE-EEeCc
Confidence 99998543 345699999999999999999999998777888 99995
No 8
>cd05559 SCP_HrTT-1 SCP_HrTT-1: SCP-like extracellular protein domain in HrTT-1, a tail-tip epidermis marker in ascidians. The wider family of SCP containing proteins includes plant pathogenesis-related protein 1 (PR-1), CRISPs, mammalian cysteine-rich secretory proteins, and allergen 5 from vespid venom. It has been proposed that SCP domains may function as endopeptidases.
Probab=100.00 E-value=6.6e-33 Score=193.23 Aligned_cols=119 Identities=43% Similarity=0.846 Sum_probs=104.3
Q ss_pred HHHHHHHHHHHHhhcC-----CCCCccccHHHHHHHHHHHHhhhccCccccCCC----CccceEEeecC-CCCHHHHHHH
Q 048151 29 QQRYVHLHNEARRNVG-----IGIGMTWDKTLEDHAHSYAQKLKVDCIIEHSVS----HYGENLAWADY-DFTVDHIVKM 98 (164)
Q Consensus 29 ~~~il~~hN~~R~~~~-----~~~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~~----~~gen~~~~~~-~~~~~~~v~~ 98 (164)
|+.||+.||.+|+.++ |. +|+||++||..||.||+ +|.+.|++. .+|||++...+ ...+.++|+.
T Consensus 1 r~~il~~HN~~R~~~~p~a~~m~-~L~Wd~~La~~A~~~a~----~C~~~~~~~~~~~~~GeNl~~~~~~~~~~~~~v~~ 75 (136)
T cd05559 1 RLNLVDLHNQYRSQVSPPAANML-KMTWDEELAALAEAYAR----KCIWDHNPDRGHLRVGENLFISTGPPFDATKAVED 75 (136)
T ss_pred CcHHHHHHHHHHhhCCCccccCc-ccccCHHHHHHHHHHHH----hccccCCCcccCCCceeeeeecCCCCCCHHHHHHH
Confidence 5789999999999986 44 79999999999999999 999888664 58999987764 4678999999
Q ss_pred HHhccccCcCCCCCCCCCCcchHHHHHHHHcCCeeeEEEEEeCCC-------CEEEEEEEec
Q 048151 99 WVDEKQFYDYNSNTCAPNQMCGHYTQVVWRKSVRLGCAKERCNNN-------HQFIAICNYD 153 (164)
Q Consensus 99 W~~e~~~y~~~~~~~~~~~~~~~f~qmvw~~~~~vGCa~~~c~~~-------~~~~~vC~Y~ 153 (164)
|++|...|+++.+.+..+..++||+||||+++++||||++.|.++ ..++ ||+|+
T Consensus 76 W~~e~~~y~~~~~~~~~~~~~~hftqmvw~~t~~vGCa~~~c~~~~~~~~~~~~~~-vC~Y~ 136 (136)
T cd05559 76 WNNEKLDYNYNTNTCAPNKMCGHYTQVVWANTFKIGCGSYFCETLEVLRWENATLL-VCNYG 136 (136)
T ss_pred HHHHHHhcCCCCCCCCCCCcccchHHHHHhccCccceEEEECCCCCCCCcccCEEE-EecCC
Confidence 999999999998887777889999999999999999999999652 2456 99995
No 9
>KOG3017 consensus Defense-related protein containing SCP domain [Function unknown]
Probab=100.00 E-value=4.5e-33 Score=208.98 Aligned_cols=132 Identities=41% Similarity=0.780 Sum_probs=116.3
Q ss_pred HHHHHHHHHHHHHHhhcC-----CCCCccccHHHHHHHHHHHHhhhccCccccCC------CCccceEEeecCC------
Q 048151 27 ATQQRYVHLHNEARRNVG-----IGIGMTWDKTLEDHAHSYAQKLKVDCIIEHSV------SHYGENLAWADYD------ 89 (164)
Q Consensus 27 ~~~~~il~~hN~~R~~~~-----~~~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~------~~~gen~~~~~~~------ 89 (164)
+.+++|++.||.+|..+. |+ +|+||++||..||.||+ +|.+.|+. ..+|||+++.++.
T Consensus 40 ~~~~~~~~~hn~~r~~~~~~as~m~-~m~Wd~~La~~Aq~~a~----~c~~~~~~~~~~~~~~~GeNl~~~~~~~~~~~~ 114 (225)
T KOG3017|consen 40 NLRSEILNGHNVARGAVGPPASNMM-KLKWDDELAALAQNWAN----TCPFGHDKCVHTSFGPYGENLAWGWSSNPPLSL 114 (225)
T ss_pred HHHHHHHhhhHHhcCccCCchHhCc-cccCCHHHHHHHHHHHh----hCCcccCccccccCCCCcccceeeccCCCCccc
Confidence 788999999999999999 99 99999999999999999 88887763 4679999987753
Q ss_pred -CCHHHHHHHHHhccccCcCCCCCCCC---CCcchHHHHHHHHcCCeeeEEEEEeCCC-----CEEEEEEEecCCCCCCC
Q 048151 90 -FTVDHIVKMWVDEKQFYDYNSNTCAP---NQMCGHYTQVVWRKSVRLGCAKERCNNN-----HQFIAICNYDPPGNAAG 160 (164)
Q Consensus 90 -~~~~~~v~~W~~e~~~y~~~~~~~~~---~~~~~~f~qmvw~~~~~vGCa~~~c~~~-----~~~~~vC~Y~p~gn~~g 160 (164)
.....+++.|+.|...|++.++.+.. +..++|||||||+++++||||++.|+++ ..++ ||+|+|+||+.+
T Consensus 115 ~~~~~~a~~~w~~e~~~~~~~~~~~~~~~~~~~~gHyTQ~vw~~s~~vGCgv~~c~~~~~~~~~~~~-vC~Y~p~g~~~~ 193 (225)
T KOG3017|consen 115 DTSGALAVEAWESEFQEYDWSSNTCSSADFGEGIGHYTQMVWAKSTKVGCGVVRCGNGSNGYNTVAV-VCNYDPPGNNIN 193 (225)
T ss_pred cccHHHHHHHHHHHHHHccCcccccCcccCCCcceEEEEEEEeCCceeceeeccCCCCCCCcceEEE-EEEeecCCCCcC
Confidence 46778999999999999999988875 7889999999999999999999999887 4567 999999955544
Q ss_pred -CCCC
Q 048151 161 -ERPF 164 (164)
Q Consensus 161 -~~~Y 164 (164)
+.||
T Consensus 194 ~~~~y 198 (225)
T KOG3017|consen 194 GEIPY 198 (225)
T ss_pred CCCcC
Confidence 6766
No 10
>cd05380 SCP_euk SCP_euk: SCP-like extracellular protein domain, as found mainly in eukaryotes. This family includes plant pathogenesis-related protein 1 (PR-1), CRISPs, mammalian cysteine-rich secretory proteins, and allergen 5 from vespid venom. It has been proposed that SCP domains may function as endopeptidases.
Probab=99.97 E-value=1.8e-31 Score=186.75 Aligned_cols=119 Identities=36% Similarity=0.700 Sum_probs=103.3
Q ss_pred HHHHHHHHHHHHhhc------------CCCCCccccHHHHHHHHHHHHhhhccCccccCCC----CccceEEeecCC---
Q 048151 29 QQRYVHLHNEARRNV------------GIGIGMTWDKTLEDHAHSYAQKLKVDCIIEHSVS----HYGENLAWADYD--- 89 (164)
Q Consensus 29 ~~~il~~hN~~R~~~------------~~~~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~~----~~gen~~~~~~~--- 89 (164)
|+.||+.||.+|+++ .|+ +|+||++||..|+.+|+ +|...|+.. .+|||++.....
T Consensus 1 ~~~il~~HN~~R~~~a~g~~~~~p~a~~m~-~l~Wd~~La~~A~~~a~----~C~~~~~~~~~~~~~GeNl~~~~~~~~~ 75 (144)
T cd05380 1 RQAILDAHNELRSKVAKGTYSLLPPASNMP-KLKWDDELAALAQNWAK----TCVFEHSPCRNTGGVGQNLAAGSSTGST 75 (144)
T ss_pred CcHHHHHHHHHHHHhhcCCCCCCCchhcCC-cceeCHHHHHHHHHHHh----cCCCcCCcccCCCCCCcEEEEeccCCCC
Confidence 468999999999999 678 99999999999999999 998888765 689999987642
Q ss_pred --CCHHHHHHHHHhccccCcCCCC-CCCCCCcchHHHHHHHHcCCeeeEEEEEeCC---CCEEEEEEEec
Q 048151 90 --FTVDHIVKMWVDEKQFYDYNSN-TCAPNQMCGHYTQVVWRKSVRLGCAKERCNN---NHQFIAICNYD 153 (164)
Q Consensus 90 --~~~~~~v~~W~~e~~~y~~~~~-~~~~~~~~~~f~qmvw~~~~~vGCa~~~c~~---~~~~~~vC~Y~ 153 (164)
..++++|+.|++|...|++... .+..+..++||+||||+++++||||++.|.. ...++ ||+|+
T Consensus 76 ~~~~~~~~v~~W~~e~~~~~~~~~~~~~~~~~~~hftq~vw~~t~~vGCa~~~~~~~~~~~~~~-vC~Y~ 144 (144)
T cd05380 76 VEELAEDAVNAWYNELKDYGFGSNPTNNFNSGIGHFTQMVWAKTTKVGCAVARCGKDGGNKTVV-VCNYS 144 (144)
T ss_pred HHHHHHHHHHHHHHHHHHcCCCcCcccccccchhHHHHHHHHhcCccceEEEEeecCCceEEEE-EecCC
Confidence 3688999999999999999875 4445677999999999999999999999975 35667 99996
No 11
>PF00188 CAP: Cysteine-rich secretory protein family; InterPro: IPR014044 The cysteine-rich secretory proteins, antigen 5, and pathogenesis-related 1 proteins (CAP) superfamily proteins are found in a wide range of organisms, including prokaryotes [] and non-vertebrate eukaryotes [], The nine subfamilies of the mammalian CAP superfamily include: the human glioma pathogenesis-related 1 (GLIPR1), Golgi associated pathogenesis related-1 (GAPR1) proteins, peptidase inhibitor 15 (PI15), peptidase inhibitor 16 (PI16), cysteine-rich secretory proteins (CRISPs), CRISP LCCL domain containing 1 (CRISPLD1), CRISP LCCL domain containing 2 (CRISPLD2), mannose receptor like and the R3H domain containing like proteins. Members are most often secreted and have an extracellular endocrine or paracrine function and are involved in processes including the regulation of extracellular matrix and branching morphogenesis, potentially as either proteases or protease inhibitors; in ion channel regulation in fertility; as tumour suppressor or pro-oncogenic genes in tissues including the prostate; and in cell-cell adhesion during fertilisation. The overall protein structural conservation within the CAP superfamily results in fundamentally similar functions for the CAP domain in all members, yet the diversity outside of this core region dramatically alters the target specificity and, thus, the biological consequences []. The Ca++-chelating function [] would fit with the various signalling processes (e.g. the CRISP proteins) that members of this family are involved in, and also the sequence and structural evidence of a conserved pocket containing two histidines and a glutamate. It also may explain how Q91055 from SWISSPROT blocks the Ca++ transporting ryanodine receptors. This entry represents the CAP domain common to all members of the CAP superfamily. The CAP domain forms a unique 3 layer alpha-beta-alpha fold with some, though not all, of the structural elements found in proteases [].; PDB: 3U3N_C 3U3U_C 3U3L_C 1U53_A 1RC9_A 1SMB_A 3NT8_B 1QNX_A 1WVR_A 3Q2U_A ....
Probab=99.87 E-value=2.5e-21 Score=129.67 Aligned_cols=114 Identities=30% Similarity=0.568 Sum_probs=83.0
Q ss_pred HHHHHHHH-hhcCCCCCccccHHHHHHHHHHHHhhhccCccccCCC-CccceEEeecCCCCHHHH----HHHHHhccccC
Q 048151 33 VHLHNEAR-RNVGIGIGMTWDKTLEDHAHSYAQKLKVDCIIEHSVS-HYGENLAWADYDFTVDHI----VKMWVDEKQFY 106 (164)
Q Consensus 33 l~~hN~~R-~~~~~~~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~~-~~gen~~~~~~~~~~~~~----v~~W~~e~~~y 106 (164)
|+.||++| ...+++ +|+||++|+..|+.+|+ .|...+... ..|++............. ++.|+.+...+
T Consensus 1 L~~~N~~R~~~~~~~-~L~~d~~L~~~A~~~a~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (124)
T PF00188_consen 1 LDLHNEYRSAANGLP-PLKWDPELAKAAQAHAK----YCANSNSLSHDSGENGSQSSRFGSYSDAQVTAVENWYSESKNY 75 (124)
T ss_dssp HHHHHHHHHBSSTBB---EE-HHHHHHHHHHHT----TTCSSEETTEESEEEEEEESSTTSHHHHHHHHHHHHHGGGGGE
T ss_pred CHHHHHHHHHhCCCC-CCeeCHHHHHHHHHhhH----HhhhhcccccccCCCCccccccccccchhhHHHHHHHhccccc
Confidence 78999999 888899 99999999999999999 776633222 467777766532222222 89999999988
Q ss_pred cCCC--CCCCCCCcchHHHHHHHHcCCeeeEEEEEeCCCC--EEEEEEEe
Q 048151 107 DYNS--NTCAPNQMCGHYTQVVWRKSVRLGCAKERCNNNH--QFIAICNY 152 (164)
Q Consensus 107 ~~~~--~~~~~~~~~~~f~qmvw~~~~~vGCa~~~c~~~~--~~~~vC~Y 152 (164)
+... ........++||++|+|+++++||||++.|..+. .++ ||.|
T Consensus 76 ~~~~~~~~~~~~~~~~h~~~ll~~~~~~iGca~~~~~~~~~~~~~-vc~y 124 (124)
T PF00188_consen 76 NFQNQSIFNSWMNSPGHFTNLLWPNTTRIGCAVANCPNGKNNYYW-VCNY 124 (124)
T ss_dssp ETTCSTEESSTTSTCHHHHHHT-TT--EEEEEEEEETTSSSEEEE-EEEE
T ss_pred ccccchhhhccCCchhhhhhhhcCCCCEEEEEEEEeCCCCeeEEE-EEEC
Confidence 8762 1222246689999999999999999999998875 777 9998
No 12
>TIGR02909 spore_YkwD uncharacterized protein, YkwD family. Members of this protein family represent a subset of those belonging to Pfam family pfam00188 (SCP-like extracellular protein). Based on currently cuttoffs for this model, all member proteins are found in Bacteria capable of endospore formation. Members include a named but uncharacterized protein, YkwD of Bacillus subtilis. Only the C-terminal region is well-conserved and is included in the seed alignment for this model. Three members of this family have an N-terminal domain homologous to the spore coat assembly protein SafA.
Probab=99.83 E-value=1.5e-19 Score=124.45 Aligned_cols=106 Identities=20% Similarity=0.336 Sum_probs=91.8
Q ss_pred HHHHHHHHHHHHHHhhcCCCCCccccHHHHHHHHHHHHhhhccCccccCCC-----------------CccceEEeecCC
Q 048151 27 ATQQRYVHLHNEARRNVGIGIGMTWDKTLEDHAHSYAQKLKVDCIIEHSVS-----------------HYGENLAWADYD 89 (164)
Q Consensus 27 ~~~~~il~~hN~~R~~~~~~~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~~-----------------~~gen~~~~~~~ 89 (164)
+.++++++.||.+|.+++++ +|+||+.|++.|+.||++|+..+.+.|... .+|||++.+.
T Consensus 3 ~~e~~~l~~iN~~R~~~Gl~-pL~~~~~L~~~A~~hA~~ma~~~~~~H~~~~~~~~~~r~~~~g~~~~~~gENi~~g~-- 79 (127)
T TIGR02909 3 AEEKRVVELVNAERAKNGLK-PLKADPELSKVARLKSEDMRDKNYFSHTSPTYGSPFDMMKKFGISYRMAGENIAYGN-- 79 (127)
T ss_pred HHHHHHHHHHHHHHHHcCCC-CCccCHHHHHHHHHHHHHHHhCCcccccCCCCCCHHHHHHHcCCCcccceeeeeccC--
Confidence 56889999999999999999 999999999999999999998888888642 3589998654
Q ss_pred CCHHHHHHHHHhccccCcCCCCCCCCCCcchHHHHHHHHcCCeeeEEEEEeCCCCEEEEEEEe
Q 048151 90 FTVDHIVKMWVDEKQFYDYNSNTCAPNQMCGHYTQVVWRKSVRLGCAKERCNNNHQFIAICNY 152 (164)
Q Consensus 90 ~~~~~~v~~W~~e~~~y~~~~~~~~~~~~~~~f~qmvw~~~~~vGCa~~~c~~~~~~~~vC~Y 152 (164)
..++.+++.|+++ .+|+++|+|++.+++|||++.+++++.|+ |=.|
T Consensus 80 ~~~~~~v~~W~~S----------------~gH~~nil~~~~~~~Gvg~~~~~~g~~y~-~q~F 125 (127)
T TIGR02909 80 STVEAVHNAWMNS----------------PGHRANILNPNYTEIGVGYVEGGSGGIYW-TQMF 125 (127)
T ss_pred CCHHHHHHHHHcC----------------HhHHHHHcCCCcCeEeEEEEeCCCCCeEE-EEEe
Confidence 3678999999865 68999999999999999999988876766 5444
No 13
>cd05379 SCP_bacterial SCP_bacterial: SCP-like extracellular protein domain, as found in bacteria and archaea. The wider family of SCP containing proteins includes plant pathogenesis-related protein 1 (PR-1), CRISPs, mammalian cysteine-rich secretory proteins, and allergen 5 from vespid venom. It has been proposed that SCP domains may function as endopeptidases. Little is known about the biological roles of the bacterial and archaeal SCP domains.
Probab=99.68 E-value=6.9e-16 Score=104.55 Aligned_cols=103 Identities=21% Similarity=0.373 Sum_probs=86.9
Q ss_pred HHHHHHHHHHHhhcCCCCCccccHHHHHHHHHHHHhhhccCccccCCC-----------------CccceEEeecCCCCH
Q 048151 30 QRYVHLHNEARRNVGIGIGMTWDKTLEDHAHSYAQKLKVDCIIEHSVS-----------------HYGENLAWADYDFTV 92 (164)
Q Consensus 30 ~~il~~hN~~R~~~~~~~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~~-----------------~~gen~~~~~~~~~~ 92 (164)
+.+++.+|.+|..++++ +|+||.+|+..|+.+|.+|+....+.|.+. ..|||++.... .+
T Consensus 2 ~~~~~~iN~~R~~~gl~-pl~~~~~l~~~A~~~a~~~~~~~~~~h~~~~~~~~~~~~~~~g~~~~~~~eni~~~~~--~~ 78 (122)
T cd05379 2 QEALELINAYRAQNGLP-PLTWDPALAAAAQAHARDMAANGYFSHTGPDGSSPFDRARAAGYPYSSAGENIAYGYS--TA 78 (122)
T ss_pred hHHHHHHHHHHHHcCCC-CCccChHHHHHHHHHHHHHHhcCccCCcCCCCCCHHHHHHHcCCCcCccchhhcccCC--CH
Confidence 57899999999999999 999999999999999999987666666432 13889876643 68
Q ss_pred HHHHHHHHhccccCcCCCCCCCCCCcchHHHHHHHHcCCeeeEEEEEeCCCCEEEEEEEe
Q 048151 93 DHIVKMWVDEKQFYDYNSNTCAPNQMCGHYTQVVWRKSVRLGCAKERCNNNHQFIAICNY 152 (164)
Q Consensus 93 ~~~v~~W~~e~~~y~~~~~~~~~~~~~~~f~qmvw~~~~~vGCa~~~c~~~~~~~~vC~Y 152 (164)
.++++.|+++ .+|+.+|+++..+++|||+....++..++ |..|
T Consensus 79 ~~~~~~w~~~----------------~~H~~~ll~~~~~~~Gvg~~~~~~~~~y~-~~~f 121 (122)
T cd05379 79 EAAVDGWMNS----------------PGHRANILNPDYTEVGVGVAYGGDGGYYW-VQVF 121 (122)
T ss_pred HHHHHHHhCC----------------HhHHHHHcCCCcceeeEEEEeCCCCCeEE-EEec
Confidence 9999999865 78999999999999999999987776666 6554
No 14
>COG2340 Uncharacterized protein with SCP/PR1 domains [Function unknown]
Probab=99.30 E-value=1.5e-11 Score=91.28 Aligned_cols=98 Identities=20% Similarity=0.345 Sum_probs=84.4
Q ss_pred hHHHHHHHHHHHHHHhhcCCCCCccccHHHHHHHHHHHHhhhccCccccCCC-----------------CccceEEeecC
Q 048151 26 NATQQRYVHLHNEARRNVGIGIGMTWDKTLEDHAHSYAQKLKVDCIIEHSVS-----------------HYGENLAWADY 88 (164)
Q Consensus 26 ~~~~~~il~~hN~~R~~~~~~~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~~-----------------~~gen~~~~~~ 88 (164)
+++.+++++.+|.+|..++++ +|.||.+|+..|+.++.+|++...+.|..+ .+||||+.++.
T Consensus 78 ~~~~~~~~~~~N~~R~~~~l~-~L~~n~~L~~~A~~~a~~m~~~g~~sH~~~~g~~~~~r~~~~g~~~~~agENIa~g~~ 156 (207)
T COG2340 78 AQFEKAVVAETNQERAKHGLP-PLAWNATLAKAARNHARDMAKNGYFSHTSPTGETPADRLKKYGISGATAGENIAYGSN 156 (207)
T ss_pred chhHHHHHHHHHHHHhhcCCC-CcccCHHHHHHHHHHHHHHHHcCCccccCCCCCCHHHHHHhCCcccccccceeecCCC
Confidence 467889999999999999999 999999999999999999998888888542 48999988753
Q ss_pred CCCHHHHHHHHHhccccCcCCCCCCCCCCcchHHHHHHHHcCCeeeEEEEEeC
Q 048151 89 DFTVDHIVKMWVDEKQFYDYNSNTCAPNQMCGHYTQVVWRKSVRLGCAKERCN 141 (164)
Q Consensus 89 ~~~~~~~v~~W~~e~~~y~~~~~~~~~~~~~~~f~qmvw~~~~~vGCa~~~c~ 141 (164)
+. .+.+++.|.+. .||-.+|+.+..+.+|.|+..-.
T Consensus 157 ~~-~~~~v~~Wl~S----------------~gH~~nll~~~~~~~Gv~~~~~~ 192 (207)
T COG2340 157 DP-PEAAVDGWLNS----------------PGHRKNLLNPAYTEIGVGVAYDA 192 (207)
T ss_pred Cc-hHHHHHHhcCC----------------hhhhhhccCcchhheeEEEEecC
Confidence 21 27999999765 58999999999999999998743
No 15
>PF11054 Surface_antigen: Sporozoite TA4 surface antigen; InterPro: IPR021288 This family of proteins is a Eukaryotic family of surface antigens. One of the better characterised members of the family is the sporulated TA4 antigen. The TA4 gene encodes a single polypeptide of 25 kDa which contains a 17 and a 8kDa polypeptide [].
Probab=92.55 E-value=1.2 Score=33.85 Aligned_cols=129 Identities=16% Similarity=0.217 Sum_probs=78.3
Q ss_pred HHHHHHHHHHHHhhcCCC-----------CCccccHHHHHHHHHHHHhhhccCccccCCC-------------CccceEE
Q 048151 29 QQRYVHLHNEARRNVGIG-----------IGMTWDKTLEDHAHSYAQKLKVDCIIEHSVS-------------HYGENLA 84 (164)
Q Consensus 29 ~~~il~~hN~~R~~~~~~-----------~~L~Wd~~La~~A~~~a~~~~~~C~~~~~~~-------------~~gen~~ 84 (164)
.-++|+..|..|...|++ ++=-=+++| .....|-. -|...-+.. .-| ..+
T Consensus 35 ~~~CL~E~NaaReAAGL~~F~~A~~~~~~Lp~~~~~e~-~~~t~W~~----iC~~l~pt~~~~~~~~~~~~pf~~G-TyA 108 (254)
T PF11054_consen 35 SVECLSEMNAAREAAGLANFTEATSDDQKLPEPGSEEL-TDDTLWKK----ICEHLIPTQAEPAAEASKLNPFKDG-TYA 108 (254)
T ss_pred chhHHHHHHHHHHhcCchhhHhhcCCcccCCCCCchhc-cchhhHHH----HHHHhcCCCCcchhhccccCcCCCC-ceE
Confidence 567999999999999966 111113334 44455665 665332211 112 222
Q ss_pred eec---CCCCHHHHHHHHHhccccCcCCCCC------CCCCCcchHHHHHHHHcCCe-eeEEEEEeCCC-----------
Q 048151 85 WAD---YDFTVDHIVKMWVDEKQFYDYNSNT------CAPNQMCGHYTQVVWRKSVR-LGCAKERCNNN----------- 143 (164)
Q Consensus 85 ~~~---~~~~~~~~v~~W~~e~~~y~~~~~~------~~~~~~~~~f~qmvw~~~~~-vGCa~~~c~~~----------- 143 (164)
..+ +..+..+.|+.|-.-.++++--.+. .+.+...-.|.-|.+++... .-|.+..|...
T Consensus 109 f~~lt~~~~dCk~aVdYWKaafknF~glPPs~~~~~~lYndqdnVSFVALYNPs~~atAdC~vvTCt~tt~~~~~~~~~~ 188 (254)
T PF11054_consen 109 FKSLTDEKPDCKEAVDYWKAAFKNFTGLPPSKTAANKLYNDQDNVSFVALYNPSSSATADCRVVTCTQTTSNTAGGSRLQ 188 (254)
T ss_pred eeeccCCCCChHHHHHHHHHHHhhcCCCCCChhhccccccCCcceeEEEEeCCCCCCcceeEEEeCCCCCccCCCccccc
Confidence 222 4568999999998887777642222 12233344566677777754 67999999541
Q ss_pred ---------CEEEEEEEecCCCC-CCCCCCC
Q 048151 144 ---------HQFIAICNYDPPGN-AAGERPF 164 (164)
Q Consensus 144 ---------~~~~~vC~Y~p~gn-~~g~~~Y 164 (164)
++-+ +|.-.|..= ..|+.||
T Consensus 189 ~d~~~~~~~gyAl-iCkT~P~Al~~~~saPF 218 (254)
T PF11054_consen 189 GDSDSESKTGYAL-ICKTMPAALASDGSAPF 218 (254)
T ss_pred CCCcccccceEEE-EEecCchhhcCCCCCCC
Confidence 3455 899999764 5666664
No 16
>PF15240 Pro-rich: Proline-rich
Probab=52.92 E-value=8.5 Score=27.95 Aligned_cols=18 Identities=28% Similarity=0.191 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHHhhcccc
Q 048151 7 LAIFHLVVLAARIHLSSA 24 (164)
Q Consensus 7 ~~~~~l~~~~~~~~~~~a 24 (164)
|+|||-|+||+++++..+
T Consensus 2 LlVLLSvALLALSSAQ~~ 19 (179)
T PF15240_consen 2 LLVLLSVALLALSSAQST 19 (179)
T ss_pred hhHHHHHHHHHhhhcccc
Confidence 344455556666655544
No 17
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=52.68 E-value=9.3 Score=30.12 Aligned_cols=34 Identities=21% Similarity=0.407 Sum_probs=22.1
Q ss_pred HHHHHHcCCeeeEEEEEeCCCCEEEEEEEecCCCCC
Q 048151 123 TQVVWRKSVRLGCAKERCNNNHQFIAICNYDPPGNA 158 (164)
Q Consensus 123 ~qmvw~~~~~vGCa~~~c~~~~~~~~vC~Y~p~gn~ 158 (164)
.-+||++.|.---....-... -++ .|.|+|.||.
T Consensus 78 klIvWDs~TtnK~haipl~s~-WVM-tCA~sPSg~~ 111 (343)
T KOG0286|consen 78 KLIVWDSFTTNKVHAIPLPSS-WVM-TCAYSPSGNF 111 (343)
T ss_pred eEEEEEcccccceeEEecCce-eEE-EEEECCCCCe
Confidence 367898887765444433222 445 8999998875
No 18
>PF04202 Mfp-3: Foot protein 3; InterPro: IPR007328 Mytilus foot protein-3 (Mfp-3) is a highly polymorphic protein family located in the byssal adhesive plaques of blue mussels.
Probab=49.88 E-value=18 Score=21.76 Aligned_cols=24 Identities=21% Similarity=0.282 Sum_probs=13.6
Q ss_pred CCchhHHHHHHHHHHHHHhhcccc
Q 048151 1 MSPINSLAIFHLVVLAARIHLSSA 24 (164)
Q Consensus 1 m~~~~~~~~~~l~~~~~~~~~~~a 24 (164)
|.-....++++|+++.++++.|.+
T Consensus 1 mnn~Si~VLlaLvLIg~fAVqSda 24 (71)
T PF04202_consen 1 MNNLSIAVLLALVLIGSFAVQSDA 24 (71)
T ss_pred CCchhHHHHHHHHHHhhheeeecC
Confidence 444555566666666666555544
No 19
>PF11254 DUF3053: Protein of unknown function (DUF3053); InterPro: IPR021413 Some members in this family of proteins are annotated as the membrane protein YiaF. No function is currently known.
Probab=45.90 E-value=66 Score=24.41 Aligned_cols=50 Identities=14% Similarity=0.159 Sum_probs=30.8
Q ss_pred CCchhHHHHHHHHHHHHHhhccccChHHHHHHHHHHHH--HHh-hcCCCCCccccH
Q 048151 1 MSPINSLAIFHLVVLAARIHLSSANNATQQRYVHLHNE--ARR-NVGIGIGMTWDK 53 (164)
Q Consensus 1 m~~~~~~~~~~l~~~~~~~~~~~a~~~~~~~il~~hN~--~R~-~~~~~~~L~Wd~ 53 (164)
|+...|++.+++++.|+-|...- .+.|+.+++.... .|+ .+.+| +|+-+.
T Consensus 1 ~r~~~p~~al~~~l~LagCgdKE--peQR~AFi~fLQ~~i~~~~g~~vp-~Lte~q 53 (229)
T PF11254_consen 1 SRWFRPLLALLMVLQLAGCGDKE--PEQRKAFIDFLQNRIMRSPGVRVP-TLTEDQ 53 (229)
T ss_pred CchHHHHHHHHHHHHHHhcCCCC--HHHHHHHHHHHHHHHHHhcCCCCC-CCCHHH
Confidence 34555654444444455443333 5788888888777 777 56788 776443
No 20
>PF08105 Antimicrobial10: Metchnikowin family; InterPro: IPR012513 This family consists of the metchnikowin family of antimicrobial peptides from Drosophila. metchnikowin is a proline-rich peptide whose expression is immune-inducible. Induction of the metchnikowin gene expression can be mediated either by the TOLL pathway or by the imd gene product. The metchnikowin peptide is unique among the Drosophila antimicrobial peptides in that it is active against both bacteria and fungi [].
Probab=45.25 E-value=28 Score=19.58 Aligned_cols=24 Identities=17% Similarity=-0.057 Sum_probs=13.7
Q ss_pred CCchhHHHHHHHHHHHHHhhcccc
Q 048151 1 MSPINSLAIFHLVVLAARIHLSSA 24 (164)
Q Consensus 1 m~~~~~~~~~~l~~~~~~~~~~~a 24 (164)
|..++...++.|+.+++......+
T Consensus 1 Mqlnlg~i~l~lL~ll~~~~~~~~ 24 (52)
T PF08105_consen 1 MQLNLGAIFLALLGLLALAGSVLT 24 (52)
T ss_pred CcccHHHHHHHHHHHHHhcccccc
Confidence 666666665555555555544444
No 21
>PF04648 MF_alpha: Yeast mating factor alpha hormone; InterPro: IPR006742 This repeated sequence,WHWLQLKPGQPMY, characterises the mating factor alpha-1 or alpha-1 mating pheromone [contains: Mating factor alpha].The hormone is excreted into the culture medium by haploid cells of the alpha mating type and acts on cells of the opposite mating type (type A) by binding to a cognate G-protein coupled receptor which is coupled to a downstream signal transduction pathway. It inhibits DNA synthesis in type A cells synchronising them with type alpha, and so mediates the conjugation process.; GO: 0000772 mating pheromone activity, 0019953 sexual reproduction, 0005576 extracellular region
Probab=44.52 E-value=12 Score=14.93 Aligned_cols=6 Identities=17% Similarity=0.578 Sum_probs=3.8
Q ss_pred CCCCCC
Q 048151 159 AGERPF 164 (164)
Q Consensus 159 ~g~~~Y 164 (164)
+||++|
T Consensus 8 ~GqP~Y 13 (13)
T PF04648_consen 8 PGQPMY 13 (13)
T ss_pred CCCcCC
Confidence 467766
No 22
>PF02402 Lysis_col: Lysis protein; InterPro: IPR003059 The DNA sequence of the entire colicin E2 operon has been determined []. The operon comprises the colicin activity gene (ceaB), the colicin immunity gene (ceiB) and the lysis gene (celB), which is essential for colicin release from producing cells []. A putative LexA binding site is located upstream from ceaB, and a rho-independent terminator structure is located downstream from celB []. Comparison of the amino acid sequences of colicin E2 and cloacin DF13 reveal extensive similarity. These colicins have different modes of action and recognise different cell surface receptors; the two major regions of heterology at the C terminus, and in the C-terminal end of the central region are thought to correspond to the catalytic and receptor-recognition domains, respectively []. Sequence similarities between colicins E2, A and E1 [] are less striking. The colicin E2 (pyocin) immunity protein does not share similarity with either the colicin E3 or cloacin DF13 [] immunity proteins. By contrast, the lysis proteins of the ColE2, ColE1 and CloDF13 plasmids are almost identical except in the N-terminal regions, which themselves are similar to lipoprotein signal peptides []. Processing of the ColE2 prolysis protein to the mature form is prevented by globomycin, a specific inhibitor of the lipoprotein signal peptidase []. The mature ColE2 lysis protein is located in the cell envelope [].; GO: 0009405 pathogenesis, 0019835 cytolysis, 0019867 outer membrane
Probab=43.40 E-value=12 Score=20.57 Aligned_cols=23 Identities=30% Similarity=0.330 Sum_probs=13.7
Q ss_pred CCchhHHHHHHHHHHHHHhhccc
Q 048151 1 MSPINSLAIFHLVVLAARIHLSS 23 (164)
Q Consensus 1 m~~~~~~~~~~l~~~~~~~~~~~ 23 (164)
|+.+..+.|+.+.++++.+...+
T Consensus 1 MkKi~~~~i~~~~~~L~aCQaN~ 23 (46)
T PF02402_consen 1 MKKIIFIGIFLLTMLLAACQANY 23 (46)
T ss_pred CcEEEEeHHHHHHHHHHHhhhcc
Confidence 66566666666666666555433
No 23
>PHA03066 Hypothetical protein; Provisional
Probab=25.99 E-value=1.7e+02 Score=19.48 Aligned_cols=22 Identities=14% Similarity=0.069 Sum_probs=12.4
Q ss_pred CCchhHHHHHHHHHHHHHhhcc
Q 048151 1 MSPINSLAIFHLVVLAARIHLS 22 (164)
Q Consensus 1 m~~~~~~~~~~l~~~~~~~~~~ 22 (164)
|++.+-+++|.++++++=.-.-
T Consensus 1 ~~~~~~l~fFi~Fl~~~Y~~n~ 22 (110)
T PHA03066 1 ASSLLYLLFFIIFLCISYYFNY 22 (110)
T ss_pred CchHHHHHHHHHHHHHHHHHhh
Confidence 6666666666555555543333
No 24
>PF08138 Sex_peptide: Sex peptide (SP) family; InterPro: IPR012608 This family consists of Sex Peptides (SP) that are found in Drosophila. On mating, Drosophila females decreases her remating rate and increases her egg-laying rate due, in part, to the transfer of SP from the male to the female. SP are found in seminal fluids transferred from the male to the female during mating. The male seminal fluid proteins are referred to as accessory gland proteins (Acps). The SP is one of the most interesting Acps and plays an important role in reproduction [].; GO: 0005179 hormone activity, 0046008 regulation of female receptivity, post-mating, 0005576 extracellular region; PDB: 2LAQ_A.
Probab=24.80 E-value=24 Score=20.20 Aligned_cols=14 Identities=21% Similarity=0.228 Sum_probs=0.0
Q ss_pred CCchhHHHHHHHHH
Q 048151 1 MSPINSLAIFHLVV 14 (164)
Q Consensus 1 m~~~~~~~~~~l~~ 14 (164)
|+++.+|.++.+++
T Consensus 1 Mk~p~~llllvlll 14 (56)
T PF08138_consen 1 MKTPIFLLLLVLLL 14 (56)
T ss_dssp --------------
T ss_pred CcchHHHHHHHHHH
Confidence 66666664444433
No 25
>PF15284 PAGK: Phage-encoded virulence factor
Probab=24.77 E-value=74 Score=18.80 Aligned_cols=17 Identities=24% Similarity=0.382 Sum_probs=7.2
Q ss_pred CCchhHHHHHHHHHHHH
Q 048151 1 MSPINSLAIFHLVVLAA 17 (164)
Q Consensus 1 m~~~~~~~~~~l~~~~~ 17 (164)
|+....+.+.+++++++
T Consensus 1 Mkk~ksifL~l~~~LsA 17 (61)
T PF15284_consen 1 MKKFKSIFLALVFILSA 17 (61)
T ss_pred ChHHHHHHHHHHHHHHH
Confidence 55444444433443333
No 26
>KOG4228 consensus Protein tyrosine phosphatase [Signal transduction mechanisms]
Probab=24.61 E-value=83 Score=29.39 Aligned_cols=38 Identities=13% Similarity=0.230 Sum_probs=29.9
Q ss_pred CcchHHHHHHHHcCCeeeEEEEEeCCCCEEEEEEEecCC
Q 048151 117 QMCGHYTQVVWRKSVRLGCAKERCNNNHQFIAICNYDPP 155 (164)
Q Consensus 117 ~~~~~f~qmvw~~~~~vGCa~~~c~~~~~~~~vC~Y~p~ 155 (164)
...++|-.|||.+-+..=..+.++.+.+..- .+.|+|.
T Consensus 619 eTv~DFWRMVWEq~S~~IVMvTnl~E~~r~k-C~qYWP~ 656 (1087)
T KOG4228|consen 619 ETVGDFWRMVWEQKSAGIVMVTNLEEFSRVK-CAQYWPE 656 (1087)
T ss_pred cchHHHHHHheeccCCcEEEEeccccccccc-ccccCCC
Confidence 5588999999999988777777777765555 5679983
No 27
>TIGR03044 PS_II_psb27 photosystem II protein Psb27. Members of this family are the Psb27 protein of the cyanobacterial photosynthetic supracomplex, photosystem II. Although most protein components of both cyanobacterial and chloroplast versions of photosystem II are closely related and described together by single model families, this family is strictly bacterial. Some uncharacterized proteins with highly divergent sequences, from Arabidopsis, score between trusted and noise cutoffs for this model but are not at this time assigned as functionally equivalent photosystem II proteins.
Probab=24.11 E-value=2.4e+02 Score=19.58 Aligned_cols=21 Identities=5% Similarity=0.088 Sum_probs=18.8
Q ss_pred hHHHHHHHHHHHHHHhhcCCC
Q 048151 26 NATQQRYVHLHNEARRNVGIG 46 (164)
Q Consensus 26 ~~~~~~il~~hN~~R~~~~~~ 46 (164)
.+..++-+...+.+|....++
T Consensus 35 g~Y~~DT~~Vi~tlr~~i~lp 55 (135)
T TIGR03044 35 GDYVEDTLAVIQTLREAIDLP 55 (135)
T ss_pred chHHHHHHHHHHHHHHHHcCC
Confidence 378899999999999999988
No 28
>KOG4326 consensus Mitochondrial F1F0-ATP synthase, subunit e [Energy production and conversion]
Probab=23.25 E-value=1.3e+02 Score=18.36 Aligned_cols=45 Identities=9% Similarity=0.037 Sum_probs=30.0
Q ss_pred hccccChHHHHHHHHHHHHHHhhcCCCCCccccHHHHHHHHHHHHhh
Q 048151 20 HLSSANNATQQRYVHLHNEARRNVGIGIGMTWDKTLEDHAHSYAQKL 66 (164)
Q Consensus 20 ~~~~a~~~~~~~il~~hN~~R~~~~~~~~L~Wd~~La~~A~~~a~~~ 66 (164)
..++. .....+|-..|-+.|.--+-. +-+=|.+++..-++||...
T Consensus 23 GvaYG-a~r~~~l~~~~e~~Rei~a~e-Kav~da~~a~ekKr~a~~e 67 (81)
T KOG4326|consen 23 GVAYG-AFRLRQLREYHEDIREIDAHE-KAVADAEEAAEKKRWAKDE 67 (81)
T ss_pred HHHHh-HHHHHHHhHHHHHHHHHHHHH-HHHHhHHHHHHHHhhHHHH
Confidence 33444 344567788888999877666 5566777777777776543
No 29
>PF03295 Pox_TAA1: Poxvirus trans-activator protein A1 C-terminal; InterPro: IPR004975 Late transcription factor VLTF-2, acts with RNA polymerase to initiate transcription from late gene promoters [].
Probab=22.45 E-value=79 Score=18.67 Aligned_cols=19 Identities=21% Similarity=0.448 Sum_probs=16.2
Q ss_pred HHHHHHHHHHHHHHhhcCC
Q 048151 27 ATQQRYVHLHNEARRNVGI 45 (164)
Q Consensus 27 ~~~~~il~~hN~~R~~~~~ 45 (164)
+..+++++..|.+|.+-|.
T Consensus 24 ~~Pe~Vi~iIN~lR~keGv 42 (63)
T PF03295_consen 24 EDPEEVINIINELRNKEGV 42 (63)
T ss_pred cCHHHHHHHHHHhhhccCc
Confidence 4678899999999998874
No 30
>COG3026 RseB Negative regulator of sigma E activity [Signal transduction mechanisms]
Probab=21.62 E-value=1.4e+02 Score=23.62 Aligned_cols=37 Identities=16% Similarity=0.214 Sum_probs=23.4
Q ss_pred HHHHHHHHHHHHhhccccChHHHHHHHHHHHHHHhhcC
Q 048151 7 LAIFHLVVLAARIHLSSANNATQQRYVHLHNEARRNVG 44 (164)
Q Consensus 7 ~~~~~l~~~~~~~~~~~a~~~~~~~il~~hN~~R~~~~ 44 (164)
+++++|++.++++..+.+ ++...++|...|..+.+..
T Consensus 6 ~s~~ll~~sl~~s~~a~a-e~~s~~~L~km~~A~~~ln 42 (320)
T COG3026 6 FSLLLLLGSLLLSAAASA-ESASAAWLQKMNEASQSLN 42 (320)
T ss_pred HHHHHHHHHHhhhhhhhc-cCccHHHHHHHHHHHHhcC
Confidence 344444444555555555 4444489999999998877
No 31
>PF08194 DIM: DIM protein; InterPro: IPR013172 Drosophila immune-induced molecules (DIMs) are short proteins induced during the immune response of Drosophila []. This entry includes DIMs 1 to 4 and DIM23.
Probab=21.34 E-value=99 Score=16.23 Aligned_cols=6 Identities=17% Similarity=0.163 Sum_probs=3.1
Q ss_pred CCchhH
Q 048151 1 MSPINS 6 (164)
Q Consensus 1 m~~~~~ 6 (164)
|+....
T Consensus 1 Mk~l~~ 6 (36)
T PF08194_consen 1 MKCLSL 6 (36)
T ss_pred CceeHH
Confidence 555544
No 32
>PF14276 DUF4363: Domain of unknown function (DUF4363)
Probab=20.33 E-value=2.7e+02 Score=18.33 Aligned_cols=46 Identities=13% Similarity=0.261 Sum_probs=28.0
Q ss_pred HHHHHHHHHHHHhhccccChHHHHHHHHHHHHHHhhcCCCCCccccHHHH
Q 048151 7 LAIFHLVVLAARIHLSSANNATQQRYVHLHNEARRNVGIGIGMTWDKTLE 56 (164)
Q Consensus 7 ~~~~~l~~~~~~~~~~~a~~~~~~~il~~hN~~R~~~~~~~~L~Wd~~La 56 (164)
++++.+++++......+- ....+.+.+..+.....+.-+ .|+..-.
T Consensus 4 ~~i~~lii~~~~~~~~~l-~~~~~~i~~~l~~i~~~i~~~---dW~~A~~ 49 (121)
T PF14276_consen 4 IIIFILIIALSIFSNNYL-NNSTDSIEEQLEQIEEAIENE---DWEKAYK 49 (121)
T ss_pred HHHHHHHHHHHHHHHhhh-hhHHHHHHHHHHHHHHHHHhC---CHHHHHH
Confidence 444445555555555444 455677888888888877766 5655433
No 33
>KOG4439 consensus RNA polymerase II transcription termination factor TTF2/lodestar, DEAD-box superfamily [Transcription; Replication, recombination and repair]
Probab=20.02 E-value=1.1e+02 Score=27.54 Aligned_cols=40 Identities=20% Similarity=0.463 Sum_probs=31.6
Q ss_pred HHHHHHHHHHHHHHhhcC----------CC-----------CCccccHHHHHHHHHHHHhh
Q 048151 27 ATQQRYVHLHNEARRNVG----------IG-----------IGMTWDKTLEDHAHSYAQKL 66 (164)
Q Consensus 27 ~~~~~il~~hN~~R~~~~----------~~-----------~~L~Wd~~La~~A~~~a~~~ 66 (164)
..|+.+++..|.-+.... .+ +.|-|+..|+++|+...-.|
T Consensus 783 K~Rq~iv~~FN~~k~~~rVmLlSLtAGGVGLNL~GaNHlilvDlHWNPaLEqQAcDRIYR~ 843 (901)
T KOG4439|consen 783 KDRQEIVDEFNQEKGGARVMLLSLTAGGVGLNLIGANHLILVDLHWNPALEQQACDRIYRM 843 (901)
T ss_pred hHHHHHHHHHHhccCCceEEEEEEccCcceeeecccceEEEEecccCHHHHHHHHHHHHHh
Confidence 678999999999887333 11 57889999999999988744
Done!