Query psy8304
Match_columns 593
No_of_seqs 221 out of 616
Neff 5.8
Searched_HMMs 46136
Date Sat Aug 17 00:17:57 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8304.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8304hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00241 ZP Zona pellucida ( 100.0 3.7E-36 8E-41 303.8 25.9 235 168-421 1-246 (253)
2 PF00100 Zona_pellucida: Zona 100.0 1.3E-29 2.9E-34 252.8 22.5 234 168-416 1-254 (265)
3 KOG1924|consensus 98.8 8.3E-09 1.8E-13 115.9 8.4 22 108-129 567-588 (1102)
4 KOG1924|consensus 98.3 2.9E-06 6.2E-11 96.1 10.1 15 186-202 632-646 (1102)
5 KOG3671|consensus 91.8 0.62 1.4E-05 51.8 8.3 10 122-131 427-436 (569)
6 KOG3671|consensus 90.2 0.76 1.6E-05 51.2 7.2 10 68-77 373-382 (569)
7 KOG1830|consensus 82.0 13 0.00028 41.0 11.0 19 15-33 286-304 (518)
8 PF12273 RCR: Chitin synthesis 78.9 1.6 3.5E-05 40.4 2.7 18 550-567 16-33 (130)
9 PRK14094 psbM photosystem II r 71.6 6.5 0.00014 30.4 3.8 16 544-559 15-30 (50)
10 PRK15319 AIDA autotransporter- 68.2 7.6 0.00016 49.8 5.5 10 123-132 1723-1732(2039)
11 PF05454 DAG1: Dystroglycan (D 63.5 2.3 5.1E-05 44.9 0.0 24 537-560 152-175 (290)
12 PF07172 GRP: Glycine rich pro 62.6 8.5 0.00018 34.1 3.4 23 530-552 3-25 (95)
13 PF15102 TMEM154: TMEM154 prot 62.0 6.9 0.00015 37.4 2.8 22 539-560 66-87 (146)
14 KOG0162|consensus 60.4 38 0.00082 40.1 8.7 12 23-34 921-932 (1106)
15 PF14991 MLANA: Protein melan- 60.4 1.6 3.5E-05 39.8 -1.6 27 534-560 26-52 (118)
16 PF02009 Rifin_STEVOR: Rifin/s 58.9 4.8 0.0001 42.8 1.4 37 525-562 249-285 (299)
17 KOG2675|consensus 57.3 6.1 0.00013 43.7 1.8 12 22-33 154-165 (480)
18 PF05568 ASFV_J13L: African sw 56.1 10 0.00022 36.2 2.8 37 542-579 39-75 (189)
19 PF01102 Glycophorin_A: Glycop 51.3 14 0.00031 34.2 3.0 27 534-561 68-94 (122)
20 KOG1922|consensus 49.9 41 0.00089 40.5 7.4 8 529-536 765-772 (833)
21 KOG0132|consensus 49.6 47 0.001 39.5 7.4 13 2-14 478-490 (894)
22 PTZ00046 rifin; Provisional 48.5 9.1 0.0002 41.6 1.4 30 535-564 317-346 (358)
23 COG5178 PRP8 U5 snRNP spliceos 48.5 14 0.00031 45.3 3.1 17 163-180 139-155 (2365)
24 PF11980 DUF3481: Domain of un 46.9 27 0.00058 30.4 3.7 35 530-564 14-49 (87)
25 PF11770 GAPT: GRB2-binding ad 46.9 11 0.00023 36.2 1.4 13 574-586 41-53 (158)
26 PF01299 Lamp: Lysosome-associ 46.5 16 0.00035 38.6 2.9 20 543-562 282-301 (306)
27 TIGR01477 RIFIN variant surfac 46.4 10 0.00022 41.1 1.4 30 535-564 312-341 (353)
28 PRK09752 adhesin; Provisional 45.9 22 0.00048 44.1 4.2 6 9-14 851-856 (1250)
29 PF15330 SIT: SHP2-interacting 45.7 43 0.00092 30.4 5.1 26 537-562 6-31 (107)
30 PF14575 EphA2_TM: Ephrin type 43.2 27 0.00059 29.5 3.2 24 536-559 4-27 (75)
31 PRK15319 AIDA autotransporter- 41.7 37 0.00081 44.0 5.3 6 389-394 1947-1952(2039)
32 PF06809 NPDC1: Neural prolife 40.6 21 0.00045 38.2 2.6 31 543-573 209-239 (341)
33 PLN02983 biotin carboxyl carri 40.2 98 0.0021 32.6 7.3 40 148-189 214-253 (274)
34 PF05308 Mito_fiss_reg: Mitoch 39.5 62 0.0014 33.7 5.8 12 22-33 97-108 (253)
35 PF05337 CSF-1: Macrophage col 38.6 10 0.00022 39.7 0.0 22 542-563 234-255 (285)
36 KOG2675|consensus 38.4 22 0.00048 39.5 2.5 7 149-155 331-337 (480)
37 PF01034 Syndecan: Syndecan do 38.4 13 0.00029 30.6 0.6 24 542-565 20-43 (64)
38 KOG2199|consensus 37.1 68 0.0015 35.5 5.8 10 150-159 450-459 (462)
39 PF05518 Totivirus_coat: Totiv 37.1 1.5E+02 0.0034 35.3 9.0 15 109-123 711-725 (759)
40 PRK09752 adhesin; Provisional 35.2 37 0.0008 42.3 3.8 6 127-132 954-959 (1250)
41 KOG1925|consensus 35.0 44 0.00096 37.9 4.1 10 27-36 164-173 (817)
42 PF00943 Alpha_E2_glycop: Alph 34.0 28 0.0006 38.3 2.3 28 534-561 354-381 (403)
43 PHA02902 putative IMV membrane 33.6 72 0.0016 26.5 4.0 27 536-562 5-31 (70)
44 KOG2236|consensus 33.4 72 0.0016 35.9 5.4 59 68-129 418-480 (483)
45 KOG2893|consensus 32.6 2.3E+02 0.005 29.4 8.4 9 149-157 216-224 (341)
46 KOG1922|consensus 32.0 88 0.0019 37.7 6.4 7 111-117 347-353 (833)
47 PF00558 Vpu: Vpu protein; In 31.8 51 0.0011 28.6 3.1 24 535-558 7-30 (81)
48 PF12273 RCR: Chitin synthesis 31.3 35 0.00075 31.5 2.2 19 542-560 11-29 (130)
49 PF02439 Adeno_E3_CR2: Adenovi 30.6 73 0.0016 23.8 3.3 13 548-560 21-33 (38)
50 KOG4368|consensus 29.5 2.2E+02 0.0047 33.2 8.3 23 69-91 291-316 (757)
51 KOG3895|consensus 29.3 2.1E+02 0.0044 31.5 7.7 9 26-34 369-377 (488)
52 KOG0608|consensus 29.3 1.7E+02 0.0038 34.7 7.6 10 126-135 270-279 (1034)
53 PHA03211 serine/threonine kina 29.2 62 0.0013 36.2 4.2 16 47-62 14-29 (461)
54 PF15179 Myc_target_1: Myc tar 29.0 1.2E+02 0.0027 30.1 5.6 8 585-592 77-84 (197)
55 PF12191 stn_TNFRSF12A: Tumour 28.3 19 0.00042 33.5 0.0 22 542-563 87-108 (129)
56 PF02009 Rifin_STEVOR: Rifin/s 27.8 46 0.001 35.5 2.7 38 525-563 252-289 (299)
57 KOG1830|consensus 27.5 1.3E+02 0.0028 33.7 5.9 10 109-118 399-408 (518)
58 PF05506 DUF756: Domain of unk 26.3 3.1E+02 0.0067 23.3 7.2 45 300-344 33-77 (89)
59 KOG3441|consensus 26.1 2.8E+02 0.006 26.0 7.0 65 295-371 53-122 (149)
60 PF04881 Adeno_GP19K: Adenovir 24.6 73 0.0016 29.9 3.0 31 532-562 100-130 (139)
61 PRK14758 hypothetical protein; 24.4 88 0.0019 21.5 2.5 17 539-555 9-25 (27)
62 PF07174 FAP: Fibronectin-atta 24.3 2.5E+02 0.0054 29.8 7.1 6 67-72 36-41 (297)
63 PF15145 DUF4577: Domain of un 24.3 95 0.0021 28.5 3.6 31 529-559 59-89 (128)
64 KOG2546|consensus 23.6 2E+02 0.0043 32.3 6.5 7 146-152 411-417 (483)
65 PF14141 YqzM: YqzM-like prote 23.4 95 0.0021 23.7 2.9 25 529-553 17-41 (43)
66 PF06365 CD34_antigen: CD34/Po 22.8 49 0.0011 33.3 1.7 50 533-582 101-157 (202)
67 PF05283 MGC-24: Multi-glycosy 22.7 78 0.0017 31.6 3.0 25 531-557 161-185 (186)
68 PF13388 DUF4106: Protein of u 22.7 2.2E+02 0.0048 30.4 6.4 11 69-79 171-181 (422)
69 KOG1984|consensus 22.2 3.5E+02 0.0075 33.1 8.5 11 361-371 600-610 (1007)
70 KOG1785|consensus 21.9 4.5E+02 0.0098 29.4 8.7 10 57-66 467-476 (563)
71 PF11884 DUF3404: Domain of un 21.3 63 0.0014 33.9 2.1 40 523-562 217-258 (262)
72 PTZ00208 65 kDa invariant surf 20.8 98 0.0021 34.3 3.6 26 535-560 389-414 (436)
73 PF15330 SIT: SHP2-interacting 20.4 74 0.0016 28.9 2.2 32 536-567 2-33 (107)
74 PF01690 PLRV_ORF5: Potato lea 20.1 76 0.0016 35.8 2.6 10 259-268 181-190 (465)
75 PF10717 ODV-E18: Occlusion-de 20.1 1E+02 0.0023 26.8 2.9 12 572-583 61-72 (85)
76 KOG3397|consensus 20.1 91 0.002 31.0 2.8 10 47-56 126-135 (225)
77 PF01213 CAP_N: Adenylate cycl 20.0 35 0.00076 36.6 0.0 18 106-123 228-245 (312)
No 1
>smart00241 ZP Zona pellucida (ZP) domain. ZP proteins are responsible for sperm-adhesion fo the zona pellucida. ZP domains are also present in multidomain transmembrane proteins such as glycoprotein GP2, uromodulin and TGF-beta receptor type III (betaglycan).
Probab=100.00 E-value=3.7e-36 Score=303.77 Aligned_cols=235 Identities=22% Similarity=0.405 Sum_probs=186.0
Q ss_pred EecCCeEEEEEEeCCCccceeeecCCCC-CCCceeccCCCCceEEEEEEeeccCCCCCccCCCccccccCCCCceEEEEE
Q psy8304 168 KCEKNSMKVFISFDKPFFGIVFSKGHYS-NVNCVHLPAGLGRTSANFEIGIHACGTSGNTENGLYGYGADAGSGTYFENI 246 (593)
Q Consensus 168 ~C~~d~M~V~V~~~~pF~G~Iyskg~~~-dp~C~~v~~g~g~t~~~F~I~l~~CGt~~~~~~~~~~y~~~n~~g~~ysNt 246 (593)
+|.++.|+|.|+++.+|.|.||+++.+. |+.|+......+.+++.|++++++||++++. ++++++|+|+
T Consensus 1 ~C~~~~m~v~v~~~~~~~g~i~~~~l~l~d~~C~~~~~~~~~~~~~f~~~l~~CGt~~~~----------~~~~~~ysn~ 70 (253)
T smart00241 1 QCGEDQMVVSVSTDLLFPGGIYVKGLYLGDPSCRPVFTDSTSAFVSFEVPLNGCGTRRQV----------NPDGIVYSNT 70 (253)
T ss_pred CCCCCEEEEEEEecCCCCCeEEEeeEEeCCCCCCCccccCCCcEEEEEeccccCCCeEEE----------CCCeEEEEEE
Confidence 5999999999999999999999998765 8999995334567899999999999999853 3568899999
Q ss_pred EEEe-eCCcceeeecc-eEEEEeeecCCccceeeeccccccccceeee-eccCCCceEEEEEEeCCCCCcccccc--eee
Q psy8304 247 IVIQ-YDPQVQEVWDQ-ARKLRCTWHDQYEKSVTFRPFPVDMLDVVRA-DFAGDNVGCWMQIQVGKGPWASEVSG--LVK 321 (593)
Q Consensus 247 VVVQ-~hp~I~T~~D~-~~~V~C~y~~~~eksVt~~p~~V~m~~v~~~-~fsg~~p~~~M~I~~g~GP~~~~Vs~--~V~ 321 (593)
|+++ .|+.++|..|+ .++|+|.|.... + ++. .+.+.......+ ....+.+.|+|+|+.++ .|+.++.. .++
T Consensus 71 v~~~~~~~~~itr~~~~~~~~~C~y~~~~-~-~~~-~~~~~~~~~~~~~~~~~g~~~~~m~l~~~~-~~~~~~~~~~~~~ 146 (253)
T smart00241 71 LVVSPFHPGFITRDDRAAYHFQCFYPENE-K-VSL-NLDVSTIPPTELSSVSEGPPTCSYRLYKDD-SFGSPYQSADYPV 146 (253)
T ss_pred EEEccCCCCceEecCceEEEEEEEEeCCC-c-eEE-EEEecCCCCCCcccccCCCcEEEEEeccCC-CCCCcccCCCCcc
Confidence 9999 79999999988 899999999853 3 322 123322111111 11235679999999875 57776633 238
Q ss_pred cCCEEEEEEEEecC-CCcceEEEeEEEEecCC----CCcEEeeccCCcccccccccceeeeccCCCccceeEEEEEEeeE
Q psy8304 322 IGQTMTMVLAIKDD-DSKFDMLVRNCMAHDGK----RAPIQLVDQRGCVTRSKLMSRFTKIKNFGASASVLSYAHFQAFK 396 (593)
Q Consensus 322 IGd~L~l~v~i~~~-~~~~~l~V~sC~Atdg~----g~~~~LIDenGCp~D~~L~~~f~~~~d~g~s~slla~a~f~AFK 396 (593)
|||+|+++|++... ++.|+|+|++|||+|+. +.++.||| +||++|+.++....+..+. +..++|.|+|||
T Consensus 147 lg~~l~~~~~~~~~~~~~~~l~v~~C~at~~~~~~~~~~~~lI~-~GC~~d~~~~~~~~~~~~~----~~~~~f~~~aF~ 221 (253)
T smart00241 147 LGDPVYHEWSCDGADDPPLGLLVDNCYATPGSDPSSGPKYFIID-NGCPVDGYLDSTIPYNSSP----NHYARFSVKVFK 221 (253)
T ss_pred cCCeEEEEEEEccCCCCCeEEEEeeEEEcCCCCCCCCCcEEEEE-CccCCCCccccceecCCCC----cceEEEEEEEEE
Confidence 99999999999875 68999999999999987 36899999 9999998877766665541 224699999999
Q ss_pred eCCCCeEEEEEEEEEccCCCCCcCC
Q psy8304 397 FPDSMEVHFQCTIQICRYQCPDQCS 421 (593)
Q Consensus 397 Fpds~~Vyf~C~V~vC~~~CP~~C~ 421 (593)
|+++..|||||+|++|....+..|.
T Consensus 222 F~~~~~v~~hC~v~vC~~~~~~~C~ 246 (253)
T smart00241 222 FADRSLVYFHCQIRLCDKSDGSSCD 246 (253)
T ss_pred ecCCCcEEEEEEEEEECCCCCCCCC
Confidence 9999999999999999985433444
No 2
>PF00100 Zona_pellucida: Zona pellucida-like domain; InterPro: IPR001507 A large domain, containing around 260 amino acids, has been recognised in a variety of receptor-like eukaryotic glycoproteins []. All of these proteins are mosaic proteins composed of various domains and that all have a large extracellular region followed by either a transmembrane region and a very short cytoplasmic region or by a GPI-anchor. The domain common to all these proteins is located in the C-terminal portion of the extracellular region, and contains 8 conserved Cys residues, which are probably involved in disulphide bond formation. This entry includes a domain found in the zona pellucida proteins. The zona pellucida is a specialised extracellular matrix of mammailian eggs, which plays a crucial role in fertilisation by directly mediating species-restricted recongition between gametes. In mammals, up to four zona pellucida subnits are known, named ZP1-4. The full-length crystal structure of murine ZP3 has been solved []. This entry is also found in other glycoproteins, such as CD105 (also called endoglin) which is the regulatory component of the TGF-beta receptor complex. It is a modulator of cellular responses to TGF-beta 1. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 3D4G_G 3EF7_A 3D4C_A 3NK4_B 3NK3_B 3QW9_A.
Probab=99.97 E-value=1.3e-29 Score=252.82 Aligned_cols=234 Identities=24% Similarity=0.425 Sum_probs=156.7
Q ss_pred EecCCeEEEEEEeCCCccceeee-cCCCCCCCceeccCCCCceEEEEEEeecc--CCCCCccCCCccccccCCCCceEEE
Q psy8304 168 KCEKNSMKVFISFDKPFFGIVFS-KGHYSNVNCVHLPAGLGRTSANFEIGIHA--CGTSGNTENGLYGYGADAGSGTYFE 244 (593)
Q Consensus 168 ~C~~d~M~V~V~~~~pF~G~Iys-kg~~~dp~C~~v~~g~g~t~~~F~I~l~~--CGt~~~~~~~~~~y~~~n~~g~~ys 244 (593)
+|..+.|+|.|+++.+|.|.||. ++++.++.|++.. ..+.+++.|++++.. ||+...... +.++.|+
T Consensus 1 ~C~~~~~~v~i~~~~~~~~~i~~~~~~~~~~~C~~~~-~~~~~~~~~~~~~~~~~Cg~~~~~~~---------~~~~~y~ 70 (265)
T PF00100_consen 1 TCGSDSMTVTIKKDYLFTGNIYVSDLHLNDPSCRARS-DTNSTHITFSIPFNSPSCGTTRSIEN---------GDSIVYS 70 (265)
T ss_dssp EE-SSEEEEEEESSTTSSS-B-GGGEEETTTTB--EE-ECTTTEEEEEEECCC--CTSEEEE-----------SSEEEEE
T ss_pred CcCCCEEEEEEEeccccCCCEEeeeeEECCCCCCCcc-cCCCcEEEEEEecCCCCCcccccccc---------cCcceee
Confidence 69999999999999999998877 6777889999921 245678999999999 999976431 2347899
Q ss_pred EEEEEeeCC--------cceeeecceEEEEeee-cCCccceeeeccccccccceeeeeccCCCceEEEEEEeCCCCCcc-
Q psy8304 245 NIIVIQYDP--------QVQEVWDQARKLRCTW-HDQYEKSVTFRPFPVDMLDVVRADFAGDNVGCWMQIQVGKGPWAS- 314 (593)
Q Consensus 245 NtVVVQ~hp--------~I~T~~D~~~~V~C~y-~~~~eksVt~~p~~V~m~~v~~~~fsg~~p~~~M~I~~g~GP~~~- 314 (593)
|+|+++.+. .|++..|..++|+|.| ... .+++......................++|+++.++ .+..
T Consensus 71 n~i~~~~~~~~~~~~~~~i~~~~~~~~~~~C~y~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~l~~~~-~~~~~ 147 (265)
T PF00100_consen 71 NTIVSSQDVSFNDSPGGNITRDSDRRFNFTCTYSSSQ--YSVSSGSPNISSVTIHISQESSGSFSFSMRLYKDD-SFSSP 147 (265)
T ss_dssp EEEEEE--SSSS---STTECSSS-EEEEEEEEE-EC--------------B-TTT---STT--EEEEEEEESTT-TSSCB
T ss_pred eeeeeeccccccccccCceEEecccccccEEEECCcc--ceeccccccceeccccceeccCCccceEEEeccCc-ccccc
Confidence 999987653 4566789999999999 432 23322111111111111111234467899999864 3432
Q ss_pred --cccceeecCCEEEEEEEEecCC-CcceEEEeEEEEecC----CCCcEEeeccCCcccccccccceeeeccCCCcccee
Q psy8304 315 --EVSGLVKIGQTMTMVLAIKDDD-SKFDMLVRNCMAHDG----KRAPIQLVDQRGCVTRSKLMSRFTKIKNFGASASVL 387 (593)
Q Consensus 315 --~Vs~~V~IGd~L~l~v~i~~~~-~~~~l~V~sC~Atdg----~g~~~~LIDenGCp~D~~L~~~f~~~~d~g~s~sll 387 (593)
.-...+.+||.|+++|+++... ..+.|+|++|||+++ .+..+.|||++||++|...+..+.+....+.+ . .
T Consensus 148 ~~~~~~~~~vg~~v~~~v~~~~~~~~~~~~~v~~C~a~~~~~~~~~~~~~lId~~GC~~~~~~~~~~~~~~~~~~~-~-~ 225 (265)
T PF00100_consen 148 YSQPPATVEVGDRVYVEVSVSPDSSDPLGLFVDSCWATPSSDPDSSPSYTLIDENGCPVDSDDFVTFEYSGPNGDS-S-R 225 (265)
T ss_dssp ---SSEEE-TT-EEEEEEEEEECCC-TEEEEEEEEEEESSSSTTSSSCEEEEBCTTEEGGGCTTTEEEEETTTCCC-C-E
T ss_pred ccCCceeEEeeeeEEEEEEeccccceeeEEEeCEEEEeCCCCcccccccceeccccccccccceeEEEeccccccc-c-E
Confidence 1123689999999999986543 128899999999999 34689999999999999888777765111221 2 4
Q ss_pred EEEEEEeeEeCCCCeEEEEEEEEEccCCC
Q psy8304 388 SYAHFQAFKFPDSMEVHFQCTIQICRYQC 416 (593)
Q Consensus 388 a~a~f~AFKFpds~~Vyf~C~V~vC~~~C 416 (593)
++|+|+||||+++..|||||+|++|...|
T Consensus 226 ~~f~f~aF~f~~~~~v~i~C~v~~C~~~~ 254 (265)
T PF00100_consen 226 ARFSFRAFRFPDSSSVYIHCSVRVCDKSD 254 (265)
T ss_dssp EEEEEE-EEETSSSEEEEEEEEEEEETTS
T ss_pred EEEEeeEEEecCCCeEEEEEEEEEECCCC
Confidence 68999999999999999999999999875
No 3
>KOG1924|consensus
Probab=98.82 E-value=8.3e-09 Score=115.91 Aligned_cols=22 Identities=55% Similarity=1.242 Sum_probs=13.0
Q ss_pred CCCCCCCCCCCCCCCCCCCCcc
Q psy8304 108 YHGPPPPPPPLSAAKPPPVQPE 129 (593)
Q Consensus 108 ~~~p~~~~~~~~~~~~~~~~~~ 129 (593)
..|||||||||.-.++||||++
T Consensus 567 aG~PPpPppppg~~gppPPPpp 588 (1102)
T KOG1924|consen 567 AGGPPPPPPPPGGGGPPPPPPP 588 (1102)
T ss_pred cCCCCccCCCCCCCCCCCcCCC
Confidence 4456666666665666655555
No 4
>KOG1924|consensus
Probab=98.28 E-value=2.9e-06 Score=96.06 Aligned_cols=15 Identities=13% Similarity=0.344 Sum_probs=7.1
Q ss_pred ceeeecCCCCCCCceec
Q psy8304 186 GIVFSKGHYSNVNCVHL 202 (593)
Q Consensus 186 G~Iyskg~~~dp~C~~v 202 (593)
+.|-...-. .+|.++
T Consensus 632 ~kI~p~d~s--~~cFWv 646 (1102)
T KOG1924|consen 632 SKIVPRDLS--ENCFWV 646 (1102)
T ss_pred cccCccccC--ccceee
Confidence 444444333 347665
No 5
>KOG3671|consensus
Probab=91.78 E-value=0.62 Score=51.85 Aligned_cols=10 Identities=50% Similarity=1.062 Sum_probs=4.6
Q ss_pred CCCCCCcccc
Q psy8304 122 KPPPVQPEAM 131 (593)
Q Consensus 122 ~~~~~~~~~~ 131 (593)
++++++++-|
T Consensus 427 pPppPPPPPm 436 (569)
T KOG3671|consen 427 PPPPPPPPPM 436 (569)
T ss_pred CCCCCcCCCC
Confidence 3444444455
No 6
>KOG3671|consensus
Probab=90.20 E-value=0.76 Score=51.20 Aligned_cols=10 Identities=30% Similarity=0.587 Sum_probs=4.2
Q ss_pred CCCCCCCCCC
Q psy8304 68 GYVDRPPPPP 77 (593)
Q Consensus 68 ~~~~~~~~~~ 77 (593)
+-.+|||+++
T Consensus 373 ar~PPPpp~s 382 (569)
T KOG3671|consen 373 ARPPPPPPRS 382 (569)
T ss_pred CCCCcCCCCC
Confidence 3344444443
No 7
>KOG1830|consensus
Probab=81.97 E-value=13 Score=41.03 Aligned_cols=19 Identities=21% Similarity=0.307 Sum_probs=8.0
Q ss_pred ecccccccccccCCccccc
Q psy8304 15 TGSQFAEKTEVPNVSKVEE 33 (593)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~ 33 (593)
+|..-|++-...++-.+++
T Consensus 286 ~g~g~a~~r~~~n~~rPt~ 304 (518)
T KOG1830|consen 286 RGGGLASARPSSNVVRPTQ 304 (518)
T ss_pred cCCcccCcCCCcCCCCCCC
Confidence 3444444444444444433
No 8
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 [].
Probab=78.90 E-value=1.6 Score=40.41 Aligned_cols=18 Identities=11% Similarity=0.052 Sum_probs=7.7
Q ss_pred HHHhhheeecccccCCCC
Q psy8304 550 VSACLCIRLRPFSNKTSQ 567 (593)
Q Consensus 550 va~~L~~r~r~~~~~~~~ 567 (593)
+++++|+++||++++..+
T Consensus 16 ~~~~~~~~rRR~r~G~~P 33 (130)
T PF12273_consen 16 LFLFYCHNRRRRRRGLQP 33 (130)
T ss_pred HHHHHHHHHHHhhcCCCC
Confidence 334444444444433333
No 9
>PRK14094 psbM photosystem II reaction center protein M; Provisional
Probab=71.57 E-value=6.5 Score=30.42 Aligned_cols=16 Identities=19% Similarity=0.148 Sum_probs=8.4
Q ss_pred HHHHHHHHHhhheeec
Q psy8304 544 LVVSCLVSACLCIRLR 559 (593)
Q Consensus 544 llvs~~va~~L~~r~r 559 (593)
++|-+++..+||++..
T Consensus 15 i~iPT~FLlilYVkT~ 30 (50)
T PRK14094 15 VGVPTIFLIGLFISTQ 30 (50)
T ss_pred HHHHHHHhhheeEEec
Confidence 3345555555655543
No 10
>PRK15319 AIDA autotransporter-like protein ShdA; Provisional
Probab=68.24 E-value=7.6 Score=49.80 Aligned_cols=10 Identities=10% Similarity=0.145 Sum_probs=5.6
Q ss_pred CCCCCccccc
Q psy8304 123 PPPVQPEAMD 132 (593)
Q Consensus 123 ~~~~~~~~~~ 132 (593)
++|.++|+..
T Consensus 1723 ~~P~YRPEVg 1732 (2039)
T PRK15319 1723 VAPQYRADIG 1732 (2039)
T ss_pred CcccccCchh
Confidence 4456666554
No 11
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=63.49 E-value=2.3 Score=44.90 Aligned_cols=24 Identities=21% Similarity=0.350 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHhhheeecc
Q psy8304 537 LFVLLGILVVSCLVSACLCIRLRP 560 (593)
Q Consensus 537 ~v~ll~~llvs~~va~~L~~r~r~ 560 (593)
+|+++++||||+++++++|.|+|.
T Consensus 152 aVVI~~iLLIA~iIa~icyrrkR~ 175 (290)
T PF05454_consen 152 AVVIAAILLIAGIIACICYRRKRK 175 (290)
T ss_dssp ------------------------
T ss_pred HHHHHHHHHHHHHHHHHhhhhhhc
Confidence 334445555666666665554443
No 12
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=62.62 E-value=8.5 Score=34.14 Aligned_cols=23 Identities=26% Similarity=0.336 Sum_probs=14.1
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHH
Q psy8304 530 TIGFSATLFVLLGILVVSCLVSA 552 (593)
Q Consensus 530 ~~~f~~~~v~ll~~llvs~~va~ 552 (593)
...|.++.++|+++||||+.|++
T Consensus 3 SK~~llL~l~LA~lLlisSevaa 25 (95)
T PF07172_consen 3 SKAFLLLGLLLAALLLISSEVAA 25 (95)
T ss_pred hhHHHHHHHHHHHHHHHHhhhhh
Confidence 34555555555566777766665
No 13
>PF15102 TMEM154: TMEM154 protein family
Probab=61.96 E-value=6.9 Score=37.36 Aligned_cols=22 Identities=32% Similarity=0.368 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHHHHhhheeecc
Q psy8304 539 VLLGILVVSCLVSACLCIRLRP 560 (593)
Q Consensus 539 ~ll~~llvs~~va~~L~~r~r~ 560 (593)
+||++||+++|+.++.+.|+|.
T Consensus 66 VLLvlLLl~vV~lv~~~kRkr~ 87 (146)
T PF15102_consen 66 VLLVLLLLSVVCLVIYYKRKRT 87 (146)
T ss_pred HHHHHHHHHHHHheeEEeeccc
Confidence 3334444444443433333333
No 14
>KOG0162|consensus
Probab=60.39 E-value=38 Score=40.09 Aligned_cols=12 Identities=33% Similarity=0.382 Sum_probs=6.6
Q ss_pred ccccCCcccccc
Q psy8304 23 TEVPNVSKVEEQ 34 (593)
Q Consensus 23 ~~~~~~~~~~~~ 34 (593)
+..|..+++.-+
T Consensus 921 ~GlP~~Skps~k 932 (1106)
T KOG0162|consen 921 TGLPPNSKPSRK 932 (1106)
T ss_pred CCCCCCCCcCCc
Confidence 445666665433
No 15
>PF14991 MLANA: Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P.
Probab=60.36 E-value=1.6 Score=39.76 Aligned_cols=27 Identities=26% Similarity=0.346 Sum_probs=1.6
Q ss_pred HHHHHHHHHHHHHHHHHHHhhheeecc
Q psy8304 534 SATLFVLLGILVVSCLVSACLCIRLRP 560 (593)
Q Consensus 534 ~~~~v~ll~~llvs~~va~~L~~r~r~ 560 (593)
|+++.||.+||.+..++.||.|.|+.-
T Consensus 26 AaGIGiL~VILgiLLliGCWYckRRSG 52 (118)
T PF14991_consen 26 AAGIGILIVILGILLLIGCWYCKRRSG 52 (118)
T ss_dssp --SSS----------------------
T ss_pred hccceeHHHHHHHHHHHhheeeeecch
Confidence 345555556666666788898877653
No 16
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=58.87 E-value=4.8 Score=42.78 Aligned_cols=37 Identities=14% Similarity=0.095 Sum_probs=21.9
Q ss_pred ceeechhHHHHHHHHHHHHHHHHHHHHHhhheeecccc
Q psy8304 525 MICMTTIGFSATLFVLLGILVVSCLVSACLCIRLRPFS 562 (593)
Q Consensus 525 ~iC~S~~~f~~~~v~ll~~llvs~~va~~L~~r~r~~~ 562 (593)
..+++-...+++.+++ ++++|.++|.++|..|+||..
T Consensus 249 ~~~~~~~t~I~aSiia-IliIVLIMvIIYLILRYRRKK 285 (299)
T PF02009_consen 249 STYASLTTAIIASIIA-ILIIVLIMVIIYLILRYRRKK 285 (299)
T ss_pred chhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh
Confidence 4566655444444443 444456777788888877743
No 17
>KOG2675|consensus
Probab=57.26 E-value=6.1 Score=43.69 Aligned_cols=12 Identities=25% Similarity=0.457 Sum_probs=7.3
Q ss_pred cccccCCccccc
Q psy8304 22 KTEVPNVSKVEE 33 (593)
Q Consensus 22 ~~~~~~~~~~~~ 33 (593)
.|++|||.+--+
T Consensus 154 ~tP~p~vke~~d 165 (480)
T KOG2675|consen 154 PTPAPYVKEFKD 165 (480)
T ss_pred CCCchHHHHHHH
Confidence 477777765433
No 18
>PF05568 ASFV_J13L: African swine fever virus J13L protein; InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=56.07 E-value=10 Score=36.19 Aligned_cols=37 Identities=16% Similarity=0.336 Sum_probs=19.9
Q ss_pred HHHHHHHHHHHhhheeecccccCCCCCccccccCCCcc
Q psy8304 542 GILVVSCLVSACLCIRLRPFSNKTSQKTMSVSYTNPIV 579 (593)
Q Consensus 542 ~~llvs~~va~~L~~r~r~~~~~~~~~~~~~~~~~~~~ 579 (593)
+++++-++|.++||-+++++.. .+--+...-|.||..
T Consensus 39 vVliiiiivli~lcssRKkKaa-AAi~eediQfinpyq 75 (189)
T PF05568_consen 39 VVLIIIIIVLIYLCSSRKKKAA-AAIEEEDIQFINPYQ 75 (189)
T ss_pred HHHHHHHHHHHHHHhhhhHHHH-hhhhhhcccccCccc
Confidence 4444555666677766555443 222234556777654
No 19
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=51.34 E-value=14 Score=34.24 Aligned_cols=27 Identities=22% Similarity=0.262 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhhheeeccc
Q psy8304 534 SATLFVLLGILVVSCLVSACLCIRLRPF 561 (593)
Q Consensus 534 ~~~~v~ll~~llvs~~va~~L~~r~r~~ 561 (593)
.|.|.++++++++ +++.+|++.|+|++
T Consensus 68 ~Ii~gv~aGvIg~-Illi~y~irR~~Kk 94 (122)
T PF01102_consen 68 GIIFGVMAGVIGI-ILLISYCIRRLRKK 94 (122)
T ss_dssp HHHHHHHHHHHHH-HHHHHHHHHHHS--
T ss_pred ehhHHHHHHHHHH-HHHHHHHHHHHhcc
Confidence 3444333333332 33444444444443
No 20
>KOG1922|consensus
Probab=49.88 E-value=41 Score=40.49 Aligned_cols=8 Identities=25% Similarity=0.081 Sum_probs=3.0
Q ss_pred chhHHHHH
Q psy8304 529 TTIGFSAT 536 (593)
Q Consensus 529 S~~~f~~~ 536 (593)
....|.+.
T Consensus 765 ~~~~f~~~ 772 (833)
T KOG1922|consen 765 PEQVFSIL 772 (833)
T ss_pred HHHHHHHH
Confidence 33333333
No 21
>KOG0132|consensus
Probab=49.61 E-value=47 Score=39.46 Aligned_cols=13 Identities=15% Similarity=0.396 Sum_probs=5.7
Q ss_pred ceeeEEeeeEEEe
Q psy8304 2 KTYKIVSTQLQIT 14 (593)
Q Consensus 2 ~~~~~~~~~~~~~ 14 (593)
+.||+..+.+.|-
T Consensus 478 ~n~kv~~k~Iki~ 490 (894)
T KOG0132|consen 478 SNVKVADKTIKIA 490 (894)
T ss_pred hcccccceeeEEe
Confidence 3445444444333
No 22
>PTZ00046 rifin; Provisional
Probab=48.54 E-value=9.1 Score=41.60 Aligned_cols=30 Identities=17% Similarity=0.174 Sum_probs=19.9
Q ss_pred HHHHHHHHHHHHHHHHHHhhheeecccccC
Q psy8304 535 ATLFVLLGILVVSCLVSACLCIRLRPFSNK 564 (593)
Q Consensus 535 ~~~v~ll~~llvs~~va~~L~~r~r~~~~~ 564 (593)
|.+.++.++++|.++|.++|..|.||...-
T Consensus 317 IiaSiiAIvVIVLIMvIIYLILRYRRKKKM 346 (358)
T PTZ00046 317 IIASIVAIVVIVLIMVIIYLILRYRRKKKM 346 (358)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcchh
Confidence 334444444555678889999999987653
No 23
>COG5178 PRP8 U5 snRNP spliceosome subunit [RNA processing and modification]
Probab=48.50 E-value=14 Score=45.32 Aligned_cols=17 Identities=12% Similarity=0.286 Sum_probs=8.4
Q ss_pred cceeEEecCCeEEEEEEe
Q psy8304 163 VSLDVKCEKNSMKVFISF 180 (593)
Q Consensus 163 ~~v~V~C~~d~M~V~V~~ 180 (593)
+.+.+.|. +.|++..+.
T Consensus 139 vkvlyh~~-Gaitfvne~ 155 (2365)
T COG5178 139 VKVLYHCH-GAITFVNEV 155 (2365)
T ss_pred eeEEeecc-ceEEEeecC
Confidence 34444555 445555544
No 24
>PF11980 DUF3481: Domain of unknown function (DUF3481); InterPro: IPR022579 This domain of unknown function is located in the C terminus of the eukaryotic neuropilin receptor family of proteins. It is found in association with PF00754 from PFAM, PF00431 from PFAM and PF00629 from PFAM. There are two completely conserved residues (Y and E) that may be functionally important.
Probab=46.90 E-value=27 Score=30.41 Aligned_cols=35 Identities=14% Similarity=0.046 Sum_probs=17.4
Q ss_pred hhHHHHHHHHHHHHHHHHHH-HHHhhheeecccccC
Q psy8304 530 TIGFSATLFVLLGILVVSCL-VSACLCIRLRPFSNK 564 (593)
Q Consensus 530 ~~~f~~~~v~ll~~llvs~~-va~~L~~r~r~~~~~ 564 (593)
...+++.+...+++||+++. +..+.|.|.++-..+
T Consensus 14 ~~~yyiiA~gga~llL~~v~l~vvL~C~r~~~a~kk 49 (87)
T PF11980_consen 14 PYWYYIIAMGGALLLLVAVCLGVVLYCHRFHWAAKK 49 (87)
T ss_pred ceeeHHHhhccHHHHHHHHHHHHHHhhhhhcccccc
Confidence 34444444444444555555 444555666664433
No 25
>PF11770 GAPT: GRB2-binding adapter (GAPT); InterPro: IPR021082 This entry represents a family of transmembrane proteins which bind the growth factor receptor-bound protein 2 (GRB2) in B cells []. In contrast to other transmembrane adaptor proteins, GAPT, which this entry represents, is not phosphorylated upon BCR ligation. It associates with GRB2 constitutively through its proline-rich region [].
Probab=46.90 E-value=11 Score=36.21 Aligned_cols=13 Identities=31% Similarity=0.485 Sum_probs=5.7
Q ss_pred cCCCccccccccc
Q psy8304 574 YTNPIVSVKKTSK 586 (593)
Q Consensus 574 ~~~~~~~~~~~~~ 586 (593)
|+=|.|-+.+.+|
T Consensus 41 ftLPkflqRRssk 53 (158)
T PF11770_consen 41 FTLPKFLQRRSSK 53 (158)
T ss_pred cchHHHHHhhhhh
Confidence 4444444444433
No 26
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=46.54 E-value=16 Score=38.65 Aligned_cols=20 Identities=25% Similarity=0.076 Sum_probs=12.9
Q ss_pred HHHHHHHHHHhhheeecccc
Q psy8304 543 ILVVSCLVSACLCIRLRPFS 562 (593)
Q Consensus 543 ~llvs~~va~~L~~r~r~~~ 562 (593)
..||.++++|+|+.|+|+..
T Consensus 282 a~lvlivLiaYli~Rrr~~~ 301 (306)
T PF01299_consen 282 AGLVLIVLIAYLIGRRRSRA 301 (306)
T ss_pred HHHHHHHHHhheeEeccccc
Confidence 33344566778888877664
No 27
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=46.44 E-value=10 Score=41.12 Aligned_cols=30 Identities=17% Similarity=0.193 Sum_probs=19.9
Q ss_pred HHHHHHHHHHHHHHHHHHhhheeecccccC
Q psy8304 535 ATLFVLLGILVVSCLVSACLCIRLRPFSNK 564 (593)
Q Consensus 535 ~~~v~ll~~llvs~~va~~L~~r~r~~~~~ 564 (593)
|.+.++..+++|.++|.++|..|.||...-
T Consensus 312 IiaSiIAIvvIVLIMvIIYLILRYRRKKKM 341 (353)
T TIGR01477 312 IIASIIAILIIVLIMVIIYLILRYRRKKKM 341 (353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcchh
Confidence 344444444555678889999999987643
No 28
>PRK09752 adhesin; Provisional
Probab=45.92 E-value=22 Score=44.10 Aligned_cols=6 Identities=17% Similarity=0.357 Sum_probs=3.0
Q ss_pred eeEEEe
Q psy8304 9 TQLQIT 14 (593)
Q Consensus 9 ~~~~~~ 14 (593)
|.|.|+
T Consensus 851 T~V~V~ 856 (1250)
T PRK09752 851 TTVVVN 856 (1250)
T ss_pred EEEEEE
Confidence 445554
No 29
>PF15330 SIT: SHP2-interacting transmembrane adaptor protein, SIT
Probab=45.68 E-value=43 Score=30.41 Aligned_cols=26 Identities=15% Similarity=0.124 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHHHHHHhhheeecccc
Q psy8304 537 LFVLLGILVVSCLVSACLCIRLRPFS 562 (593)
Q Consensus 537 ~v~ll~~llvs~~va~~L~~r~r~~~ 562 (593)
|+.||++|++++-+.+|-+.|+|...
T Consensus 6 il~llLll~l~asl~~wr~~~rq~k~ 31 (107)
T PF15330_consen 6 ILALLLLLSLAASLLAWRMKQRQKKA 31 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhhccc
Confidence 33344444445555555555555443
No 30
>PF14575 EphA2_TM: Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=43.25 E-value=27 Score=29.55 Aligned_cols=24 Identities=13% Similarity=0.297 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHHHHHhhheeec
Q psy8304 536 TLFVLLGILVVSCLVSACLCIRLR 559 (593)
Q Consensus 536 ~~v~ll~~llvs~~va~~L~~r~r 559 (593)
+.+++++++++.+++..++|.|++
T Consensus 4 ~~~~~g~~~ll~~v~~~~~~~rr~ 27 (75)
T PF14575_consen 4 ASIIVGVLLLLVLVIIVIVCFRRC 27 (75)
T ss_dssp HHHHHHHHHHHHHHHHHHCCCTT-
T ss_pred ehHHHHHHHHHHhheeEEEEEeeE
Confidence 333443444443444444444443
No 31
>PRK15319 AIDA autotransporter-like protein ShdA; Provisional
Probab=41.71 E-value=37 Score=44.00 Aligned_cols=6 Identities=0% Similarity=-0.042 Sum_probs=2.2
Q ss_pred EEEEEe
Q psy8304 389 YAHFQA 394 (593)
Q Consensus 389 ~a~f~A 394 (593)
+.-+++
T Consensus 1947 RLGLR~ 1952 (2039)
T PRK15319 1947 RLGLRV 1952 (2039)
T ss_pred EEEEEE
Confidence 333333
No 32
>PF06809 NPDC1: Neural proliferation differentiation control-1 protein (NPDC1); InterPro: IPR009635 This family consists of several neural proliferation differentiation control-1 (NPDC1) proteins. NPDC1 plays a role in the control of neural cell proliferation and differentiation. It has been suggested that NPDC1 may be involved in the development of several secretion glands. This family also contains the C-terminal region of the Caenorhabditis elegans protein CAB-1 (Q93249 from SWISSPROT) which is known to interact with AEX-3 [].; GO: 0016021 integral to membrane
Probab=40.64 E-value=21 Score=38.18 Aligned_cols=31 Identities=10% Similarity=0.033 Sum_probs=21.9
Q ss_pred HHHHHHHHHHhhheeecccccCCCCCccccc
Q psy8304 543 ILVVSCLVSACLCIRLRPFSNKTSQKTMSVS 573 (593)
Q Consensus 543 ~llvs~~va~~L~~r~r~~~~~~~~~~~~~~ 573 (593)
+.+++++++++.|+|+++-.+-+.+..|+..
T Consensus 209 aG~aAliva~~cW~Rlqr~~rlaqk~DY~a~ 239 (341)
T PF06809_consen 209 AGAAALIVAGYCWYRLQREIRLAQKVDYPAT 239 (341)
T ss_pred HHHHHHHHhhheEEEeccccccccccccccc
Confidence 3445777888888999988776666666443
No 33
>PLN02983 biotin carboxyl carrier protein of acetyl-CoA carboxylase
Probab=40.16 E-value=98 Score=32.58 Aligned_cols=40 Identities=20% Similarity=0.248 Sum_probs=25.8
Q ss_pred CCCCCCCCCCCCCCccceeEEecCCeEEEEEEeCCCccceee
Q psy8304 148 APDPWPNAVADTPKIVSLDVKCEKNSMKVFISFDKPFFGIVF 189 (593)
Q Consensus 148 ~~~~wp~~~~d~~~i~~v~V~C~~d~M~V~V~~~~pF~G~Iy 189 (593)
.++.|-....|. +..=++.|.-+.|.+.++...++.|.|-
T Consensus 214 pge~w~VkvGDs--VkkGQvLavIEAMKmeieV~AP~sGtV~ 253 (274)
T PLN02983 214 PGEPPFVKVGDK--VQKGQVVCIIEAMKLMNEIEADQSGTIV 253 (274)
T ss_pred CCCcceeCCCCE--ecCCCEEEEEEeeceeeEEecCCCeEEE
Confidence 345676665543 3444567777777777777788999764
No 34
>PF05308 Mito_fiss_reg: Mitochondrial fission regulator; InterPro: IPR007972 This family consists of several uncharacterised eukaryotic proteins of unknown function.
Probab=39.55 E-value=62 Score=33.70 Aligned_cols=12 Identities=25% Similarity=0.432 Sum_probs=6.0
Q ss_pred cccccCCccccc
Q psy8304 22 KTEVPNVSKVEE 33 (593)
Q Consensus 22 ~~~~~~~~~~~~ 33 (593)
...+|+...-++
T Consensus 97 ~~s~P~l~~~~~ 108 (253)
T PF05308_consen 97 NPSSPNLRQNEP 108 (253)
T ss_pred cCcCCCcCCCcc
Confidence 345555555444
No 35
>PF05337 CSF-1: Macrophage colony stimulating factor-1 (CSF-1); InterPro: IPR008001 Colony stimulating factor 1 (CSF-1) is a homodimeric polypeptide growth factor whose primary function is to regulate the survival, proliferation, differentiation, and function of cells of the mononuclear phagocytic lineage. This lineage includes mononuclear phagocytic precursors, blood monocytes, tissue macrophages, osteoclasts, and microglia of the brain, all of which possess cell surface receptors for CSF-1. The protein has also been linked with male fertility [] and mutations in the Csf-1 gene have been found to cause osteopetrosis and failure of tooth eruption [].; GO: 0005125 cytokine activity, 0008083 growth factor activity, 0016021 integral to membrane; PDB: 3EJJ_A.
Probab=38.65 E-value=10 Score=39.71 Aligned_cols=22 Identities=32% Similarity=0.295 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHhhheeeccccc
Q psy8304 542 GILVVSCLVSACLCIRLRPFSN 563 (593)
Q Consensus 542 ~~llvs~~va~~L~~r~r~~~~ 563 (593)
+||||...|..+||||+|+++.
T Consensus 234 SiILVLLaVGGLLfYr~rrRs~ 255 (285)
T PF05337_consen 234 SIILVLLAVGGLLFYRRRRRSH 255 (285)
T ss_dssp ----------------------
T ss_pred chhhhhhhccceeeeccccccc
Confidence 5666778899999999999875
No 36
>KOG2675|consensus
Probab=38.43 E-value=22 Score=39.51 Aligned_cols=7 Identities=14% Similarity=0.449 Sum_probs=4.0
Q ss_pred CCCCCCC
Q psy8304 149 PDPWPNA 155 (593)
Q Consensus 149 ~~~wp~~ 155 (593)
+..|-..
T Consensus 331 GkkW~VE 337 (480)
T KOG2675|consen 331 GKKWRVE 337 (480)
T ss_pred cceEEEe
Confidence 5556654
No 37
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=38.42 E-value=13 Score=30.63 Aligned_cols=24 Identities=21% Similarity=0.174 Sum_probs=0.9
Q ss_pred HHHHHHHHHHHhhheeecccccCC
Q psy8304 542 GILVVSCLVSACLCIRLRPFSNKT 565 (593)
Q Consensus 542 ~~llvs~~va~~L~~r~r~~~~~~ 565 (593)
+.+++++++..|+++|.|+++-+.
T Consensus 20 vgll~ailLIlf~iyR~rkkdEGS 43 (64)
T PF01034_consen 20 VGLLFAILLILFLIYRMRKKDEGS 43 (64)
T ss_dssp -----------------S------
T ss_pred HHHHHHHHHHHHHHHHHHhcCCCC
Confidence 444456667778888888877643
No 38
>KOG2199|consensus
Probab=37.13 E-value=68 Score=35.50 Aligned_cols=10 Identities=30% Similarity=0.351 Sum_probs=6.2
Q ss_pred CCCCCCCCCC
Q psy8304 150 DPWPNAVADT 159 (593)
Q Consensus 150 ~~wp~~~~d~ 159 (593)
..||.+..++
T Consensus 450 q~~pa~~a~~ 459 (462)
T KOG2199|consen 450 QQPPAPTADP 459 (462)
T ss_pred CCCCCCCCCc
Confidence 6677766543
No 39
>PF05518 Totivirus_coat: Totivirus coat protein; InterPro: IPR008871 This family of proteins contain the coat proteins of the Totiviruses.
Probab=37.12 E-value=1.5e+02 Score=35.34 Aligned_cols=15 Identities=47% Similarity=1.139 Sum_probs=6.2
Q ss_pred CCCCCCCCCCCCCCC
Q psy8304 109 HGPPPPPPPLSAAKP 123 (593)
Q Consensus 109 ~~p~~~~~~~~~~~~ 123 (593)
..||||++++..++.
T Consensus 711 ~~~ppp~~~~g~~~~ 725 (759)
T PF05518_consen 711 AAPPPPAPPPGPPPP 725 (759)
T ss_pred CCCCCCCCCCCCCCC
Confidence 344444444443333
No 40
>PRK09752 adhesin; Provisional
Probab=35.18 E-value=37 Score=42.26 Aligned_cols=6 Identities=0% Similarity=-0.064 Sum_probs=2.4
Q ss_pred Cccccc
Q psy8304 127 QPEAMD 132 (593)
Q Consensus 127 ~~~~~~ 132 (593)
.+++..
T Consensus 954 lRPEvG 959 (1250)
T PRK09752 954 LNAKVG 959 (1250)
T ss_pred cccccc
Confidence 344433
No 41
>KOG1925|consensus
Probab=35.02 E-value=44 Score=37.88 Aligned_cols=10 Identities=30% Similarity=0.471 Sum_probs=4.3
Q ss_pred CCccccccch
Q psy8304 27 NVSKVEEQPK 36 (593)
Q Consensus 27 ~~~~~~~~~~ 36 (593)
++.+-|-|-|
T Consensus 164 ~~~~AE~~~~ 173 (817)
T KOG1925|consen 164 NVAAAETEKQ 173 (817)
T ss_pred hhHHHHHHHH
Confidence 3444444443
No 42
>PF00943 Alpha_E2_glycop: Alphavirus E2 glycoprotein; InterPro: IPR000936 Alphaviruses are enveloped RNA viruses that use arthropods such as mosquitoes for transmission to their vertebrate hosts, and include Semliki Forest and Sindbis viruses []. Alphaviruses consist of three structural proteins: the core nucleocapsid protein C, and the envelope proteins P62 and E1 (IPR002548 from INTERPRO) that associate as a heterodimer. The viral membrane-anchored surface glycoproteins are responsible for receptor recognition and entry into target cells through membrane fusion. The proteolytic maturation of P62 into E2 and E3 (IPR002533 from INTERPRO) causes a change in the viral surface. Together the E1, E2, and sometimes E3 glycoprotein "spikes" form an E1/E2 dimer or an E1/E2/E3 trimer, where E2 extends from the centre to the vertices, E1 fills the space between the vertices, and E3, if present, is at the distal end of the spike [, ]. Upon exposure of the virus to the acidity of the endosome, E1 dissociates from E2 to form an E1 homotrimer, which is necessary for the fusion step to drive the cellular and viral membranes together []. This entry represents the alphaviral E2 glycoprotein. The E2 glycoprotein functions to interact with the nucleocapsid through its cytoplasmic domain, while its ectodomain is responsible for binding a cellular receptor.; GO: 0005198 structural molecule activity, 0019028 viral capsid; PDB: 2XFC_B 3N42_B 3N41_B 2XFB_E 3N40_P 3N43_B 3N44_B 3MUU_B 1Z8Y_P 3J0C_H ....
Probab=33.98 E-value=28 Score=38.29 Aligned_cols=28 Identities=29% Similarity=0.306 Sum_probs=22.6
Q ss_pred HHHHHHHHHHHHHHHHHHHhhheeeccc
Q psy8304 534 SATLFVLLGILVVSCLVSACLCIRLRPF 561 (593)
Q Consensus 534 ~~~~v~ll~~llvs~~va~~L~~r~r~~ 561 (593)
+++++.+.+++++++.+++|||.|.|+.
T Consensus 354 i~~~~~~~~~~~i~~~~s~~~~~~aR~~ 381 (403)
T PF00943_consen 354 IVVLSGAAAIILISLAASAWLCCKARRK 381 (403)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3566667778888999999999999864
No 43
>PHA02902 putative IMV membrane protein; Provisional
Probab=33.59 E-value=72 Score=26.48 Aligned_cols=27 Identities=22% Similarity=0.397 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHHHHHHHhhheeecccc
Q psy8304 536 TLFVLLGILVVSCLVSACLCIRLRPFS 562 (593)
Q Consensus 536 ~~v~ll~~llvs~~va~~L~~r~r~~~ 562 (593)
++||+++++++.|++...+|.|.+-+.
T Consensus 5 tfvi~~v~v~Ivclliya~YrR~kci~ 31 (70)
T PHA02902 5 TFVILAVIVIIFCLLIYAAYKRYKCIP 31 (70)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcCCC
Confidence 456666666565666666665554433
No 44
>KOG2236|consensus
Probab=33.37 E-value=72 Score=35.86 Aligned_cols=59 Identities=25% Similarity=0.400 Sum_probs=0.0
Q ss_pred CCCCC----CCCCCCCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCCCCCCCCcc
Q psy8304 68 GYVDR----PPPPPAPIVAPPRPHPNGRHPSHGKPQFNHKLGGVYHGPPPPPPPLSAAKPPPVQPE 129 (593)
Q Consensus 68 ~~~~~----~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~ 129 (593)
||++| ++-++||+-.++++.-....|.-|-++.+ -+.+..||||+||++-.+.-+-+.+
T Consensus 418 Nyppp~p~f~m~~~hP~~~~p~~~~g~~~P~~~mpp~~---P~~~~pppP~~pp~p~~~~~q~q~~ 480 (483)
T KOG2236|consen 418 NYPPPSPSFPMFQPHPPESNPPANFGQANPFNQMPPAY---PHQQSPPPPPPPPPPNSPMNQMQNP 480 (483)
T ss_pred CCCCCCCCCCccCCCCCCCCCcccccccCccccCCCCC---ccccCCCCCCCCCCCCChhhcccCC
No 45
>KOG2893|consensus
Probab=32.65 E-value=2.3e+02 Score=29.41 Aligned_cols=9 Identities=22% Similarity=0.564 Sum_probs=5.2
Q ss_pred CCCCCCCCC
Q psy8304 149 PDPWPNAVA 157 (593)
Q Consensus 149 ~~~wp~~~~ 157 (593)
.+.|+.+.-
T Consensus 216 ~~r~~~p~~ 224 (341)
T KOG2893|consen 216 GDRWGPPMR 224 (341)
T ss_pred CCCCCCCcC
Confidence 466776643
No 46
>KOG1922|consensus
Probab=32.00 E-value=88 Score=37.73 Aligned_cols=7 Identities=100% Similarity=2.527 Sum_probs=2.6
Q ss_pred CCCCCCC
Q psy8304 111 PPPPPPP 117 (593)
Q Consensus 111 p~~~~~~ 117 (593)
|+|||||
T Consensus 347 p~ppppp 353 (833)
T KOG1922|consen 347 PPPPPPP 353 (833)
T ss_pred CCCCCCC
Confidence 3333333
No 47
>PF00558 Vpu: Vpu protein; InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=31.82 E-value=51 Score=28.55 Aligned_cols=24 Identities=17% Similarity=0.325 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHHHhhheee
Q psy8304 535 ATLFVLLGILVVSCLVSACLCIRL 558 (593)
Q Consensus 535 ~~~v~ll~~llvs~~va~~L~~r~ 558 (593)
++++.+.+.++++++|..+.++..
T Consensus 7 ~~iialiv~~iiaIvvW~iv~ieY 30 (81)
T PF00558_consen 7 LAIIALIVALIIAIVVWTIVYIEY 30 (81)
T ss_dssp -HHHHHHHHHHHHHHHHHHH----
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444445455555666666655433
No 48
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 [].
Probab=31.32 E-value=35 Score=31.54 Aligned_cols=19 Identities=26% Similarity=0.135 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHhhheeecc
Q psy8304 542 GILVVSCLVSACLCIRLRP 560 (593)
Q Consensus 542 ~~llvs~~va~~L~~r~r~ 560 (593)
++||+.++++++--.|+|+
T Consensus 11 ~i~l~~~~~~~~~rRR~r~ 29 (130)
T PF12273_consen 11 AILLFLFLFYCHNRRRRRR 29 (130)
T ss_pred HHHHHHHHHHHHHHHHhhc
Confidence 4444456666777777775
No 49
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=30.62 E-value=73 Score=23.79 Aligned_cols=13 Identities=8% Similarity=0.013 Sum_probs=5.8
Q ss_pred HHHHHhhheeecc
Q psy8304 548 CLVSACLCIRLRP 560 (593)
Q Consensus 548 ~~va~~L~~r~r~ 560 (593)
..+..+.|.++|.
T Consensus 21 i~~~~YaCcykk~ 33 (38)
T PF02439_consen 21 ICMFYYACCYKKH 33 (38)
T ss_pred HHHHHHHHHHccc
Confidence 3444444544443
No 50
>KOG4368|consensus
Probab=29.52 E-value=2.2e+02 Score=33.22 Aligned_cols=23 Identities=17% Similarity=0.167 Sum_probs=10.9
Q ss_pred CCCCCCCCCCCC---CCCCCCCCCCC
Q psy8304 69 YVDRPPPPPAPI---VAPPRPHPNGR 91 (593)
Q Consensus 69 ~~~~~~~~~~p~---~~~~~~~~~~~ 91 (593)
..+||-+-|+++ +++.++-++||
T Consensus 291 ~~~~p~~GPgdH~h~~~~~p~dq~hp 316 (757)
T KOG4368|consen 291 TPPPPAPGPGPHDQIPPNKPFDQPHP 316 (757)
T ss_pred cCCCCCCCCCcccccCCCCCCCCCCC
Confidence 344444444444 45555544444
No 51
>KOG3895|consensus
Probab=29.29 E-value=2.1e+02 Score=31.54 Aligned_cols=9 Identities=22% Similarity=0.530 Sum_probs=3.4
Q ss_pred cCCcccccc
Q psy8304 26 PNVSKVEEQ 34 (593)
Q Consensus 26 ~~~~~~~~~ 34 (593)
|+.-+-.||
T Consensus 369 pliGeh~ee 377 (488)
T KOG3895|consen 369 PLIGEHQEE 377 (488)
T ss_pred cccccchhH
Confidence 344333333
No 52
>KOG0608|consensus
Probab=29.26 E-value=1.7e+02 Score=34.72 Aligned_cols=10 Identities=40% Similarity=0.285 Sum_probs=4.2
Q ss_pred CCccccccCC
Q psy8304 126 VQPEAMDKSG 135 (593)
Q Consensus 126 ~~~~~~~~~~ 135 (593)
++..+-++..
T Consensus 270 ~q~s~y~ssP 279 (1034)
T KOG0608|consen 270 PQQSAYDSSP 279 (1034)
T ss_pred CCccccccCC
Confidence 3333444443
No 53
>PHA03211 serine/threonine kinase US3; Provisional
Probab=29.20 E-value=62 Score=36.24 Aligned_cols=16 Identities=19% Similarity=0.370 Sum_probs=9.2
Q ss_pred chhhhcccCccccccC
Q psy8304 47 TDQVASETSSDQESQQ 62 (593)
Q Consensus 47 ~~~~~~~~~~~~~~~~ 62 (593)
||.+-.|+..+.+..+
T Consensus 14 ~~~~~~~~~~~~~~~~ 29 (461)
T PHA03211 14 TEDAGREEAVEPETTT 29 (461)
T ss_pred CccccccccCCCCCCC
Confidence 4555556666666643
No 54
>PF15179 Myc_target_1: Myc target protein 1
Probab=28.97 E-value=1.2e+02 Score=30.15 Aligned_cols=8 Identities=38% Similarity=0.928 Sum_probs=4.2
Q ss_pred cccccccC
Q psy8304 585 SKSSFYTS 592 (593)
Q Consensus 585 ~~~~~~~~ 592 (593)
.+.-||.+
T Consensus 77 nr~GfyR~ 84 (197)
T PF15179_consen 77 NRTGFYRS 84 (197)
T ss_pred Cccceecc
Confidence 45556654
No 55
>PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=28.26 E-value=19 Score=33.54 Aligned_cols=22 Identities=23% Similarity=0.240 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHhhheeeccccc
Q psy8304 542 GILVVSCLVSACLCIRLRPFSN 563 (593)
Q Consensus 542 ~~llvs~~va~~L~~r~r~~~~ 563 (593)
+++||.++++.+|++|++++..
T Consensus 87 ~v~lVl~llsg~lv~rrcrrr~ 108 (129)
T PF12191_consen 87 SVVLVLALLSGFLVWRRCRRRE 108 (129)
T ss_dssp ----------------------
T ss_pred HHHHHHHHHHHHHHHhhhhccc
Confidence 4445567777888877776654
No 56
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=27.83 E-value=46 Score=35.50 Aligned_cols=38 Identities=13% Similarity=0.114 Sum_probs=25.3
Q ss_pred ceeechhHHHHHHHHHHHHHHHHHHHHHhhheeeccccc
Q psy8304 525 MICMTTIGFSATLFVLLGILVVSCLVSACLCIRLRPFSN 563 (593)
Q Consensus 525 ~iC~S~~~f~~~~v~ll~~llvs~~va~~L~~r~r~~~~ 563 (593)
..|-+....++.+++++ +|+|.+|-.+|.|.|.+....
T Consensus 252 ~~~~t~I~aSiiaIliI-VLIMvIIYLILRYRRKKKmkK 289 (299)
T PF02009_consen 252 ASLTTAIIASIIAILII-VLIMVIIYLILRYRRKKKMKK 289 (299)
T ss_pred hhhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhhhH
Confidence 35666666665555553 334558889999999887654
No 57
>KOG1830|consensus
Probab=27.55 E-value=1.3e+02 Score=33.69 Aligned_cols=10 Identities=70% Similarity=1.660 Sum_probs=4.4
Q ss_pred CCCCCCCCCC
Q psy8304 109 HGPPPPPPPL 118 (593)
Q Consensus 109 ~~p~~~~~~~ 118 (593)
..||||||||
T Consensus 399 ~sppPppppp 408 (518)
T KOG1830|consen 399 GSPPPPPPPP 408 (518)
T ss_pred CCCCCCCCCC
Confidence 3444444444
No 58
>PF05506 DUF756: Domain of unknown function (DUF756); InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=26.35 E-value=3.1e+02 Score=23.32 Aligned_cols=45 Identities=13% Similarity=0.114 Sum_probs=30.5
Q ss_pred eEEEEEEeCCCCCcccccceeecCCEEEEEEEEecCCCcceEEEe
Q psy8304 300 GCWMQIQVGKGPWASEVSGLVKIGQTMTMVLAIKDDDSKFDMLVR 344 (593)
Q Consensus 300 ~~~M~I~~g~GP~~~~Vs~~V~IGd~L~l~v~i~~~~~~~~l~V~ 344 (593)
.+.++|+......+.+....|.-|+.+.+.|.+...++-|++.|+
T Consensus 33 ~~~~~v~~~~y~~~~~~~~~v~ag~~~~~~w~l~~s~gwYDl~v~ 77 (89)
T PF05506_consen 33 AVTFTVYDNAYGGGGPWTYTVAAGQTVSLTWPLAASGGWYDLTVT 77 (89)
T ss_pred cEEEEEEeCCcCCCCCEEEEECCCCEEEEEEeecCCCCcEEEEEE
Confidence 456677752211122334578889999999999766788998775
No 59
>KOG3441|consensus
Probab=26.09 E-value=2.8e+02 Score=25.95 Aligned_cols=65 Identities=20% Similarity=0.347 Sum_probs=42.3
Q ss_pred cCCCceEEEEEEeCCCCCcccccceeecCCEEEEEEEEecCCCcceEEEeEEEEecCC-----CCcEEeeccCCcccccc
Q psy8304 295 AGDNVGCWMQIQVGKGPWASEVSGLVKIGQTMTMVLAIKDDDSKFDMLVRNCMAHDGK-----RAPIQLVDQRGCVTRSK 369 (593)
Q Consensus 295 sg~~p~~~M~I~~g~GP~~~~Vs~~V~IGd~L~l~v~i~~~~~~~~l~V~sC~Atdg~-----g~~~~LIDenGCp~D~~ 369 (593)
+|..|.| ..+|+-+| ..++||.+-+ .|+. +-.-.+.|..|.-..-. .+.+.|||+||-|+...
T Consensus 53 ~gr~PrC-IHVYkkrg--------vg~~GDkiLv--AIkG-QmkKa~vVGh~~~~k~~~P~fDsNniVLiddnGnPlGtR 120 (149)
T KOG3441|consen 53 TGRLPRC-IHVYKKRG--------VGELGDKILV--AIKG-QMKKAYVVGHVHYRKHGVPVFDSNNIVLIDDNGNPLGTR 120 (149)
T ss_pred cCCCCce-EEEEeccc--------ccccccEEEE--EEec-ceeeeEEEEeeccCCCCCcccCCCcEEEECCCCCcccce
Confidence 3444555 67776543 5689999754 4543 23345677777655431 25799999999999865
Q ss_pred cc
Q psy8304 370 LM 371 (593)
Q Consensus 370 L~ 371 (593)
|.
T Consensus 121 I~ 122 (149)
T KOG3441|consen 121 IT 122 (149)
T ss_pred Ee
Confidence 53
No 60
>PF04881 Adeno_GP19K: Adenovirus GP19K; InterPro: IPR006965 This 19 kDa glycoprotein binds the major histocompatibility (MHC) class I antigens in the endoplasmic reticulum (ER). The ER retention signal at the C terminus of Gp19K causes retention of the complex in the ER, preventing lysis of the cell by cytotoxic T-lymphocytes [].; GO: 0005537 mannose binding, 0050690 regulation of defense response to virus by virus
Probab=24.58 E-value=73 Score=29.94 Aligned_cols=31 Identities=23% Similarity=0.401 Sum_probs=17.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhheeecccc
Q psy8304 532 GFSATLFVLLGILVVSCLVSACLCIRLRPFS 562 (593)
Q Consensus 532 ~f~~~~v~ll~~llvs~~va~~L~~r~r~~~ 562 (593)
+|.+++++...++-..+++++.|+++.|++.
T Consensus 100 ~fS~af~~~aclit~l~~~~i~~~i~~kpR~ 130 (139)
T PF04881_consen 100 GFSIAFCICACLITALLCVCIHLLIKIKPRN 130 (139)
T ss_pred eeeHHHHHHHHHHHHHHHHHHhhheeecccc
Confidence 4556666554333333455666667666554
No 61
>PRK14758 hypothetical protein; Provisional
Probab=24.43 E-value=88 Score=21.47 Aligned_cols=17 Identities=29% Similarity=0.507 Sum_probs=9.8
Q ss_pred HHHHHHHHHHHHHHhhh
Q psy8304 539 VLLGILVVSCLVSACLC 555 (593)
Q Consensus 539 ~ll~~llvs~~va~~L~ 555 (593)
++|+++++++++++-++
T Consensus 9 liLivlIlCalia~~fy 25 (27)
T PRK14758 9 FILIILILCALIAARFY 25 (27)
T ss_pred HHHHHHHHHHHHHHHhc
Confidence 34455556666666554
No 62
>PF07174 FAP: Fibronectin-attachment protein (FAP); InterPro: IPR010801 This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix [].; GO: 0050840 extracellular matrix binding, 0005576 extracellular region
Probab=24.33 E-value=2.5e+02 Score=29.75 Aligned_cols=6 Identities=17% Similarity=0.135 Sum_probs=2.2
Q ss_pred CCCCCC
Q psy8304 67 HGYVDR 72 (593)
Q Consensus 67 ~~~~~~ 72 (593)
+++..|
T Consensus 36 ~A~AdP 41 (297)
T PF07174_consen 36 TAQADP 41 (297)
T ss_pred cccCCC
Confidence 333333
No 63
>PF15145 DUF4577: Domain of unknown function (DUF4577)
Probab=24.31 E-value=95 Score=28.53 Aligned_cols=31 Identities=16% Similarity=0.110 Sum_probs=21.6
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhheeec
Q psy8304 529 TTIGFSATLFVLLGILVVSCLVSACLCIRLR 559 (593)
Q Consensus 529 S~~~f~~~~v~ll~~llvs~~va~~L~~r~r 559 (593)
+..-|.++++++|.+-|+.+.+++||.+-.+
T Consensus 59 ~~~lffvglii~LivSLaLVsFvIFLiiQTg 89 (128)
T PF15145_consen 59 SRSLFFVGLIIVLIVSLALVSFVIFLIIQTG 89 (128)
T ss_pred ceeehHHHHHHHHHHHHHHHHHHHHheeecc
Confidence 3445778888877777777777788876543
No 64
>KOG2546|consensus
Probab=23.63 E-value=2e+02 Score=32.28 Aligned_cols=7 Identities=43% Similarity=1.203 Sum_probs=3.2
Q ss_pred CCCCCCC
Q psy8304 146 APAPDPW 152 (593)
Q Consensus 146 ~~~~~~w 152 (593)
+..+..|
T Consensus 411 ap~sp~w 417 (483)
T KOG2546|consen 411 APSSPAW 417 (483)
T ss_pred CCCCccc
Confidence 3444455
No 65
>PF14141 YqzM: YqzM-like protein
Probab=23.36 E-value=95 Score=23.72 Aligned_cols=25 Identities=12% Similarity=0.386 Sum_probs=19.7
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHh
Q psy8304 529 TTIGFSATLFVLLGILVVSCLVSAC 553 (593)
Q Consensus 529 S~~~f~~~~v~ll~~llvs~~va~~ 553 (593)
|..+|++++.++.+++++++++-.+
T Consensus 17 sgvGF~vsFgFF~~iFiIa~ii~~~ 41 (43)
T PF14141_consen 17 SGVGFIVSFGFFATIFIIATIIKFI 41 (43)
T ss_pred ccchhhhhHHHHHHHHHHHHHHHHh
Confidence 5678999998888888888877543
No 66
>PF06365 CD34_antigen: CD34/Podocalyxin family; InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=22.84 E-value=49 Score=33.34 Aligned_cols=50 Identities=24% Similarity=0.235 Sum_probs=22.8
Q ss_pred HHHHHHHHH-HHHHHHHHHHHhhheeecccccCCCCCc---cccc---cCCCccccc
Q psy8304 533 FSATLFVLL-GILVVSCLVSACLCIRLRPFSNKTSQKT---MSVS---YTNPIVSVK 582 (593)
Q Consensus 533 f~~~~v~ll-~~llvs~~va~~L~~r~r~~~~~~~~~~---~~~~---~~~~~~~~~ 582 (593)
..|+||+++ .+||++++++++.|+.+|.-+.+...+. +..+ -.||.++.+
T Consensus 101 ~lI~lv~~g~~lLla~~~~~~Y~~~~Rrs~~~~~~rl~Ee~~~vEng~h~n~~l~v~ 157 (202)
T PF06365_consen 101 TLIALVTSGSFLLLAILLGAGYCCHQRRSWSKKGQRLGEELYTVENGYHDNPTLSVA 157 (202)
T ss_pred EEEehHHhhHHHHHHHHHHHHHHhhhhccCCcchhhhccCceecccCccCCcccccC
Confidence 344555554 3444455555555544444333333333 1111 247777554
No 67
>PF05283 MGC-24: Multi-glycosylated core protein 24 (MGC-24); InterPro: IPR007947 CD164 is a mucin-like receptor, or sialomucin, with specificity in receptor/ ligand interactions that depends on the structural characteristics of the mucin-like receptor. Its functions include mediating, or regulating, haematopoietic progenitor cell adhesion and the negative regulation of their growth and/or-differentiation. It exists in the native state as a disulphide- linked homodimer of two 80-85kDa subunits. It is usually expressed by CD34+ and CD341o/- haematopoietic stem cells and associated microenvironmental cells. It contains, in its extracellular region, two mucin domains (I and II) linked by a non-mucin domain, which has been predicted to contain intra- disulphide bridges. This receptor may play a key role in haematopoiesis by facilitating the adhesion of human CD34+ cells to bone marrow stroma and by negatively regulating CD34+ CD341o/- haematopoietic progenitor cell proliferation. These effects involve the CD164 class I and/or II epitopes recognised by the monoclonal antibodies (mAbs) 105A5 and 103B2/9E10. These epitopes are carbohydrate-dependent and are located on the N-terminal mucin domain I [, ]. It has been found that murine MGC-24v and rat endolyn share significant sequence similarities with human CD164. However, CD164 lacks the consensus glycosaminoglycan (GAG)-attachment site found in MGC-24; it is possible that GAG-association is responsible for the high molecular weight of the epithelial-derived MGC-24 glycoprotein []. Genomic structure studies have placed CD164 within the mucin-subgroup that comprises multiple exons, and demonstrate the diverse chromosomal distribution of this family of molecules. Molecules with such multiple exons may have sophisticated regulatory mechanisms that involve not only post-translational modifications of the oligosaccharide side chains, but also differential exon usage. Although differences in the intron and exon sizes are seen between the mouse and human genes, the predicted proteins are similar in size and structure, maintaining functionally important motifs that regulate cell proliferation or subcellular distribution []. CD164 is a gene whose expression depends on differential usage of poly- adenylation sites within the 3'-UTR. The conserved distribution of the 3.2- and 1.2-kb CD164 transcripts between mouse and human suggests that (i) a mechanism may exist to regulate tissue-specific polyadenylation, and (ii) differences in polyadenylation are important for the expression and function of CD164 in different tissues. Two other aspects of the structure of CD164 are of particular interest. First, it shares one of several conserved features of a cytokine-binding pocket - in this respect, it is notable that evidence exists for a class of cell-surface sialomucin modulators that directly interact with growth factor receptors to regulate their response to physiological ligands. Second, its cytoplasmic tail contains a C-terminal YHTL motif found in many endocytic membrane proteins or receptors. These Tyr-based motifs bind to adaptor proteins, which mediate the sorting of membrane proteins into transport vesicles from the plasma membrane to the endosomes, and between intracellular compartments.
Probab=22.73 E-value=78 Score=31.57 Aligned_cols=25 Identities=16% Similarity=0.287 Sum_probs=16.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhhee
Q psy8304 531 IGFSATLFVLLGILVVSCLVSACLCIR 557 (593)
Q Consensus 531 ~~f~~~~v~ll~~llvs~~va~~L~~r 557 (593)
.+|+.+||+.|++|. +++..|=|+|
T Consensus 161 ~SFiGGIVL~LGv~a--I~ff~~KF~k 185 (186)
T PF05283_consen 161 ASFIGGIVLTLGVLA--IIFFLYKFCK 185 (186)
T ss_pred hhhhhHHHHHHHHHH--HHHHHhhhcc
Confidence 789999998887774 4444444443
No 68
>PF13388 DUF4106: Protein of unknown function (DUF4106)
Probab=22.72 E-value=2.2e+02 Score=30.42 Aligned_cols=11 Identities=45% Similarity=0.975 Sum_probs=6.5
Q ss_pred CCCCCCCCCCC
Q psy8304 69 YVDRPPPPPAP 79 (593)
Q Consensus 69 ~~~~~~~~~~p 79 (593)
|-||-||+-+|
T Consensus 171 YIPPNPPrEAP 181 (422)
T PF13388_consen 171 YIPPNPPREAP 181 (422)
T ss_pred ccCCCCccccc
Confidence 44665666665
No 69
>KOG1984|consensus
Probab=22.20 E-value=3.5e+02 Score=33.12 Aligned_cols=11 Identities=27% Similarity=0.495 Sum_probs=7.1
Q ss_pred cCCcccccccc
Q psy8304 361 QRGCVTRSKLM 371 (593)
Q Consensus 361 enGCp~D~~L~ 371 (593)
+.||.+|-.++
T Consensus 600 ~~g~svDlF~t 610 (1007)
T KOG1984|consen 600 ESGCSVDLFLT 610 (1007)
T ss_pred HhCceEEEEEc
Confidence 46777776554
No 70
>KOG1785|consensus
Probab=21.90 E-value=4.5e+02 Score=29.36 Aligned_cols=10 Identities=20% Similarity=0.461 Sum_probs=4.9
Q ss_pred cccccCCCCC
Q psy8304 57 DQESQQSAPK 66 (593)
Q Consensus 57 ~~~~~~~~~~ 66 (593)
.+++.++++.
T Consensus 467 a~~~~p~~p~ 476 (563)
T KOG1785|consen 467 AHDASPSIPS 476 (563)
T ss_pred hhhccCCCCc
Confidence 4555554443
No 71
>PF11884 DUF3404: Domain of unknown function (DUF3404); InterPro: IPR021821 This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 260 amino acids in length. This domain is found associated with PF02518 from PFAM, PF00512 from PFAM.
Probab=21.26 E-value=63 Score=33.87 Aligned_cols=40 Identities=18% Similarity=0.143 Sum_probs=21.8
Q ss_pred CCceeechhHHH--HHHHHHHHHHHHHHHHHHhhheeecccc
Q psy8304 523 SGMICMTTIGFS--ATLFVLLGILVVSCLVSACLCIRLRPFS 562 (593)
Q Consensus 523 ~~~iC~S~~~f~--~~~v~ll~~llvs~~va~~L~~r~r~~~ 562 (593)
.++||++...=. +.++++.++++..+++..|..+|++...
T Consensus 217 ~gNlCw~~~~~s~l~~~~~i~L~~~~i~l~~gw~~y~~~~kr 258 (262)
T PF11884_consen 217 YGNLCWSEEDHSHLLRISMIALVLANILLVLGWSLYRWNQKR 258 (262)
T ss_pred ecceeeccCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 468999875443 2223333333344556667777766543
No 72
>PTZ00208 65 kDa invariant surface glycoprotein; Provisional
Probab=20.80 E-value=98 Score=34.33 Aligned_cols=26 Identities=12% Similarity=0.180 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHHhhheeecc
Q psy8304 535 ATLFVLLGILVVSCLVSACLCIRLRP 560 (593)
Q Consensus 535 ~~~v~ll~~llvs~~va~~L~~r~r~ 560 (593)
+..|++-+++|+-+.++.|.|+|+||
T Consensus 389 ~~avl~p~~il~~~~~~~~~~v~rrr 414 (436)
T PTZ00208 389 ILAVLVPAIILAIIAVAFFIMVKRRR 414 (436)
T ss_pred HHHHHHHHHHHHHHHHHhheeeeecc
Confidence 33343334554434444455566655
No 73
>PF15330 SIT: SHP2-interacting transmembrane adaptor protein, SIT
Probab=20.37 E-value=74 Score=28.87 Aligned_cols=32 Identities=25% Similarity=0.410 Sum_probs=22.8
Q ss_pred HHHHHHHHHHHHHHHHHhhheeecccccCCCC
Q psy8304 536 TLFVLLGILVVSCLVSACLCIRLRPFSNKTSQ 567 (593)
Q Consensus 536 ~~v~ll~~llvs~~va~~L~~r~r~~~~~~~~ 567 (593)
+|.+++++||+..+++.++..|..++..+.+.
T Consensus 2 ~Ll~il~llLll~l~asl~~wr~~~rq~k~~~ 33 (107)
T PF15330_consen 2 LLLGILALLLLLSLAASLLAWRMKQRQKKAGQ 33 (107)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhhccccC
Confidence 46677788887777777777777777766544
No 74
>PF01690 PLRV_ORF5: Potato leaf roll virus readthrough protein; InterPro: IPR002929 This family consists mainly of the Potato leafroll virus (PLrV) read through protein otherwise known as the minor capsid protein. This is generated via a readthrough of open reading frame 3, the coat protein, allowing transcription of open reading frame 5 to give an extended coat protein with a large C-terminal addition or read through domain []. The read through protein is essential for the circulative aphid transmission of PLrV [] and Beet western yellows virus []. The N-terminal region of the luteovirus readthrough domain determines virus binding to Buchnera GroEL and is essential for virus persistence in the aphid [].; GO: 0019028 viral capsid
Probab=20.11 E-value=76 Score=35.79 Aligned_cols=10 Identities=20% Similarity=0.002 Sum_probs=5.1
Q ss_pred ecceEEEEee
Q psy8304 259 WDQARKLRCT 268 (593)
Q Consensus 259 ~D~~~~V~C~ 268 (593)
+...|||+|.
T Consensus 181 ~~~SFhl~~~ 190 (465)
T PF01690_consen 181 GTISFHLEAT 190 (465)
T ss_pred eeEEEEEEec
Confidence 4444555555
No 75
>PF10717 ODV-E18: Occlusion-derived virus envelope protein ODV-E18; InterPro: IPR019655 Baculovirus occlusion-derived virus (ODV) derives its envelope from an intranuclear membrane source. Occlusion-derived viral envelope proteins that are detected in viral-induced intranuclear microvesicles, but not detected in the plasma membrane, cytoplasmic membranes, or the nuclear envelope. This entry represents ODV-E18 protein which is encoded by baculovirus late genes with transcription initiating from a TAAG motif. ODV-E18 exists as a dimer in the ODV envelope, which contains a hydrophobic domain that putatively acts as a target or retention signal for intranuclear microvesicles []. ; GO: 0019031 viral envelope
Probab=20.09 E-value=1e+02 Score=26.81 Aligned_cols=12 Identities=25% Similarity=0.667 Sum_probs=8.3
Q ss_pred cccCCCcccccc
Q psy8304 572 VSYTNPIVSVKK 583 (593)
Q Consensus 572 ~~~~~~~~~~~~ 583 (593)
..|.||+=+.+.
T Consensus 61 ~~f~NPLNaTMR 72 (85)
T PF10717_consen 61 MGFTNPLNATMR 72 (85)
T ss_pred ccccccchHHHh
Confidence 458998876553
No 76
>KOG3397|consensus
Probab=20.07 E-value=91 Score=30.96 Aligned_cols=10 Identities=10% Similarity=-0.140 Sum_probs=4.6
Q ss_pred chhhhcccCc
Q psy8304 47 TDQVASETSS 56 (593)
Q Consensus 47 ~~~~~~~~~~ 56 (593)
||...-|..+
T Consensus 126 DQ~~FYe~lG 135 (225)
T KOG3397|consen 126 DQCRFYESLG 135 (225)
T ss_pred cchhhhhhhc
Confidence 4444444443
No 77
>PF01213 CAP_N: Adenylate cyclase associated (CAP) N terminal; InterPro: IPR013992 Cyclase-associated proteins (CAPs) are highly conserved actin-binding proteins present in a wide range of organisms including yeast, fly, plants, and mammals. CAPs are multifunctional proteins that contain several structural domains. CAP is involved in species-specific signalling pathways [, , , ]. In Drosophila, CAP functions in Hedgehog-mediated eye development and in establishing oocyte polarity. In Dictyostelium (slim mold), CAP is involved in microfilament reorganisation near the plasma membrane in a PIP2-regulated manner and is required to perpetuate the cAMP relay signal to organise fruitbody formation. In plants, CAP is involved in plant signalling pathways required for co-ordinated organ expansion. In yeast, CAP is involved in adenylate cyclase activation, as well as in vesicle trafficking and endocytosis. In both yeast and mammals, CAPs appear to be involved in recycling G-actin monomers from ADF/cofilins for subsequent rounds of filament assembly [, ]. In mammals, there are two different CAPs (CAP1 and CAP2) that share 64% amino acid identity. All CAPs appear to contain a C-terminal actin-binding domain that regulates actin remodelling in response to cellular signals and is required for normal cellular morphology, cell division, growth and locomotion in eukaryotes. CAP directly regulates actin filament dynamics and has been implicated in a number of complex developmental and morphological processes, including mRNA localisation and the establishment of cell polarity. Actin exists both as globular (G) (monomeric) actin subunits and assembled into filamentous (F) actin. In cells, actin cycles between these two forms. Proteins that bind F-actin often regulate F-actin assembly and its interaction with other proteins, while proteins that interact with G-actin often control the availability of unpolymerised actin. CAPs bind G-actin. In addition to actin-binding, CAPs can have additional roles, and may act as bifunctional proteins. In Saccharomyces cerevisiae (Baker's yeast), CAP is a component of the adenylyl cyclase complex (Cyr1p) that serves as an effector of Ras during normal cell signalling. S. cerevisiae CAP functions to expose adenylate cyclase binding sites to Ras, thereby enabling adenylate cyclase to be activated by Ras regulatory signals. In Schizosaccharomyces pombe (Fission yeast), CAP is also required for adenylate cyclase activity, but not through the Ras pathway. In both organisms, the N-terminal domain is responsible for adenylate cyclase activation, but the S cerevisiae and S. pombe N-termini cannot complement one another. Yeast CAPs are unique among the CAP family of proteins, because they are the only ones to directly interact with and activate adenylate cyclase []. S. cerevisiae CAP has four major domains. In addition to the N-terminal adenylate cyclase-interacting domain, and the C-terminal actin-binding domain, it possesses two other domains: a proline-rich domain that interacts with Src homology 3 (SH3) domains of specific proteins, and a domain that is responsible for CAP oligomerisation to form multimeric complexes (although oligomerisation appears to involve the N- and C-terminal domains as well). The proline-rich domain interacts with profilin, a protein that catalyses nucleotide exchange on G-actin monomers and promotes addition to barbed ends of filamentous F-actin []. Since CAP can bind profilin via a proline-rich domain, and G-actin via a C-terminal domain, it has been suggested that a ternary G-actin/CAP/profilin complex could be formed. This entry represents the N-terminal domain of CAP proteins. This domain has an all-alpha structure consisting of six helices in a bundle with a left-handed twist and an up-and-down topology [].; GO: 0003779 actin binding, 0007010 cytoskeleton organization; PDB: 1TJF_B 1S0P_A.
Probab=20.01 E-value=35 Score=36.60 Aligned_cols=18 Identities=44% Similarity=1.040 Sum_probs=0.0
Q ss_pred CCCCCCCCCCCCCCCCCC
Q psy8304 106 GVYHGPPPPPPPLSAAKP 123 (593)
Q Consensus 106 ~~~~~p~~~~~~~~~~~~ 123 (593)
.+...||||||||+|+.+
T Consensus 228 ~~~~~PpPPppPpPP~~~ 245 (312)
T PF01213_consen 228 SAPAAPPPPPPPPPPPAP 245 (312)
T ss_dssp ------------------
T ss_pred cccCCCCCCCCCCccccc
Done!