Query psy14031
Match_columns 280
No_of_seqs 226 out of 1426
Neff 8.7
Searched_HMMs 46136
Date Fri Aug 16 17:51:22 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14031.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14031hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00216 VWD von Willebrand 100.0 8.9E-38 1.9E-42 254.0 20.4 154 2-158 6-162 (162)
2 PF00094 VWD: von Willebrand f 100.0 5.5E-31 1.2E-35 212.8 13.4 149 8-159 1-159 (159)
3 smart00832 C8 C8 domain. This 99.9 1E-23 2.2E-28 148.6 4.2 71 204-274 3-76 (76)
4 PF08742 C8: C8 domain; Inter 99.8 7.7E-21 1.7E-25 134.0 3.6 71 204-274 3-74 (74)
5 KOG4338|consensus 98.5 7.9E-08 1.7E-12 98.0 5.4 169 5-181 1353-1532(1680)
6 PF06535 RGM_N: Repulsive guid 97.6 0.00016 3.5E-09 57.3 5.6 63 3-68 97-160 (161)
7 KOG1216|consensus 95.2 0.033 7.1E-07 53.9 5.6 235 3-240 289-535 (553)
8 PF07481 DUF1521: Domain of Un 94.6 0.67 1.5E-05 36.9 10.2 63 7-75 39-103 (171)
9 PF06668 ITI_HC_C: Inter-alpha 72.2 55 0.0012 27.1 10.3 89 57-148 23-133 (188)
10 PF07833 Cu_amine_oxidN1: Copp 70.9 14 0.00031 25.9 5.5 56 61-122 27-88 (93)
11 PF05910 DUF868: Plant protein 37.1 2.5E+02 0.0054 24.8 8.2 35 78-121 176-215 (274)
12 PRK10626 hypothetical protein; 34.3 50 0.0011 28.4 3.4 30 61-90 37-66 (239)
13 PF06534 RGM_C: Repulsive guid 32.6 32 0.00069 28.2 1.9 28 227-257 138-165 (179)
14 PF11101 DUF2884: Protein of u 32.5 55 0.0012 27.9 3.5 30 61-90 14-44 (229)
15 PF07944 DUF1680: Putative gly 30.3 5E+02 0.011 25.0 11.3 44 76-119 453-499 (520)
16 smart00700 JHBP Juvenile hormo 26.9 3.7E+02 0.0081 22.4 9.7 31 78-108 103-133 (225)
17 PF13464 DUF4115: Domain of un 26.3 2.1E+02 0.0046 19.4 5.8 21 71-91 41-63 (77)
18 PF13349 DUF4097: Domain of un 26.1 3.1E+02 0.0068 21.2 8.3 63 60-132 56-130 (166)
19 PF08394 Arc_trans_TRASH: Arch 24.8 42 0.0009 19.9 1.0 17 10-26 8-24 (37)
20 COG3588 Fructose-1,6-bisphosph 23.9 55 0.0012 29.0 2.0 18 246-263 156-173 (332)
21 KOG4291|consensus 23.5 1.1E+02 0.0025 32.2 4.4 59 111-171 521-579 (1043)
22 PRK14111 F0F1 ATP synthase sub 21.1 26 0.00056 31.1 -0.6 10 140-149 89-98 (290)
No 1
>smart00216 VWD von Willebrand factor (vWF) type D domain. Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Probab=100.00 E-value=8.9e-38 Score=254.01 Aligned_cols=154 Identities=35% Similarity=0.732 Sum_probs=138.5
Q ss_pred CccceEEEEecCCceecCCCCeeeeccCeEEEEeeecCCC-ceEEEEEecCCCCCCeeeEEEEEEEECCEEEEEcc-ccE
Q psy14031 2 TLGDGVCTVFGDPHYRTFDGKFYSFQGSCKYQLTADCTGG-AFSIRVTNDARDTKTSSWTKTVSIRIGDMKVNLGE-KLR 79 (280)
Q Consensus 2 ~~~~~~C~~~G~~h~~TFDg~~y~f~g~C~YvL~~~~~~~-~f~V~~~~~~~~~~~~~~~~~v~v~~~~~~i~l~~-~~~ 79 (280)
.+|+++|+++|++||+||||+.|+|+|.|+|+|+++|... +|+|++++..+... .++.++|+|.+++..|++.+ +..
T Consensus 6 ~~~~~~C~v~g~~~~~TFDg~~y~~~g~C~yvL~~~~~~~~~f~V~~~~~~~~~~-~~~~~~v~v~~~~~~i~~~~~~~~ 84 (162)
T smart00216 6 GECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSEPTFSVLLKNVPCGGG-ATCLKSVKVELNGDEIELKDDNGT 84 (162)
T ss_pred CCCCCEEEEcCCCCeECcCCCEEeecCceEEEEEEECCCCCCEEEEEEecCCCCC-ceEEEEEEEEECCEEEEEEeCCCE
Confidence 3688899999999999999999999999999999998765 99999999886543 67899999999999999987 789
Q ss_pred EEECCEEeeeCccCCC-eEEEEEecCEEEEEeCCcEEEEEeCCeeEEEEcCccccccCceeeecccCCCCCCCcCcCCCC
Q psy14031 80 VKIDGQRVALPYDLPS-KVIVTKTAESVLVETAIGIKVLWDGNSFLEVSAPAKFKILDRLCGLCGNFNGISSDDLTTKRG 158 (280)
Q Consensus 80 V~vng~~~~lp~~~~~-~~~i~~~~~~v~v~~~~gl~v~~~~~~~v~V~~~~~~~~~g~~~GLCGn~ng~~~dD~~~p~g 158 (280)
|+|||+.+.+|+...+ .+++++.+.++++.+..||+|.|||...+.|.+++.|+ |+||||||||||++.|||++|+|
T Consensus 85 v~vng~~v~~p~~~~~~~v~~~~~~~~~v~~~~~gl~v~~dg~~~~~V~~~~~~~--g~~~GLCGn~ng~~~dDf~~p~g 162 (162)
T smart00216 85 VTVNGQQVSLPYKTSDGSIQIRSSGGYVVVITSLGLQVTFDGLTLLSVQLPSRYR--GKTCGLCGNFDGEPEDDFRTPDG 162 (162)
T ss_pred EEECCEEeeCCcCcCCceEEEEECceEEEEECCCCEEEEECCCcEEEEEECHHHC--CCeeEccCCCCCCccccccCCCC
Confidence 9999999999997654 35566666777788899999999998779999999999 99999999999999999999987
No 2
>PF00094 VWD: von Willebrand factor type D domain; InterPro: IPR001846 A family of growth regulators (originally called cef10, connective tissue growth factor, fisp-12, cyr61, or, alternatively, beta IG-M1 and beta IG-M2), all belong to immediate-early genes expressed after induction by growth factors or certain oncogenes. Sequence analysis of this family revealed the presence of four distinct modules. Each module has homologues in other extracellular mosaic proteins such as Von Willebrand factor, slit, thrombospondins, fibrillar collagens, IGF-binding proteins and mucins. Classification and analysis of these modules suggests the location of binding regions and, by analogy to better characterised modules in other proteins, sheds some light onto the structure of this new family []. The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. One of the functions of von Willebrand factor (vWF) is to serve as a carrier of clotting factor VIII (FVIII). The native conformation of the D' domain of vWF is not only required for factor VIII (FVIII) binding but also for normal multimerisation and optimal secretion. The interaction between blood clotting factor VIII and VWF is necessary for normal survival of blood clotting factor VIII in blood circulation. The VWFD domain is a highly structured region, in which the first conserved Cys has been found to form a disulphide bridge with the second conserved one [].
Probab=99.97 E-value=5.5e-31 Score=212.80 Aligned_cols=149 Identities=40% Similarity=0.765 Sum_probs=123.4
Q ss_pred EEEecCCceecCCCCeeeeccCeEEE--EeeecCCC---ceEEEEEecCCCCCCeeeEEEEEEEECCEEEEEccc--cEE
Q psy14031 8 CTVFGDPHYRTFDGKFYSFQGSCKYQ--LTADCTGG---AFSIRVTNDARDTKTSSWTKTVSIRIGDMKVNLGEK--LRV 80 (280)
Q Consensus 8 C~~~G~~h~~TFDg~~y~f~g~C~Yv--L~~~~~~~---~f~V~~~~~~~~~~~~~~~~~v~v~~~~~~i~l~~~--~~V 80 (280)
|+++|++||+||||+.|.|++.|.|+ |++++... .|.|..++........+|+++|.|.+++..|+|.++ ..|
T Consensus 1 C~v~g~~~~~TFDg~~y~~~~~c~y~~vl~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~v~i~~~~~~i~i~~~~~~~v 80 (159)
T PF00094_consen 1 CSVYGDPHITTFDGKSYSFPGNCTYILVLAQDCSSDPKFSFSVENKNCPCGQSGTSCTRKVTIRLGNHEIEIKPNNGDQV 80 (159)
T ss_pred CEEeCCCeEEcCCCCEEecCCCceEEEEeecccCccccccccceeeeeccCCCCcceeeEEEEEcccceeeEeccccceE
Confidence 89999999999999999999999999 99998764 345555555554455569999999999999999987 799
Q ss_pred EECCEEeeeCccCCCeEEEEEec-CEEEEEe--CCcEEEEEeCCeeEEEEcCccccccCceeeecccCCCCCCCcCcCCC
Q psy14031 81 KIDGQRVALPYDLPSKVIVTKTA-ESVLVET--AIGIKVLWDGNSFLEVSAPAKFKILDRLCGLCGNFNGISSDDLTTKR 157 (280)
Q Consensus 81 ~vng~~~~lp~~~~~~~~i~~~~-~~v~v~~--~~gl~v~~~~~~~v~V~~~~~~~~~g~~~GLCGn~ng~~~dD~~~p~ 157 (280)
+|||+.+.+|+...+ ..|+..+ ..++|.. ..+|++.|++...+.|.+++.|+ |+|+||||+|||++.|||++|+
T Consensus 81 ~vng~~~~~p~~~~~-~~i~~~~~~~~~v~~~~~~~v~~~~~~~~~~~v~~~~~~~--g~t~GLCG~~dg~~~dDf~~~~ 157 (159)
T PF00094_consen 81 TVNGQPVSLPYWSSG-GSIERYSGGFVVVVFSSGVRVQVNWDGNMSVYVSVPPWYK--GKTCGLCGNFDGEPEDDFTTPD 157 (159)
T ss_pred EEeeecccCccccCc-ceEEeeccccEEEEEecCCeEEEEEecccccccccccccc--cccccccCCCCCCccceecCCC
Confidence 999999999997766 3555543 4455444 45556678875689999999999 9999999999999999999999
Q ss_pred Cc
Q psy14031 158 GR 159 (280)
Q Consensus 158 g~ 159 (280)
|+
T Consensus 158 g~ 159 (159)
T PF00094_consen 158 GC 159 (159)
T ss_pred CC
Confidence 85
No 3
>smart00832 C8 C8 domain. This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Probab=99.88 E-value=1e-23 Score=148.65 Aligned_cols=71 Identities=35% Similarity=0.803 Sum_probs=66.4
Q ss_pred hhhccCccccCC--CCccccccccchhchhhccchhcCCCC-CcccCHHHHHHHHHHHHCCCCcCCCCCCCCCC
Q psy14031 204 YRDKKCKPLRSD--VFSACHARLNHLMYFKSCLVDMCECPL-KNCHCESFTAYARECSRLGVQLGDWRKLTGCH 274 (280)
Q Consensus 204 ~a~~~C~~l~~~--~F~~C~~~vdp~~f~~~C~~d~C~~~~-~~~~C~~l~aYa~~C~~~gv~i~~WR~~~~C~ 274 (280)
.+.+.|.+|++. +|++||.+|||++||++|++|+|++++ .+++|++|++||++|+++||.|.+||++++||
T Consensus 3 ~a~~~C~~l~~~~g~F~~Ch~~V~p~~f~~~Cv~D~C~~~~~~~~lC~al~aYa~aC~~~Gv~v~~WR~~~~Cp 76 (76)
T smart00832 3 YACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTPTFCP 76 (76)
T ss_pred chhhhhHhhcCCCCChHHHhCcCChHHHHHHHHHHHhCCCCCCcccCHHHHHHHHHHHHCcCcCCCCCCCCCCC
Confidence 467899999766 999999999999999999999999874 88999999999999999999998899999997
No 4
>PF08742 C8: C8 domain; InterPro: IPR014853 The proteins in this entry contained a domain rich in positionally conserved cysteine residues. Most proteins contains 7 or 8 cysteine residues. The domain is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing IPR001846 from INTERPRO and IPR002919 from INTERPRO.
Probab=99.81 E-value=7.7e-21 Score=134.02 Aligned_cols=71 Identities=39% Similarity=0.940 Sum_probs=67.0
Q ss_pred hhhccCccccCCCCccccccccchhchhhccchhcCCCC-CcccCHHHHHHHHHHHHCCCCcCCCCCCCCCC
Q psy14031 204 YRDKKCKPLRSDVFSACHARLNHLMYFKSCLVDMCECPL-KNCHCESFTAYARECSRLGVQLGDWRKLTGCH 274 (280)
Q Consensus 204 ~a~~~C~~l~~~~F~~C~~~vdp~~f~~~C~~d~C~~~~-~~~~C~~l~aYa~~C~~~gv~i~~WR~~~~C~ 274 (280)
++...|.+|.++.|++||.+|||++|+++|++|+|.+.+ .+++|++|++||++|+.+|+.+.+||+.++||
T Consensus 3 ~a~~~C~~l~~~~F~~C~~~v~~~~f~~~C~~d~C~~~~~~~~~C~~l~~Ya~~C~~~g~~~~~WR~~~~C~ 74 (74)
T PF08742_consen 3 RAESYCGILLDPEFAPCHSVVDPDPFYEACVYDMCACPGSQQCLCEALSAYARECQRAGICVGDWRTPTLCP 74 (74)
T ss_pred hHhHHhHHHcCchhhhhcccCccHHHHHHHHHHHcCCCCCcchhhHHHHHHHHHHHHCcCCCCCCCCcCCCc
Confidence 578899999877799999999999999999999999985 68999999999999999999999999999997
No 5
>KOG4338|consensus
Probab=98.55 E-value=7.9e-08 Score=98.04 Aligned_cols=169 Identities=23% Similarity=0.425 Sum_probs=127.8
Q ss_pred ceEEEEecCCceecCCCCeeeec-cCeEEEEeeecCC----CceEEEEEecCCCCCCeeeEEEEEEEECCEEEEEcc---
Q psy14031 5 DGVCTVFGDPHYRTFDGKFYSFQ-GSCKYQLTADCTG----GAFSIRVTNDARDTKTSSWTKTVSIRIGDMKVNLGE--- 76 (280)
Q Consensus 5 ~~~C~~~G~~h~~TFDg~~y~f~-g~C~YvL~~~~~~----~~f~V~~~~~~~~~~~~~~~~~v~v~~~~~~i~l~~--- 76 (280)
.+.|.+ |-+-++|||-..|.=+ -.|.|+.+++|.. ..|.|..+....... ...|++..+..+|.+..
T Consensus 1353 ~a~~~v-~ls~~~~Fd~~~y~n~~~~~~sl~a~~~~~ek~~~~~~vl~~~~~~~~~----~~~vk~vy~e~ei~~~~~~~ 1427 (1680)
T KOG4338|consen 1353 TALCKV-GLSRINTFDIVAYNNPLTPCYSLTAKDCSEEKATPRFAVLIKKINKNSE----EVAVKVVYGEREIVVKKLHE 1427 (1680)
T ss_pred EEEEEe-ccccccccchhhhcCcccceEEEEeecccccccCceEEEEecCCCCCch----hhhheeeeeeeeeEEeeccc
Confidence 467888 6789999999999855 8999999999986 378888777653322 34566677777776653
Q ss_pred ccEEEECCEEeeeCccCC---CeEEEEEecCEEEEEeCCcEEEEEeCCeeEEEEcCccccccCceeeecccCCCCCCCcC
Q psy14031 77 KLRVKIDGQRVALPYDLP---SKVIVTKTAESVLVETAIGIKVLWDGNSFLEVSAPAKFKILDRLCGLCGNFNGISSDDL 153 (280)
Q Consensus 77 ~~~V~vng~~~~lp~~~~---~~~~i~~~~~~v~v~~~~gl~v~~~~~~~v~V~~~~~~~~~g~~~GLCGn~ng~~~dD~ 153 (280)
...++|+|..+....... -...|...+...++....|.+|.|||. .+.+.++.-++ .|+.||||||++++..++|
T Consensus 1428 ~~~~~~~~~k~~~t~~~~~~~~q~~i~~~~es~~~~~l~~gtv~~~g~-~~~~~~~~~~~-~~q~~~~~~n~~~~~~~~~ 1505 (1680)
T KOG4338|consen 1428 KKMVKVDGKKIITTLNIIYELKQYNIEYLGESLIVFDLPGGTVQFDGY-NARITLPSVRR-KNQLCGLCGNNDDESETEF 1505 (1680)
T ss_pred ceEEeecchhcccchHHHHHHHHHHHHhhccceeEeecCCceEEeccc-ccccccHHHhh-ccchhhhhccCCccccchh
Confidence 467899999886543211 114555556666666777999999994 57788776543 4999999999999999999
Q ss_pred cCCCCceecChhhhhcceeeCCCCCCCC
Q psy14031 154 TTKRGRVVTDANKFGASWRVGGRKACSR 181 (280)
Q Consensus 154 ~~p~g~~~~~~~~F~~sW~~~~~~~C~~ 181 (280)
.+...+-+.+..+|..|+-+.+. .|..
T Consensus 1506 ~t~~~~~~~~i~e~h~s~~l~n~-~~~~ 1532 (1680)
T KOG4338|consen 1506 YTALNIETEDIEEFHRSYLLKNE-ECEK 1532 (1680)
T ss_pred hhcccchhhhHHHHHHHHcccch-hhhh
Confidence 99999888899999999988874 5643
No 6
>PF06535 RGM_N: Repulsive guidance molecule (RGM) N-terminus; InterPro: IPR010536 This entry represents the N-terminal region of several mammalian and one bird sequence from Gallus gallus (Chicken). All of the mammalian proteins are hypothetical and have no known function but Q8JG54 from SWISSPROT from the chicken is annotated as being a repulsive guidance molecule (RGM). RGM is a GPI-linked axon guidance molecule of the retinotectal system. RGM is repulsive for a subset of axons, those from the temporal half of the retina. Temporal retinal axons invade the anterior optic tectum in a superficial layer, and encounter RGM expressed in a gradient with increasing concentration along the anterior-posterior axis. Temporal axons are able to receive posterior-dependent information by sensing gradients or concentrations of guidance cues. Thus, RGM is likely to provide positional information for temporal axons invading the optic tectum in the stratum opticum [].
Probab=97.57 E-value=0.00016 Score=57.33 Aligned_cols=63 Identities=27% Similarity=0.454 Sum_probs=46.6
Q ss_pred ccceEEEEecCCceecCCCCeeeeccCeEEEEeeecCCCceEEEEEecC-CCCCCeeeEEEEEEEEC
Q psy14031 3 LGDGVCTVFGDPHYRTFDGKFYSFQGSCKYQLTADCTGGAFSIRVTNDA-RDTKTSSWTKTVSIRIG 68 (280)
Q Consensus 3 ~~~~~C~~~G~~h~~TFDg~~y~f~g~C~YvL~~~~~~~~f~V~~~~~~-~~~~~~~~~~~v~v~~~ 68 (280)
+....|.++||||++||++..-.=.-.+.+=|+. +.-+.|++.|.+ ......+.+..|+|.+.
T Consensus 97 ~~y~hCglFGDPHLRTF~~~fqTC~v~GAWPLId---N~yL~VQvTn~pv~~gs~aTattKvTVI~K 160 (161)
T PF06535_consen 97 PNYRHCGLFGDPHLRTFNDEFQTCKVEGAWPLID---NDYLSVQVTNSPVVPGSSATATTKVTVIFK 160 (161)
T ss_pred cccceecccCChhHhhccCCceeeeeecceeeec---CCcEEEEeeCccccCCccceeeeeEEEEEc
Confidence 3467899999999999999877655445555554 357899999988 34445666788887764
No 7
>KOG1216|consensus
Probab=95.22 E-value=0.033 Score=53.94 Aligned_cols=235 Identities=17% Similarity=0.209 Sum_probs=110.8
Q ss_pred ccceEEEEecCCceecCCCCeeeeccCeEEEEeeecCCC--ceEEEEEecCCCCCCeeeEEEEEEEECCEEEEEccccEE
Q psy14031 3 LGDGVCTVFGDPHYRTFDGKFYSFQGSCKYQLTADCTGG--AFSIRVTNDARDTKTSSWTKTVSIRIGDMKVNLGEKLRV 80 (280)
Q Consensus 3 ~~~~~C~~~G~~h~~TFDg~~y~f~g~C~YvL~~~~~~~--~f~V~~~~~~~~~~~~~~~~~v~v~~~~~~i~l~~~~~V 80 (280)
....+|......++.||||..+ +...|..+|.+.+... .+.....+.............+........+.+.....+
T Consensus 289 ~~~~~~~~~~~~~~~t~~g~~~-~~~~c~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (553)
T KOG1216|consen 289 EKMCVCTLPNGLPLLTTDGERL-RCSGCSCFLDDVCKKGDIPPQLVRVNETPVGRNCPVKFPDALTTFESAVLISPETTV 367 (553)
T ss_pred CcceEEecCCCCeeEeccCCCc-ccceeeeecccccccccCCccccccCccccCCCCcccCccccccchhhhccCCccce
Confidence 4457899999999999999999 4466777776654332 222221121111111111122222222222222333333
Q ss_pred EECCEEeeeCccCCCeEEEEEecCEEE-EEeCCcEEEEEeCCeeEEEE-cCccccccCceeeecccCCCCCCCcCcCCCC
Q psy14031 81 KIDGQRVALPYDLPSKVIVTKTAESVL-VETAIGIKVLWDGNSFLEVS-APAKFKILDRLCGLCGNFNGISSDDLTTKRG 158 (280)
Q Consensus 81 ~vng~~~~lp~~~~~~~~i~~~~~~v~-v~~~~gl~v~~~~~~~v~V~-~~~~~~~~g~~~GLCGn~ng~~~dD~~~p~g 158 (280)
.-+|.....+.-..........+.... .....++...|.....+.+. ...... .+.+|+++++|++..|++.+.++
T Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~d~ 445 (553)
T KOG1216|consen 368 ERGGTSCTNSACKPAFCSSLSAGNCPGAPVVQVVLECSWPTDCVVELSRAAEESE--IRLCGLCADGNGNTPDGFPLGDG 445 (553)
T ss_pred eecCcccccceecCcceeeeccccCCcCCccceeccCCCCchheeccCcchhhcc--ccccceeccCCcCCCCCeeccCC
Confidence 333433332222222111122221111 12223444455554444443 456667 88999999999999888888887
Q ss_pred ceecChhhhh---cceeeCCCCCCCCCCCCC---CCCCCCccCCCCccccchhhccCcc--ccCCCCccccccccchhch
Q psy14031 159 RVVTDANKFG---ASWRVGGRKACSRPNTEN---SVGLGGVRCNHRLPQRKYRDKKCKP--LRSDVFSACHARLNHLMYF 230 (280)
Q Consensus 159 ~~~~~~~~F~---~sW~~~~~~~C~~~~~~~---~~~~~~~~C~~~~~~~~~a~~~C~~--l~~~~F~~C~~~vdp~~f~ 230 (280)
........+. -+|.......+..+.... .-....+.|.............|.. .....+..||...++..+|
T Consensus 446 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~ 525 (553)
T KOG1216|consen 446 NPLRQRVRLTNGLPSEDSSLLLINLVPDPSCVEPQISNDAFACQGTERPALYGQFVCQCSSSNENFPQTCPLTCPLLAVY 525 (553)
T ss_pred cceeeeEEecCCCCCcCcchhhhccCCCCCcceEecchhhhcCccccCcccccceeEeechhccCcccccceecCCcccc
Confidence 6544332211 122211111110000000 0000011233211111122222222 2222568899999999999
Q ss_pred hhccchhcCC
Q psy14031 231 KSCLVDMCEC 240 (280)
Q Consensus 231 ~~C~~d~C~~ 240 (280)
..|..++|.+
T Consensus 526 ~~~~~~~~~~ 535 (553)
T KOG1216|consen 526 RSTYTDSCES 535 (553)
T ss_pred eeeecccccc
Confidence 9999998876
No 8
>PF07481 DUF1521: Domain of Unknown Function (DUF1521); InterPro: IPR011086 This domain of unknown function is found in a limited set of Bradyrhizobium proteins. There appears to be a periodic -DG- motif in the domain.
Probab=94.57 E-value=0.67 Score=36.86 Aligned_cols=63 Identities=25% Similarity=0.460 Sum_probs=44.7
Q ss_pred EEEEecCCce-ecCCCCe-eeeccCeEEEEeeecCCCceEEEEEecCCCCCCeeeEEEEEEEECCEEEEEc
Q psy14031 7 VCTVFGDPHY-RTFDGKF-YSFQGSCKYQLTADCTGGAFSIRVTNDARDTKTSSWTKTVSIRIGDMKVNLG 75 (280)
Q Consensus 7 ~C~~~G~~h~-~TFDg~~-y~f~g~C~YvL~~~~~~~~f~V~~~~~~~~~~~~~~~~~v~v~~~~~~i~l~ 75 (280)
+=++|||||| ..=||+. |+|.+.=.+.|-. --.|.+...+.++ .++....++|..++..|++.
T Consensus 39 ~TriwGDPHvd~~Gdg~~~fDFk~~~tf~L~D-----GTKItV~T~p~gn-g~T~askLtIt~Gd~~~~V~ 103 (171)
T PF07481_consen 39 TTRIWGDPHVDADGDGKTDFDFKGDMTFQLDD-----GTKITVDTVPWGN-GMTYASKLTITNGDNAWQVE 103 (171)
T ss_pred EeEEeCCCccccCCCcccceeecCceEEEeCC-----CCEEEEeeeecCC-CcEEeeeEEEEcCCceEEEe
Confidence 3489999999 5556555 9999988888863 2366666666443 67777888888877655443
No 9
>PF06668 ITI_HC_C: Inter-alpha-trypsin inhibitor heavy chain C-terminus; InterPro: IPR010600 This entry represents the C-terminal region of inter-alpha-trypsin inhibitor heavy chains. Inter-alpha-trypsin inhibitors are glycoproteins with a high inhibitory activity against trypsin, built up from different combinations of four polypeptides: bikunin and the three heavy chains that belong to this family (HC1, HC2, HC3). The heavy chains do not have any protease inhibitory properties but have the capacity to interact in vitro and in vivo with hyaluronic acid, which promotes the stability of the extra-cellular matrix. This domain is associated with the VWA domain IPR002035 from INTERPRO.; GO: 0004867 serine-type endopeptidase inhibitor activity, 0030212 hyaluronan metabolic process
Probab=72.21 E-value=55 Score=27.08 Aligned_cols=89 Identities=18% Similarity=0.357 Sum_probs=49.5
Q ss_pred eeeEEEEEEEECC--EEEEEccccEEEE-C-CEEeeeCccCC-----CeEEEEEe-cCEEEEEeCCcEEE--E----EeC
Q psy14031 57 SSWTKTVSIRIGD--MKVNLGEKLRVKI-D-GQRVALPYDLP-----SKVIVTKT-AESVLVETAIGIKV--L----WDG 120 (280)
Q Consensus 57 ~~~~~~v~v~~~~--~~i~l~~~~~V~v-n-g~~~~lp~~~~-----~~~~i~~~-~~~v~v~~~~gl~v--~----~~~ 120 (280)
-+....|.|.+.. ..|++.+.. |.| + +....+++... ..+.|... ...+.|....++++ . |..
T Consensus 23 ~TYF~~i~I~~~~~~~~I~vt~~~-I~l~~g~~~~~l~w~~t~~~~~~~l~v~v~k~~~l~v~~~~~v~F~Il~Hr~~~~ 101 (188)
T PF06668_consen 23 RTYFGRIGITFQKPDVKIEVTPEK-ITLTDGRRRSVLSWSDTASVKQPGLEVSVNKNKNLTVTLGDGVTFVILLHRVWKK 101 (188)
T ss_pred CceEEEEEEEECCCCeEEEEEccE-EEEecCCCceeEeEcceEEEccCcEEEEEECCceEEEEeCCceEEEEEEEeecCC
Confidence 4456777777754 456777653 333 4 44555666321 12666544 33456665555443 2 221
Q ss_pred ----CeeEEEEc--CccccccCceeeecccCCCC
Q psy14031 121 ----NSFLEVSA--PAKFKILDRLCGLCGNFNGI 148 (280)
Q Consensus 121 ----~~~v~V~~--~~~~~~~g~~~GLCGn~ng~ 148 (280)
...+-+.+ +..|. .++.||.|.|-.+
T Consensus 102 ~~~~~d~LGfYi~ds~~lS--~~vhGLLGQF~~~ 133 (188)
T PF06668_consen 102 HPYQRDFLGFYILDSHGLS--PSVHGLLGQFYHE 133 (188)
T ss_pred CCCCCCeeEEEecCCCCCC--CcccccccCccCC
Confidence 12344443 45677 8999999988654
No 10
>PF07833 Cu_amine_oxidN1: Copper amine oxidase N-terminal domain; InterPro: IPR012854 Amine oxidases (AO) are enzymes that catalyse the oxidation of a wide range of biogenic amines including many neurotransmitters, histamine and xenobiotic amines. There are two classes of amine oxidases: flavin-containing (1.4.3.4 from EC) and copper-containing (1.4.3.6 from EC). Copper-containing AO act as a disulphide-linked homodimer. They catalyse the oxidation of primary amines to aldehydes, with the subsequent release of ammonia and hydrogen peroxide, which requires one copper ion per subunit and topaquinone as cofactor []: RCH2NH2 + H2O + O2 = RCHO + NH3 + H2O2 Copper-containing amine oxidases are found in bacteria, fungi, plants and animals. In prokaryotes, the enzyme enables various amine substrates to be used as sources of carbon and nitrogen [, ]. In eukaryotes they have a broader range of functions, including cell differentiation and growth, wound healing, detoxification and cell signalling []. The copper amine oxidases occur as mushroom-shaped homodimers of 70-95 kDa, each monomer containing a copper ion and a covalently bound redox cofactor, topaquinone (TPQ). TPQ is formed by post-translational modification of a conserved tyrosine residue. The copper ion is coordinated with three histidine residues and two water molecules in a distorted square pyramidal geometry, and has a dual function in catalysis and TPQ biogenesis. The catalytic domain is the largest of the 3-4 domains found in copper amine oxidases, and consists of a beta sandwich of 18 strands in two sheets. The active site is buried and requires a conformational change to allow the substrate access. The two N-terminal domains share a common structural fold, its core consisting of a five-stranded antiparallel beta sheet twisted around an alpha helix. The D1 domains from the two subunits comprise the stalk, of the mushroom-shaped dimer, and interact with each other but do not pack tightly against each other [, ]. This entry represents a domain found at the N-terminal of certain copper amine oxidases, as well as in related proteins such as cell wall hydrolase and N-acetylmuramoyl-L-alanine amidase. This domain consists of a five-stranded antiparallel beta-sheet twisted around an alpha helix [, ]. ; PDB: 1SPU_A 2WGQ_A 2WO0_A 1JRQ_A 1QAF_B 1D6U_B 1QAL_A 2WOF_A 1OAC_B 1QAK_A ....
Probab=70.88 E-value=14 Score=25.92 Aligned_cols=56 Identities=27% Similarity=0.453 Sum_probs=28.6
Q ss_pred EEEEEEECCEEEEEcc-ccEEEECCEEeeeCccCCCeEEEEEecCEEEEEe-----CCcEEEEEeCCe
Q psy14031 61 KTVSIRIGDMKVNLGE-KLRVKIDGQRVALPYDLPSKVIVTKTAESVLVET-----AIGIKVLWDGNS 122 (280)
Q Consensus 61 ~~v~v~~~~~~i~l~~-~~~V~vng~~~~lp~~~~~~~~i~~~~~~v~v~~-----~~gl~v~~~~~~ 122 (280)
+.|.+..++..+.+.. ...+.+||+.+.++... . ...+.+.|-. .+|+.|.||+..
T Consensus 27 ~~v~i~~~~~~i~~~~~~~~~~~ng~~~~l~~~~----~--~~~g~~yvp~~~l~~~~g~~v~~d~~~ 88 (93)
T PF07833_consen 27 KTVTITKGKKSIKLKVGSNEATVNGKKITLSAPP----V--IINGRTYVPLRFLSEAFGYKVSWDNET 88 (93)
T ss_dssp TEEEEEETTEEEEE-TT-SEEEETTEEEE-SS-B----E--EETTEEEEETTHHHHHHT---------
T ss_pred cEEEEEeeeceeeecCCCceeeeeccccccccCC----E--EeCCEEEEEHHHHHHHhCCcccccccc
Confidence 4678888888888875 46889999999887321 1 1223444432 369999999853
No 11
>PF05910 DUF868: Plant protein of unknown function (DUF868); InterPro: IPR008586 This family consists of several hypothetical proteins from plants. The function of this family is unknown.
Probab=37.11 E-value=2.5e+02 Score=24.76 Aligned_cols=35 Identities=29% Similarity=0.634 Sum_probs=19.3
Q ss_pred cEEEECCEEeeeCccCCCeEEEEE-----ecCEEEEEeCCcEEEEEeCC
Q psy14031 78 LRVKIDGQRVALPYDLPSKVIVTK-----TAESVLVETAIGIKVLWDGN 121 (280)
Q Consensus 78 ~~V~vng~~~~lp~~~~~~~~i~~-----~~~~v~v~~~~gl~v~~~~~ 121 (280)
..|.|||+.+ ++|.+ .|+..++--+..++|.||-.
T Consensus 176 l~V~VDgk~v---------~~VkrL~WkFRGNqti~vdg~~V~V~WDVH 215 (274)
T PF05910_consen 176 LWVSVDGKKV---------VQVKRLRWKFRGNQTIFVDGLPVQVFWDVH 215 (274)
T ss_pred EEEEECCEEE---------EEEEEeeecccCceEEEECCeEEEEEEEhh
Confidence 4667777665 22222 35554444555677788743
No 12
>PRK10626 hypothetical protein; Provisional
Probab=34.33 E-value=50 Score=28.42 Aligned_cols=30 Identities=17% Similarity=0.208 Sum_probs=22.8
Q ss_pred EEEEEEECCEEEEEccccEEEECCEEeeeC
Q psy14031 61 KTVSIRIGDMKVNLGEKLRVKIDGQRVALP 90 (280)
Q Consensus 61 ~~v~v~~~~~~i~l~~~~~V~vng~~~~lp 90 (280)
..|.|.-.+..+.|.+++.|++||+.+.|.
T Consensus 37 q~v~V~~~sg~l~I~~dg~L~inGk~v~L~ 66 (239)
T PRK10626 37 QTVQVVGASGNLVISPDGNVMRNGKQLSLN 66 (239)
T ss_pred CeEEEEecCCceEEcCCCCEEECCEEecCC
Confidence 345555545558889999999999999864
No 13
>PF06534 RGM_C: Repulsive guidance molecule (RGM) C-terminus; InterPro: IPR009496 This entry contains of several mammalian and one bird sequence from Gallus gallus (Chicken) and represents the C-terminal region of several sequences, but in others it represents the full protein. All of the mammalian proteins are hypothetical and have no known function, but Q8JG54 from SWISSPROT from the chicken is annotated as being a repulsive guidance molecule (RGM). RGM is a GPI-linked axon guidance molecule of the retinotectal system. RGM is repulsive for a subset of axons, those from the temporal half of the retina. Temporal retinal axons invade the anterior optic tectum in a superficial layer, and encounter RGM expressed in a gradient with increasing concentration along the anterior-posterior axis. Temporal axons are able to receive posterior-dependent information by sensing gradients or concentrations of guidance cues. Thus, RGM is likely to provide positional information for temporal axons invading the optic tectum in the stratum opticum [].
Probab=32.64 E-value=32 Score=28.21 Aligned_cols=28 Identities=29% Similarity=0.400 Sum_probs=19.6
Q ss_pred hhchhhccchhcCCCCCcccCHHHHHHHHHH
Q psy14031 227 LMYFKSCLVDMCECPLKNCHCESFTAYAREC 257 (280)
Q Consensus 227 ~~f~~~C~~d~C~~~~~~~~C~~l~aYa~~C 257 (280)
..||++|++|+-.+++.. -+++||....
T Consensus 138 D~YFqSCVFDLlTTGD~n---ft~aA~~Ale 165 (179)
T PF06534_consen 138 DVYFQSCVFDLLTTGDAN---FTAAAYSALE 165 (179)
T ss_pred chhhhhceeeeeecCCcc---HHHHHHHHHH
Confidence 479999999999987522 2356665544
No 14
>PF11101 DUF2884: Protein of unknown function (DUF2884); InterPro: IPR021307 Some members in this bacterial family of proteins are annotated as YggN which currently has no known function.
Probab=32.53 E-value=55 Score=27.92 Aligned_cols=30 Identities=17% Similarity=0.339 Sum_probs=23.6
Q ss_pred EEEEEEECC-EEEEEccccEEEECCEEeeeC
Q psy14031 61 KTVSIRIGD-MKVNLGEKLRVKIDGQRVALP 90 (280)
Q Consensus 61 ~~v~v~~~~-~~i~l~~~~~V~vng~~~~lp 90 (280)
..|.|.-.+ ..+.|.+++.++|||+.++|-
T Consensus 14 ~~l~v~~~~~~~~~I~~~g~L~i~G~~v~L~ 44 (229)
T PF11101_consen 14 QSLEVVQASGEKLRIDPDGNLFINGKKVSLN 44 (229)
T ss_pred CEEEEEeCCCceEEEcCCCcEEECCEEccCC
Confidence 456666654 488898999999999999865
No 15
>PF07944 DUF1680: Putative glycosyl hydrolase of unknown function (DUF1680); InterPro: IPR012878 The members of this family are sequences derived from hypothetical bacterial and eukaryotic proteins of unknown function. One member of this family is annotated as a possible arabinosidase, but no references were found to back this.
Probab=30.34 E-value=5e+02 Score=24.99 Aligned_cols=44 Identities=7% Similarity=0.140 Sum_probs=28.5
Q ss_pred cccEEEECCEEeeeCccCCCeEEEEE---ecCEEEEEeCCcEEEEEe
Q psy14031 76 EKLRVKIDGQRVALPYDLPSKVIVTK---TAESVLVETAIGIKVLWD 119 (280)
Q Consensus 76 ~~~~V~vng~~~~lp~~~~~~~~i~~---~~~~v~v~~~~gl~v~~~ 119 (280)
.+.+|+|||+.+......++-+.|.+ .|+.|.|.....+++...
T Consensus 453 ~~~~i~vNG~~~~~~~~~~gy~~i~r~W~~gD~v~l~lpm~~r~~~~ 499 (520)
T PF07944_consen 453 KGATIRVNGEPVVDTAVPGGYLTIEREWKDGDVVELRLPMEVRLEPA 499 (520)
T ss_pred CCcEEEECCEeCCCCcCCCCeEEEEeeccCCcEEEEEecCeeEEEeC
Confidence 35689999999533333344367765 467777777777766543
No 16
>smart00700 JHBP Juvenile hormone binding protein domains in insects. The juvenile hormone exerts pleiotropic functions during insect life cycles and its binding proteins regulate these functions.
Probab=26.86 E-value=3.7e+02 Score=22.38 Aligned_cols=31 Identities=19% Similarity=0.174 Sum_probs=21.1
Q ss_pred cEEEECCEEeeeCccCCCeEEEEEecCEEEE
Q psy14031 78 LRVKIDGQRVALPYDLPSKVIVTKTAESVLV 108 (280)
Q Consensus 78 ~~V~vng~~~~lp~~~~~~~~i~~~~~~v~v 108 (280)
++..++|+-..+|....|...+...+-.+.+
T Consensus 103 g~Y~~~g~ll~lpi~g~G~~~~~l~n~~~~~ 133 (225)
T smart00700 103 GDYKLDGRLLGLPLNGKGDANFTLENVKIRG 133 (225)
T ss_pred eeeEeeeEEEEEEecCCCcEEEEEEeeEEEE
Confidence 5677788888889887776666655544333
No 17
>PF13464 DUF4115: Domain of unknown function (DUF4115)
Probab=26.34 E-value=2.1e+02 Score=19.40 Aligned_cols=21 Identities=24% Similarity=0.556 Sum_probs=14.1
Q ss_pred EEEEcc--ccEEEECCEEeeeCc
Q psy14031 71 KVNLGE--KLRVKIDGQRVALPY 91 (280)
Q Consensus 71 ~i~l~~--~~~V~vng~~~~lp~ 91 (280)
.|.+.. ..+|++||+.+.++-
T Consensus 41 ~i~iGna~~v~v~~nG~~~~~~~ 63 (77)
T PF13464_consen 41 RIRIGNAGAVEVTVNGKPVDLLG 63 (77)
T ss_pred EEEEeCCCcEEEEECCEECCCCC
Confidence 344442 357889999998743
No 18
>PF13349 DUF4097: Domain of unknown function (DUF4097)
Probab=26.08 E-value=3.1e+02 Score=21.22 Aligned_cols=63 Identities=16% Similarity=0.333 Sum_probs=34.9
Q ss_pred EEEEEEEECCEEEEEccc----cEEEECCEEeeeCccCCCeEEEEEecCEEEEEeC-------CcEEEEE-eCCeeEEEE
Q psy14031 60 TKTVSIRIGDMKVNLGEK----LRVKIDGQRVALPYDLPSKVIVTKTAESVLVETA-------IGIKVLW-DGNSFLEVS 127 (280)
Q Consensus 60 ~~~v~v~~~~~~i~l~~~----~~V~vng~~~~lp~~~~~~~~i~~~~~~v~v~~~-------~gl~v~~-~~~~~v~V~ 127 (280)
.+.+.|......|++.+. .+|.++++ .+.+...+..+.|... .++.+.- +....+.|+
T Consensus 56 i~~i~i~~~~~~V~I~~~~~~~i~v~~~~k----------~~~~~~~~~~L~I~~~~~~~~~~~~~~~~~~~~~~~i~I~ 125 (166)
T PF13349_consen 56 IKKIDIDTDNGDVEIKPSDDDKIKVEYNGK----------KPEISVEGGTLTIKSKDRESFFFKGFNFNNSDNKSKITIY 125 (166)
T ss_pred CCEEEEEcCceeEEEEEcCCccEEEEEcCc----------EEEEEEcCCEEEEEEecccccccceEEEcccCCCcEEEEE
Confidence 366777777766666542 23333333 2555666666655543 1221211 345679999
Q ss_pred cCccc
Q psy14031 128 APAKF 132 (280)
Q Consensus 128 ~~~~~ 132 (280)
+|..+
T Consensus 126 lP~~~ 130 (166)
T PF13349_consen 126 LPKDY 130 (166)
T ss_pred ECCCC
Confidence 99877
No 19
>PF08394 Arc_trans_TRASH: Archaeal TRASH domain; InterPro: IPR013603 This region is found in the C terminus of a number of archaeal transcriptional regulators. It is thought to function as a metal-sensing regulatory module [].
Probab=24.79 E-value=42 Score=19.92 Aligned_cols=17 Identities=29% Similarity=0.595 Sum_probs=14.9
Q ss_pred EecCCceecCCCCeeee
Q psy14031 10 VFGDPHYRTFDGKFYSF 26 (280)
Q Consensus 10 ~~G~~h~~TFDg~~y~f 26 (280)
+.|.|+...++++.|.|
T Consensus 8 I~~eP~~~k~~~~~y~f 24 (37)
T PF08394_consen 8 ITGEPIVVKIGNKVYYF 24 (37)
T ss_pred ccCCEEEEEECCeEEEE
Confidence 56889999999999987
No 20
>COG3588 Fructose-1,6-bisphosphate aldolase [Carbohydrate transport and metabolism]
Probab=23.88 E-value=55 Score=28.97 Aligned_cols=18 Identities=22% Similarity=0.519 Sum_probs=15.6
Q ss_pred cCHHHHHHHHHHHHCCCC
Q psy14031 246 HCESFTAYARECSRLGVQ 263 (280)
Q Consensus 246 ~C~~l~aYa~~C~~~gv~ 263 (280)
.|++++.||..|+++|+.
T Consensus 156 nv~~la~yAa~cq~aGlV 173 (332)
T COG3588 156 NVHQLAEYAALCQAAGLV 173 (332)
T ss_pred HHHHHHHHHHHHHHCCCc
Confidence 467899999999999963
No 21
>KOG4291|consensus
Probab=23.50 E-value=1.1e+02 Score=32.18 Aligned_cols=59 Identities=8% Similarity=-0.128 Sum_probs=43.4
Q ss_pred CCcEEEEEeCCeeEEEEcCccccccCceeeecccCCCCCCCcCcCCCCceecChhhhhcce
Q psy14031 111 AIGIKVLWDGNSFLEVSAPAKFKILDRLCGLCGNFNGISSDDLTTKRGRVVTDANKFGASW 171 (280)
Q Consensus 111 ~~gl~v~~~~~~~v~V~~~~~~~~~g~~~GLCGn~ng~~~dD~~~p~g~~~~~~~~F~~sW 171 (280)
........++.....+.++..+- +.+++++++.++.+.+||..+.+........+-.-|
T Consensus 521 ~~~~~~~~~~f~~~~~~l~~s~~--~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 579 (1043)
T KOG4291|consen 521 GYYEDKVRKKFREEIDTLAETFT--YSISNNGSYRWNPQEEDIQLNQVRQWYPGDKFYRFG 579 (1043)
T ss_pred cceeeccccCCccceeEeccccc--cccccccccccCChhhhhhccccccccccccceeeE
Confidence 34455556665556677888888 999999999999999999999988655433333333
No 22
>PRK14111 F0F1 ATP synthase subunit gamma; Provisional
Probab=21.06 E-value=26 Score=31.07 Aligned_cols=10 Identities=40% Similarity=0.703 Sum_probs=8.9
Q ss_pred eecccCCCCC
Q psy14031 140 GLCGNFNGIS 149 (280)
Q Consensus 140 GLCGn~ng~~ 149 (280)
||||.||.+-
T Consensus 89 GLCG~fN~~l 98 (290)
T PRK14111 89 GLAGSLNANV 98 (290)
T ss_pred CccccccHHH
Confidence 9999999874
Done!