Query psy7171
Match_columns 146
No_of_seqs 47 out of 49
Neff 2.2
Searched_HMMs 46136
Date Fri Aug 16 20:02:39 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7171.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7171hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF03261 CDK5_activator: Cycli 100.0 3.7E-80 8E-85 529.9 6.0 126 11-137 219-346 (346)
2 KOG3932|consensus 100.0 1.1E-79 2.3E-84 526.4 7.6 126 12-137 221-348 (357)
3 PF02083 Urotensin_II: Urotens 68.6 1.6 3.5E-05 22.5 -0.0 11 93-103 2-12 (12)
4 PF12241 Enoyl_reductase: Tran 67.8 1.5 3.2E-05 37.6 -0.4 66 72-137 140-222 (237)
5 KOG1151|consensus 67.5 5.3 0.00011 38.6 3.1 47 79-126 702-748 (775)
6 COG3007 Uncharacterized paraqu 49.8 4.8 0.0001 36.6 -0.3 63 72-137 220-302 (398)
7 PF00134 Cyclin_N: Cyclin, N-t 49.8 24 0.00053 24.0 3.2 42 38-80 49-90 (127)
8 PF09027 GTPase_binding: GTPas 44.8 8.9 0.00019 27.3 0.4 14 79-92 5-19 (66)
9 cd00043 CYCLIN Cyclin box fold 41.1 22 0.00049 21.6 1.8 33 47-80 29-61 (88)
10 PF10140 YukC: WXG100 protein 40.8 2.9 6.2E-05 36.9 -3.1 59 33-91 80-153 (359)
11 cd03568 VHS_STAM VHS domain fa 38.3 58 0.0013 24.8 4.0 50 88-138 71-136 (144)
12 PF02129 Peptidase_S15: X-Pro 34.9 19 0.0004 28.4 0.9 17 20-36 224-243 (272)
13 PF07051 OCIA: Ovarian carcino 34.4 13 0.00029 28.5 0.0 14 92-105 36-49 (111)
14 cd00489 Barstar_like Barstar i 34.2 45 0.00098 23.4 2.7 35 52-100 3-37 (85)
15 PLN00214 putative protein; Pro 33.2 77 0.0017 24.8 4.0 57 39-107 48-113 (115)
16 smart00385 CYCLIN domain prese 31.6 1.2E+02 0.0026 18.2 4.2 36 45-81 21-56 (83)
17 PLN02341 pfkB-type carbohydrat 28.4 72 0.0016 28.1 3.5 41 68-108 381-428 (470)
18 PF04112 Mak10: Mak10 subunit, 27.1 43 0.00093 25.8 1.7 36 34-75 51-90 (168)
19 PF00790 VHS: VHS domain; Int 26.8 69 0.0015 23.6 2.7 34 100-134 107-140 (140)
20 cd03565 VHS_Tom1 VHS domain fa 26.7 69 0.0015 24.2 2.7 35 100-135 104-138 (141)
21 COG1553 DsrE Uncharacterized c 26.3 40 0.00087 26.5 1.4 29 42-70 92-120 (126)
22 smart00549 TAFH TAF homology. 25.1 82 0.0018 23.8 2.8 36 44-92 28-63 (92)
23 PRK13317 pantothenate kinase; 24.2 64 0.0014 27.0 2.3 31 58-93 72-102 (277)
24 PLN02920 pantothenate kinase 1 24.1 1.4E+02 0.003 27.3 4.6 82 11-93 68-171 (398)
25 PF00767 Poty_coat: Potyvirus 23.6 26 0.00056 29.6 -0.1 15 79-93 109-123 (237)
26 PF15017 AF1Q: Drug resistance 22.9 34 0.00074 25.3 0.4 11 32-42 72-82 (87)
27 TIGR03634 arc_protsome_B prote 22.8 2.2E+02 0.0047 21.2 4.8 58 23-82 98-175 (185)
28 PF08785 Ku_PK_bind: Ku C term 22.1 78 0.0017 23.2 2.2 44 90-134 34-77 (120)
29 cd03567 VHS_GGA VHS domain fam 21.9 86 0.0019 24.0 2.5 47 88-135 72-136 (139)
30 KOG0013|consensus 20.4 40 0.00086 29.1 0.4 20 84-103 37-56 (231)
No 1
>PF03261 CDK5_activator: Cyclin-dependent kinase 5 activator protein; InterPro: IPR004944 These proteins are neuron specific activators of cyclin-dependent kinase 5 (CDK5) []. They form a heterodimer with the catalytic subunit (CDK5) [].; GO: 0016534 cyclin-dependent protein kinase 5 activator activity, 0016533 cyclin-dependent protein kinase 5 holoenzyme complex; PDB: 3O0G_D 1H4L_E 1UNH_D 1UNL_E 1UNG_E.
Probab=100.00 E-value=3.7e-80 Score=529.90 Aligned_cols=126 Identities=65% Similarity=1.084 Sum_probs=104.0
Q ss_pred CChhhhhhhhcccc--ccccCCcccccccChhHHHHHHHHHhhhcCCcCCCHHHHHHHHHHHHHHhhhccCccccccCcc
Q psy7171 11 DDHLAIACWLFPVP--LRLSEINDVAFINPANVVFVYMLVRELVDGDKIDSESQLQAVVLTCLYLSYSYMGNEISYPLKP 88 (146)
Q Consensus 11 ~d~~~~v~WlR~VD--LLLQGWQD~~FI~PANvVFvYmL~rd~v~~~~i~se~eLqa~vLTCLYLsYSYmGNEISYPLKP 88 (146)
-|+.++|||||+|| ||||||||||||||||||||||||||+|++| ++||+||||+||||||||||||||||||||||
T Consensus 219 ~~~~~~vmWlR~vDRsLLlqgwqd~~fi~pan~vf~yml~r~~~~~~-~~~~~~l~~~~l~cly~sysy~gneisyplkp 297 (346)
T PF03261_consen 219 FSPGDVVMWLRAVDRSLLLQGWQDQAFINPANVVFVYMLCRDVVSGE-VSSERELQAIVLTCLYLSYSYMGNEISYPLKP 297 (346)
T ss_dssp --THHHHHHHHHHHHHHHHTTS-SS-S--HHHHHHHHHHHHHHS-TT---SHHHHHHHHHHHHHHHHHHH-SSS---SGG
T ss_pred CCccceEeehhhcCHHHHhccccccceeccchhhhhHHHHHHhhccc-cCCHHHHHHHHHHHHHHHhhhcCcccccccCC
Confidence 37899999999999 9999999999999999999999999999885 79999999999999999999999999999999
Q ss_pred cccccchhHhhhHHHHHHhhccccchhccCCcchhHHHHHHHHhcCCCC
Q psy7171 89 FLVEDCKEKFWDRCLHIVNNMSSSMLRINAEPGFFTEIFTELKACGTST 137 (146)
Q Consensus 89 FLve~~ke~FW~rcl~Ii~~~S~~MLriNadp~fFTevF~eLK~~~~~~ 137 (146)
||||++||+||||||+|||++|+||||||+|||||||||+|||+||+.+
T Consensus 298 flv~~~~~~fw~~~~~~~~~~s~~ml~~n~~~~~ft~~~~~lk~~~~~~ 346 (346)
T PF03261_consen 298 FLVEESKEAFWDRCLSIINRLSAKMLRINADPHFFTEVFTDLKNEGQSE 346 (346)
T ss_dssp G--SS-HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHTT----
T ss_pred eeecccHHHHHHHHHHHHHHhhHHHhhhcCCCchHHHHHHHHHhcccCC
Confidence 9999999999999999999999999999999999999999999999753
No 2
>KOG3932|consensus
Probab=100.00 E-value=1.1e-79 Score=526.42 Aligned_cols=126 Identities=62% Similarity=1.067 Sum_probs=123.1
Q ss_pred Chhhhhhhhcccc--ccccCCcccccccChhHHHHHHHHHhhhcCCcCCCHHHHHHHHHHHHHHhhhccCccccccCccc
Q psy7171 12 DHLAIACWLFPVP--LRLSEINDVAFINPANVVFVYMLVRELVDGDKIDSESQLQAVVLTCLYLSYSYMGNEISYPLKPF 89 (146)
Q Consensus 12 d~~~~v~WlR~VD--LLLQGWQD~~FI~PANvVFvYmL~rd~v~~~~i~se~eLqa~vLTCLYLsYSYmGNEISYPLKPF 89 (146)
|..++|+|||+|| ||||||||||||||||||||||||||++++||++|++||||++||||||||||||||||||||||
T Consensus 221 dpgd~V~WlRsVDRSLLLQGWQD~aFInPANlVFvylLvRdvlsg~e~~s~~eLqA~~LtCLYlsYSYMGNEISYPLKPF 300 (357)
T KOG3932|consen 221 DPGDLVTWLRSVDRSLLLQGWQDQAFINPANLVFVYLLVRDVLSGEEIHSLEELQAWILTCLYLSYSYMGNEISYPLKPF 300 (357)
T ss_pred CccchhHHHHhhhHHHHhhhccccccccchheehhhhhHHhhcCchhhccHHHHHHHHHHHHHHHHHhhcccccccccce
Confidence 7889999999999 99999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccccchhHhhhHHHHHHhhccccchhccCCcchhHHHHHHHHhcCCCC
Q psy7171 90 LVEDCKEKFWDRCLHIVNNMSSSMLRINAEPGFFTEIFTELKACGTST 137 (146)
Q Consensus 90 Lve~~ke~FW~rcl~Ii~~~S~~MLriNadp~fFTevF~eLK~~~~~~ 137 (146)
|||++||+||||||.|||++|.+||+||||||||||||+|||+|+..+
T Consensus 301 LVe~dke~FWdRCl~~vn~lS~qML~lNadp~fFteVFtdLK~cs~~~ 348 (357)
T KOG3932|consen 301 LVENDKETFWDRCLAMVNSLSRQMLLLNADPHFFTEVFTDLKNCSSSE 348 (357)
T ss_pred eecccHHHHHHHHHHHHHHhhHHHhhhcCCCchHHHHHHHhhhcccch
Confidence 999999999999999999999999999999999999999999996543
No 3
>PF02083 Urotensin_II: Urotensin II; InterPro: IPR001483 Urotensin II, a small peptide that contains a disulphide bridge, was originally isolated from the caudal portion of the spinal cord of teleost and elasmobranch fish []. The peptide has also been found in the brain of frogs []. Urotensin II seems to be involved in smooth muscle stimulation.; GO: 0005179 hormone activity, 0005576 extracellular region
Probab=68.57 E-value=1.6 Score=22.46 Aligned_cols=11 Identities=36% Similarity=0.951 Sum_probs=8.6
Q ss_pred cchhHhhhHHH
Q psy7171 93 DCKEKFWDRCL 103 (146)
Q Consensus 93 ~~ke~FW~rcl 103 (146)
+..|-||.+|+
T Consensus 2 ~~~~CFWKYCv 12 (12)
T PF02083_consen 2 GKSECFWKYCV 12 (12)
T ss_pred CccchhhhhcC
Confidence 35688999995
No 4
>PF12241 Enoyl_reductase: Trans-2-enoyl-CoA reductase catalytic region; PDB: 3ZU5_A 3ZU3_A 3ZU4_A 3ZU2_A 3S8M_A.
Probab=67.82 E-value=1.5 Score=37.56 Aligned_cols=66 Identities=23% Similarity=0.430 Sum_probs=33.7
Q ss_pred HHhhhccCccccccCcc-cccccchhHhhhHHHHHHhhccc----------cch------hccCCcchhHHHHHHHHhcC
Q psy7171 72 YLSYSYMGNEISYPLKP-FLVEDCKEKFWDRCLHIVNNMSS----------SML------RINAEPGFFTEIFTELKACG 134 (146)
Q Consensus 72 YLsYSYmGNEISYPLKP-FLve~~ke~FW~rcl~Ii~~~S~----------~ML------riNadp~fFTevF~eLK~~~ 134 (146)
=++|||.|.|+.||+=- =-+..-|+..=.....|-.+++. |-| -|-.-|-|..-+|+-+|..|
T Consensus 140 TvAySYIG~~~T~pIY~~GTiG~AK~dLe~ta~~i~~~L~~~~G~A~vsV~KAlVTqAS~aIP~~pLYi~~L~kVMKekG 219 (237)
T PF12241_consen 140 TVAYSYIGPELTWPIYRDGTIGKAKEDLEKTAHAINEKLAAIGGKAYVSVNKALVTQASSAIPVVPLYISLLYKVMKEKG 219 (237)
T ss_dssp EEEEEE---GGGCCCCTTCHHHHHHHHHHHHHHHHHHHHHTTT-EEEEEEE-----TTGGGSTCHHHHHHHHHHHHHHCT
T ss_pred EEEEeccCcccChhhhcCCcHHHHHHHHHHHHHHHHHHHHhcCCcEEEEEehhhhhhhhhcCccHHHHHHHHHHHHHhcC
Confidence 46899999999999721 11333444444444444444432 111 12334667778888888776
Q ss_pred CCC
Q psy7171 135 TST 137 (146)
Q Consensus 135 ~~~ 137 (146)
.-+
T Consensus 220 ~HE 222 (237)
T PF12241_consen 220 THE 222 (237)
T ss_dssp ---
T ss_pred CCc
Confidence 543
No 5
>KOG1151|consensus
Probab=67.48 E-value=5.3 Score=38.55 Aligned_cols=47 Identities=21% Similarity=0.477 Sum_probs=37.3
Q ss_pred CccccccCcccccccchhHhhhHHHHHHhhccccchhccCCcchhHHH
Q psy7171 79 GNEISYPLKPFLVEDCKEKFWDRCLHIVNNMSSSMLRINAEPGFFTEI 126 (146)
Q Consensus 79 GNEISYPLKPFLve~~ke~FW~rcl~Ii~~~S~~MLriNadp~fFTev 126 (146)
..|++||-||-. .....+|-.|||.--.+--.+.-++-.||-|.--+
T Consensus 702 AtEVqFP~KPvV-sseAkaFIRRCLaYRKeDR~DV~qLA~dpyllPh~ 748 (775)
T KOG1151|consen 702 ATEVQFPPKPVV-SSEAKAFIRRCLAYRKEDRIDVQQLACDPYLLPHI 748 (775)
T ss_pred ceeccCCCCCcc-CHHHHHHHHHHHHhhhhhhhhHHHHccCccccchh
Confidence 459999999964 45567899999998887777888888888766533
No 6
>COG3007 Uncharacterized paraquat-inducible protein B [Function unknown]
Probab=49.82 E-value=4.8 Score=36.63 Aligned_cols=63 Identities=24% Similarity=0.375 Sum_probs=37.4
Q ss_pred HHhhhccCccccccCc---------ccc------ccc-----chhHhhhHHHHHHhhccccchhccCCcchhHHHHHHHH
Q psy7171 72 YLSYSYMGNEISYPLK---------PFL------VED-----CKEKFWDRCLHIVNNMSSSMLRINAEPGFFTEIFTELK 131 (146)
Q Consensus 72 YLsYSYmGNEISYPLK---------PFL------ve~-----~ke~FW~rcl~Ii~~~S~~MLriNadp~fFTevF~eLK 131 (146)
=++|||||.+|.||+- .=| +.+ .-.++=.-|-.++..-|+- |-.-|-|..-+|+-+|
T Consensus 220 TiAfsYiG~~iT~~IYw~GtiG~AK~DLd~~~~~inekLa~~gG~A~vsVlKavVTqASsa---IP~~plYla~lfkvMK 296 (398)
T COG3007 220 TIAFSYIGEKITHPIYWDGTIGRAKKDLDQKSLAINEKLAALGGGARVSVLKAVVTQASSA---IPMMPLYLAILFKVMK 296 (398)
T ss_pred EEEEEecCCccccceeeccccchhhhcHHHHHHHHHHHHHhcCCCeeeeehHHHHhhhhhc---cccccHHHHHHHHHHH
Confidence 3799999999999961 000 000 1122233344444444442 4455888888888888
Q ss_pred hcCCCC
Q psy7171 132 ACGTST 137 (146)
Q Consensus 132 ~~~~~~ 137 (146)
+.|.-+
T Consensus 297 ekg~HE 302 (398)
T COG3007 297 EKGTHE 302 (398)
T ss_pred HcCcch
Confidence 877644
No 7
>PF00134 Cyclin_N: Cyclin, N-terminal domain; InterPro: IPR006671 Cyclins are eukaryotic proteins that play an active role in controlling nuclear cell division cycles [], and regulate cyclin dependent kinases (CDKs). Cyclins, together with the p34 (cdc2) or cdk2 kinases, form the Maturation Promoting Factor (MPF). There are two main groups of cyclins, G1/S cyclins, which are essential for the control of the cell cycle at the G1/S (start) transition, and G2/M cyclins, which are essential for the control of the cell cycle at the G2/M (mitosis) transition. G2/M cyclins accumulate steadily during G2 and are abruptly destroyed as cells exit from mitosis (at the end of the M-phase). In most species, there are multiple forms of G1 and G2 cyclins. For example, in vertebrates, there are two G2 cyclins, A and B, and at least three G1 cyclins, C, D, and E. Cyclin homologues have been found in various viruses, including Saimiriine herpesvirus 2 (Herpesvirus saimiri) and Human herpesvirus 8 (HHV-8) (Kaposi's sarcoma-associated herpesvirus). These viral homologues differ from their cellular counterparts in that the viral proteins have gained new functions and eliminated others to harness the cell and benefit the virus []. Cyclins contain two domains of similar all-alpha fold, of which this entry is associated with the N-terminal domain.; PDB: 2W2H_B 3RGF_B 1KXU_A 1JKW_A 1URC_D 2WMB_D 1H26_D 1GY3_B 2UUE_D 1E9H_D ....
Probab=49.78 E-value=24 Score=23.99 Aligned_cols=42 Identities=21% Similarity=0.430 Sum_probs=26.5
Q ss_pred ChhHHHHHHHHHhhhcCCcCCCHHHHHHHHHHHHHHhhhccCc
Q psy7171 38 PANVVFVYMLVRELVDGDKIDSESQLQAVVLTCLYLSYSYMGN 80 (146)
Q Consensus 38 PANvVFvYmL~rd~v~~~~i~se~eLqa~vLTCLYLsYSYmGN 80 (146)
|.-+...-.++...+.... -....+|-+.+||+++|.-+-..
T Consensus 49 ~~~~~~A~~~~dr~~~~~~-~~~~~~~li~~~cl~lA~K~~e~ 90 (127)
T PF00134_consen 49 PETLHLAIYLFDRFLSKRP-VNRSKLQLIALACLFLASKMEED 90 (127)
T ss_dssp HHHHHHHHHHHHHHHTTS--TTCCGHHHHHHHHHHHHHHHHTS
T ss_pred hhHHHHHHHHHHHHHhhcc-cccchhhhhhhhHHHHhhhhhcc
Confidence 3333333344444444432 56688999999999999886544
No 8
>PF09027 GTPase_binding: GTPase binding; InterPro: IPR015116 The GTPase binding domain binds to the G protein Cdc42, inhibiting both its intrinsic and stimulated GTPase activity. The domain is largely unstructured in the absence of Cdc42 []. ; PDB: 1CF4_B.
Probab=44.85 E-value=8.9 Score=27.26 Aligned_cols=14 Identities=57% Similarity=0.809 Sum_probs=4.2
Q ss_pred CccccccCc-ccccc
Q psy7171 79 GNEISYPLK-PFLVE 92 (146)
Q Consensus 79 GNEISYPLK-PFLve 92 (146)
..|||-||| +|+=.
T Consensus 5 aqeIs~PLk~sFlH~ 19 (66)
T PF09027_consen 5 AQEISVPLKNSFLHT 19 (66)
T ss_dssp STT---S-SS-----
T ss_pred hhhhccccccccccc
Confidence 579999999 88854
No 9
>cd00043 CYCLIN Cyclin box fold. Protein binding domain functioning in cell-cycle and transcription control. Present in cyclins, TFIIB and Retinoblastoma (RB).The cyclins consist of 8 classes of cell cycle regulators that regulate cyclin dependent kinases (CDKs). TFIIB is a transcription factor that binds the TATA box. Cyclins, TFIIB and RB contain 2 copies of the domain.
Probab=41.10 E-value=22 Score=21.58 Aligned_cols=33 Identities=15% Similarity=0.058 Sum_probs=23.2
Q ss_pred HHHhhhcCCcCCCHHHHHHHHHHHHHHhhhccCc
Q psy7171 47 LVRELVDGDKIDSESQLQAVVLTCLYLSYSYMGN 80 (146)
Q Consensus 47 L~rd~v~~~~i~se~eLqa~vLTCLYLsYSYmGN 80 (146)
++...+.... -.....|.+..+|+|+|--+.+.
T Consensus 29 ~~~~~~~~~~-~~~~~~~~ia~a~l~lA~k~~~~ 61 (88)
T cd00043 29 LLDRFLLDYS-VLGRSPSLVAAAALYLAAKVEEI 61 (88)
T ss_pred HHHHHHHhcc-cccCChHHHHHHHHHHHHHHcCC
Confidence 3444444333 34678899999999999887776
No 10
>PF10140 YukC: WXG100 protein secretion system (Wss), protein YukC; InterPro: IPR018778 Members of this family are associated with type VII secretion of WXG100 family targets in the Firmicutes, but not in the Actinobacteria. This protein is designated YukC in Bacillus subtilis and EssB is Staphylococcus aureus. ; PDB: 4ANN_A.
Probab=40.82 E-value=2.9 Score=36.90 Aligned_cols=59 Identities=32% Similarity=0.493 Sum_probs=30.6
Q ss_pred cccccChhHHH-----HHHHHH---hhhcCCcCCCH---HHHHHHHHHHHHHhhh----ccCccccccCccccc
Q psy7171 33 VAFINPANVVF-----VYMLVR---ELVDGDKIDSE---SQLQAVVLTCLYLSYS----YMGNEISYPLKPFLV 91 (146)
Q Consensus 33 ~~FI~PANvVF-----vYmL~r---d~v~~~~i~se---~eLqa~vLTCLYLsYS----YmGNEISYPLKPFLv 91 (146)
.-|+.|.|+|| ++++-| +.+++.+.+.| .+.+|.+.++|-=-|| |.|..=.+.+.||+-
T Consensus 80 ~f~l~P~Nl~fd~~~~p~i~hrGi~~~l~P~~~~ee~fl~qyKali~~ll~~k~~Fe~ly~G~~e~~~~~~f~~ 153 (359)
T PF10140_consen 80 TFILHPENLVFDKNLMPKILHRGIKEILPPYELDEEDFLKQYKALIIALLDPKYSFEDLYNGSLELLKGSPFLK 153 (359)
T ss_dssp E---SGGGEEE-TTS-EEES--EETTTBS-----HHHHHHHHHHHHHHHH-TT--HHHHHHS-GGG----HHHH
T ss_pred eEEEechheEEcCCCCEEEEEcCCcCCCCCCCCCHHHHHHHHHHHHHHHHcCCCCHHHHHhChhhhcccCHHHH
Confidence 35789999999 445544 77776663332 3678999999987777 456888888888874
No 11
>cd03568 VHS_STAM VHS domain family, STAM subfamily; members include STAM (Signal Transducing Adaptor Molecule), EAST (EGFR-associated protein with SH3 and TAM domains) and Hbp (Hrs-binding protein). Collectively, they are referred to as STAM. All STAMs have at their N-termini a VHS domain, which is involved in cytokine-mediated intracellular signal transduction and has a superhelical structure similar to the structure of ARM (Armadillo) repeats, followed by a SH3 (Src homology 3) domain, a well-established protein-protein interaction domain. At the C-termini of most vertebrate STAMS, an ITAM (Immunoreceptor Tyrosine-based Activation) motif is present, which mediates the binding of HRS (hepatocyte growth factor-regulated tyrosine kinase substrate) in endocytic and exocytic machineries.
Probab=38.30 E-value=58 Score=24.85 Aligned_cols=50 Identities=14% Similarity=0.301 Sum_probs=30.8
Q ss_pred ccccccchhHhhhHHHHHHhh----------------ccccchhccCCcchhHHHHHHHHhcCCCCC
Q psy7171 88 PFLVEDCKEKFWDRCLHIVNN----------------MSSSMLRINAEPGFFTEIFTELKACGTSTP 138 (146)
Q Consensus 88 PFLve~~ke~FW~rcl~Ii~~----------------~S~~MLriNadp~fFTevF~eLK~~~~~~~ 138 (146)
+|..|-....|.+...++++. .+. ..+-..+-.++.+++..||++|-.-+
T Consensus 71 ~fh~evask~Fl~eL~kl~~~~~~~~Vk~kil~li~~W~~-~f~~~~~l~~i~~~y~~L~~~G~~f~ 136 (144)
T cd03568 71 RFHQEVASRDFTQELKKLINDRVHPTVKEKLREVVKQWAD-EFKNDPSLSLMSDLYKKLKNEGPDLV 136 (144)
T ss_pred HHHHHHhhHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHH-HhCCCcccHHHHHHHHHHHHcCCCCC
Confidence 455454555666665555555 332 12223344579999999999998755
No 12
>PF02129 Peptidase_S15: X-Pro dipeptidyl-peptidase (S15 family); InterPro: IPR000383 This entry represents a domain found peptidases Xaa-Pro dipeptidyl-peptidase and glutaryl-7-aminocephalosporanic-acid acylase, which belong to MEROPS peptidase families S15 and S45 respectively []. It is also found in hydrolases from the CocE/NonD family. Cocaine esterase (CocE) hydrolyzes cocaine endowing the bacteria with the ability to utilise cocaine as a sole source of carbon and energy []. ; GO: 0004177 aminopeptidase activity, 0006508 proteolysis; PDB: 1LNS_A 3PUI_A 3PUH_B 1JU3_A 3I2I_A 3I2G_A 1JU4_A 3I2K_A 3IDA_A 3I2H_A ....
Probab=34.88 E-value=19 Score=28.36 Aligned_cols=17 Identities=24% Similarity=0.284 Sum_probs=13.2
Q ss_pred hcccc---ccccCCcccccc
Q psy7171 20 LFPVP---LRLSEINDVAFI 36 (146)
Q Consensus 20 lR~VD---LLLQGWQD~~FI 36 (146)
++.+| |+++||+|..|.
T Consensus 224 ~~~i~vP~l~v~Gw~D~~~~ 243 (272)
T PF02129_consen 224 LDKIDVPVLIVGGWYDTLFL 243 (272)
T ss_dssp HGG--SEEEEEEETTCSSTS
T ss_pred HhhCCCCEEEecccCCcccc
Confidence 36777 999999999888
No 13
>PF07051 OCIA: Ovarian carcinoma immunoreactive antigen (OCIA); InterPro: IPR009764 This family consists of several ovarian carcinoma immunoreactive antigen (OCIA) and related eukaryotic sequences. The function of this family is unknown [,].
Probab=34.44 E-value=13 Score=28.50 Aligned_cols=14 Identities=43% Similarity=0.771 Sum_probs=11.2
Q ss_pred ccchhHhhhHHHHH
Q psy7171 92 EDCKEKFWDRCLHI 105 (146)
Q Consensus 92 e~~ke~FW~rcl~I 105 (146)
|=++|+||.|||=+
T Consensus 36 EC~~ESFwyRslPl 49 (111)
T PF07051_consen 36 ECNEESFWYRSLPL 49 (111)
T ss_pred HHHHhhhHhccCcH
Confidence 44889999999743
No 14
>cd00489 Barstar_like Barstar is an intracellular inhibitor of barnase, an extracellular ribonuclease of Bacillus amyloliquefaciens. Barstar binds tightly to the barnase active site and sterically blocks it, thus inhibiting its potentially lethal RNase activity inside the cell. Barstar also binds and inhibits a ribonuclease called RNase Sa (produced by Streptomyces aureofaciens) which belongs to the same enzyme family as does barnase.
Probab=34.25 E-value=45 Score=23.43 Aligned_cols=35 Identities=26% Similarity=0.542 Sum_probs=26.3
Q ss_pred hcCCcCCCHHHHHHHHHHHHHHhhhccCccccccCcccccccchhHhhh
Q psy7171 52 VDGDKIDSESQLQAVVLTCLYLSYSYMGNEISYPLKPFLVEDCKEKFWD 100 (146)
Q Consensus 52 v~~~~i~se~eLqa~vLTCLYLsYSYmGNEISYPLKPFLve~~ke~FW~ 100 (146)
|+++.|.|.++++..+-..|-+. +|.|+ |-|+|||
T Consensus 3 id~~~i~~~~~f~~~~~~~l~fp-~~fG~-------------NlDAl~D 37 (85)
T cd00489 3 IDGEDIADWEDFHARLKKKLGFP-DYYGH-------------NLDALWD 37 (85)
T ss_pred EecccCCCHHHHHHHHHHHhCCc-cccCC-------------CHHHHHH
Confidence 56788999999998887776332 44554 8899998
No 15
>PLN00214 putative protein; Provisional
Probab=33.23 E-value=77 Score=24.77 Aligned_cols=57 Identities=14% Similarity=0.323 Sum_probs=40.5
Q ss_pred hhHHHHHHH---------HHhhhcCCcCCCHHHHHHHHHHHHHHhhhccCccccccCcccccccchhHhhhHHHHHHh
Q psy7171 39 ANVVFVYML---------VRELVDGDKIDSESQLQAVVLTCLYLSYSYMGNEISYPLKPFLVEDCKEKFWDRCLHIVN 107 (146)
Q Consensus 39 ANvVFvYmL---------~rd~v~~~~i~se~eLqa~vLTCLYLsYSYmGNEISYPLKPFLve~~ke~FW~rcl~Ii~ 107 (146)
++-+|-.++ ||+++..+ .+=|-.+..-+.=.|.|-|+|=-|=- -.+.-|+.|++|..
T Consensus 48 a~EI~a~i~~N~t~s~~CC~~LVk~G-----K~CH~~LiK~i~~~p~~k~~as~~l~-------kSd~~Wn~Cv~i~~ 113 (115)
T PLN00214 48 ALDIIAVVFENGTLIDPCCNDLVKEG-----KVCHDTLIKYIADKPMLIAHETEYLK-------KSDDLWNHCVSISK 113 (115)
T ss_pred HHHHHHHHHcCCCCchHHHHHHHHHh-----hHHHHHHHHHHHcchhhhhhHHHhhc-------ccHhHHHHHHHHhc
Confidence 345565565 66666544 36677777777778999999876633 35789999999964
No 16
>smart00385 CYCLIN domain present in cyclins, TFIIB and Retinoblastoma. A helical domain present in cyclins and TFIIB (twice) and Retinoblastoma (once). A protein recognition domain functioning in cell-cycle and transcription control.
Probab=31.60 E-value=1.2e+02 Score=18.16 Aligned_cols=36 Identities=14% Similarity=0.099 Sum_probs=24.4
Q ss_pred HHHHHhhhcCCcCCCHHHHHHHHHHHHHHhhhccCcc
Q psy7171 45 YMLVRELVDGDKIDSESQLQAVVLTCLYLSYSYMGNE 81 (146)
Q Consensus 45 YmL~rd~v~~~~i~se~eLqa~vLTCLYLsYSYmGNE 81 (146)
..+++..+....+. ....|.+..+|+|+|--..+..
T Consensus 21 ~~~~~~~l~~~~~~-~~~~~~ia~a~l~lA~k~~~~~ 56 (83)
T smart00385 21 VNLLDRFLSDYKFL-KYSPSLIAAAALYLAAKTEEIP 56 (83)
T ss_pred HHHHHHHHHHhhcc-cCCHHHHHHHHHHHHHHHhcCC
Confidence 34455555432222 2788999999999998887654
No 17
>PLN02341 pfkB-type carbohydrate kinase family protein
Probab=28.41 E-value=72 Score=28.14 Aligned_cols=41 Identities=12% Similarity=0.105 Sum_probs=30.3
Q ss_pred HHHHHHhhhccCcccccc----Ccccccc---cchhHhhhHHHHHHhh
Q psy7171 68 LTCLYLSYSYMGNEISYP----LKPFLVE---DCKEKFWDRCLHIVNN 108 (146)
Q Consensus 68 LTCLYLsYSYmGNEISYP----LKPFLve---~~ke~FW~rcl~Ii~~ 108 (146)
-.|--++-+..|...++| ++.||-+ +.|..||+-|+.++++
T Consensus 381 ~a~aA~~v~~~Ga~~~~p~~~ev~~~l~~~~~~~~~~~~~~~~~~~~~ 428 (470)
T PLN02341 381 NAVGAATAMGCGAGRNVATLEKVLELLRASNLNEDDTFWAELLKNSDC 428 (470)
T ss_pred HHHHHHHHcCcCCCCCCCCHHHHHHHHHhcCCCcchhHHHHhhccccc
Confidence 344455667788888888 7778853 4688999999987654
No 18
>PF04112 Mak10: Mak10 subunit, NatC N(alpha)-terminal acetyltransferase; InterPro: IPR007244 NatC N(alpha)-terminal acetyltransferases contain Mak10p, Mak31p and Mak3p subunits. All three subunits are associated with each other to form the active complex [].
Probab=27.10 E-value=43 Score=25.78 Aligned_cols=36 Identities=33% Similarity=0.622 Sum_probs=21.9
Q ss_pred ccccChhHHHHH--HHHHhh--hcCCcCCCHHHHHHHHHHHHHHhh
Q psy7171 34 AFINPANVVFVY--MLVREL--VDGDKIDSESQLQAVVLTCLYLSY 75 (146)
Q Consensus 34 ~FI~PANvVFvY--mL~rd~--v~~~~i~se~eLqa~vLTCLYLsY 75 (146)
.-.+|..++-+. ||+.++ ..|. -|-.+|+||+|+--
T Consensus 51 ~~l~~~evi~I~D~Ll~~e~~Wl~G~------sL~qTv~TC~Y~~~ 90 (168)
T PF04112_consen 51 KPLSPEEVIGIMDRLLACEVSWLEGY------SLPQTVFTCLYLHH 90 (168)
T ss_pred CCCCHHHHHHHHHHHHHHHHHHHcCC------cHHHHHHHHHHHHH
Confidence 445666665543 444443 3333 37789999999843
No 19
>PF00790 VHS: VHS domain; InterPro: IPR002014 The VHS domain is a ~140 residues long domain, whose name is derived from its occurrence in VPS-27, Hrs and STAM. Based on regions surrounding the domain, VHS-proteins can be divided into 4 groups []: STAM/EAST/Hbp which all share the domain composition VHS-SH3-ITAM and carry one or two ubiquitin-interacting motifs Proteins with a FYVE domain (IPR000306 from INTERPRO) C-terminal to VHS which also carry one or two ubiquitin-interacting motifs GGA proteins with a domain composition VHS-GAT (GGA and Tom1) homology domain VHS domain alone or in combination with domains other than those listed above The VHS domain is always found at the N- terminus of proteins suggesting that such topology is important for function. The domain is considered to have a general membrane targeting/cargo recognition role in vesicular trafficking []. Resolution of the crystal structure of the VHS domain of Drosophila Hrs and human Tom1 revealed that it consists of eight helices arranged in a double-layer superhelix []. The existence of conserved patches of residues on the domain surface suggests that VHS domains may be involved in protein-protein recognition and docking. Overall, sequence similarity is low (approx 25%) amongst domain family members.; GO: 0006886 intracellular protein transport; PDB: 1X5B_A 2L0T_B 1DVP_A 3LDZ_C 3ZYQ_A 4AVX_A 3G2U_A 3G2W_A 1UJJ_A 3G2V_A ....
Probab=26.80 E-value=69 Score=23.58 Aligned_cols=34 Identities=15% Similarity=0.433 Sum_probs=22.3
Q ss_pred hHHHHHHhhccccchhccCCcchhHHHHHHHHhcC
Q psy7171 100 DRCLHIVNNMSSSMLRINAEPGFFTEIFTELKACG 134 (146)
Q Consensus 100 ~rcl~Ii~~~S~~MLriNadp~fFTevF~eLK~~~ 134 (146)
++++.+|..-+... +-+.+-.+|.+.|..||+.|
T Consensus 107 ~k~l~ll~~W~~~f-~~~~~~~~i~~~y~~Lk~~G 140 (140)
T PF00790_consen 107 EKILELLQEWAEAF-KSDPELSLIQDTYKRLKRKG 140 (140)
T ss_dssp HHHHHHHHHHHHHT-TTSTTGHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHHHH-CCCCCchHHHHHHHHHHHCc
Confidence 34444444443333 55666778999999999876
No 20
>cd03565 VHS_Tom1 VHS domain family, Tom1 subfamily; The VHS domain is an essential part of Tom1 (Target of myb1 - retroviral oncogene) protein. The VHS domain has a superhelical structure similar to the structure of the ARM repeats and is present at the very N-termini of proteins. It is a right-handed superhelix of eight alpha helices. The VHS domain has been found in a number of proteins, some of which have been implicated in intracellular trafficking and sorting. The VHS domain of the Tom1 protein is essential for the negative regulation of Interleukin-1 and Tumor Necrosis Factor-induced signaling pathways.
Probab=26.67 E-value=69 Score=24.23 Aligned_cols=35 Identities=20% Similarity=0.360 Sum_probs=23.5
Q ss_pred hHHHHHHhhccccchhccCCcchhHHHHHHHHhcCC
Q psy7171 100 DRCLHIVNNMSSSMLRINAEPGFFTEIFTELKACGT 135 (146)
Q Consensus 100 ~rcl~Ii~~~S~~MLriNadp~fFTevF~eLK~~~~ 135 (146)
+|++.+|...+. .++-..+-.++.++|.+||++|-
T Consensus 104 ~kil~li~~W~~-~f~~~~~l~~i~~~y~~L~~~G~ 138 (141)
T cd03565 104 EKVLALIQAWAD-AFRGSPDLTGVVEVYEELKKKGI 138 (141)
T ss_pred HHHHHHHHHHHH-HhCCCccchHHHHHHHHHHHcCC
Confidence 566666666653 23333444679999999999874
No 21
>COG1553 DsrE Uncharacterized conserved protein involved in intracellular sulfur reduction [Inorganic ion transport and metabolism]
Probab=26.26 E-value=40 Score=26.54 Aligned_cols=29 Identities=21% Similarity=0.260 Sum_probs=24.2
Q ss_pred HHHHHHHHhhhcCCcCCCHHHHHHHHHHH
Q psy7171 42 VFVYMLVRELVDGDKIDSESQLQAVVLTC 70 (146)
Q Consensus 42 VFvYmL~rd~v~~~~i~se~eLqa~vLTC 70 (146)
..-+=++|++++|-++++..||..++++|
T Consensus 92 ~~~~~~~~nl~~g~~lsgL~eLa~~~~~~ 120 (126)
T COG1553 92 AERLGLASNLIEGFELSGLGELAEATLTA 120 (126)
T ss_pred hhcccchhccccceeeccHHHHHHHHHhc
Confidence 44455677788888999999999999987
No 22
>smart00549 TAFH TAF homology. Domain in Drosophila nervy, CBFA2T1, human TAF105, human TAF130, and Drosophila TAF110. Also known as nervy homology region 1 (NHR1).
Probab=25.08 E-value=82 Score=23.75 Aligned_cols=36 Identities=28% Similarity=0.525 Sum_probs=25.9
Q ss_pred HHHHHHhhhcCCcCCCHHHHHHHHHHHHHHhhhccCccccccCcccccc
Q psy7171 44 VYMLVRELVDGDKIDSESQLQAVVLTCLYLSYSYMGNEISYPLKPFLVE 92 (146)
Q Consensus 44 vYmL~rd~v~~~~i~se~eLqa~vLTCLYLsYSYmGNEISYPLKPFLve 92 (146)
|==|+++++++. -+.+|.+.-+.. ++.+|++||+|-
T Consensus 28 Vr~LV~~L~~~~--i~~EeF~~~Lq~-----------~lns~~qP~lvP 63 (92)
T smart00549 28 VRTLVLGLVNGT--ITAEEFTSRLQE-----------ALNSPLQPYLIP 63 (92)
T ss_pred HHHHHHHHHhCC--CCHHHHHHHHHH-----------HHcCCCCchhHH
Confidence 445677788765 367777766554 678999999985
No 23
>PRK13317 pantothenate kinase; Provisional
Probab=24.24 E-value=64 Score=27.04 Aligned_cols=31 Identities=19% Similarity=0.252 Sum_probs=26.4
Q ss_pred CCHHHHHHHHHHHHHHhhhccCccccccCccccccc
Q psy7171 58 DSESQLQAVVLTCLYLSYSYMGNEISYPLKPFLVED 93 (146)
Q Consensus 58 ~se~eLqa~vLTCLYLsYSYmGNEISYPLKPFLve~ 93 (146)
..+.|++|.+.-.-|+. +|+.+|++||++++
T Consensus 72 ~~v~E~~a~~~g~~~l~-----~~~~~~~~~~~i~~ 102 (277)
T PRK13317 72 AEFVEFEATGLGVRYLL-----KEEGHDLNDYIFTN 102 (277)
T ss_pred eeeHHHHHHHHHHHHHH-----HhcCCCCCcEEEEE
Confidence 34899999999999887 57789999999876
No 24
>PLN02920 pantothenate kinase 1
Probab=24.10 E-value=1.4e+02 Score=27.34 Aligned_cols=82 Identities=13% Similarity=0.234 Sum_probs=52.3
Q ss_pred CChhhhhhhhccccccccCCcc---------cccccChhHHHHHHHHHhhhcCCcCCCHHHHHHHHHHHHHH-------h
Q psy7171 11 DDHLAIACWLFPVPLRLSEIND---------VAFINPANVVFVYMLVRELVDGDKIDSESQLQAVVLTCLYL-------S 74 (146)
Q Consensus 11 ~d~~~~v~WlR~VDLLLQGWQD---------~~FI~PANvVFvYmL~rd~v~~~~i~se~eLqa~vLTCLYL-------s 74 (146)
.+.++.+.|++.-.++.-|=+. +--+|=-.-...+=++++...- .+.-+.|+++.+.-+-|| .
T Consensus 68 ~~i~~~i~fl~~~~~~~~~~~~~~~~~~~~~~i~~TGGGA~k~~~~~~~~~~i-~~~k~DEm~~li~Gl~fLl~~~~~e~ 146 (398)
T PLN02920 68 RKINDCLEFISSNKLHHGGFQHHENPTHDKNFIKATGGGAYKFADLFKEKLGI-SLDKEDEMDCLVTGANFLLKAVHHEA 146 (398)
T ss_pred cCHHHHHHHHHhccccccccccccccCCCceEEEEECCcHHHHHHHHHhhhCC-CceeecHHHHHHHHHHHHHhhCCcce
Confidence 4667888899865443322111 1112333344455667776643 356789999999999999 5
Q ss_pred hhccCccccc------cCccccccc
Q psy7171 75 YSYMGNEISY------PLKPFLVED 93 (146)
Q Consensus 75 YSYmGNEISY------PLKPFLve~ 93 (146)
|.|...+-.+ ++.|||+-+
T Consensus 147 f~y~~~~~~~~~~~~~~lyPyLLVN 171 (398)
T PLN02920 147 FTYLDGQKEFVQIDHNDLYPYLLVN 171 (398)
T ss_pred eEeccCcccccccCccccCceEEEE
Confidence 7777665554 799999644
No 25
>PF00767 Poty_coat: Potyvirus coat protein; InterPro: IPR001592 This protease is found in genome polyproteins of potyviruses. The genome polyprotein contains: N-terminal protein (P1), helper component protease (3.4.22 from EC, HC-PRO), protein P3, 6KD protein (6K1), cytoplasmic inclusion protein (CI), 6KD protein 2 (6K2), genome-linked protein (VPG), nuclear inclusion protein A (3.4.22 from EC), nuclear inclusion protein B (2.7.7.48 from EC) and coat protein (CP). The coat protein is at the C terminus of the polyprotein.; GO: 0019028 viral capsid
Probab=23.62 E-value=26 Score=29.62 Aligned_cols=15 Identities=40% Similarity=0.973 Sum_probs=11.7
Q ss_pred CccccccCccccccc
Q psy7171 79 GNEISYPLKPFLVED 93 (146)
Q Consensus 79 GNEISYPLKPFLve~ 93 (146)
+..+.||||||+...
T Consensus 109 ~~Qveypl~P~i~~A 123 (237)
T PF00767_consen 109 EEQVEYPLKPFIENA 123 (237)
T ss_pred ccceeeccHHhhhcC
Confidence 345889999998755
No 26
>PF15017 AF1Q: Drug resistance and apoptosis regulator
Probab=22.87 E-value=34 Score=25.29 Aligned_cols=11 Identities=27% Similarity=0.585 Sum_probs=8.5
Q ss_pred ccccccChhHH
Q psy7171 32 DVAFINPANVV 42 (146)
Q Consensus 32 D~~FI~PANvV 42 (146)
+-|.|||+|+=
T Consensus 72 ~gGWITPsNIk 82 (87)
T PF15017_consen 72 GGGWITPSNIK 82 (87)
T ss_pred CCccccchhhh
Confidence 45789999973
No 27
>TIGR03634 arc_protsome_B proteasome endopeptidase complex, archaeal, beta subunit. This protein family describes the archaeal proteasome beta subunit, homologous to both the alpha subunit and to the alpha and beta subunits of eukaryotic proteasome subunits. This family is universal in the first 29 complete archaeal genomes but occasionally is duplicated.
Probab=22.76 E-value=2.2e+02 Score=21.24 Aligned_cols=58 Identities=17% Similarity=0.266 Sum_probs=41.5
Q ss_pred ccccccCCcccc----cccCh-------------hHHHHHHHHHhhhcCCcCCCHHHHHHHHHHHHHHhhh---ccCccc
Q psy7171 23 VPLRLSEINDVA----FINPA-------------NVVFVYMLVRELVDGDKIDSESQLQAVVLTCLYLSYS---YMGNEI 82 (146)
Q Consensus 23 VDLLLQGWQD~~----FI~PA-------------NvVFvYmL~rd~v~~~~i~se~eLqa~vLTCLYLsYS---YmGNEI 82 (146)
|.+++-||.+.+ .+.|. +--|.+=++++...++ -|.+|+.+....||..+.. +.|++|
T Consensus 98 v~~ivaG~d~~g~~Ly~~d~~G~~~~~~~~a~G~g~~~~~~~Le~~~~~~--~s~~ea~~l~~~~l~~~~~r~~~~~~~~ 175 (185)
T TIGR03634 98 VQLLVGGVDEEGPHLYSLDPAGGIIEDDYTATGSGSPVAYGVLEDEYRED--MSVEEAKKLAVRAIKSAIERDVASGNGI 175 (185)
T ss_pred EEEEEEEEeCCCCEEEEECCCCCeEECCEEEEcCcHHHHHHHHHhcCCCC--CCHHHHHHHHHHHHHHHHHhcccCCCCE
Confidence 347888886433 55555 4556777788887654 4889999999999998877 666543
No 28
>PF08785 Ku_PK_bind: Ku C terminal domain like; InterPro: IPR014893 The non-homologous end joining (NHEJ) pathway is one method by which double stranded breaks in chromosomal DNA are repaired. Ku is a component of a multi-protein complex that is involved in the NHEJ. Ku has affinity for DNA ends and recruits the DNA-dependent protein kinase catalytic subunit (DNA-PKcs). This domain is found at the C-terminal of Ku which binds to DNA-PKcs []. ; GO: 0016817 hydrolase activity, acting on acid anhydrides; PDB: 1RW2_A 1Q2Z_A 3ISM_C.
Probab=22.06 E-value=78 Score=23.19 Aligned_cols=44 Identities=20% Similarity=0.397 Sum_probs=34.0
Q ss_pred ccccchhHhhhHHHHHHhhccccchhccCCcchhHHHHHHHHhcC
Q psy7171 90 LVEDCKEKFWDRCLHIVNNMSSSMLRINAEPGFFTEIFTELKACG 134 (146)
Q Consensus 90 Lve~~ke~FW~rcl~Ii~~~S~~MLriNadp~fFTevF~eLK~~~ 134 (146)
||.++.+.+.++++.-|..|-...++.+ +|..|++...+||..-
T Consensus 34 Lv~~s~~~~y~kalecl~~lR~~~i~~~-ep~~yN~Fl~~LK~~~ 77 (120)
T PF08785_consen 34 LVSDSGDQNYDKALECLRALREECIEEE-EPDEYNDFLRKLKKKL 77 (120)
T ss_dssp HHHCSHCHHHHHHHHHHHHHHHHHHHHT--CHHHHHHHHHHHHHH
T ss_pred HHhccCcchHHHHHHHHHHHHHHHHhcC-CHHHHHHHHHHHHHHH
Confidence 3455578889998888888876666655 4999999999999854
No 29
>cd03567 VHS_GGA VHS domain family, GGA subfamily; GGA (Golgi-localized, Gamma-ear-containing, Arf-binding) comprise a subfamily of ubiquitously expressed, monomeric, motif-binding cargo/clathrin adaptor proteins. The VHS domain has a superhelical structure similar to the structure of the ARM (Armadillo) repeats and is present at the N-termini of proteins. GGA proteins have a multidomain structure consisting of an N-terminal VHS domain linked by a short proline-rich linker to a GAT (GGA and TOM) domain, which is followed by a long flexible linker to the C-terminal appendage, GAE (gamma-adaptin ear) domain. The VHS domain of GGA proteins binds to the acidic-cluster dileucine (DxxLL) motif found on the cytoplasmic tails of cargo proteins trafficked between the trans-Golgi network and the endosomal system.
Probab=21.89 E-value=86 Score=23.97 Aligned_cols=47 Identities=26% Similarity=0.410 Sum_probs=29.6
Q ss_pred ccccccchhHhhhHHHHHHhhc------c----ccch--------hccCCcchhHHHHHHHHhcCC
Q psy7171 88 PFLVEDCKEKFWDRCLHIVNNM------S----SSML--------RINAEPGFFTEIFTELKACGT 135 (146)
Q Consensus 88 PFLve~~ke~FW~rcl~Ii~~~------S----~~ML--------riNadp~fFTevF~eLK~~~~ 135 (146)
+|-.+=.+..|.+..++++... + .++| ....+|. |.++|.+||++|-
T Consensus 72 ~fh~evas~~Fl~el~kl~~~k~~~~~~~~~Vk~kil~li~~W~~~f~~~p~-~~~~Y~~Lk~~G~ 136 (139)
T cd03567 72 RFHSEVGKFRFLNELIKLVSPKYLGSRTSEKVKTKIIELLYSWTLELPHEPK-IKEAYDMLKKQGI 136 (139)
T ss_pred HHHHHHHhHHHHHHHHHHhccccCCCCCCHHHHHHHHHHHHHHHHHhcccch-HHHHHHHHHHCCC
Confidence 4555556666666666666421 1 1122 2346788 7999999999884
No 30
>KOG0013|consensus
Probab=20.40 E-value=40 Score=29.09 Aligned_cols=20 Identities=25% Similarity=0.527 Sum_probs=14.9
Q ss_pred ccCcccccccchhHhhhHHH
Q psy7171 84 YPLKPFLVEDCKEKFWDRCL 103 (146)
Q Consensus 84 YPLKPFLve~~ke~FW~rcl 103 (146)
|||--=-++..|+.|||.|-
T Consensus 37 ~~lT~gqlrskRdEFWdTap 56 (231)
T KOG0013|consen 37 PPLTKGQLRSKRDEFWDTAP 56 (231)
T ss_pred CCcchhhhhhhhhhhhhccc
Confidence 45555556778999999994
Done!