Query 022186
Match_columns 301
No_of_seqs 51 out of 53
Neff 3.1
Searched_HMMs 46136
Date Fri Mar 29 08:41:00 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/022186.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/022186hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF15003 HAUS2: HAUS augmin-li 100.0 3.2E-91 7E-96 645.8 20.1 269 9-299 2-276 (277)
2 PF15003 HAUS2: HAUS augmin-li 81.7 0.26 5.7E-06 47.4 -1.5 32 143-174 167-199 (277)
3 KOG0639 Transducin-like enhanc 58.4 4.2 9.2E-05 43.0 0.8 80 64-152 30-110 (705)
4 TIGR01386 cztS_silS_copS heavy 53.2 1.9E+02 0.0041 26.8 10.9 80 95-174 217-305 (457)
5 PF08653 DASH_Dam1: DASH compl 48.9 57 0.0012 24.9 5.4 56 144-225 2-58 (58)
6 PF10168 Nup88: Nuclear pore c 37.2 93 0.002 33.6 6.7 69 98-166 584-658 (717)
7 PF07862 Nif11: Nitrogen fixat 35.8 38 0.00082 23.7 2.5 26 17-46 23-48 (49)
8 PF10392 COG5: Golgi transport 34.4 1.3E+02 0.0028 25.2 5.9 69 63-131 32-100 (132)
9 PF14735 HAUS4: HAUS augmin-li 33.6 45 0.00097 31.5 3.3 66 92-170 163-232 (238)
10 PF15219 TEX12: Testis-express 32.7 81 0.0018 26.7 4.3 40 85-127 53-92 (100)
11 PF13424 TPR_12: Tetratricopep 32.6 47 0.001 23.8 2.7 35 176-210 43-77 (78)
12 PF09033 DFF-C: DNA Fragmentat 32.5 15 0.00032 33.4 0.0 82 6-124 1-89 (164)
13 cd07618 BAR_Rich1 The Bin/Amph 31.6 1.4E+02 0.0031 28.3 6.3 134 23-162 79-232 (246)
14 KOG3973 Uncharacterized conser 31.0 3.4E+02 0.0073 28.2 9.0 91 55-150 162-270 (465)
15 PRK09039 hypothetical protein; 30.8 1.6E+02 0.0034 28.8 6.6 126 64-203 144-292 (343)
16 cd07595 BAR_RhoGAP_Rich-like T 30.7 41 0.00088 31.5 2.5 96 62-162 119-230 (244)
17 PF11932 DUF3450: Protein of u 30.1 2.4E+02 0.0052 25.8 7.4 82 67-164 38-119 (251)
18 PF09325 Vps5: Vps5 C terminal 25.9 3.2E+02 0.007 23.9 7.2 75 143-217 78-152 (236)
19 PF04822 Takusan: Takusan; In 25.8 1.4E+02 0.003 24.3 4.5 32 96-127 12-43 (84)
20 cd07623 BAR_SNX1_2 The Bin/Amp 25.2 3E+02 0.0065 25.0 7.0 109 103-215 26-138 (224)
21 PF14678 FANCI_S4: FANCI solen 24.7 3.5E+02 0.0075 25.6 7.6 96 103-204 41-137 (256)
22 PRK09835 sensor kinase CusS; P 24.4 6E+02 0.013 23.9 12.0 74 95-168 238-320 (482)
23 PF12018 DUF3508: Domain of un 24.1 5.6E+02 0.012 24.3 8.9 112 97-216 13-145 (281)
24 COG3418 Flagellar biosynthesis 23.8 4.2E+02 0.0092 23.9 7.5 87 43-132 24-121 (146)
25 PF01017 STAT_alpha: STAT prot 23.6 2.2E+02 0.0047 25.3 5.7 61 92-154 120-181 (182)
26 PF01544 CorA: CorA-like Mg2+ 23.4 5.2E+02 0.011 22.8 9.0 53 66-128 127-179 (292)
27 COG5296 Transcription factor i 22.4 56 0.0012 34.0 2.0 72 66-147 323-395 (521)
28 PF06013 WXG100: Proteins of 1 22.4 2.9E+02 0.0062 19.5 9.7 42 93-134 4-45 (86)
29 cd09238 V_Alix_like_1 Protein- 22.2 4E+02 0.0086 25.8 7.6 21 108-128 252-272 (339)
30 PF08182 Pedibin: Pedibin/Hym- 22.2 53 0.0011 23.2 1.2 32 69-108 2-33 (35)
31 PRK14011 prefoldin subunit alp 21.7 1.7E+02 0.0036 25.7 4.5 34 95-128 90-123 (144)
32 PF12308 Noelin-1: Neurogenesi 21.3 5E+02 0.011 22.2 7.0 80 33-125 18-97 (101)
33 PRK10803 tol-pal system protei 21.0 2.3E+02 0.0051 26.5 5.6 30 53-82 50-79 (263)
34 PRK10604 sensor protein RstB; 20.6 7.6E+02 0.017 23.7 10.0 55 62-124 163-217 (433)
35 smart00721 BAR BAR domain. 20.5 3.1E+02 0.0067 23.8 5.9 97 60-157 130-238 (239)
36 cd07598 BAR_FAM92 The Bin/Amph 20.4 99 0.0021 28.4 3.0 56 107-162 153-208 (211)
37 PF11461 RILP: Rab interacting 20.3 81 0.0017 24.4 2.0 17 111-127 4-20 (60)
38 PRK11085 magnesium/nickel/coba 20.2 8.1E+02 0.018 23.8 10.3 56 68-129 150-205 (316)
No 1
>PF15003 HAUS2: HAUS augmin-like complex subunit 2
Probab=100.00 E-value=3.2e-91 Score=645.79 Aligned_cols=269 Identities=36% Similarity=0.547 Sum_probs=258.1
Q ss_pred CCCCCCcchhhccchHHHHHHHHhhCCCCCCCCCHHHHHHHhhCCCCchH-----HHHHHHHHHHHHHHHhhcccceeee
Q 022186 9 STWVGKKPLRRIGGMSDALSIAADLGFSVAPPPSQEELQNLCANGEKGDD-----LIRVLRELTAVQRKIADLQVELQGR 83 (301)
Q Consensus 9 ~~w~~~~~~~~lG~~~~~Lsia~~Lgh~vas~~sqE~lq~~~~~~a~s~~-----l~s~LrQIT~lQ~eLdq~nLEIelL 83 (301)
|||+|++|++++|||.++++||+++||.. |+ +.+++.++||+ +|++|+|||++|++|||+|+|||++
T Consensus 2 npW~p~~~~~~~agl~l~~~vAsg~~~~~-------~l-~~s~~~~~~f~~~s~~l~s~L~QIt~iQaeI~q~nlEielL 73 (277)
T PF15003_consen 2 NPWDPASPAPRAAGLLLALCVASGLGTQE-------ML-DISQKEAPCFSEKSSDLFSRLRQITNIQAEIDQLNLEIELL 73 (277)
T ss_pred CCCCCCCcCCchHHHHHHHHHHhccCccc-------cc-CcccccchhhhhhhhHHHHHHHHHHHHHHHHHhhhHHHHHH
Confidence 69999999999999999999999998765 45 55667777777 9999999999999999999999999
Q ss_pred ccccccccccChhHHHHHHHHHHHHHHHHHHHhhchHHHHHHHhccccCCCcchhhhhhhHHHHHHHHHHhHHHHHHHhH
Q 022186 84 KDDKNVAHLTHVSEMQKKIETLSRITTILKDVIQNKDRIIARLQQPYSLDCIPVEAEYQKQFSELLMKAASDYGALTASV 163 (301)
Q Consensus 84 klDKeTADltH~~~L~kK~e~LQ~mnsHLeaVLkeK~~LrqRLqkP~~~enLPVEA~yHr~vVeLL~lavsfIe~Lee~L 163 (301)
++||+||||+|++||++||++||+||+||++||+||++||+|||||||++||||||+|||||||||++||+||++|+++|
T Consensus 74 kleKeTADltH~~~L~~K~~~Lq~m~shLe~VLk~K~~Lr~RLqkP~~qe~LPVEA~yHr~vVeLL~laa~fi~~Le~~L 153 (277)
T PF15003_consen 74 KLEKETADLTHPDYLAEKCEALQSMNSHLEAVLKEKDRLRQRLQKPYCQENLPVEAQYHRYVVELLELAASFIEKLEEHL 153 (277)
T ss_pred HhhcchHhhhCHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHhhhhhcCccchhhhhHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhhhhcccccCCCcccc-cccchhhhhhhhhHHHHHHHHHHHHHHHHHHhhhcCCCCcccCCCCCCCCCCCCCCCCccCC
Q 022186 164 ADFQWSQSFKEPPSIWG-MLRPIPVALASCTRFFEAMSAMRESFATLQHLRVGDSASSLPITPDNNSSQRVPGGSDCVTP 242 (301)
Q Consensus 164 e~I~W~snFk~~p~v~~-~Lr~Ip~aLasc~~~~ea~s~~r~~~a~l~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (301)
++|+|+++|+++++.|+ +|++||+.+|.|++|||++.+||++++.+|.+|+|.++...|.+| ++|+||+|
T Consensus 154 etIrwip~~~~~~~~m~~aL~ki~~lvae~E~l~e~ilkwRe~~ke~~~~~~~~~~~~~~~~~--------~~d~~~~t- 224 (277)
T PF15003_consen 154 ETIRWIPNFDENPSNMDKALAKIDALVAECEELAEQILKWREQQKEVSSYIPKMLAEENPLHK--------PHDSDCIT- 224 (277)
T ss_pred HHHhccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCc--------CCCCCccC-
Confidence 99999999999999999 999999999999999999999999999999999999999888888 59999999
Q ss_pred CCCCCCCCchhHHHHHHHHhhhhhhhhhhhhhhhccccCCCCCCCccccCCCccccC
Q 022186 243 PPWTNESSLDDLVIRNLRRQELGRQEAEDAISEVSDLSQSDGTNNRRLSWPLQVKKS 299 (301)
Q Consensus 243 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 299 (301)
|||++|++||||+||++|||+++++++++++|++| ||++|||||||||||++
T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~ 276 (277)
T PF15003_consen 225 PPWRAESSFDDLAIRSRRRQNNDQQEEDDEESEDG-----DDNSQRRLSWPPSVKTS 276 (277)
T ss_pred CccccCCchhHHHHhhhHHHHHHhhccccCcCCCC-----cccccccccCCCccCCC
Confidence 89999999999999999999999998888888887 49999999999999986
No 2
>PF15003 HAUS2: HAUS augmin-like complex subunit 2
Probab=81.74 E-value=0.26 Score=47.44 Aligned_cols=32 Identities=16% Similarity=0.259 Sum_probs=18.6
Q ss_pred hHHHHHHHHHHhHHHHHHHhHhhh-hhcccccC
Q 022186 143 KQFSELLMKAASDYGALTASVADF-QWSQSFKE 174 (301)
Q Consensus 143 r~vVeLL~lavsfIe~Lee~Le~I-~W~snFk~ 174 (301)
.-+.+.|..+..-+...|+..+.| +|-..+++
T Consensus 167 ~~m~~aL~ki~~lvae~E~l~e~ilkwRe~~ke 199 (277)
T PF15003_consen 167 SNMDKALAKIDALVAECEELAEQILKWREQQKE 199 (277)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445666666666666666665554 46555544
No 3
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=58.39 E-value=4.2 Score=42.96 Aligned_cols=80 Identities=23% Similarity=0.273 Sum_probs=57.1
Q ss_pred HHHHHHHHHHhhcccceeeeccccccccccChhHHHHHHHHHHHHHHHHHHHhhchHHHHHHHhccccCCCcch-hhhhh
Q 022186 64 RELTAVQRKIADLQVELQGRKDDKNVAHLTHVSEMQKKIETLSRITTILKDVIQNKDRIIARLQQPYSLDCIPV-EAEYQ 142 (301)
Q Consensus 64 rQIT~lQ~eLdq~nLEIelLklDKeTADltH~~~L~kK~e~LQ~mnsHLeaVLkeK~~LrqRLqkP~~~enLPV-EA~yH 142 (301)
.+..-+|++.-++.+|+|++-.||. .|++-+=+--.|..-|.--+.|+.+|.+||+.=..| -+|. ..++|
T Consensus 30 dEfqflqaqyhslkleceKlA~EKt--------eMqRhYvmYyEmSygLniemhKq~EI~KRLn~i~aQ-l~PfLsqehQ 100 (705)
T KOG0639|consen 30 EEFQFLQAQYHSLKLECEKLASEKT--------EMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTICAQ-LIPFLSQEHQ 100 (705)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhh--------hhhhheeeeeeeccccchhhHHHHHHHHHHHHHHHH-HhhhhhHHHH
Confidence 3455688899999999999988875 344444444457777888899999999999876666 5664 45566
Q ss_pred hHHHHHHHHH
Q 022186 143 KQFSELLMKA 152 (301)
Q Consensus 143 r~vVeLL~la 152 (301)
.+|..-++.+
T Consensus 101 qqvlqAvEra 110 (705)
T KOG0639|consen 101 QQVLQAVERA 110 (705)
T ss_pred HHHHHHHHHH
Confidence 6665555544
No 4
>TIGR01386 cztS_silS_copS heavy metal sensor kinase. Members of this family contain a sensor histidine kinase domain (Pfam:PF00512) and a domain found in bacterial signal proteins (Pfam:PF00672). This group is separated phylogenetically from related proteins with similar architecture and contains a number of proteins associated with heavy metal resistance efflux systems for copper, silver, cadmium, and/or zinc.
Probab=53.25 E-value=1.9e+02 Score=26.78 Aligned_cols=80 Identities=14% Similarity=0.221 Sum_probs=52.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhchHHHH----HHHhccccCC-----CcchhhhhhhHHHHHHHHHHhHHHHHHHhHhh
Q 022186 95 VSEMQKKIETLSRITTILKDVIQNKDRII----ARLQQPYSLD-----CIPVEAEYQKQFSELLMKAASDYGALTASVAD 165 (301)
Q Consensus 95 ~~~L~kK~e~LQ~mnsHLeaVLkeK~~Lr----qRLqkP~~~e-----nLPVEA~yHr~vVeLL~lavsfIe~Lee~Le~ 165 (301)
.+.+.+=.+++..|...|+..+++.+... ..|..|+..- .+.-...-.+.+.+.+..+...+..|...++.
T Consensus 217 ~dEi~~l~~~~n~m~~~l~~~~~~~~~~~~~~~h~l~tpl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~ 296 (457)
T TIGR01386 217 PAELRELAQSFNAMLGRLEDAFQRLSQFSADLAHELRTPLTNLLGQTQVALSQPRTGEEYREVLESNLEELERLSRMVSD 296 (457)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCcHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 47889999999999999998888775443 4455555421 01111112344556777777778888888888
Q ss_pred hhhcccccC
Q 022186 166 FQWSQSFKE 174 (301)
Q Consensus 166 I~W~snFk~ 174 (301)
+-+......
T Consensus 297 ll~~~~~~~ 305 (457)
T TIGR01386 297 MLFLARADN 305 (457)
T ss_pred HHHHHHhhc
Confidence 776655443
No 5
>PF08653 DASH_Dam1: DASH complex subunit Dam1; InterPro: IPR013962 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ]. Components of the DASH complex, including Dam1, Duo1, Spc34, Dad1 and Ask1, are essential and connect the centromere to the plus end of spindle microtubules [].
Probab=48.89 E-value=57 Score=24.95 Aligned_cols=56 Identities=21% Similarity=0.365 Sum_probs=40.2
Q ss_pred HHHHHHHHHHhHHHHHHHhHhhhhhcccccCCCcccccccchhhhhhhhhHHHHHHHHHHHHHHH-HHHhhhcCCCCccc
Q 022186 144 QFSELLMKAASDYGALTASVADFQWSQSFKEPPSIWGMLRPIPVALASCTRFFEAMSAMRESFAT-LQHLRVGDSASSLP 222 (301)
Q Consensus 144 ~vVeLL~lavsfIe~Lee~Le~I~W~snFk~~p~v~~~Lr~Ip~aLasc~~~~ea~s~~r~~~a~-l~~~r~~~~~~~~~ 222 (301)
+...-|..+.+.++.|..++.. | +..-++++..-||||+ ||-|+.+..-+-.|
T Consensus 2 ~l~~~f~eL~D~~~~L~~n~~~----------------L----------~~ihesL~~FNESFasfLYGl~mna~cvdfp 55 (58)
T PF08653_consen 2 FLEPQFAELSDSMETLDKNMEQ----------------L----------NQIHESLSDFNESFASFLYGLNMNAWCVDFP 55 (58)
T ss_pred chHHHHHHHHHHHHHHHHHHHH----------------H----------HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Confidence 3456788889999999999665 3 4556777778888887 57787776655555
Q ss_pred CCC
Q 022186 223 ITP 225 (301)
Q Consensus 223 ~~~ 225 (301)
-.|
T Consensus 56 ~~P 58 (58)
T PF08653_consen 56 EAP 58 (58)
T ss_pred CCC
Confidence 544
No 6
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=37.23 E-value=93 Score=33.63 Aligned_cols=69 Identities=16% Similarity=0.375 Sum_probs=49.7
Q ss_pred HHHHHHHH----HHHHHHHHHHhhchHHHHHHHhcccc--CCCcchhhhhhhHHHHHHHHHHhHHHHHHHhHhhh
Q 022186 98 MQKKIETL----SRITTILKDVIQNKDRIIARLQQPYS--LDCIPVEAEYQKQFSELLMKAASDYGALTASVADF 166 (301)
Q Consensus 98 L~kK~e~L----Q~mnsHLeaVLkeK~~LrqRLqkP~~--~enLPVEA~yHr~vVeLL~lavsfIe~Lee~Le~I 166 (301)
+.++.+.| ..+..-++.+..+|+.|.+|+.+-+. ..++|+-..+-|.|-+=|..+..-+..|...++.+
T Consensus 584 l~e~~~~l~~~ae~LaeR~e~a~d~Qe~L~~R~~~vl~~l~~~~P~LS~AEr~~~~EL~~~~~~l~~l~~si~~l 658 (717)
T PF10168_consen 584 LQEERKSLRESAEKLAERYEEAKDKQEKLMKRVDRVLQLLNSQLPVLSEAEREFKKELERMKDQLQDLKASIEQL 658 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 66666666 35566788999999999999875432 56788777777888777777777777776665543
No 7
>PF07862 Nif11: Nitrogen fixation protein of unknown function; InterPro: IPR012903 This domain is found in the cyanobacteria, and the nitrogen-fixing proteobacterium Azotobacter vinelandii and may be involved in nitrogen fixation, but no role has been assigned [].
Probab=35.78 E-value=38 Score=23.71 Aligned_cols=26 Identities=23% Similarity=0.523 Sum_probs=21.2
Q ss_pred hhhccchHHHHHHHHhhCCCCCCCCCHHHH
Q 022186 17 LRRIGGMSDALSIAADLGFSVAPPPSQEEL 46 (301)
Q Consensus 17 ~~~lG~~~~~Lsia~~Lgh~vas~~sqE~l 46 (301)
++......+++.||.+.||.+ |.++|
T Consensus 23 l~~~~~~~e~~~lA~~~Gy~f----t~~el 48 (49)
T PF07862_consen 23 LKACQNPEEVVALAREAGYDF----TEEEL 48 (49)
T ss_pred HHhcCCHHHHHHHHHHcCCCC----CHHHh
Confidence 344567889999999999998 77776
No 8
>PF10392 COG5: Golgi transport complex subunit 5; InterPro: IPR019465 The conserved oligomeric Golgi (COG) complex is a peripheral membrane complex involved in intra-Golgi protein trafficking. Subunit 5 is located in the smaller, B lobe, together with subunits 6-8, and has been shown to bind subunits 1 and 7 [].
Probab=34.36 E-value=1.3e+02 Score=25.22 Aligned_cols=69 Identities=13% Similarity=0.286 Sum_probs=46.7
Q ss_pred HHHHHHHHHHHhhcccceeeeccccccccccChhHHHHHHHHHHHHHHHHHHHhhchHHHHHHHhcccc
Q 022186 63 LRELTAVQRKIADLQVELQGRKDDKNVAHLTHVSEMQKKIETLSRITTILKDVIQNKDRIIARLQQPYS 131 (301)
Q Consensus 63 LrQIT~lQ~eLdq~nLEIelLklDKeTADltH~~~L~kK~e~LQ~mnsHLeaVLkeK~~LrqRLqkP~~ 131 (301)
-..+..++.-|..++-+|+..-.+.-..=+.|..-+.+--..++.|..+++.+-..=++|+..+..||-
T Consensus 32 ~~~l~kL~~~i~eld~~i~~~v~~~~~~LL~q~~~~~~~~~~l~~v~~~v~~L~~s~~RL~~eV~~Py~ 100 (132)
T PF10392_consen 32 STPLKKLNFDIQELDKRIRSQVTSNHEDLLSQASSIEELESVLQAVRSSVESLQSSYERLRSEVIEPYE 100 (132)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH
Confidence 344455555555555555544444333334555555555668889999999999999999999999984
No 9
>PF14735 HAUS4: HAUS augmin-like complex subunit 4
Probab=33.61 E-value=45 Score=31.47 Aligned_cols=66 Identities=15% Similarity=0.365 Sum_probs=40.7
Q ss_pred ccChhHHHHHHHHHHHHHHHHHHHhhchH----HHHHHHhccccCCCcchhhhhhhHHHHHHHHHHhHHHHHHHhHhhhh
Q 022186 92 LTHVSEMQKKIETLSRITTILKDVIQNKD----RIIARLQQPYSLDCIPVEAEYQKQFSELLMKAASDYGALTASVADFQ 167 (301)
Q Consensus 92 ltH~~~L~kK~e~LQ~mnsHLeaVLkeK~----~LrqRLqkP~~~enLPVEA~yHr~vVeLL~lavsfIe~Lee~Le~I~ 167 (301)
|.+-.|=.+++.+|..+-+||++.++..+ ..+++|. .| +++. . =|..+|.=|..|...|++.+
T Consensus 163 iL~~TYTpe~v~Al~~Ir~~L~~~~~~~e~~~~~a~~~L~-~Y--e~lg--~--------~F~~ivreY~~l~~~ie~k~ 229 (238)
T PF14735_consen 163 ILSDTYTPETVPALRKIRDHLEEAIEELEQELQKARQRLE-SY--EGLG--P--------EFEEIVREYTDLQQEIENKR 229 (238)
T ss_pred HHHccCCHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH--hccc--H--------hHHHHHHHHHHHHHHHHHHH
Confidence 33344444455556678888888876543 3444432 11 1111 1 17788888999999999999
Q ss_pred hcc
Q 022186 168 WSQ 170 (301)
Q Consensus 168 W~s 170 (301)
|.-
T Consensus 230 Wal 232 (238)
T PF14735_consen 230 WAL 232 (238)
T ss_pred HHH
Confidence 963
No 10
>PF15219 TEX12: Testis-expressed 12
Probab=32.74 E-value=81 Score=26.72 Aligned_cols=40 Identities=10% Similarity=0.198 Sum_probs=25.4
Q ss_pred cccccccccChhHHHHHHHHHHHHHHHHHHHhhchHHHHHHHh
Q 022186 85 DDKNVAHLTHVSEMQKKIETLSRITTILKDVIQNKDRIIARLQ 127 (301)
Q Consensus 85 lDKeTADltH~~~L~kK~e~LQ~mnsHLeaVLkeK~~LrqRLq 127 (301)
-|....|...+..+ =..++..++.=..++++|..|||||+
T Consensus 53 SEraavd~syi~ei---D~lfkEA~~lEnfLkqkre~LrQrlt 92 (100)
T PF15219_consen 53 SERAAVDASYITEI---DGLFKEANALENFLKQKRECLRQRLT 92 (100)
T ss_pred hHHHHhhhhhhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555665554444 44555555555566788889999986
No 11
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=32.59 E-value=47 Score=23.77 Aligned_cols=35 Identities=11% Similarity=0.173 Sum_probs=28.4
Q ss_pred CcccccccchhhhhhhhhHHHHHHHHHHHHHHHHH
Q 022186 176 PSIWGMLRPIPVALASCTRFFEAMSAMRESFATLQ 210 (301)
Q Consensus 176 p~v~~~Lr~Ip~aLasc~~~~ea~s~~r~~~a~l~ 210 (301)
|.+..++.+|-.+....+.|.+|+.-+++++.-.+
T Consensus 43 ~~~a~~~~~lg~~~~~~g~~~~A~~~~~~al~i~~ 77 (78)
T PF13424_consen 43 PDTANTLNNLGECYYRLGDYEEALEYYQKALDIFE 77 (78)
T ss_dssp HHHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhhc
Confidence 44455889999999999999999999998876543
No 12
>PF09033 DFF-C: DNA Fragmentation factor 45kDa, C terminal domain; InterPro: IPR015121 The C-terminal domain of DNA fragmentation factor 45 kDa (DFF-C) consists of four alpha-helices, which are folded in a helix-packing arrangement, with alpha-2 and alpha-3 packing against a long C-terminal helix (alpha-4). The main function of this domain is the inhibition of DFF40 by binding to its C-terminal catalytic domain through ionic interactions, thereby inhibiting the fragmentation of DNA in the apoptotic process. In addition to blocking the DNase activity of DFF40, the C-terminal region of DFF45 is also important for the DFF40-specific folding chaperone activity, as demonstrated by the ability of DFF45 to refold DFF40 []. ; PDB: 1KOY_A 1IYR_A.
Probab=32.51 E-value=15 Score=33.37 Aligned_cols=82 Identities=20% Similarity=0.330 Sum_probs=0.0
Q ss_pred CCCCCCCCCcchhh-----ccchHHHHHHHHhhCCCCCCC--CCHHHHHHHhhCCCCchHHHHHHHHHHHHHHHHhhccc
Q 022186 6 DTGSTWVGKKPLRR-----IGGMSDALSIAADLGFSVAPP--PSQEELQNLCANGEKGDDLIRVLRELTAVQRKIADLQV 78 (301)
Q Consensus 6 ~~~~~w~~~~~~~~-----lG~~~~~Lsia~~Lgh~vas~--~sqE~lq~~~~~~a~s~~l~s~LrQIT~lQ~eLdq~nL 78 (301)
|.|..|....-+-. -||...-..+|.+|.--+++- .++|+||-+
T Consensus 1 DGGTAWl~~eS~e~D~~ds~~g~~~WknlArQLK~DLssIILmSEeDLQ~L----------------------------- 51 (164)
T PF09033_consen 1 DGGTAWLSQESFEVDETDSGAGADKWKNLARQLKEDLSSIILMSEEDLQVL----------------------------- 51 (164)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred CCccchhcccccccccccccccchhHHHHHHHHHhhhHHHhccCHHHHHHH-----------------------------
Confidence 44567876655422 245556666777776666555 577776221
Q ss_pred ceeeeccccccccccChhHHHHHHHHHHHHHHHHHHHhhchHHHHH
Q 022186 79 ELQGRKDDKNVAHLTHVSEMQKKIETLSRITTILKDVIQNKDRIIA 124 (301)
Q Consensus 79 EIelLklDKeTADltH~~~L~kK~e~LQ~mnsHLeaVLkeK~~Lrq 124 (301)
-|-...|+. ..|.+.|...|.+.+.|+.||..|++-|+
T Consensus 52 ------iDvpcsdLA--~el~qs~~k~q~LQ~TLQqVLDrREE~RQ 89 (164)
T PF09033_consen 52 ------IDVPCSDLA--QELGQSCAKVQGLQNTLQQVLDRREEERQ 89 (164)
T ss_dssp ----------------------------------------------
T ss_pred ------hCCChHHHH--HHHcccHHHHHHHHHHHHHHHHHHHHHHH
Confidence 122233333 24777888888888999999999988775
No 13
>cd07618 BAR_Rich1 The Bin/Amphiphysin/Rvs (BAR) domain of RhoGAP interacting with CIP4 homologs protein 1. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. RhoGAP interacting with CIP4 homologs protein 1 (Rich1) is also called Neuron-associated developmentally-regulated protein (Nadrin) or Rho GTPase activating protein 17 (ARHGAP17). It is a Cdc42- and Rac-specific GAP that binds to polarity proteins through the scaffold protein angiomotin and plays a role in maintaining the integrity of tight junctions. It may be a component of a sorting mechanism in the recycling of tight junction transmembrane proteins. Rich1 contains an N-terminal BAR domain followed by a Rho GAP domain and a C-terminal proline-rich domain. It interacts with the BAR domain proteins endophilin and amphiphysin through its proline-rich region. The BAR domain of Rich1 forms oligomers and can bind membranes and induce membrane tubulation.
Probab=31.60 E-value=1.4e+02 Score=28.30 Aligned_cols=134 Identities=13% Similarity=0.156 Sum_probs=73.1
Q ss_pred hHHHHHHHHhhCCCCCCC-CCHHHHHHHhhCCCCchHHHHHHHHHHHHHHHHhhcccceeeecccccccc----------
Q 022186 23 MSDALSIAADLGFSVAPP-PSQEELQNLCANGEKGDDLIRVLRELTAVQRKIADLQVELQGRKDDKNVAH---------- 91 (301)
Q Consensus 23 ~~~~Lsia~~Lgh~vas~-~sqE~lq~~~~~~a~s~~l~s~LrQIT~lQ~eLdq~nLEIelLklDKeTAD---------- 91 (301)
+..+|-.+.+..+-++.. +.+|+.....-..+-.--+=.-|+.|+..++.+.+..+++...|-....|.
T Consensus 79 ~g~aL~~~gea~~kla~~~~~~d~~ie~~fl~PL~~~le~dlk~I~K~RkkLe~~RLD~D~~K~r~~~a~~~~~~~~~~~ 158 (246)
T cd07618 79 IGKMLDTCGDAENKLAFELSQHEVLLEKDILDPLNQLAEVEIPNIQKQRKQLAKLVLDWDSARGRYNQAHKSSGTNFQAM 158 (246)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHhHHhhHHHHHHHHHhccccCccccccc
Confidence 346777777666666544 566653111000000000112356788888888888888876655443221
Q ss_pred ccC-------hhHHHHHHHHHH-HH-HHHHHHHhhchHHHHHHHhccccCCCcchhhhhhhHHHHHHHHHHhHHHHHHHh
Q 022186 92 LTH-------VSEMQKKIETLS-RI-TTILKDVIQNKDRIIARLQQPYSLDCIPVEAEYQKQFSELLMKAASDYGALTAS 162 (301)
Q Consensus 92 ltH-------~~~L~kK~e~LQ-~m-nsHLeaVLkeK~~LrqRLqkP~~~enLPVEA~yHr~vVeLL~lavsfIe~Lee~ 162 (301)
-.. ......|++.-. .+ +..+..+.++-+ .+.-| .+.+-..+.|||...++|..+..-|..+.+.
T Consensus 159 ~~K~~~l~ee~e~a~~k~E~~kD~~~~dm~~~l~~e~e-~~~~l-----~~lv~aQ~eYHr~a~e~Le~~~p~i~~~~~~ 232 (246)
T cd07618 159 PSKIDMLKEEMDEAGNKVEQCKDQLAADMYNFASKEGE-YAKFF-----VLLLEAQADYHRKALAVIEKVLPEIQAHQDK 232 (246)
T ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHcCHH-HHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 111 123444554433 11 223333333333 33322 2456678899999999999999998888754
No 14
>KOG3973 consensus Uncharacterized conserved glycine-rich protein [Function unknown]
Probab=30.98 E-value=3.4e+02 Score=28.17 Aligned_cols=91 Identities=19% Similarity=0.203 Sum_probs=59.3
Q ss_pred CchHHHHHHHHHHHHHHHHhhcccceeeeccccccccccChhHHHHHHHHHH--------HHHHHHHHHhh---------
Q 022186 55 KGDDLIRVLRELTAVQRKIADLQVELQGRKDDKNVAHLTHVSEMQKKIETLS--------RITTILKDVIQ--------- 117 (301)
Q Consensus 55 ~s~~l~s~LrQIT~lQ~eLdq~nLEIelLklDKeTADltH~~~L~kK~e~LQ--------~mnsHLeaVLk--------- 117 (301)
.-+.+|+.+++- +...|.+++-.....-+=|.+-|=.|--.+++-|+++. .|++-|+.-++
T Consensus 162 n~~~lfe~i~~k--l~~ai~kv~p~~~~~PLlKkpl~~a~w~~iE~~~~~~~~ey~~Rr~ll~sRL~vTVqSF~Wsdr~k 239 (465)
T KOG3973|consen 162 NEWKLFETIRQK--LDGAIKKVSPSQRSHPLLKKPLDEATWPEIEKQCESFSREYYNRRLLLNSRLKVTVQSFLWSDRLK 239 (465)
T ss_pred hHHHHHHHHHHH--HHhHHhcCCHhhcCCchhcCcCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccHHHH
Confidence 346678888775 66778888777665566667778888888998888864 67777765443
Q ss_pred -chHHHHHHHhccccCCCcchhhhhhhHHHHHHH
Q 022186 118 -NKDRIIARLQQPYSLDCIPVEAEYQKQFSELLM 150 (301)
Q Consensus 118 -eK~~LrqRLqkP~~~enLPVEA~yHr~vVeLL~ 150 (301)
.+.+|..+|-+|+ .+-+.|.| .-+|+|||-
T Consensus 240 ~~~~ei~~~~~~~~-rei~~~K~--~~dvahLLa 270 (465)
T KOG3973|consen 240 MHREEIQSILSARV-REIGRVKA--NSDVAHLLA 270 (465)
T ss_pred HHHHHHHHHHHHHH-HHhccccc--hhHHHHHHH
Confidence 2335666666554 23444444 556666653
No 15
>PRK09039 hypothetical protein; Validated
Probab=30.79 E-value=1.6e+02 Score=28.82 Aligned_cols=126 Identities=21% Similarity=0.298 Sum_probs=64.5
Q ss_pred HHHHHHHHHHhhcccceeeeccccccccccChhHHHHHHHHHH-HHHHHHH----HHhhchHHHHHHHh-ccccCCCcch
Q 022186 64 RELTAVQRKIADLQVELQGRKDDKNVAHLTHVSEMQKKIETLS-RITTILK----DVIQNKDRIIARLQ-QPYSLDCIPV 137 (301)
Q Consensus 64 rQIT~lQ~eLdq~nLEIelLklDKeTADltH~~~L~kK~e~LQ-~mnsHLe----aVLkeK~~LrqRLq-kP~~~enLPV 137 (301)
+||..++++|..++-+|...+... .....|++.|+ .++..|. .+-+=|..+..||. .--+.+.+.|
T Consensus 144 ~qI~aLr~Qla~le~~L~~ae~~~--------~~~~~~i~~L~~~L~~a~~~~~~~l~~~~~~~~~~l~~~~~~~~~iri 215 (343)
T PRK09039 144 QQIAALRRQLAALEAALDASEKRD--------RESQAKIADLGRRLNVALAQRVQELNRYRSEFFGRLREILGDREGIRI 215 (343)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHhCCCCCcEE
Confidence 467777777777777776655443 22334444443 2333321 22334667777776 3334455555
Q ss_pred h----------------hhhhhHHHHHHHHHHhHHHHHHHhH-hhhhhcccccCCCcccccccchhhhhhhhhHHHHHHH
Q 022186 138 E----------------AEYQKQFSELLMKAASDYGALTASV-ADFQWSQSFKEPPSIWGMLRPIPVALASCTRFFEAMS 200 (301)
Q Consensus 138 E----------------A~yHr~vVeLL~lavsfIe~Lee~L-e~I~W~snFk~~p~v~~~Lr~Ip~aLasc~~~~ea~s 200 (301)
+ +..-......|..++..|..+...+ ..+.|.-. |.|-=++.|..-..-..|+..+|
T Consensus 216 ~g~~~~~~~~vlF~~gsa~L~~~~~~~L~~ia~~l~~~~~~~p~~i~~~I~------I~GHTD~~p~~~~g~~~~N~~LS 289 (343)
T PRK09039 216 VGDRFVFQSEVLFPTGSAELNPEGQAEIAKLAAALIELAKEIPPEINWVLR------VDGHTDNVPLSGTGRFRDNWELS 289 (343)
T ss_pred ECCEEEecCCceeCCCCcccCHHHHHHHHHHHHHHHHhhhccCCcCCeeEE------EEEecCCCCccCCCCcccHHHHH
Confidence 4 4445566666777777776654331 12222211 22333333332211235677888
Q ss_pred HHH
Q 022186 201 AMR 203 (301)
Q Consensus 201 ~~r 203 (301)
+.|
T Consensus 290 ~~R 292 (343)
T PRK09039 290 SAR 292 (343)
T ss_pred HHH
Confidence 877
No 16
>cd07595 BAR_RhoGAP_Rich-like The Bin/Amphiphysin/Rvs (BAR) domain of Rich-like Rho GTPase Activating Proteins. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. This subfamily is composed of Rho and Rac GTPase activating proteins (GAPs) with similarity to GAP interacting with CIP4 homologs proteins (Rich). Members contain an N-terminal BAR domain, followed by a Rho GAP domain, and a C-terminal prolin-rich region. Vertebrates harbor at least three Rho GAPs in this subfamily including Rich1, Rich2, and SH3-domain binding protein 1 (SH3BP1). Rich1 and Rich2 play complementary roles in the establishment and maintenance of cell polarity. Rich1 is a Cdc42- and Rac-specific GAP that binds to polarity proteins through the scaffold protein angiomotin and plays a role in maintaining the integrity of tight junctions. Rich2 is a Rac GAP that interacts with CD317 and plays a role in actin cytoskeleton organization and
Probab=30.67 E-value=41 Score=31.50 Aligned_cols=96 Identities=17% Similarity=0.303 Sum_probs=53.0
Q ss_pred HHHHHHHHHHHHhhcccceeeecccccccc--------ccChh----H---HHHHHHH-HHHHHHHHHHHhhchHHHHHH
Q 022186 62 VLRELTAVQRKIADLQVELQGRKDDKNVAH--------LTHVS----E---MQKKIET-LSRITTILKDVIQNKDRIIAR 125 (301)
Q Consensus 62 ~LrQIT~lQ~eLdq~nLEIelLklDKeTAD--------ltH~~----~---L~kK~e~-LQ~mnsHLeaVLkeK~~LrqR 125 (301)
-++.|...++.+.+..+.+...+-.-.-|- -.+.. . .+.|++. -....+-+..+|.+=...+.-
T Consensus 119 dik~i~k~RKkLe~~RLd~D~~k~r~~ka~k~~~~~~~~~K~~~l~eE~e~ae~k~e~~~e~~~~~M~~~l~~E~e~~~~ 198 (244)
T cd07595 119 EIPNIQKQKKRLSKLVLDMDSARSRYNAAHKSSGGQGAAAKVDALKDEYEEAELKLEQCRDALATDMYEFLAKEAEIASY 198 (244)
T ss_pred HHHHHHHHHHHHhhhhHHHHHHHHHHHhccccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHccHHHHHH
Confidence 467778888888888877765544332220 11111 1 1222211 112333333444442333333
Q ss_pred HhccccCCCcchhhhhhhHHHHHHHHHHhHHHHHHHh
Q 022186 126 LQQPYSLDCIPVEAEYQKQFSELLMKAASDYGALTAS 162 (301)
Q Consensus 126 LqkP~~~enLPVEA~yHr~vVeLL~lavsfIe~Lee~ 162 (301)
|. +.+-.++.||+...++|..+...|+.+-..
T Consensus 199 l~-----~lv~aQl~YH~~a~e~L~~l~~~l~~~~~~ 230 (244)
T cd07595 199 LI-----DLIEAQREYHRTALSVLEAVLPELQEQIEQ 230 (244)
T ss_pred HH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 32 356678899999999999998888766544
No 17
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=30.10 E-value=2.4e+02 Score=25.85 Aligned_cols=82 Identities=18% Similarity=0.227 Sum_probs=58.4
Q ss_pred HHHHHHHhhcccceeeeccccccccccChhHHHHHHHHHHHHHHHHHHHhhchHHHHHHHhccccCCCcchhhhhhhHHH
Q 022186 67 TAVQRKIADLQVELQGRKDDKNVAHLTHVSEMQKKIETLSRITTILKDVIQNKDRIIARLQQPYSLDCIPVEAEYQKQFS 146 (301)
Q Consensus 67 T~lQ~eLdq~nLEIelLklDKeTADltH~~~L~kK~e~LQ~mnsHLeaVLkeK~~LrqRLqkP~~~enLPVEA~yHr~vV 146 (301)
...|+.+++...|-+.++.+-+ -+.+..+.|+.-+.|++..+..++.-+.+|.+=+ ......--
T Consensus 38 ~~sQ~~id~~~~e~~~L~~e~~--------~l~~e~e~L~~~~~~l~~~v~~q~~el~~L~~qi--------~~~~~~~~ 101 (251)
T PF11932_consen 38 QQSQKRIDQWDDEKQELLAEYR--------QLEREIENLEVYNEQLERQVASQEQELASLEQQI--------EQIEETRQ 101 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHH
Confidence 3455666666666665555433 2677888899999999999999999998887643 23344455
Q ss_pred HHHHHHHhHHHHHHHhHh
Q 022186 147 ELLMKAASDYGALTASVA 164 (301)
Q Consensus 147 eLL~lavsfIe~Lee~Le 164 (301)
++.+++...++.|++.++
T Consensus 102 ~l~p~m~~m~~~L~~~v~ 119 (251)
T PF11932_consen 102 ELVPLMEQMIDELEQFVE 119 (251)
T ss_pred HHHHHHHHHHHHHHHHHh
Confidence 777888888888888765
No 18
>PF09325 Vps5: Vps5 C terminal like; InterPro: IPR015404 Vps5 is a sorting nexin that functions in membrane trafficking. This is the C-terminal dimerisation domain [].
Probab=25.88 E-value=3.2e+02 Score=23.87 Aligned_cols=75 Identities=15% Similarity=0.093 Sum_probs=42.2
Q ss_pred hHHHHHHHHHHhHHHHHHHhHhhhhhcccccCCCcccccccchhhhhhhhhHHHHHHHHHHHHHHHHHHhhhcCC
Q 022186 143 KQFSELLMKAASDYGALTASVADFQWSQSFKEPPSIWGMLRPIPVALASCTRFFEAMSAMRESFATLQHLRVGDS 217 (301)
Q Consensus 143 r~vVeLL~lavsfIe~Lee~Le~I~W~snFk~~p~v~~~Lr~Ip~aLasc~~~~ea~s~~r~~~a~l~~~r~~~~ 217 (301)
..+...|..++..++++.+.++...=.......+.+-+.++-+.++=+.+.+=..++..+..+.+.|.+.|-..-
T Consensus 78 ~~l~~~l~~l~~~~~~~~~~~~~~a~~~~~~l~~~L~ey~~~~~svk~~l~~R~~~~~~~~~a~~~l~kkk~~~~ 152 (236)
T PF09325_consen 78 KSLSEALSQLAEAFEKISELLEEQANQEEETLGEPLREYLRYIESVKEALNRRDKKLIEYQNAEKELQKKKAQLE 152 (236)
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 446666777777666666655443221122222222225555555555566666667777777777877776554
No 19
>PF04822 Takusan: Takusan; InterPro: IPR006907 This family includes several uncharacterised muridae (mouse and rat) proteins.
Probab=25.78 E-value=1.4e+02 Score=24.31 Aligned_cols=32 Identities=22% Similarity=0.400 Sum_probs=28.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhchHHHHHHHh
Q 022186 96 SEMQKKIETLSRITTILKDVIQNKDRIIARLQ 127 (301)
Q Consensus 96 ~~L~kK~e~LQ~mnsHLeaVLkeK~~LrqRLq 127 (301)
+--+.|.+-|..+...|+-|-+++++||.||.
T Consensus 12 s~~e~~~k~lE~L~~eL~~it~ERnELr~~L~ 43 (84)
T PF04822_consen 12 SKKEKKMKELERLKFELQKITKERNELRDILA 43 (84)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34566678899999999999999999999998
No 20
>cd07623 BAR_SNX1_2 The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexins 1 and 2. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. This subfamily consists of SNX1, SNX2, and similar proteins. SNX1 and SNX2 are components of the retromer complex, a membrane coat multimeric complex required for endosomal retrieval of lysosomal hydrolase receptors to the Golgi. The retromer consists of a cargo-recognition subcomplex and a subcomplex formed by a dimer of sorting nexins (SNX1 and/or SNX2), wh
Probab=25.16 E-value=3e+02 Score=24.99 Aligned_cols=109 Identities=9% Similarity=0.094 Sum_probs=45.1
Q ss_pred HHHHHHHHHHHHHhhchHHHHHHHhccccCCCcc-hhhhhhhHHHHHHHHHHhHHHHHHHhHhhhhhcccccCCCccc--
Q 022186 103 ETLSRITTILKDVIQNKDRIIARLQQPYSLDCIP-VEAEYQKQFSELLMKAASDYGALTASVADFQWSQSFKEPPSIW-- 179 (301)
Q Consensus 103 e~LQ~mnsHLeaVLkeK~~LrqRLqkP~~~enLP-VEA~yHr~vVeLL~lavsfIe~Lee~Le~I~W~snFk~~p~v~-- 179 (301)
..|..+...++.+++.++.|..=+-.= .....- -..+-+..+...|..+++..+++....+.....-.+....-+=
T Consensus 26 ~~Lk~l~~~~e~lv~~r~ela~~~~~f-~~s~~~L~~~E~~~~Ls~al~~la~~~~ki~~~~~~qa~~d~~~l~e~L~eY 104 (224)
T cd07623 26 QQLRKLHASVESLVNHRKELALNTGSF-AKSAAMLSNCEEHTSLSRALSQLAEVEEKIEQLHGEQADTDFYILAELLKDY 104 (224)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344455555556666666554322110 000000 0011133344444445544445444444433333333322222
Q ss_pred -ccccchhhhhhhhhHHHHHHHHHHHHHHHHHHhhhc
Q 022186 180 -GMLRPIPVALASCTRFFEAMSAMRESFATLQHLRVG 215 (301)
Q Consensus 180 -~~Lr~Ip~aLasc~~~~ea~s~~r~~~a~l~~~r~~ 215 (301)
+++..++.++ .+--.|...|+.+...|.+.|..
T Consensus 105 ~r~i~svk~~f---~~R~~a~~~~q~a~~~l~kkr~~ 138 (224)
T cd07623 105 IGLIGAIKDVF---HERVKVWQNWQNAQQTLTKKREA 138 (224)
T ss_pred HHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH
Confidence 2233333333 33344555566666666666654
No 21
>PF14678 FANCI_S4: FANCI solenoid 4; PDB: 3S51_A 3S4Z_A 3S4W_A.
Probab=24.69 E-value=3.5e+02 Score=25.64 Aligned_cols=96 Identities=16% Similarity=0.175 Sum_probs=45.7
Q ss_pred HHHHHHHHHHHHHhhchHHHHHHHhccccCCCcchhhhhhhHHHHHHHHHHhHHHHHHHhHhhhhhcccccCCCc-cccc
Q 022186 103 ETLSRITTILKDVIQNKDRIIARLQQPYSLDCIPVEAEYQKQFSELLMKAASDYGALTASVADFQWSQSFKEPPS-IWGM 181 (301)
Q Consensus 103 e~LQ~mnsHLeaVLkeK~~LrqRLqkP~~~enLPVEA~yHr~vVeLL~lavsfIe~Lee~Le~I~W~snFk~~p~-v~~~ 181 (301)
..+..+.+|++.+|.+=+-++.||-.-....+.+.+..--...-.+...=-+.+..|..-+..+.=--+...||| ..+
T Consensus 41 sv~~~l~~~~~~~L~dvdwli~klk~~~~~~~~~~~~~~~~~~~~~~~~E~~lc~qL~~l~~~l~~L~~~~lp~G~~~d- 119 (256)
T PF14678_consen 41 SVLNLLLSHVESVLDDVDWLISKLKSLLNSDKLSSESDSDEWRGNLKSLEESLCSQLGHLVTVLSELVQTALPPGSCSD- 119 (256)
T ss_dssp THHHHHHHHHHHHHHHHHHHHHHHHTT------------------HHHHHHHHHHHHHHHHHHHHHHHHS---TTTHHH-
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcccchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHhhCCCcchHH-
Confidence 568889999999999999999999877777766622221111111111111222222222222222234566663 333
Q ss_pred ccchhhhhhhhhHHHHHHHHHHH
Q 022186 182 LRPIPVALASCTRFFEAMSAMRE 204 (301)
Q Consensus 182 Lr~Ip~aLasc~~~~ea~s~~r~ 204 (301)
..|-.|+++|-+++++-.
T Consensus 120 -----~lLK~l~klY~~Lt~l~K 137 (256)
T PF14678_consen 120 -----KLLKLLTKLYTLLTNLVK 137 (256)
T ss_dssp -----HHHHHHHHHHHHHHHHHH
T ss_pred -----HHHHHHHHHHHHHHHHHH
Confidence 234446999999988873
No 22
>PRK09835 sensor kinase CusS; Provisional
Probab=24.37 E-value=6e+02 Score=23.92 Aligned_cols=74 Identities=15% Similarity=0.294 Sum_probs=43.5
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhchHHHHHH----HhccccCCCcchh-----hhhhhHHHHHHHHHHhHHHHHHHhHhh
Q 022186 95 VSEMQKKIETLSRITTILKDVIQNKDRIIAR----LQQPYSLDCIPVE-----AEYQKQFSELLMKAASDYGALTASVAD 165 (301)
Q Consensus 95 ~~~L~kK~e~LQ~mnsHLeaVLkeK~~LrqR----LqkP~~~enLPVE-----A~yHr~vVeLL~lavsfIe~Lee~Le~ 165 (301)
.+.+.+-.+++..|...|+..+.+++++.+. |..|+..=....+ ..-+....+.+..+..-+..+...+++
T Consensus 238 ~dEl~~l~~~~n~m~~~l~~~~~~~~~~~~~laheL~tpl~~i~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~i~~ 317 (482)
T PRK09835 238 PIELEQLVLSFNHMIERIEDVFTRQSNFSADIAHEIRTPITNLITQTEIALSQSRSQKELEDVLYSNLEELTRMAKMVSD 317 (482)
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5689999999999999999999987766553 5556543111111 111223334444444455555555555
Q ss_pred hhh
Q 022186 166 FQW 168 (301)
Q Consensus 166 I~W 168 (301)
+..
T Consensus 318 ll~ 320 (482)
T PRK09835 318 MLF 320 (482)
T ss_pred HHH
Confidence 543
No 23
>PF12018 DUF3508: Domain of unknown function (DUF3508); InterPro: IPR021897 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 280 amino acids in length. This domain has two conserved sequence motifs: GFC and GLL. This family is also known as UPF0704.
Probab=24.05 E-value=5.6e+02 Score=24.26 Aligned_cols=112 Identities=13% Similarity=0.171 Sum_probs=65.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhhchHHHHHHHhccccCCCcchh-----hhhhhHHHHHHHHHHhHHHHHHHhH--------
Q 022186 97 EMQKKIETLSRITTILKDVIQNKDRIIARLQQPYSLDCIPVE-----AEYQKQFSELLMKAASDYGALTASV-------- 163 (301)
Q Consensus 97 ~L~kK~e~LQ~mnsHLeaVLkeK~~LrqRLqkP~~~enLPVE-----A~yHr~vVeLL~lavsfIe~Lee~L-------- 163 (301)
.+....+..+.++.+.++||.+... .|-....++++ -.+.|++.-.|..+.+++....+++
T Consensus 13 ~i~~eL~~~~~l~~~yta~l~~~~~------~~~~~~~~~~~~lke~L~n~RQ~e~fLr~ll~dl~~~~~~V~~l~~~~~ 86 (281)
T PF12018_consen 13 HIDTELEEAQELCYRYTAVLEKQSQ------SPQMESELPPELLKEELYNRRQYEIFLRILLSDLITCAQRVEELIKRFE 86 (281)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc------ccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3555666677777777777776654 44444444422 2344555555555555544444433
Q ss_pred ---hhhhhcccccCCCcccc-cccc----hhhhhhhhhHHHHHHHHHHHHHHHHHHhhhcC
Q 022186 164 ---ADFQWSQSFKEPPSIWG-MLRP----IPVALASCTRFFEAMSAMRESFATLQHLRVGD 216 (301)
Q Consensus 164 ---e~I~W~snFk~~p~v~~-~Lr~----Ip~aLasc~~~~ea~s~~r~~~a~l~~~r~~~ 216 (301)
+.++=.- +.-.+|.- .+=| +-.+-.+++.++..++.+..-+.+|+.+....
T Consensus 87 ~~l~~L~~tv--~~rtAVPt~~VyP~Fi~Ls~~W~~lqde~~ll~~l~~l~~~L~~~~~~~ 145 (281)
T PF12018_consen 87 AQLEKLKETV--KSRTAVPTAQVYPLFIALSQLWSGLQDELNLLSVLNNLLENLQPFSKSF 145 (281)
T ss_pred HHHHHHHHHH--hcccccchhhcCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 3333222 22233433 3334 34444578999999999999999999987743
No 24
>COG3418 Flagellar biosynthesis/type III secretory pathway chaperone [Cell motility and secretion / Intracellular trafficking and secretion / Posttranslational modification, protein turnover, chaperones]
Probab=23.82 E-value=4.2e+02 Score=23.91 Aligned_cols=87 Identities=13% Similarity=0.077 Sum_probs=60.6
Q ss_pred HHHHHHHhhCCCCchHHHHHHHHHHHHHHHHhhcccceeeeccccccccccC-----------hhHHHHHHHHHHHHHHH
Q 022186 43 QEELQNLCANGEKGDDLIRVLRELTAVQRKIADLQVELQGRKDDKNVAHLTH-----------VSEMQKKIETLSRITTI 111 (301)
Q Consensus 43 qE~lq~~~~~~a~s~~l~s~LrQIT~lQ~eLdq~nLEIelLklDKeTADltH-----------~~~L~kK~e~LQ~mnsH 111 (301)
.+|-|.++...-++.++-+.++|-..+=+.|+.+.-..-++ ..+|-++- ...+.++|+-|+.+|-|
T Consensus 24 dqE~q~L~~~~~~~~~lq~i~~qK~sLl~~L~~l~Q~R~~~---~~~ani~~dye~~~~L~erwq~i~~~~~~lrq~NL~ 100 (146)
T COG3418 24 DQEQQALSSGQINGSVLQEITEQKSSLLATLDYLDQDRAKE---PNEANIFPDYESNNDLNERWQEIIELTERLRQANLH 100 (146)
T ss_pred HHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHHHHHHhc---hhhcccCCCccchHHHHHHHHHHHHHHHHHHHHHhh
Confidence 44556888888888888888887777777777665443333 23344443 34566788888999999
Q ss_pred HHHHhhchHHHHHHHhccccC
Q 022186 112 LKDVIQNKDRIIARLQQPYSL 132 (301)
Q Consensus 112 LeaVLkeK~~LrqRLqkP~~~ 132 (301)
.-.+|..+-..-+++..-+.+
T Consensus 101 NG~ll~~~~~~n~q~L~ll~~ 121 (146)
T COG3418 101 NGWLLEGQIESNQQALELLKP 121 (146)
T ss_pred hHHHHHHHHHHHHHHHHHhcc
Confidence 999999998888887554433
No 25
>PF01017 STAT_alpha: STAT protein, all-alpha domain; InterPro: IPR013800 The STAT protein (Signal Transducers and Activators of Transcription) family contains transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors, hence they act as signal transducers in the cytoplasm and transcription activators in the nucleus []. Binding of these factors to cell-surface receptors leads to receptor autophosphorylation at a tyrosine, the phosphotyrosine being recognised by the STAT SH2 domain, which mediates the recruitment of STAT proteins from the cytosol and their association with the activated receptor. The STAT proteins are then activated by phosphorylation via members of the JAK family of protein kinases, causing them to dimerise and translocated to the nucleus, where they bind to specific promoter sequences in target genes. In mammals, STATs comprise a family of seven structurally and functionally related proteins: Stat1, Stat2, Stat3, Stat4, Stat5a and Stat5b, Stat6. STAT proteins play a critical role in regulating innate and acquired host immune responses. Dysregulation of at least two STAT signalling cascades (i.e. Stat3 and Stat5) is associated with cellular transformation. Signalling through the JAK/STAT pathway is initiated when a cytokine binds to its corresponding receptor. This leads to conformational changes in the cytoplasmic portion of the receptor, initiating activation of receptor associated members of the JAK family of kinases. The JAKs, in turn, mediate phosphorylation at the specific receptor tyrosine residues, which then serve as docking sites for STATs and other signalling molecules. Once recruited to the receptor, STATs also become phosphorylated by JAKs, on a single tyrosine residue. Activated STATs dissociate from the receptor, dimerise, translocate to the nucleus and bind to members of the GAS (gamma activated site) family of enhancers. The seven STAT proteins identified in mammals range in size from 750 and 850 amino acids. The chromosomal distribution of these STATs, as well as the identification of STATs in more primitive eukaryotes, suggest that this family arose from a single primordial gene. STATs share structurally and functionally conserved domains including: an N-terminal domain that strengthens interactions between STAT dimers on adjacent DNA-binding sites; a coiled-coil STAT domain that is implicated in protein-protein interactions; a DNA-binding domain with an immunoglobulin-like fold similar to p53 tumour suppressor protein; an EF-hand-like linker domain connecting the DNA-binding and SH2 domains; an SH2 domain (IPR000980 from INTERPRO) that acts as a phosphorylation-dependent switch to control receptor recognition and DNA-binding; and a C-terminal transactivation domain []. The crystal structure of the N terminus of Stat4 reveals a dimer. The interface of this dimer is formed by a ring-shaped element consisting of five short helices. Several studies suggest that this N-terminal dimerisation promotes cooperativity of binding to tandem GAS elements and with the transcriptional coactivator CBP/p300. This entry represents the all-alpha helical domain, which consists of four long helices arranged in a bundle with a left-handed twist (coiled-coil), which in turn forms a right-handed superhelix.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0004871 signal transducer activity, 0006355 regulation of transcription, DNA-dependent, 0007165 signal transduction, 0005634 nucleus; PDB: 1YVL_A 1BF5_A 3CWG_B 1BG1_A 1Y1U_B.
Probab=23.56 E-value=2.2e+02 Score=25.26 Aligned_cols=61 Identities=21% Similarity=0.349 Sum_probs=41.2
Q ss_pred ccChhHHHHHHHHHHHHHHHHHHHhhchHHHHHHHhccccCCCcch-hhhhhhHHHHHHHHHHh
Q 022186 92 LTHVSEMQKKIETLSRITTILKDVIQNKDRIIARLQQPYSLDCIPV-EAEYQKQFSELLMKAAS 154 (301)
Q Consensus 92 ltH~~~L~kK~e~LQ~mnsHLeaVLkeK~~LrqRLqkP~~~enLPV-EA~yHr~vVeLL~lavs 154 (301)
-+..+-|++-|+.|..+.-++...|++=..|.+++ ||.++.+|- -..+...+-.||..+++
T Consensus 120 ~~~LD~LQ~wfe~LAe~l~qlrqqlk~l~~l~~k~--~~~~d~~~~~~~~L~~~v~~ll~~Lv~ 181 (182)
T PF01017_consen 120 DSSLDQLQNWFESLAEILWQLRQQLKKLEELQQKL--TYENDPIPDQLPQLNERVTELLKNLVT 181 (182)
T ss_dssp ---THHHHHHHHHHHHHHHHHHHHHHHHHHHHTTS----TT-THHHHHHHHHHHHHHHHHHHHH
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--CCCCCchhhhHHHHHHHHHHHHHHHhc
Confidence 35567799999999999999999999999998876 788877662 23444555555555443
No 26
>PF01544 CorA: CorA-like Mg2+ transporter protein; InterPro: IPR002523 The CorA transport system is the primary Mg2+ influx system of Salmonella typhimurium and Escherichia coli [, ]. CorA is virtually ubiquitous in the Bacteria and Archaea. There are also eukaryotic relatives of this protein. Transporter ZntB mediates efflux of zinc ions [].; GO: 0046873 metal ion transmembrane transporter activity, 0030001 metal ion transport, 0055085 transmembrane transport, 0016020 membrane; PDB: 2HN1_A 3NWI_D 3NVO_B 3CK6_A 2IUB_E 2BBJ_E 2HN2_A 2BBH_A.
Probab=23.40 E-value=5.2e+02 Score=22.83 Aligned_cols=53 Identities=19% Similarity=0.346 Sum_probs=25.2
Q ss_pred HHHHHHHHhhcccceeeeccccccccccChhHHHHHHHHHHHHHHHHHHHhhchHHHHHHHhc
Q 022186 66 LTAVQRKIADLQVELQGRKDDKNVAHLTHVSEMQKKIETLSRITTILKDVIQNKDRIIARLQQ 128 (301)
Q Consensus 66 IT~lQ~eLdq~nLEIelLklDKeTADltH~~~L~kK~e~LQ~mnsHLeaVLkeK~~LrqRLqk 128 (301)
+..+..+++++.-++ ........+-....+. .--.++...+....+...|+.+
T Consensus 127 l~~l~~~l~~le~~~---~~~~~~~~~~~l~~l~-------~~l~~l~~~l~~~~~~l~~~~~ 179 (292)
T PF01544_consen 127 LEELEDELDELEDEL---DDRPSNELLRELFDLR-------RELSRLRRSLSPLREVLQRLLR 179 (292)
T ss_dssp HHHHHHHHHHHHHHH---THTTTHHHCCHHHHHH-------HHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhhc---ccccchhhHHHHHHHH-------HHHHHHHHHhhhHHHHHHHHHH
Confidence 344555555555555 2222223333333344 3444445555555566656666
No 27
>COG5296 Transcription factor involved in TATA site selection and in elongation by RNA polymerase II [Transcription]
Probab=22.42 E-value=56 Score=34.00 Aligned_cols=72 Identities=22% Similarity=0.338 Sum_probs=55.0
Q ss_pred HHHHHHHHhhcccceeeeccccccccccC-hhHHHHHHHHHHHHHHHHHHHhhchHHHHHHHhccccCCCcchhhhhhhH
Q 022186 66 LTAVQRKIADLQVELQGRKDDKNVAHLTH-VSEMQKKIETLSRITTILKDVIQNKDRIIARLQQPYSLDCIPVEAEYQKQ 144 (301)
Q Consensus 66 IT~lQ~eLdq~nLEIelLklDKeTADltH-~~~L~kK~e~LQ~mnsHLeaVLkeK~~LrqRLqkP~~~enLPVEA~yHr~ 144 (301)
|..+++.++++---+-...-||++.+++. .. .++--+..|+-+|.+||+|+|.-+...|=..-+.|+++
T Consensus 323 ~~~v~~k~~~l~d~~~~~LSdkeis~~V~~k~----------e~~~k~sNvi~eKt~Lrqkrq~A~e~~n~k~~~ey~~q 392 (521)
T COG5296 323 IAKVKEKYDKLVDTMGRRLSDKEISKMVACKD----------EVHPKRSNVIHEKTELRQKRQRAIELKNKKAAMEYQRQ 392 (521)
T ss_pred HHHHHHHHHHHHHHhCCcCchhHHHHHHHHHH----------hcCccchhHHHHHHHHHHHHHHHHHccCHHHHHHHHHH
Confidence 44566777777777777778999988753 22 24445668999999999999999999998888888876
Q ss_pred HHH
Q 022186 145 FSE 147 (301)
Q Consensus 145 vVe 147 (301)
.-+
T Consensus 393 L~~ 395 (521)
T COG5296 393 LEE 395 (521)
T ss_pred HHH
Confidence 533
No 28
>PF06013 WXG100: Proteins of 100 residues with WXG; InterPro: IPR010310 ESAT-6 is a small protein appears to be of fundamental importance in virulence and protective immunity in Mycobacterium tuberculosis. Homologues have been detected in other Gram-positive bacterial species. It may represent a novel secretion system potentially driven by the PF01580 from PFAM domains in the YukA-like proteins []. Members of this protein family include secretion targets for type main variants of type VII secretion systems (T7SS), one found in the Actinobacteria, one found in the Firmicutes. This model was derived through iteration from PF06013 from PFAM. The best characterised member of this family is ESAT-6 from Mycobacterium tuberculosis. Members of this family usually are ~100 amino acids in length but occasionally have long C-terminal extension. ; PDB: 3FAV_A 1WA8_A 3Q4H_B 2KG7_A 2VRZ_B 2VS0_B 3OGI_A 3H6P_B 3GVM_B 3GWK_C ....
Probab=22.42 E-value=2.9e+02 Score=19.50 Aligned_cols=42 Identities=14% Similarity=0.219 Sum_probs=35.7
Q ss_pred cChhHHHHHHHHHHHHHHHHHHHhhchHHHHHHHhccccCCC
Q 022186 93 THVSEMQKKIETLSRITTILKDVIQNKDRIIARLQQPYSLDC 134 (301)
Q Consensus 93 tH~~~L~kK~e~LQ~mnsHLeaVLkeK~~LrqRLqkP~~~en 134 (301)
+.+..|..-...++...+.|+..++.=...+..|.--..|+-
T Consensus 4 vd~~~l~~~a~~~~~~~~~l~~~~~~l~~~~~~l~~~W~G~a 45 (86)
T PF06013_consen 4 VDPEQLRAAAQQLQAQADELQSQLQQLESSIDSLQASWQGEA 45 (86)
T ss_dssp SCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHGGGBTSST
T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhCCchH
Confidence 356678888899999999999999999999999977777764
No 29
>cd09238 V_Alix_like_1 Protein-interacting V-domain of an uncharacterized family of the V_Alix_like superfamily. This domain family is comprised of uncharacterized plant proteins. It belongs to the V_Alix_like superfamily which includes the V-shaped (V) domains of Bro1 and Rim20 (also known as PalA) from Saccharomyces cerevisiae, mammalian Alix (apoptosis-linked gene-2 interacting protein X), (His-Domain) type N23 protein tyrosine phosphatase (HD-PTP, also known as PTPN23), and related domains. Alix, also known as apoptosis-linked gene-2 interacting protein 1 (AIP1), participates in membrane remodeling processes during the budding of enveloped viruses, vesicle budding inside late endosomal multivesicular bodies (MVBs), and the abscission reactions of mammalian cell division. It also functions in apoptosis. HD-PTP functions in cell migration and endosomal trafficking, Bro1 in endosomal trafficking, and Rim20 in the response to the external pH via the Rim101 pathway. Alix, HD-PTP, Bro1, a
Probab=22.25 E-value=4e+02 Score=25.82 Aligned_cols=21 Identities=19% Similarity=0.476 Sum_probs=14.7
Q ss_pred HHHHHHHHhhchHHHHHHHhc
Q 022186 108 ITTILKDVIQNKDRIIARLQQ 128 (301)
Q Consensus 108 mnsHLeaVLkeK~~LrqRLqk 128 (301)
....++.-+.+|..|.+.|+.
T Consensus 252 ~~~~v~~~~~~Q~~ll~~i~~ 272 (339)
T cd09238 252 VREAVSKNISSQDDLLSRLRA 272 (339)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 344455667788888888874
No 30
>PF08182 Pedibin: Pedibin/Hym-346 family; InterPro: IPR012594 This family consists of the pedibin and Hym-346 signalling peptides. These two peptides have been isolated from Hydra attenuata (Hydra) (Hydra vulgaris) and Hydra magnipapillata (Hydra). Experiments have indicated that both cause a reduction in the positional value gradient, the principle patterning process governing the maintenance of form in the adult hydra. The peptides cause an increase in the rate of foot regeneration following bisection of the body column. Thus both play important signalling roles in patterning processes in cnidaria and maybe in more complex metazoans [].
Probab=22.18 E-value=53 Score=23.22 Aligned_cols=32 Identities=25% Similarity=0.374 Sum_probs=19.9
Q ss_pred HHHHHhhcccceeeeccccccccccChhHHHHHHHHHHHH
Q 022186 69 VQRKIADLQVELQGRKDDKNVAHLTHVSEMQKKIETLSRI 108 (301)
Q Consensus 69 lQ~eLdq~nLEIelLklDKeTADltH~~~L~kK~e~LQ~m 108 (301)
+++||+.+|... -+--|+-| .|++||+.|+.+
T Consensus 2 L~~EI~~Lq~~~------a~Gedv~~--~LE~Kek~L~n~ 33 (35)
T PF08182_consen 2 LCAEIDVLQIQL------ADGEDVCK--ELEQKEKELSNF 33 (35)
T ss_pred HHHHHHHHHHHH------hcchhHHH--HHHHHHHHHHhc
Confidence 467777766433 12234444 589999998864
No 31
>PRK14011 prefoldin subunit alpha; Provisional
Probab=21.70 E-value=1.7e+02 Score=25.71 Aligned_cols=34 Identities=18% Similarity=0.387 Sum_probs=28.7
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhchHHHHHHHhc
Q 022186 95 VSEMQKKIETLSRITTILKDVIQNKDRIIARLQQ 128 (301)
Q Consensus 95 ~~~L~kK~e~LQ~mnsHLeaVLkeK~~LrqRLqk 128 (301)
..|+.+|++.|+.--.-|..+|+++...+.+|++
T Consensus 90 ~~~~~~ri~~l~~~~~~l~~~i~~~~~~~~~l~~ 123 (144)
T PRK14011 90 IEDFKKSVEELDKTKKEGNKKIEELNKEITKLRK 123 (144)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3589999999999888888888888888877764
No 32
>PF12308 Noelin-1: Neurogenesis glycoprotein; InterPro: IPR022082 This domain family is found in eukaryotes, and is approximately 100 amino acids in length. The family is found in association with PF02191 from PFAM. There are two conserved sequence motifs: SAQ and VQN. Noelin-1 is a glycoprotein which is secreted mainly by postmitotic neurogenic tissues in the developing central and peripheral nervous systems, first appearing after neural tube closure. It is likely that it forms large multimeric complexes.It has a divergent function in neurogenesis. In animal caps neuralized by expression of noggin, co-expression of Noelin-1 causes expression of neuronal differentiation markers several stages before neurogenesis normally occurs in this tissue. Finally, only secreted forms of the protein can activate sensory marker expression, while all forms of the protein can induce early neurogenesis.
Probab=21.30 E-value=5e+02 Score=22.22 Aligned_cols=80 Identities=18% Similarity=0.313 Sum_probs=60.7
Q ss_pred hCCCCCCCCCHHHHHHHhhCCCCchHHHHHHHHHHHHHHHHhhcccceeeeccccccccccChhHHHHHHHHHHHHHHHH
Q 022186 33 LGFSVAPPPSQEELQNLCANGEKGDDLIRVLRELTAVQRKIADLQVELQGRKDDKNVAHLTHVSEMQKKIETLSRITTIL 112 (301)
Q Consensus 33 Lgh~vas~~sqE~lq~~~~~~a~s~~l~s~LrQIT~lQ~eLdq~nLEIelLklDKeTADltH~~~L~kK~e~LQ~mnsHL 112 (301)
-||||-.-+.. .||+|..-+++=.+-..|++..+|.+.|..+++.... | ..|+.+=--.|..+.+-|
T Consensus 18 dGrCvCTVvaP--~q~~CSrD~r~~qlrqllekVqNmSqsievL~~RT~r--------d---lqyv~~~E~~mk~l~~k~ 84 (101)
T PF12308_consen 18 DGRCVCTVVAP--QQNLCSRDARSRQLRQLLEKVQNMSQSIEVLDLRTQR--------D---LQYVRKMETQMKGLESKF 84 (101)
T ss_pred CCCEEEEEecC--CcchhccCccHHHHHHHHHHHHHHHHHHHHHHhhccc--------h---HHHHHHHHHHHHHHHHHH
Confidence 36776444221 2588999999999999999999999999999887642 2 345666666777888888
Q ss_pred HHHhhchHHHHHH
Q 022186 113 KDVIQNKDRIIAR 125 (301)
Q Consensus 113 eaVLkeK~~LrqR 125 (301)
..|-.+++.|.+|
T Consensus 85 ~~~e~~~~~l~~k 97 (101)
T PF12308_consen 85 RQVEDDRKSLSAK 97 (101)
T ss_pred HHHhcCHHHhhhh
Confidence 8888888888776
No 33
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=21.02 E-value=2.3e+02 Score=26.53 Aligned_cols=30 Identities=20% Similarity=0.284 Sum_probs=23.5
Q ss_pred CCCchHHHHHHHHHHHHHHHHhhcccceee
Q 022186 53 GEKGDDLIRVLRELTAVQRKIADLQVELQG 82 (301)
Q Consensus 53 ~a~s~~l~s~LrQIT~lQ~eLdq~nLEIel 82 (301)
...+-.++...+||.++|+||++++=+||.
T Consensus 50 ~~~~~~~~~l~~ql~~lq~ev~~LrG~~E~ 79 (263)
T PRK10803 50 NAHSQLLTQLQQQLSDNQSDIDSLRGQIQE 79 (263)
T ss_pred HhhhHHHHHHHHHHHHHHHHHHHHhhHHHH
Confidence 355567788888999999999988877754
No 34
>PRK10604 sensor protein RstB; Provisional
Probab=20.63 E-value=7.6e+02 Score=23.71 Aligned_cols=55 Identities=15% Similarity=0.285 Sum_probs=37.3
Q ss_pred HHHHHHHHHHHHhhcccceeeeccccccccccChhHHHHHHHHHHHHHHHHHHHhhchHHHHH
Q 022186 62 VLRELTAVQRKIADLQVELQGRKDDKNVAHLTHVSEMQKKIETLSRITTILKDVIQNKDRIIA 124 (301)
Q Consensus 62 ~LrQIT~lQ~eLdq~nLEIelLklDKeTADltH~~~L~kK~e~LQ~mnsHLeaVLkeK~~Lrq 124 (301)
.|+++....+.+.+-++... ..+...+++..-.++++.|...++..++.++++.+
T Consensus 163 ~l~~L~~~~~~~~~g~~~~~--------~~~~~~~el~~L~~~fn~m~~~l~~~~~~~~~l~~ 217 (433)
T PRK10604 163 DMLKLEAAAQRLGDGHLAER--------IHFDEGSSLERLGVAFNQMADNINALIASKKQLID 217 (433)
T ss_pred HHHHHHHHHHHHhcCCCccc--------cCCCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35555555556655554432 22345567888888999999999999988877654
No 35
>smart00721 BAR BAR domain.
Probab=20.50 E-value=3.1e+02 Score=23.76 Aligned_cols=97 Identities=7% Similarity=0.145 Sum_probs=56.5
Q ss_pred HHHHHHHHHHHHHHhhcccceeeecc-------ccccc-cccChhHHHHHHHHH----HHHHHHHHHHhhchHHHHHHHh
Q 022186 60 IRVLRELTAVQRKIADLQVELQGRKD-------DKNVA-HLTHVSEMQKKIETL----SRITTILKDVIQNKDRIIARLQ 127 (301)
Q Consensus 60 ~s~LrQIT~lQ~eLdq~nLEIelLkl-------DKeTA-DltH~~~L~kK~e~L----Q~mnsHLeaVLkeK~~LrqRLq 127 (301)
...+.++....+..+...++....+- .+... |- -..-.+++++.. ..++..|..-|..=-..+.-..
T Consensus 130 ~~~~~~~~~~~kk~~~~~lDyD~~~~kl~~~~~~~~~~~~~-kl~~~e~el~~ak~~fe~~~~~l~~~l~~l~~~~~~~~ 208 (239)
T smart00721 130 LGEFKEIKKARKKLERKLLDYDSARHKLKKAKKSKEKKKDE-KLAKAEEELRKAKQEFEESNAQLVEELPQLVASRVDFF 208 (239)
T ss_pred HHHhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhccCChhh-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhH
Confidence 35566677777777777776655441 11111 11 222234333222 2344555554444444444556
Q ss_pred ccccCCCcchhhhhhhHHHHHHHHHHhHHH
Q 022186 128 QPYSLDCIPVEAEYQKQFSELLMKAASDYG 157 (301)
Q Consensus 128 kP~~~enLPVEA~yHr~vVeLL~lavsfIe 157 (301)
.|.....+-.++.||+.+.++|..+...+.
T Consensus 209 ~~~l~~~~~aq~~y~~~~~~~l~~l~~~l~ 238 (239)
T smart00721 209 VNCLQALIEAQLNFHRESYKLLQQLQQQLD 238 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 667777788899999999999988877653
No 36
>cd07598 BAR_FAM92 The Bin/Amphiphysin/Rvs (BAR) domain of Family with sequence similarity 92 (FAM92). BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions including organelle biogenesis, membrane trafficking or remodeling, and cell division and migration. This group is composed of proteins from the family with sequence similarity 92 (FAM92), which were originally identified by the presence of the unknown domain DUF1208. This domain shows similarity to the BAR domains of sorting nexins. Mammals contain at least two member types, FAM92A and FAM92B, which may exist in many variants. The Xenopus homolog of FAM92A1, xVAP019, is essential for embryo survival and cell differentiation. FAM92A1 may be involved in regulating cell proliferation and apoptosis. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=20.35 E-value=99 Score=28.42 Aligned_cols=56 Identities=18% Similarity=0.180 Sum_probs=43.4
Q ss_pred HHHHHHHHHhhchHHHHHHHhccccCCCcchhhhhhhHHHHHHHHHHhHHHHHHHh
Q 022186 107 RITTILKDVIQNKDRIIARLQQPYSLDCIPVEAEYQKQFSELLMKAASDYGALTAS 162 (301)
Q Consensus 107 ~mnsHLeaVLkeK~~LrqRLqkP~~~enLPVEA~yHr~vVeLL~lavsfIe~Lee~ 162 (301)
..+.+|+.-+..=..=+.+=.||.-.+++.++..||..+++++..+...|+++.+.
T Consensus 153 r~s~~l~ee~~rFe~~k~~d~K~~l~~fv~~~m~~~~kale~~~~~~~~~~~~~~~ 208 (211)
T cd07598 153 RSTKELEEQMDNFEKQKIRDIKTIFSDFVLIEMLFHAKALEVYTAAYQDIQNIDEE 208 (211)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc
Confidence 45555555555444445566789999999999999999999999999999887653
No 37
>PF11461 RILP: Rab interacting lysosomal protein; InterPro: IPR021563 RILP contains a domain which contains two coiled-coil regions and is found mainly in the cytosol. RILP is recruited onto late endosomal and lysosomal membranes by Rab7 and acts as a downstream effector of Rab7. This recruitment process is important for phagosome maturation and fusion with late endosomes and lysosomes. ; PDB: 1YHN_B.
Probab=20.31 E-value=81 Score=24.40 Aligned_cols=17 Identities=29% Similarity=0.565 Sum_probs=15.0
Q ss_pred HHHHHhhchHHHHHHHh
Q 022186 111 ILKDVIQNKDRIIARLQ 127 (301)
Q Consensus 111 HLeaVLkeK~~LrqRLq 127 (301)
-|++||++|..|..||.
T Consensus 4 ELr~VL~ERNeLK~~v~ 20 (60)
T PF11461_consen 4 ELREVLQERNELKARVF 20 (60)
T ss_dssp THHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 37899999999999985
No 38
>PRK11085 magnesium/nickel/cobalt transporter CorA; Provisional
Probab=20.18 E-value=8.1e+02 Score=23.85 Aligned_cols=56 Identities=11% Similarity=0.127 Sum_probs=37.3
Q ss_pred HHHHHHhhcccceeeeccccccccccChhHHHHHHHHHHHHHHHHHHHhhchHHHHHHHhcc
Q 022186 68 AVQRKIADLQVELQGRKDDKNVAHLTHVSEMQKKIETLSRITTILKDVIQNKDRIIARLQQP 129 (301)
Q Consensus 68 ~lQ~eLdq~nLEIelLklDKeTADltH~~~L~kK~e~LQ~mnsHLeaVLkeK~~LrqRLqkP 129 (301)
.+..+|+++.-+|=. . ....++ ..+-++.-.+..++..+..++--++++..+|.++
T Consensus 150 ~~~~~ld~ls~~if~-~--~~~~~~---~~~l~~i~~l~~~~~~~r~~l~~~~r~l~~l~~~ 205 (316)
T PRK11085 150 NIYSDLEKLSRVIME-G--HQGDEY---DEALSTLAELEDIGWKVRLCLMDTQRALNFLVRK 205 (316)
T ss_pred HHHHHHHHHHHHhcc-C--CCchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 344566666666621 1 111122 2233778888999999999999999999999864
Done!