RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9770
         (811 letters)



>gnl|CDD|190637 pfam03451, HELP, HELP motif.  The founding member of the EMAP
           protein family is the 75 kDa Echinoderm
           Microtubule-Associated Protein, so-named for its
           abundance in sea urchin, sand dollar and starfish eggs.
           The Hydrophobic EMAP-Like Protein (HELP) motif was
           identified initially in the human EMAP-Like Protein 2
           (EML2) and subsequently in the entire EMAP Protein
           family. The HELP motif is approximately 60-70 amino
           acids in length and is conserved amongst metazoans.
           Although the HELP motif is hydrophobic, there is no
           evidence that EMAP-Like Proteins are
           membrane-associated. All members of the EMAP-Like
           Protein family, identified to-date, are constructed with
           an amino terminal HELP motif followed by a WD domain. In
           C. elegans, EMAP-Like Protein-1 (ELP-1) is required for
           touch sensation indicating that ELP-1 may play a role in
           mechanosensation. The localization of ELP-1 to
           microtubules and adhesion sites implies that ELP-1 may
           transmit forces between the body surface and the touch
           receptor neurons.
          Length = 77

 Score =  122 bits (309), Expect = 2e-33
 Identities = 52/78 (66%), Positives = 63/78 (80%), Gaps = 1/78 (1%)

Query: 211 EESSLRMYLRGRPINLYAPSDLAESYDITKVAAPPQNKLKLDWVYGYRGKDCRSNLYLLP 270
           E   ++M++RGRP+ +Y PSD  E+YD+     PP  KLKL+WVYGYRG+DCR+NLYLLP
Sbjct: 1   ESGYVKMFIRGRPVTMYIPSD-VENYDLEADKDPPTKKLKLEWVYGYRGRDCRANLYLLP 59

Query: 271 TGEIVYFVAAVAVLYNVE 288
           TGEIVYFVAAV VLYNVE
Sbjct: 60  TGEIVYFVAAVVVLYNVE 77


>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
           proteins that cover a wide variety of functions
           including adaptor/regulatory modules in signal
           transduction, pre-mRNA processing and cytoskeleton
           assembly; typically contains a GH dipeptide 11-24
           residues from its N-terminus and the WD dipeptide at its
           C-terminus and is 40 residues long, hence the name WD40;
           between GH and WD lies a conserved core; serves as a
           stable propeller-like platform to which proteins can
           bind either stably or reversibly; forms a propeller-like
           structure with several blades where each blade is
           composed of a four-stranded anti-parallel b-sheet;
           instances with few detectable copies are hypothesized to
           form larger structures by dimerization; each WD40
           sequence repeat forms the first three strands of one
           blade and the last strand in the next blade; the last
           C-terminal WD40 repeat completes the blade structure of
           the first WD40 repeat to create the closed ring
           propeller-structure; residues on the top and bottom
           surface of the propeller are proposed to coordinate
           interactions with other proteins and/or small ligands; 7
           copies of the repeat are present in this alignment.
          Length = 289

 Score = 97.8 bits (244), Expect = 3e-22
 Identities = 74/296 (25%), Positives = 130/296 (43%), Gaps = 19/296 (6%)

Query: 445 IRDKPKYVTCLAFTQGGEVL-SGDSNGNIIVWTRGSNTISKFIRNVHEGSVFSICVLKEG 503
           ++     VTC+AF+  G++L +G  +G I VW   +  + + ++  H G V  +    +G
Sbjct: 5   LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADG 63

Query: 504 SIISAGGKDGRLLQFDSNLQPTGYEAQIAEHLGGIRTVSEGRGSQLIVGTTR-NCILVGS 562
           + +++G  D  +  +D           +  H   + +V+     +++  ++R   I V  
Sbjct: 64  TYLASGSSDKTIRLWDLETGECVRT--LTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWD 121

Query: 563 LNLG-FSPVVLGHTDELWALAAHPSLSQFLTAG-FDRIVQLWDSMSHSVVWS-KDIAEQA 619
           +  G     + GHTD + ++A  P    F+ +   D  ++LWD  +   V +      + 
Sbjct: 122 VETGKCLTTLRGHTDWVNSVAFSPD-GTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEV 180

Query: 620 QSATFSPDGSIIIVGCIAGR---WLVMDSETREAYSIHTDGNEPIQVVKFSPDGKLLALG 676
            S  FSPDG  ++     G    W +   +       H +    +  V FSPDG LLA G
Sbjct: 181 NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHEN---GVNSVAFSPDGYLLASG 237

Query: 677 SRDNYIYVYEVSGDSRKYSRVGRCTGHSSFITHLDWSEDSMYIRSNSGDYELLYWN 732
           S D  I V+    D R    V   +GH++ +T L WS D   + S S D  +  W+
Sbjct: 238 SEDGTIRVW----DLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289



 Score = 95.5 bits (238), Expect = 1e-21
 Identities = 64/318 (20%), Positives = 119/318 (37%), Gaps = 38/318 (11%)

Query: 293 RHYLGHTDDIKCLSIHPNKLLIATGQVAGHDAREGKPHIRIWNSVSLFTVSIIGIGEFER 352
           R   GHT  + C++  P+  L+ATG     D       I++W+ +    +     G    
Sbjct: 3   RTLKGHTGGVTCVAFSPDGKLLATG---SGDGT-----IKVWD-LETGELLRTLKGH-TG 52

Query: 353 SISCLSFSKADGGTFLCAVDEGNDYNISVWDWQKSDRGHKITETKCSVDTVVAAEFHPLD 412
            +  ++ S ADG          +D  I +WD +  +    +T     V +V    F P  
Sbjct: 53  PVRDVAAS-ADGTYLASG---SSDKTIRLWDLETGECVRTLTGHTSYVSSVA---FSPDG 105

Query: 413 RNNIVTCGKSH--INFWTIDAGGTLYKKQGIFEMIRDKPKYVTCLAFTQGGEVL-SGDSN 469
           R  I++       I  W ++ G  L         +R    +V  +AF+  G  + S   +
Sbjct: 106 R--ILSSSSRDKTIKVWDVETGKCLTT-------LRGHTDWVNSVAFSPDGTFVASSSQD 156

Query: 470 GNIIVWTRGSNTISKFIRN--VHEGSVFSICVLKEGSIISAGGKDGRLLQFDSNLQPTGY 527
           G I +W        K +     H G V S+    +G  + +   DG +  +D +      
Sbjct: 157 GTIKLW---DLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCL- 212

Query: 528 EAQIAEHLGGIRTVSEGRGSQLIVGTTRNCILVG--SLNLGFSPVVLGHTDELWALAAHP 585
              +  H  G+ +V+      L+   + +  +             + GHT+ + +LA  P
Sbjct: 213 -GTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271

Query: 586 SLSQFLTAGFDRIVQLWD 603
              +  +   D  +++WD
Sbjct: 272 DGKRLASGSADGTIRIWD 289



 Score = 87.0 bits (216), Expect = 9e-19
 Identities = 70/292 (23%), Positives = 113/292 (38%), Gaps = 46/292 (15%)

Query: 529 AQIAEHLGGIRTVSEGRGSQLIV-GTTRNCILVGSLNLGFSPVVL-GHTDELWALAAHPS 586
             +  H GG+  V+     +L+  G+    I V  L  G     L GHT  +  +AA   
Sbjct: 3   RTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASAD 62

Query: 587 LSQFLTAGFDRIVQLWDSMSHSVVW-----SKDIAEQAQSATFSPDGSIIIVGCIAGRWL 641
            +   +   D+ ++LWD  +   V      +  ++    S  FSPDG I+          
Sbjct: 63  GTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVS----SVAFSPDGRILSSSSRDKTIK 118

Query: 642 VMDSETRE---AYSIHTDGNEPIQVVKFSPDGKLLALGSRDNYIYVYEVSGDSRKYSRVG 698
           V D ET +       HTD    +  V FSPDG  +A  S+D  I ++    D R    V 
Sbjct: 119 VWDVETGKCLTTLRGHTD---WVNSVAFSPDGTFVASSSQDGTIKLW----DLRTGKCVA 171

Query: 699 RCTGHSSFITHLDWSEDSMYIRSNSGDYELLYWNAGTCRQIPQSSQLRDVVWESNSCTLT 758
             TGH+  +  + +S D   + S+S D  +  W+  T + +                TL 
Sbjct: 172 TLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCL---------------GTLR 216

Query: 759 FTTIGIWPEGADGTDVNSCERSHDARVIATGDDFGKLKLYSYPAVQPKLCLS 810
                          VNS   S D  ++A+G + G ++++     +    LS
Sbjct: 217 ----------GHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLS 258


>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
           only].
          Length = 466

 Score =  100 bits (248), Expect = 4e-22
 Identities = 101/464 (21%), Positives = 181/464 (39%), Gaps = 44/464 (9%)

Query: 270 PTGEIVYFVAAVAVLYNVEEQTQRHYLGHTDDIKCLSIHPNKLLIATGQVAGHDAREGKP 329
               I+        L ++ + +     GH D I  ++  P+  L+ +G   G        
Sbjct: 36  SESGILLLALLSDSLVSLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSDGT------- 88

Query: 330 HIRIWNSVSLFTVSIIGIGEFERSISCLSFSKADGGTFLCAVDEGNDYNISVWDWQKSDR 389
            I++W+  +   +     G  + S+S L+ S  DG + L A     D  + +WD   S  
Sbjct: 89  -IKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLAS-SSLDGTVKLWDL--STP 144

Query: 390 GHKITETKCSVDTVVAAEFHPLDRNNIVTCGKSH-INFWTIDAGGTLYKKQGIFEMIRDK 448
           G  I   +   ++V +  F P  +           I  W +  G  L         +   
Sbjct: 145 GKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLST-------LAGH 197

Query: 449 PKYVTCLAFTQGGE--VLSGDSNGNIIVWTRGSNTISKFIRNVHEGSVFSICVLKEGSII 506
              V+ LAF+  G   + SG S+G I +W   +  + +   + H  SV S     +GS++
Sbjct: 198 TDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVS-SFSPDGSLL 256

Query: 507 SAGGKDGRLLQFDSNLQPTGYEAQIAEHLGGIRTVS-EGRGSQLIVGTTRNCILVGSLNL 565
           ++G  DG +  +D     +     ++ H   + +V+    G  L  G++   + +  L  
Sbjct: 257 ASGSSDGTIRLWDLRSSSSLL-RTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLET 315

Query: 566 GFSPVVL---GHTDELWALAAHPSLSQFLTAG-FDRIVQLWDSMSHSVVWSKDIAEQAQS 621
           G     L   GH   + +L+  P  S  ++ G  D  ++LWD  +   + + +      S
Sbjct: 316 GKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHSNVLS 375

Query: 622 ATFSPDGSIIIVGCIAGRWLVMDSETREAYSIHTDGNEPIQVVKFSPDGKLLALGSRDNY 681
            +FSPDG ++  G   G   + D  T             +  + FSPDGK LA GS DN 
Sbjct: 376 VSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNT 435

Query: 682 IYVYEVSGDSRKYSRVGRCTGHSSFITHLDWSEDSMYIRSNSGD 725
           I ++++                 + +  + +S D   + S S D
Sbjct: 436 IRLWDL----------------KTSLKSVSFSPDGKVLASKSSD 463



 Score = 98.6 bits (244), Expect = 1e-21
 Identities = 93/424 (21%), Positives = 167/424 (39%), Gaps = 28/424 (6%)

Query: 337 VSLFTVSIIGIGEFERSISCLSFSKADGGTFLCAVDEGNDYNISVWDWQKSDRGHKITET 396
           VSL  +S + +   E SI+ ++FS    G  L +    +D  I +WD          +  
Sbjct: 51  VSLPDLSSLLLRGHEDSITSIAFSP--DGELLLS--GSSDGTIKLWDLDNG-EKLIKSLE 105

Query: 397 KCSVDTVVAAEFHPLDRNNIVTCGKSHINFWTIDAGGTLYKKQGIFEMIRDKPKYVTCLA 456
                +V        D N+I+    S      +    T  K     E      + VT LA
Sbjct: 106 GLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLE---GHSESVTSLA 162

Query: 457 FTQGGEVL--SGDSNGNIIVWTRGSNTISKFIRNVHEGSVFSICVLKEGSIISAGGKDGR 514
           F+  G++L      +G I +W   +      +   H   V S+    +G ++ A G    
Sbjct: 163 FSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAG-HTDPVSSLAFSPDGGLLIASGSSDG 221

Query: 515 LLQFDSNLQPTGYEAQIAEHLGGIRTVSEGRGSQLIVGTTRNCILVGSLNLGFSPVVL-- 572
            ++           + ++ H   + +     GS L  G++   I +  L    S +    
Sbjct: 222 TIRLWDLSTGKLLRSTLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLS 281

Query: 573 GHTDELWALAAHPSLSQFLTAGFDRIVQLWDSMSHSVVWSKDI---AEQAQSATFSPDGS 629
           GH+  + ++A  P      +   D  V+LWD  +  ++ S  +        S +FSPDGS
Sbjct: 282 GHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGS 341

Query: 630 IIIVGC----IAGRWLVMDSETREAYSIHTDGNEPIQVVKFSPDGKLLALGSRDNYIYVY 685
           +++ G         W +   +  +    H++       V FSPDG++++ GS D  + ++
Sbjct: 342 LLVSGGSDDGTIRLWDLRTGKPLKTLEGHSNVLS----VSFSPDGRVVSSGSTDGTVRLW 397

Query: 686 EVSGDSRKYSRVGRCTGHSSFITHLDWSEDSMYIRSNSGDYELLYWNAGTCRQIPQSSQL 745
           ++S  S   +      GH+S +T LD+S D   + S S D  +  W+  T  +    S  
Sbjct: 398 DLSTGSLLRNL----DGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSVSFSPD 453

Query: 746 RDVV 749
             V+
Sbjct: 454 GKVL 457


>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
          Length = 793

 Score = 42.8 bits (100), Expect = 7e-04
 Identities = 43/160 (26%), Positives = 74/160 (46%), Gaps = 32/160 (20%)

Query: 588 SQFLTAGFDRIVQLWDSMSHSV----------VWSKDI--AEQAQSATFSPDGSIIIVGC 635
           SQ  ++ F+ +VQ+WD     +          VWS D   A+    A+ S DGS+ +   
Sbjct: 546 SQVASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSI 605

Query: 636 IAGRWLVMDSETREAYSIHTDGNEPIQVVKFSPD-GKLLALGSRDNYIYVYEVSGDSRKY 694
             G  +          +I T  N  I  V+F  + G+ LA GS D+ +Y Y++     + 
Sbjct: 606 NQGVSI---------GTIKTKAN--ICCVQFPSESGRSLAFGSADHKVYYYDL-----RN 649

Query: 695 SRVGRCT--GHSSFITHLDWSEDSMYIRSNSGDYELLYWN 732
            ++  CT  GHS  ++++ + + S  + S+S D  L  W+
Sbjct: 650 PKLPLCTMIGHSKTVSYVRFVDSSTLV-SSSTDNTLKLWD 688


>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat. 
          Length = 39

 Score = 36.2 bits (84), Expect = 0.002
 Identities = 9/31 (29%), Positives = 16/31 (51%)

Query: 573 GHTDELWALAAHPSLSQFLTAGFDRIVQLWD 603
           GHT  + ++A  P  +   +   D  V++WD
Sbjct: 9   GHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 32.3 bits (74), Expect = 0.047
 Identities = 11/47 (23%), Positives = 21/47 (44%), Gaps = 8/47 (17%)

Query: 289 EQTQRHYLGHTDDIKCLSIHPNKLLIATGQVAGHDAREGKPHIRIWN 335
            +  R   GHT  +  ++  P+  L+A+G     D       +R+W+
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGNLLASG---SDDG-----TVRVWD 39



 Score = 31.6 bits (72), Expect = 0.11
 Identities = 16/35 (45%), Positives = 19/35 (54%), Gaps = 3/35 (8%)

Query: 651 YSIHTDGNEPIQVVKFSPDGKLLALGSRDNYIYVY 685
              HT    P+  V FSPDG LLA GS D  + V+
Sbjct: 7   LKGHTG---PVTSVAFSPDGNLLASGSDDGTVRVW 38



 Score = 27.7 bits (62), Expect = 2.2
 Identities = 9/38 (23%), Positives = 17/38 (44%)

Query: 695 SRVGRCTGHSSFITHLDWSEDSMYIRSNSGDYELLYWN 732
             +    GH+  +T + +S D   + S S D  +  W+
Sbjct: 2   KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 26.5 bits (59), Expect = 6.1
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 490 HEGSVFSICVLKEGSIISAGGKDGRLLQFD 519
           H G V S+    +G+++++G  DG +  +D
Sbjct: 10  HTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39


>gnl|CDD|197651 smart00320, WD40, WD40 repeats.  Note that these repeats are
           permuted with respect to the structural repeats (blades)
           of the beta propeller domain.
          Length = 40

 Score = 35.8 bits (83), Expect = 0.003
 Identities = 9/31 (29%), Positives = 15/31 (48%)

Query: 573 GHTDELWALAAHPSLSQFLTAGFDRIVQLWD 603
           GHT  + ++A  P      +   D  ++LWD
Sbjct: 10  GHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 32.3 bits (74), Expect = 0.046
 Identities = 16/36 (44%), Positives = 20/36 (55%), Gaps = 3/36 (8%)

Query: 651 YSIHTDGNEPIQVVKFSPDGKLLALGSRDNYIYVYE 686
              HT    P+  V FSPDGK LA GS D  I +++
Sbjct: 8   LKGHTG---PVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 31.1 bits (71), Expect = 0.13
 Identities = 9/39 (23%), Positives = 18/39 (46%), Gaps = 8/39 (20%)

Query: 297 GHTDDIKCLSIHPNKLLIATGQVAGHDAREGKPHIRIWN 335
           GHT  +  ++  P+   +A+G     D       I++W+
Sbjct: 10  GHTGPVTSVAFSPDGKYLASG---SDDG-----TIKLWD 40



 Score = 30.7 bits (70), Expect = 0.21
 Identities = 10/36 (27%), Positives = 18/36 (50%)

Query: 697 VGRCTGHSSFITHLDWSEDSMYIRSNSGDYELLYWN 732
           +    GH+  +T + +S D  Y+ S S D  +  W+
Sbjct: 5   LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 27.3 bits (61), Expect = 2.9
 Identities = 9/30 (30%), Positives = 16/30 (53%)

Query: 490 HEGSVFSICVLKEGSIISAGGKDGRLLQFD 519
           H G V S+    +G  +++G  DG +  +D
Sbjct: 11  HTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 26.5 bits (59), Expect = 5.4
 Identities = 11/27 (40%), Positives = 16/27 (59%), Gaps = 1/27 (3%)

Query: 450 KYVTCLAFTQGGEVL-SGDSNGNIIVW 475
             VT +AF+  G+ L SG  +G I +W
Sbjct: 13  GPVTSVAFSPDGKYLASGSDDGTIKLW 39


>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
           Rho/Rac/Cdc42-like GTPases [Signal transduction
           mechanisms].
          Length = 1175

 Score = 37.6 bits (87), Expect = 0.030
 Identities = 25/138 (18%), Positives = 45/138 (32%), Gaps = 16/138 (11%)

Query: 74  SPISSSPLLKNGTLKGKSFSSSKETLRYRPLNDQINRDPVRRVVSY---TPSSLP-QRRG 129
            PI+ SP   + T    S    +    + P +D ++ +P               P Q+R 
Sbjct: 90  GPITHSPSATSSTSSLNSNDGDQ----FSPASDSLSFNPSSTQSRKDSGPGDGSPVQKRK 145

Query: 130 VHYQSTGSLHS-------DSPSSSSV-SPVPSSPRPLPPKTSWTSNLHTSKRWSSTGDFH 181
                + S H           + S   +P   S + +P   S +  L +           
Sbjct: 146 NPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSSD 205

Query: 182 HHTMSSHHSVHQNHHNSS 199
                S+ S+ +N  +SS
Sbjct: 206 TSNGFSYPSIRKNSRHSS 223



 Score = 31.4 bits (71), Expect = 2.4
 Identities = 25/141 (17%), Positives = 38/141 (26%), Gaps = 17/141 (12%)

Query: 63  SSKNIANNS---FGSPISSSPLLKNGTLKGKSFSSSKETLRYRPLNDQINRDPVRRVVSY 119
           SS     +S    GSP+      KN  L   S   +   + +   N      P  R    
Sbjct: 125 SSTQSRKDSGPGDGSPVQKR---KNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKE 181

Query: 120 TPSSLPQRRGVHYQSTGSLHSDSPSSSSVSPVPSSPRPLPPKTSWTSNLHTSKRWSSTGD 179
            PS            + S+   SP          +       +   ++ H+S    S   
Sbjct: 182 IPSL----------GSQSMQLPSPHFRQKFSSSDTSNGFSYPSIRKNSRHSSNSMPSFPH 231

Query: 180 FHHHTMSSHHS-VHQNHHNSS 199
                +   HS        SS
Sbjct: 232 SSTAVLLKRHSGSSGASLISS 252


>gnl|CDD|216201 pfam00930, DPPIV_N, Dipeptidyl peptidase IV (DPP IV) N-terminal
           region.  This family is an alignment of the region to
           the N-terminal side of the active site. The Prosite
           motif does not correspond to this Pfam entry.
          Length = 348

 Score = 35.7 bits (83), Expect = 0.082
 Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 11/77 (14%)

Query: 625 SPDGSIIIVGC---------IAGRWLVMDSETREAYSIHTDGNEPIQVVKFSPDGKLLAL 675
           SPDG  +++               + + D ET     +   G   IQ  K+SPDG  LA 
Sbjct: 1   SPDGKYLLLATNYEKIWRHSYTASYYIYDLETGTVEPL-PPGEGKIQYAKWSPDGHKLAY 59

Query: 676 GSRDNYIYVYEVSGDSR 692
             RDN +YV E++    
Sbjct: 60  V-RDNNLYVRELATGKE 75


>gnl|CDD|227282 COG4946, COG4946, Uncharacterized protein related to the
           periplasmic component of the Tol biopolymer transport
           system [Function unknown].
          Length = 668

 Score = 35.6 bits (82), Expect = 0.11
 Identities = 27/101 (26%), Positives = 43/101 (42%), Gaps = 14/101 (13%)

Query: 624 FSPDGSIIIVGCIAGRWL-VMDSETREAYSIHTD-GNEPIQVVKFSPDGKLLALGSRDNY 681
              D    ++G   G  L + D +  E   I  D GN  I+ VK SPDGK + + +    
Sbjct: 367 IQVDPEGDVIGTNDGDKLGIYDKDGGEVKRIEKDLGN--IEAVKVSPDGKKVVVANDRFE 424

Query: 682 IYVYEVSGDSRKY---SRVGRCTGHSSFITHLDWSEDSMYI 719
           ++V ++   + +    S  G        IT  DW  +S +I
Sbjct: 425 LWVIDIDNGNVRLIDKSEYG-------LITDFDWHPNSRWI 458


>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein.  The Bacillus subtilis divIVA1
          mutation causes misplacement of the septum during cell
          division, resulting in the formation of small,
          circular, anucleate mini-cells. Inactivation of divIVA
          produces a mini-cell phenotype, whereas overproduction
          of DivIVA results in a filamentation phenotype. These
          proteins appear to contain coiled-coils.
          Length = 131

 Score = 33.4 bits (77), Expect = 0.12
 Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 6/57 (10%)

Query: 19 LETENESLRERVSDLEKKVLEHGDEIVCLRSTL------ADVLRRLAQLESSKNIAN 69
          L  ENE L+E +  LE+K+ E+ +    L+ TL      A+ ++  AQ E+   I  
Sbjct: 37 LYKENEELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVKANAQKEAELIIKE 93


>gnl|CDD|215562 PLN03078, PLN03078, Putative tRNA pseudouridine synthase;
           Provisional.
          Length = 513

 Score = 34.9 bits (80), Expect = 0.16
 Identities = 24/67 (35%), Positives = 31/67 (46%), Gaps = 5/67 (7%)

Query: 110 RDPVRRVVSYTPSSLPQRRGVHYQSTGSLHSDSPSSSSVSPVPSSPRPLPPKTSWTSNLH 169
           R P      ++ +SL +R       T S  S SPSS+S+SP PSS    PP T  +    
Sbjct: 6   RAPWPLWRFWSSTSLLRRVNPRILRTSSSASSSPSSTSLSP-PSSD---PPLTDNSDQGG 61

Query: 170 TSKRWSS 176
             K W S
Sbjct: 62  GMK-WES 67


>gnl|CDD|222371 pfam13775, DUF4171, Domain of unknown function (DUF4171).  This
           short family is frequently found at the N-terminus of
           Homeobox proteins.
          Length = 128

 Score = 32.8 bits (74), Expect = 0.26
 Identities = 17/68 (25%), Positives = 26/68 (38%)

Query: 165 TSNLHTSKRWSSTGDFHHHTMSSHHSVHQNHHNSSGGGAKDAQYNEEESSLRMYLRGRPI 224
            S     K+ S       H  SS     + HH   G G+ D   ++ + S  +Y+   P+
Sbjct: 17  GSEPPLHKKPSKKLGEPSHASSSSSESRKEHHRKKGSGSSDKHSHKSKKSKPLYVNTEPL 76

Query: 225 NLYAPSDL 232
            L  P  L
Sbjct: 77  TLREPDGL 84


>gnl|CDD|237510 PRK13795, PRK13795, hypothetical protein; Provisional.
          Length = 636

 Score = 34.2 bits (79), Expect = 0.28
 Identities = 10/64 (15%), Positives = 28/64 (43%), Gaps = 12/64 (18%)

Query: 625 SPDGSIIIVGCIAGRWLVMDSETREA----YSIH----TDGNEPIQ----VVKFSPDGKL 672
             +G+  ++     +W+++D    E      ++      + +  I+    VV  + DG++
Sbjct: 113 RLEGAKRLLKKRLKKWVIVDKGALEPIKNGKNVLAPGVVEADLDIKKGDEVVVVTEDGEV 172

Query: 673 LALG 676
           + +G
Sbjct: 173 VGVG 176


>gnl|CDD|149648 pfam08662, eIF2A, Eukaryotic translation initiation factor eIF2A.
           This is a family of eukaryotic translation initiation
           factors.
          Length = 194

 Score = 33.0 bits (76), Expect = 0.37
 Identities = 19/70 (27%), Positives = 31/70 (44%), Gaps = 9/70 (12%)

Query: 653 IHTDGNEPIQVVKFSPDGKLLALGSRDN---YIYVYEVSGDSRKYSRVGRCTGHSSFITH 709
           IH+ G +P   + +SP G+L+ L    N    I  ++V        ++   T  +S  T 
Sbjct: 95  IHSLGEQPRNTIFWSPFGRLVLLAGFGNLAGQIEFWDVK----NKKKIA--TAEASNATD 148

Query: 710 LDWSEDSMYI 719
            +WS D  Y 
Sbjct: 149 CEWSPDGRYF 158


>gnl|CDD|219401 pfam07404, TEBP_beta, Telomere-binding protein beta subunit (TEBP
           beta).  This family consists of several telomere-binding
           protein beta subunits which appear to be specific to the
           family Oxytrichidae. Telomeres are specialised
           protein-DNA complexes that compose the ends of
           eukaryotic chromosomes. Telomeres protect chromosome
           termini from degradation and recombination and act
           together with telomerase to ensure complete genome
           replication. TEBP beta forms a complex with TEBP alpha
           and this complex is able to recognise and bind ssDNA to
           form a sequence-specific, telomeric nucleoprotein
           complex that caps the very 3' ends of chromosomes.
          Length = 375

 Score = 33.5 bits (76), Expect = 0.43
 Identities = 19/73 (26%), Positives = 36/73 (49%), Gaps = 1/73 (1%)

Query: 110 RDPVRRVVSYTPSSLPQRRGVHYQS-TGSLHSDSPSSSSVSPVPSSPRPLPPKTSWTSNL 168
           R  V ++V YTP+    R+    +S  G   +   ++ S   VP++P P   K++ T++ 
Sbjct: 275 RKSVDKIVKYTPNKPSSRKETPQKSQAGKSSAKKTTTGSKKAVPANPSPSGKKSTKTTDQ 334

Query: 169 HTSKRWSSTGDFH 181
            T  ++    D+H
Sbjct: 335 MTMAQFKKYLDWH 347


>gnl|CDD|192630 pfam10568, Tom37, Outer mitochondrial membrane transport complex
           protein.  The TOM37 protein is one of the outer membrane
           proteins that make up the TOM complex for guiding
           cytosolic mitochondrial beta-barrel proteins from the
           cytosol across the outer mitochondrial membrane into the
           intramembrane space. In conjunction with TOM70 it guides
           peptides without an MTS into TOM40, the protein that
           forms the passage through the outer membrane. It has
           homology with Metaxin-1, also part of the outer
           mitochondrial membrane beta-barrel protein transport
           complex.
          Length = 71

 Score = 30.4 bits (69), Expect = 0.50
 Identities = 18/66 (27%), Positives = 24/66 (36%), Gaps = 16/66 (24%)

Query: 610 VWSKDIAEQAQSATFSPDGSIIIVGCIAGRWLVMDSETREAYSIHTDGNEPIQVVKFSPD 669
           VW  D         F    S I   C+A  W +      EA+ +    N  +     SP 
Sbjct: 3   VWGPD---------FGL--SSISPECLALIWYLKLCLPPEAFEVVFSNNTNL-----SPT 46

Query: 670 GKLLAL 675
           G+L AL
Sbjct: 47  GELPAL 52


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 33.4 bits (76), Expect = 0.60
 Identities = 20/78 (25%), Positives = 29/78 (37%), Gaps = 5/78 (6%)

Query: 142 SPSSSSVSP-VPSSPRPLPPKTSWTSNLHTSKRWSSTG--DFHHHTMSSHHSVHQNHHNS 198
           S   +S    VP          S +SN  T +   S    D HH     HHS+  N+H+S
Sbjct: 221 SKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINNHHS 280

Query: 199 SG--GGAKDAQYNEEESS 214
                  +  +  E  S+
Sbjct: 281 KHADSKLQTIEVIENHSN 298


>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal.  Members
           of this family are found in the N terminal region of
           cytochrome B561, as well as in various other putative
           uncharacterized proteins.
          Length = 559

 Score = 33.2 bits (76), Expect = 0.62
 Identities = 22/82 (26%), Positives = 28/82 (34%), Gaps = 10/82 (12%)

Query: 103 PLNDQINRDPVRRVVSYTPSSLP-------QRRGVHY--QSTGSLHSDSPSSSSVSPVPS 153
           P+N      P     S +PS              +    Q T S    SPSSS  SP  S
Sbjct: 128 PMNTSEPLVPGHSSFSDSPSRSASPSRKFSPSSTIQQSPQLTPSNKPASPSSSYQSPSYS 187

Query: 154 SPRPLPPKTSWTSNLHTSKRWS 175
           S    P  +S   +   S  W+
Sbjct: 188 SSLG-PVNSSGNRSNLRSSPWA 208


>gnl|CDD|234377 TIGR03866, PQQ_ABC_repeats, PQQ-dependent catabolism-associated
           beta-propeller protein.  Members of this protein family
           consist of seven repeats each of the YVTN family
           beta-propeller repeat (see TIGR02276). Members occur
           invariably as part of a transport operon that is
           associated with PQQ-dependent catabolism of alcohols
           such as phenylethanol.
          Length = 300

 Score = 32.7 bits (75), Expect = 0.63
 Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 8/89 (8%)

Query: 596 DRIVQLWDSMSHSVVWSKDIAEQAQSATFSPDGSIIIVGCI--AGRWLVMDSETREAYSI 653
           D  + + D+ +  V  +  + ++ +  TFS DG ++ V C   +    V+D  T E   +
Sbjct: 10  DNTISVIDTATLEVTRTFPVGQRPRGITFSKDGKLLYV-CASDSDTVQVIDPATGEV--L 66

Query: 654 HT--DGNEPIQVVKFSPDGKLLALGSRDN 680
           HT   G +P       P+GK+L + + D+
Sbjct: 67  HTLPSGPDPELFA-LHPNGKILYIANEDD 94


>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
          Length = 493

 Score = 32.9 bits (75), Expect = 0.66
 Identities = 21/69 (30%), Positives = 35/69 (50%), Gaps = 5/69 (7%)

Query: 573 GHTDELWALAAHPSLSQFL-TAGFDRIVQLWDSMSHSVVWS-KDIAEQAQSATFSPDGSI 630
           GHT ++  ++ HPS    L +AG D +V +WD      V   K  ++Q  S  ++ DGS+
Sbjct: 123 GHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSL 182

Query: 631 IIVGCIAGR 639
           +   C   +
Sbjct: 183 L---CTTSK 188


>gnl|CDD|166839 PRK00178, tolB, translocation protein TolB; Provisional.
          Length = 430

 Score = 32.8 bits (75), Expect = 0.67
 Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 10/64 (15%)

Query: 666 FSPDGKLLALG-SRDNY--IYVYEVSGDSRKYSRVGRCTGHSSFITHLDWSED--SMYIR 720
           +SPDG  LA   S+D    IYV +++  SR+ SRV   T H +  T   W +D  ++Y  
Sbjct: 250 WSPDGSKLAFVLSKDGNPEIYVMDLA--SRQLSRV---TNHPAIDTEPFWGKDGRTLYFT 304

Query: 721 SNSG 724
           S+ G
Sbjct: 305 SDRG 308


>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper. 
          Length = 65

 Score = 29.8 bits (68), Expect = 0.70
 Identities = 7/19 (36%), Positives = 13/19 (68%)

Query: 19 LETENESLRERVSDLEKKV 37
          LE ENE L++ +  L +++
Sbjct: 38 LEAENERLKKEIERLRREL 56


>gnl|CDD|237616 PRK14127, PRK14127, cell division protein GpsB; Provisional.
          Length = 109

 Score = 30.8 bits (70), Expect = 0.83
 Identities = 14/52 (26%), Positives = 25/52 (48%), Gaps = 6/52 (11%)

Query: 17 DMLETENESLRERVSDLEKKVLEHGDEIVCLRSTLA------DVLRRLAQLE 62
          + L+ EN  L+ +V +L K+V           +  +      D+L+RL+ LE
Sbjct: 47 EELQQENARLKAQVDELTKQVSVGASSSSVATTQPSSSATNYDILKRLSNLE 98


>gnl|CDD|234019 TIGR02800, propeller_TolB, tol-pal system beta propeller repeat
           protein TolB.  Members of this protein family are the
           TolB periplasmic protein of Gram-negative bacteria. TolB
           is part of the Tol-Pal (peptidoglycan-associated
           lipoprotein) multiprotein complex, comprising five
           envelope proteins, TolQ, TolR, TolA, TolB and Pal, which
           form two complexes. The TolQ, TolR and TolA
           inner-membrane proteins interact via their transmembrane
           domains. The {beta}-propeller domain of the periplasmic
           protein TolB is responsible for its interaction with
           Pal. TolB also interacts with the outer-membrane
           peptidoglycan-associated proteins Lpp and OmpA. TolA
           undergoes a conformational change in response to changes
           in the proton-motive force, and interacts with Pal in an
           energy-dependent manner. The C-terminal periplasmic
           domain of TolA also interacts with the N-terminal domain
           of TolB. The Tol-PAL system is required for bacterial
           outer membrane integrity. E. coli TolB is involved in
           the tonB-independent uptake of group A colicins
           (colicins A, E1, E2, E3 and K), and is necessary for the
           colicins to reach their respective targets after initial
           binding to the bacteria. It is also involved in uptake
           of filamentous DNA. Study of its structure suggest that
           the TolB protein might be involved in the recycling of
           peptidoglycan or in its covalent linking with
           lipoproteins. The Tol-Pal system is also implicated in
           pathogenesis of E. coli, Haemophilus ducreyi ,
           Salmonella enterica and Vibrio cholerae, but the
           mechanism(s) is unclear [Transport and binding proteins,
           Other, Cellular processes, Pathogenesis].
          Length = 417

 Score = 32.6 bits (75), Expect = 0.94
 Identities = 23/79 (29%), Positives = 34/79 (43%), Gaps = 7/79 (8%)

Query: 621 SATFSPDG-SIIIVGCIAGR--WLVMDSETREAYSIHTDGNEPIQVVKFSPDGKLLALGS 677
             ++SPDG SI       G     +MD++  EA  +   GN        SPDG L+A   
Sbjct: 282 EPSWSPDGKSIAFTSDRGGSPQIYMMDADGGEARRLTFRGNYNASPSW-SPDGDLIAFVH 340

Query: 678 RDN---YIYVYEVSGDSRK 693
           R+     I V ++ G   +
Sbjct: 341 REGGGFNIAVMDLDGGGER 359


>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator.  In
           eukaryotes, this family of proteins induces
           mitochondrial fission.
          Length = 248

 Score = 31.6 bits (72), Expect = 1.1
 Identities = 15/44 (34%), Positives = 16/44 (36%)

Query: 117 VSYTPSSLPQRRGVHYQSTGSLHSDSPSSSSVSPVPSSPRPLPP 160
            S T   L     V   ST S     P+   V  VP  P P PP
Sbjct: 149 NSTTSDLLSSDESVPSSSTTSFPISPPTEEPVLEVPPPPPPPPP 192


>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1.  Mating in
           fungi is controlled by the loci that determine the
           mating type of an individual, and only individuals with
           differing mating types can mate. Basidiomycete fungi
           have evolved a unique mating system, termed tetrapolar
           or bifactorial incompatibility, in which mating type is
           determined by two unlinked loci; compatibility at both
           loci is required for mating to occur. The multi-allelic
           tetrapolar mating system is considered to be a novel
           innovation that could have only evolved once, and is
           thus unique to the mushroom fungi. This domain is
           C-terminal to the homeodomain transcription factor
           region.
          Length = 418

 Score = 32.1 bits (73), Expect = 1.3
 Identities = 26/116 (22%), Positives = 41/116 (35%), Gaps = 21/116 (18%)

Query: 73  GSPISSSPLLKNGTLKGKSFSSS-------KETLRYRPLNDQI--NRDPVRRVVSYTPSS 123
            S +  SP+L +     +  SSS        E    RP +D I  +  P +   +  PS 
Sbjct: 90  PSVLDLSPVLASPQTGKRRRSSSPSDDEDEAERPSKRPRSDSISSSSSPAKPPEACLPSP 149

Query: 124 LPQRRGVHYQSTGSLHSDSPSSSSVSPVPSSPRPLPPKTSWTSNLHTSKRWSSTGD 179
                        S   +  S +S +P+P+     P   + T+     KR  S G 
Sbjct: 150 AA-----------STQDEL-SEASAAPLPTPSLSPPHTPTDTAPSGKRKRRLSDGF 193


>gnl|CDD|235312 PRK04792, tolB, translocation protein TolB; Provisional.
          Length = 448

 Score = 31.9 bits (73), Expect = 1.3
 Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 4/52 (7%)

Query: 665 KFSPDGKLLALG-SRDNYIYVYEVSGDSRKYSRVGRCTGHSSFITHLDWSED 715
           +FSPDGK LAL  S+D    +Y V   ++  +R+   T H +  T   W  D
Sbjct: 268 RFSPDGKKLALVLSKDGQPEIYVVDIATKALTRI---TRHRAIDTEPSWHPD 316


>gnl|CDD|213367 cd12833, ZntB-like_1, Salmonella typhimurium Zn2+ transporter
           ZntB-like subgroup.  A bacterial subgroup belonging to
           the Escherichia coli CorA-Salmonella typhimurium
           ZntB_like family (EcCorA_ZntB-like) of the MIT
           superfamily of essential membrane proteins involved in
           transporting divalent cations (uptake or efflux) across
           membranes. This subgroup includes the Zn2+ transporter
           Salmonella typhimurium ZntB which mediates the efflux of
           Zn2+ (and Cd2+). Structures of the intracellular domain
           of Vibrio parahaemolyticus and Salmonella typhimurium
           ZntB form funnel-shaped homopentamers, the tip of the
           funnel is formed from two C-terminal transmembrane (TM)
           helices from each monomer, and the large opening of the
           funnel from the N-terminal cytoplasmic domains. The GMN
           signature motif of the MIT superfamily occurs just after
           TM1, mutation within this motif is known to abolish Mg2+
           transport through Salmonella typhimurium CorA, and
           Mrs2p. Natural variants such as GVN and GIN, which occur
           in proteins belonging to this subfamily, may be
           associated with the transport of different divalent
           cations, such as zinc and cadmium. The functional
           diversity of MIT transporters may also be due to minor
           structural differences regulating gating, substrate
           selection, and transport.
          Length = 290

 Score = 31.7 bits (73), Expect = 1.4
 Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 3/48 (6%)

Query: 14  LHDDMLETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQL 61
           L D M E   + L +R+ +LE++VLE  DE   LR  LA++ R+   L
Sbjct: 116 LTDRM-EDVIDELEDRLDELEERVLEEEDEE--LRGELAELRRQAIAL 160


>gnl|CDD|179892 PRK04922, tolB, translocation protein TolB; Provisional.
          Length = 433

 Score = 31.9 bits (73), Expect = 1.4
 Identities = 24/62 (38%), Positives = 32/62 (51%), Gaps = 6/62 (9%)

Query: 666 FSPDGKLLALG-SRDNYIYVYEVSGDSRKYSRVGRCTGHSSFITHLDWSED--SMYIRSN 722
           FSPDG+ LAL  SRD    +Y +   SR   ++ R T H    T   W+ D  S+Y  S+
Sbjct: 255 FSPDGRRLALTLSRDGNPEIYVMDLGSR---QLTRLTNHFGIDTEPTWAPDGKSIYFTSD 311

Query: 723 SG 724
            G
Sbjct: 312 RG 313


>gnl|CDD|205130 pfam12894, Apc4_WD40, Anaphase-promoting complex subunit 4 WD40
           domain.  Apc4 contains an N-terminal propeller-shaped
           WD40 domain.The N-terminus of Afi1 serves to stabilise
           the union between Apc4 and Apc5, both of which lie
           towards the bottom-front of the APC,.
          Length = 47

 Score = 28.4 bits (64), Expect = 1.4
 Identities = 5/36 (13%), Positives = 18/36 (50%)

Query: 654 HTDGNEPIQVVKFSPDGKLLALGSRDNYIYVYEVSG 689
             +    ++++ + P   L+AL + D  + ++ ++ 
Sbjct: 7   EKNLPARVKILSWCPTMDLIALATEDGELLLHRLNW 42


>gnl|CDD|129323 TIGR00219, mreC, rod shape-determining protein MreC.  MreC (murein
           formation C) is involved in the rod shape determination
           in E. coli, and more generally in cell shape
           determination of bacteria whether or not they are
           rod-shaped. Cells defective in MreC are round. Species
           with MreC include many of the Proteobacteria,
           Gram-positives, and spirochetes [Cell envelope,
           Biosynthesis and degradation of murein sacculus and
           peptidoglycan].
          Length = 283

 Score = 31.4 bits (71), Expect = 1.5
 Identities = 29/118 (24%), Positives = 38/118 (32%), Gaps = 22/118 (18%)

Query: 17  DMLETENESLRE----RVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQLESSKNIANNSF 72
           + LE EN  LR+    +   LE        E V LR  L   L       S++ I  N  
Sbjct: 69  NNLEYENYKLRQELLKKNQQLEILTQNLKQENVRLRELLNSPLSSDEYKISAEVIYLNY- 127

Query: 73  GSPISSSPLLKNGT----------------LKGKSFSSSKETLRYRPLNDQINRDPVR 114
               S+  ++  G                 L GK  S    T R   L D  N  P +
Sbjct: 128 -DNYSTQVVINKGFNDGVYKDMPVIADGKGLVGKVVSVGSNTSRVLLLTDYTNFVPAQ 184


>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566).  Family of
           related proteins that is plant specific.
          Length = 313

 Score = 31.5 bits (71), Expect = 1.6
 Identities = 40/187 (21%), Positives = 59/187 (31%), Gaps = 41/187 (21%)

Query: 48  RSTLADVLRRLAQLES--SKNIANNSFGSPISSSPLLKNGTLKGKSFSSSKETLRYRPLN 105
           RS+    L       S  S+    N+  SP SSSP         ++ SSS          
Sbjct: 18  RSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSP--------ARNTSSSSS-------- 61

Query: 106 DQINRDPVRRVVSYTPSSLPQRRGVHYQSTGSLHSDSPSSSSVSPVPSSPRPLPPKTSWT 165
                   +R  S +   L  R              SPS  S S   S    L   ++  
Sbjct: 62  ---FGLSKQRPSSLSRGRLSSR------------FVSPSRGSPSAAASLNGSLATASTSG 106

Query: 166 SNLHTSKRWSSTGDFHHHTMSSHHSVHQNHHNSSGGGAKDAQYNEEESSLRM----YLRG 221
           S+  +  R +++ D       S  S   +       G K     E+   LR+     L+ 
Sbjct: 107 SSSPSRSRRTTSSDLSSGNGPSVLSFMAD----VKRGKKGPSKIEDAHQLRLLYNRLLQW 162

Query: 222 RPINLYA 228
           R +N  A
Sbjct: 163 RFVNARA 169


>gnl|CDD|237292 PRK13182, racA, polar chromosome segregation protein; Reviewed.
          Length = 175

 Score = 30.4 bits (69), Expect = 2.1
 Identities = 13/50 (26%), Positives = 29/50 (58%), Gaps = 5/50 (10%)

Query: 19  LETENESLRERVSDLEKKVLEHGDEIVCL-----RSTLADVLRRLAQLES 63
           LE +  ++  R+ +LE+++ +  D++V       R  + ++L RL +LE+
Sbjct: 90  LEAQLNTITRRLDELERQLQQKADDVVSYQLLQHRREMEEMLERLQKLEA 139


>gnl|CDD|115305 pfam06635, NolV, Nodulation protein NolV.  This family consists of
           several nodulation protein NolV sequences from different
           Rhizobium species. The function of this family is
           unclear.
          Length = 207

 Score = 30.8 bits (69), Expect = 2.4
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 304 CLSIHPNKLLIATGQVAGHDAREGKPHIRI 333
           CL + P K  +   + A  D  +G+P +RI
Sbjct: 134 CLHVSPLKADLLAREFADWDGLDGRPKLRI 163


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 31.2 bits (71), Expect = 2.9
 Identities = 15/53 (28%), Positives = 21/53 (39%), Gaps = 7/53 (13%)

Query: 20  ETENESLRERVSDLEKKVLEHGDEIVCLRSTLA-------DVLRRLAQLESSK 65
           E E E L E V DL +++ E  +E   L +           V  R  +LE   
Sbjct: 271 EREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRD 323



 Score = 29.6 bits (67), Expect = 9.0
 Identities = 12/46 (26%), Positives = 23/46 (50%)

Query: 17  DMLETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQLE 62
           D LE   E LRE  ++LE ++ E  + +   R  + ++   + +L 
Sbjct: 352 DDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELR 397


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
           proteins consist of three coiled-coiled domains and are
           involved in dynein-mediated minus end-directed transport
           from the Golgi apparatus to the endoplasmic reticulum
           (ER). For full functioning they bind with GSK-3beta
           pfam05350 to maintain the anchoring of microtubules to
           the centromere. It appears that amino-acid residues
           437-617 of BicD and the kinase activity of GSK-3 are
           necessary for the formation of a complex between BicD
           and GSK-3beta in intact cells.
          Length = 711

 Score = 31.0 bits (70), Expect = 3.4
 Identities = 25/69 (36%), Positives = 34/69 (49%), Gaps = 19/69 (27%)

Query: 11  AKLLHD-DMLETENESLRERVSDL------------EKKVLEHGDEIVCLRSTLADV--L 55
           A+LL D   LE EN SL+++VS L            E + LE  +E   L S L +   L
Sbjct: 93  ARLLQDYSELEEENISLQKQVSVLRQSQVEFEGLKHEIRRLE--EETELLNSQLEEAARL 150

Query: 56  RRLA--QLE 62
           + +A  QLE
Sbjct: 151 KEIAEKQLE 159


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 30.8 bits (70), Expect = 3.7
 Identities = 16/45 (35%), Positives = 27/45 (60%)

Query: 18  MLETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQLE 62
            LE E E L+ER+ +LE+ +     EI  ++S L ++  R+ +LE
Sbjct: 727 QLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELE 771


>gnl|CDD|223893 COG0823, TolB, Periplasmic component of the Tol biopolymer
           transport system [Intracellular trafficking and
           secretion].
          Length = 425

 Score = 30.5 bits (69), Expect = 3.8
 Identities = 23/98 (23%), Positives = 35/98 (35%), Gaps = 16/98 (16%)

Query: 665 KFSPDGKLLALGSRDNY---IYVYEVSGDSRKYSRVGRCTGHSSFITHLDWSEDSMYI-- 719
            FSPDG  LA  S  +    IY+ ++ G +       R T      T   WS D   I  
Sbjct: 244 AFSPDGSKLAFSSSRDGSPDIYLMDLDGKNLP-----RLTNGFGINTSPSWSPDGSKIVF 298

Query: 720 RSNSGDYELLYWNAGTCRQIPQSSQLRDVVWESNSCTL 757
            S+ G    +Y          + SQ+  + +     + 
Sbjct: 299 TSDRGGRPQIY------LYDLEGSQVTRLTFSGGGNSN 330


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 30.5 bits (69), Expect = 4.4
 Identities = 16/44 (36%), Positives = 27/44 (61%)

Query: 19  LETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQLE 62
           LE + +SL   ++ LE +++E  D++  LR  +AD+  RL  LE
Sbjct: 64  LEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALE 107


>gnl|CDD|225333 COG2706, COG2706, 3-carboxymuconate cyclase [Carbohydrate transport
           and metabolism].
          Length = 346

 Score = 30.0 bits (68), Expect = 4.5
 Identities = 13/81 (16%), Positives = 31/81 (38%), Gaps = 4/81 (4%)

Query: 643 MDSETREAYSIHT-DGNEPIQVVKFSPDGKLLALG---SRDNYIYVYEVSGDSRKYSRVG 698
           +D++T E   +           +  +PD + L +      +  +  Y +  D  + + + 
Sbjct: 23  LDTKTGELSLLQLVAELGNPTYLAVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLN 82

Query: 699 RCTGHSSFITHLDWSEDSMYI 719
           R T   S   ++   ED  ++
Sbjct: 83  RQTLPGSPPCYVSVDEDGRFV 103


>gnl|CDD|114269 pfam05538, Campylo_MOMP, Campylobacter major outer membrane
           protein.  This family consists of Campylobacter major
           outer membrane proteins. The major outer membrane
           protein (MOMP), a putative porin and a multifunction
           surface protein of Campylobacter jejuni, may play an
           important role in the adaptation of the organism to
           various host environments.
          Length = 437

 Score = 30.0 bits (67), Expect = 4.9
 Identities = 23/95 (24%), Positives = 36/95 (37%), Gaps = 6/95 (6%)

Query: 399 SVDTVVAAEFHPLDRNNIVTCGKSHINFWTIDAGGTLY--KKQGIFEMIRDKPKYVTCLA 456
           S+D  +  +    + N     G   +N W    GG  Y  K +     I D+    + LA
Sbjct: 271 SLDNELDDKLDAANGNFFALKGSIEVNGWDASLGGLYYGDKDKASLTTIEDQGNLGSLLA 330

Query: 457 ----FTQGGEVLSGDSNGNIIVWTRGSNTISKFIR 487
               F   G  L+GD   NI  +     T ++ +R
Sbjct: 331 GEEIFYTNGSNLNGDIGRNIFGYVTAGYTFNETVR 365


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 30.1 bits (68), Expect = 5.1
 Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 7/56 (12%)

Query: 20  ETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRL-------AQLESSKNIA 68
           E E E+L+E++ +LE  + E   EI  L S +  V   L        +LE    I 
Sbjct: 269 EAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIK 324


>gnl|CDD|217197 pfam02711, Pap_E4, E4 protein.  This is is a family of
           Papillomavirus proteins, E4, coded for by ORF4. A splice
           variant, E1--E4, exists but neither the function of E4
           or E1--E4 is known.
          Length = 95

 Score = 28.0 bits (63), Expect = 5.2
 Identities = 13/41 (31%), Positives = 17/41 (41%), Gaps = 5/41 (12%)

Query: 125 PQRRGVHYQSTGSLHSDSPSSSSVSPVPSSPRPLPPKTSWT 165
           P++R         L SDS SSS+ SP          +  WT
Sbjct: 34  PKKRCRR-----RLESDSDSSSTTSPSSPPTPQPTTECPWT 69


>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
          Length = 568

 Score = 30.3 bits (68), Expect = 5.3
 Identities = 33/113 (29%), Positives = 54/113 (47%), Gaps = 9/113 (7%)

Query: 660 PIQVVKFSP-DGKLLALGSRDNYIYVYEV--SGDSRKYSRVGRC--TGHSSFITHLDWSE 714
            I  ++F+P   ++LA GS D  I V+E+  + +S K  +  +C   GH   I+ +DW+ 
Sbjct: 76  SILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNP 135

Query: 715 DSMYIRSNSG-DYELLYW---NAGTCRQIPQSSQLRDVVWESNSCTLTFTTIG 763
            + YI  +SG D  +  W   N     QI    +L  + W      L+ T +G
Sbjct: 136 MNYYIMCSSGFDSFVNIWDIENEKRAFQINMPKKLSSLKWNIKGNLLSGTCVG 188


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 30.1 bits (67), Expect = 5.7
 Identities = 39/154 (25%), Positives = 57/154 (37%), Gaps = 32/154 (20%)

Query: 4   GDKDSVYAKLLHDDMLETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQLES 63
           G  DSV   + HD M    ++SLR  V D              +R    D +R  A    
Sbjct: 591 GVHDSVRGGV-HDSMRSGVHDSLRGGVHD-------------SIRGGAHDGVRSGAHDGV 636

Query: 64  SKNIANNSFGSPISSSPLLKNGTLKGKSFSSSKETLRYRPLNDQINRDPVRRVVSYTPSS 123
            K+      G   + +P    GT  G +FS  K++  +              V SY P +
Sbjct: 637 RKSSGVKVEGMGNARAPPPNQGTHDG-NFSRGKDSSDH--------------VSSYKPYA 681

Query: 124 LPQRRGVHYQST---GSLHSDSPSSSSVSPVPSS 154
              R+  +YQ T     +H    +S   + VPSS
Sbjct: 682 YDSRKAKNYQDTFDRNDMHGYMKNSPISTIVPSS 715


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 30.1 bits (68), Expect = 5.8
 Identities = 16/50 (32%), Positives = 25/50 (50%)

Query: 17  DMLETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQLESSKN 66
           + LE E E L+E + +LE +  E  DE+  L     ++   L +LES   
Sbjct: 852 EELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELA 901


>gnl|CDD|113356 pfam04582, Reo_sigmaC, Reovirus sigma C capsid protein. 
          Length = 326

 Score = 29.9 bits (67), Expect = 5.8
 Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 3/77 (3%)

Query: 2  NLGDKDSVYAKLLHDDMLETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQL 61
          N GD   +Y +L +   LE+   SL E VS +  ++ +   ++  + ++L D    L  L
Sbjct: 26 NPGDLTPIYERLTN---LESATASLSESVSTISSRLSDLSADLQDMTTSLDDAKATLNGL 82

Query: 62 ESSKNIANNSFGSPISS 78
           ++      S  S  S 
Sbjct: 83 ITTVQALQTSVDSLSSE 99


>gnl|CDD|152790 pfam12355, Dscam_C, Down syndrome cell adhesion molecule C
           terminal.  This domain family is found in eukaryotes,
           and is approximately 120 amino acids in length. The
           family is found in association with pfam00047,
           pfam07679, pfam00041. The Down syndrome cell adhesion
           molecule (Dscam) belongs to a family of cell membrane
           molecules involved in the differentiation of the nervous
           system. This is the C terminal cytoplasmic tail region
           of Dscam.
          Length = 125

 Score = 28.6 bits (63), Expect = 6.4
 Identities = 13/30 (43%), Positives = 15/30 (50%)

Query: 131 HYQSTGSLHSDSPSSSSVSPVPSSPRPLPP 160
           HY S GS+   S  S+   P   SP P PP
Sbjct: 40  HYGSRGSMGRRSIGSARNIPGNGSPEPPPP 69


>gnl|CDD|180423 PRK06143, PRK06143, enoyl-CoA hydratase; Provisional.
          Length = 256

 Score = 29.6 bits (67), Expect = 6.5
 Identities = 19/60 (31%), Positives = 28/60 (46%), Gaps = 9/60 (15%)

Query: 561 GSLNLGFSPVVLGHTDELWALAAHPSLSQFLTAGFDRIVQLWDSMSHSVVWSKDIAEQAQ 620
           GSLN+  +PV+L  T  L  LAA P +         R++ L  +   + +   DI E A 
Sbjct: 26  GSLNILGTPVILALTQALRWLAADPDV---------RVLVLRGAGEKAFIGGADIKEMAT 76


>gnl|CDD|218912 pfam06148, COG2, COG (conserved oligomeric Golgi) complex
           component, COG2.  The COG complex comprises eight
           proteins COG1-8. The COG complex plays critical roles in
           Golgi structure and function. The proposed function of
           the complex is to mediate the initial physical contact
           between transport vesicles and their membrane targets. A
           comparable role in tethering vesicles has been suggested
           for at least six additional large multisubunit
           complexes, including the exocyst, a complex that
           mediates trafficking to the plasma membrane. COG2
           structure reveals a six-helix bundle with few conserved
           surface features but a general resemblance to recently
           determined crystal structures of four different exocyst
           subunits. These bundles inCOG2 may act as platforms for
           interaction with other trafficing proteins including
           SNAREs (soluble N-ethylmaleimide factor attachment
           protein receptors) and Rabs.
          Length = 133

 Score = 28.4 bits (64), Expect = 6.7
 Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 14/76 (18%)

Query: 10  YAKLLHDDMLETENE-------------SLRERVSDLEKKVLEHGDEIVCLRSTLADVLR 56
           Y KLL  +++E  NE              L E++  +   +L    E+  +RS L   L 
Sbjct: 38  YLKLLKSELIELINEDYADFVSLSTNLVGLDEKLERIRVPLLALRKEVESVRSELEARLE 97

Query: 57  RL-AQLESSKNIANNS 71
            + A+LE  K++    
Sbjct: 98  EVEAKLEEKKDLREKK 113


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 29.7 bits (67), Expect = 7.8
 Identities = 20/105 (19%), Positives = 30/105 (28%), Gaps = 12/105 (11%)

Query: 16  DDMLETENESLRERVSDLEKKVLEHGDEIVCLRSTLADVLRRLAQLESSKNIANNS---- 71
            D LE E       VS   +  LE        +  L     +L   +S+  I +      
Sbjct: 13  ADALEEELLISEGGVSSALEVNLELELIEGEKKLELVVCDTKLGNADSAFEIPSEVGEDL 72

Query: 72  -------FGSPISSSPLLKNGTLKGKSFSSSKETLRYRP-LNDQI 108
                   G   SS        L G     S+  +R      D++
Sbjct: 73  RENAEEALGELSSSDEDDYYRMLLGLGHELSRIKVREEVDKEDKL 117


>gnl|CDD|222217 pfam13552, DUF4127, Protein of unknown function (DUF4127).  This
           family of uncharacterized bacterial proteins are about
           500 amino acids in length.
          Length = 495

 Score = 29.6 bits (67), Expect = 8.5
 Identities = 12/28 (42%), Positives = 15/28 (53%), Gaps = 1/28 (3%)

Query: 222 RPINLYAPSDLAESYDITKVAAPPQNKL 249
           RP N   P+DLA+   I  V  PP+  L
Sbjct: 10  RPCNYDFPADLAKIAGIE-VLTPPKELL 36


>gnl|CDD|224614 COG1700, COG1700, Uncharacterized conserved protein [Function
          unknown].
          Length = 503

 Score = 29.1 bits (65), Expect = 9.9
 Identities = 13/40 (32%), Positives = 17/40 (42%)

Query: 6  KDSVYAKLLHDDMLETENESLRERVSDLEKKVLEHGDEIV 45
          +D  Y     + ML  E E      S  + KV  HGDE+ 
Sbjct: 13 EDCPYYDNESEIMLIEETELFETEGSLKDVKVEPHGDELF 52


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.133    0.403 

Gapped
Lambda     K      H
   0.267   0.0799    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 40,958,247
Number of extensions: 3960391
Number of successful extensions: 4007
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3912
Number of HSP's successfully gapped: 96
Length of query: 811
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 706
Effective length of database: 6,280,432
Effective search space: 4433984992
Effective search space used: 4433984992
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 63 (27.9 bits)