RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11552
         (714 letters)



>gnl|CDD|214539 smart00147, RasGEF, Guanine nucleotide exchange factor for Ras-like
           small GTPases. 
          Length = 242

 Score =  205 bits (523), Expect = 3e-61
 Identities = 76/198 (38%), Positives = 113/198 (57%), Gaps = 13/198 (6%)

Query: 467 RYKFRGPNRFQQITANLDVFLRRFNEIQYWVITEILLVTSLNKRVQILRKMIKLAAYCKE 526
              +   ++      NL+ F+RRFNE+  WV TEIL  T+   R ++L K I++A +C+E
Sbjct: 31  GSVWGKRSKKSPSPLNLEAFIRRFNEVSNWVATEILKQTTPKDRAELLSKFIQVAKHCRE 90

Query: 527 YRNINALFAVLMGLSNVAVSRLSLTWDKLPSKSKKTYTELEALIDPSKNHRAYRQAVSKL 586
             N N+L A++  LS+  +SRL  TW+KLPSK KK + ELE L+ P +N++ YR+A+S  
Sbjct: 91  LNNFNSLMAIVSALSSSPISRLKKTWEKLPSKYKKLFEELEELLSPERNYKNYREALSSC 150

Query: 587 -QSPVIPFMPLLLKDLAFTHDGNKTVV-DGLVNFEKMHMFAQTLRTLRYCRTRHLDHDKT 644
              P IPF+ +LLKDL F  +GN   + +GLVNFEK    A+ LR +R  +++       
Sbjct: 151 NLPPCIPFLGVLLKDLTFIDEGNPDFLENGLVNFEKRRQIAEILREIRQLQSQPY----- 205

Query: 645 VVDGLVNFEKMHMFAQTL 662
                 N        Q+L
Sbjct: 206 ------NLRPNRSDIQSL 217


>gnl|CDD|216025 pfam00617, RasGEF, RasGEF domain.  Guanine nucleotide exchange
           factor for Ras-like small GTPases.
          Length = 185

 Score =  200 bits (510), Expect = 3e-60
 Identities = 65/151 (43%), Positives = 97/151 (64%), Gaps = 1/151 (0%)

Query: 474 NRFQQITANLDVFLRRFNEIQYWVITEILLVTSLNKRVQILRKMIKLAAYCKEYRNINAL 533
            +  +++ N++ F+ RFN++  WV TEIL    L KR ++++K IK+A +C+E  N N+L
Sbjct: 35  KKDSKLSPNIEAFISRFNQLSRWVATEILSEEDLKKRAKVIKKFIKIAQHCRELNNFNSL 94

Query: 534 FAVLMGLSNVAVSRLSLTWDKLPSKSKKTYTELEALIDPSKNHRAYRQAVSKLQSPVIPF 593
            A++ GL++  +SRL  TW+ +  K KK   ELE L+ PSKN + YR+A+ K   P IPF
Sbjct: 95  MAIISGLNSSPISRLKKTWELVSKKYKKLLEELEELMSPSKNFKNYREALKKANPPCIPF 154

Query: 594 MPLLLKDLAFTHDGNKTVVD-GLVNFEKMHM 623
           + L LKDL F  +GN   +D GL+NFEK   
Sbjct: 155 LGLYLKDLTFIEEGNPDFLDNGLINFEKRRK 185


>gnl|CDD|238087 cd00155, RasGEF, Guanine nucleotide exchange factor for Ras-like
           small GTPases. Small GTP-binding proteins of the Ras
           superfamily function as molecular switches in
           fundamental events such as signal transduction,
           cytoskeleton dynamics and intracellular trafficking.
           Guanine-nucleotide-exchange factors (GEFs) positively
           regulate these GTP-binding proteins in response to a
           variety of signals. GEFs catalyze the dissociation of
           GDP from the inactive GTP-binding proteins. GTP can then
           bind and induce structural changes that allow
           interaction with effectors.
          Length = 237

 Score =  196 bits (501), Expect = 4e-58
 Identities = 72/188 (38%), Positives = 114/188 (60%), Gaps = 5/188 (2%)

Query: 480 TANLDVFLRRFNEIQYWVITEILLVTSLNKRVQILRKMIKLAAYCKEYRNINALFAVLMG 539
           + NL+ F+ RFN +  WV +EILL T+  KR ++L K I++A +C+E  N N+L A++  
Sbjct: 44  SPNLERFIERFNNLSNWVASEILLCTNPKKRARLLSKFIQVAKHCRELNNFNSLMAIVSA 103

Query: 540 LSNVAVSRLSLTWDKLPSKSKKTYTELEALIDPSKNHRAYRQAVSKL--QSPVIPFMPLL 597
           LS+  +SRL  TW+ L SK KK + ELE L+DPS+N + YR+ +  +    P +PF+ + 
Sbjct: 104 LSSSPISRLKKTWEVLSSKLKKLFEELEELVDPSRNFKNYRKLLKSVGPNPPCVPFLGVY 163

Query: 598 LKDLAFTHDGNKTVVD-GLVNFEKMHMFAQTLRTLRYCRTRH--LDHDKTVVDGLVNFEK 654
           LKDL F H+GN   ++  LVNFEK    A+ LR +R  ++    L+ D+ ++  L    +
Sbjct: 164 LKDLTFLHEGNPDFLEGNLVNFEKRRKIAEILREIRQLQSNSYELNRDEDILAFLWKLLE 223

Query: 655 MHMFAQTL 662
           + +    L
Sbjct: 224 LILNEDEL 231


>gnl|CDD|239884 cd04437, DEP_Epac, DEP (Dishevelled, Egl-10, and Pleckstrin) domain
           found in Epac-like proteins. Epac (exchange proteins
           directly activated by cAMP) proteins are GEFs
           (guanine-nucleotide-exchange factors) for the small
           GTPases, Rap1 and Rap2. They are directly regulated by
           cyclic AMP, a second messenger that plays a role in the
           control of diverse cellular processes, such as cell
           adhesion and insulin secretion.  Epac-like proteins
           share a common domain architecture, containing RasGEF,
           DEP and CAP-effector (cAMP binding) domains. The DEP
           domain is involved in membrane localization.
          Length = 125

 Score =  115 bits (291), Expect = 2e-30
 Identities = 48/106 (45%), Positives = 63/106 (59%), Gaps = 5/106 (4%)

Query: 143 TIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGEQAFRDKCILYN 202
              R+C  G+ELVDWL+  +P +  SR    GMWQ LLEEGV+ HV+ E  F+DK   Y 
Sbjct: 25  RTYRQCCVGTELVDWLLQQSPCVQ-SRSQAVGMWQVLLEEGVLLHVDQELHFQDKYQFYR 83

Query: 203 FWQDKEGSSSQATAQDIAEAEEHLEEALLALARRAPDAILRYILRK 248
           F  D+          +  EAEE L+EA+  L++  PDA+LR ILRK
Sbjct: 84  FSDDECSP----APLEKREAEEELQEAVTLLSQLGPDALLRMILRK 125



 Score = 81.6 bits (202), Expect = 2e-18
 Identities = 33/71 (46%), Positives = 42/71 (59%), Gaps = 3/71 (4%)

Query: 44  AGWVLRTLLLNDESGTLRDRKTSGGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMW 103
           AG  LR  +L+D    +RDRK        R+C  G+ELVDWL+  +P +  SR    GMW
Sbjct: 1   AGRALRNAILSDAPHLIRDRKYHL--RTYRQCCVGTELVDWLLQQSPCVQ-SRSQAVGMW 57

Query: 104 QALLEEGVIYH 114
           Q LLEEGV+ H
Sbjct: 58  QVLLEEGVLLH 68


>gnl|CDD|237999 cd00038, CAP_ED, effector domain of the CAP family of transcription
           factors; members include CAP (or cAMP receptor protein
           (CRP)), which binds cAMP, FNR (fumarate and nitrate
           reduction), which uses an iron-sulfur cluster to sense
           oxygen) and CooA, a heme containing CO sensor. In all
           cases binding of the effector leads to conformational
           changes and the ability to activate transcription.
           Cyclic nucleotide-binding domain similar to CAP are also
           present in cAMP- and cGMP-dependent protein kinases
           (cAPK and cGPK) and vertebrate cyclic nucleotide-gated
           ion-channels.  Cyclic nucleotide-monophosphate binding
           domain; proteins that bind cyclic nucleotides (cAMP or
           cGMP) share a structural domain of about 120 residues;
           the best studied is the prokaryotic catabolite gene
           activator, CAP, where such a domain is known to be
           composed of three alpha-helices and a distinctive
           eight-stranded, antiparallel beta-barrel structure;
           three conserved glycine residues are thought to be
           essential for maintenance of the structural integrity of
           the beta-barrel; CooA is a homodimeric transcription
           factor that belongs to CAP family; cAMP- and
           cGMP-dependent protein kinases (cAPK and cGPK) contain
           two tandem copies of the cyclic nucleotide-binding
           domain; cAPK's are composed of two different subunits, a
           catalytic chain and a regulatory chain, which contains
           both copies of the domain; cGPK's are single chain
           enzymes that include the two copies of the domain in
           their N-terminal section; also found in vertebrate
           cyclic nucleotide-gated ion-channels.
          Length = 115

 Score = 62.0 bits (151), Expect = 9e-12
 Identities = 28/81 (34%), Positives = 39/81 (48%), Gaps = 5/81 (6%)

Query: 310 PLHHLSNSVRRELAGVVMFEAHPRKGEILFHQGDEGKSWYIIIQGSVDVVIYGKG----C 365
               L +    ELA   + E     GE++  QGD   S YI++ GSV+V    +      
Sbjct: 1   LFSGLDDEELEELADA-LEERRFPAGEVIIRQGDPADSLYIVLSGSVEVYKLDEDGREQI 59

Query: 366 VTSLYAGEDFGKLALVNNAPR 386
           V  L  G+ FG+LAL+ N PR
Sbjct: 60  VGFLGPGDLFGELALLGNGPR 80



 Score = 47.3 bits (113), Expect = 2e-06
 Identities = 20/55 (36%), Positives = 29/55 (52%), Gaps = 4/55 (7%)

Query: 391 SMVFHQGDEGKSWYIIIQGSVDVVIYGKG----CVTSLYAGEDFGKLALVNNAPS 441
            ++  QGD   S YI++ GSV+V    +      V  L  G+ FG+LAL+ N P 
Sbjct: 26  EVIIRQGDPADSLYIVLSGSVEVYKLDEDGREQIVGFLGPGDLFGELALLGNGPR 80


>gnl|CDD|239836 cd04371, DEP, DEP domain, named after Dishevelled, Egl-10, and
           Pleckstrin, where this domain was first discovered. The
           function of this domain is still not clear, but it is
           believed to be important for the membrane association of
           the signaling proteins in which it is present. New
           studies show that the DEP domain of Sst2, a yeast RGS
           protein is necessary and sufficient for receptor
           interaction.
          Length = 81

 Score = 57.7 bits (140), Expect = 1e-10
 Identities = 25/63 (39%), Positives = 37/63 (58%), Gaps = 4/63 (6%)

Query: 140 GGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHV-NGEQAFRDKC 198
             +T    C +GSELVDWL+      A++R+    + QALL+ G+I+HV + +  FRD  
Sbjct: 21  HLKTY-PNCFTGSELVDWLLDN--LEAITREEAVELGQALLKHGLIHHVSDDKHTFRDSY 77

Query: 199 ILY 201
            LY
Sbjct: 78  ALY 80



 Score = 51.2 bits (123), Expect = 3e-08
 Identities = 25/68 (36%), Positives = 41/68 (60%), Gaps = 4/68 (5%)

Query: 47  VLRTLLLNDESGTLRDRKTSGGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQAL 106
           ++R +L +D    ++DRK    +T    C +GSELVDWL+      A++R+    + QAL
Sbjct: 2   LVRIMLDSDSGVPIKDRKY-HLKTY-PNCFTGSELVDWLLDN--LEAITREEAVELGQAL 57

Query: 107 LEEGVIYH 114
           L+ G+I+H
Sbjct: 58  LKHGLIHH 65


>gnl|CDD|216020 pfam00610, DEP, Domain found in Dishevelled, Egl-10, and Pleckstrin
           (DEP).  The DEP domain is responsible for mediating
           intracellular protein targeting and regulation of
           protein stability in the cell. The DEP domain is present
           in a number of signaling molecules, including Regulator
           of G protein Signaling (RGS) proteins, and has been
           implicated in membrane targeting. New findings in yeast,
           however, demonstrate a major role for a DEP domain in
           mediating the interaction of an RGS protein to the
           C-terminal tail of a GPCR, thus placing RGS in close
           proximity with its substrate G protein alpha subunit.
          Length = 74

 Score = 56.5 bits (137), Expect = 3e-10
 Identities = 24/66 (36%), Positives = 34/66 (51%), Gaps = 3/66 (4%)

Query: 140 GGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGE-QAFRD-K 197
             +T    C +GSE VDWLM     L + R+    + Q LL+ G+I+HV  + + F D K
Sbjct: 10  HLKTY-PNCFTGSEAVDWLMDNFEGLVIDREEAVELGQLLLDHGLIHHVGDKHRGFLDSK 68

Query: 198 CILYNF 203
              Y F
Sbjct: 69  YSFYRF 74



 Score = 53.8 bits (130), Expect = 3e-09
 Identities = 22/56 (39%), Positives = 31/56 (55%), Gaps = 2/56 (3%)

Query: 59  TLRDRKTSGGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYH 114
            L+DRK    +T    C +GSE VDWLM     L + R+    + Q LL+ G+I+H
Sbjct: 3   KLKDRK-KHLKTY-PNCFTGSEAVDWLMDNFEGLVIDREEAVELGQLLLDHGLIHH 56


>gnl|CDD|197516 smart00100, cNMP, Cyclic nucleotide-monophosphate binding domain.
           Catabolite gene activator protein (CAP) is a prokaryotic
           homologue of eukaryotic cNMP-binding domains, present in
           ion channels, and cNMP-dependent kinases.
          Length = 120

 Score = 56.6 bits (137), Expect = 7e-10
 Identities = 27/78 (34%), Positives = 39/78 (50%), Gaps = 5/78 (6%)

Query: 313 HLSNSVRRELAGVVMFEAHPRKGEILFHQGDEGKSWYIIIQGSVDVVI---YGKGCVTSL 369
           +L     RELA   +       GE++  QGD G S+YII+ G V+V      G+  +   
Sbjct: 4   NLDAEELRELADA-LEPVRYPAGEVIIRQGDVGDSFYIIVSGEVEVYKVLEDGEEQIVGT 62

Query: 370 YAGED-FGKLALVNNAPR 386
               D FG+LAL+ N+ R
Sbjct: 63  LGPGDFFGELALLTNSRR 80



 Score = 44.7 bits (106), Expect = 1e-05
 Identities = 19/55 (34%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 393 VFHQGDEGKSWYIIIQGSVDVVI---YGKGCVTSLYAGED-FGKLALVNNAPSYQ 443
           +  QGD G S+YII+ G V+V      G+  +       D FG+LAL+ N+    
Sbjct: 28  IIRQGDVGDSFYIIVSGEVEVYKVLEDGEEQIVGTLGPGDFFGELALLTNSRRAA 82


>gnl|CDD|214489 smart00049, DEP, Domain found in Dishevelled, Egl-10, and
           Pleckstrin.  Domain of unknown function present in
           signalling proteins that contain PH, rasGEF, rhoGEF,
           rhoGAP, RGS, PDZ domains. DEP domain in Drosophila
           dishevelled is essential to rescue planar polarity
           defects and induce JNK signalling (Cell 94, 109-118).
          Length = 77

 Score = 55.0 bits (133), Expect = 1e-09
 Identities = 26/63 (41%), Positives = 35/63 (55%), Gaps = 4/63 (6%)

Query: 143 TIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNG--EQAFRDKCIL 200
                C +GSELVDWLM     +   R+    + Q LL+EG+I+HVNG  +  F+D   L
Sbjct: 15  KTYPNCFTGSELVDWLMDNLEII--DREEAVHLGQLLLDEGLIHHVNGPNKHTFKDSKAL 72

Query: 201 YNF 203
           Y F
Sbjct: 73  YRF 75



 Score = 45.7 bits (109), Expect = 2e-06
 Identities = 21/60 (35%), Positives = 31/60 (51%), Gaps = 3/60 (5%)

Query: 55  DESGTLRDRKTSGGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYH 114
            E+G     +    +T    C +GSELVDWLM     +   R+    + Q LL+EG+I+H
Sbjct: 1   PETGLKLRDRKYFLKTY-PNCFTGSELVDWLMDNLEII--DREEAVHLGQLLLDEGLIHH 57


>gnl|CDD|215664 pfam00027, cNMP_binding, Cyclic nucleotide-binding domain. 
          Length = 91

 Score = 51.1 bits (123), Expect = 3e-08
 Identities = 21/58 (36%), Positives = 32/58 (55%), Gaps = 4/58 (6%)

Query: 333 RKGEILFHQGDEGKSWYIIIQGSVDVVIYGKG----CVTSLYAGEDFGKLALVNNAPR 386
           +KGE++F +GD   S YI++ G V V    +      +  L  G+ FG+LAL+   PR
Sbjct: 5   KKGEVIFREGDPADSLYIVLSGKVKVYKLDEDGREQILAFLGPGDFFGELALLGGEPR 62



 Score = 42.7 bits (101), Expect = 3e-05
 Identities = 17/55 (30%), Positives = 27/55 (49%), Gaps = 4/55 (7%)

Query: 391 SMVFHQGDEGKSWYIIIQGSVDVVIYGKG----CVTSLYAGEDFGKLALVNNAPS 441
            ++F +GD   S YI++ G V V    +      +  L  G+ FG+LAL+   P 
Sbjct: 8   EVIFREGDPADSLYIVLSGKVKVYKLDEDGREQILAFLGPGDFFGELALLGGEPR 62


>gnl|CDD|239890 cd04443, DEP_GPR155, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain found in GPR155-like proteins. GRP155-like
           proteins, also known as PGR22, contain an N-terminal
           permease domain, a central transmembrane region and a
           C-terminal DEP domain. They are orphan receptors of the
           class B G protein-coupled receptors. Their function is
           unknown.
          Length = 83

 Score = 46.6 bits (111), Expect = 1e-06
 Identities = 21/63 (33%), Positives = 31/63 (49%), Gaps = 2/63 (3%)

Query: 141 GRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGEQAFRDKCIL 200
           G    +    G +LV WL+ +   LA  R       + LL+ GV+ H+  E  FRD+ +L
Sbjct: 23  GLRTYKGVFCGCDLVSWLIEVG--LAQDRGEAVLYGRRLLQGGVLQHITNEHHFRDENLL 80

Query: 201 YNF 203
           Y F
Sbjct: 81  YRF 83



 Score = 28.1 bits (63), Expect = 3.5
 Identities = 14/47 (29%), Positives = 21/47 (44%), Gaps = 2/47 (4%)

Query: 68  GRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYH 114
           G    +    G +LV WL+ +   LA  R       + LL+ GV+ H
Sbjct: 23  GLRTYKGVFCGCDLVSWLIEVG--LAQDRGEAVLYGRRLLQGGVLQH 67


>gnl|CDD|239886 cd04439, DEP_1_P-Rex, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain 1 found in P-Rex-like proteins. The P-Rex family
           is the guanine-nucleotide exchange factor (GEF) for the
           small GTPase Rac that contains an N-terminal RhoGEF
           domain, two DEP and PDZ domains. Rac-GEF activity is
           stimulated by phosphatidylinositol (3,4,5)-trisphosphate
           (PtdIns(3,4,5)P3), a lipid second messenger, and by the
           G beta-gamma subunits of heterotrimeric G proteins. The
           DEP domains are not involved in mediating these stimuli,
           but may be of importance for basal and stimulated levels
           Rac-GEF activity.
          Length = 81

 Score = 44.5 bits (105), Expect = 6e-06
 Identities = 20/61 (32%), Positives = 34/61 (55%), Gaps = 2/61 (3%)

Query: 143 TIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGEQAFRDKCILYN 202
           +   +C  G+E V WL+ +         +  G  QALLE G+I+HV+ +  F+++ +LY 
Sbjct: 23  STFPKCFLGNEFVSWLLEIGEISKPEEGVNLG--QALLENGIIHHVSDKHQFKNEQVLYR 80

Query: 203 F 203
           F
Sbjct: 81  F 81



 Score = 34.9 bits (80), Expect = 0.014
 Identities = 17/63 (26%), Positives = 31/63 (49%), Gaps = 4/63 (6%)

Query: 52  LLNDESGTLRDRKTSGGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGV 111
           ++  +   ++DR+     +   +C  G+E V WL+ +         +  G  QALLE G+
Sbjct: 7   MMCKQGSLIKDRRRK--LSTFPKCFLGNEFVSWLLEIGEISKPEEGVNLG--QALLENGI 62

Query: 112 IYH 114
           I+H
Sbjct: 63  IHH 65


>gnl|CDD|239888 cd04441, DEP_2_DEP6, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain 2 found in DEP6-like proteins. DEP6 proteins
           contain two DEP and a PDZ domain. Their function is
           unknown.
          Length = 85

 Score = 43.6 bits (103), Expect = 2e-05
 Identities = 23/58 (39%), Positives = 31/58 (53%), Gaps = 2/58 (3%)

Query: 146 RRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGEQAFRDKCILYNF 203
            R   GSE +DWL  L    A SR+    + + LLE G+I HV+ +  F D  +LY F
Sbjct: 30  ERTFVGSEFIDWL--LQEGEAESRREAVQLCRRLLEHGIIQHVSNKHHFFDSNLLYQF 85



 Score = 35.5 bits (82), Expect = 0.011
 Identities = 22/63 (34%), Positives = 33/63 (52%), Gaps = 4/63 (6%)

Query: 52  LLNDESGTLRDRKTSGGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGV 111
           L++ E+  L+ R+  G +    R   GSE +DWL  L    A SR+    + + LLE G+
Sbjct: 11  LMSTENSILQVREEEGVKY--ERTFVGSEFIDWL--LQEGEAESRREAVQLCRRLLEHGI 66

Query: 112 IYH 114
           I H
Sbjct: 67  IQH 69


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 41.4 bits (98), Expect = 7e-04
 Identities = 32/114 (28%), Positives = 55/114 (48%), Gaps = 19/114 (16%)

Query: 206 DKEGSSSQATAQDI-----AEAEEHLEEALLALARRAPDAILRY---------ILRKQLK 251
           + + SS++  A+ I      EAE   +EALL     A + I +            R +L+
Sbjct: 26  EAKISSAEEEAKQIIEEAKKEAEALKKEALLE----AKEEIHKLRAEAERELKERRNELQ 81

Query: 252 KLQNRYLPREYML-HLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
           + + R L +E  L   +  +E KEE  +E+E+E     + L+   E+LE++ EE
Sbjct: 82  RQEKRLLQKEETLDRKDESLEKKEESLEEKEKELAARQQQLEEKEEELEELIEE 135


>gnl|CDD|239895 cd04448, DEP_PIKfyve, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain found in fungal RhoGEF (GDP/GTP exchange factor)
           PIKfyve-like proteins. PIKfyve contains N-terminal Fyve
           finger and DEP domains, a central chaperonin-like domain
           and a C-terminal PIPK (phosphatidylinositol phosphate
           kinase) domain. PIKfyve-like proteins are important
           phosphatidylinositol (3)-monophosphate
           (PtdIns(3)P)-5-kinases, producing PtdIns(3,5)P2, which
           plays a major role in multivesicular body (MVB) sorting
           and control of retrograde traffic from the vacuole back
           to the endosome and/or Golgi. PIKfyve itself has been
           shown to be play a role in regulating
           early-endosome-to-trans-Golgi network (TGN) retrograde
           trafficking.
          Length = 81

 Score = 37.8 bits (88), Expect = 0.001
 Identities = 21/54 (38%), Positives = 29/54 (53%), Gaps = 2/54 (3%)

Query: 148 CASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGEQAFRDKCILY 201
           C  G ELV+WL  +    A +R     + QALL+ G I  V+ +  FRD+  LY
Sbjct: 28  CILGKELVNWL--IRQGKAATRVQAIAIGQALLDAGWIECVSDDDLFRDEYALY 79



 Score = 30.9 bits (70), Expect = 0.34
 Identities = 15/38 (39%), Positives = 20/38 (52%), Gaps = 2/38 (5%)

Query: 75  CASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVI 112
           C  G ELV+WL  +    A +R     + QALL+ G I
Sbjct: 28  CILGKELVNWL--IRQGKAATRVQAIAIGQALLDAGWI 63


>gnl|CDD|239889 cd04442, DEP_1_DEP6, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain 1 found in DEP6-like proteins. DEP6 proteins
           contain two DEP and a PDZ domain. Their function is
           unknown.
          Length = 82

 Score = 38.0 bits (88), Expect = 0.001
 Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 3/57 (5%)

Query: 148 CASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGE-QAFRDKCILYNF 203
           C  G EL+DWL+      A  R+    + Q LL+  +I+HV  E + F+D  + Y F
Sbjct: 28  CFVGKELIDWLIE--HKEASDRETAIKIMQKLLDHSIIHHVCDEHKEFKDAKLFYRF 82



 Score = 29.1 bits (65), Expect = 1.5
 Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 2/40 (5%)

Query: 75  CASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYH 114
           C  G EL+DWL+      A  R+    + Q LL+  +I+H
Sbjct: 28  CFVGKELIDWLIE--HKEASDRETAIKIMQKLLDHSIIHH 65


>gnl|CDD|239891 cd04444, DEP_PLEK2, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain found in pleckstrin 2-like proteins.  Pleckstrin
           2 is found in a wide variety of cell types, which
           suggest a more general role in signaling than pleckstrin
           1.  Pleckstrin-like proteins contain a central DEP
           domain, flanked by 2 PH (pleckstrin homology) domains.
          Length = 109

 Score = 37.6 bits (87), Expect = 0.003
 Identities = 26/82 (31%), Positives = 35/82 (42%), Gaps = 16/82 (19%)

Query: 146 RRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVN------------GEQA 193
           ++   GS LVDWL+S   S A SR     +   L+EE  +  V              EQ 
Sbjct: 29  KKTFLGSALVDWLIS--NSFAASRLEAVTLASMLMEENFLRPVGVRSMGAIRSGDLAEQF 86

Query: 194 FRDKCILYNFWQD--KEGSSSQ 213
             D   LY F +   K+GSS +
Sbjct: 87  LDDSTALYTFAESYKKKGSSKE 108



 Score = 33.4 bits (76), Expect = 0.10
 Identities = 15/38 (39%), Positives = 20/38 (52%), Gaps = 2/38 (5%)

Query: 73  RRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEG 110
           ++   GS LVDWL+S   S A SR     +   L+EE 
Sbjct: 29  KKTFLGSALVDWLIS--NSFAASRLEAVTLASMLMEEN 64


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 40.5 bits (95), Expect = 0.003
 Identities = 25/95 (26%), Positives = 49/95 (51%)

Query: 212 SQATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIE 271
           S   ++   E EE  EE    L +    A L  +L+  L++L+ +    E  +  E +  
Sbjct: 618 SLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRI 677

Query: 272 DKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELL 306
           + EE+ +E+ EE  Q+ + L++  E+LE++ ++L 
Sbjct: 678 ENEEQLEEKLEELEQLEEELEQLREELEELLKKLG 712



 Score = 36.3 bits (84), Expect = 0.069
 Identities = 26/101 (25%), Positives = 43/101 (42%), Gaps = 11/101 (10%)

Query: 216 AQDIAEAEEHLEEA------LLALARRAPDAILRYILRKQLKKLQNRYL-----PREYML 264
            +++ E E  LEE       L  L R   +        + L +     L       E + 
Sbjct: 273 EEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLE 332

Query: 265 HLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
            LE ++E  E E +E  EE+ ++ K L+   ++LE+  EEL
Sbjct: 333 KLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEEL 373



 Score = 34.4 bits (79), Expect = 0.27
 Identities = 26/101 (25%), Positives = 45/101 (44%), Gaps = 13/101 (12%)

Query: 218 DIAEAEEHLEEALLAL-----ARRAPDAI--LRYILRKQLKKLQNRYLPREYMLHLEREI 270
           ++ E    LEE L +L          +A   L   L K   + +   L +  +  LE ++
Sbjct: 604 ELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKV 663

Query: 271 E------DKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
           E       +E +  E EE+  +  + L++  E+LEQ+ EEL
Sbjct: 664 EELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREEL 704



 Score = 33.6 bits (77), Expect = 0.43
 Identities = 19/90 (21%), Positives = 41/90 (45%), Gaps = 3/90 (3%)

Query: 246 LRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEE-EERQIGKPLDRTSEDLEQIYEE 304
            + ++++L+ +    E +  +E  +E  EEE KE ++ EE Q  +  +   +++E + E 
Sbjct: 183 AKAKIEELEGQL--SELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEER 240

Query: 305 LLHLKPLHHLSNSVRRELAGVVMFEAHPRK 334
           L  L+        ++  L  +   E    K
Sbjct: 241 LAELEEEKERLEELKARLLEIESLELEALK 270



 Score = 30.1 bits (68), Expect = 5.0
 Identities = 23/93 (24%), Positives = 42/93 (45%), Gaps = 9/93 (9%)

Query: 217 QDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEE 276
               + EE LE+ + AL  R  +       +++L++L+ R L  E +     +I ++E  
Sbjct: 221 IQEEQEEEELEQEIEALEERLAE---LEEEKERLEELKARLLEIESLELEALKIREEELR 277

Query: 277 EKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
           E E   EE +         E LE++  E+  L+
Sbjct: 278 ELERLLEELE------EKIERLEELEREIEELE 304



 Score = 29.7 bits (67), Expect = 6.4
 Identities = 18/97 (18%), Positives = 41/97 (42%)

Query: 216 AQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEE 275
            +++      L E +  L     + +      +++++ Q      + +  LE  + + EE
Sbjct: 187 IEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEE 246

Query: 276 EEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLH 312
           E++  EE + ++ +      E L+   EEL  L+ L 
Sbjct: 247 EKERLEELKARLLEIESLELEALKIREEELRELERLL 283



 Score = 29.3 bits (66), Expect = 8.7
 Identities = 24/101 (23%), Positives = 43/101 (42%), Gaps = 14/101 (13%)

Query: 215 TAQDIAEAEEHLEEALLALARR-----APDAILRYILRKQLKKLQNRY---------LPR 260
              ++ E  E LE+ L  L          +  L+  L ++L+KL+N           L  
Sbjct: 495 EEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQL 554

Query: 261 EYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQI 301
           + +    R++ED+ +E KE  EE R +    +   E  E++
Sbjct: 555 QQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERL 595



 Score = 29.3 bits (66), Expect = 9.2
 Identities = 23/92 (25%), Positives = 40/92 (43%), Gaps = 15/92 (16%)

Query: 217 QDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEE 276
           + + + E  LEE        A       +L ++LK+L+ R         LE+E+E   E 
Sbjct: 336 EKLEKLESELEELAEEKNELAK------LLEERLKELEERLE------ELEKELEKALER 383

Query: 277 EK---EEEEEERQIGKPLDRTSEDLEQIYEEL 305
            K   E  +E ++    L    E++++  EEL
Sbjct: 384 LKQLEEAIQELKEELAELSAALEEIQEELEEL 415



 Score = 29.3 bits (66), Expect = 9.9
 Identities = 26/116 (22%), Positives = 46/116 (39%), Gaps = 12/116 (10%)

Query: 219 IAEAEEHLEEA---LLALARRAPDAILRYI---LRKQLKKLQNRYLPREYMLHLEREIED 272
           + E ++ L+E    L  L        L      L +  ++L++          LE  ++ 
Sbjct: 595 LKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQA 654

Query: 273 KEEE---EKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRRELAGV 325
             EE   + EE E E +      +  E+ EQ+ E+L  L+ L      +R EL  +
Sbjct: 655 ALEELEEKVEELEAEIR---RELQRIENEEQLEEKLEELEQLEEELEQLREELEEL 707


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 39.9 bits (94), Expect = 0.005
 Identities = 35/134 (26%), Positives = 65/134 (48%), Gaps = 25/134 (18%)

Query: 200 LYNFWQDKEGSSSQATAQDI-----AEAEEHLEEALLALARRAPDAILRY---------I 245
           L     +K+  S++  A+ I      EAE   +EALL     A + + +           
Sbjct: 18  LRKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLE----AKEEVHKLRAELERELKE 73

Query: 246 LRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIG---KPLDRTSEDLEQIY 302
            R +L++L+ R L RE    L+R++E  +++E+  E++E+++    K LD   E+LE++ 
Sbjct: 74  RRNELQRLERRLLQRE--ETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELI 131

Query: 303 EELLHLKPLHHLSN 316
            E    + L  +S 
Sbjct: 132 AE--QREELERISG 143


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 39.8 bits (94), Expect = 0.005
 Identities = 35/107 (32%), Positives = 56/107 (52%), Gaps = 16/107 (14%)

Query: 220 AEAEEHLEEALLALARRAPDAILRY---------ILRKQLKKLQNRYLPREYMLHLEREI 270
            EAE   +EALL     A + I +            R +L+KL+ R L +E  L  + E+
Sbjct: 49  KEAEAIKKEALLE----AKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLEL 104

Query: 271 EDKEEEEKEEEEEE-RQIGKPLDRTSEDLEQIYEELLHLKPLHHLSN 316
            +K EEE E++E+E  Q  + L++  E+LE++ EE   L+ L  +S 
Sbjct: 105 LEKREEELEKKEKELEQKQQELEKKEEELEELIEE--QLQELERISG 149



 Score = 33.6 bits (78), Expect = 0.39
 Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 7/91 (7%)

Query: 219 IAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEK 278
           I EAEE   + +L  A++  +AI      K+   L+ +    +     E+E+ ++  E +
Sbjct: 33  IKEAEE-EAKRILEEAKKEAEAI------KKEALLEAKEEIHKLRNEFEKELRERRNELQ 85

Query: 279 EEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
           + E+   Q  + LDR  E LE+  EEL   +
Sbjct: 86  KLEKRLLQKEENLDRKLELLEKREEELEKKE 116


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 39.7 bits (93), Expect = 0.006
 Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 16/103 (15%)

Query: 215 TAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREY------MLHLER 268
             +++ +A E L E    L            LRK+L++L+ +Y   EY       L L R
Sbjct: 624 LEEELDKAFEELAETEKRLEE----------LRKELEELEKKYSEEEYEELREEYLELSR 673

Query: 269 EIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPL 311
           E+     E +E E+   +I K L++  E+LE+  +    L+ L
Sbjct: 674 ELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKL 716



 Score = 39.3 bits (92), Expect = 0.008
 Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 4/92 (4%)

Query: 219 IAEAEEHLEEALLALARRAPDAILRYILR-KQLKKLQNRYLPREYMLHLEREIEDKEEEE 277
           + E EE L E L  L     +++     R K+L+   N YL    +   E+E+E +E+E 
Sbjct: 565 LDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLE---LKDAEKELEREEKEL 621

Query: 278 KEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
           K+ EEE  +  + L  T + LE++ +EL  L+
Sbjct: 622 KKLEEELDKAFEELAETEKRLEELRKELEELE 653



 Score = 32.0 bits (73), Expect = 1.5
 Identities = 28/99 (28%), Positives = 44/99 (44%), Gaps = 5/99 (5%)

Query: 207 KEGSSSQATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHL 266
           KE    +   +++ +  + LE+ L  L  R       Y   K  K+   R   R   L  
Sbjct: 331 KELEEKEERLEELKKKLKELEKRLEELEERH----ELYEEAKAKKEELERLKKRLTGLTP 386

Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
           E+  ++ EE EK +EE E +I K   R  E L++  +EL
Sbjct: 387 EKLEKELEELEKAKEEIEEEISKITARIGE-LKKEIKEL 424



 Score = 30.4 bits (69), Expect = 4.1
 Identities = 24/109 (22%), Positives = 44/109 (40%), Gaps = 23/109 (21%)

Query: 219 IAEAEEHLEEALLALARRAPD---AILRYILRKQLKKLQNRYLPREYMLHLE-------- 267
           I E ++ +EE L    +   +       YI   +L +    YL     +           
Sbjct: 268 IEELKKEIEE-LEEKVKELKELKEKAEEYI---KLSEFYEEYLDELREIEKRLSRLEEEI 323

Query: 268 -------REIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
                  +E+E+KEE  +E +++ +++ K L+   E  E +YEE    K
Sbjct: 324 NGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHE-LYEEAKAKK 371



 Score = 29.6 bits (67), Expect = 6.3
 Identities = 28/100 (28%), Positives = 51/100 (51%), Gaps = 10/100 (10%)

Query: 219 IAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLP----REYMLHLEREIEDKE 274
           I E E+ LEE L  +   +        LR++L+KL+         +E +  LE+E+E  E
Sbjct: 195 IKEKEKELEEVLREINEISS---ELPELREELEKLEKEVKELEELKEEIEELEKELESLE 251

Query: 275 EEEKEEEEEERQIGKPLDRT---SEDLEQIYEELLHLKPL 311
             +++ EE+ R++ + ++      E+LE+  +EL  LK  
Sbjct: 252 GSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEK 291



 Score = 29.6 bits (67), Expect = 7.6
 Identities = 22/84 (26%), Positives = 41/84 (48%), Gaps = 4/84 (4%)

Query: 246 LRKQLKKLQNRYLPREYML-HLEREIEDKEEEEKEEEEEERQIG--KPLDRTSEDLEQIY 302
           L K+L+ L+      E  +  LE  IE+ ++E +E EE+ +++   K        L + Y
Sbjct: 243 LEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFY 302

Query: 303 EELLH-LKPLHHLSNSVRRELAGV 325
           EE L  L+ +    + +  E+ G+
Sbjct: 303 EEYLDELREIEKRLSRLEEEINGI 326



 Score = 29.3 bits (66), Expect = 9.2
 Identities = 24/105 (22%), Positives = 41/105 (39%), Gaps = 20/105 (19%)

Query: 239 DAILRYILRKQLKKLQNRYLP-REYMLHLEREIEDKEEEEKEEEEEERQIG--------- 288
           + ++R IL   L   +N Y    E +  ++R IE  E+  K  E  E  I          
Sbjct: 148 EKVVRQIL--GLDDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEV 205

Query: 289 --------KPLDRTSEDLEQIYEELLHLKPLHHLSNSVRRELAGV 325
                     L    E+LE++ +E+  L+ L      + +EL  +
Sbjct: 206 LREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESL 250


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 38.0 bits (89), Expect = 0.015
 Identities = 24/89 (26%), Positives = 38/89 (42%), Gaps = 8/89 (8%)

Query: 216 AQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEE 275
            ++ AE E   E  L   A    + + +    K+  K        E+   LE++IE+KEE
Sbjct: 251 QEERAEEEAERERMLEKQAED--EELEQENAEKRRMKRL------EHRRELEQQIEEKEE 302

Query: 276 EEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
               E EEE + G+ L     + +   EE
Sbjct: 303 RRAAEREEELEEGERLREEEAERQARIEE 331



 Score = 33.0 bits (76), Expect = 0.46
 Identities = 22/70 (31%), Positives = 31/70 (44%), Gaps = 7/70 (10%)

Query: 221 EAEEHLEEALLALA--RRAPDAILRYILRKQLKKLQNRYLPREYMLHLERE-IEDKEEEE 277
           E EE L+E         R  +        K+ K+ + R    E +     E IE KEEE+
Sbjct: 90  EYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLR----EEIDEFNEERIERKEEEK 145

Query: 278 KEEEEEERQI 287
           + E EEE +I
Sbjct: 146 EREREEELKI 155



 Score = 33.0 bits (76), Expect = 0.57
 Identities = 29/109 (26%), Positives = 50/109 (45%), Gaps = 10/109 (9%)

Query: 217 QDIAEAEEHLEEALLALARRAPDAILRY-ILRKQLKKLQNRYLPREYMLHLEREIEDKEE 275
           ++  +  E L+E    L +   +   R     +  K+ + +   +        E E++ +
Sbjct: 192 EEAEDEREELDELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQ 251

Query: 276 EEKEEEEEERQIGKPLDRTSED--LEQIYEELLHLKPLHHLSNSVRREL 322
           EE+ EEE ER+  + L++ +ED  LEQ   E   +K L H     RREL
Sbjct: 252 EERAEEEAERE--RMLEKQAEDEELEQENAEKRRMKRLEH-----RREL 293



 Score = 31.4 bits (72), Expect = 1.4
 Identities = 23/98 (23%), Positives = 42/98 (42%), Gaps = 13/98 (13%)

Query: 217 QDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLERE-----IE 271
                 EE  E       R       R +L++Q+++ + R          ERE     IE
Sbjct: 52  LKALAEEEERERKRKEERRE-----GRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIE 106

Query: 272 DKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
             +EE++ E +E+R+  K   +  E++++  EE +  K
Sbjct: 107 RIQEEDEAEAQEKREKQK---KLREEIDEFNEERIERK 141



 Score = 30.3 bits (69), Expect = 3.8
 Identities = 23/96 (23%), Positives = 42/96 (43%), Gaps = 18/96 (18%)

Query: 213 QATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIED 272
           +    +  E E  ++E +     +A    L     ++ K+ + R   RE    L+ +IE+
Sbjct: 30  KRIKAEEKEEERRIDEMMEEERLKA----LAEEEERERKRKEER---REGRAVLQEQIEE 82

Query: 273 KE----EEEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
           +E    EE +E  +E  Q+        E +E+I EE
Sbjct: 83  REKRRQEEYEERLQEREQM-------DEIIERIQEE 111



 Score = 29.5 bits (67), Expect = 6.7
 Identities = 16/70 (22%), Positives = 28/70 (40%), Gaps = 6/70 (8%)

Query: 217 QDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEE 276
           +++ E  E L  A +   R A       I  K+  K + +   R     +E E      E
Sbjct: 4   EELRELNEKLRAAKVNKERDAQ------IEEKKRIKAEEKEEERRIDEMMEEERLKALAE 57

Query: 277 EKEEEEEERQ 286
           E+E E + ++
Sbjct: 58  EEERERKRKE 67


>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
           division and chromosome partitioning].
          Length = 570

 Score = 37.4 bits (87), Expect = 0.030
 Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 11/95 (11%)

Query: 219 IAEAEEHLEEALLALARRAPDAI--LRYILRKQLKKLQNRY---LPREYML---HLEREI 270
           + EAEEH+  AL ++  R P  +  L+  L  QL+ L+  Y       Y L   +++  +
Sbjct: 199 LEEAEEHMI-ALRSIMERIPSLLAELQTELPGQLQDLKAGYRDMKEEGYHLEHVNIDSRL 257

Query: 271 EDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
           E  +E+  E  E   Q+   LD   E+L  I E++
Sbjct: 258 ERLKEQLVENSELLTQL--ELDEAEEELGLIQEKI 290


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 37.3 bits (86), Expect = 0.031
 Identities = 15/25 (60%), Positives = 20/25 (80%)

Query: 267 EREIEDKEEEEKEEEEEERQIGKPL 291
           E E E++EEEE+EEEEEE +  +PL
Sbjct: 870 EEEEEEEEEEEEEEEEEEEENEEPL 894



 Score = 33.4 bits (76), Expect = 0.53
 Identities = 13/20 (65%), Positives = 17/20 (85%)

Query: 267 EREIEDKEEEEKEEEEEERQ 286
           E E E++EEEE+EEEEEE +
Sbjct: 866 EEEEEEEEEEEEEEEEEEEE 885



 Score = 32.3 bits (73), Expect = 1.1
 Identities = 13/20 (65%), Positives = 17/20 (85%)

Query: 267 EREIEDKEEEEKEEEEEERQ 286
           E E E++EEEE+EEEEEE +
Sbjct: 867 EEEEEEEEEEEEEEEEEEEE 886



 Score = 32.3 bits (73), Expect = 1.1
 Identities = 13/20 (65%), Positives = 17/20 (85%)

Query: 267 EREIEDKEEEEKEEEEEERQ 286
           E E E++EEEE+EEEEEE +
Sbjct: 865 EEEEEEEEEEEEEEEEEEEE 884



 Score = 31.9 bits (72), Expect = 1.5
 Identities = 17/41 (41%), Positives = 26/41 (63%), Gaps = 5/41 (12%)

Query: 269 EIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
           + E++EEEE+EEEEEE +     +   E+ E+  EE L L+
Sbjct: 862 DSEEEEEEEEEEEEEEEE-----EEEEEEEEEENEEPLSLE 897



 Score = 31.5 bits (71), Expect = 1.9
 Identities = 13/20 (65%), Positives = 17/20 (85%)

Query: 267 EREIEDKEEEEKEEEEEERQ 286
           E E E++EEEE+EEEEEE +
Sbjct: 868 EEEEEEEEEEEEEEEEEEEE 887



 Score = 31.5 bits (71), Expect = 2.1
 Identities = 13/20 (65%), Positives = 17/20 (85%)

Query: 267 EREIEDKEEEEKEEEEEERQ 286
           E E E++EEEE+EEEEEE +
Sbjct: 869 EEEEEEEEEEEEEEEEEEEE 888



 Score = 31.1 bits (70), Expect = 2.3
 Identities = 13/20 (65%), Positives = 17/20 (85%)

Query: 267 EREIEDKEEEEKEEEEEERQ 286
           E E E++EEEE+EEEEEE +
Sbjct: 864 EEEEEEEEEEEEEEEEEEEE 883


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 37.4 bits (87), Expect = 0.032
 Identities = 20/93 (21%), Positives = 45/93 (48%), Gaps = 8/93 (8%)

Query: 213 QATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIED 272
               + + E E+ LEE    +     +  LR  +++++ +L+        +  LER I +
Sbjct: 261 SELEKRLEEIEQLLEELNKKIKDLGEEEQLR--VKEKIGELEAE------IASLERSIAE 312

Query: 273 KEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
           KE E ++ EE   ++   +D+   ++E++  E+
Sbjct: 313 KERELEDAEERLAKLEAEIDKLLAEIEELEREI 345



 Score = 32.7 bits (75), Expect = 0.94
 Identities = 24/104 (23%), Positives = 49/104 (47%), Gaps = 16/104 (15%)

Query: 218 DIAEAEEHLEEALLALA---------RRAPDAILRY-ILRKQLKKLQNRYLPREYMLH-- 265
           ++ E EE++E   L +          RR  +   RY  L K+ ++ +   L +E      
Sbjct: 178 ELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALER 237

Query: 266 ----LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
               +ER++   EEE ++  EE  ++ K L+   + LE++ +++
Sbjct: 238 QKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKI 281



 Score = 32.0 bits (73), Expect = 1.4
 Identities = 27/94 (28%), Positives = 38/94 (40%), Gaps = 16/94 (17%)

Query: 218 DIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEEE 277
              E EE LEE   AL            L  +L  L+      E  L   RE+E K EE 
Sbjct: 862 KKEELEEELEELEAALRD----------LESRLGDLKKERDELEAQL---RELERKIEEL 908

Query: 278 KEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPL 311
           + + E++R+    L      LE + EEL  ++  
Sbjct: 909 EAQIEKKRKR---LSELKAKLEALEEELSEIEDP 939



 Score = 29.7 bits (67), Expect = 8.2
 Identities = 18/87 (20%), Positives = 34/87 (39%), Gaps = 1/87 (1%)

Query: 235 RRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRT 294
               +A LR I +K  +    +    + +  L+ +  D +E+ K  E+E   +    +  
Sbjct: 807 VSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEEL 866

Query: 295 SEDLEQIYEELLHL-KPLHHLSNSVRR 320
            E+LE++   L  L   L  L      
Sbjct: 867 EEELEELEAALRDLESRLGDLKKERDE 893


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 34.6 bits (79), Expect = 0.036
 Identities = 16/37 (43%), Positives = 20/37 (54%)

Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYE 303
           + E E+ E EE+E EEEE   G    R +ED E   E
Sbjct: 60  DDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAE 96


>gnl|CDD|236969 PRK11753, PRK11753, DNA-binding transcriptional dual regulator Crp;
           Provisional.
          Length = 211

 Score = 35.3 bits (82), Expect = 0.055
 Identities = 18/49 (36%), Positives = 29/49 (59%), Gaps = 4/49 (8%)

Query: 391 SMVFHQGDEGKSWYIIIQGSVDVVIY---GKGCVTS-LYAGEDFGKLAL 435
           S + H G++ ++ Y I++GSV V+I    GK  + S L  G+  G+L L
Sbjct: 29  STLIHAGEKAETLYYIVKGSVAVLIKDEEGKEMILSYLNQGDFIGELGL 77



 Score = 35.3 bits (82), Expect = 0.069
 Identities = 18/47 (38%), Positives = 28/47 (59%), Gaps = 4/47 (8%)

Query: 338 LFHQGDEGKSWYIIIQGSVDVVIY---GKGCVTS-LYAGEDFGKLAL 380
           L H G++ ++ Y I++GSV V+I    GK  + S L  G+  G+L L
Sbjct: 31  LIHAGEKAETLYYIVKGSVAVLIKDEEGKEMILSYLNQGDFIGELGL 77


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 36.2 bits (84), Expect = 0.062
 Identities = 27/102 (26%), Positives = 47/102 (46%), Gaps = 11/102 (10%)

Query: 211 SSQATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLH-LERE 269
             +   +++ EAE+ +EE    L            LR++L++LQ   L  +  +  LE E
Sbjct: 254 ELEELQEELEEAEKEIEELKSELEE----------LREELEELQEELLELKEEIEELEGE 303

Query: 270 IEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPL 311
           I    E  +E E E  ++ + L+   E +E + EEL   + L
Sbjct: 304 ISLLRERLEELENELEELEERLEELKEKIEALKEELEERETL 345



 Score = 34.7 bits (80), Expect = 0.24
 Identities = 28/122 (22%), Positives = 56/122 (45%), Gaps = 9/122 (7%)

Query: 205 QDKEGSSSQATAQD-IAEAEEHLEEALLALARRAPDAILRYI--LRKQLKKLQNRYLP-R 260
           Q ++      + ++ +   E+ LEE      RR  + + R +  L+++L  L+      +
Sbjct: 682 QLEKLEEELKSLKNELRSLEDLLEEL-----RRQLEELERQLEELKRELAALEEELEQLQ 736

Query: 261 EYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRR 320
             +  LE E+E+ EEE +E +E   ++ + L+   E L ++ EE+  L+           
Sbjct: 737 SRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELE 796

Query: 321 EL 322
           EL
Sbjct: 797 EL 798



 Score = 32.4 bits (74), Expect = 1.0
 Identities = 31/103 (30%), Positives = 52/103 (50%), Gaps = 11/103 (10%)

Query: 213 QATAQDIAEAE---EHLEEALLALARRAPDAILRYI-LRKQLKKLQNRYLPREYMLH--- 265
           + T +++   E   E LE+ L  L R+A  A  RY  L+ +L++L+   L  +       
Sbjct: 182 ERTEENLERLEDLLEELEKQLEKLERQAEKA-ERYQELKAELRELELALLLAKLKELRKE 240

Query: 266 ---LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
              LE E+   EEE +E +EE  +  K ++    +LE++ EEL
Sbjct: 241 LEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREEL 283



 Score = 31.6 bits (72), Expect = 1.9
 Identities = 22/94 (23%), Positives = 41/94 (43%), Gaps = 11/94 (11%)

Query: 217 QDIAEAEEHLEEALLALARR-APDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEE 275
           + + E EE LE    ALA+       L    +   ++L+           LE E+E+ E 
Sbjct: 758 ERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEE----------LEEELEEAER 807

Query: 276 EEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
                E E   + +  +R  +++E++ EE+  L+
Sbjct: 808 RLDALERELESLEQRRERLEQEIEELEEEIEELE 841



 Score = 31.2 bits (71), Expect = 2.3
 Identities = 31/122 (25%), Positives = 50/122 (40%), Gaps = 18/122 (14%)

Query: 213 QATAQDIAEAEEHLEEALLALARRAPDAILRY-----ILRKQLKKLQNRYLPREYMLHLE 267
           +    ++ EA+E LEE L AL     +           L  +L +++N       +  L+
Sbjct: 350 EQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNE------LEELK 403

Query: 268 REIEDKEEE-EKEEEEEERQIGK------PLDRTSEDLEQIYEELLHLKPLHHLSNSVRR 320
           REIE  EE  E+  E  E    +       L+    +LE++ EEL  L+          +
Sbjct: 404 REIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLK 463

Query: 321 EL 322
           EL
Sbjct: 464 EL 465



 Score = 30.8 bits (70), Expect = 3.2
 Identities = 25/105 (23%), Positives = 50/105 (47%), Gaps = 10/105 (9%)

Query: 213 QATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLH------- 265
             + ++     +   E L    +   + +    L ++L++ + R    E  L        
Sbjct: 767 LESLEEALAKLKEEIEELEEKRQALQEELE--ELEEELEEAERRLDALERELESLEQRRE 824

Query: 266 -LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
            LE+EIE+ EEE +E EE+  ++ + L+   ++LE++ EEL  L+
Sbjct: 825 RLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELE 869



 Score = 30.5 bits (69), Expect = 4.7
 Identities = 25/105 (23%), Positives = 46/105 (43%), Gaps = 10/105 (9%)

Query: 219 IAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEK 278
           + E +E LEE          D +    L ++ ++L+     RE    LE E+ + +EE +
Sbjct: 858 LEELKEELEELEAEKEELE-DELKE--LEEEKEELEEEL--RE----LESELAELKEEIE 908

Query: 279 EEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRRELA 323
           +  E   ++   L+R   +L ++ EEL        L   + RE+ 
Sbjct: 909 KLRERLEELEAKLERLEVELPELEEELEEEYED-TLETELEREIE 952



 Score = 29.7 bits (67), Expect = 6.9
 Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 4/90 (4%)

Query: 217 QDIAEAEEHLEEALLALARRAPDAIL-RYILRKQLKKLQNRYLPREYMLHLEREIEDKEE 275
           +++ E  E L+E LL L     +      +LR++L++L+N     E +     E+++K E
Sbjct: 277 EELREELEELQEELLELKEEIEELEGEISLLRERLEELENE---LEELEERLEELKEKIE 333

Query: 276 EEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
             KEE EE   + + L++   +LE+  EEL
Sbjct: 334 ALKEELEERETLLEELEQLLAELEEAKEEL 363



 Score = 29.3 bits (66), Expect = 9.0
 Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 7/90 (7%)

Query: 217 QDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLH-LEREIEDKEE 275
           Q + E  E LEE L    RR         L ++L+ L+ R    E  +  LE EIE+ EE
Sbjct: 789 QALQEELEELEEELEEAERRLDA------LERELESLEQRRERLEQEIEELEEEIEELEE 842

Query: 276 EEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
           +  E EEE  ++ K L+   E+LE++  E 
Sbjct: 843 KLDELEEELEELEKELEELKEELEELEAEK 872


>gnl|CDD|223736 COG0664, Crp, cAMP-binding proteins - catabolite gene activator and
           regulatory subunit of cAMP-dependent protein kinases
           [Signal transduction mechanisms].
          Length = 214

 Score = 35.2 bits (81), Expect = 0.065
 Identities = 22/58 (37%), Positives = 31/58 (53%), Gaps = 4/58 (6%)

Query: 333 RKGEILFHQGDEGKSWYIIIQGSVDVVIY---GKGCVTSLYAGED-FGKLALVNNAPR 386
            KGE+LF +G+E  S YII+ G V +      G+  +       D FG+LAL+   PR
Sbjct: 29  PKGEVLFTEGEEADSLYIILSGIVKLYANTEDGREIILGFLGPGDFFGELALLGGDPR 86



 Score = 28.6 bits (64), Expect = 9.3
 Identities = 12/53 (22%), Positives = 20/53 (37%), Gaps = 3/53 (5%)

Query: 365 CVTSLYAGEDFGKLALVNNAPRLIIT---SMVFHQGDEGKSWYIIIQGSVDVV 414
               L        L L+     +       ++F +G+E  S YII+ G V + 
Sbjct: 3   KENPLLNLLPSELLELLALKLEVRKLPKGEVLFTEGEEADSLYIILSGIVKLY 55


>gnl|CDD|221528 pfam12320, SbcD_C, Type 5 capsule protein repressor C-terminal
           domain.  This domain is found in bacteria and archaea.
           This domain is about 90 amino acids in length. This
           domain is found associated with pfam00149. SbcD works in
           complex with SbdC (SbcDC) which is a transcription
           regulator. It down-regulates transcription of arl and
           mgr to inhibit type 5 capsule protein production. It
           acts as part of the SOS pathway of bacteria.
          Length = 97

 Score = 33.4 bits (77), Expect = 0.066
 Identities = 27/98 (27%), Positives = 44/98 (44%), Gaps = 14/98 (14%)

Query: 217 QDIAEAEEHLEEALLALA-RRAPDAILRYIL---------RKQLKKLQNRYLPREYMLHL 266
           + +   +  LEE L AL   +  DA L   L         R++L++     LP   +L +
Sbjct: 4   RPLRRIKGSLEELLAALEELKDRDAYLEVELTDEEPLPDLRERLREA----LPNIELLRI 59

Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
            R+ E +E   +EEEEEE +   PL+      ++   E
Sbjct: 60  RRKRERRERALEEEEEEELEELSPLELFERFYKEETGE 97


>gnl|CDD|218915 pfam06156, DUF972, Protein of unknown function (DUF972).  This
           family consists of several hypothetical bacterial
           sequences. The function of this family is unknown.
          Length = 106

 Score = 33.4 bits (77), Expect = 0.069
 Identities = 28/96 (29%), Positives = 52/96 (54%), Gaps = 17/96 (17%)

Query: 219 IAEAEEHLEEALLALARRAPDAILRYILRKQLKKL--QNRYLPREYMLHLEREIEDKEEE 276
           ++E E+ L + L  L            L++QL +L  +N  L  E   HL   +E+ E+E
Sbjct: 10  LSELEQQLNQLLAELGE----------LKQQLAELLEENAELRIE-NEHLRERLEELEQE 58

Query: 277 EKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLH 312
           +KE+E+ ++++G+  D    +L ++Y+E  H+  LH
Sbjct: 59  QKEKEKGKKKLGEGYD----NLARLYQEGFHICNLH 90


>gnl|CDD|224450 COG1533, SplB, DNA repair photolyase [DNA replication,
           recombination, and repair].
          Length = 297

 Score = 35.4 bits (82), Expect = 0.082
 Identities = 21/73 (28%), Positives = 30/73 (41%), Gaps = 2/73 (2%)

Query: 220 AEAEEHLEEALLALARRAPDAILRY--ILRKQLKKLQNRYLPREYMLHLEREIEDKEEEE 277
            E E  LE A  A AR      LR    + ++ K+   R  P  Y L+LE + E +  + 
Sbjct: 199 EELERILEAAAEAGARVVVYGTLRLRLDILRRFKEYLKRIEPERYRLYLELDREKRRGKF 258

Query: 278 KEEEEEERQIGKP 290
              E  +R  G  
Sbjct: 259 YAPEYGKRVYGDG 271


>gnl|CDD|226251 COG3728, XtmA, Phage terminase, small subunit [DNA replication,
           recombination, and repair].
          Length = 179

 Score = 34.8 bits (80), Expect = 0.085
 Identities = 18/69 (26%), Positives = 30/69 (43%), Gaps = 2/69 (2%)

Query: 241 ILRYILRKQLKKLQNRYLPREYMLHLEREI--EDKEEEEKEEEEEERQIGKPLDRTSEDL 298
           I  YI  KQ + ++   +P   +L +  EI   D E E   + ++   +  P    ++  
Sbjct: 47  IKAYIDEKQCETVEEDEVPAYELLKVLTEIALGDIETEAIVQPKDTLPVLTPPVVIAQLE 106

Query: 299 EQIYEELLH 307
           E I EEL  
Sbjct: 107 EAIEEELED 115


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 34.7 bits (80), Expect = 0.11
 Identities = 25/102 (24%), Positives = 42/102 (41%), Gaps = 2/102 (1%)

Query: 207 KEGSSSQATAQDIAEAEEHLEEA--LLALARRAPDAILRYILRKQLKKLQNRYLPREYML 264
           KE    + TA ++ E  +  EE   LL       +   R +  +     + R      + 
Sbjct: 26  KELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEERERLEAEVD 85

Query: 265 HLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELL 306
               E+   EEE +++E E RQ+ + L    E  E+  +ELL
Sbjct: 86  EATAEVAKLEEEREKKEAETRQLQQELREAQEAHERARQELL 127


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 35.3 bits (82), Expect = 0.12
 Identities = 23/104 (22%), Positives = 52/104 (50%), Gaps = 8/104 (7%)

Query: 216 AQDIAEA---EEHLEEALLALARRAPDAILRY----ILRKQLKKLQNRYLPREYMLHLER 268
            Q++ E    E+ L++AL  L +      L+      + ++++K Q  Y  RE +  +++
Sbjct: 178 LQEVLETVNIEKRLKKALELLKKELELLKLQNKITKKVEEKMEKTQREYYLREQLKAIKK 237

Query: 269 EIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLH 312
           E+  +++++ E E+ + ++ + L    E  + I +EL  L  L 
Sbjct: 238 ELGIEKDDKDELEKLKEKL-EELKLPEEVKKVIEKELTKLSLLE 280


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 35.4 bits (82), Expect = 0.13
 Identities = 30/112 (26%), Positives = 46/112 (41%), Gaps = 10/112 (8%)

Query: 204 WQDKEGSSSQATAQDIA--EAEEHLEEAL--LALARRAPDAILRYI--LRKQLKKL---Q 254
            +  E    +    + A  E  + LEE    L   R+  + + R I  LRK L +L    
Sbjct: 683 EEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEV 742

Query: 255 NRYLPREYMLHLER-EIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
            +   R   L  E  E+E + EE +E  EE  +     +   E+LE   E+L
Sbjct: 743 EQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQL 794



 Score = 31.6 bits (72), Expect = 1.7
 Identities = 22/96 (22%), Positives = 38/96 (39%), Gaps = 8/96 (8%)

Query: 213 QATAQDIAEAEEHLEEALLALA--RRAPDAILRYILRKQLKKLQNRYLPREYMLH-LERE 269
           +   + I E  E +E     +       +      L  +L+ L N     E  L  L  E
Sbjct: 841 EDLEEQIEELSEDIESLAAEIEELEELIEE-----LESELEALLNERASLEEALALLRSE 895

Query: 270 IEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
           +E+  EE +E E +  ++ + L+   E L Q+   L
Sbjct: 896 LEELSEELRELESKRSELRRELEELREKLAQLELRL 931



 Score = 31.6 bits (72), Expect = 1.7
 Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 3/77 (3%)

Query: 226 LEEALLALARRAPDAILRYI-LRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEE 284
           LE  L +L R+A  A  RY  L+ +L++L+   L       L  E+E+ +EE KE EEE 
Sbjct: 198 LERQLKSLERQAEKAE-RYKELKAELRELELALLVLRL-EELREELEELQEELKEAEEEL 255

Query: 285 RQIGKPLDRTSEDLEQI 301
            ++   L    E LE++
Sbjct: 256 EELTAELQELEEKLEEL 272



 Score = 31.2 bits (71), Expect = 2.8
 Identities = 17/43 (39%), Positives = 28/43 (65%)

Query: 263 MLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
           +L   REIE+ EE+ +E EE+  ++ K L    ++LE++ EEL
Sbjct: 672 ILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEEL 714


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 33.4 bits (77), Expect = 0.13
 Identities = 16/42 (38%), Positives = 27/42 (64%), Gaps = 1/42 (2%)

Query: 268 REIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
            E E ++EE++ +EE+E++ GK L +  E+ E+  EEL   K
Sbjct: 19  EEKEREKEEKERKEEKEKEWGKGLVQ-KEEREKRLEELEKAK 59


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 34.6 bits (80), Expect = 0.14
 Identities = 28/116 (24%), Positives = 46/116 (39%), Gaps = 20/116 (17%)

Query: 199 ILYNFWQDKEGSSS-------QATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLK 251
           +L  F   KE             TA++ A   E  +         A   +LR   +++ +
Sbjct: 177 VLQEFLNSKEAVEEAILQTDQALTAKEKAIEAERAKAEAA----EAEQELLREKQKEEEQ 232

Query: 252 KLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLH 307
            ++ +   R Y  H+++ IE K E E+E+   E       +R  E   Q  EELL 
Sbjct: 233 MMEAQ--ERSYQEHVKQLIE-KMEAEREKLLAE------QERMLEHKLQEQEELLK 279


>gnl|CDD|173171 PRK14708, PRK14708, flagellin; Provisional.
          Length = 888

 Score = 34.9 bits (80), Expect = 0.19
 Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 6/80 (7%)

Query: 8   ISARIRTITFNLSVTNTRVVHYTTTLQSSPSVPMARAGW--VLRTLLLNDESG----TLR 61
           IS  + TI+ N   T + V     T+QSS    +   G   +L+ L L   +G    TL 
Sbjct: 412 ISNGVATISTNAGQTASIVTAGIVTIQSSTGADLNVTGPTDLLKNLGLTTSTGSGPLTLT 471

Query: 62  DRKTSGGRTIARRCASGSEL 81
            ++T+ G T+    A GS L
Sbjct: 472 KQRTTSGTTMGTLIADGSTL 491


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 31.8 bits (73), Expect = 0.25
 Identities = 12/18 (66%), Positives = 16/18 (88%)

Query: 267 EREIEDKEEEEKEEEEEE 284
           E++ E++EEEEKEE EEE
Sbjct: 79  EKKEEEEEEEEKEESEEE 96



 Score = 31.1 bits (71), Expect = 0.54
 Identities = 10/18 (55%), Positives = 15/18 (83%)

Query: 267 EREIEDKEEEEKEEEEEE 284
           E + E++EEEE++EE EE
Sbjct: 78  EEKKEEEEEEEEKEESEE 95


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 31.4 bits (72), Expect = 0.29
 Identities = 9/18 (50%), Positives = 16/18 (88%)

Query: 271 EDKEEEEKEEEEEERQIG 288
           E++++EE+EEEEE+  +G
Sbjct: 67  EEEKKEEEEEEEEDDDMG 84


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 31.8 bits (72), Expect = 0.32
 Identities = 11/14 (78%), Positives = 13/14 (92%)

Query: 271 EDKEEEEKEEEEEE 284
           E K+EE+KEEEEEE
Sbjct: 89  EAKKEEKKEEEEEE 102



 Score = 30.6 bits (69), Expect = 0.82
 Identities = 10/18 (55%), Positives = 14/18 (77%)

Query: 271 EDKEEEEKEEEEEERQIG 288
             KEE+++EEEEEE  +G
Sbjct: 90  AKKEEKKEEEEEEEDDLG 107


>gnl|CDD|239896 cd04449, DEP_DEPDC5-like, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain found in DEPDC5-like proteins. DEPDC5, in human
           also known as KIAA0645, is a DEP domain containing
           protein of unknown function.
          Length = 83

 Score = 31.1 bits (71), Expect = 0.38
 Identities = 18/49 (36%), Positives = 27/49 (55%), Gaps = 1/49 (2%)

Query: 148 CASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGEQAFRD 196
           C  GSE V WL++    +  +R+    + Q L+ EG+I HV+G   F D
Sbjct: 29  CFIGSEAVSWLINNFEDV-DTREEAVELGQELMNEGLIEHVSGRHPFLD 76


>gnl|CDD|183876 PRK13169, PRK13169, DNA replication intiation control protein YabA;
           Reviewed.
          Length = 110

 Score = 31.4 bits (72), Expect = 0.44
 Identities = 16/53 (30%), Positives = 30/53 (56%), Gaps = 8/53 (15%)

Query: 264 LHLE----REIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLH 312
           L LE    RE  ++ E E+  +E++++ G+  D    +L ++Y+E  H+  LH
Sbjct: 41  LRLENDKLRERLEELEAEEPAKEKKKKEGEGKD----NLARLYQEGFHICNLH 89


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 31.3 bits (71), Expect = 0.50
 Identities = 14/18 (77%), Positives = 16/18 (88%)

Query: 267 EREIEDKEEEEKEEEEEE 284
           E+E E K+EEEKEEEEEE
Sbjct: 80  EKEEEKKKEEEKEEEEEE 97



 Score = 30.2 bits (68), Expect = 1.1
 Identities = 11/18 (61%), Positives = 16/18 (88%)

Query: 267 EREIEDKEEEEKEEEEEE 284
           E + E+KEEE+K+EEE+E
Sbjct: 75  EEKAEEKEEEKKKEEEKE 92



 Score = 29.0 bits (65), Expect = 2.5
 Identities = 10/18 (55%), Positives = 15/18 (83%)

Query: 267 EREIEDKEEEEKEEEEEE 284
           E+  E +EE++KEEE+EE
Sbjct: 76  EKAEEKEEEKKKEEEKEE 93


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
           function of this eukaryotic protein family is unknown.
           The yeast orthologues have been implicated in cell cycle
           progression and biogenesis of 60S ribosomal subunits.
           The Schistosoma mansoni Mak16 has been shown to target
           protein transport to the nucleolus.
          Length = 97

 Score = 31.0 bits (70), Expect = 0.55
 Identities = 17/43 (39%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 266 LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHL 308
           LE E  ++ +EE+EEEEEE   G+       D E++ EE+  L
Sbjct: 55  LEAEESEENDEEEEEEEEEEDEGEIEY--VSDDEELEEEIEDL 95


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 30.8 bits (70), Expect = 0.62
 Identities = 12/18 (66%), Positives = 15/18 (83%)

Query: 267 EREIEDKEEEEKEEEEEE 284
           E E E++EEEE+EE EEE
Sbjct: 78  EEEEEEEEEEEEEESEEE 95



 Score = 30.8 bits (70), Expect = 0.68
 Identities = 12/18 (66%), Positives = 15/18 (83%)

Query: 267 EREIEDKEEEEKEEEEEE 284
           E E E++EEEE+EEE EE
Sbjct: 77  EEEEEEEEEEEEEEESEE 94



 Score = 29.2 bits (66), Expect = 2.4
 Identities = 11/18 (61%), Positives = 14/18 (77%)

Query: 267 EREIEDKEEEEKEEEEEE 284
               E++EEEE+EEEEEE
Sbjct: 72  AAAAEEEEEEEEEEEEEE 89



 Score = 29.2 bits (66), Expect = 2.5
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 267 EREIEDKEEEEKEEEEEERQIGKPL 291
           E E E++EEEE+EEEE E +    L
Sbjct: 76  EEEEEEEEEEEEEEEESEEEAMAGL 100


>gnl|CDD|219996 pfam08738, Gon7, Gon7 family.  In S. cerevisiae Gon7 is a member of
           the KEOPS protein complex. A protein complex proposed to
           be involved in transcription and promoting telomere
           uncapping and telomere elongation.
          Length = 97

 Score = 30.5 bits (69), Expect = 0.63
 Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 4/45 (8%)

Query: 246 LRKQLKKLQ---NRYLPREYMLHLEREIEDKEEEEKEEEEEERQI 287
           LR  L  LQ   N +L  E M   +++   K   EKE E++E + 
Sbjct: 52  LRASLTTLQDDINEFLT-ERMEEDKKKAAAKGRGEKEIEDKEEEN 95


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 32.7 bits (75), Expect = 0.67
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 267 EREIEDKEEEEKEEEEEERQIGKPL 291
           E+E E++EEEE+E EE E + G PL
Sbjct: 352 EQEDEEEEEEEEEPEEPEPEEGPPL 376



 Score = 29.6 bits (67), Expect = 6.7
 Identities = 13/18 (72%), Positives = 15/18 (83%)

Query: 267 EREIEDKEEEEKEEEEEE 284
           E E ED+EEEE+EEE EE
Sbjct: 350 EEEQEDEEEEEEEEEPEE 367


>gnl|CDD|212745 cd11811, SH3_CHK, Src Homology 3 domain of CSK homologous kinase.
           CHK is also referred to as megakaryocyte-associated
           tyrosine kinase (Matk). It inhibits Src kinases using a
           noncatalytic mechanism by simply binding to them. As a
           negative regulator of Src kinases, Chk may play
           important roles in cell proliferation, survival, and
           differentiation, and consequently, in cancer development
           and progression. To inhibit Src kinases that are
           anchored to the plasma membrane, CHK is translocated to
           the membrane via binding to specific transmembrane
           proteins, G-proteins, or adaptor proteins near the
           membrane. CHK also plays a role in neural
           differentiation in a manner independent of Src by
           enhancing MAPK activation via Ras-mediated signaling. It
           is a cytoplasmic (or nonreceptor) tyr kinase containing
           the Src homology domains, SH3 and SH2, N-terminal to the
           catalytic tyr kinase domain. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 59

 Score = 29.8 bits (67), Expect = 0.67
 Identities = 11/26 (42%), Positives = 14/26 (53%), Gaps = 8/26 (30%)

Query: 332 PRKGEILFHQGD--------EGKSWY 349
           P+ GE+ FH+GD        E K WY
Sbjct: 13  PKPGELAFHKGDIVTIVETCERKGWY 38


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 32.7 bits (75), Expect = 0.67
 Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 11/74 (14%)

Query: 247 RKQLKKLQNRYLPREYMLH------LEREIEDKE-----EEEKEEEEEERQIGKPLDRTS 295
            ++L  L+N   P    +H       ERE++ K      E+EK  EE E+ + +     +
Sbjct: 289 TEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELN 348

Query: 296 EDLEQIYEELLHLK 309
             LE+I ++L  L+
Sbjct: 349 SKLEEIQKKLEDLE 362


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 32.0 bits (73), Expect = 0.70
 Identities = 17/56 (30%), Positives = 26/56 (46%), Gaps = 3/56 (5%)

Query: 245 ILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQ 300
           + R++ +      L ++       E E+  EEE EEEEEE Q G   +   ED + 
Sbjct: 64  LDREEAR---LELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDS 116


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 32.4 bits (74), Expect = 0.72
 Identities = 26/93 (27%), Positives = 40/93 (43%), Gaps = 14/93 (15%)

Query: 200 LYNFWQDKEGSSSQATAQDIAEAEEHLEEALLAL------ARRAPDAILRYILRKQLKKL 253
            Y   +  E    Q    +    EE   +A+  L      ++R  + + R    ++LK+L
Sbjct: 107 NYEADKLDEE---QEERVEKEREEELAGDAMKKLENRTADSKREMEVLERL---EELKEL 160

Query: 254 QNRYLP--REYMLHLEREIEDKEEEEKEEEEEE 284
           Q+R        ML      E KEEEE+EEE+E 
Sbjct: 161 QSRRADVDVNSMLEALFRREKKEEEEEEEEDEA 193



 Score = 30.9 bits (70), Expect = 2.0
 Identities = 21/77 (27%), Positives = 35/77 (45%), Gaps = 1/77 (1%)

Query: 260 REYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVR 319
           R Y      E +++  E++ EEE      K L+  + D ++  E L  L+ L  L  S R
Sbjct: 106 RNYEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKEL-QSRR 164

Query: 320 RELAGVVMFEAHPRKGE 336
            ++    M EA  R+ +
Sbjct: 165 ADVDVNSMLEALFRREK 181


>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region.  This family
           represents an N-terminal conserved region found in
           several huntingtin-associated protein 1 (HAP1)
           homologues. HAP1 binds to huntingtin in a polyglutamine
           repeat-length-dependent manner. However, its possible
           role in the pathogenesis of Huntington's disease is
           unclear. This family also includes a similar N-terminal
           conserved region from hypothetical protein products of
           ALS2CR3 genes found in the human juvenile amyotrophic
           lateral sclerosis critical region 2q33-2q34.
          Length = 307

 Score = 32.5 bits (74), Expect = 0.74
 Identities = 26/76 (34%), Positives = 39/76 (51%), Gaps = 14/76 (18%)

Query: 246 LRKQLKKLQ--NRYLPREYMLHLEREIEDKEEEE--------KEEEEEERQI---GKPLD 292
           L+++LK L+  N +L R    HL+ E    EE+E        K+  E   QI    + L 
Sbjct: 166 LQEKLKLLEEENEHL-RSEASHLKTETVTYEEKEQQLVNDCVKQLREANDQIASLSEELA 224

Query: 293 RTSEDLEQIYEELLHL 308
           + +EDLE+  EE+ HL
Sbjct: 225 KKTEDLERQQEEITHL 240


>gnl|CDD|226873 COG4467, COG4467, Regulator of replication initiation timing
           [Replication,    recombination, and repair].
          Length = 114

 Score = 30.9 bits (70), Expect = 0.75
 Identities = 23/90 (25%), Positives = 38/90 (42%), Gaps = 10/90 (11%)

Query: 226 LEEALLALARRAPDAILRYILRKQLKKL--QNRYLPREYMLHLEREIEDKEEEEKEEEEE 283
           LEE L  L            L++ L  L  +N  L  E    L RE   +   EK   ++
Sbjct: 13  LEEQLGVLLAELGG------LKQHLGSLVEENTALRLE-NEKL-RERLGEPTLEKTAVKK 64

Query: 284 ERQIGKPLDRTSEDLEQIYEELLHLKPLHH 313
           E+   K      ++L ++Y+E  H+  +H+
Sbjct: 65  EKPAVKKKGEGYDNLARLYQEGFHICNVHY 94


>gnl|CDD|239887 cd04440, DEP_2_P-Rex, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain 2 found in P-Rex-like proteins. The P-Rex family
           is the guanine-nucleotide exchange factor (GEF) for the
           small GTPase Rac that contains an N-terminal RhoGEF
           domain, two DEP and PDZ domains. Rac-GEF activity is
           stimulated by phosphatidylinositol (3,4,5)-trisphosphate
           (PtdIns(3,4,5)P3), a lipid second messenger, and the G
           beta-gamma subunits of heterotrimeric G proteins. The
           DEP domains are not involved in mediating these stimuli,
           but may be of importance for basal and stimulated levels
           Rac-GEF activity.
          Length = 93

 Score = 30.3 bits (68), Expect = 0.79
 Identities = 16/55 (29%), Positives = 29/55 (52%), Gaps = 2/55 (3%)

Query: 152 SELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGEQAFRDKCILYNFWQD 206
           S+LVDWL++          +  G+   L   G ++HV  +  F+D+ +L+ F+ D
Sbjct: 41  SKLVDWLLAQGDCRTREEAVILGV--GLCNNGFMHHVLEKSEFKDEPLLFRFYAD 93


>gnl|CDD|239894 cd04447, DEP_BRCC3, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain found in BBRC3-like proteins. BBRC3, also known
           as DEPDC1B, is a DEP containing protein of unknown
           function.
          Length = 92

 Score = 30.2 bits (68), Expect = 0.81
 Identities = 21/65 (32%), Positives = 30/65 (46%), Gaps = 8/65 (12%)

Query: 147 RCASGSELVDWLMSLAPSLA-----VSRQITTGMWQALLEEGVIYHVN---GEQAFRDKC 198
            C + SE VDWL  L  S +     V+RQ T  + +  L+  VI  +    G++   D  
Sbjct: 28  NCFTASEAVDWLHELLRSNSNFGPEVTRQQTVQLLKKFLKNHVIEDIKGRWGKEDLEDNN 87

Query: 199 ILYNF 203
            LY F
Sbjct: 88  HLYRF 92


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
           appears in eukaryotes as well as bacteria and tends to
           be found near the C-terminus of the metalloprotease M16C
           (pfam05193).
          Length = 248

 Score = 31.8 bits (73), Expect = 0.90
 Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 3/40 (7%)

Query: 271 EDKEEEEKEEEEEE-RQIGKPLDRTSEDLEQIYEELLHLK 309
           E  EE+  EEE E+  +    L  + ED E+I E  L L+
Sbjct: 5   ETLEEKLNEEEREKLEKKKSSL--SEEDKEKIIERGLELE 42


>gnl|CDD|241266 cd01233, PH_KIFIA_KIFIB, KIFIA and KIFIB protein pleckstrin
           homology (PH) domain.  The kinesin-3 family motors KIFIA
           (Caenorhabditis elegans homolog unc-104) and KIFIB
           transport synaptic vesicle precursors that contain
           synaptic vesicle proteins, such as synaptophysin,
           synaptotagmin and the small GTPase RAB3A, but they do
           not transport organelles that contain plasma membrane
           proteins. They have a N-terminal motor domain, followed
           by a coiled-coil domain, and a C-terminal PH domain.
           KIF1A adopts a monomeric form in vitro, but acts as a
           processive dimer in vivo. KIF1B has alternatively
           spliced isoforms distinguished by the presence or
           absence of insertion sequences in the conserved
           amino-terminal region of the protein; this results in
           their different motor activities. KIF1A and KIF1B bind
           to RAB3 proteins through the adaptor protein
           mitogen-activated protein kinase (MAPK) -activating
           death domain (MADD; also calledDENN), which was first
           identified as a RAB3 guanine nucleotide exchange factor
           (GEF). PH domains have diverse functions, but in general
           are involved in targeting proteins to the appropriate
           cellular location or in the interaction with a binding
           partner. They share little sequence conservation, but
           all have a common fold, which is electrostatically
           polarized. Less than 10% of PH domains bind
           phosphoinositide phosphates (PIPs) with high affinity
           and specificity. PH domains are distinguished from other
           PIP-binding domains by their specific high-affinity
           binding to PIPs with two vicinal phosphate groups:
           PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
           results in targeting some PH domain proteins to the
           plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 111

 Score = 30.2 bits (69), Expect = 1.0
 Identities = 9/32 (28%), Positives = 18/32 (56%)

Query: 135 YSLYSGGRTIARRCASGSELVDWLMSLAPSLA 166
           +++Y+   +   +  S  E+ DWL ++ P LA
Sbjct: 80  FAVYTPTNSYLLQARSEKEMHDWLYAIDPLLA 111



 Score = 27.6 bits (62), Expect = 8.7
 Identities = 8/25 (32%), Positives = 13/25 (52%)

Query: 69  RTIARRCASGSELVDWLMSLAPSLA 93
            +   +  S  E+ DWL ++ P LA
Sbjct: 87  NSYLLQARSEKEMHDWLYAIDPLLA 111


>gnl|CDD|220494 pfam09969, DUF2203, Uncharacterized conserved protein (DUF2203).
           This domain, found in various hypothetical bacterial
           proteins, has no known function.
          Length = 120

 Score = 30.4 bits (69), Expect = 1.1
 Identities = 21/105 (20%), Positives = 38/105 (36%), Gaps = 9/105 (8%)

Query: 247 RKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELL 306
            + L +L+      E +   + E+E+ E    E E      G   +   E  EQI     
Sbjct: 9   NELLPELRRLL---EEVREAKAELEELEAALSEGERSLEARGL-KEELRELEEQIRA--- 61

Query: 307 HLKPLHHLSNSVRRELAGVVMFEAHPRKGEILF--HQGDEGKSWY 349
            +  +  L   V+    G+V F A     E+      G+   +++
Sbjct: 62  AIAEIEELGIEVKDLDIGLVDFPALRDGEEVYLCWKYGEPEIAFW 106


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 32.2 bits (73), Expect = 1.1
 Identities = 15/76 (19%), Positives = 28/76 (36%), Gaps = 1/76 (1%)

Query: 213 QATAQDIAEAEEHLEEALLALARRAPDAILRY-ILRKQLKKLQNRYLPREYMLHLEREIE 271
             + ++  + E+   E          +          +   +  +Y      L LE   E
Sbjct: 889 LESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADE 948

Query: 272 DKEEEEKEEEEEERQI 287
            ++EE+ +EEEEER  
Sbjct: 949 KEKEEDNKEEEEERNK 964


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 32.3 bits (73), Expect = 1.3
 Identities = 14/33 (42%), Positives = 20/33 (60%), Gaps = 2/33 (6%)

Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLE 299
           + + +D +EE+ EEEEEE +  K  D   ED E
Sbjct: 159 DEDEDDDDEEDDEEEEEEEEEIKGFD--DEDEE 189



 Score = 31.9 bits (72), Expect = 1.6
 Identities = 11/53 (20%), Positives = 30/53 (56%), Gaps = 3/53 (5%)

Query: 250 LKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIY 302
           +K+ + R+L  E M   +  + D ++E+++E++++    +  +   E+ E+I 
Sbjct: 132 IKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDD---EEDDEEEEEEEEEIK 181


>gnl|CDD|178459 PLN02868, PLN02868, acyl-CoA thioesterase family protein.
          Length = 413

 Score = 31.6 bits (72), Expect = 1.4
 Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 1/62 (1%)

Query: 297 DLEQIYEELLHLKPLHHLSNSVRRELAGVVMFEAHPRKGEILFHQGDEGKSWYIIIQGSV 356
           + E + E L  +  L  L +S  +++A VV+ +    KGE +  +G+ G   Y I +G  
Sbjct: 2   NTESVVEFLGSVPLLQRLPSSSLKKIAEVVVPK-RYGKGEYVVREGEPGDGLYFIWKGEA 60

Query: 357 DV 358
           +V
Sbjct: 61  EV 62


>gnl|CDD|204076 pfam08824, Serine_rich, Serine rich protein interaction domain.
           This is a serine rich domain that is found in the
           docking protein p130(cas) (Crk-associated substrate).
           This domain folds into a four helix bundle which is
           associated with protein-protein interactions.
          Length = 159

 Score = 30.8 bits (70), Expect = 1.5
 Identities = 16/55 (29%), Positives = 26/55 (47%), Gaps = 10/55 (18%)

Query: 213 QATAQDIAEAEEHLEEAL----------LALARRAPDAILRYILRKQLKKLQNRY 257
           +A   DI  A + +EE+L          +  A +  D  L   LR+QL+ L++ Y
Sbjct: 39  EANINDIRRAVDRVEESLREFLEFARGAVGNATQLTDRNLHAKLRRQLQTLEDSY 93


>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 30.9 bits (70), Expect = 1.5
 Identities = 35/140 (25%), Positives = 57/140 (40%), Gaps = 13/140 (9%)

Query: 207 KEGSSSQATAQDIAEAEEHLEEALLALARR----APDAI--LRYILRKQ-LKKLQNRYLP 259
                  A A+D A      + A  +   +      DA     Y L+K+ L+KL++    
Sbjct: 30  FAAKPVGAEAEDAAAELAEKKAAAQSEIEKFCANIADAAADQLYALQKELLEKLKDLAEV 89

Query: 260 REYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVR 319
            E +    + +E K+ E K+ EEE   I +        L +IY ++        L N + 
Sbjct: 90  NERL----KALEKKKAELKDLEEEREGILRSKQAEDGKLVKIYSKMKPDAAAAILEN-LP 144

Query: 320 RELAGVVMFEAHPRK-GEIL 338
            E A  ++ +  PRK G IL
Sbjct: 145 DEEAAAILMKLKPRKLGLIL 164


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 31.3 bits (71), Expect = 1.5
 Identities = 18/39 (46%), Positives = 27/39 (69%)

Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
           E E+E+ EE+E+EE EEE +  +PL  T E+L +  E+L
Sbjct: 81  EEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTEASEQL 119


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 31.6 bits (72), Expect = 1.5
 Identities = 15/40 (37%), Positives = 25/40 (62%), Gaps = 4/40 (10%)

Query: 267 EREIEDK----EEEEKEEEEEERQIGKPLDRTSEDLEQIY 302
           E+ ++ K    E E+KEEE  E+   K +++  EDLE++Y
Sbjct: 86  EQGLQRKLKELEREQKEEEVREKHNKKIIEKFGEDLEEVY 125


>gnl|CDD|218954 pfam06246, Isy1, Isy1-like splicing family.  Isy1 protein is
           important in the optimisation of splicing.
          Length = 253

 Score = 31.1 bits (71), Expect = 1.6
 Identities = 14/46 (30%), Positives = 18/46 (39%)

Query: 260 REYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
              +L  E E E +  E+  EE +E Q         E  E IY  L
Sbjct: 174 DGVLLEYEAEREKELIEKLVEEWKEIQDEDLQAEEDEREEDIYIPL 219


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 31.4 bits (71), Expect = 1.6
 Identities = 30/129 (23%), Positives = 60/129 (46%), Gaps = 8/129 (6%)

Query: 220 AEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKE-EEEK 278
            E E   E+AL   A+R  +   R  L+   ++++     ++    LE  I+D + ++E 
Sbjct: 53  REREAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDDEN 112

Query: 279 EEEEEE----RQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRR--ELAGVVMFEAHP 332
           EEEE E    R++ + + R  E+ E++  E   ++ + +++   RR        +     
Sbjct: 113 EEEEYEAWKLRELKR-IKRDREEREEMEREKAEIEKMRNMTEEERRAELRKNPKVSTNKA 171

Query: 333 RKGEILFHQ 341
           +KG+  F Q
Sbjct: 172 KKGKYKFLQ 180


>gnl|CDD|235359 PRK05182, PRK05182, DNA-directed RNA polymerase subunit alpha;
           Provisional.
          Length = 310

 Score = 31.2 bits (72), Expect = 1.7
 Identities = 15/37 (40%), Positives = 21/37 (56%), Gaps = 2/37 (5%)

Query: 265 HLE--REIEDKEEEEKEEEEEERQIGKPLDRTSEDLE 299
            L    ++E+ EE E+E EEEE +    L R  E+LE
Sbjct: 219 QLSVFVDLEEAEEAEEEVEEEEPEFDPILLRPIEELE 255


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 30.8 bits (70), Expect = 1.8
 Identities = 26/77 (33%), Positives = 32/77 (41%), Gaps = 12/77 (15%)

Query: 245 ILRKQLKKL---QNRYLPREYMLHLEREIEDKEE------EEKEEEEEERQIGKPLDRTS 295
           I  K+  KL   Q R   RE       E +  EE      +E+EE EEER+  K  +   
Sbjct: 3   IGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERK 62

Query: 296 EDLEQI---YEELLHLK 309
           E  EQ     EE   LK
Sbjct: 63  EREEQARKEQEEYEKLK 79



 Score = 28.9 bits (65), Expect = 6.2
 Identities = 19/66 (28%), Positives = 29/66 (43%), Gaps = 5/66 (7%)

Query: 234 ARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDR 293
           ARR           ++ K  + R   R+    LE E E K+EEE+ +E EE+       +
Sbjct: 16  ARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQAR-----K 70

Query: 294 TSEDLE 299
             E+ E
Sbjct: 71  EQEEYE 76


>gnl|CDD|223771 COG0699, COG0699, Predicted GTPases (dynamin-related) [General
           function prediction only].
          Length = 546

 Score = 31.4 bits (71), Expect = 2.0
 Identities = 19/80 (23%), Positives = 27/80 (33%)

Query: 212 SQATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIE 271
                Q I    E L      +      AI   +  K+ +  Q + L R Y+  LE E+ 
Sbjct: 458 KTLEKQLIKSLLESLLILAQKIRDSVLKAIFELLKNKRKRLAQKQRLKRLYLEQLEDELL 517

Query: 272 DKEEEEKEEEEEERQIGKPL 291
              EE  E      Q  + L
Sbjct: 518 RTAEEILELRLLLEQFLEAL 537


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 31.3 bits (72), Expect = 2.0
 Identities = 22/106 (20%), Positives = 39/106 (36%), Gaps = 20/106 (18%)

Query: 208 EGSSSQATA--QDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLH 265
                      Q   EAE  L+EA            L+  L ++ +KLQ      + +  
Sbjct: 523 ASLEELERELEQKAEEAEALLKEA----------EKLKEELEEKKEKLQEEE--DKLLEE 570

Query: 266 LEREIEDKEEEEKEEEEE------ERQIGKPLDRTSEDLEQIYEEL 305
            E+E +   +E K+E +E      + Q G      + +L +  + L
Sbjct: 571 AEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRL 616


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 31.4 bits (71), Expect = 2.0
 Identities = 18/65 (27%), Positives = 29/65 (44%), Gaps = 1/65 (1%)

Query: 260 REYMLHLEREIEDKEEEEKEEEEE-ERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSV 318
           +  ++ LE E+  KE E K  + E ER+     +    +L++  E  + LK L  L    
Sbjct: 67  KSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEELEKKA 126

Query: 319 RRELA 323
             E A
Sbjct: 127 ENEAA 131


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 29.3 bits (66), Expect = 2.1
 Identities = 11/16 (68%), Positives = 13/16 (81%)

Query: 269 EIEDKEEEEKEEEEEE 284
           E ++ EEEEKEEE EE
Sbjct: 82  EADEAEEEEKEEEAEE 97


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 31.3 bits (71), Expect = 2.2
 Identities = 13/45 (28%), Positives = 25/45 (55%)

Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPL 311
           E E +  ++EE+ +EEEE++  K   +  ++    +E L   KP+
Sbjct: 49  EEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTKPI 93



 Score = 29.4 bits (66), Expect = 8.3
 Identities = 20/70 (28%), Positives = 44/70 (62%), Gaps = 16/70 (22%)

Query: 244 YILRKQLKKLQNRYLPREYM-----LHLEREIE----DKEEEEKEEEEEERQIGKPLDRT 294
           Y+  +++K+L  ++   E++     L +E+E+E    D+EEEE++EE++E +     ++T
Sbjct: 2   YLEERRIKELVKKH--SEFINYPIYLWVEKEVEKEVPDEEEEEEKEEKKEEE-----EKT 54

Query: 295 SEDLEQIYEE 304
           ++  E++ EE
Sbjct: 55  TDKEEEVDEE 64


>gnl|CDD|131219 TIGR02164, torA, trimethylamine-N-oxide reductase TorA.  This very
           narrowly defined family represents TorA, part of a
           family of related molybdoenzymes that include biotin
           sulfoxide reductases, dimethyl sulfoxide reductases, and
           at least two different subfamilies of
           trimethylamine-N-oxide reductases. A single enzyme from
           the larger family may have more than one activity. TorA
           typically is located in the periplasm, has a Tat
           (twin-arginine translocation)-dependent signal sequence,
           and is encoded in a torCAD operon.
          Length = 822

 Score = 31.4 bits (71), Expect = 2.3
 Identities = 16/67 (23%), Positives = 32/67 (47%), Gaps = 12/67 (17%)

Query: 385 PRLIITSMVFHQGDEGKSWYIIIQGSVDVVIYGKGCVTSLYAGEDFGKLALVNNAPSYQL 444
           P ++ ++ V+ QG    SW +I++ S  +V++    V +L  G          N  +++ 
Sbjct: 189 PYVLGSTEVYAQG---TSWPLILENSDTIVLWANDPVKNLQVGW---------NCETHES 236

Query: 445 HMYLEQM 451
             YL Q+
Sbjct: 237 FAYLAQL 243


>gnl|CDD|183384 PRK11924, PRK11924, RNA polymerase sigma factor; Provisional.
          Length = 179

 Score = 30.3 bits (69), Expect = 2.4
 Identities = 24/121 (19%), Positives = 42/121 (34%), Gaps = 17/121 (14%)

Query: 218 DIAEAEEHLEEALLALARRAPD------------AILRYILRKQLKKLQNRYLPREYMLH 265
           D A AE+ ++EA L   R+A               I R +    L++ +           
Sbjct: 36  DRALAEDAVQEAFLRAWRKADLFNGKGSARTWLLTIARNVCYDLLRRRRREKAVLSDD-- 93

Query: 266 LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRRELAGV 325
                  +  E  E     +     +DR  + L     E+  L+ +  LS    RE+A +
Sbjct: 94  ALEPEFAETAETPEAALLAKDDLARIDRCLDALPVKQREVFLLRYVEGLS---YREIAEI 150

Query: 326 V 326
           +
Sbjct: 151 L 151


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 29.9 bits (68), Expect = 2.4
 Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 3/44 (6%)

Query: 247 RKQLKKLQN-RYLPREYMLHL--EREIEDKEEEEKEEEEEERQI 287
           +K  KKL   R   R+ +     E+E E++EE E EE +EE QI
Sbjct: 77  KKDFKKLLKWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQI 120


>gnl|CDD|183004 PRK11161, PRK11161, fumarate/nitrate reduction transcriptional
           regulator; Provisional.
          Length = 235

 Score = 30.5 bits (69), Expect = 2.6
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 334 KGEILFHQGDEGKSWYIIIQGSV 356
           KG+ LF  GDE KS Y I  G++
Sbjct: 44  KGQTLFKAGDELKSLYAIRSGTI 66


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 30.6 bits (69), Expect = 2.6
 Identities = 12/39 (30%), Positives = 21/39 (53%)

Query: 266 LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
           L++E E   +E +E E+E+  +   L    E+ EQ+  E
Sbjct: 75  LKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENE 113


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 30.1 bits (68), Expect = 2.7
 Identities = 26/99 (26%), Positives = 45/99 (45%), Gaps = 4/99 (4%)

Query: 210 SSSQATAQDIAEAEEHLEEALLALARRAPD-AILRYILRKQLKKLQNRY--LPREYMLHL 266
           S  +    ++ E  E LEE +  L     D      +LR+ LK L+ R   L       L
Sbjct: 82  SELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKE-L 140

Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
            +E+ +  ++ +EE EE R+  + L    + L++  +EL
Sbjct: 141 AKELRELRQDLREEVEELREELERLQENLQRLQEAIQEL 179


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 30.4 bits (69), Expect = 2.7
 Identities = 25/104 (24%), Positives = 37/104 (35%), Gaps = 19/104 (18%)

Query: 204 WQDKEGSSSQATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYM 263
             D+     +   +D  E  E  EE       +  D          ++KLQ         
Sbjct: 53  ANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDE-----ASSIIEKLQ--------- 98

Query: 264 LHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLH 307
           + +ERE E+ EEE +   EE +  G       E  E+  EE L 
Sbjct: 99  MQIEREQEEWEEELERLIEEAKAEGY-----EEGYEKGREEGLE 137


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 30.5 bits (69), Expect = 2.7
 Identities = 16/62 (25%), Positives = 30/62 (48%), Gaps = 2/62 (3%)

Query: 224 EHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEE 283
             L+E  + L R+A +A   ++  KQ ++ +      E     ERE E +E E ++   +
Sbjct: 74  RSLKE--VKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAK 131

Query: 284 ER 285
           E+
Sbjct: 132 EK 133


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 30.8 bits (70), Expect = 2.8
 Identities = 33/194 (17%), Positives = 66/194 (34%), Gaps = 34/194 (17%)

Query: 221 EAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRY----LPREYMLHLEREIEDKEEE 276
           E  E     L          I    L  +L++ +            +   +R IE  E+E
Sbjct: 432 ERLEEENSELKRELEELKREI--EKLESELERFRREVRDKVRKDREIRARDRRIERLEKE 489

Query: 277 EKEEEEEERQIGKPLDRTSEDLEQIYE--------ELLHLKPLHHLSNSVRRELAGVVMF 328
            +E+++   +    L+R   +L ++ +         +  ++ L   +     E  G+   
Sbjct: 490 LEEKKKRVEE----LERKLAELRKMRKLELSGKGTPVKVVEKLTLEAIEEAEEEYGI--- 542

Query: 329 EAHPRKGEILFHQGDEGKSWYII---IQGSVDVVIYGKGCVTSLYAGEDFGKLALVNNAP 385
               ++G+++  +   G         I      +I G+    S  A E+F K    N  P
Sbjct: 543 ----KEGDVILVEDPSGGGARTAEELIDKKPRAIIRGEE--MSHAAAEEFFK----NEIP 592

Query: 386 RLIITSMVFHQGDE 399
            L    +   + DE
Sbjct: 593 VLPEGDVQIIRLDE 606


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 30.0 bits (68), Expect = 3.5
 Identities = 15/67 (22%), Positives = 21/67 (31%), Gaps = 2/67 (2%)

Query: 538 MGLSNVAVSRLSLTWDKLPSKSKKTYTELEALIDPSKNHRAYRQAVSKLQSPVIPFMPLL 597
           MG+ N A  +L     K P++S  T      L    K       A  +         P  
Sbjct: 6   MGIKNFAFCKLKGPSMKAPTRSLPTLNAWLKLNLDEKIGADDAPATDEKSKIS--LSPRY 63

Query: 598 LKDLAFT 604
              LA+ 
Sbjct: 64  YAFLAYK 70


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 30.4 bits (69), Expect = 3.7
 Identities = 20/50 (40%), Positives = 28/50 (56%)

Query: 235 RRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEE 284
           RR  + I R+I R  L++ + R  P       E E E++E  E+EEEEEE
Sbjct: 275 RRRGEMIRRFIDRLPLRRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEE 324


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 29.5 bits (66), Expect = 3.8
 Identities = 18/58 (31%), Positives = 29/58 (50%), Gaps = 3/58 (5%)

Query: 247 RKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
           +   KKL+ R  P +     E + + + EE +E+EEE       LD   E+ E++ EE
Sbjct: 4   KGNAKKLKVRPPPTKDE---EEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEE 58


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 30.2 bits (68), Expect = 3.9
 Identities = 17/84 (20%), Positives = 31/84 (36%), Gaps = 4/84 (4%)

Query: 213 QATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQL-KKLQNRYLPREYMLHLERE-I 270
            A  +   +    +EE + AL     D + +  L  +L K + ++           R+ I
Sbjct: 30  NAEKKSKNDKLARIEEEIEALKTLDLDQLAKKYLFSKLIKVITSKPTKAWAESPEFRQII 89

Query: 271 EDKEEEEK--EEEEEERQIGKPLD 292
            DK  +      E E  +  K L+
Sbjct: 90  NDKSHKANPSRSEAEVVEKLKSLN 113


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 30.0 bits (68), Expect = 3.9
 Identities = 21/98 (21%), Positives = 44/98 (44%), Gaps = 11/98 (11%)

Query: 214 ATAQDI-AEAEEHLEEA-LLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIE 271
           A  Q++  E  E  E+     L  R+  ++ R  + +  KK Q   L  E    L ++I+
Sbjct: 83  AKLQELRKEYRELKEKRNEFNLGGRSIKSLER-EIERLEKKQQTSVLTPEEERELVQKIK 141

Query: 272 DKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
           +  +E ++ ++   +        +E L+++  E+  LK
Sbjct: 142 ELRKELEDAKKALEE--------NEKLKELKAEIDELK 171


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 30.3 bits (69), Expect = 4.0
 Identities = 20/67 (29%), Positives = 32/67 (47%), Gaps = 4/67 (5%)

Query: 217 QDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEE 276
           +++ E EE LE+    L +    A  R  L  + ++   +    E    L  E+E+ EEE
Sbjct: 341 EELKELEEELEKIKKLLKKLPKKA--RGQLPPEKREQLEKL--LETKEKLSEELEELEEE 396

Query: 277 EKEEEEE 283
            KE +EE
Sbjct: 397 LKELKEE 403


>gnl|CDD|220222 pfam09403, FadA, Adhesion protein FadA.  FadA (Fusobacterium
           adhesin A) is an adhesin which forms two alpha helices.
          Length = 126

 Score = 28.7 bits (64), Expect = 4.3
 Identities = 18/70 (25%), Positives = 35/70 (50%), Gaps = 3/70 (4%)

Query: 206 DKEGSSSQATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLH 265
           ++E    +   +++ E +E         A+   ++  R+  + + KKL  RY   EY+  
Sbjct: 48  EEEKQEKETAEKEVQELKERQLGREELYAKLKEESKTRF-HKDEYKKLLKRY--DEYLNK 104

Query: 266 LEREIEDKEE 275
           LE++I DKE+
Sbjct: 105 LEQKITDKEQ 114


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 29.5 bits (67), Expect = 4.3
 Identities = 17/67 (25%), Positives = 32/67 (47%), Gaps = 1/67 (1%)

Query: 244 YILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIY- 302
           + +RK L+  Q +    + +  LE E E+ E+   E E +   I K  +   +  E+ + 
Sbjct: 110 FGMRKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHA 169

Query: 303 EELLHLK 309
           +E+  LK
Sbjct: 170 DEIAFLK 176


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 29.4 bits (66), Expect = 4.4
 Identities = 11/40 (27%), Positives = 24/40 (60%), Gaps = 5/40 (12%)

Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDL--EQIYEE 304
           E   ++K+E+++EEEEE+ +    ++   +    E+I E+
Sbjct: 41  EDVKDEKQEDDEEEEEEDEEE---IEEPEDIEDEEEIVED 77


>gnl|CDD|205495 pfam13315, DUF4085, Protein of unknown function (DUF4085).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 101 and 269 amino
           acids in length.
          Length = 208

 Score = 29.8 bits (67), Expect = 4.5
 Identities = 17/47 (36%), Positives = 23/47 (48%), Gaps = 3/47 (6%)

Query: 264 LHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKP 310
           L      E+ EEE +EEEE        L    E++E+I E+LL   P
Sbjct: 17  LSFPESQEEWEEELREEEESGNDY---LQSLKEEIEEIKEDLLKFLP 60


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 30.1 bits (67), Expect = 5.0
 Identities = 28/109 (25%), Positives = 50/109 (45%), Gaps = 11/109 (10%)

Query: 207  KEGSSSQATAQDIAEAEEHLEEALLALARRAPDAILRYILRK----------QLKKLQN- 255
            K+    +  A++  +AEE  ++A  AL + A +A     L+K          +LKK +  
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEE 1727

Query: 256  RYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
              +  E       E + K EE K++EEE+++I        +  E+I +E
Sbjct: 1728 NKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKE 1776


>gnl|CDD|239885 cd04438, DEP_dishevelled, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain found in dishevelled-like proteins.
           Dishevelled-like proteins play a key role in the
           transduction of the Wnt signal from the cell surface to
           the nucleus, which in turn is an important regulatory
           pathway for cellular development and growth. They
           contain an N-terminal DIX domain, a central PDZ domain,
           and a C-terminal DEP domain.
          Length = 84

 Score = 27.7 bits (62), Expect = 5.1
 Identities = 15/51 (29%), Positives = 24/51 (47%), Gaps = 7/51 (13%)

Query: 151 GSELVDWLMSLAPSLAVS---RQITTGMWQALLEEGVIYHVNGEQAFRDKC 198
           GS+LVDWL+S    L      R+  + + +     G I H   +  F ++C
Sbjct: 32  GSDLVDWLLSHVEGLTDRREARKYASSLLKL----GYIRHTVNKITFSEQC 78


>gnl|CDD|182730 PRK10787, PRK10787, DNA-binding ATP-dependent protease La;
           Provisional.
          Length = 784

 Score = 29.9 bits (67), Expect = 5.1
 Identities = 30/124 (24%), Positives = 56/124 (45%), Gaps = 13/124 (10%)

Query: 218 DIAEAEEHL------EEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIE 271
           D+ E  E+L      E  LL + +R     +R  ++KQ++K Q  Y   E M  +++E+ 
Sbjct: 188 DVNERLEYLMAMMESEIDLLQVEKR-----IRNRVKKQMEKSQREYYLNEQMKAIQKELG 242

Query: 272 DKEEEEKEEEEEERQI--GKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRRELAGVVMFE 329
           + ++   E E  +R+I   K      E  E   ++L  + P+   +  VR  +  +V   
Sbjct: 243 EMDDAPDENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMSAEATVVRGYIDWMVQVP 302

Query: 330 AHPR 333
            + R
Sbjct: 303 WNAR 306


>gnl|CDD|177972 PLN02338, PLN02338, 3-phosphoshikimate 1-carboxyvinyltransferase.
          Length = 443

 Score = 30.1 bits (68), Expect = 5.1
 Identities = 16/53 (30%), Positives = 28/53 (52%), Gaps = 4/53 (7%)

Query: 596 LLLKDLAFTHDGNKTVVDGLVNFEKMHMFAQTLRTLRYCRTRHLDHDKTVVDG 648
           LLL  L+   +G  TVVD L++ + +      L+TL        ++++ VV+G
Sbjct: 29  LLLAALS---EG-TTVVDNLLDSDDIRYMLGALKTLGLNVEEDSENNRAVVEG 77


>gnl|CDD|227545 COG5220, TFB3, Cdk activating kinase (CAK)/RNA polymerase II
           transcription initiation/nucleotide excision repair
           factor TFIIH, subunit TFB3 [Cell division and chromosome
           partitioning / Transcription / DNA replication,
           recombination, and repair].
          Length = 314

 Score = 29.6 bits (66), Expect = 5.1
 Identities = 20/87 (22%), Positives = 35/87 (40%), Gaps = 4/87 (4%)

Query: 223 EEHLEEALLALARRAPDAILRYILRKQLK----KLQNRYLPREYMLHLEREIEDKEEEEK 278
               EE +        D+IL  + R   +    K +N    R  +   + E + +EEE  
Sbjct: 126 VSLTEEKVKKYEEMNQDSILNNLERPTPEVMPGKQKNVLQKRMKLKKRQLERQIEEEERM 185

Query: 279 EEEEEERQIGKPLDRTSEDLEQIYEEL 305
            +E  +++IG  L   SED  +  +  
Sbjct: 186 NKEWTKKEIGNRLGTASEDGNKTIKIG 212


>gnl|CDD|238663 cd01367, KISc_KIF2_like, Kinesin motor domain, KIF2-like group.
           KIF2 is a protein expressed in neurons, which has been
           associated with axonal transport and neuron development;
           alternative splice forms have been implicated in
           lysosomal translocation. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Kinesins are microtubule-dependent
           molecular motors that play important roles in
           intracellular transport and in cell division. In this
           subgroup the motor domain is found in the middle
           (M-type) of the protein chain. M-type kinesins are (+)
           end-directed motors, i.e. they transport cargo towards
           the (+) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second (KIF2 may be slower). To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 322

 Score = 29.5 bits (67), Expect = 5.3
 Identities = 21/69 (30%), Positives = 30/69 (43%), Gaps = 16/69 (23%)

Query: 571 DPSKNHRAYRQAVSKLQSPVIPFMPLLLKDLAFTHDGN-KTVVDGLVNFEKMHMFAQTLR 629
             +K H  +R   SKL          +L+D +F   GN KTV+   ++         TL 
Sbjct: 268 ASNKAHVPFRG--SKLT--------QVLRD-SF--IGNSKTVMIATIS-PSASSCEHTLN 313

Query: 630 TLRY-CRTR 637
           TLRY  R +
Sbjct: 314 TLRYADRVK 322


>gnl|CDD|222480 pfam13968, DUF4220, Domain of unknown function (DUF4220).  This
           family is found in plants and is often associated with
           DUF294, pfam04578.
          Length = 362

 Score = 29.6 bits (67), Expect = 5.3
 Identities = 21/116 (18%), Positives = 40/116 (34%), Gaps = 6/116 (5%)

Query: 228 EALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIE---DKEEEEKEEEEEE 284
           E + AL   + ++I   +L +             Y   +   ++       EE  +    
Sbjct: 104 ERIWALRSASMESIRSSMLGEPDPGPNYAKFMEGYKYLVAEGLQAKIVIGREEDVDLNSV 163

Query: 285 RQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRRELAGVVMFEAHPRKGEILFH 340
            Q+ + L       +   E+ L L   H L   +RR    +++ E   RK +  F 
Sbjct: 164 EQVWEQLLLIILK-DGDEEKDLCLA--HALFKLLRRLFVDLILSEQDRRKSQEFFR 216


>gnl|CDD|215276 PLN02500, PLN02500, cytochrome P450 90B1.
          Length = 490

 Score = 29.8 bits (67), Expect = 5.5
 Identities = 22/71 (30%), Positives = 38/71 (53%), Gaps = 4/71 (5%)

Query: 266 LEREIED---KEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRREL 322
           +ER++E+   K +EE E  EE+  +G  L  ++   EQI + +L L    H ++SV   L
Sbjct: 242 IERKMEERIEKLKEEDESVEEDDLLGWVLKHSNLSTEQILDLILSLLFAGHETSSVAIAL 301

Query: 323 AGVVMFEAHPR 333
           A +   +  P+
Sbjct: 302 A-IFFLQGCPK 311


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 29.8 bits (68), Expect = 5.6
 Identities = 18/67 (26%), Positives = 38/67 (56%), Gaps = 1/67 (1%)

Query: 246 LRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
           +++Q++K Q  Y  RE +  +++E+ + ++++ E EE   +I K L    E  E+  +EL
Sbjct: 218 VKEQMEKSQREYYLREQLKAIQKELGEDDDDKDEVEELREKIEK-LKLPKEAKEKAEKEL 276

Query: 306 LHLKPLH 312
             L+ + 
Sbjct: 277 KKLETMS 283


>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 557

 Score = 29.9 bits (68), Expect = 5.7
 Identities = 34/173 (19%), Positives = 61/173 (35%), Gaps = 36/173 (20%)

Query: 210 SSSQATAQDIAEAEEHLEEALLALARR---APDAILRYILRKQLKKLQNRYLPREYMLHL 266
              +     + E EE L   L +LAR+     + +L Y     L K++            
Sbjct: 290 DELEFDPNRLEEVEERLFA-LKSLARKYGVTIEDLLEY-----LDKIKEEL--------- 334

Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRRELAGVV 326
             ++++ EE  +  E+E +++   L   +E L  I ++         L   V  EL  + 
Sbjct: 335 -AQLDNSEESLEALEKEVKKLKAELLEAAEALSAIRKKA-----AKELEKEVTAELKALA 388

Query: 327 MFEAHPRKGEILFHQGDEGKSWYIIIQGSVDVVIYGKGCVTSLYAGEDFGKLA 379
           M +A   +  +     +E  +      G+  V         S   GE    LA
Sbjct: 389 MEKA---RFTVELKPLEESPT----ADGADKVEFL-----ISTNPGEPLKPLA 429


>gnl|CDD|220212 pfam09376, NurA, NurA domain.  This family includes NurA a nuclease
           exhibiting both single-stranded endonuclease activity
           and 5'-3' exonuclease activity on single-stranded and
           double-stranded DNA from the hyperthermophilic archaeon
           Sulfolobus acidocaldarius.
          Length = 260

 Score = 29.5 bits (66), Expect = 6.0
 Identities = 24/139 (17%), Positives = 38/139 (27%), Gaps = 26/139 (18%)

Query: 256 RYLPREYMLHLEREIEDKEEEEKE---------------EEEEERQIGKPLDRTSEDLEQ 300
           R   RE M  LE        +E+                   E  +    L      LE 
Sbjct: 71  RRRVRELMALLELAAALLLLKEEGPDLILLDGSLIPTLIRLPEYLERILELLYLLRLLEL 130

Query: 301 IYEELLHLKPLHHLSNSVRRELAGVVMFEAHPRK---------GEILFHQGDEGKSWYII 351
               L  +  +  +  S   EL  ++  E              G +L   G       ++
Sbjct: 131 A--LLEGIPLVGIVKTSRSSELLNLLRDELEEILPGISDRALLGLLLEKGGYREPIPLLL 188

Query: 352 IQGSVDVVIYGKGCVTSLY 370
           I G + V +Y    +   Y
Sbjct: 189 ISGPLLVELYEDDEIYFFY 207


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 29.6 bits (66), Expect = 6.1
 Identities = 23/144 (15%), Positives = 52/144 (36%), Gaps = 8/144 (5%)

Query: 205 QDKEGSSSQATAQDIAEA--EEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREY 262
           + K  S+ Q+ ++       +   E   +    +      R       +  +   + +  
Sbjct: 83  EFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREERE------EVEETEGVTKSE 136

Query: 263 MLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRREL 322
             +  R+ E+ ++EEKE E EE +  K       + E +  +L H +       +   ++
Sbjct: 137 QKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQV 196

Query: 323 AGVVMFEAHPRKGEILFHQGDEGK 346
                FE   +K +    + +E K
Sbjct: 197 EAGKEFEKLKQKQQEAALELEELK 220


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 27.3 bits (61), Expect = 6.2
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 266 LEREIEDKEEEEKEEEEEERQ 286
           L+ E E +EEEE+  + EER+
Sbjct: 29  LKAEEEKREEEEEARKREERK 49



 Score = 26.9 bits (60), Expect = 6.7
 Identities = 13/44 (29%), Positives = 25/44 (56%), Gaps = 4/44 (9%)

Query: 263 MLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELL 306
           +   ++E++ +EE+ +EEEE      K  +R   +  + +EELL
Sbjct: 22  LKAKKKELKAEEEKREEEEEAR----KREERKEREKNKSFEELL 61


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 29.4 bits (67), Expect = 6.3
 Identities = 11/18 (61%), Positives = 14/18 (77%)

Query: 267 EREIEDKEEEEKEEEEEE 284
           E E E++EEEE+EE  EE
Sbjct: 302 EEEEEEEEEEEEEEPSEE 319



 Score = 29.1 bits (66), Expect = 9.3
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 267 EREIEDKEEEEKEEEEEERQIG 288
             E E++EEEE+EEE  E +  
Sbjct: 301 AEEEEEEEEEEEEEEPSEEEAA 322


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 29.9 bits (68), Expect = 6.4
 Identities = 20/98 (20%), Positives = 41/98 (41%), Gaps = 12/98 (12%)

Query: 216  AQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREY---MLHLEREIED 272
            A ++      L E L     +A     R   R+QL++ Q +    +Y   +  L+   + 
Sbjct: 976  AAEMLAKNSDLNEKLRQRLEQAEQE--RTRAREQLRQAQAQL--AQYNQVLASLKSSYDA 1031

Query: 273  KEEEEKEEEEEERQIGKPLD-----RTSEDLEQIYEEL 305
            K +  +E ++E + +G P D     R     ++++  L
Sbjct: 1032 KRQMLQELKQELQDLGVPADSGAEERARARRDELHARL 1069


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 29.4 bits (66), Expect = 6.5
 Identities = 13/47 (27%), Positives = 25/47 (53%), Gaps = 5/47 (10%)

Query: 261 EYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLH 307
           E  L  E+E E++     ++EEEE+Q+ K  ++     + + +EL  
Sbjct: 148 EEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNK-----QALLDELET 189


>gnl|CDD|233703 TIGR02063, RNase_R, ribonuclease R.  This family consists of an
           exoribonuclease, ribonuclease R, also called VacB. It is
           one of the eight exoribonucleases reported in E. coli
           and is broadly distributed throughout the bacteria. In
           E. coli, double mutants of this protein and
           polynucleotide phosphorylase are not viable. Scoring
           between trusted and noise cutoffs to the model are
           shorter, divergent forms from the Chlamydiae, and
           divergent forms from the Campylobacterales (including
           Helicobacter pylori) and Leptospira interrogans
           [Transcription, Degradation of RNA].
          Length = 709

 Score = 29.5 bits (67), Expect = 6.5
 Identities = 16/57 (28%), Positives = 29/57 (50%), Gaps = 5/57 (8%)

Query: 235 RRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEE----EKEEEEEERQI 287
           RR PD I+  +++K L   +N     +   +LE ++E+  E     E+  +E ER +
Sbjct: 559 RRYPDLIVHRLIKKALFGGEN-TTTEKEREYLEAKLEEIAEHSSKTERRADEAERDV 614


>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 406

 Score = 29.5 bits (67), Expect = 6.5
 Identities = 15/52 (28%), Positives = 27/52 (51%), Gaps = 5/52 (9%)

Query: 250 LKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQI 301
           L KL++  L       LE+E      EE+  E E  ++ + +DR  E++E++
Sbjct: 22  LNKLEDTKLKL-----LEKEKRLLLLEEQRLEAEGLRLKREVDRLREEIERL 68


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 29.7 bits (67), Expect = 6.7
 Identities = 23/96 (23%), Positives = 36/96 (37%), Gaps = 16/96 (16%)

Query: 213 QATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIED 272
            A  Q++A A+  L +AL  L     D              +   LP      ++     
Sbjct: 413 AALEQEVAVAKRELAQALSRLGLWRGDL------------EELVALPVPSAEQVQA---- 456

Query: 273 KEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHL 308
             +E +E  + +RQ    L    EDLE +  +L HL
Sbjct: 457 FLKEAEEIMQAKRQKRDRLLELEEDLETLELQLRHL 492


>gnl|CDD|147887 pfam05974, DUF892, Domain of unknown function (DUF892).  This
           family consists of several hypothetical bacterial
           proteins of unknown function.
          Length = 159

 Score = 28.7 bits (65), Expect = 7.0
 Identities = 13/45 (28%), Positives = 23/45 (51%), Gaps = 7/45 (15%)

Query: 217 QDIAEAEEHLEEALLALARRAPDAILRYIL-------RKQLKKLQ 254
           +DI  AE+   +AL  +A+ A D  L+  L         Q+++L+
Sbjct: 11  RDIYAAEKQALKALPKMAKAAEDPELKAALEQHLEETEGQIERLE 55


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 29.4 bits (66), Expect = 7.3
 Identities = 14/34 (41%), Positives = 21/34 (61%)

Query: 266 LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLE 299
           +ER  E++EEEE+EEEEE  +   P++      E
Sbjct: 398 VERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFE 431


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 28.4 bits (64), Expect = 7.4
 Identities = 28/99 (28%), Positives = 41/99 (41%), Gaps = 21/99 (21%)

Query: 216 AQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEE 275
           A D+AEAE   EEA   LA             ++L++ + +    E +   ++E E   E
Sbjct: 43  ADDLAEAERLKEEAQALLAEY----------EQELEEAREQA--SEIIEQAKKEAEQIAE 90

Query: 276 EEKEEEEEER---------QIGKPLDRTSEDLEQIYEEL 305
           E K E EEE          +I    +R  E+L     EL
Sbjct: 91  EIKAEAEEELERIKEAAEAEIEAEKERALEELRAEVAEL 129


>gnl|CDD|237722 PRK14476, PRK14476, nitrogenase molybdenum-cofactor biosynthesis
           protein NifN; Provisional.
          Length = 455

 Score = 29.4 bits (67), Expect = 8.1
 Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 5/34 (14%)

Query: 213 QATAQD----IAEAEEHLEEALLALARRA-PDAI 241
           Q TA D    I   +E++EEA+L + ++A P  I
Sbjct: 63  QTTAMDEVTTILGGDENVEEAILNICKKAKPKII 96


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 29.3 bits (66), Expect = 8.3
 Identities = 11/39 (28%), Positives = 16/39 (41%)

Query: 266 LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
            +R  E +EEE ++ EEE  Q  +     S        E
Sbjct: 379 EQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSE 417


>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin. 
          Length = 237

 Score = 28.8 bits (65), Expect = 8.6
 Identities = 20/90 (22%), Positives = 42/90 (46%), Gaps = 14/90 (15%)

Query: 211 SSQATAQDIAEAEEHLEEALLALARRAPDAILRY-ILRKQLKKLQNRYLPREYMLHLERE 269
           S + + +  ++ E+  EE +  L  +  +A  R     + ++KL             E+E
Sbjct: 159 SLEVSEEKASQREDSYEEKIRDLTEKLKEAETRAEFAERSVQKL-------------EKE 205

Query: 270 IEDKEEEEKEEEEEERQIGKPLDRTSEDLE 299
           ++  E+E   E+E+ + I + LD+T  +L 
Sbjct: 206 VDRLEDELLAEKEKYKAISEELDQTLAELT 235


>gnl|CDD|224228 COG1309, AcrR, Transcriptional regulator [Transcription].
          Length = 201

 Score = 28.8 bits (63), Expect = 8.6
 Identities = 19/86 (22%), Positives = 27/86 (31%)

Query: 200 LYNFWQDKEGSSSQATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLP 259
           LY  +  KE        + + E    LEEALL  A    +A+L  +L        +  L 
Sbjct: 49  LYRHFPSKEDLLLALLERALEELRALLEEALLDGAPEELEALLALLLAYLEFARDDPRLL 108

Query: 260 REYMLHLEREIEDKEEEEKEEEEEER 285
              +          E E    E    
Sbjct: 109 LLLLAAALAGELAPELEAILRELLAL 134


>gnl|CDD|219663 pfam07957, DUF3294, Protein of unknown function (DUF3294).  This
           family was annotated as mitochondrial Ribosomal protein
           MRP8, based on the presumed similarity of the
           S.cerevisiae protein to an E.coli mitochondrial
           ribosomal protein; however, this similarity is spurious,
           and the function is not known [Wood, V].
          Length = 217

 Score = 28.6 bits (64), Expect = 8.8
 Identities = 7/43 (16%), Positives = 23/43 (53%)

Query: 245 ILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQI 287
           +L    ++ Q     ++   H+E E  D++ E++ ++  ++++
Sbjct: 153 LLLADQEEFQEFLQGKKEAFHIESEERDEDIEKEIKKYSKKEL 195


>gnl|CDD|216831 pfam01988, VIT1, VIT family.  This family includes the vacuolar
           Fe2+/Mn2+ uptake transporter, Ccc1 and the vacuolar iron
           transporter VIT1.
          Length = 209

 Score = 28.8 bits (65), Expect = 9.0
 Identities = 14/49 (28%), Positives = 23/49 (46%)

Query: 266 LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHL 314
           LERE  + E + + E EE  +I +    + E   Q+  EL+    L  +
Sbjct: 67  LEREKRELENDPEGEREELAEIYRARGLSEELARQVARELMKKDALDAM 115


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 29.3 bits (66), Expect = 9.3
 Identities = 19/78 (24%), Positives = 33/78 (42%), Gaps = 8/78 (10%)

Query: 217 QDIAEAEEHLEEAL--LALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKE 274
             + E E+  +E L  + L  +  D         +LKK +  Y P +     +R I  K 
Sbjct: 156 TGVLEDEDEGDE-LENVELVEKEKDK-----KNLELKKKKPDYDPDDDDKFNKRSILSKY 209

Query: 275 EEEKEEEEEERQIGKPLD 292
           +EE E ++++      LD
Sbjct: 210 DEEIEGKKKKSDNLFTLD 227


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.135    0.398 

Gapped
Lambda     K      H
   0.267   0.0696    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 37,146,043
Number of extensions: 3795955
Number of successful extensions: 7461
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6801
Number of HSP's successfully gapped: 334
Length of query: 714
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 610
Effective length of database: 6,324,786
Effective search space: 3858119460
Effective search space used: 3858119460
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.1 bits)