RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4663
         (966 letters)



>gnl|CDD|214539 smart00147, RasGEF, Guanine nucleotide exchange factor for Ras-like
           small GTPases. 
          Length = 242

 Score =  240 bits (614), Expect = 6e-73
 Identities = 102/249 (40%), Positives = 153/249 (61%), Gaps = 13/249 (5%)

Query: 439 ILVLHPVEIARQLTLLESEYYRAVKPSEIVGSAWTKSNKEEKSP-NLLKIIKHTTNFTRW 497
           +L+L P E+A QLTLL+ E +R + PSE++GS W K +K+  SP NL   I+     + W
Sbjct: 1   LLLLDPKELAEQLTLLDFELFRKIDPSELLGSVWGKRSKKSPSPLNLEAFIRRFNEVSNW 60

Query: 498 LEKTIVETENLEERIAIVSRAIEIMMVLNDLNNFNGVLAIVSALGSASVYRLKCTFQALP 557
           +   I++    ++R  ++S+ I++     +LNNFN ++AIVSAL S+ + RLK T++ LP
Sbjct: 61  VATEILKQTTPKDRAELLSKFIQVAKHCRELNNFNSLMAIVSALSSSPISRLKKTWEKLP 120

Query: 558 ARLENALVELREL--GNDHFRKYQERLRSIN-PPCVPFFGMYLTNILHIEEGNPDYLPNS 614
           ++ +    EL EL     +++ Y+E L S N PPC+PF G+ L ++  I+EGNPD+L N 
Sbjct: 121 SKYKKLFEELEELLSPERNYKNYREALSSCNLPPCIPFLGVLLKDLTFIDEGNPDFLEN- 179

Query: 615 NEELINFSKRRRVAEITGEIQQFQNSPYCLSVEPR-IRRFLENLTPFDNWKEIDISNYLY 673
              L+NF KRR++AEI  EI+Q Q+ PY L      I+  L+ L   D+  E      LY
Sbjct: 180 --GLVNFEKRRQIAEILREIRQLQSQPYNLRPNRSDIQSLLQQL--LDHLDEE---EELY 232

Query: 674 EQSLKIEPR 682
           + SLKIEPR
Sbjct: 233 QLSLKIEPR 241


>gnl|CDD|238087 cd00155, RasGEF, Guanine nucleotide exchange factor for Ras-like
           small GTPases. Small GTP-binding proteins of the Ras
           superfamily function as molecular switches in
           fundamental events such as signal transduction,
           cytoskeleton dynamics and intracellular trafficking.
           Guanine-nucleotide-exchange factors (GEFs) positively
           regulate these GTP-binding proteins in response to a
           variety of signals. GEFs catalyze the dissociation of
           GDP from the inactive GTP-binding proteins. GTP can then
           bind and induce structural changes that allow
           interaction with effectors.
          Length = 237

 Score =  239 bits (611), Expect = 1e-72
 Identities = 98/245 (40%), Positives = 152/245 (62%), Gaps = 13/245 (5%)

Query: 439 ILVLHPVEIARQLTLLESEYYRAVKPSEIVGSAWTKSNKE-EKSPNLLKIIKHTTNFTRW 497
            L L P E+A QLTLL+ E +R ++P E++GS W+K +K    SPNL + I+   N + W
Sbjct: 1   FLSLDPKELAEQLTLLDFELFRKIEPFELLGSLWSKKDKNIHLSPNLERFIERFNNLSNW 60

Query: 498 LEKTIVETENLEERIAIVSRAIEIMMVLNDLNNFNGVLAIVSALGSASVYRLKCTFQALP 557
           +   I+   N ++R  ++S+ I++     +LNNFN ++AIVSAL S+ + RLK T++ L 
Sbjct: 61  VASEILLCTNPKKRARLLSKFIQVAKHCRELNNFNSLMAIVSALSSSPISRLKKTWEVLS 120

Query: 558 ARLENALVELRELG--NDHFRKYQERLRSI--NPPCVPFFGMYLTNILHIEEGNPDYLPN 613
           ++L+    EL EL   + +F+ Y++ L+S+  NPPCVPF G+YL ++  + EGNPD+L  
Sbjct: 121 SKLKKLFEELEELVDPSRNFKNYRKLLKSVGPNPPCVPFLGVYLKDLTFLHEGNPDFLEG 180

Query: 614 SNEELINFSKRRRVAEITGEIQQFQNSPYCLSVEPRIRRFLENLTPFDNWKEIDISNYLY 673
               L+NF KRR++AEI  EI+Q Q++ Y L+ +  I  FL  L      + I   + LY
Sbjct: 181 ---NLVNFEKRRKIAEILREIRQLQSNSYELNRDEDILAFLWKL-----LELILNEDELY 232

Query: 674 EQSLK 678
           E SL+
Sbjct: 233 ELSLE 237


>gnl|CDD|216025 pfam00617, RasGEF, RasGEF domain.  Guanine nucleotide exchange
           factor for Ras-like small GTPases.
          Length = 185

 Score =  210 bits (538), Expect = 5e-63
 Identities = 80/189 (42%), Positives = 122/189 (64%), Gaps = 6/189 (3%)

Query: 440 LVLHPVEIARQLTLLESEYYRAVKPSEIVGSAWTKSNKEEKSPNLLKIIKHTTNFTRWLE 499
           L L P+E+ARQLTL+ESE + A+ P E++ SAW K + +  SPN+   I      +RW+ 
Sbjct: 1   LDLDPLELARQLTLIESELFSAIDPRELLDSAWGKKDSK-LSPNIEAFISRFNQLSRWVA 59

Query: 500 KTIVETENLEERIAIVSRAIEIMMVLNDLNNFNGVLAIVSALGSASVYRLKCTFQALPAR 559
             I+  E+L++R  ++ + I+I     +LNNFN ++AI+S L S+ + RLK T++ +  +
Sbjct: 60  TEILSEEDLKKRAKVIKKFIKIAQHCRELNNFNSLMAIISGLNSSPISRLKKTWELVSKK 119

Query: 560 LENALVELREL--GNDHFRKYQERLRSINPPCVPFFGMYLTNILHIEEGNPDYLPNSNEE 617
            +  L EL EL   + +F+ Y+E L+  NPPC+PF G+YL ++  IEEGNPD+L N    
Sbjct: 120 YKKLLEELEELMSPSKNFKNYREALKKANPPCIPFLGLYLKDLTFIEEGNPDFLDN---G 176

Query: 618 LINFSKRRR 626
           LINF KRR+
Sbjct: 177 LINFEKRRK 185


>gnl|CDD|241292 cd01261, PH_SOS, Son of Sevenless (SOS) Pleckstrin homology (PH)
           domain.  SOS is a Ras guanine nucleotide exchange
           factor. SOS is thought to transmit signals from
           activated receptor tyrosine kinases to the Ras signaling
           pathway. SOS contains a histone domain, Dbl-homology
           (DH), a PH domain, Rem domain, Cdc25 domain, and a Grb2
           binding domain. The SOS PH domain binds to
           phosphatidylinositol-4,5-bisphosphate (PIP2) and
           phosphatidic acid (PA). SOS is dependent on Ras binding
           to the allosteric site via its histone domain for both a
           lower level of activity (Ras GDP) and maximal activity
           (Ras GTP). The DH domain blocks the allosteric Ras
           binding site in SOS. The PH domain is closely associated
           with the DH domain and the action of the DH-PH unit
           gates a reciprocal interaction between Ras and SOS. The
           C-terminal proline-rich domain of SOS binds to the
           adapter protein Grb2 which localizes the Sos protein to
           the plasma membrane and diminishes the negative effect
           of the C-terminal domain on the guanine nucleotide
           exchange activity of the CDC25-homology domain of SOS.
           PH domains have diverse functions, but in general are
           involved in targeting proteins to the appropriate
           cellular location or in the interaction with a binding
           partner. They share little sequence conservation, but
           all have a common fold, which is electrostatically
           polarized. Less than 10% of PH domains bind
           phosphoinositide phosphates (PIPs) with high affinity
           and specificity. PH domains are distinguished from other
           PIP-binding domains by their specific high-affinity
           binding to PIPs with two vicinal phosphate groups:
           PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
           results in targeting some PH domain proteins to the
           plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 109

 Score =  160 bits (406), Expect = 4e-46
 Identities = 65/117 (55%), Positives = 76/117 (64%), Gaps = 8/117 (6%)

Query: 108 QCCNEFFREDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEY 167
           QCCNEF  E TL K+  G    STER  FLFDGLLLLCK N   T+   + P      EY
Sbjct: 1   QCCNEFIMEGTLTKV--GAGKRSTERHLFLFDGLLLLCKSNQSSTASGQSKP------EY 52

Query: 168 RLKERFFIRRIEILDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWMADLIMLN 224
           RLKE+ FIR++EI DL D+D+ KNAF+IVPR     IL A S E+KNNWMA L+ L 
Sbjct: 53  RLKEKIFIRKVEINDLEDTDELKNAFEIVPRDEPSYILFAKSAEEKNNWMAALVTLQ 109


>gnl|CDD|214571 smart00229, RasGEFN, Guanine nucleotide exchange factor for
           Ras-like GTPases; N-terminal motif.  A subset of guanine
           nucleotide exchange factor for Ras-like small GTPases
           appear to possess this domain N-terminal to the RasGef
           (Cdc25-like) domain. The recent crystal structureof Sos
           shows that this domain is alpha-helical and plays a
           "purely structural role" (Nature 394, 337-343).
          Length = 127

 Score =  128 bits (325), Expect = 7e-35
 Identities = 52/132 (39%), Positives = 72/132 (54%), Gaps = 7/132 (5%)

Query: 275 GVPLIKGATLYKLVERLTYHIY-ADPAFVRTFLTTYRSFCTPLNFLQLLIERFSIPEFQG 333
              LIKG TL  L+E LT     ADP+FV TFL TYRSF T    LQLL+ R++    + 
Sbjct: 1   DGGLIKGGTLEALIEHLTEAFDKADPSFVETFLLTYRSFITTQELLQLLLYRYNAIPPES 60

Query: 334 TVREDLKKFRKEYVQPVQFRVLNVLRHWVDHHFYDFEQDAHLLEQLKLFLETINGKSMRK 393
            V        K   + V+ RVLN+LR WV++++ DFE D  L+  L  FLE ++ +    
Sbjct: 61  WV------EEKVNPRRVKNRVLNILRTWVENYWEDFEDDPKLISFLLEFLELVDDEKYPG 114

Query: 394 WVDSVSKVIFRK 405
            V S+  ++ R 
Sbjct: 115 LVTSLLNLLRRL 126


>gnl|CDD|100121 cd06224, REM, Guanine nucleotide exchange factor for Ras-like
           GTPases; N-terminal domain (RasGef_N), also called REM
           domain (Ras exchanger motif). This domain is common in
           nucleotide exchange factors for Ras-like small GTPases
           and is typically found immediately N-terminal to the
           RasGef (Cdc25-like) domain. REM contacts the GTPase and
           is assumed to participate in the catalytic activity of
           the exchange factor. Proteins with the REM domain
           include Sos1 and Sos2, which relay signals from
           tyrosine-kinase mediated signalling to Ras, RasGRP1-4,
           RasGRF1,2, CNrasGEF, and RAP-specific nucleotide
           exchange factors, to name a few.
          Length = 122

 Score =  117 bits (294), Expect = 7e-31
 Identities = 49/123 (39%), Positives = 71/123 (57%), Gaps = 6/123 (4%)

Query: 283 TLYKLVERLTYH-IYADPAFVRTFLTTYRSFCTPLNFLQLLIERFSIPEFQGTVREDLKK 341
           TL  L+E LT      DP+FV TFL TYRSF TP   L+ LIER+ I   +     +   
Sbjct: 1   TLEALIEHLTSTFDMPDPSFVSTFLLTYRSFTTPTELLEKLIERYEIAPPENL---EYND 57

Query: 342 FRKEYVQPVQFRVLNVLRHWVDHHFYDFEQDAHLLEQLKLFLETIN--GKSMRKWVDSVS 399
           + K+  +P++ RVLNVLR WV+++ YDF  D  LLE L+ FL  +   G  +++    + 
Sbjct: 58  WDKKKSKPIRLRVLNVLRTWVENYPYDFFDDEELLELLEEFLNRLVQEGALLQELKKLLR 117

Query: 400 KVI 402
           K++
Sbjct: 118 KLL 120


>gnl|CDD|216026 pfam00618, RasGEF_N, RasGEF N-terminal motif.  A subset of guanine
           nucleotide exchange factor for Ras-like small GTPases
           appear to possess this motif/domain N-terminal to the
           RasGef (Cdc25-like) domain.
          Length = 94

 Score = 99.4 bits (248), Expect = 6e-25
 Identities = 44/105 (41%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 279 IKGATLYKLVERLTYH-IYADPAFVRTFLTTYRSFCTPLNFLQLLIERFSIPEFQGTVRE 337
           +KG TL  LVE LT      D +F+ TFL TYRSF TP   L+LL+ERF  P    + +E
Sbjct: 2   VKGGTLEALVEHLTSTDDMLDDSFLETFLLTYRSFTTPGELLELLLERFLSPPLGLSYQE 61

Query: 338 DLKKFRKEYVQPVQFRVLNVLRHWVDHHFYDFEQDAHLLEQLKLF 382
                       ++ RVLNVLR W++++  DF  D  LL  L+ F
Sbjct: 62  ------------IRIRVLNVLRKWIENYNSDFVDDPELLSLLEEF 94


>gnl|CDD|214574 smart00233, PH, Pleckstrin homology domain.  Domain commonly found
           in eukaryotic signalling proteins. The domain family
           possesses multiple functions including the abilities to
           bind inositol phosphates, and various proteins. PH
           domains have been found to possess inserted domains
           (such as in PLC gamma, syntrophins) and to be inserted
           within other domains. Mutations in Brutons tyrosine
           kinase (Btk) within its PH domain cause X-linked
           agammaglobulinaemia (XLA) in patients. Point mutations
           cluster into the positively charged end of the molecule
           around the predicted binding site for
           phosphatidylinositol lipids.
          Length = 102

 Score = 59.1 bits (143), Expect = 1e-10
 Identities = 25/110 (22%), Positives = 41/110 (37%), Gaps = 15/110 (13%)

Query: 113 FFREDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKER 172
             +E  L K + G K    +R   LF+  LL  K    + S             Y+ K  
Sbjct: 1   VIKEGWLYKKSGGGKKSWKKRYFVLFNSTLLYYKSKKDKKS-------------YKPKGS 47

Query: 173 FFIRRIEILDLPDSD--DYKNAFQIVPRCGSPVILIANSPEDKNNWMADL 220
             +    + + PD D     + F+I       ++L A S E++  W+  L
Sbjct: 48  IDLSGCTVREAPDPDSSKKPHCFEIKTSDRKTLLLQAESEEEREKWVEAL 97


>gnl|CDD|241258 cd01224, PH_Collybistin_ASEF, Collybistin/APC-stimulated guanine
           nucleotide exchange factor pleckstrin homology (PH)
           domain.  Collybistin (also called PEM2) is homologous to
           the Dbl proteins ASEF (also called ARHGEF4/RhoGEF4) and
           SPATA13 (Spermatogenesis-associated protein 13; also
           called ASEF2). It activates CDC42 specifically and not
           any other Rho-family GTPases. Collybistin consists of an
           SH3 domain, followed by a RhoGEF/DH and PH domain. In
           Dbl proteins, the DH and PH domains catalyze the
           exchange of GDP for GTP in Rho GTPases, allowing them to
           signal to downstream effectors. It induces submembrane
           clustering of the receptor-associated peripheral
           membrane protein gephyrin, which is thought to form a
           scaffold underneath the postsynaptic membrane linking
           receptors to the cytoskeleton. It also acts as a tumor
           suppressor that links adenomatous polyposis coli (APC)
           protein, a negative regulator of the Wnt signaling
           pathway and promotes the phosphorylation and degradation
           of beta-catenin, to Cdc42. Autoinhibition of collybistin
           is accomplished by the binding of its SH3 domain with
           both the RhoGEF and PH domains to block access of Cdc42
           to the GTPase-binding site. Inactivation promotes cancer
           progression. PH domains have diverse functions, but in
           general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 139

 Score = 57.6 bits (140), Expect = 9e-10
 Identities = 36/137 (26%), Positives = 56/137 (40%), Gaps = 21/137 (15%)

Query: 88  IEKTNELSKTVDGWTSVDIGQCCNEFFREDTLLKLNIGKKTESTERVCFLFDGLLLLCKP 147
           +EK      +++GW   D+    +E      L K++   K +  ER  FLFD  L+ CK 
Sbjct: 4   LEKLAAWQSSIEGWEGEDLLDRSSELIHSGELSKISSKGKAQ--ERTFFLFDHQLVYCKK 61

Query: 148 NSKRTSVSVTAPLGGNQGEYRLKERFFIRRIEILDLPD-SDDY-----KNAFQIVPRCGS 201
           +  +                  K R  +  +EI DL D  D       KNA++I     +
Sbjct: 62  DLLKRDN------------LIYKGRIDLDNMEIEDLEDGKDKDSGVTVKNAWKIYNTSKN 109

Query: 202 PV-ILIANSPEDKNNWM 217
              +L A S E+K  W+
Sbjct: 110 KWYLLCAKSAEEKQRWL 126


>gnl|CDD|215766 pfam00169, PH, PH domain.  PH stands for pleckstrin homology.
          Length = 101

 Score = 51.7 bits (124), Expect = 3e-08
 Identities = 28/112 (25%), Positives = 44/112 (39%), Gaps = 16/112 (14%)

Query: 114 FREDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERF 173
            +E  LLK   G +    +R   LFDG+LL  K + K                 R K   
Sbjct: 2   IKEGWLLKKGSGGRKSWKKRYFVLFDGVLLYYKDSKKS--------------SSRPKGSI 47

Query: 174 FIRRIEILDLPDSD--DYKNAFQIVPRCGSPVILIANSPEDKNNWMADLIML 223
            +   ++  +PDS+    KN F+I        +L A S E++  W+  +   
Sbjct: 48  PLSGCQVTKVPDSEDGKRKNCFEIRTGDRETFLLQAESEEERKEWVKAIRSA 99


>gnl|CDD|241540 cd13389, PH1_FGD5_FGD6, FYVE, RhoGEF and PH domain
           containing/faciogenital dysplasia proteins 5 and 6
           Pleckstrin Homology (PH) domain.  FGD5 regulates
           promotes angiogenesis of vascular endothelial growth
           factor (VEGF) in vascular endothelial cells, including
           network formation, permeability, directional movement,
           and proliferation. The specific function of FGD6 is
           unknown. In general, FGDs have a RhoGEF (DH) domain,
           followed by a PH domain, a FYVE domain and a C-terminal
           PH domain. All FGDs are guanine nucleotide exchange
           factors that activate the Rho GTPase Cdc42, an important
           regulator of membrane trafficking. The RhoGEF domain is
           responsible for GEF catalytic activity, while the PH
           domain is involved in intracellular targeting of the DH
           domain. PH domains have diverse functions, but in
           general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 125

 Score = 51.5 bits (124), Expect = 9e-08
 Identities = 30/109 (27%), Positives = 51/109 (46%), Gaps = 17/109 (15%)

Query: 112 EFFREDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKE 171
              +E  L+K++  K  E   R  FLF+ +LL               P+    G+Y+LK 
Sbjct: 13  VLIKEGELMKVS-RK--EMQPRYLFLFNDVLLYT------------TPVQSGVGKYKLKN 57

Query: 172 RFFIRRIEILDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWMADL 220
              +  +++  LP+ ++Y N FQI     S   L A+S E+++ W+  L
Sbjct: 58  ELPLSGMKVS-LPEDEEYSNEFQIESTKRS-FTLSASSAEERDEWVKAL 104


>gnl|CDD|241231 cd00821, PH, Pleckstrin homology (PH) domain.  PH domains have
           diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 92

 Score = 45.4 bits (107), Expect = 5e-06
 Identities = 23/105 (21%), Positives = 36/105 (34%), Gaps = 15/105 (14%)

Query: 115 REDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERFF 174
           +E  L K          +R   LFD +LL  K     +               + K    
Sbjct: 1   KEGWLKKRGGKGLKSWKKRWFVLFDDVLLYYKSKKDSS--------------KKPKGLIP 46

Query: 175 IRRIEILDLPDSDDYKNAFQIV-PRCGSPVILIANSPEDKNNWMA 218
           +     ++L  S    N F++V P  G    L A S E++  W+ 
Sbjct: 47  LSDGLEVELVSSSGKPNCFELVTPDRGRTYYLQAESEEEREEWLE 91


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 49.0 bits (116), Expect = 1e-05
 Identities = 45/194 (23%), Positives = 73/194 (37%), Gaps = 20/194 (10%)

Query: 720 TVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTA--FS 777
           T    TG  +S L E D +P +  T  T N+     +P        + SP +  T+   +
Sbjct: 470 TSGTPTGTTSSTLPE-DTSPTSRTTSATPNAT----SPTPAVTTPNATSPTTQKTSDTPN 524

Query: 778 MFSTQSVQSLASSTITLRPGSSSGPEFS-----GEDSPGLMGSTFDGLYSPPPLPPRTNR 832
             S   +    ++T T  P  ++    +      E+SP +  +    + S P +      
Sbjct: 525 ATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESP-VNNTNTPVVTSAPSVLTSAVT 583

Query: 833 RREV-----STSDQSVHSRDISPPPLPPRRETRPSLSS--PPGYASSPDPSPSSPGRFHV 885
             +       TS Q          P      T P L+S  P G  +  + +PS P   HV
Sbjct: 584 TGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHV 643

Query: 886 SSPSPSKPSGSTSR 899
           S+ SP    G+TS+
Sbjct: 644 STLSPGPGPGTTSQ 657



 Score = 36.7 bits (84), Expect = 0.072
 Identities = 41/213 (19%), Positives = 68/213 (31%), Gaps = 23/213 (10%)

Query: 718 ALTVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFS 777
             +V   T P+ +  S  + T    +T   +     V             SP S      
Sbjct: 546 TTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTT----GQHGTGSSPTSQQPGIP 601

Query: 778 MFSTQSVQSLASSTITLRPGSSSGP---EFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRR 834
             S  + +S  +ST T    +S+ P   E   E++P +  +T     SP P P  T++  
Sbjct: 602 SSSHSTPRS--NSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVS 659

Query: 835 EVSTSDQS-----VHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPS 889
               S  S     VH  +  P          P+ +SP   +      P+       ++ +
Sbjct: 660 GPGNSSTSRYPGEVHVTEGMPN---------PNATSPSAPSGQKTAVPTVTSTGGKANST 710

Query: 890 PSKPSGSTSRLPLLPPKPFSFGKQVPMFQFNAP 922
             + SGST      P       +  P       
Sbjct: 711 TKETSGSTLMASTSPHTNEGAFRTTPYNATTYL 743



 Score = 35.1 bits (80), Expect = 0.21
 Identities = 45/227 (19%), Positives = 68/227 (29%), Gaps = 38/227 (16%)

Query: 707 SKNYMARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNAN-- 764
           S+   A  N  + T    T    SP ++         +P       T  A    +     
Sbjct: 490 SRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSV 549

Query: 765 --------GSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGST 816
                     +SP + +    + S  SV + A +T     GSS   +  G  S       
Sbjct: 550 PNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPR 609

Query: 817 FDGLYSPPPL----PPRTNRRREVS---TSDQSVHSRDISPPPLPPRRETRPSLSSPPGY 869
            +   + P L    P       E +    S   V +    P P                 
Sbjct: 610 SNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGP------------GTTSQ 657

Query: 870 ASSPDPSPSS--PGRFHVS-------SPSPSKPSGSTSRLPLLPPKP 907
            S P  S +S  PG  HV+       + SPS PSG  + +P +    
Sbjct: 658 VSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTG 704



 Score = 30.5 bits (68), Expect = 5.9
 Identities = 27/138 (19%), Positives = 44/138 (31%), Gaps = 18/138 (13%)

Query: 717 PALTVNKITGPKNSPLSESDGTPQ-TPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTA 775
              T ++++GP NS  S   G    T   P                NA    +P    TA
Sbjct: 651 GPGTTSQVSGPGNSSTSRYPGEVHVTEGMPN--------------PNATSPSAPSGQKTA 696

Query: 776 FSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPP-LPPRTNRRR 834
               ++   ++ +++  T   GS+     S   + G   +T     +  P       R R
Sbjct: 697 VPTVTSTGGKANSTTKET--SGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPR 754

Query: 835 EVSTSDQSVHSRDISPPP 852
              TS      +   P P
Sbjct: 755 WTFTSPPVTTKQATVPVP 772


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 47.9 bits (114), Expect = 3e-05
 Identities = 42/218 (19%), Positives = 60/218 (27%), Gaps = 13/218 (5%)

Query: 701 SPGIKASKNYMARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTV--FAPI 758
            PG +A  N        +L+      P         G       PPT          AP 
Sbjct: 74  GPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPD 133

Query: 759 MLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGS---SSGPEFSGEDSPGLMGS 815
           +        SPG    A    +  S  ++AS   + R  +   SS  E     +P    +
Sbjct: 134 LSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEE--TARAPSSPPA 191

Query: 816 TFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDP 875
                  P    PR  RR    ++  S      SP P P R     + +S    +SS   
Sbjct: 192 EPPPSTPPAAASPRPPRRSSPISASAS------SPAPAPGRSAADDAGASSSDSSSSESS 245

Query: 876 SPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQ 913
                       P P+  +  T           S    
Sbjct: 246 GCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPG 283



 Score = 46.7 bits (111), Expect = 6e-05
 Identities = 34/181 (18%), Positives = 53/181 (29%), Gaps = 12/181 (6%)

Query: 727 PKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQS 786
             +SP +E   +       P      +  +         + SP       +     +  S
Sbjct: 185 APSSPPAEPPPSTPPAAASPRPPRRSSPISA-------SASSPAPAPGRSAADD--AGAS 235

Query: 787 LASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSR 846
            + S+ +   G   GPE    + P    +                       +  S   R
Sbjct: 236 SSDSSSSESSGCGWGPE---NECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPR 292

Query: 847 DISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPK 906
           + SP P P    + P+ SSP   +SS     SS      SS S    + S    P   P 
Sbjct: 293 ERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPS 352

Query: 907 P 907
           P
Sbjct: 353 P 353



 Score = 42.1 bits (99), Expect = 0.002
 Identities = 32/217 (14%), Positives = 62/217 (28%), Gaps = 9/217 (4%)

Query: 691 PRRWPNLSLKSPGIKASKNYMARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANS 750
           PRR   +S  +     +    A ++  A + +  +    S         + PL  P   +
Sbjct: 207 PRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSE--SSGCGWGPENECPLPRPAPIT 264

Query: 751 EFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSP 810
             T        N   S+   + S++     + S    +  +        +    S     
Sbjct: 265 LPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRES 324

Query: 811 GLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYA 870
               ++     S               +  +S       PP  P     RP  S  P   
Sbjct: 325 SSSSTSSSSESSRGAAVSPG------PSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSP 378

Query: 871 SSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
           ++    P+   R   +    ++   +T R P   P+P
Sbjct: 379 AASAGRPTRR-RARAAVAGRARRRDATGRFPAGRPRP 414



 Score = 41.7 bits (98), Expect = 0.002
 Identities = 30/142 (21%), Positives = 50/142 (35%), Gaps = 11/142 (7%)

Query: 764 NGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSP 823
           +   S  + S+  +  S+ S +  +SS+ +    SS G   S   SP    S      S 
Sbjct: 301 SSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSP-----SR 355

Query: 824 PPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRF 883
           PP P   +  R+     ++  S   S    P RR  R +++       +    P+   R 
Sbjct: 356 PPPPADPSSPRKRPRPSRAPSSPAASAGR-PTRRRARAAVAGRARRRDATGRFPAGRPR- 413

Query: 884 HVSSPSPSKPSGSTSRLPLLPP 905
               PSP     ++       P
Sbjct: 414 ----PSPLDAGAASGAFYARYP 431



 Score = 38.6 bits (90), Expect = 0.022
 Identities = 30/174 (17%), Positives = 49/174 (28%), Gaps = 20/174 (11%)

Query: 756 APIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDS------ 809
            P  L+ A     P               ++       LR    +    S   S      
Sbjct: 764 VPAKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRS 823

Query: 810 --PGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQS------VHSRDISPPPLPPRRETRP 861
             P     +      P         R   S+  +          ++    P PP    RP
Sbjct: 824 HTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARP 883

Query: 862 SLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVP 915
             ++PP  A++  P+ +   R     P+P    G    +P   P P    ++VP
Sbjct: 884 GAAAPPKAAAAAPPAGAPAPRPR---PAPRVKLG---PMPPGGPDPRGGFRRVP 931



 Score = 38.2 bits (89), Expect = 0.025
 Identities = 33/159 (20%), Positives = 52/159 (32%), Gaps = 11/159 (6%)

Query: 725 TGPKNSPLSESDGTPQ--TPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQ 782
               +S   E   +P   +P + P  +S          S+++ S+   S ST+ S  S++
Sbjct: 284 PASSSSSPRERSPSPSPSSPGSGPAPSSPRAS------SSSSSSRESSSSSTSSSSESSR 337

Query: 783 SVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQS 842
                   + +  P  S  P  +   SP           SP     R  RRR  +     
Sbjct: 338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGR 397

Query: 843 VHSRDISPP---PLPPRRETRPSLSSPPGYASSPDPSPS 878
              RD +       P         +S   YA  P  +PS
Sbjct: 398 ARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPS 436



 Score = 36.3 bits (84), Expect = 0.099
 Identities = 22/118 (18%), Positives = 29/118 (24%), Gaps = 8/118 (6%)

Query: 795 RPGSSSGPEFSGEDSPGLMG----STFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISP 850
             G              + G      F+    PPP P       E  +   +     ++P
Sbjct: 39  SQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRS-TPTWSLSTLAP 97

Query: 851 -PPLPPRRETRPSLSSPPGYASSPDP--SPSSPGRFHVSSPSPSKPSGSTSRLPLLPP 905
             P      T P  SSP     +P P   P SP         P    G          
Sbjct: 98  ASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAA 155



 Score = 31.7 bits (72), Expect = 2.3
 Identities = 22/139 (15%), Positives = 37/139 (26%), Gaps = 10/139 (7%)

Query: 788 ASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVS----TSDQSV 843
           A              E +      L  S      +      R +R         +S  + 
Sbjct: 778 AEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPAR 837

Query: 844 HSRDISPPPLPPRRETRPSLSSPPGYASSPD-----PSPSSPGRFH-VSSPSPSKPSGST 897
                + PP     E+  S  +  G  +        P P  P      ++P  +  +   
Sbjct: 838 PPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPP 897

Query: 898 SRLPLLPPKPFSFGKQVPM 916
           +  P   P+P    K  PM
Sbjct: 898 AGAPAPRPRPAPRVKLGPM 916



 Score = 30.9 bits (70), Expect = 4.0
 Identities = 22/117 (18%), Positives = 31/117 (26%), Gaps = 8/117 (6%)

Query: 783 SVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQS 842
           +V  +A +    R    +GP            S     +S   L P +  R    T    
Sbjct: 52  AVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGP 111

Query: 843 VHSRDISPPPLPPRRETRPSL--SSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGST 897
                 S P  PP      S   S  P  +    P  S       S P+      + 
Sbjct: 112 ------SSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAV 162


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 44.2 bits (104), Expect = 5e-04
 Identities = 27/112 (24%), Positives = 36/112 (32%), Gaps = 2/112 (1%)

Query: 796  PGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPP 855
            P  S  P  +  D                P P R +R R          ++  SPP  P 
Sbjct: 2627 PPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARR--LGRAAQASSPPQRPR 2684

Query: 856  RRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
            RR  RP++ S    A  P P P+     H    +   P G  +     P  P
Sbjct: 2685 RRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALP 2736



 Score = 42.6 bits (100), Expect = 0.001
 Identities = 45/224 (20%), Positives = 60/224 (26%), Gaps = 28/224 (12%)

Query: 695  PNLSLKSPGIKASKNYMARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTV 754
            P      P         AR    A        P  +P +     P   LT P   S  + 
Sbjct: 2741 PPAVPAGPATPGGPARPARPPTTA------GPPAPAPPAAPAAGPPRRLTRPAVAS-LSE 2793

Query: 755  FAPIMLSNANGSQSPGSVSTAFSMF-STQSVQSLASSTITLRPGSSSGPEFSGEDSPGLM 813
                + S  + +  P +V    +      S         + +P +   P      S  L 
Sbjct: 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG 2853

Query: 814  GSTFDG--------LYSPPPLP-----PRTNR--RREVSTSDQSVHSRDISPPPLP-PRR 857
            GS   G          SP   P     P   R  R  VS S +S       P   P P+ 
Sbjct: 2854 GSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQA 2913

Query: 858  ETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLP 901
               P     P     P P P  P R       P  P+   +   
Sbjct: 2914 PPPPQPQPQPPPPPQPQPPPPPPPR----PQPPLAPTTDPAGAG 2953



 Score = 40.7 bits (95), Expect = 0.005
 Identities = 25/92 (27%), Positives = 31/92 (33%), Gaps = 11/92 (11%)

Query: 823  PPPLPPRTNRRREVSTSDQSVHSRDISPPPLPP---RRETRPSLSSPPGYASSP-DPSPS 878
            PPPLPP        +  D+SV     +P P  P    R  RP          +P D    
Sbjct: 2552 PPPLPPAA----PPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGD 2607

Query: 879  SPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSF 910
              G    + PSP  P       P   P P + 
Sbjct: 2608 PRGP---APPSPLPPDTHAPDPPPPSPSPAAN 2636



 Score = 39.5 bits (92), Expect = 0.010
 Identities = 32/177 (18%), Positives = 46/177 (25%), Gaps = 13/177 (7%)

Query: 738  TPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPG 797
             P  P  P           P           P + +   +     +  +     +T    
Sbjct: 2735 LPAAPAPPAVPA------GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAV 2788

Query: 798  SSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRR 857
            +S         SP         + +P    P             S       PPP PP  
Sbjct: 2789 ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP 2848

Query: 858  ETRPSLSSPPG-------YASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
                  S  PG        + SP   P++P R  V   +    S ST    L P +P
Sbjct: 2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905



 Score = 38.8 bits (90), Expect = 0.017
 Identities = 24/84 (28%), Positives = 30/84 (35%), Gaps = 8/84 (9%)

Query: 824  PPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRF 883
            P  P  T+R R      QS   R        PR    P  S  P    +PDP P SP   
Sbjct: 2577 PSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPP--SPLPPDTHAPDPPPPSP--- 2631

Query: 884  HVSSPSPSKPSGSTSRLPLLPPKP 907
               SP+ ++P          P +P
Sbjct: 2632 ---SPAANEPDPHPPPTVPPPERP 2652



 Score = 38.4 bits (89), Expect = 0.026
 Identities = 23/95 (24%), Positives = 34/95 (35%), Gaps = 16/95 (16%)

Query: 822  SPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPG 881
             P   P                  +   PPP PPR +  P L+     A + +PS + P 
Sbjct: 2904 QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ--PPLAPTTDPAGAGEPSGAVPQ 2961

Query: 882  --------------RFHVSSPSPSKPSGSTSRLPL 902
                          RF V  P+PS+ + ++S  PL
Sbjct: 2962 PWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPL 2996



 Score = 36.8 bits (85), Expect = 0.062
 Identities = 40/209 (19%), Positives = 60/209 (28%), Gaps = 41/209 (19%)

Query: 691  PRRWPNLSLKSPGIKASKNYMARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANS 750
            P+R    + +      S   +A    P  T      P+ +P +    TP  P  P  A  
Sbjct: 2680 PQRPRRRAARPT--VGSLTSLADPPPPPPT------PEPAPHALVSATP-LPPGPAAARQ 2730

Query: 751  EF--TVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGED 808
                   AP   +   G  +PG  +              A    T  P + + P      
Sbjct: 2731 ASPALPAAPAPPAVPAGPATPGGPARP------------ARPPTTAGPPAPAPPA----- 2773

Query: 809  SPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPG 868
                         +P   PPR   R  V++  +S  S      P  P        ++ P 
Sbjct: 2774 -------------APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820

Query: 869  YASSPDPSPSSPGRFHVSSPSPSKPSGST 897
             AS   P P        + P P  P   +
Sbjct: 2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPS 2849



 Score = 33.8 bits (77), Expect = 0.68
 Identities = 28/125 (22%), Positives = 41/125 (32%), Gaps = 16/125 (12%)

Query: 795  RPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNR-------RREVSTSDQSVHSRD 847
            RP +   P+ +   +P        G   P PLPP T+            +          
Sbjct: 2588 RPDAP--PQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645

Query: 848  ISPPPLP-----PRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPL 902
            + PP  P     P R +RP  +   G A+     P  P R   ++        S +  P 
Sbjct: 2646 VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRR--RAARPTVGSLTSLADPPP 2703

Query: 903  LPPKP 907
             PP P
Sbjct: 2704 PPPTP 2708



 Score = 33.0 bits (75), Expect = 1.0
 Identities = 22/87 (25%), Positives = 29/87 (33%), Gaps = 2/87 (2%)

Query: 822  SPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPG 881
            SPP  P R   R  V +           P P P       +   PPG A++   SP+ P 
Sbjct: 2678 SPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPA 2737

Query: 882  RFHVSSPSPSKPSG-STSRLPLLPPKP 907
                    P+ P+       P  PP  
Sbjct: 2738 A-PAPPAVPAGPATPGGPARPARPPTT 2763



 Score = 32.2 bits (73), Expect = 1.7
 Identities = 22/92 (23%), Positives = 32/92 (34%), Gaps = 8/92 (8%)

Query: 819 GLYSPPPLPPRTNRRREVSTSDQSV---HSRDISPPPLPPRRETRPSLSSPPGYASSPDP 875
           G + P      T +RR    +          D    P  P   + P+ +  P  AS+P P
Sbjct: 371 GRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAP-P 429

Query: 876 SPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
            P++P    + S  P    G        PP P
Sbjct: 430 PPATP----LPSAEPGSDDGPAPPPERQPPAP 457



 Score = 31.4 bits (71), Expect = 3.0
 Identities = 40/168 (23%), Positives = 58/168 (34%), Gaps = 15/168 (8%)

Query: 727  PKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQ--SV 784
            P+  P       PQ PL P T  +     +  +     G+  PG V+           S 
Sbjct: 2927 PQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSR 2986

Query: 785  QSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVH 844
            ++ ASST  L   S S    S   S  L   T      PPP+    + ++ +   D +  
Sbjct: 2987 EAPASSTPPLTGHSLSRVS-SWASSLALHEET-----DPPPV----SLKQTLWPPDDTED 3036

Query: 845  SRDISPPPLPPRRETRPSLSSPPGYASSP---DPSPSSPGRFHVSSPS 889
            S   S       R    +L   P     P   +P P++P      SPS
Sbjct: 3037 SDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATPEAGARESPS 3084



 Score = 31.1 bits (70), Expect = 4.2
 Identities = 21/86 (24%), Positives = 28/86 (32%), Gaps = 15/86 (17%)

Query: 832 RRREVSTSDQSV-------HSRDISPPPLPPRRETRPS---LSSPPGYASSPDPSPSSPG 881
           RRR   T   S+       H    +  P   RR  R +    +  PG      P+   P 
Sbjct: 354 RRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPA 413

Query: 882 RFHVSSPSPSKPSGSTSRLPLLPPKP 907
               S P+P+      S  P  P  P
Sbjct: 414 ----SVPTPAPTPVPASA-PPPPATP 434



 Score = 30.7 bits (69), Expect = 5.1
 Identities = 31/120 (25%), Positives = 40/120 (33%), Gaps = 16/120 (13%)

Query: 796  PGSSSGPEFSGEDSPGLMGSTFDGLYSPP---PLPPRT---NRRREVSTSDQSVHSRDIS 849
            PG    P+     +P  +      L   P   P+ PR     R  E   SD +       
Sbjct: 2498 PGGGGPPDPDAPPAPSRLAPAI--LPDEPVGEPVHPRMLTWIRGLEELASDDAGDP---- 2551

Query: 850  PPPLPPR-RETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPF 908
            PPPLPP      P  S PP     P P PS P     +    + P  +  R P+      
Sbjct: 2552 PPPLPPAAPPAAPDRSVPP---PRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDP 2608


>gnl|CDD|241253 cd01218, PH_Phafin2-like, Phafin2 (also called EAPF, FLJ13187,
           ZFYVE18 or PLEKHF2) Pleckstrin Homology (PH) domain.
           Phafin2 is differentially expressed in the liver cancer
           cell and regulates the structure and function of the
           endosomes through Rab5-dependent processes. Phafin2
           modulates the cell's response to extracellular
           stimulation by modulating the receptor density on the
           cell surface. Phafin2 contains a PH domain and a FYVE
           domain. PH domains have diverse functions, but in
           general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 123

 Score = 40.3 bits (95), Expect = 6e-04
 Identities = 14/43 (32%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 178 IEILDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWMADL 220
           ++I DL D+ + KN +QI+    S V+  A +  +K+ WM  +
Sbjct: 79  VKIEDLEDTGELKNGWQIISPKKSFVV-YAATATEKSEWMDHI 120


>gnl|CDD|241263 cd01229, PH_Ect2, Epithelial cell transforming 2 (Ect2) pleckstrin
           homology (PH) domain.  Ect2, a mammalian ortholog of
           Drosophila pebble, plays a role in neuronal
           differentiation and brain development. Pebble and Ect2
           have been identified as Rho-family guanine nucleotide
           exchange factors (GEF) that mediate activation of Rho
           during cytokinesis, but are proposed to play slightly
           different roles. PH domains have diverse functions, but
           in general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 180

 Score = 39.9 bits (94), Expect = 0.002
 Identities = 30/128 (23%), Positives = 55/128 (42%), Gaps = 27/128 (21%)

Query: 120 LKLNIGKKTESTERVC-FLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKE-RFFIRR 177
           ++L    K+   + +  FLF  LL +CK   KR+SV  TA          L+E +   + 
Sbjct: 49  VELGDSLKSGRGDSLALFLFSDLLEICK---KRSSVKGTAKSPRGSTGSGLRESKKKYKH 105

Query: 178 IE---------ILDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWMADLIMLN---T 225
           I+         ++D+ +++D +N F ++ R          SP +    +    +L+    
Sbjct: 106 IKLMPLSNIKRVIDITETEDCQNVFALLFR----------SPTELKEKLYSFQILDEETD 155

Query: 226 KSMFERTL 233
           K  F +TL
Sbjct: 156 KEKFLKTL 163


>gnl|CDD|241539 cd13388, PH1_FGD4, FYVE, RhoGEF and PH domain
           containing/faciogenital dysplasia protein 4 pleckstrin
           homology (PH), N-terminal domain.  In general, FGDs have
           a RhoGEF (DH) domain, followed by an N-terminal PH
           domain, a FYVE domain and a C-terminal PH domain. All
           FGDs are guanine nucleotide exchange factors that
           activates the Rho GTPase Cdc42, an important regulator
           of membrane trafficking. The RhoGEF domain is
           responsible for GEF catalytic activity, while the
           N-terminal PH domain is involved in intracellular
           targeting of the DH domain. FGD4 is one of the genes
           associated with Charcot-Marie-Tooth neuropathy type 4
           (CMT4), a group of progressive motor and sensory axonal
           and demyelinating neuropathies that are distinguished
           from other forms of CMT by autosomal recessive
           inheritance. Those affected have distal muscle weakness
           and atrophy associated with sensory loss and,
           frequently, pes cavus foot deformity. PH domains have
           diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 92

 Score = 38.2 bits (89), Expect = 0.002
 Identities = 26/100 (26%), Positives = 46/100 (46%), Gaps = 16/100 (16%)

Query: 120 LKLNIGKKTEST-ERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERFFIRRI 178
             L I  ++  T ER  FLF+ LLLLC   S R              +Y+++  F I  +
Sbjct: 5   KLLKISSRSGDTQERHLFLFNDLLLLC---SPRLIP--------VGSKYKVRALFGISGM 53

Query: 179 EILDLPDSDDYKNAFQIVPRCGSPVI-LIANSPEDKNNWM 217
           ++L+  D+ + +N F +  R     + L   + ++K  W+
Sbjct: 54  QVLE-GDNLETENTFYV--RGKQRCVELCTGTQKEKEEWI 90


>gnl|CDD|217512 pfam03359, GKAP, Guanylate-kinase-associated protein (GKAP)
           protein. 
          Length = 342

 Score = 39.8 bits (93), Expect = 0.006
 Identities = 32/150 (21%), Positives = 44/150 (29%), Gaps = 13/150 (8%)

Query: 727 PKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQS 786
            K  P S      ++          F  F  + +   +            S   T +VQ 
Sbjct: 8   KKRKPSSIGISDDESATDSEAEPKGFREFQSVGVQVEDEKLQRRRF--KRSSSVTANVQ- 64

Query: 787 LASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTN----RRREVSTSDQS 842
            A   +   PG       S ED     G +F    S    PP ++      R VST  QS
Sbjct: 65  -ADLELEGFPGLPVVRHVSTEDKALQFGPSFQRESSEDSPPPSSSTYSAGTRTVSTQGQS 123

Query: 843 VHSRDISPPPLPPRRET-----RPSLSSPP 867
            +  D    P     E+       S   PP
Sbjct: 124 AYLSDPKRRPSSEASESETVAFDESDLPPP 153


>gnl|CDD|241482 cd13328, PH1_FDG_family, FYVE, RhoGEF and PH domain
           containing/faciogenital dysplasia family proteins
           pleckstrin homology (PH), N-terminal domain.  In
           general, FGDs have a RhoGEF (DH) domain, followed by an
           N-terminal PH domain, a FYVE domain and a C-terminal PH
           domain. All FGDs are guanine nucleotide exchange factors
           that activates the Rho GTPase Cdc42, an important
           regulator of membrane trafficking. The RhoGEF domain is
           responsible for GEF catalytic activity, while the
           N-terminal PH domain is involved in intracellular
           targeting of the DH domain. Mutations in the FGD1 gene
           are responsible for the X-linked disorder known as
           faciogenital dysplasia (FGDY). PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 91

 Score = 36.7 bits (85), Expect = 0.006
 Identities = 27/102 (26%), Positives = 46/102 (45%), Gaps = 14/102 (13%)

Query: 115 REDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERFF 174
           +E  +LKL   K   S ER  FLF+ +LL C P            L     +Y ++ R  
Sbjct: 1   KEGHILKL-SAKNGTSQERHLFLFNDMLLYCVPK-----------LRLIGQKYGVRARMD 48

Query: 175 IRRIEILDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNW 216
           +  + +  +   D+  + F I  +  S + L A++ E+K+ W
Sbjct: 49  VDGMNVQVVKG-DEVPHTFYIEGKQKS-LELQASTAEEKDAW 88


>gnl|CDD|241397 cd13243, PH_PLEKHG1_G2_G3, Pleckstrin homology domain-containing
           family G members 1, 2, and 3 pleckstrin homology (PH)
           domain.  PLEKHG1 (also called ARHGEF41), PLEKHG2 (also
           called ARHGEF42 or CLG/common-site lymphoma/leukemia
           guanine nucleotide exchange factor2), and PLEKHG3 (also
           called ARHGEF43) have RhoGEF DH/double-homology domains
           in tandem with a PH domain which is involved in
           phospholipid binding. They function as a guanine
           nucleotide exchange factor (GEF) and are involved in the
           regulation of Rho protein signal transduction. Mutations
           in PLEKHG1 have been associated panic disorder (PD), an
           anxiety disorder characterized by panic attacks and
           anticipatory anxiety. PH domains have diverse functions,
           but in general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 147

 Score = 38.1 bits (89), Expect = 0.006
 Identities = 36/154 (23%), Positives = 56/154 (36%), Gaps = 26/154 (16%)

Query: 74  TGIRAY-NRLKR-QQAIEKTNELSKTVDGWTSVDIGQCCNEFFREDTLLKLNIGKKTEST 131
           T +  + N +KR  +   +  E+   +DGW   D+     E   E +      G K E  
Sbjct: 12  TQVAWHINDMKRKHEHAVRLQEIQSLLDGWLGPDL-TTYGELVLEGSFRVQ--GAKNE-- 66

Query: 132 ERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERFFIRRIEILDLPDSDDYKN 191
            R+ FLFD +LL+ K   KR                  K       + +++    D    
Sbjct: 67  -RLLFLFDKMLLITK---KRE-----------DDHLVYKTHIMCSNLMLVESIPKD--PL 109

Query: 192 AFQIVPRCGSPV--ILIANSPEDKNNWMADLIML 223
           +FQ++P         L A + E K  W   L  L
Sbjct: 110 SFQVLPFDNPKAQYTLQAKNLEQKRLWTQHLKRL 143


>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1.  Mating in
           fungi is controlled by the loci that determine the
           mating type of an individual, and only individuals with
           differing mating types can mate. Basidiomycete fungi
           have evolved a unique mating system, termed tetrapolar
           or bifactorial incompatibility, in which mating type is
           determined by two unlinked loci; compatibility at both
           loci is required for mating to occur. The multi-allelic
           tetrapolar mating system is considered to be a novel
           innovation that could have only evolved once, and is
           thus unique to the mushroom fungi. This domain is
           C-terminal to the homeodomain transcription factor
           region.
          Length = 418

 Score = 39.0 bits (91), Expect = 0.010
 Identities = 29/111 (26%), Positives = 40/111 (36%), Gaps = 13/111 (11%)

Query: 796 PGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLP-PRTNRRREVSTSDQSVHSRDISPPPLP 854
           P  SS    S   S          L   P L  P+T +RR  S+   S    +   P   
Sbjct: 78  PALSSERLLSPSPSV---------LDLSPVLASPQTGKRRRSSS--PSDDEDEAERPSKR 126

Query: 855 PRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPP 905
           PR ++  S SSP     +  PSP +       S + + P  + S  P   P
Sbjct: 127 PRSDSISSSSSPAKPPEACLPSP-AASTQDELSEASAAPLPTPSLSPPHTP 176



 Score = 39.0 bits (91), Expect = 0.012
 Identities = 27/115 (23%), Positives = 39/115 (33%), Gaps = 7/115 (6%)

Query: 786 SLASSTITLRPGSSSGPEFSGEDSPGLMG-STFDGLYSPPPLPPRTNRRREVSTSDQSVH 844
           S +SST  +   +SS  + S   +P   G S F       PL P  +    +     S  
Sbjct: 284 SSSSSTFAVPALTSSSVDQSA--TPLDQGFSNFGSNMYSEPLNPTND---SLLYGLPSSS 338

Query: 845 SRDISPPPLPPRRETRPSLSSPPGYASSPDPSP-SSPGRFHVSSPSPSKPSGSTS 898
           S   +    P    T  S        + P PSP +S      + P+   P    S
Sbjct: 339 SLYANRTIFPAWASTSVSPLDFSTLFNQPSPSPMASQSILAPAQPTSPSPVALPS 393



 Score = 32.8 bits (75), Expect = 0.89
 Identities = 35/189 (18%), Positives = 52/189 (27%), Gaps = 29/189 (15%)

Query: 730 SPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLAS 789
           SP      + +  L+P  +  + +              SP +     S  S+ S     +
Sbjct: 73  SPERSPALSSERLLSPSPSVLDLSPVL----------ASPQTGKRRRS--SSPSDDEDEA 120

Query: 790 STITLRPGSSSGPEFSGEDSPGLMG------STFDGLYSPPP-------LPPRTNRRREV 836
              + RP S S    S    P          ST D L            L P        
Sbjct: 121 ERPSKRPRSDSISSSSSPAKPPEACLPSPAASTQDELSEASAAPLPTPSLSPPHTPTDTA 180

Query: 837 STSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGS 896
            +  +     D    P P R +T    SS P   S P P  ++         + S     
Sbjct: 181 PSGKRKRRLSDGFQLPAPKRPQT----SSRPQTVSDPLPLHATTDWDTWFQATVSSSPSL 236

Query: 897 TSRLPLLPP 905
                + PP
Sbjct: 237 LLTGDIPPP 245


>gnl|CDD|217197 pfam02711, Pap_E4, E4 protein.  This is is a family of
           Papillomavirus proteins, E4, coded for by ORF4. A splice
           variant, E1--E4, exists but neither the function of E4
           or E1--E4 is known.
          Length = 95

 Score = 35.7 bits (83), Expect = 0.016
 Identities = 15/39 (38%), Positives = 17/39 (43%), Gaps = 1/39 (2%)

Query: 823 PPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRP 861
           PP  P +  RRR  S SD S  +   S PP P      P
Sbjct: 30  PPWAPKKRCRRRLESDSD-SSSTTSPSSPPTPQPTTECP 67


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 38.8 bits (91), Expect = 0.017
 Identities = 29/183 (15%), Positives = 46/183 (25%), Gaps = 35/183 (19%)

Query: 725 TGPKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSV 784
            GP     S        P  P                 A  + +P   + A +  S    
Sbjct: 598 EGPPAPASSGPPEEAARPAAPAAPA-------------APAAPAPAGAAAAPAEASAAPA 644

Query: 785 QSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVH 844
             +A+     +  +       G+  P   G          P  P          +     
Sbjct: 645 PGVAAPEHHPKHVAVPDASDGGDGWPAKAG-------GAAPAAP--------PPAPAPAA 689

Query: 845 SRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLP 904
               +    P +    P+ + P G A  P   P    +        S PS +      LP
Sbjct: 690 PAAPAGAA-PAQPAPAPAATPPAGQADDPAAQPPQAAQ------GASAPSPAADDPVPLP 742

Query: 905 PKP 907
           P+P
Sbjct: 743 PEP 745



 Score = 32.3 bits (74), Expect = 1.6
 Identities = 22/166 (13%), Positives = 38/166 (22%), Gaps = 19/166 (11%)

Query: 733 SESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTI 792
           +        P     A         +  ++  G   P     A       +    A +  
Sbjct: 634 AAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAP 693

Query: 793 TLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPP 852
                +   P  +     G          + PP   +                  +  PP
Sbjct: 694 AGAAPAQPAPAPAATPPAGQADDP----AAQPPQAAQGASAP------SPAADDPVPLPP 743

Query: 853 LPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTS 898
            P           PP  A +P   P  P     ++P+ + P    S
Sbjct: 744 EPD---------DPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 38.0 bits (88), Expect = 0.019
 Identities = 38/222 (17%), Positives = 62/222 (27%), Gaps = 36/222 (16%)

Query: 681 PRYAKLRKDIPRRWPNLSLKSPGIKASKNYMARNNLPALTVNKITGPKNSPLSESDGTPQ 740
                    +P   P L      + A+     R      T    +   + P +       
Sbjct: 181 SADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPST 240

Query: 741 TPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTA-----FSMFSTQSVQ-SLASSTITL 794
           T   P    +      P   +   G   P + + A     + +  TQ +Q ++ +S I  
Sbjct: 241 TIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPEASRYELTVTQIIQIAIPASIIAC 300

Query: 795 -------------------------RPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPR 829
                                    RP S   P  S  +    +    D L   PP  PR
Sbjct: 301 VFLGSCACCLHRRCRRRRRRPARIYRPPSPVAPSISAVNEAA-LARLGDELKRHPPESPR 359

Query: 830 TNRRREVSTSDQSVH--SRDISPPPLP--PRRETRPSLSSPP 867
            ++RR   T   S+   S +   P +    R   RP   +  
Sbjct: 360 RSKRRSSQTMVPSLTAISEESEAPAVVELSRSPRRPGGPTAR 401



 Score = 31.1 bits (70), Expect = 2.8
 Identities = 15/57 (26%), Positives = 21/57 (36%), Gaps = 5/57 (8%)

Query: 851 PPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
           P   PR    P  +  P   +SP PS + P      S + + P   T+      P P
Sbjct: 208 PRPTPRTTASPETTPTPSTTTSP-PSTTIPA----PSTTIAAPQAGTTPEAEGTPAP 259



 Score = 31.1 bits (70), Expect = 3.5
 Identities = 18/79 (22%), Positives = 28/79 (35%), Gaps = 3/79 (3%)

Query: 824 PPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRF 883
           P   PRT    E + +  +  S   +  P P      P   + P    +P P     G  
Sbjct: 208 PRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGE- 266

Query: 884 HVSSPSPSKPSGSTSRLPL 902
             + P+ + P+   SR  L
Sbjct: 267 --APPANATPAPEASRYEL 283


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 37.3 bits (86), Expect = 0.038
 Identities = 36/160 (22%), Positives = 55/160 (34%), Gaps = 18/160 (11%)

Query: 781 TQSVQSLASSTITLRPGSSSGPEFSGEDSPG----LMGSTFDGLYSPPPLPPRTNRRREV 836
           TQ  Q+ A++   L      G + +   S          +  G    P  PP T  + + 
Sbjct: 201 TQGHQT-ATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQS 259

Query: 837 STSDQSVHSRDISPPPLPPRRETRPSLSSPP-------GYASSPDPS---PSSPGRFHVS 886
           +T D   H++    PP    R +  S ++PP           +P P+    S     H S
Sbjct: 260 TTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSS 319

Query: 887 SPSPSK-PSGSTSRL--PLLPPKPFSFGKQVPMFQFNAPG 923
            P     P+         L PPKP S    V ++    P 
Sbjct: 320 PPGVQANPTTQNLVDCKELDPPKPNSICYGVGIYNEALPR 359


>gnl|CDD|241308 cd10572, PH_RhoGEF3_XPLN, Rho guanine nucleotide exchange factor 3
           Pleckstrin homology (PH) domain.  RhoGEF3/XPLN, a Rho
           family GEF, preferentially stimulates guanine nucleotide
           exchange on RhoA and RhoB, but not RhoC, RhoG, Rac1, or
           Cdc42 in vitro. It also possesses transforming activity.
           RhoGEF3/XPLN contains a tandem Dbl homology and PH
           domain, but lacks homology with other known functional
           domains or motifs. It is expressed in the brain,
           skeletal muscle, heart, kidney, platelets, and
           macrophage and neuronal cell lines. PH domains have
           diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 143

 Score = 35.4 bits (82), Expect = 0.040
 Identities = 30/133 (22%), Positives = 51/133 (38%), Gaps = 30/133 (22%)

Query: 104 VDIGQCCNEFFREDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGN 163
           +D  Q          LL     K    T+   FLF+ +L+L +P ++             
Sbjct: 15  LDEKQKDPLIDESRALLCHGELKNKRGTKLHVFLFEDVLVLTRPVTR-----------NE 63

Query: 164 QGEYRL-KERFFIRRIEILDLPD---------------SDDYKNAFQIVPRCGSP---VI 204
           Q  Y++ ++   ++ + + DLPD               +D  KN F++  + GS      
Sbjct: 64  QLCYQVYRQPIPVQDLVLEDLPDGEVRLGGSFRGAFSNNDKAKNFFRVSFKDGSKGQSHT 123

Query: 205 LIANSPEDKNNWM 217
           L AN   DK  W+
Sbjct: 124 LQANDEFDKQQWL 136


>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509).  This
           family consists of several uncharacterized viral
           proteins from the Marek's disease-like viruses. Members
           of this family are typically around 400 residues in
           length. The function of this family is unknown.
          Length = 377

 Score = 36.9 bits (85), Expect = 0.049
 Identities = 25/98 (25%), Positives = 33/98 (33%), Gaps = 20/98 (20%)

Query: 823 PPPLPPRTNRRRE------VSTSDQSV-HSRDISPPPLPPRRETRPSLSSPPGYASSPDP 875
           P P P   NR         V   D  V  +R  +P P  PR +T             PDP
Sbjct: 201 PQPGPSGQNRSPRTPTLSNVRVLDAPVATNRGEAPSP--PRTDT-----------LDPDP 247

Query: 876 SPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQ 913
           + + P R    +PSP   S           +P  +G  
Sbjct: 248 AIAGPSRAVNRTPSPRPSSPPPEIDEEYNAQPDPWGTN 285


>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729).  This
           family of proteins is found in viruses. Proteins in this
           family are typically between 145 and 1707 amino acids in
           length. The family is found in association with
           pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
           There is a single completely conserved residue L that
           may be functionally important.
          Length = 115

 Score = 34.3 bits (79), Expect = 0.061
 Identities = 34/118 (28%), Positives = 43/118 (36%), Gaps = 24/118 (20%)

Query: 790 STITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDIS 849
           ST+  R  S+SG  FS   SP       +  +   P P    R   V T           
Sbjct: 13  STLYTRTWSTSG--FSSCFSP------PESAHPDDPPPVGDPRPPVVDT----------- 53

Query: 850 PPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
           PPP+       P  S P    ++P P P  P       PSP  P     + PL PP+P
Sbjct: 54  PPPVSAVW-VLPPPSEP----AAPPPDPEPPVPGPAGPPSPLAPPAPARKPPLPPPRP 106


>gnl|CDD|241271 cd01238, PH_Btk, Bruton's tyrosine kinase pleckstrin homology (PH)
           domain.  Btk is a member of the Tec family of
           cytoplasmic protein tyrosine kinases that includes BMX,
           IL2-inducible T-cell kinase (Itk) and Tec. Btk plays a
           role in the maturation of B cells. Tec proteins general
           have an N-terminal PH domain, followed by a Tek homology
           (TH) domain, a SH3 domain, a SH2 domain and a kinase
           domain. The Btk PH domain binds phosphatidylinositol
           3,4,5-trisphosphate and responds to signalling via
           phosphatidylinositol 3-kinase. The PH domain is also
           involved in membrane anchoring which is confirmed by the
           discovery of a mutation of a critical arginine residue
           in the BTK PH domain. This results in severe human
           immunodeficiency known as X-linked agammaglobulinemia
           (XLA) in humans and a related disorder is mice.PH
           domains have diverse functions, but in general are
           involved in targeting proteins to the appropriate
           cellular location or in the interaction with a binding
           partner. They share little sequence conservation, but
           all have a common fold, which is electrostatically
           polarized. Less than 10% of PH domains bind
           phosphoinositide phosphates (PIPs) with high affinity
           and specificity. PH domains are distinguished from other
           PIP-binding domains by their specific high-affinity
           binding to PIPs with two vicinal phosphate groups:
           PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
           results in targeting some PH domain proteins to the
           plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 140

 Score = 34.9 bits (81), Expect = 0.063
 Identities = 13/46 (28%), Positives = 23/46 (50%), Gaps = 1/46 (2%)

Query: 175 IRRIEILDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWMADL 220
           I+ +E +    S + K  FQ+V      + + A S ED++ W+  L
Sbjct: 54  IKCVEEVKDDASFERKYPFQVV-YDDYTLYVFAPSEEDRDEWILAL 98


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 36.2 bits (84), Expect = 0.068
 Identities = 19/115 (16%), Positives = 28/115 (24%), Gaps = 7/115 (6%)

Query: 795 RPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLP 854
               +S P    + +              P  P                 S     P   
Sbjct: 166 TNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPE------DPSSPSDSSLPPAPSSF 219

Query: 855 PRRETRPSLSSPPGYASSPDP-SPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPF 908
                 PS  SP   +  P P +P  P    V   S +KP+  ++     P    
Sbjct: 220 QSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGI 274



 Score = 32.3 bits (74), Expect = 1.00
 Identities = 21/104 (20%), Positives = 29/104 (27%), Gaps = 1/104 (0%)

Query: 806 GEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSS 865
            E+      +T +   S P                 S        PP  P   +  SL  
Sbjct: 155 DEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPP 214

Query: 866 PPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFS 909
            P    S  P P SP      SP P   +     +  +PP   +
Sbjct: 215 APSSFQSDTP-PPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTA 257



 Score = 30.8 bits (70), Expect = 3.1
 Identities = 22/87 (25%), Positives = 26/87 (29%), Gaps = 11/87 (12%)

Query: 850 PPPLPPRRE------TRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGST-----S 898
           P P PP  E         + S         DP+ +SP     SSP               
Sbjct: 148 PNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSP 207

Query: 899 RLPLLPPKPFSFGKQVPMFQFNAPGND 925
               LPP P SF    P     +P N 
Sbjct: 208 SDSSLPPAPSSFQSDTPPPSPESPTNP 234



 Score = 30.8 bits (70), Expect = 3.4
 Identities = 20/88 (22%), Positives = 24/88 (27%), Gaps = 7/88 (7%)

Query: 835 EVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPS 894
             S S     S     P  P      P         SS  P+PSS        P   +  
Sbjct: 179 PASASPSDPPSSSPGVPSFPS-----PPEDPSSPSDSSLPPAPSSFQSDT--PPPSPESP 231

Query: 895 GSTSRLPLLPPKPFSFGKQVPMFQFNAP 922
            + S  P     P    +QVP      P
Sbjct: 232 TNPSPPPGPAAPPPPPVQQVPPLSTAKP 259


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 36.7 bits (85), Expect = 0.076
 Identities = 36/197 (18%), Positives = 53/197 (26%), Gaps = 34/197 (17%)

Query: 732 LSESDGTPQTPLTPP----TANSEFTVFAPIMLSNANGSQSPGSVSTAFSM--------- 778
             + DG  +  L P       N    V  P  L++   +     V  A +          
Sbjct: 561 TDDGDGNVREHLPPTREFAEGNVTVRVTRPTGLASGASNAEDPEVRRARTRGARALAQAR 620

Query: 779 -FSTQSVQSL-------ASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRT 830
            F   +V  +          T   RP  + G E  G   PG +        + P    R 
Sbjct: 621 TFGRATVGEMIISGFPPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARG 680

Query: 831 NRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSP 890
           +  R  +      H+      P  PR           G    PD   ++      SS   
Sbjct: 681 DPVRPTA-----HHAA--LRAPQAPRPG--GPPGGGGGLPPPPDLPAAAGPAPCGSSLIA 731

Query: 891 SKPSGSTSRLPLLPPKP 907
           S     T+     PP  
Sbjct: 732 S----PTAPPEPEPPGA 744


>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria and eukaryotes, and
           is typically between 384 and 430 amino acids in length.
          Length = 414

 Score = 36.0 bits (83), Expect = 0.081
 Identities = 31/110 (28%), Positives = 40/110 (36%), Gaps = 13/110 (11%)

Query: 794 LRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNR----RREVSTSDQSV--HSRD 847
           L  G S G  F    +     S  D L   P    R NR    R   S S+ ++   +++
Sbjct: 32  LPSGLSRGNSFLSNRNSDAAPSGTDSLSGRPA--SRLNREPSSRPGSSHSEATIVRQAKE 89

Query: 848 ISPPPLPPRRET-----RPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSK 892
              P  PP         RPSL S P   S+   +     R     PSPSK
Sbjct: 90  GERPATPPEARPDEGFVRPSLPSHPRSRSASVSNSKDGDRPSDLPPSPSK 139



 Score = 32.5 bits (74), Expect = 1.1
 Identities = 34/144 (23%), Positives = 45/144 (31%), Gaps = 12/144 (8%)

Query: 768 SPGSVSTAFSMFST---QSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPP 824
           S   V+    M +       +S + S I   P S    E +  + P    S+ D   S  
Sbjct: 197 SFKEVTPVGLMRTPPPGSHSKSPSKSGIPDLPSSRD-SEKTKPEKPQQETSSMDTEKSSA 255

Query: 825 PLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFH 884
           P P  T   +    +        I       +        S    A    PS  SP    
Sbjct: 256 PKPRETLDPKSPEKAPP------IDTTEEELKSPEASPKESEEASARKRSPSLLSPSP-K 308

Query: 885 VSSPSPSKPSGSTSRLPLLP-PKP 907
             SP P    G + R PL P PKP
Sbjct: 309 AESPKPLASPGKSPRDPLSPRPKP 332


>gnl|CDD|220233 pfam09422, WTX, WTX protein.  The WTX protein is found to be
           inactivated in one third of Wilms tumours. The WTX
           protein is functionally uncharacterized.
          Length = 467

 Score = 35.3 bits (81), Expect = 0.15
 Identities = 16/58 (27%), Positives = 21/58 (36%), Gaps = 3/58 (5%)

Query: 850 PPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
             P   R E  P   +  G  S P  +  +PG   V    P+KP   +S      P P
Sbjct: 128 ETPRAAREENAPPQDADGGKVS-PASALEAPGTSCVECGDPAKPGSESSA--FEDPGP 182


>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein.  Tymoviruses
           are single stranded RNA viruses. This family includes a
           protein of unknown function that has been named based on
           its molecular weight. Tymoviruses such as the ononis
           yellow mosaic tymovirus encode only three proteins. Of
           these two are overlapping this protein overlaps a larger
           ORF that is thought to be the polymerase.
          Length = 458

 Score = 35.1 bits (81), Expect = 0.17
 Identities = 23/96 (23%), Positives = 30/96 (31%), Gaps = 25/96 (26%)

Query: 808 DSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPP 867
            SPG++G        P PL P T              +R   P P   R  + P    P 
Sbjct: 236 HSPGILG--------PSPLHPHT--------------TRPSPPRPAFSRSPSSPLSPLPR 273

Query: 868 GYASS---PDPSPSSPGRFHVSSPSPSKPSGSTSRL 900
                   P+P      R H+  P+ S P      L
Sbjct: 274 PSTRRGLLPNPRLPRASRGHLPPPTSSAPPRPNGGL 309



 Score = 33.1 bits (76), Expect = 0.74
 Identities = 24/92 (26%), Positives = 30/92 (32%), Gaps = 4/92 (4%)

Query: 824 PPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRF 883
            P+PP   +   ++     V      P P PP        +S   +   P PS     R 
Sbjct: 370 LPVPPPKVQALPLTALAPLVRHSPSIPLPHPPSALPSHVGASSSKHHRLP-PSVLPGPRL 428

Query: 884 HVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVP 915
              SPSPS P   T R    PP         P
Sbjct: 429 SSPSPSPSLP---TRRPGTPPPPASPPTPSPP 457


>gnl|CDD|223068 PHA03384, PHA03384, early DNA-binding protein E2A; Provisional.
          Length = 445

 Score = 35.1 bits (81), Expect = 0.19
 Identities = 29/112 (25%), Positives = 34/112 (30%), Gaps = 18/112 (16%)

Query: 795 RPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLP 854
             GSSS   +S +DSP L          PP LPP+   RR VS  ++             
Sbjct: 3   GRGSSSDSPYSSDDSPSL---------EPPELPPKKKGRRRVSPVEEEE--------EEE 45

Query: 855 PRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGS-TSRLPLLPP 905
                    S PP   S                 S  K S     R PL  P
Sbjct: 46  EAEVVAVGFSYPPVRISRGKDGKRPVRPLKEEKDSEKKASTEAAVRNPLSDP 97



 Score = 30.1 bits (68), Expect = 6.2
 Identities = 17/79 (21%), Positives = 25/79 (31%), Gaps = 1/79 (1%)

Query: 832 RRREVSTSD-QSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSP 890
            R   S S   S  S  + PP LPP+++ R  +S         +    + G  +      
Sbjct: 3   GRGSSSDSPYSSDDSPSLEPPELPPKKKGRRRVSPVEEEEEEEEAEVVAVGFSYPPVRIS 62

Query: 891 SKPSGSTSRLPLLPPKPFS 909
               G     PL   K   
Sbjct: 63  RGKDGKRPVRPLKEEKDSE 81


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 670

 Score = 35.1 bits (80), Expect = 0.19
 Identities = 23/97 (23%), Positives = 39/97 (40%), Gaps = 2/97 (2%)

Query: 804 FSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSL 863
           F+G  +PG   ST  G  +       ++    V     S  +  +  PP  P +   PS 
Sbjct: 562 FAG--NPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPST 619

Query: 864 SSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRL 900
           S P  +  SP  +PSSP      + + +    S+ ++
Sbjct: 620 SPPASHLGSPSTTPSSPESSIKVASTETASPESSIKV 656



 Score = 30.4 bits (68), Expect = 5.5
 Identities = 18/55 (32%), Positives = 22/55 (40%), Gaps = 5/55 (9%)

Query: 855 PRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFS 909
           P     PS S P G+  SP  +PS      + SPS S P+         P  P S
Sbjct: 589 PSTVVAPSTSPPAGHLGSPPATPSK-----IVSPSTSPPASHLGSPSTTPSSPES 638



 Score = 30.4 bits (68), Expect = 6.2
 Identities = 24/112 (21%), Positives = 35/112 (31%), Gaps = 12/112 (10%)

Query: 788 ASSTIT--LRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHS 845
            +ST T      +++    S         S   G    PP  P       + +   S  +
Sbjct: 569 TNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSK-----IVSPSTSPPA 623

Query: 846 RDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGST 897
             +  P   P      S  S    AS+   SP S  +   +  S S  S ST
Sbjct: 624 SHLGSPSTTP-----SSPESSIKVASTETASPESSIKVASTESSVSMVSMST 670


>gnl|CDD|215299 PLN02543, PLN02543, pfkB-type carbohydrate kinase family protein.
          Length = 496

 Score = 34.9 bits (80), Expect = 0.23
 Identities = 23/97 (23%), Positives = 32/97 (32%), Gaps = 5/97 (5%)

Query: 824 PPLPP--RTNRRREVSTSDQSVHSRDI---SPPPLPPRRETRPSLSSPPGYASSPDPSPS 878
           P L     +  RRE +    S  +R          P  + +RP   S  G A    P PS
Sbjct: 8   PHLHHSYSSLDRREKTCLRSSQKTRRFPKPKASLHPSIKRSRPGRCSTNGAAVPESPKPS 67

Query: 879 SPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVP 915
             GR    + SP K   +  R      +    G +  
Sbjct: 68  RRGRKKKPTSSPPKAKTTRRRTKKTDQELDPEGAEED 104


>gnl|CDD|165564 PHA03309, PHA03309, transcriptional regulator ICP4; Provisional.
          Length = 2033

 Score = 34.8 bits (79), Expect = 0.25
 Identities = 54/203 (26%), Positives = 82/203 (40%), Gaps = 27/203 (13%)

Query: 712  ARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGS---QS 768
            +R+  P+L+ +  + P+ +P+  S    +     P+AN     +AP   S A+ S    +
Sbjct: 1836 SRSATPSLSPSP-SPPRRAPVDRSRSGRRRERDRPSANP--FRWAPRQRSRADHSPDGTA 1892

Query: 769  PGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPP---- 824
            PG           +       S+ T  P S SGPE S +++      T D   +PP    
Sbjct: 1893 PGDAPLNLEDGPGRGRPIWTPSSATTLP-SRSGPEDSVDET-----ETEDS--APPARLA 1944

Query: 825  PLPPRTNRRREVSTSDQSVHSR---DISPPPLPPRRETRPSLSSP--PGYASSPDPSPSS 879
            P P  T+R  +   S+   +S      SPP L  R   RPS   P  P    +     S+
Sbjct: 1945 PSPLETSRAEDSEDSEYPEYSNPRLGKSPPALKSREARRPSSKQPRRPSSGKNGHTDVSA 2004

Query: 880  PGRFHVSSPSPSKPSGSTSRLPL 902
               F +  P+P    GS  R PL
Sbjct: 2005 ASAFFLGRPAP----GSCGRRPL 2023



 Score = 31.4 bits (70), Expect = 3.1
 Identities = 21/54 (38%), Positives = 24/54 (44%), Gaps = 1/54 (1%)

Query: 827  PPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSP 880
            P R   +R  S+S  S  S D SP     RR+ RPS    P    S  P PS P
Sbjct: 1557 PQRPGGKRPRSSSSSSSASHDRSPSSSSRRRDGRPSSRRRPSRRMSARP-PSRP 1609


>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210).  This is
           a family of proteins conserved in yeasts. The function
           is not known. The Schizosaccharomyces pombe member is
           SPBC18E5.07 and the Saccharomyces cerevisiae member is
           AIM21.
          Length = 671

 Score = 34.5 bits (79), Expect = 0.31
 Identities = 26/91 (28%), Positives = 37/91 (40%), Gaps = 18/91 (19%)

Query: 824 PPLPPRTNRR--REVSTSDQSVHSRDISPPPLPPRR---ETRPSLSSP------------ 866
           P +PPR  RR  R VS +  S     ++ PP  P+R    T PSL               
Sbjct: 1   PKVPPRPRRRGDRSVSPNPDSFAPSPLNEPPDVPQRPPSVTLPSLGEEGAEYEALEEAEL 60

Query: 867 -PGYASSPDPSPSSPGRFHVSSPSPSKPSGS 896
              + S+P  + +      + +P PS PS S
Sbjct: 61  SDSHHSTPAQTRNVGEDLKLHAPKPSLPSSS 91


>gnl|CDD|241398 cd13244, PH_PLEKHG5_G6, Pleckstrin homology domain-containing
           family G member 5 and 6 pleckstrin homology (PH) domain.
            PLEKHG5 has a RhoGEF DH/double-homology domain in
           tandem with a PH domain which is involved in
           phospholipid binding. PLEKHG5 activates the nuclear
           factor kappa B (NFKB1) signaling pathway. Mutations in
           PLEKHG5 are associated with autosomal recessive distal
           spinal muscular atrophy. PLEKHG6 (also called MyoGEF)
           has no known function to date. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 154

 Score = 33.0 bits (76), Expect = 0.31
 Identities = 15/39 (38%), Positives = 19/39 (48%), Gaps = 3/39 (7%)

Query: 113 FFREDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKR 151
               D  LK     K  S +  CFLF  +LL+CKP  K+
Sbjct: 49  LLEGDLKLK---EGKGSSVDVHCFLFTDMLLICKPVKKK 84


>gnl|CDD|241479 cd13325, PH_unc89, unc89 pleckstrin homology (PH) domain.  unc89 is
           a myofibrillar protein. unc89-B the largest isoform is
           composed of 53 immunoglobulin (Ig) domains, 2 Fn3
           domains, a triplet of SH3, DH and PH domains at its
           N-terminus, and 2 protein kinase domains (PK1 and PK2)
           at its C-terminus. unc-89 mutants display
           disorganization of muscle A-bands, and usually lack
           M-lines. The COOH-terminal region of obscurin, the human
           homolog of unc89, interacts via two specific Ig-like
           domains with the NH(2)-terminal Z-disk region of titin,
           a protein that connects the Z line to the M line in the
           sarcomere and contributes to the contraction of striated
           muscle. obscurin is also thought to be involved in
           Ca2+/calmodulin via its IQ domains, as well as G
           protein-coupled signal transduction in the sarcomere via
           its RhoGEF/DH domain. The DH-PH region of OBSCN and
           unc89, the C. elegans homolog, has exchange activity for
           RhoA and Rho-1 respectively, but not for the small
           GTPases homologous to Cdc42 or Rac. PH domains have
           diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 113

 Score = 31.9 bits (73), Expect = 0.42
 Identities = 21/98 (21%), Positives = 44/98 (44%), Gaps = 16/98 (16%)

Query: 126 KKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERFFIRRIEILDLPD 185
            + ++ ER  FLF   +L+ K   +R S         ++  + LK+   +  + +   PD
Sbjct: 20  GEGKAKERYLFLFKSRILITK--VRRIS--------EDRSVFILKDIIRLPEVNVKQHPD 69

Query: 186 SDDYKNAFQIVPRC---GSPVILIANSPEDKNNWMADL 220
               +  F++ P+    G P+   A+  E K+ W+ ++
Sbjct: 70  D---ERTFELQPKPAFKGYPIDFKAHKDEIKDAWLNEI 104


>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
          Length = 318

 Score = 33.7 bits (77), Expect = 0.42
 Identities = 21/71 (29%), Positives = 29/71 (40%), Gaps = 1/71 (1%)

Query: 846 RDISPPPLPPRRETRPSLSSPPGYASSPDPS-PSSPGRFHVSSPSPSKPSGSTSRLPLLP 904
           R  S  P PPRR + P  + P    +SP+ + PS PG      P  +    +  R P   
Sbjct: 3   RARSRSPSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGC 62

Query: 905 PKPFSFGKQVP 915
           P   +F    P
Sbjct: 63  PAGVTFSSSAP 73



 Score = 30.6 bits (69), Expect = 3.4
 Identities = 23/87 (26%), Positives = 30/87 (34%), Gaps = 6/87 (6%)

Query: 823 PPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSS-PG 881
            PP  P   R     + D S      SPP   P  E  P  ++ P   ++P   P   P 
Sbjct: 10  SPPRRPSPPRPTPPRSPDASPEETPPSPP--GPGAEPPPGRAAGP---AAPRRRPRGCPA 64

Query: 882 RFHVSSPSPSKPSGSTSRLPLLPPKPF 908
               SS +P +P       P   P P 
Sbjct: 65  GVTFSSSAPPRPPLGLDDAPAATPPPL 91


>gnl|CDD|241400 cd13246, PH_Scd1, Shape and Conjugation Deficiency 1 Pleckstrin
           homology (PH) domain.  Fission yeast Scd1 is an exchange
           factor for Cdc42 and an effector of Ras1, the homolog of
           the human H-Ras. Scd2/Bem1 mediates Cdc42 activation by
           binding to Scd1/Cdc24 and to Cdc42. Ras1 regulates
           Scd1/Cdc24/Ral1, which is a putative guanine nucleotide
           exchange factor for Cdc42, a member of the Rho family of
           Ras-like proteins. Cdc42 then activates the Shk1/Orb2
           protein kinase. Scd1 interacts with Klp5 and Klp6
           kinesins to mediate cytokinesis. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 148

 Score = 32.6 bits (75), Expect = 0.45
 Identities = 19/67 (28%), Positives = 29/67 (43%), Gaps = 11/67 (16%)

Query: 121 KLNIGKKTESTER--VCFLFDGLLLLCK------PNSKRTSVSVTAPLGGNQ-GEYRLKE 171
              + K  + +ER    +LF+ +LL CK        S   S S ++     + G  +LK 
Sbjct: 36  TFTVRK--DDSEREYHVYLFERILLCCKEVKKKKKKSSLKSKSSSSSKKKKKKGPLQLKG 93

Query: 172 RFFIRRI 178
           R FI  I
Sbjct: 94  RIFISNI 100


>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus.  This family represents
           the C-terminus of eukaryotic neogenin precursor
           proteins, which contains several potential
           phosphorylation sites. Neogenin is a member of the N-CAM
           family of cell adhesion molecules (and therefore
           contains multiple copies of pfam00047 and pfam00041) and
           is closely related to the DCC tumour suppressor gene
           product - these proteins may play an integral role in
           regulating differentiation programmes and/or cell
           migration events within many adult and embryonic
           tissues.
          Length = 295

 Score = 33.4 bits (76), Expect = 0.47
 Identities = 33/164 (20%), Positives = 48/164 (29%), Gaps = 31/164 (18%)

Query: 766 SQSPGSVSTAF---SMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYS 822
           SQ P  V +A    ++ + Q    L S             +FS    P     T   LY 
Sbjct: 89  SQPPQPVVSAHPIHTLDNPQYPGILPSPRC-----GYLHHQFSLRPMP-FSTLTVQRLYQ 142

Query: 823 PPPLPPRTNRRREV---------STSDQSVHSRDISPP-PLPPRRETRP----SLSSPPG 868
                      R+           +   +      S   P    R T P    ++ + P 
Sbjct: 143 HGDRAESVESVRQTPEPPYLPAAQSESSNAAEEAPSRSIPTAHVRPTHPLKSFAVPALPA 202

Query: 869 YASSPDPSPSSPGRFHVSSPSPSKPSGST--------SRLPLLP 904
             S+ +P   S        P+  K S  T        +R PLLP
Sbjct: 203 SMSTIEPKLPSTPLLTQQGPTLPKHSVKTASVGTLGRARSPLLP 246


>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566).  Family of
           related proteins that is plant specific.
          Length = 313

 Score = 33.4 bits (76), Expect = 0.48
 Identities = 37/150 (24%), Positives = 48/150 (32%), Gaps = 18/150 (12%)

Query: 768 SPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLP 827
           S  S ST+    S +S      S+  L   +SS P      +            SPP   
Sbjct: 4   SVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPA------------SPPSSS 51

Query: 828 PR--TNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHV 885
           P   T+       S Q   S           R   PS  SP   AS    S ++      
Sbjct: 52  PARNTSSSSSFGLSKQRPSSL---SRGRLSSRFVSPSRGSPSAAASLN-GSLATASTSGS 107

Query: 886 SSPSPSKPSGSTSRLPLLPPKPFSFGKQVP 915
           SSPS S+ + S+       P   SF   V 
Sbjct: 108 SSPSRSRRTTSSDLSSGNGPSVLSFMADVK 137



 Score = 30.3 bits (68), Expect = 5.6
 Identities = 18/82 (21%), Positives = 25/82 (30%), Gaps = 4/82 (4%)

Query: 834 REVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKP 893
           R  S S  S      SP     RR +   LS+          +P+SP     SS      
Sbjct: 1   RAASVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASP----PSSSPARNT 56

Query: 894 SGSTSRLPLLPPKPFSFGKQVP 915
           S S+S              ++ 
Sbjct: 57  SSSSSFGLSKQRPSSLSRGRLS 78



 Score = 29.5 bits (66), Expect = 9.0
 Identities = 28/113 (24%), Positives = 33/113 (29%), Gaps = 14/113 (12%)

Query: 795 RPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLP 854
           R  S S    SG+ S     S              +   R    +  +      SPP   
Sbjct: 1   RAASVSSGSTSGDASSPRSSSRR---RLSSSFLSTSASSRPRRLNAPA------SPPSSS 51

Query: 855 PRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKP 907
           P R T  S SS           PSS  R  +SS   S   GS S    L    
Sbjct: 52  PARNTS-SSSS----FGLSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSL 99


>gnl|CDD|144068 pfam00336, DNA_pol_viral_C, DNA polymerase (viral) C-terminal
           domain. 
          Length = 245

 Score = 33.0 bits (75), Expect = 0.58
 Identities = 16/37 (43%), Positives = 18/37 (48%), Gaps = 1/37 (2%)

Query: 852 PLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSP 888
           PLP R   RPS      YA+SP   P  P R   +SP
Sbjct: 200 PLP-RLRFRPSTGRTSLYAASPPVPPHRPVRVAFASP 235


>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein.  This family consists of
           Ebola and Marburg virus nucleoproteins. These proteins
           are responsible for encapsidation of genomic RNA. It has
           been found that nucleoprotein DNA vaccines can offer
           protection from the virus.
          Length = 717

 Score = 33.4 bits (76), Expect = 0.62
 Identities = 34/169 (20%), Positives = 50/169 (29%), Gaps = 22/169 (13%)

Query: 728 KNSPLSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSL 787
             S   ++    Q    PP  + +    A   L+N N  Q   S +           Q L
Sbjct: 484 AASTYGQTSRERQGIPEPPPGSHQPGNRASQDLNNNNQKQEDESTNPIGKTSLR--YQEL 541

Query: 788 ASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRD 847
                              ED P               LPP  +     S ++Q V   +
Sbjct: 542 --------------TPVQEEDEP-----EDQTDDDDSSLPPLESDDDPGSDNEQGVDLTE 582

Query: 848 ISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGS 896
           ++PP  P  R+ +     P    +  DP+ S          S SKPS  
Sbjct: 583 VAPPA-PVYRDEKEQDEIPHPAQNPQDPTGSIGNVDSDILRSNSKPSAP 630


>gnl|CDD|241423 cd13269, PH_alsin, Alsin Pleckstrin homology (PH) domain.  The ALS2
           gene encodes alsin, a GEF, that has dual specificity for
           Rac1 and Rab5 GTPases. Alsin mutations in the form of
           truncated proteins are responsible for motor function
           disorders including juvenile-onset amyotrophic lateral
           sclerosis, familial juvenile primary lateral sclerosis,
           and infantile-onset ascending hereditary spastic
           paralysis. The alsin protein is widely expressed in the
           developing CNS including neurons of the cerebral cortex,
           brain stem, spinal cord, and cerebellum. Alsin contains
           a regulator of chromosome condensation 1 (RCC1) domain,
           a Rho guanine nucleotide exchanging factor (RhoGEF)
           domain, a PH domain, a Membrane Occupation and
           Recognition Nexus (MORN), a vacuolar protein sorting 9
           (Vps9) domain, and a Dbl homology (DH) domain. Alsin
           interacts with Rab5 through its Vps9 domain and through
           this interaction modulates early endosome fusion and
           trafficking. The GEF activity of alsin towards Rab5 is
           regulated by Rac1 function. The GEF activity of alsin
           for Rac1 occurs via its DH domain and this interaction
           plays a role in promoting spinal motor neuron survival
           via multiple Rac-dependent signaling pathways. PH
           domains have diverse functions, but in general are
           involved in targeting proteins to the appropriate
           cellular location or in the interaction with a binding
           partner. They share little sequence conservation, but
           all have a common fold, which is electrostatically
           polarized. Less than 10% of PH domains bind
           phosphoinositide phosphates (PIPs) with high affinity
           and specificity. PH domains are distinguished from other
           PIP-binding domains by their specific high-affinity
           binding to PIPs with two vicinal phosphate groups:
           PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
           results in targeting some PH domain proteins to the
           plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 106

 Score = 31.2 bits (71), Expect = 0.65
 Identities = 12/36 (33%), Positives = 21/36 (58%), Gaps = 1/36 (2%)

Query: 182 DLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWM 217
            +PD D  +NA +I     S   L+A++P++K  W+
Sbjct: 59  PIPDEDSGQNALKITTPEES-FTLVASTPQEKAEWL 93


>gnl|CDD|241401 cd13247, BAR-PH_APPL, Adaptor protein containing PH domain, PTB
           domain, and Leucine zipper motif Bin1/amphiphysin/Rvs167
           (BAR)-Pleckstrin homology (PH) domain.  APPL (also
           called DCC-interacting protein (DIP)-13alpha) interacts
           with oncoprotein serine/threonine kinase AKT2, tumor
           suppressor protein DCC (deleted in colorectal cancer),
           Rab5, GIPC (GAIP-interacting protein, C terminus), human
           follicle-stimulating hormone receptor (FSHR), and the
           adiponectin receptors AdipoR1 and AdipoR2. There are two
           isoforms of human APPL: APPL1 and APPL2, which share
           about 50% sequence identity. APPL has a BAR and a PH
           domain near its N terminus, and the two domains are
           thought to function as a unit (BAR-PH domain).
           C-terminal to this is a PTB domain. Lipid binding assays
           show that the BAR, PH, and PTB domains can bind
           phospholipids. PH domains have diverse functions, but in
           general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 125

 Score = 31.6 bits (72), Expect = 0.69
 Identities = 12/35 (34%), Positives = 19/35 (54%), Gaps = 1/35 (2%)

Query: 185 DSDDYKNAFQIV-PRCGSPVILIANSPEDKNNWMA 218
           D +D +  FQI  P     ++L A S +D+  W+A
Sbjct: 83  DCEDRRFVFQITSPDGKKAIVLQAESEKDREEWIA 117


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 33.1 bits (76), Expect = 0.78
 Identities = 29/103 (28%), Positives = 45/103 (43%), Gaps = 4/103 (3%)

Query: 483 NLLKIIKHTTNFTRWLEKTIVETENLEERIAIVSRAIEIMMVLNDLNNFNGVLAIVSALG 542
           NL ++IK        LEK I  TEN+EE I    +  E+  VL ++N  +  L  +    
Sbjct: 166 NLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEK--ELEEVLREINEISSELPELREEL 223

Query: 543 SASVYRLKCTFQALPARLENALVELRELGNDHFRKYQERLRSI 585
                 +K   + L   +E    EL  L     RK +E++R +
Sbjct: 224 EKLEKEVK-ELEELKEEIEELEKELESL-EGSKRKLEEKIREL 264


>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
          Length = 580

 Score = 33.2 bits (76), Expect = 0.80
 Identities = 28/131 (21%), Positives = 36/131 (27%), Gaps = 18/131 (13%)

Query: 795 RPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVH---------S 845
           +PG  +  + S E   G     FD       L P  +     S  D  ++          
Sbjct: 8   QPGRWN-YDSSDESPEGSRDENFDA-ERDDFLTPLGSTSEATSEDDDDLYPPRETGSGGG 65

Query: 846 RDISPPPLPPRRETRPSL------SSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSR 899
              S     PR    P        S P      P P+P  PG F  +S        STS 
Sbjct: 66  VATSTIYTVPRPPRGPEQTLDKPDSLPASRELPPGPTPVPPGGFRGASSPRLGAD-STSP 124

Query: 900 LPLLPPKPFSF 910
             L        
Sbjct: 125 RFLYQVNFPVI 135


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 33.1 bits (75), Expect = 0.89
 Identities = 32/127 (25%), Positives = 44/127 (34%), Gaps = 19/127 (14%)

Query: 796 PGSSSGPEFSGEDSPGLM--------GSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRD 847
           PG   G E   EDS            G T +G     P P + ++  ++ T      S+ 
Sbjct: 524 PGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPT-----LSKK 578

Query: 848 ISPP--PLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSS--PSPSKPSGSTS-RLPL 902
              P  P  P+    P     P  A  P   P SP    +     SP +P    S + P 
Sbjct: 579 PEFPKDPKHPKDPEEPKKPKRPRSAQRP-TRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637

Query: 903 LPPKPFS 909
            P +P S
Sbjct: 638 PPQRPSS 644



 Score = 30.4 bits (68), Expect = 6.5
 Identities = 44/250 (17%), Positives = 65/250 (26%), Gaps = 43/250 (17%)

Query: 687 RKDIPRRWPNLSLKSPGIKASKN----YMARNNLPALTVNKITGPKNSPLSESDGTPQTP 742
           ++  P + P LS K    K  K+       +      +  + T PK+  L E    P++P
Sbjct: 565 KEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSP 624

Query: 743 LTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQ----------------SVQS 786
             P +  S      P   S+    + P  + +     S +                   +
Sbjct: 625 KRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYDDYLDAAA 684

Query: 787 LASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSR 846
            +  T T      S      E  P   G+ F      PP  PR          D      
Sbjct: 685 KSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQP 744

Query: 847 DISPPPLPPRRET---------------------RPSLSSPPGYASSPDPSPSSPGRFHV 885
           D      PP  E                         + +  G        P SP     
Sbjct: 745 DDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDSPSEHED 804

Query: 886 SSPS--PSKP 893
             P   PS P
Sbjct: 805 KPPGDHPSLP 814


>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
          Length = 694

 Score = 32.9 bits (75), Expect = 0.99
 Identities = 29/126 (23%), Positives = 45/126 (35%), Gaps = 22/126 (17%)

Query: 802 PEFSGEDSPGLMGSTFDGLYSP----PP------LPPRTNRR--REVSTSDQSVHSRDIS 849
           P    E++P L+ S     Y P    PP      LP    R       T+  +       
Sbjct: 282 PGLGFEEAPELLWSFLGQFYGPQSTIPPRIVVPWLPDTEGREGDDLAPTAVCTDAGLLPD 341

Query: 850 PPPLPPRRE-----TRPSLSSPPGYASSP----DPSPSSPGRFHVSSPSPSKPSGSTSRL 900
            P LP   E       P  ++ P  AS P      +P+S      + P+ ++P+ + + L
Sbjct: 342 TPLLPDAPEGSSDPVVPVAAATPVDASLPDVRTGTAPTSLANVSHADPAVAQPTQAAT-L 400

Query: 901 PLLPPK 906
               PK
Sbjct: 401 AGAAPK 406


>gnl|CDD|241419 cd13265, PH_evt, Evectin Pleckstrin homology (PH) domain.  There
           are 2 members of the evectin family (also called
           pleckstrin homology domain containing, family B): evt-1
           (also called PLEKHB1) and evt-2 (also called PLEKHB2).
           evt-1 is specific to the nervous system, where it is
           expressed in photoreceptors and myelinating glia. evt-2
           is widely expressed in both neural and nonneural
           tissues. Evectins possess a single N-terminal PH domain
           and a C-terminal hydrophobic region. evt-1 is thought to
           function as a mediator of post-Golgi trafficking in
           cells that produce large membrane-rich organelles. It is
           a candidate gene for the inherited human retinopathy
           autosomal dominant familial exudative vitreoretinopathy
           and a susceptibility gene for multiple sclerosis. evt-2
           is essential for retrograde endosomal membrane transport
           from the plasma membrane (PM) to the Golgi. Two membrane
           trafficking pathways pass through recycling endosomes: a
           recycling pathway and a retrograde pathway that links
           the PM to the Golgi/ER. Its PH domain that is unique in
           that it specifically recognizes phosphatidylserine (PS),
           but not polyphosphoinositides. PS is an anionic
           phospholipid class in eukaryotic biomembranes, is highly
           enriched in the PM, and plays key roles in various
           physiological processes such as the coagulation cascade,
           recruitment and activation of signaling molecules, and
           clearance of apoptotic cells. PH domains are only found
           in eukaryotes. They share little sequence conservation,
           but all have a common fold, which is electrostatically
           polarized. PH domains have diverse functions, but in
           general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 108

 Score = 30.7 bits (70), Expect = 0.99
 Identities = 13/40 (32%), Positives = 17/40 (42%)

Query: 181 LDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWMADL 220
           +  P+        QIV R G  +IL A S +D   W   L
Sbjct: 65  VQPPEGRSRDCLLQIVTRDGKRLILCAESADDALAWKLAL 104


>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
           UL32.  The large phosphorylated protein (UL32-like) of
           herpes viruses is the polypeptide most frequently
           reactive in immuno-blotting analyses with antisera when
           compared with other viral proteins.
          Length = 777

 Score = 33.0 bits (75), Expect = 1.0
 Identities = 37/249 (14%), Positives = 60/249 (24%), Gaps = 53/249 (21%)

Query: 714 NNLPALTVNKITGPKNSPLSESDGTPQTPLTP--------PTANSEFTVFAPIMLSN--- 762
           N+   +     T   + PL+ +D T   P            +    + +  P +  N   
Sbjct: 378 NSGKEMPGILTTENLDLPLASTDSTEMDPEDKRGGAVKINNSGILAWGLKTPGLAVNDER 437

Query: 763 ANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGS------------------SSGPEF 804
           +    S G           +   S         P                    ++ PE 
Sbjct: 438 SIAVSSDGITDVLDPPSPLRLHSSDKVIDSVSPPSKRRVSAPASRLDDAKRPEVTATPES 497

Query: 805 SGEDSPG---------------------LMGSTFDGLYSPPPLPPRTNRRREVSTSDQSV 843
           SG DS G                     L         SPPP  P  +RR  +     + 
Sbjct: 498 SGSDSEGGASGREDETSSDAESVVSIKELRPRIGFINKSPPPKSPPKSRRTLIVALSLAS 557

Query: 844 HSRDISPPPLPPRRE---TRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRL 900
            S   SP P P   +        +       + +    +     V     +     T   
Sbjct: 558 PSTAGSPRPKPSLGKFVIGTDPFAFANTVRLTDNMRGGNGVGSSVKPKGSASSKPLTGPG 617

Query: 901 PLLPPKPFS 909
             L P   +
Sbjct: 618 SDLKPATLN 626


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 32.7 bits (74), Expect = 1.1
 Identities = 33/142 (23%), Positives = 43/142 (30%), Gaps = 20/142 (14%)

Query: 782 QSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQ 841
           Q +Q     +I + PG++  P                    PP   P   +        Q
Sbjct: 171 QLLQPQGPPSIQVPPGAALAPSAPPPTPSAQA--------VPPQGSPIAAQ--PAPQPQQ 220

Query: 842 SVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSS----PSPSKPSGST 897
                 IS P L P+R   P     P  AS   P P +P   H  S    P P  P    
Sbjct: 221 PSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQ 280

Query: 898 SRLPLL------PPKPFSFGKQ 913
                L      PP+PF   + 
Sbjct: 281 QGPVFLQHPSSNPPQPFGLAQS 302



 Score = 30.4 bits (68), Expect = 5.7
 Identities = 41/191 (21%), Positives = 52/191 (27%), Gaps = 25/191 (13%)

Query: 727 PKNSPLSESDGTPQTPLTPPTA-NSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQ 785
           P   P + S  +PQ P        S      P M          G V       +     
Sbjct: 242 PPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHAL----QQGPVFLQHPSSNPPQPF 297

Query: 786 SLASSTITLRPG-SSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVH 844
            LA S +   P  S + P      S   +          P  PPR        +      
Sbjct: 298 GLAQSQVPPLPLPSQAQPHSHTPPSQSALQ---------PQQPPREQPLPPAPSMPH--- 345

Query: 845 SRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKP--SGSTSRLPL 902
              I PPP  P  +        P +   P P P  P   ++  P   KP  S  T   P 
Sbjct: 346 ---IKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPS--NLPPPPALKPLSSLPTHHPPS 400

Query: 903 LPPKPFSFGKQ 913
             P P     Q
Sbjct: 401 AHPPPLQLMPQ 411



 Score = 29.7 bits (66), Expect = 9.1
 Identities = 33/193 (17%), Positives = 46/193 (23%), Gaps = 36/193 (18%)

Query: 732 LSESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASST 791
           L +  G P   + P  A +                 SP +   A      Q       S 
Sbjct: 172 LLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQ--PQQPSPLSLISA 229

Query: 792 ITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPP 851
            +L P     P                     PPL P+T  ++                P
Sbjct: 230 PSLHPQRLPSP--------------------HPPLQPQTASQQSPQ-------------P 256

Query: 852 PLPPRRETRPSLSSP-PGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSF 910
           P P  R  + S   P P    +    P        + P P   + S      LP +    
Sbjct: 257 PAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPH 316

Query: 911 GKQVPMFQFNAPG 923
               P      P 
Sbjct: 317 SHTPPSQSALQPQ 329


>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein.  The WWbp domain is
           characterized by several short PY and PT-like motifs of
           the PPPPY form. These appear to bind directly to the WW
           domains of WWP1 and WWP2 and other such diverse proteins
           as dystrophin and YAP (Yes-associated protein). This is
           the WW-domain binding protein WWbp via PY and PY_like
           motifs. The presence of a phosphotyrosine residue in the
           pWBP-1 peptide abolishes WW domain binding which
           suggests a potential regulatory role for tyrosine
           phosphorylation in modulating WW domain-ligand
           interactions. Given the likelihood that WWP1 and WWP2
           function as E3 ubiquitin-protein ligases, it is possible
           that initial substrate-specific recognition occurs via
           WW domain-substrate protein interaction followed by
           ubiquitin transfer and subsequent proteolysis. This
           domain lies just downstream of the GRAM (pfam02893) in
           many members.
          Length = 111

 Score = 30.4 bits (69), Expect = 1.2
 Identities = 15/58 (25%), Positives = 24/58 (41%), Gaps = 7/58 (12%)

Query: 848 ISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPP 905
           +   PLP         ++PP Y   P   P+ PG +  + P P+    +  + P  PP
Sbjct: 54  VHLEPLPAY----GQYAAPPPYGPPPPYYPAPPGVYP-TPPPPNSGYMADPQEP--PP 104


>gnl|CDD|219118 pfam06640, P_C, P protein C-terminus.  This family represents the
           C-terminus of plant P proteins. The maize P gene is a
           transcriptional regulator of genes encoding enzymes for
           flavonoid biosynthesis in the pathway leading to the
           production of a red phlobaphene pigment, and P proteins
           are homologous to the DNA-binding domain of myb-like
           transcription factors. All members of this family
           contain the pfam00249 domain.
          Length = 209

 Score = 32.0 bits (72), Expect = 1.2
 Identities = 24/82 (29%), Positives = 29/82 (35%), Gaps = 12/82 (14%)

Query: 830 TNRRREVST---SDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVS 886
            ++RR   T   S +S  SR        P  E           ASSP    S   R  V 
Sbjct: 12  ADKRRGGRTPGRSPKSSASRTKQADADQPGGEAA----GDAAAASSP--RHSDGARSAVV 65

Query: 887 SPSPSKP---SGSTSRLPLLPP 905
            P P++P   SGST      P 
Sbjct: 66  DPGPNQPNSSSGSTGTAEEGPC 87


>gnl|CDD|233411 TIGR01431, adm_rel, adenosine deaminase-related growth factor.
           Members of this family have been described as secreted
           proteins with growth factor activity and regions of
           adenosine deaminase homology in insects, mollusks, and
           vertebrates.
          Length = 479

 Score = 32.5 bits (74), Expect = 1.3
 Identities = 22/107 (20%), Positives = 40/107 (37%), Gaps = 11/107 (10%)

Query: 318 FLQLLIERFSIPEFQGTVREDLKKFRKEYVQPVQFRVLNVLRHWVDHHFYDFEQDAHLLE 377
           F  L       P F+      L++F ++ VQ ++ R   +         Y+     H  E
Sbjct: 182 FFTLSGLLHYAPVFRDYYFRALEEFYEDNVQYMELRS-RLFP------LYELSGTHHDEE 234

Query: 378 QLKLFLETINGKSMRKWVDSV-SKVIF---RKTDPLELQKQIKFAFD 420
                 + +  K + +  D +  K+I+   R  D  E+ + IK A  
Sbjct: 235 WSVKTYKEVTEKFVEEHPDFIGIKIIYSDLRSKDVEEIAEYIKMAMG 281


>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 226

 Score = 31.8 bits (72), Expect = 1.5
 Identities = 20/93 (21%), Positives = 30/93 (32%), Gaps = 6/93 (6%)

Query: 825 PLPPRTNRRREVSTSDQ-SVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPS--PSSPG 881
           P  P     +E+  +   +  S D  P P+          +  P  A +P P   P  P 
Sbjct: 67  PTQPPEGVAQEIQDAGDAAAASVD--PQPVAQPPVESTP-AGVPVAAQTPKPVKPPKQPP 123

Query: 882 RFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQV 914
              V +    KP       P   P   +F  Q+
Sbjct: 124 AGAVPAKPTPKPEPKPVAEPAAAPTGQAFVVQL 156


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 32.1 bits (73), Expect = 1.6
 Identities = 17/79 (21%), Positives = 25/79 (31%), Gaps = 11/79 (13%)

Query: 823 PPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGR 882
            P     +  R   + S +   +   + PP  P RET      PP     P   P     
Sbjct: 370 KPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETAT----PPPVPPRPVAPP----- 420

Query: 883 FHVSSPSPSKPSGSTSRLP 901
             V     S P  + + +P
Sbjct: 421 --VPHTPESAPKLTRAAIP 437


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 32.1 bits (73), Expect = 1.7
 Identities = 18/100 (18%), Positives = 22/100 (22%), Gaps = 16/100 (16%)

Query: 816 TFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDP 875
           T D     P  PP   RR    TS+  +      P                P     P  
Sbjct: 87  TVDPSAGEPAPPPPHARR----TSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTA 142

Query: 876 SPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVP 915
            P+ P       P               P     +G Q  
Sbjct: 143 RPAYPAYQQRPEPGA------------WPRAADDYGWQQQ 170



 Score = 31.3 bits (71), Expect = 3.0
 Identities = 18/120 (15%), Positives = 26/120 (21%), Gaps = 7/120 (5%)

Query: 795 RPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLP 854
            PG+              +G      Y+ P        R            R        
Sbjct: 154 EPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAG----RPEYDQRRR 209

Query: 855 PRRETRPSLSSPPGYASS-PDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQ 913
                RP    P    +  P+P P +          P +        P+ P  P     Q
Sbjct: 210 DYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDAPVV--PIRPSAPGPLAAQ 267


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 32.0 bits (72), Expect = 1.7
 Identities = 43/234 (18%), Positives = 74/234 (31%), Gaps = 24/234 (10%)

Query: 738 TPQTPLTPPTANSEFTVFAPIMLSNANGS-QSPGSVSTAFSMFSTQSVQSLASSTITLRP 796
           TP   L  P   S     A +  + A     +  +V+ A  +       +  SS++    
Sbjct: 283 TPPDALLQPATVSAIPAAAAVAAAAATAKIMAAEAVAGAAVLGPRAQSPATPSSSLPTDI 342

Query: 797 GSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPR 856
           G+             ++ S        PPLP         +            P P+PP 
Sbjct: 343 GN-----------KAVVSSAKKEAEEVPPLPQ--------AAPAVVKPGPMEIPTPVPPP 383

Query: 857 RETRPSLSSPPGYASSPDPSPS---SPGRFHVSSPSPSKPSGSTSRLPLL-PPKPFSFGK 912
               PSL +PPG  +  + +PS   SP +       P         L    P K     +
Sbjct: 384 GLAIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVTFGALDDTLAWKEPSKEDQTSE 443

Query: 913 QVPMFQFNAPGNDKWAMHEAKEAIEKSKKKISCLVLPVEKIHYLLSKDVLQYSG 966
              M           A+   K+  EK  +++   ++   +   L S++ +   G
Sbjct: 444 DGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMNSEDASLASQEGMSIRG 497


>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
          Length = 463

 Score = 32.0 bits (72), Expect = 1.7
 Identities = 21/86 (24%), Positives = 36/86 (41%), Gaps = 6/86 (6%)

Query: 831 NRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSP 890
            +   +S S+ +  S+      +P   + +PS   PP          S+P      +PS 
Sbjct: 160 TKVAIISKSEDAA-SQVTPSQKIPETTDPKPS---PPAEDKQKPKVESAPVAEKPKAPSS 215

Query: 891 SKPSGSTSRLPLLPPKPFSFGKQVPM 916
             P   +++ P LPPK     ++VPM
Sbjct: 216 PPPPKQSAKEPQLPPKERE--RRVPM 239


>gnl|CDD|215361 PLN02673, PLN02673, quinolinate synthetase A.
          Length = 724

 Score = 31.9 bits (72), Expect = 1.7
 Identities = 21/79 (26%), Positives = 30/79 (37%), Gaps = 15/79 (18%)

Query: 837 STSDQSVHSRDISPPPLPPRRETRPSLSS---PPG-----------YASSPDPSPSSPGR 882
            TS  S     + P P P  R T P+  S                  +  PD +PS+   
Sbjct: 5   PTSSSSSSFLSLLPNPSPNFRTTHPNFGSQRRIGTINPLFKSFKCIQSPPPDSAPSNASP 64

Query: 883 FHVSSPSPSKPSGSTSRLP 901
           F  S+ + S PS +T  +P
Sbjct: 65  FSCSAVAFS-PSQTTELVP 82


>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
          Length = 540

 Score = 31.8 bits (72), Expect = 1.8
 Identities = 24/63 (38%), Positives = 27/63 (42%), Gaps = 5/63 (7%)

Query: 799 SSGPEFSGEDSPGLMGSTFDGL--YSPPPLPPRTNRRREVST--SDQSVHSRDISPPPLP 854
           S      G DSP   GS F  L    P P  P    RR+++T  SD     RD S  P P
Sbjct: 478 SDPDRLPGTDSPPKRGSGFQILSGTKPDPWSPGARSRRDLTTFRSDDPDRYRD-SQAPDP 536

Query: 855 PRR 857
           P R
Sbjct: 537 PHR 539


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 32.0 bits (72), Expect = 1.9
 Identities = 23/110 (20%), Positives = 36/110 (32%), Gaps = 8/110 (7%)

Query: 821 YSPPPLPPRTNRRREVSTSDQSVHSR---DISPPPLPPRRETRPSLSS-----PPGYASS 872
           Y P P    T    + +        R    + PP  PP R  RP+ ++     P      
Sbjct: 674 YQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGR 733

Query: 873 PDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVPMFQFNAP 922
             P  ++PGR    + +P +     +      P   + G   P     AP
Sbjct: 734 ARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAP 783



 Score = 29.6 bits (66), Expect = 9.7
 Identities = 19/108 (17%), Positives = 34/108 (31%), Gaps = 2/108 (1%)

Query: 798 SSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRR 857
           +SS P ++    P    S      +     P T+  R+     + +  R +   P+    
Sbjct: 586 ASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNV 645

Query: 858 ETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPP 905
              P+   PP    +P     +        PSP+    +T       P
Sbjct: 646 LVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGA--NTMLPIQWAP 691


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 31.4 bits (71), Expect = 2.0
 Identities = 15/67 (22%), Positives = 24/67 (35%), Gaps = 1/67 (1%)

Query: 842 SVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSS-PGRFHVSSPSPSKPSGSTSRL 900
           +  + D +PPP PP         + P     P   P+  P   + + P P     +    
Sbjct: 35  ATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPP 94

Query: 901 PLLPPKP 907
           P+ P  P
Sbjct: 95  PVDPNAP 101


>gnl|CDD|241472 cd13318, PH_IQSEC, IQ motif and SEC7 domain-containing protein
           family Pleckstrin homology domain.  The IQSEC (also
           called BRAG/Brefeldin A-resistant Arf-gunanine
           nucleotide exchange factor) family are a subset of Arf
           GEFs that have been shown to activate Arf6, which acts
           in the endocytic pathway to control the trafficking of a
           subset of cargo proteins including integrins and have
           key roles in the function and organization of distinct
           excitatory and inhibitory synapses in the retina. The
           family consists of 3 members: IQSEC1 (also called
           BRAG2/GEP100), IQSEC2 (also called BRAG1), and IQSEC3
           (also called SynArfGEF, BRAG3, or KIAA1110). IQSEC1
           interacts with clathrin and modulates cell adhesion by
           regulating integrin surface expression and in addition
           to Arf6, it also activates the class II Arfs, Arf4 and
           Arf5. Mutations in IQSEC2 cause non-syndromic X-linked
           intellectual disability as well as reduced activation of
           Arf substrates (Arf1, Arf6). IQSEC3 regulates Arf6 at
           inhibitory synapses and associates with the
           dystrophin-associated glycoprotein complex and S-SCAM.
           These members contains a IQ domain that may bind
           calmodulin, a PH domain that is thought to mediate
           membrane localization by binding of phosphoinositides,
           and a SEC7 domain that can promote GEF activity on ARF.
           PH domains have diverse functions, but in general are
           involved in targeting proteins to the appropriate
           cellular location or in the interaction with a binding
           partner. They share little sequence conservation, but
           all have a common fold, which is electrostatically
           polarized. Less than 10% of PH domains bind
           phosphoinositide phosphates (PIPs) with high affinity
           and specificity. PH domains are distinguished from other
           PIP-binding domains by their specific high-affinity
           binding to PIPs with two vicinal phosphate groups:
           PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
           results in targeting some PH domain proteins to the
           plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 128

 Score = 30.3 bits (69), Expect = 2.0
 Identities = 24/91 (26%), Positives = 47/91 (51%), Gaps = 13/91 (14%)

Query: 132 ERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERFFIRRIEILDLPDSDDYKN 191
           +R  FLF+ LL++ K  SK+ S SVT         Y  ++ F +  +++L L ++  Y +
Sbjct: 27  QREVFLFNDLLVVTKIFSKKKS-SVT---------YTFRQSFPLLGMQVL-LFENSYYPH 75

Query: 192 AFQIVPRCGSPVILIANSP--EDKNNWMADL 220
             ++     + V++  N+P   D+  ++ DL
Sbjct: 76  GIRLTSPLDNKVLITFNAPNESDRKKFVEDL 106


>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
           component of pyruvate dehydrogenase complex.
          Length = 539

 Score = 31.7 bits (72), Expect = 2.3
 Identities = 19/56 (33%), Positives = 22/56 (39%), Gaps = 1/56 (1%)

Query: 837 STSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSK 892
           S+S      +    PP P   E     SSP   AS P   PSS  R   +SP   K
Sbjct: 203 SSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSSGDRI-FASPLARK 257


>gnl|CDD|223652 COG0579, COG0579, Predicted dehydrogenase [General function
           prediction only].
          Length = 429

 Score = 31.5 bits (72), Expect = 2.4
 Identities = 17/70 (24%), Positives = 29/70 (41%), Gaps = 10/70 (14%)

Query: 645 SVEPRIRRFLENLTPFDNWKEIDISNYLYEQSLKIEPRYAKLRKDIP-----RRWPNLS- 698
           SV     R + +L      K   ++NY   + +  EPR   L   IP       + N++ 
Sbjct: 297 SVRKANSRGMPDL----GIKNNVLANYAGIRPILKEPRLPALDFIIPEAKDEDWFINVAG 352

Query: 699 LKSPGIKASK 708
           ++S G+ A  
Sbjct: 353 IRSQGLTADP 362


>gnl|CDD|220815 pfam10577, UPF0560, Uncharacterized protein family UPF0560.  This
           family of proteins has no known function.
          Length = 805

 Score = 31.8 bits (72), Expect = 2.4
 Identities = 22/122 (18%), Positives = 38/122 (31%), Gaps = 18/122 (14%)

Query: 812 LMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPG--- 868
           ++ S  +     P L    +  R+ ++S++ +  +      LP   E+ P  +S  G   
Sbjct: 341 MVSSAGEADMHTPMLKSAFSSSRDFTSSEELLAHKAEDKSQLPQSGESFPLKASRSGEQK 400

Query: 869 ----------YASSPDPSPSSPGRFHVSSPSPSK-----PSGSTSRLPLLPPKPFSFGKQ 913
                     Y        SS    H  S   +         + S  P   P P    K+
Sbjct: 401 EEYSKLETEEYRRGYGTVESSSLENHRDSFGLANQNHYSAPPTVSIQPPSGPVPPKPEKE 460

Query: 914 VP 915
           V 
Sbjct: 461 VL 462


>gnl|CDD|223034 PHA03293, PHA03293, deoxyribonuclease; Provisional.
          Length = 523

 Score = 31.6 bits (72), Expect = 2.6
 Identities = 25/111 (22%), Positives = 38/111 (34%), Gaps = 29/111 (26%)

Query: 530 NFNGVLAIVSALGSASVYRLKCTFQAL--------PARLENALVELRELGNDHFRKYQER 581
           + +GVLA      +   + +KC  +           ++    L+  R       R +   
Sbjct: 257 DPHGVLAPHPTQTTLEFFEIKCRAKYAFPPDDGSPLSQAYERLLARRTAAA--LRAF--- 311

Query: 582 LRSINPPCVPFFGMYLTNILHIEEGNPDYLPNSNEELINFS---KRRRVAE 629
           LRSI  P V +F              PD +P + E LI      K  R A 
Sbjct: 312 LRSIRNPGVEYFA-------------PDRVPGAAEALITCDDAWKPGRAAG 349


>gnl|CDD|237138 PRK12563, PRK12563, sulfate adenylyltransferase subunit 2;
           Provisional.
          Length = 312

 Score = 31.3 bits (71), Expect = 2.6
 Identities = 10/35 (28%), Positives = 18/35 (51%)

Query: 659 PFDNWKEIDISNYLYEQSLKIEPRYAKLRKDIPRR 693
           P  NW E+D+  Y+  + + + P Y   R+ +  R
Sbjct: 195 PLSNWTELDVWQYIAREKIPLVPLYFAKRRPVVER 229


>gnl|CDD|223327 COG0249, MutS, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 843

 Score = 31.5 bits (72), Expect = 2.7
 Identities = 33/175 (18%), Positives = 59/175 (33%), Gaps = 22/175 (12%)

Query: 473 TKSNKEEKSPNLLKIIKHT-TNF-----TRWLEKTIVETENLEERIAIVS---RAIEIMM 523
           +      K  +L  ++  T T        RWL + +++ E +EER+  V       E+  
Sbjct: 274 SNLRGNGKKGSLFWLLDRTVTPMGSRLLKRWLLQPLLDKEEIEERLDAVEELKDNPELRE 333

Query: 524 VLND-LNNFNGVLAIVS--ALGSAS---VYRLKCTFQALPARLENALVELRELGNDHFRK 577
            L + L     +  ++S  +LG AS   + RL+ + + +P   +       E       +
Sbjct: 334 KLREMLKKVPDLERLLSRLSLGRASPRDLLRLRDSLEKIPEIFKLLSSLKSESDLLLLLE 393

Query: 578 YQERLRSINPPCVPFFGMYLTNI-LHIEEGNPDYLPNS--NEELINFSKRRRVAE 629
             E L            +  T I         D +     N EL         A+
Sbjct: 394 DIESL----DYLAELLELLETAINEDPPLAVRDGIIKEGYNIELDELRDLLNNAK 444


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 31.3 bits (71), Expect = 2.7
 Identities = 12/48 (25%), Positives = 21/48 (43%)

Query: 846 RDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKP 893
           R    PP        P+  + P  A++P   P++P R   + P+P+  
Sbjct: 379 RGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPA 426


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 31.2 bits (70), Expect = 2.9
 Identities = 29/134 (21%), Positives = 44/134 (32%), Gaps = 39/134 (29%)

Query: 765 GSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPP 824
           G+Q P   + A  M  + +  +      T  P ++  P    +D+P              
Sbjct: 401 GTQGPQGTAPAAGMTPSSAAPA------TPAPSAAPSPRVPWDDAP-------------- 440

Query: 825 PLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFH 884
           P PPR+                 I P P P   E  P   +P   AS+ D  P+      
Sbjct: 441 PAPPRSG----------------IPPRPAPRMPEASPVPGAPDSVASASDAPPTLGD--- 481

Query: 885 VSSPSPSKPSGSTS 898
            S  +   PSG  +
Sbjct: 482 PSDTAEHTPSGPRT 495


>gnl|CDD|184628 PRK14331, PRK14331, (dimethylallyl)adenosine tRNA
           methylthiotransferase; Provisional.
          Length = 437

 Score = 31.3 bits (71), Expect = 3.0
 Identities = 16/60 (26%), Positives = 29/60 (48%), Gaps = 8/60 (13%)

Query: 192 AFQIVPRCGSPVILIAN--SPEDKNNWMADLIMLNTKSMFER------TLDSILLDEEKK 243
           +F+  PR G+P   +      E K   M  L+ L  +  F++      T+  +L++EEK+
Sbjct: 331 SFKYSPRPGTPAAYMEGQEPDEVKTKRMNRLLELQKEITFKKALSYEGTVQEVLVEEEKE 390


>gnl|CDD|241259 cd01225, PH_Cool_Pix, Cloned out of library/PAK-interactive
           exchange factor pleckstrin homology (PH) domain.  There
           are two forms of Pix proteins: alpha Pix (also called
           Rho guanine nucleotide exchange factor (GEF) 6/90Cool-2)
           and beta Pix (GEF7/p85Cool-1). betaPix contains  an
           N-terminal SH3 domain, a RhoGEF/DH domain, a PH domain,
           a GIT1 binding domain (GBD), and a C-terminal
           coiled-coil (CC) domain. alphaPix differs in that it
           contains a calponin homology (CH) domain, which
           interacts with beta-parvin, N-terminal to the SH3
           domain. alphaPix is an exchange factor for Rac1 and
           Cdc42 and mediates Pak activation on cell adhesion to
           fibronectin. Mutations in alphaPix can cause X-linked
           mental retardation. alphaPix also interacts with
           Huntington's disease protein (htt), and enhances the
           aggregation of mutant htt (muthtt) by facilitating
           SDS-soluble muthtt-muthtt interactions. The DH-PH domain
           of a Pix was required for its binding to htt. In the
           majority of Rho GEF proteins, the DH-PH domain is
           responsible for the exchange activity. PH domains have
           diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 100

 Score = 29.2 bits (66), Expect = 3.1
 Identities = 11/35 (31%), Positives = 20/35 (57%)

Query: 183 LPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWM 217
           L D++  K+AF+I       +++I NS  D+  W+
Sbjct: 57  LEDTEGIKHAFEISGPMIERIVVICNSSNDQQEWI 91


>gnl|CDD|147993 pfam06126, Herpes_LAMP2, Herpesvirus Latent membrane protein 2.
           Family of Kaposi's sarcoma-associated herpesvirus (HHV8)
           latent membrane protein.
          Length = 510

 Score = 31.2 bits (70), Expect = 3.1
 Identities = 19/57 (33%), Positives = 23/57 (40%), Gaps = 15/57 (26%)

Query: 832 RRREVSTSDQSVHSRD--------------ISPPPLPPR-RETRPSLSSPPGYASSP 873
           RR  + T D ++H+                ISPPPLPP  R     LS     A SP
Sbjct: 369 RRLPIYTPDDNLHAHAGRICPDVNHLANNLISPPPLPPFFRIHSRILSHTTDMALSP 425


>gnl|CDD|218233 pfam04732, Filament_head, Intermediate filament head (DNA binding)
           region.  This family represents the N-terminal head
           region of intermediate filaments. Intermediate filament
           heads bind DNA. Vimentin heads are able to alter nuclear
           architecture and chromatin distribution, and the
           liberation of heads by HIV-1 protease liberates may play
           an important role in HIV-1 associated cytopathogenesis
           and carcinogenesis. Phosphorylation of the head region
           can affect filament stability. The head has been shown
           to interaction with the rod domain of the same protein.
          Length = 89

 Score = 29.0 bits (65), Expect = 3.2
 Identities = 19/53 (35%), Positives = 21/53 (39%), Gaps = 1/53 (1%)

Query: 854 PPR-RETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPP 905
           P R   +R S  S    +SS      S  R   SSPSPS  S    R    PP
Sbjct: 13  PTRSSSSRSSGGSSGRSSSSSGFRSRSVSRSSSSSPSPSLKSSYRKRSSSAPP 65


>gnl|CDD|132626 TIGR03587, Pse_Me-ase, pseudaminic acid biosynthesis-associated
           methylase.  Members of this small clade are
           methyltransferases of the pfam08241 family and are
           observed within operons for the biosynthesis of
           pseudaminic acid, a component of exopolysaccharide and
           flagellin glycosyl modifications. Notable among these
           genomes is Pseudomonas fluorescens PfO-1. Possibly one
           of the two hydroxyl groups of pseudaminic acid, at
           positions 4 and 8 is converted to a methoxy group by
           this enzyme.
          Length = 204

 Score = 30.5 bits (69), Expect = 3.3
 Identities = 22/78 (28%), Positives = 31/78 (39%), Gaps = 10/78 (12%)

Query: 554 QALPARLENALVELRELGNDHFRKYQERLRSIN-----PPCVPFFGMYLTN--ILHIEEG 606
           + L        VE+ E   +  + Y   +  I      P    FF + LT   ++HI   
Sbjct: 62  KRLLPFKHIYGVEINEYAVEKAKAYLPNINIIQGSLFDPFKDNFFDLVLTKGVLIHI--- 118

Query: 607 NPDYLPNSNEELINFSKR 624
           NPD LP +  EL   S R
Sbjct: 119 NPDNLPTAYRELYRCSNR 136


>gnl|CDD|241255 cd01220, PH1_FARP1-like, FERM, RhoGEF and pleckstrin
           domain-containing protein 1 and related proteins
           Pleckstrin Homology (PH) domain, repeat 1.  Members here
           include FARP1 (also called Chondrocyte-derived
           ezrin-like protein; PH domain-containing family C member
           2), FARP2 (also called FIR/FERM domain including RhoGEF;
           FGD1-related Cdc42-GEF/FRG), and FARP6 (also called Zinc
           finger FYVE domain-containing protein 24). They are
           members of the Dbl family guanine nucleotide exchange
           factors (GEFs) which are upstream positive regulators of
           Rho GTPases. Little is known about FARP1 and FARP6,
           though FARP1 has increased expression in differentiated
           chondrocytes. FARP2 is thought to regulate neurite
           remodeling by mediating the signaling pathways from
           membrane proteins to Rac. It is found in brain, lung,
           and testis, as well as embryonic hippocampal and
           cortical neurons. FARP1 and FARP2 are composed of a
           N-terminal FERM domain, a proline-rich (PR) domain,
           Dbl-homology (DH), and two C-terminal PH domains. FARP6
           is composed of Dbl-homology (DH), and two C-terminal PH
           domains separated by a FYVE domain. This hierarchy
           contains the first PH repeat. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 123

 Score = 29.6 bits (67), Expect = 3.4
 Identities = 29/112 (25%), Positives = 51/112 (45%), Gaps = 18/112 (16%)

Query: 112 EFFREDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKE 171
           EF RE  LLKL+   K    +R+ FLF  +L+        TS   T+ L     ++++  
Sbjct: 21  EFIREGCLLKLS---KKGLQQRMFFLFSDVLIY-------TSRGPTSTL-----QFKVHG 65

Query: 172 RFFIRRIEILDLPDSDDYKNAFQIVPRCGSPVILI-ANSPEDKNNWMADLIM 222
           +  +R + + +        + F I    G+  + + A+S E+K  W+ DL  
Sbjct: 66  QLPLRGLMVEESEPEMGVPHCFTI--YGGNRALTVAASSEEEKERWLEDLSR 115


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 30.6 bits (69), Expect = 3.4
 Identities = 14/55 (25%), Positives = 17/55 (30%), Gaps = 7/55 (12%)

Query: 849 SPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLL 903
             PP P      P  S P    +   P+ S P     +  SP       S  P L
Sbjct: 153 MQPPPPH---AMPPASPPAAQPAPSAPASSPPP----TPASPPPAKAPKSSHPPL 200


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 31.0 bits (70), Expect = 3.6
 Identities = 43/181 (23%), Positives = 60/181 (33%), Gaps = 28/181 (15%)

Query: 733 SESDGTPQTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTI 792
           SE     Q    PP+ N+  +  +      A+ S S  S S            S +SS  
Sbjct: 365 SEDSDEEQATEKPPSRNTPPSAPSSNPEPAASSSGSSSSSSG-----------SESSS-- 411

Query: 793 TLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRR------EVSTSDQSVHSR 846
               GS S  E S  DS            SP P PP TN+ +      +V+    S    
Sbjct: 412 ----GSDSESESSSSDSEENEPPRTA---SPEPEPPSTNKWQLDNWLNKVNPHKVSPAES 464

Query: 847 DISPPPL--PPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLP 904
             S PP+  P  +E +   S    +  S +P P S  +      +  K   S       P
Sbjct: 465 VSSNPPIKQPMEKEGKVKSSGSQYHPESKEPPPKSSSKEKRRPRTAQKGPESGRGKQKSP 524

Query: 905 P 905
            
Sbjct: 525 A 525


>gnl|CDD|235765 PRK06274, PRK06274, indolepyruvate oxidoreductase subunit B;
           Reviewed.
          Length = 197

 Score = 30.0 bits (68), Expect = 3.9
 Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 7/64 (10%)

Query: 439 ILVLHPVEIARQLTLLESEYYRAVKPSEIVGSAWTKSNKEEKSPNLLKIIK------HTT 492
           +L L P E+AR L  L+      V    I  +  T  +++      +K  K         
Sbjct: 71  LLALEPAEVARNLHFLKKGGKIIVNAYAIHPAT-TVGSEKYDPEKEIKFAKEKICDVICI 129

Query: 493 NFTR 496
           +FT+
Sbjct: 130 DFTK 133


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 31.0 bits (70), Expect = 3.9
 Identities = 22/76 (28%), Positives = 32/76 (42%), Gaps = 9/76 (11%)

Query: 848 ISPPPLPPRRETRPSLSSPPGYASSP-DPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPK 906
             PPP         S S  P  +S+P +  P       +  PSP++PS   + +P  PP 
Sbjct: 187 QPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIP-APPI 245

Query: 907 PFSFGKQVPMFQFNAP 922
           P       P+ Q+ AP
Sbjct: 246 P-------PVIQYVAP 254


>gnl|CDD|227950 COG5663, COG5663, Uncharacterized conserved protein [Function
           unknown].
          Length = 194

 Score = 30.2 bits (68), Expect = 4.1
 Identities = 21/57 (36%), Positives = 29/57 (50%), Gaps = 1/57 (1%)

Query: 181 LDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWMADLIMLNTKSMFERTLDSIL 237
           +DL   D + NA QI    G PVILI NSP ++     ++I  N  +     +DS L
Sbjct: 135 IDLFFEDSHDNAGQIAKNAGIPVILI-NSPYNRKPAAKNIIRANNWAEAYEWVDSRL 190


>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 433 to 538 amino acids in length. This domain is
           found associated with pfam00616, pfam00168. This domain
           has two conserved sequence motifs: DLQ and PLSFQNP.
          Length = 489

 Score = 30.8 bits (69), Expect = 4.1
 Identities = 18/90 (20%), Positives = 25/90 (27%), Gaps = 2/90 (2%)

Query: 791 TITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISP 850
              L   +S+GP+    D P   G        PP L   +        S +   S     
Sbjct: 247 QPALPRQNSAGPQ-RRVDQPSPPGGGSHRGRIPPSLL-SSLPSEGSMLSSEWPQSGARPR 304

Query: 851 PPLPPRRETRPSLSSPPGYASSPDPSPSSP 880
                 +   P L    G+   P P   S 
Sbjct: 305 QQSSSSKGDSPELRPAAGHLQQPSPVNMSA 334


>gnl|CDD|223328 COG0250, NusG, Transcription antiterminator [Transcription].
          Length = 178

 Score = 30.0 bits (68), Expect = 4.1
 Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 473 TKSNKEEKSPNLLKIIKHTTNFTRWLEKTIVETENLEE----RIAIVSRA-----IEIMM 523
           T S +E+K    L+           + + +V TE + E    R  IV R      + + M
Sbjct: 10  TYSGQEKKVKENLERKAELLGMEDLIFEVLVPTEEVVEVKGKRKVIVERKLFPGYVLVEM 69

Query: 524 VLND-----LNNFNGVLAIVSALGS 543
            + D     + N  GV   V + G+
Sbjct: 70  DMTDEAWHLVRNTPGVTGFVGSGGA 94


>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B
           N-terminal.  This is the eukaryotic DNA polymerase alpha
           subunit B N-terminal domain which is involved in complex
           formation. Also see pfam04058.
          Length = 239

 Score = 30.4 bits (69), Expect = 4.2
 Identities = 25/137 (18%), Positives = 36/137 (26%), Gaps = 7/137 (5%)

Query: 788 ASSTITLRPGSSSGPEFSGEDSPGLMGST----FDGLYSPPPLPPRTNRRREVSTSDQSV 843
              T      S    E    +   L   T      G+      P    R+R  S S    
Sbjct: 65  RVRTPASIKTSKRLIEVPEAEESLLDSYTTPSDKGGMLRILSTPELPKRKRSFSASSLES 124

Query: 844 HSRDISPPPLPPRRETRPSLSSPPGYASSPDPSP--SSPGRFHVSSPSPSKPSGSTSRLP 901
            S   SP    P         +   ++S  +P     +       +P    P  S  ++ 
Sbjct: 125 PSLFFSPASFSPSAAPSTPSPNSAKFSSRSNPGEVVETLNPHLGQTPEGGGPD-SDPKVK 183

Query: 902 LLPPKPFSFGKQVPMFQ 918
           L         K   MFQ
Sbjct: 184 LSANFDAKKYKYRTMFQ 200


>gnl|CDD|233508 TIGR01649, hnRNP-L_PTB, hnRNP-L/PTB/hephaestus splicing factor
           family.  Included in this family of heterogeneous
           ribonucleoproteins are PTB (polypyrimidine tract binding
           protein ) and hnRNP-L. These proteins contain four RNA
           recognition motifs (rrm: pfam00067).
          Length = 481

 Score = 30.6 bits (69), Expect = 4.3
 Identities = 17/93 (18%), Positives = 22/93 (23%), Gaps = 11/93 (11%)

Query: 802 PEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRP 861
           P+  G   PGL  +      +     P        S       S      PL       P
Sbjct: 191 PDLPGRRDPGLDQTHRQRQPALLGQHPS-------SYGHDGYSSHGGPLAPLAGGDRMGP 243

Query: 862 SLSSP----PGYASSPDPSPSSPGRFHVSSPSP 890
               P    P Y ++P     S        P  
Sbjct: 244 PHGPPSRYRPAYEAAPLAPAISSYGPAGGGPGS 276


>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
           modification].
          Length = 2365

 Score = 30.7 bits (69), Expect = 5.2
 Identities = 15/51 (29%), Positives = 18/51 (35%), Gaps = 10/51 (19%)

Query: 850 PPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRL 900
           PPP PP          PPG+     P P  P   +V   S  + S     L
Sbjct: 9   PPPPPP----------PPGFEPPSQPPPPPPPGVNVKKRSRKQLSIVGDIL 49


>gnl|CDD|241537 cd13386, PH1_FGD2, FYVE, RhoGEF and PH domain
           containing/faciogenital dysplasia protein 2 pleckstrin
           homology (PH), N-terminal domain.  In general, FGDs have
           a RhoGEF (DH) domain, followed by an N-terminal PH
           domain, a FYVE domain and a C-terminal PH domain. All
           FGDs are guanine nucleotide exchange factors that
           activates the Rho GTPase Cdc42, an important regulator
           of membrane trafficking. The RhoGEF domain is
           responsible for GEF catalytic activity, while the
           N-terminal PH domain is involved in intracellular
           targeting of the DH domain. Not much is known about
           FGD2.  FGD1 is the best characterized member of the
           group with mutations here leading to the X-linked
           disorder known as faciogenital dysplasia (FGDY). PH
           domains have diverse functions, but in general are
           involved in targeting proteins to the appropriate
           cellular location or in the interaction with a binding
           partner. They share little sequence conservation, but
           all have a common fold, which is electrostatically
           polarized. Less than 10% of PH domains bind
           phosphoinositide phosphates (PIPs) with high affinity
           and specificity. PH domains are distinguished from other
           PIP-binding domains by their specific high-affinity
           binding to PIPs with two vicinal phosphate groups:
           PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
           results in targeting some PH domain proteins to the
           plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 107

 Score = 28.8 bits (64), Expect = 5.4
 Identities = 23/103 (22%), Positives = 52/103 (50%), Gaps = 14/103 (13%)

Query: 115 REDTLLKLNIGKKTESTERVCFLFDGLLLLCKPNSKRTSVSVTAPLGGNQGEYRLKERFF 174
           +E  + K++  ++  + E+  FLF+ +LL C P      + V A       E++++ R  
Sbjct: 2   KEGPIQKISF-RRNSTMEKYLFLFNNMLLYCVPK----VIQVGA-------EFQVRTRID 49

Query: 175 IRRIEILDLPDSDDYKNAFQIVPRCGSPVILIANSPEDKNNWM 217
           +  +++ +L D+ ++ ++F +V      + L A S E+   W+
Sbjct: 50  VAGMKVRELMDA-EFPHSF-LVSGKQRTLELQARSQEEMIAWI 90


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 30.5 bits (69), Expect = 5.5
 Identities = 18/121 (14%), Positives = 32/121 (26%), Gaps = 12/121 (9%)

Query: 823 PPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGR 882
             P P         + +  S      +  P  P+  T+P+ + P      P   P +P  
Sbjct: 388 AAPQPS------AAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNP-- 439

Query: 883 FHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVPMFQFNAPGNDKWAMHEAKEAIEKSKKK 942
                 +  +                      P          + A    KEA   ++K+
Sbjct: 440 ----PSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIKEAPTGTQKE 495

Query: 943 I 943
           I
Sbjct: 496 I 496



 Score = 29.7 bits (67), Expect = 9.4
 Identities = 16/68 (23%), Positives = 29/68 (42%), Gaps = 7/68 (10%)

Query: 847 DISPPPLPPRRETRPSLSSP-----PGYASSPDPSPSSPGRFHVSS--PSPSKPSGSTSR 899
           D +     P++  +P  + P     P  A++  PSPS        S   S ++P+G+   
Sbjct: 367 DDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPT 426

Query: 900 LPLLPPKP 907
           + + PP  
Sbjct: 427 VSVDPPAA 434


>gnl|CDD|215430 PLN02800, PLN02800, imidazoleglycerol-phosphate dehydratase.
          Length = 261

 Score = 29.8 bits (67), Expect = 5.6
 Identities = 10/61 (16%), Positives = 18/61 (29%), Gaps = 4/61 (6%)

Query: 822 SPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPG 881
           +   L P+ +    +      V S   S P          S        ++   S ++ G
Sbjct: 9   AAQLLRPKLSFIDLLPRRAAIVSSPSSSLPRFLRME----SQLRQSISCAASSSSSNALG 64

Query: 882 R 882
           R
Sbjct: 65  R 65


>gnl|CDD|220966 pfam11072, DUF2859, Protein of unknown function (DUF2859).  This is
           a bacterial family of uncharacterized proteins.
          Length = 142

 Score = 29.2 bits (66), Expect = 5.8
 Identities = 14/59 (23%), Positives = 14/59 (23%), Gaps = 2/59 (3%)

Query: 849 SPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSP--SPSKPSGSTSRLPLLPP 905
                P      P    P     SP           V SP  SP         LP L P
Sbjct: 6   GVSAAPYYEALNPQPDEPQPSPPSPPAGALEAAMLPVRSPRLSPGVVERRALNLPGLQP 64


>gnl|CDD|225979 COG3448, COG3448, CBS-domain-containing membrane protein [Signal
           transduction mechanisms].
          Length = 382

 Score = 30.2 bits (68), Expect = 6.3
 Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 515 VSRAIEIMMVLNDLNNFNGVLAIVSALGSASVYRLKCTFQALPARLENALV 565
           VS AI  M  L  L+  +G +A+ + LG  +V+RL   F   P    N+L+
Sbjct: 113 VSLAIGAMFALRCLHPPSGAVALTAVLGGPAVHRLGYNFVLWPV-ALNSLI 162


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 29.9 bits (67), Expect = 6.5
 Identities = 15/55 (27%), Positives = 21/55 (38%), Gaps = 6/55 (10%)

Query: 850 PPPLPPRRETRPSLSSPPGYASSPDPSPSS------PGRFHVSSPSPSKPSGSTS 898
           PP   P     P+ S P   AS    +P+S      P     ++ +P  PS S  
Sbjct: 86  PPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTSVP 140


>gnl|CDD|114448 pfam05722, Ustilago_mating, Ustilago B locus mating-type protein.
           This family consists of several Ustilago mating-type
           proteins. The b locus of the phytopathogenic fungus
           Ustilago maydis encodes a multiallelic recognition
           function that controls the ability of the fungus to form
           a dikaryon and complete the sexual stage of the life
           cycle. The b locus has at least 25 alleles and any
           combination of two different alleles, brought together
           by mating between haploid cells, allows the fungus to
           cause disease and undergo sexual development within the
           plant.
          Length = 239

 Score = 29.8 bits (66), Expect = 6.7
 Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 7/92 (7%)

Query: 827 PPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSP--SSPGRFH 884
           PP+    R V+T+ +   +R  +P   P  ++ +P  S  P   S+ D S   S+P    
Sbjct: 95  PPKPGMARAVTTAAKRHPARKTAPAAKPKPKKAKPRASKTPSIDSTLDSSALESTP---E 151

Query: 885 VSSPSPSKPSGST--SRLPLLPPKPFSFGKQV 914
           +S+ S +  S S+  S   L    PF  G  +
Sbjct: 152 LSACSTADNSFSSFCSNGSLGHHDPFQDGNDI 183


>gnl|CDD|220819 pfam10599, Nup_retrotrp_bd, Retro-transposon transporting motif.
           This is the highly conserved C-terminal motif
           GRKIxxxxxRRKx of nucleoporins that plays a critical and
           unique role in the nuclear import of retro-transposons
           in both yeasts and higher organisms. It would appear
           that the arginine residues at positions 2 and 9-10
           constitute a bipartite nuclear localisation signal, with
           two basic peptide motifs separated by an interchangeable
           spacer sequence, that is crucial for the
           retro-transposon activity.
          Length = 101

 Score = 28.2 bits (62), Expect = 6.8
 Identities = 9/67 (13%), Positives = 21/67 (31%)

Query: 740 QTPLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSS 799
            +   P T+++  T F      +   + +  S S   +   + +      +  +      
Sbjct: 24  GSNNQPHTSSTPSTPFTFGASPSTPANAASPSGSNPPAFSGSPASAPNQFNVGSNNSNMF 83

Query: 800 SGPEFSG 806
           S    SG
Sbjct: 84  SSSSVSG 90


>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
           factor.  RNA polymerase I-specific
           transcription-initiation factor Rrn6 and Rrn7 represent
           components of a multisubunit transcription factor
           essential for the initiation of rDNA transcription by
           Pol I. These proteins are found in fungi.
          Length = 753

 Score = 30.1 bits (68), Expect = 7.2
 Identities = 11/64 (17%), Positives = 26/64 (40%)

Query: 698 SLKSPGIKASKNYMARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTVFAP 757
             + P I++S+         +  V     P+ +  S+   +  +   PP+++  F++   
Sbjct: 665 QSQIPTIRSSQQVSQTRKGGSSVVPSAPAPRLAQSSQPPTSQSSSDLPPSSSQAFSLSDL 724

Query: 758 IMLS 761
            M S
Sbjct: 725 PMQS 728


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 29.9 bits (67), Expect = 7.3
 Identities = 17/64 (26%), Positives = 22/64 (34%), Gaps = 5/64 (7%)

Query: 849 SPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSK---PSG--STSRLPLL 903
           SP   P  ++  P    P          P +PG      PSP+    P      +R   L
Sbjct: 383 SPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSAPL 442

Query: 904 PPKP 907
           PP P
Sbjct: 443 PPSP 446


>gnl|CDD|184643 PRK14354, glmU, bifunctional N-acetylglucosamine-1-phosphate
           uridyltransferase/glucosamine-1-phosphate
           acetyltransferase; Provisional.
          Length = 458

 Score = 29.8 bits (68), Expect = 7.4
 Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 10/51 (19%)

Query: 263 ANIILEEKDNHTGV-----PLIKGATLYKLVERLTYHIYADPAFVRTFLTT 308
           A   L +K+  T V     PLI   TL  L++   +H   +     T LT 
Sbjct: 85  AEEFLADKEGTTLVICGDTPLITAETLKNLID---FHE--EHKAAATILTA 130


>gnl|CDD|219741 pfam08193, INO80_Ies4, INO80 complex subunit Ies4.  The INO80
           ATPase is a member of the SNF2 family of ATPases and
           functions as an integral component of a multisubunit
           ATP-dependent chromatin remodelling complex. This family
           of proteins corresponds to the fungal Ies4 subunit of
           INO80.
          Length = 228

 Score = 29.6 bits (66), Expect = 7.8
 Identities = 22/89 (24%), Positives = 29/89 (32%), Gaps = 6/89 (6%)

Query: 823 PPPLPPRTNRRREVSTSDQSVHSRDISPPPL------PPRRETRPSLSSPPGYASSPDPS 876
            P     TN R   S   + +    +SP  L       P +E  PS S     A  P PS
Sbjct: 2   APASKATTNGRSSRSGPSKLIVVLKLSPELLRAFLAPSPVKEDEPSDSPASSAADPPVPS 61

Query: 877 PSSPGRFHVSSPSPSKPSGSTSRLPLLPP 905
                    S+P+    +  T R    P 
Sbjct: 62  SVDNASDAASTPAAGTSATDTPRRKGGPG 90


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 29.8 bits (67), Expect = 7.8
 Identities = 29/182 (15%), Positives = 51/182 (28%), Gaps = 19/182 (10%)

Query: 701 SPGIKASKNYMARNNLPALTVNKITGPKNSPLSESDGTPQTPLTPPTANSEFTVFAPIML 760
           +PG +A+    A        V    G   +P + +         PP A       AP   
Sbjct: 382 APGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAP------APPAT 435

Query: 761 SNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGL 820
           ++     + G         +  S  S        +P + SG   +               
Sbjct: 436 ADRGDDAADGDAPVPAKANARASADSRCDER-DAQPPADSGSASAPASD----------- 483

Query: 821 YSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSP 880
            +PP        R    ++       D   P    R +   + + P   A  P P+ ++P
Sbjct: 484 -APPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAP 542

Query: 881 GR 882
             
Sbjct: 543 AA 544


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 30.0 bits (67), Expect = 8.3
 Identities = 25/143 (17%), Positives = 41/143 (28%), Gaps = 22/143 (15%)

Query: 782 QSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQ 841
            + Q      +      +      G  +  +       + SP P+P           ++Q
Sbjct: 373 MNQQVGQGGQMVALGYLNIQGNQGGLGANPMQQGQPGMMSSPSPVPQ--------VQTNQ 424

Query: 842 SVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLP 901
           S+      P P  P     P    P   +    PSP +     + SPSP       S+  
Sbjct: 425 SMPQ---PPQPSVPSPGG-PGSQPPQSVSGGMIPSPPA----LMPSPSPQMSQSPASQ-- 474

Query: 902 LLPPKPFSFGKQVPMFQFNAPGN 924
               +        P+   N PG 
Sbjct: 475 -RTIQQDMVSPGGPL---NTPGQ 493


>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein.  SOG2 proteins in
           Saccharomyces cerevisiae are involved in cell separation
           and cytokinesis.
          Length = 419

 Score = 29.7 bits (67), Expect = 8.3
 Identities = 18/99 (18%), Positives = 29/99 (29%), Gaps = 8/99 (8%)

Query: 823 PPPLPPRTNRRREVSTSDQSVHSRDISP-----PPLPPRRETRPSLSSPPGYASSPDPSP 877
                P +   R    SD+    R ++P     P   PR  + P+    P         P
Sbjct: 159 AVTASPSSMIARNTPISDRL-RPRSVTPTRGRRPSSSPRSLSNPTTLESPSNLQVTTDVP 217

Query: 878 SSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVPM 916
             P   + +S S +  S +   +      P S       
Sbjct: 218 --PPYSNGTSRSSTMSSSANLSIISSLATPRSGESFRST 254


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 30.1 bits (67), Expect = 8.6
 Identities = 16/101 (15%), Positives = 29/101 (28%), Gaps = 6/101 (5%)

Query: 820 LYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPP--GYASSPDPSP 877
           L++P   P +  ++            +    P  P  +  +P     P   Y     P  
Sbjct: 744 LFTPIVEPVQQPQQPVAPQQQYQQPQQ----PVAPQPQYQQPQQPVAPQPQYQQPQQPVA 799

Query: 878 SSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVPMFQ 918
             P       P   +P     + P+ P   +   +Q    Q
Sbjct: 800 PQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQ 840


>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1.  This region is found
           in some of the Diaphanous related formins (Drfs). It
           consists of low complexity repeats of around 12
           residues.
          Length = 160

 Score = 28.8 bits (64), Expect = 9.0
 Identities = 27/97 (27%), Positives = 31/97 (31%), Gaps = 7/97 (7%)

Query: 819 GLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPS 878
            +  PPPLP  T     +           I PPP  P     P     PG  + P P P 
Sbjct: 23  CIPPPPPLPGGTG----IPPPPPLPGGAAIPPPPPLPGVAGIPPPPPLPGATAIP-PPPP 77

Query: 879 SPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVP 915
            PG   +  P P    G     P  PP P       P
Sbjct: 78  LPGAAGIPPPPPL--PGGAGIPPPPPPLPGGAAVPPP 112



 Score = 28.8 bits (64), Expect = 9.4
 Identities = 25/67 (37%), Positives = 26/67 (38%), Gaps = 6/67 (8%)

Query: 850 PPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLL-----P 904
           PPPLP      P     PG A+ P P P  PG   V  P P  P       P       P
Sbjct: 87  PPPLPGGAGIPPPPPPLPGGAAVP-PPPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGSPP 145

Query: 905 PKPFSFG 911
           P PF FG
Sbjct: 146 PPPFGFG 152


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 30.0 bits (67), Expect = 9.1
 Identities = 51/203 (25%), Positives = 67/203 (33%), Gaps = 42/203 (20%)

Query: 708 KNYMARNNLPA---LTVNKITGPKNSPLSESDGTPQ-TPLTPPTANSEFTVFAPIMLSNA 763
           ++ + +N L     +  N  T         +  TPQ T L    A SE  V   IM  + 
Sbjct: 275 RSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQVTISIMTGS- 333

Query: 764 NGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSGPEFSGEDSPGLMGSTFDGLYSP 823
             S +    STA        +++  S T       +S      E +P           S 
Sbjct: 334 --SPAETKASTA-----AWKIRNPLSRTSAPAVRIASATFRGLEKNP-----------ST 375

Query: 824 PPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRPSLSSPPGYASSPDPSPSSPGRF 883
            P  P T R R V T+ Q  H   + P P                 A    PSPS     
Sbjct: 376 APSTPATPRVRAVLTT-QVHHCVVVKPAP-----------------AVPTTPSPSLTTAL 417

Query: 884 HVSSPSPSKPSGSTSRLPLLPPK 906
              +PSPS PS      P L PK
Sbjct: 418 FPEAPSPS-PSALPPGQPDLHPK 439


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
           family is conserved in fungi but the function is not
           known.
          Length = 509

 Score = 29.5 bits (66), Expect = 9.5
 Identities = 35/176 (19%), Positives = 55/176 (31%), Gaps = 44/176 (25%)

Query: 742 PLTPPTANSEFTVFAPIMLSNANGSQSPGSVSTAFSMFSTQSVQSLASSTITLRPGSSSG 801
           PL  P  N       P          S  S +T     + Q+++S  S++ T    + S 
Sbjct: 78  PLALPMPN--LAPITPPSSEPTPAPPSSESTATRTPDPNQQALESTESTSTTSADCNDSE 135

Query: 802 PEFSGEDSPGLMGSTFDGLYSPPPLPPRTNRRREVSTSDQSVHSRDISPPPLPPRRETRP 861
                               S P           +++SD S  S    P     R  +  
Sbjct: 136 QS------------------STPN----------LNSSDTSTSSSGALPSTSVVRGFSPS 167

Query: 862 SLSSPPGYASSPDPSPSSPGRFHVSSPSPSKPSGSTSRLPLLPPKPFSFGKQVPMF 917
            +SS   Y S+   +          +PSP+K +  T+       KP    K+  MF
Sbjct: 168 HISS--SYRSTAQLNK---------APSPTKSAEPTAA---PQAKPELPKKKQAMF 209


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.135    0.404 

Gapped
Lambda     K      H
   0.267   0.0705    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 50,408,231
Number of extensions: 5079593
Number of successful extensions: 6418
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5884
Number of HSP's successfully gapped: 260
Length of query: 966
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 860
Effective length of database: 6,236,078
Effective search space: 5363027080
Effective search space used: 5363027080
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.5 bits)