RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14673
         (912 letters)



>gnl|CDD|227361 COG5028, COG5028, Vesicle coat complex COPII, subunit SEC24/subunit
           SFB2/subunit SFB3 [Intracellular trafficking and
           secretion].
          Length = 861

 Score =  389 bits (1001), Expect = e-121
 Identities = 222/620 (35%), Positives = 346/620 (55%), Gaps = 17/620 (2%)

Query: 285 GDPYRRPEIQSATVEYLATADYTVRDPPQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQL 344
            D Y RPE++S  V++LA  +Y++R PP P +Y+FL+DVS  A ++G +      +L  L
Sbjct: 248 SDRYSRPELKSGVVDFLAPKEYSLRQPP-PPVYVFLIDVSFEAIKNGLVKAAIRAILENL 306

Query: 345 KSMPG-DRRTSIAIITYDSAVHFYSLAEGQTQPSQMILTDIDDIFLPSPEN-ILVNLSEC 402
             +P  D RT IAII +DS++HF+ L+        +I++D+D+ FLP P    ++ L  C
Sbjct: 307 DQIPNFDPRTKIAIICFDSSLHFFKLSPD-LDEQMLIVSDLDEPFLPFPSGLFVLPLKSC 365

Query: 403 YDMIVDLLKQLPGKFKDSLDSGSALGPALQAAFKLLQPTGGRITIFQTCLPNRGPGALQS 462
             +I  LL ++P  F+D+    +ALGPAL+AA  L+  TGG+I +F + LPN G G LQ 
Sbjct: 366 KQIIETLLDRVPRIFQDNKSPKNALGPALKAAKSLIGGTGGKIIVFLSTLPNMGIGKLQL 425

Query: 463 RENPNQRSNDDPPHMNPATDFYKKIALECTSFYIAVDLFILNSQYVDLATLSGVSKFSSG 522
           RE+           ++    FYK+ A+EC+   I+VDLF+ +  Y+D+ATLS + +++ G
Sbjct: 426 REDKESS------LLSCKDSFYKEFAIECSKVGISVDLFLTSEDYIDVATLSHLCRYTGG 479

Query: 523 CIHHIPLYSGKNMSHVYQLDRMFSRYLTRKIGFESVMRIRCTKGMSIHTFHGNFFVRSTD 582
             +  P +S    +   +L      +L+ +IG+E+VMR+RC+ G+ + +F+GNFF RS+D
Sbjct: 480 QTYFYPNFSATRPNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLRVSSFYGNFFNRSSD 539

Query: 583 LLALPNVNPDAGYGIQISIDENLTHIQYACFQVAVLYTSSKGDRRIRVHTLCLPVVSNIN 642
           L A   +  D    ++ SIDE L       FQVA+LYT + G+RRIRV  L LP  S+I 
Sbjct: 540 LCAFSTMPRDTSLLVEFSIDEKLMT-SDVYFQVALLYTLNDGERRIRVVNLSLPTSSSIR 598

Query: 643 DVLVGADQACIAGLLAKMAVDRSLQHSLSDAREAFLNVVCDVLSTYKMTQAGHLTGSLLA 702
           +V   ADQ  IA +LAK A  ++L  SL +AR      + D+L  YK       T + L 
Sbjct: 599 EVYASADQLAIACILAKKASTKALNSSLKEARVLINKSMVDILKAYKKELVKSNTSTQLP 658

Query: 703 -PSSLKLLPLYVLALLKHPAFRIGQSTRLDDRLFAMTEMKCLPLNSLMLSIYPELYPIHT 761
            P++LKLLPL +LALLK  AFR G ST  D R+ A+  +  LPL  LM +IYP LY +H 
Sbjct: 659 LPANLKLLPLLMLALLKSSAFRSG-STPSDIRISALNRLTSLPLKQLMRNIYPTLYALHD 717

Query: 762 LDQDPTIEWNELQVPIPRILQLSAERLQATGTYLLNMPDLIIILVRHGTSPAICSGLFGF 821
           +        +E  + +P  +  ++  L++ G YL++    I +       P++   LFG 
Sbjct: 718 M-PIEAGLPDEGLLVLPSPINATSSLLESGGLYLIDTGQKIFLWFGKDAVPSLLQDLFGV 776

Query: 822 QSVAQMPETLFELPQLETQLSLKLNSFIS--YLNDDKAYVSPVKVIKDSSPD-RMEFYDK 878
            S++ +P   F LP    + + ++ + I      +D + +  V V     P  R+ F+  
Sbjct: 777 DSLSDIPSGKFTLPPTGNEFNERVRNIIGELRSVNDDSTLPLVLVRGGGDPSLRLWFFST 836

Query: 879 LIEDKNAGGMAYYEFLLRIR 898
           L+EDK     +Y ++L  + 
Sbjct: 837 LVEDKTLNIPSYLDYLQILH 856



 Score = 30.9 bits (70), Expect = 3.4
 Identities = 12/59 (20%), Positives = 19/59 (32%), Gaps = 7/59 (11%)

Query: 102 SQQTFRMPPNTYSQ-------SSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIP 153
           SQ    + P   SQ       S K+A  +      +   +          QQQ ++ I 
Sbjct: 2   SQHKKGVYPQAQSQVHTGAASSKKSARPHRAYANFSAGQMGMPPYTTPPLQQQSRRQID 60


>gnl|CDD|238756 cd01479, Sec24-like, Sec24-like: Protein and membrane traffic in
           eukaryotes is mediated by at least in part by the
           budding and fusion of intracellular transport vesicles
           that selectively carry cargo proteins and lipids from
           donor to acceptor organelles. The two main classes of
           vesicular carriers within the endocytic and the
           biosynthetic pathways are COP- and clathrin-coated
           vesicles. Formation of COPII vesicles requires the
           ordered assembly of the coat built from several
           cytosolic components GTPase Sar1, complexes of
           Sec23-Sec24 and Sec13-Sec31. The process is initiated by
           the conversion of GDP to GTP by the GTPase Sar1 which
           then recruits the heterodimeric complex of Sec23 and
           Sec24. This heterodimeric complex generates the
           pre-budding complex. The final step leading to membrane
           deformation and budding of COPII-coated vesicles is
           carried by the heterodimeric complex Sec13-Sec31. The
           members of this CD belong to the Sec23-like family. Sec
           24 is very similar to Sec23. The Sec23 and Sec24
           polypeptides fold into five distinct domains: a
           beta-barrel, a zinc finger, a vWA or trunk, an all
           helical region and a carboxy Gelsolin domain. The
           members of this subgroup carry a partial MIDAS motif and
           have the overall Para-Rossmann type fold that is
           characteristic of this superfamily.
          Length = 244

 Score =  311 bits (800), Expect = e-100
 Identities = 120/244 (49%), Positives = 168/244 (68%), Gaps = 2/244 (0%)

Query: 312 PQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGDR-RTSIAIITYDSAVHFYSLA 370
           PQPA+Y+FL+DVS  A +SG L T CE LL+ L ++PGD  RT +  IT+DS +HF++L 
Sbjct: 1   PQPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPGDDPRTRVGFITFDSTLHFFNLK 60

Query: 371 EGQTQPSQMILTDIDDIFLPSPENILVNLSECYDMIVDLLKQLPGKFKDSLDSGSALGPA 430
               QP  M+++D+DD FLP P+ +LVNL E   +I DLL Q+P  F+D+ ++ SALGPA
Sbjct: 61  SSLEQPQMMVVSDLDDPFLPLPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPA 120

Query: 431 LQAAFKLLQPTGGRITIFQTCLPNRGPGALQSRENPNQRSNDD-PPHMNPATDFYKKIAL 489
           LQAAF LL+ TGG+I +FQ+ LP  G G L+SRE+P   S D     + P TDFYKK+AL
Sbjct: 121 LQAAFLLLKETGGKIIVFQSSLPTLGAGKLKSREDPKLLSTDKEKQLLQPQTDFYKKLAL 180

Query: 490 ECTSFYIAVDLFILNSQYVDLATLSGVSKFSSGCIHHIPLYSGKNMSHVYQLDRMFSRYL 549
           EC    I+VDLF+ ++QYVD+ATL  +S+ + G +++ P ++    + V +L    +RYL
Sbjct: 181 ECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYPSFNFSAPNDVEKLVNELARYL 240

Query: 550 TRKI 553
           TRKI
Sbjct: 241 TRKI 244


>gnl|CDD|238745 cd01468, trunk_domain, trunk domain. COPII-coated vesicles carry
           proteins from the endoplasmic reticulum to the Golgi
           complex. This vesicular transport can be reconstituted
           by using three cytosolic components containing five
           proteins: the small GTPase Sar1p, the Sec23p/24p
           complex, and the Sec13p/Sec31p complex. This domain is
           known as the trunk domain and has an alpha/beta vWA fold
           and forms the dimer interface. Some members of this
           family possess a partial MIDAS motif that is a
           characteristic feature of most vWA domain proteins.
          Length = 239

 Score =  255 bits (654), Expect = 7e-79
 Identities = 103/230 (44%), Positives = 140/230 (60%), Gaps = 5/230 (2%)

Query: 312 PQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGDRRTSIAIITYDSAVHFYSLAE 371
           PQP +++F++DVS  A + G L  + E LL  L  +PGD R  + +ITYDS VHFY+L+ 
Sbjct: 1   PQPPVFVFVIDVSYEAIKEGLLQALKESLLASLDLLPGDPRARVGLITYDSTVHFYNLSS 60

Query: 372 GQTQPSQMILTDIDDIFLPSPENILVNLSECYDMIVDLLKQLPGKFKD--SLDSGSALGP 429
              QP   +++D+ D+FLP P+  LV LSEC  +I DLL+QLP  F    +      LGP
Sbjct: 61  DLAQPKMYVVSDLKDVFLPLPDRFLVPLSECKKVIHDLLEQLPPMFWPVPTHRPERCLGP 120

Query: 430 ALQAAFKLLQPT--GGRITIFQTCLPNRGPGALQSRENPNQ-RSNDDPPHMNPATDFYKK 486
           ALQAAF LL+ T  GGRI +FQ  LP  GPG L+SRE+    RS+D+   + PAT FYK 
Sbjct: 121 ALQAAFLLLKGTFAGGRIIVFQGGLPTVGPGKLKSREDKEPIRSHDEAQLLKPATKFYKS 180

Query: 487 IALECTSFYIAVDLFILNSQYVDLATLSGVSKFSSGCIHHIPLYSGKNMS 536
           +A EC    I VDLF  +  YVD+ATL  ++K + G ++    +   N  
Sbjct: 181 LAKECVKSGICVDLFAFSLDYVDVATLKQLAKSTGGQVYLYDSFQAPNDG 230


>gnl|CDD|147125 pfam04811, Sec23_trunk, Sec23/Sec24 trunk domain.  COPII-coated
           vesicles carry proteins from the endoplasmic reticulum
           to the Golgi complex. This vesicular transport can be
           reconstituted by using three cytosolic components
           containing five proteins: the small GTPase Sar1p, the
           Sec23p/24p complex, and the Sec13p/Sec31p complex. This
           domain is known as the trunk domain and has an
           alpha/beta vWA fold and forms the dimer interface.
          Length = 241

 Score =  238 bits (609), Expect = 2e-72
 Identities = 94/243 (38%), Positives = 140/243 (57%), Gaps = 7/243 (2%)

Query: 312 PQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGDRRTSIAIITYDSAVHFYSLAE 371
           PQP ++LF++DVS  A +SG L  + E LL  L  +PGD R  +  IT+DS VHF++L+ 
Sbjct: 1   PQPPVFLFVIDVSYNAIKSGLLAALKESLLQSLDLLPGDPRALVGFITFDSTVHFFNLSS 60

Query: 372 GQTQPSQMILTDIDDIFLPSPENILVNLSECYDMIVDLLKQLPGKFKDSLDSGSALGPAL 431
              QP  ++++D+ D+FLP P+  LV LSEC  ++ DLL++LP  F  +      LGPAL
Sbjct: 61  SLRQPKMLVVSDLQDMFLPLPDRFLVPLSECRFVLEDLLEELPRMFPVTKRPERCLGPAL 120

Query: 432 QAAFKLLQ--PTGGRITIFQTCLPNRGP-GALQSRENPNQR--SNDDPPHMNPATDFYKK 486
           QAA  LL+   TGG+I +FQ  LP  GP G L+SR + +      +    +  A  FYK 
Sbjct: 121 QAAVLLLKAAFTGGKIMLFQGGLPTVGPGGKLKSRLDESHHDTDKEKAKLVKKADKFYKS 180

Query: 487 IALECTSFYIAVDLFILNSQYVDLATLSGVSKFSSGCIHHIPLYSGKNMSHVYQLDRMFS 546
           +A EC +   +VDLF  +  YVD+A L  +S+ + G ++  P +     +  ++ D    
Sbjct: 181 LAKECVAQGHSVDLFAFSLDYVDVAELGCLSRLTGGQVYLYPSFQADVDTSKFKQD--LQ 238

Query: 547 RYL 549
           RY 
Sbjct: 239 RYF 241


>gnl|CDD|219707 pfam08033, Sec23_BS, Sec23/Sec24 beta-sandwich domain. 
          Length = 86

 Score =  117 bits (295), Expect = 2e-31
 Identities = 41/86 (47%), Positives = 55/86 (63%), Gaps = 1/86 (1%)

Query: 554 GFESVMRIRCTKGMSIHTFHGNFFVRS-TDLLALPNVNPDAGYGIQISIDENLTHIQYAC 612
           GF +V+R+R +KG+ +  F GNFF RS  D   LP+++PD  Y  +  IDE L   + A 
Sbjct: 1   GFNAVLRVRTSKGLKVSGFIGNFFSRSSGDTWKLPSLDPDTSYAFEFDIDEPLDSGKQAY 60

Query: 613 FQVAVLYTSSKGDRRIRVHTLCLPVV 638
            Q A+LYT S G+RRIRV T+ LPV 
Sbjct: 61  IQFALLYTHSSGERRIRVTTVALPVT 86


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score =  132 bits (332), Expect = 3e-31
 Identities = 126/512 (24%), Positives = 247/512 (48%), Gaps = 41/512 (8%)

Query: 314  PAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGDRRTSIAIITYDSAVHFYSLAEGQ 373
            P  ++F+++ S  A  +   YT+ E +   ++++   + T IAIIT++S+++FY    G+
Sbjct: 952  PPYFVFVVECSYNAIYNNITYTILEGIRYAVQNVKCPQ-TKIAIITFNSSIYFYHCKGGK 1010

Query: 374  TQPSQ-------------MILTDIDDIFLPSP-ENILVNLSECYDMIVDLLKQLPGKFKD 419
                +             ++++D+DD FLP P E++     E  D I  L+  +      
Sbjct: 1011 GVSGEEGDGGGGSGNHQVIVMSDVDDPFLPLPLEDLFFGCVEEIDKINTLIDTIKSVSTT 1070

Query: 420  SLDSGSALGPALQAAFKLLQPTGG--RITIFQTCLPNRGPGALQSRENPNQRSNDDPPHM 477
                GS    AL+ A  +L+   G   I +F T  PN G GA++  +   Q +  +    
Sbjct: 1071 MQSYGSCGNSALKIAMDMLKERNGLGSICMFYTTTPNCGIGAIKELKKDLQENFLEVKQ- 1129

Query: 478  NPATDFYKKIALECTSFYIAVDLFILNSQYVDLA--TLSGVSKFSSGCIHHIPLYS-GKN 534
                 FY  + L+  +F I+VD+FI++S  V +   +L  V++ + G I  +  +   K+
Sbjct: 1130 ---KIFYDSLLLDLYAFNISVDIFIISSNNVRVCVPSLQYVAQNTGGKILFVENFLWQKD 1186

Query: 535  MSHVYQLDRMFSRYLTRKIGFESVMRIRCTKGMSIHTF---HGNF-FVRSTDLLALPNVN 590
               +Y    +     +  I +   +++R +  MS+      + NF  + S D + +P + 
Sbjct: 1187 YKEIYM--NIMDTLTSEDIAYCCELKLRYSHHMSVKKLFCCNNNFNSIISVDTIKIPKIR 1244

Query: 591  PDAGYGIQISIDENLTHIQYACFQVAVLYTSSKGDRRIRVHTLCLPVVSNINDVLVGADQ 650
             D  +   ++  +     +   FQ A +YT+  GDR +R+HT  + + S+++ V    D 
Sbjct: 1245 HDQTFAFLLNYSDISESKKQIYFQCACIYTNLWGDRFVRLHTTHMNLTSSLSTVFRYTDA 1304

Query: 651  ACIAGLLAKMAVDRSLQHSLSDAREAFLNVVCDVLSTYKMTQAGHL-TGSLLAPSSLKLL 709
              +  +L K      L +   +  +  ++ +  +L +Y++  A    +G L+ P +LKLL
Sbjct: 1305 EALMNILIKQLCTNILHN--DNYSKIIIDNLAAILFSYRINCASSAHSGQLILPDTLKLL 1362

Query: 710  PLYVLALLKHPAFRIGQSTRLDDRLFAMTEMKCLPLNSLMLSIYPELYPIHTLDQDPTIE 769
            PL+  +LLKH   +  +    D +++++ ++  +P+ S +L +YP +Y IH   +  T E
Sbjct: 1363 PLFTSSLLKHNVTK--KEILHDLKVYSLIKLLSMPIISSLLYVYPVMYVIHI--KGKTNE 1418

Query: 770  WNELQVP----IPRILQLSAERLQATGTYLLN 797
             + + V     IP+ +  SAE++ + G YLL+
Sbjct: 1419 IDSMDVDDDLFIPKTIPSSAEKIYSNGIYLLD 1450


>gnl|CDD|218277 pfam04815, Sec23_helical, Sec23/Sec24 helical domain.  COPII-coated
           vesicles carry proteins from the endoplasmic reticulum
           to the Golgi complex. This vesicular transport can be
           reconstituted by using three cytosolic components
           containing five proteins: the small GTPase Sar1p, the
           Sec23p/24p complex, and the Sec13p/Sec31p complex. This
           domain is composed of five alpha helices.
          Length = 103

 Score = 99.1 bits (248), Expect = 8e-25
 Identities = 47/104 (45%), Positives = 64/104 (61%), Gaps = 2/104 (1%)

Query: 650 QACIAGLLAKMAVDRSLQHSLSDAREAFLNVVCDVLSTY-KMTQAGHLTGSLLAPSSLKL 708
           Q  IA LLAK AV+++L  SL DAR      + D+L+ Y K   +   +G L+ P SLKL
Sbjct: 1   QEAIAVLLAKKAVEKALTSSLKDARRWLDKKLVDILAAYRKYCASSSSSGQLILPESLKL 60

Query: 709 LPLYVLALLKHPAFRIGQSTRLDDRLFAMTEMKCLPLNSLMLSI 752
           LPLY+LALLK PA R G +   D+R +A   +  LP+ +L+L I
Sbjct: 61  LPLYMLALLKSPALR-GGNVSPDERAYARCLLLSLPVENLLLMI 103


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 63.6 bits (155), Expect = 3e-10
 Identities = 40/202 (19%), Positives = 50/202 (24%), Gaps = 7/202 (3%)

Query: 28  SNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPV 87
             P  P     P +  Q  + +     Q          P                     
Sbjct: 135 QPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQ-----PPQQVLPQGMPPRQA 189

Query: 88  PQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQ--QQQQQQ 145
               Q P +Q   + Q     P     Q    AP      P     L QQ    QQ Q  
Sbjct: 190 AFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFP 249

Query: 146 QQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQ 205
              QQ  P  P  P  QQ     Q      N     P    G    +P           Q
Sbjct: 250 GLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQ 309

Query: 206 QQQQQQQQQQQQQQTLNSQFSN 227
           Q Q QQ+  Q ++Q +      
Sbjct: 310 QPQGQQRGPQFREQLVQLSQQQ 331



 Score = 62.5 bits (152), Expect = 8e-10
 Identities = 40/197 (20%), Positives = 53/197 (26%), Gaps = 21/197 (10%)

Query: 28  SNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPV 87
                P    PP   L   +P   ++            P       Q P           
Sbjct: 165 QRQQAPQLPQPPQQVLPQGMPPRQAAFPQ------QGPPEQPPGYPQPPQ---GHPEQVQ 215

Query: 88  PQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQ 147
           PQQF     Q    +     +PP       +  P      P        QQ      Q  
Sbjct: 216 PQQFLPAPSQ----APAQPPLPPQ---LPQQPPPLQQPQFPGL-----SQQMPPPPPQPP 263

Query: 148 QQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQ 207
           QQQ  P  P      QN     P      N  +PP  QP   P V           +++Q
Sbjct: 264 QQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQ 323

Query: 208 QQQQQQQQQQQQTLNSQ 224
             Q  QQQ++  +    
Sbjct: 324 LVQLSQQQREALSQEEA 340



 Score = 47.1 bits (112), Expect = 4e-05
 Identities = 33/236 (13%), Positives = 45/236 (19%), Gaps = 59/236 (25%)

Query: 71  NRFQAPGATINSLSTPVPQQFQRPQQQNA----------------NHSQQTFRMPPNTYS 114
                   +  S  T      Q   + +                    +   + P    S
Sbjct: 83  GAPSVGPDSDLSQKTSTFSPCQSGYEASTDPEYIPDLQPDPSLWGTAPKPEPQPPQAPES 142

Query: 115 QSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPI---------PNIPPLPNSQQNS 165
           Q     P       L+   +  Q QQ+QQ  Q  Q P                       
Sbjct: 143 QPQPQTP---AQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQ 199

Query: 166 SNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPS-------LNQAGYQQQQQQQQQQQQQQ 218
               P     +  QV P+         PA  P               Q     QQ     
Sbjct: 200 PPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPP 259

Query: 219 QTLNSQFSNLSVNQGMNKMWGYEPHDLLNTRDILPATKVEPPPIRLPQNLPEIHEQ 274
                Q       Q                          PPP   P   P + + 
Sbjct: 260 PQPPQQQQQPPQPQAQ------------------------PPPQNQPTPHPGLPQG 291



 Score = 43.6 bits (103), Expect = 5e-04
 Identities = 27/149 (18%), Positives = 41/149 (27%), Gaps = 14/149 (9%)

Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAG 195
               Q   +     + IP++ P P+    +   +P        Q P        P+    
Sbjct: 100 FSPCQSGYEASTDPEYIPDLQPDPSLWGTAPKPEPQPP-----QAPESQPQPQTPAQKML 154

Query: 196 YPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVN---------QGMNKMWGYEPHDLL 246
                +A  QQ+QQ  Q  Q  QQ L                               + +
Sbjct: 155 SLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQV 214

Query: 247 NTRDILPATKVEPPPIRLPQNLPEIHEQF 275
             +  LPA    P    LP  LP+     
Sbjct: 215 QPQQFLPAPSQAPAQPPLPPQLPQQPPPL 243



 Score = 37.1 bits (86), Expect = 0.055
 Identities = 27/130 (20%), Positives = 35/130 (26%), Gaps = 4/130 (3%)

Query: 21  MQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATI 80
              P         + A PP+    PQ P      Q P  S  M  PP    + Q      
Sbjct: 213 QVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQ--PP 270

Query: 81  NSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQ--QQ 138
              + P PQ    P              P           P      P     L Q  QQ
Sbjct: 271 QPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQ 330

Query: 139 QQQQQQQQQQ 148
           Q++   Q++ 
Sbjct: 331 QREALSQEEA 340



 Score = 32.8 bits (75), Expect = 1.1
 Identities = 22/130 (16%), Positives = 32/130 (24%), Gaps = 20/130 (15%)

Query: 22  QVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATIN 81
           Q P     P Q     PP+        +S      P       + P        P     
Sbjct: 225 QAPAQPPLPPQLPQQPPPL-QQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQP- 282

Query: 82  SLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQ 141
           +    +PQ    P               P    Q  +  P              Q ++Q 
Sbjct: 283 TPHPGLPQGQNAPLPPPQQPQLLPLVQQP----QGQQRGP--------------QFREQL 324

Query: 142 QQQQQQQQQP 151
            Q  QQQ++ 
Sbjct: 325 VQLSQQQREA 334


>gnl|CDD|215083 PLN00162, PLN00162, transport protein sec23; Provisional.
          Length = 761

 Score = 54.9 bits (133), Expect = 1e-07
 Identities = 54/250 (21%), Positives = 86/250 (34%), Gaps = 49/250 (19%)

Query: 294 QSATVEYLATADYTVRDPPQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGDRRT 353
           Q  TVEY  T        P P +++F++D   I  + G L +     +  L   P +   
Sbjct: 106 QYTTVEY--TLPPGSGGAPSPPVFVFVVDTCMIEEELGALKSALLQAIALL---PENAL- 159

Query: 354 SIAIITYDSAVHFYSLAEG-----------------------------QTQPSQMILTDI 384
            + +IT+ + VH + L                                +      I    
Sbjct: 160 -VGLITFGTHVHVHELGFSECSKSYVFRGNKEVSKDQILEQLGLGGKKRRPAGGGIAGAR 218

Query: 385 DDIFLPSPENILVNLSEC---YDMIVDLLKQLPGKFKDSLDSGSALGPALQAAFKLLQP- 440
           D +        L+  SEC    +  ++ L++ P             G AL  A  LL   
Sbjct: 219 DGLSSSGVNRFLLPASECEFTLNSALEELQKDPWPVPPGHRPARCTGAALSVAAGLLGAC 278

Query: 441 ---TGGRITIFQTCLPNRGPGALQSRENPNQ-RS-----NDDPPHMNPATDFYKKIALEC 491
              TG RI  F       GPGA+ S++     RS      D  P+   A  FY+ +A + 
Sbjct: 279 VPGTGARIMAFVGGPCTEGPGAIVSKDLSEPIRSHKDLDKDAAPYYKKAVKFYEGLAKQL 338

Query: 492 TSFYIAVDLF 501
            +    +D+F
Sbjct: 339 VAQGHVLDVF 348



 Score = 32.6 bits (75), Expect = 1.1
 Identities = 16/54 (29%), Positives = 27/54 (50%), Gaps = 2/54 (3%)

Query: 613 FQVAVLYTSSKGDRRIRVHTLCLPVV--SNINDVLVGADQACIAGLLAKMAVDR 664
            Q    Y  S G  R+RV T+    V  S+  +++ G DQ   A ++A++A  +
Sbjct: 480 LQFLTRYQHSNGQTRLRVTTVTRRWVEGSSSEELVAGFDQEAAAVVMARLASHK 533


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 52.3 bits (125), Expect = 9e-07
 Identities = 41/201 (20%), Positives = 55/201 (27%), Gaps = 21/201 (10%)

Query: 52  SSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQ-------------QQN 98
           ++ Q  L     M  P+   +      T      P+     RP               Q+
Sbjct: 57  AAQQQVLQGGQGMPDPINALQNLTGQGTRGPQMGPMGPGPGRPMGQQMGGPGTASNLLQS 116

Query: 99  ANHSQQ----TFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPN 154
            N   Q       M P+  S+     P       +  +S   Q QQ  Q   QQ Q    
Sbjct: 117 LNVRGQMPMGAAGMGPHQMSRVGTMQPGGQAGGMMQQSSGQPQSQQPNQMGPQQGQAQGQ 176

Query: 155 IPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQ 214
              +   QQ     Q    +      P     G    +        Q     Q QQQ Q 
Sbjct: 177 AGGMNQGQQGPVGQQQPPQMG----QPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQN 232

Query: 215 QQQQQTLNSQFSNLSVNQGMN 235
           QQQQQ    Q    +  Q   
Sbjct: 233 QQQQQMDQQQGPADAQAQMGQ 253



 Score = 51.9 bits (124), Expect = 1e-06
 Identities = 56/258 (21%), Positives = 67/258 (25%), Gaps = 45/258 (17%)

Query: 21  MQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAP---- 76
           M +      P Q S       G Q    +  SS Q      N M P     + QA     
Sbjct: 123 MPMGAAGMGPHQMSRVGTMQPGGQAGGMMQQSSGQPQSQQPNQMGPQQGQAQGQAGGMNQ 182

Query: 77  GATINSLSTPVPQQFQ--RPQQQNANHSQQTFRMPPNTYSQSSKTAPFNN---------G 125
           G          PQ  Q   P        QQ  +                N          
Sbjct: 183 GQQGPVGQQQPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQ 242

Query: 126 LTPLTNASLYQQQQQQQQQQQQQQQPIPN------IPPLPNSQQNSSNFQPS-------- 171
                 A + QQQQ Q   Q QQ Q           PP    QQ+     P+        
Sbjct: 243 GPADAQAQMGQQQQGQGGMQPQQMQGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGG 302

Query: 172 --NNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQ--------------QQQQQQQQQQ 215
                   +  PP+          A    +  AG Q              Q QQQQQQQ 
Sbjct: 303 GQGGPGQPMGPPPQRPGAVPQGGQAVQQGVMSAGQQQLKQMKLRNMRGQQQTQQQQQQQG 362

Query: 216 QQQQTLNSQFSNLSVNQG 233
                 + Q  N  V QG
Sbjct: 363 GNHPAAHQQQMNQQVGQG 380



 Score = 49.2 bits (117), Expect = 8e-06
 Identities = 51/264 (19%), Positives = 66/264 (25%), Gaps = 36/264 (13%)

Query: 15  GSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQ 74
           G A  +MQ         QP+   P     Q Q    N   Q P+G     +         
Sbjct: 145 GQAGGMMQQSSGQPQSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQQQPPQMGQPGMPGG 204

Query: 75  APGATINSLSTPVPQQFQRPQQQNANHSQQTFRM-----PPNTYSQSSKTAPFNNGLTPL 129
                +     P  QQ Q PQ Q    +QQ  +M     P +  +Q  +      G+ P 
Sbjct: 205 GGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQ 264

Query: 130 TNASLYQQQQQQQQQQQQQQQPIP---------------------NIPPLPNSQQNSSNF 168
                  Q   QQQ  QQQ Q                         + P P         
Sbjct: 265 QMQGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPGAVPQG 324

Query: 169 QPS----------NNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQ 218
             +            L        R Q            +   A  QQ  QQ  Q  Q  
Sbjct: 325 GQAVQQGVMSAGQQQLKQMKLRNMRGQQQTQQQQQQQGGNHPAAHQQQMNQQVGQGGQMV 384

Query: 219 QTLNSQFSNLSVNQGMNKMWGYEP 242
                         G N M   +P
Sbjct: 385 ALGYLNIQGNQGGLGANPMQQGQP 408



 Score = 48.5 bits (115), Expect = 2e-05
 Identities = 57/304 (18%), Positives = 78/304 (25%), Gaps = 5/304 (1%)

Query: 30  PSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVPQ 89
             Q  G     S L   + V     Q P+G++ M    +       PG     +      
Sbjct: 100 MGQQMGGPGTASNLLQSLNVRG---QMPMGAAGMGPHQMSRVGTMQPGGQAGGMMQQSSG 156

Query: 90  QFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQ 149
           Q  + QQ N    QQ             +  P      P             Q Q QQQ 
Sbjct: 157 Q-PQSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQQQPPQMGQPG-MPGGGGQGQMQQQG 214

Query: 150 QPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQ 209
           QP       P  QQ   N Q             +AQ G       G       G Q Q  
Sbjct: 215 QPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQMQGGQMQVP 274

Query: 210 QQQQQQQQQQTLNSQFSNLSVNQGMNKMWGYEPHDLLNTRDILPATKVEPPPIRLPQNLP 269
            QQQ  QQQ   +      +  Q M       P   +      P    +         + 
Sbjct: 275 MQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPGAVPQGGQAVQQGVMS 334

Query: 270 EIHEQFLYDRNTEMFGDPYRRPEIQSATVEYLATADYTVRDPPQPAMYLFLLDVSQIAAQ 329
              +Q    +   M G    + + Q     + A     +         +  L    I   
Sbjct: 335 AGQQQLKQMKLRNMRGQQQTQQQQQQQGGNHPAAHQQQMNQQVGQGGQMVALGYLNIQGN 394

Query: 330 SGYL 333
            G L
Sbjct: 395 QGGL 398



 Score = 43.5 bits (102), Expect = 6e-04
 Identities = 42/232 (18%), Positives = 60/232 (25%), Gaps = 12/232 (5%)

Query: 2   QPLVSNKQGPNSYGSASNVMQVPLIASN-PSQPSGAAPPISGLQPQVPVSNSSLQTPLGS 60
           QP    +Q P       N  Q  +     P+            Q  +          +  
Sbjct: 215 QPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQMQ-GGQMQV 273

Query: 61  SNMMRPPLVTNRFQAPGATINSLSTPVPQQFQR-PQQQNANHSQQTFRMPPNTYS--QSS 117
               +PP    +    G   N +   +P   Q  P Q      Q+   +P    +  Q  
Sbjct: 274 PMQQQPPQQQPQQSQLGMLPNQMQQ-MPGGGQGGPGQPMGPPPQRPGAVPQGGQAVQQGV 332

Query: 118 KTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPI-----PNIPPLPNSQQNSSNFQPSN 172
            +A               QQQ QQQQQQQ    P       N       Q  +  +    
Sbjct: 333 MSAGQQQLKQMKLRNMRGQQQTQQQQQQQGGNHPAAHQQQMNQQVGQGGQMVALGYLNIQ 392

Query: 173 NLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
                +   P  Q G    + +  P       Q   Q  Q          SQ
Sbjct: 393 GNQGGLGANP-MQQGQPGMMSSPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQ 443



 Score = 38.8 bits (90), Expect = 0.014
 Identities = 41/229 (17%), Positives = 58/229 (25%), Gaps = 9/229 (3%)

Query: 1   MQPLVSNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQP--QVPVSNSSLQTPL 58
           M      + G          MQVP+    P Q    +          Q+P          
Sbjct: 251 MGQQQQGQGGMQPQQMQGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQP 310

Query: 59  GSSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRP-----QQQNANHSQQTFRMPPNTY 113
                 RP  V    QA    + S      +Q +       QQ      QQ    P    
Sbjct: 311 MGPPPQRPGAVPQGGQAVQQGVMSAGQQQLKQMKLRNMRGQQQTQQQQQQQGGNHPAAHQ 370

Query: 114 SQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNN 173
            Q ++       +  L   ++   Q        QQ QP     P P  Q  ++   P   
Sbjct: 371 QQMNQQVGQGGQMVALGYLNIQGNQGGLGANPMQQGQPGMMSSPSPVPQVQTNQSMPQPP 430

Query: 174 LYNNIQ-VPPRAQPGNY-PSVPAGYPSLNQAGYQQQQQQQQQQQQQQQT 220
             +      P +QP           P         Q  Q    Q+  Q 
Sbjct: 431 QPSVPSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQSPASQRTIQQ 479


>gnl|CDD|227380 COG5047, SEC23, Vesicle coat complex COPII, subunit SEC23
           [Intracellular trafficking and secretion].
          Length = 755

 Score = 51.4 bits (123), Expect = 2e-06
 Identities = 81/443 (18%), Positives = 158/443 (35%), Gaps = 88/443 (19%)

Query: 291 PEIQSATVEYLATADYTVRDPPQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGD 350
              QS+T+EY        +    P ++ F++D      +   L    + L+  L  +P +
Sbjct: 104 LLPQSSTIEYT-----LSKPVILPPVFFFVVDACCDEEELTALK---DSLIVSLSLLPPE 155

Query: 351 RRTSIAIITYDSAVHFYSLA----------EGQTQPSQMILTD----------------I 384
               + +ITY +++  + L            G  + ++  L +                I
Sbjct: 156 --ALVGLITYGTSIQVHELNAENHRRSYVFSGNKEYTKENLQELLALSKPTKSGGFESKI 213

Query: 385 DDIFLPSPENILVNLSECYDMIVDLLKQL-------PGKFKDSLDSGSALGPALQAAFKL 437
             I   +    L+   +C   ++++L+QL       P   +    +GSAL  A     + 
Sbjct: 214 SGIGQFASSRFLLPTQQCEFKLLNILEQLQPDPWPVPAGKRPLRCTGSALNIASSLLEQC 273

Query: 438 LQPTGGRITIFQTCLPNRGPGALQSRE------NPNQRSNDDPPHMNPATDFYKKIALEC 491
               G  I +F       GPG + S E      + +   +D   H   AT FYK +A   
Sbjct: 274 FPNAGCHIVLFAGGPCTVGPGTVVSTELKEPMRSHHDIESDSAQHSKKATKFYKGLAERV 333

Query: 492 TSFYIAVDLFILNSQYVDLATLSGVSKFSSGCIHHIPLYSGKNMSHVYQLDRM---FSRY 548
            +   A+D+F      + +  +  ++  + G +          +S  +        F R 
Sbjct: 334 ANQGHALDIFAGCLDQIGIMEMEPLTTSTGGAL---------VLSDSFTTSIFKQSFQRI 384

Query: 549 LTR------KIGFESVMRIRCTKGMSIHTFHGN---------------FFVRSTDLLALP 587
             R      K+GF + M ++ +K + I    G+                 + +T+   + 
Sbjct: 385 FNRDSEGYLKMGFNANMEVKTSKNLKIKGLIGHAVSVKKKANNISDSEIGIGATNSWKMA 444

Query: 588 NVNPDAGYGI--QISIDENLTHIQYACF---QVAVLYTSSKGDRRIRVHTLCLPVVSNIN 642
           +++P + Y +  +I++       Q       Q    Y  S G  RIRV T+         
Sbjct: 445 SLSPKSNYALYFEIALGAASGSAQRPAEAYIQFITTYQHSSGTYRIRVTTVARMFTDGGL 504

Query: 643 DVLVGA-DQACIAGLLAKMAVDR 664
             +  + DQ   A  +A++A  +
Sbjct: 505 PKINRSFDQEAAAVFMARIAAFK 527


>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal.  Mediator is a
           large complex of up to 33 proteins that is conserved
           from plants to fungi to humans - the number and
           representation of individual subunits varying with
           species. It is arranged into four different sections, a
           core, a head, a tail and a kinase-activity part, and the
           number of subunits within each of these is what varies
           with species. Overall, Mediator regulates the
           transcriptional activity of RNA polymerase II but it
           would appear that each of the four different sections
           has a slightly different function. Mediator subunit
           Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast
           transcriptional co-repressor.
          Length = 381

 Score = 47.7 bits (113), Expect = 2e-05
 Identities = 52/257 (20%), Positives = 83/257 (32%), Gaps = 36/257 (14%)

Query: 6   SNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQV-----PVSNSSLQTPLGS 60
           SN     +  S +N       A+   + + A+ P +            +S ++  TP  +
Sbjct: 135 SNGSDAATTSSTANTPA----AAKVLKANAASAPNTTTGVGSAATTAAISATTATTPTTT 190

Query: 61  SNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQS---- 116
               R P  T +     A     S     Q     Q  +    Q   M     + +    
Sbjct: 191 QKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMGSLGVPQNTSMLAQIPNPTPLMQ 250

Query: 117 --SKTAPFNNGLTPLTNASLYQQ-QQQQQQQQQQQQQPIPNIPPLPNSQQNSS---NFQP 170
             +  +P N   +PL N S  +   Q   Q    Q  P  N   + N  + +S      P
Sbjct: 251 LLNGVSPNNAMASPLNNMSPMRNLNQMGNQNNGGQMTPSANNGNMNNQSRENSMNQGMTP 310

Query: 171 SNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSV 230
           S ++ N   + P                LN +        Q Q  QQ Q  N Q    ++
Sbjct: 311 SASMINLNNITPAN-------------ILNMSLNLAFDLNQNQTPQQLQPQNQQ----NM 353

Query: 231 NQGMNKMWGYEPHDLLN 247
           N GMN +  ++  DL N
Sbjct: 354 NMGMNDVNNFDLLDLNN 370


>gnl|CDD|151935 pfam11498, Activator_LAG-3, Transcriptional activator LAG-3.  The
           C.elegans Notch pathway, involved in the control of
           growth, differentiation and patterning in animal
           development, relies on either of the receptors GLP-1 or
           LIN-12. Both these receptors promote signalling by the
           recruitment of LAG-3 to target promoters, where it then
           acts as a transcriptional activator. LAG-3 works as a
           ternary complex together with the DNA binding protein,
           LAG-1.
          Length = 476

 Score = 45.7 bits (107), Expect = 9e-05
 Identities = 36/143 (25%), Positives = 51/143 (35%), Gaps = 7/143 (4%)

Query: 84  STPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTN-----ASLYQQQ 138
             P  +  + P +   +   Q+   P       ++ A   N +  L         + QQ+
Sbjct: 290 MEPEGETKKSPMEAGGDRMPQSAPPPAMNPQHIAQLAQQQNKMRLLQQQEMEMQRIEQQR 349

Query: 139 QQQ--QQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGY 196
           QQQ   Q QQQQQQ       L   QQ     Q  + +    Q   +A           +
Sbjct: 350 QQQIMHQHQQQQQQEHQQQQMLLQQQQQMHQLQQHHQMNGGGQFATQAHQHAAYLQQMQH 409

Query: 197 PSLNQAGYQQQQQQQQQQQQQQQ 219
             L +    QQQQ Q  QQ QQQ
Sbjct: 410 MRLQEQIQHQQQQAQHHQQAQQQ 432



 Score = 31.1 bits (69), Expect = 3.5
 Identities = 38/190 (20%), Positives = 58/190 (30%), Gaps = 14/190 (7%)

Query: 10  GPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLV 69
           GP++Y    +     L A  P   +  +P  +G       +      P   + + +    
Sbjct: 276 GPDTY----DTFLAELDAMEPEGETKKSPMEAGGDRMPQSAPPPAMNPQHIAQLAQQQNK 331

Query: 70  TNRFQAPGATINSLSTPVPQQF--QRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNG-- 125
               Q     +  +     QQ   Q  QQQ   H QQ   +         +     NG  
Sbjct: 332 MRLLQQQEMEMQRIEQQRQQQIMHQHQQQQQQEHQQQQMLLQQQQQMHQLQQHHQMNGGG 391

Query: 126 --LTPLTNASLYQQQQQ----QQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQ 179
              T     + Y QQ Q    Q+Q Q QQQQ   +       QQ + + Q    + N   
Sbjct: 392 QFATQAHQHAAYLQQMQHMRLQEQIQHQQQQAQHHQQAQQQHQQPAQHGQMGYGIPNGYP 451

Query: 180 VPPRAQPGNY 189
                    Y
Sbjct: 452 AHMHGHAPAY 461



 Score = 29.9 bits (66), Expect = 6.7
 Identities = 17/39 (43%), Positives = 21/39 (53%)

Query: 201 QAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNKMWG 239
           Q    Q QQQQQQ+ QQQQ L  Q   +   Q  ++M G
Sbjct: 351 QQIMHQHQQQQQQEHQQQQMLLQQQQQMHQLQQHHQMNG 389


>gnl|CDD|238755 cd01478, Sec23-like, Sec23-like: Protein and membrane traffic in
           eukaryotes is mediated by at least in part by the
           budding and fusion of intracellular transport vesicles
           that selectively carry cargo proteins and lipids from
           donor to acceptor organelles. The two main classes of
           vesicular carriers within the endocytic and the
           biosynthetic pathways are COP- and clathrin-coated
           vesicles. Formation of COPII vesicles requires the
           ordered assembly of the coat built from several
           cytosolic components GTPase Sar1, complexes of
           Sec23-Sec24 and Sec13-Sec31. The process is initiated by
           the conversion of GDP to GTP by the GTPase Sar1 which
           then recruits the heterodimeric complex of Sec23 and
           Sec24. This heterodimeric complex generates the
           pre-budding complex. The final step leading to membrane
           deformation and budding of COPII-coated vesicles is
           carried by the heterodimeric complex Sec13-Sec31. The
           members of this CD belong to the Sec23-like family. Sec
           23 is very similar to Sec24. The Sec23 and Sec24
           polypeptides fold into five distinct domains: a
           beta-barrel, a zinc finger, a vWA or trunk, an all
           helical region and a carboxy Gelsolin domain. The
           members of this subgroup lack the consensus MIDAS motif
           but have the overall Para-Rossmann type fold that is
           characteristic of this superfamily.
          Length = 267

 Score = 45.1 bits (107), Expect = 9e-05
 Identities = 55/236 (23%), Positives = 89/236 (37%), Gaps = 55/236 (23%)

Query: 312 PQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGDRRTSIAIITYDSAVHFYSLA- 370
             P ++LF++D      +   L    E L+  L  +P +    + +IT+ + V  + L  
Sbjct: 1   TSPPVFLFVVDTCMDEEELDALK---ESLIMSLSLLPPN--ALVGLITFGTMVQVHELGF 55

Query: 371 ---------EGQTQPSQMILTDIDDIFLPS-------------------PENILVNLSEC 402
                     G    +   + D+  +  P+                       L+ +S+C
Sbjct: 56  EECSKSYVFRGNKDYTAKQIQDMLGLGGPAMRPSASQHPGAGNPLPSAAASRFLLPVSQC 115

Query: 403 YDMIVDLLKQL-------PGKFKDSLDSGSALGPA---LQAAFKLLQPTGGRITIFQTCL 452
              + DLL+QL       P   +    +G AL  A   L+A F     TG RI +F    
Sbjct: 116 EFTLTDLLEQLQPDPWPVPAGHRPLRCTGVALSIAVGLLEACFP---NTGARIMLFAGGP 172

Query: 453 PNRGPGALQSRE--NPNQRS-----NDDPPHMNPATDFYKKIALECTSFYIAVDLF 501
              GPGA+ S E  +P  RS      D+  +   A  FY  +A    +   AVD+F
Sbjct: 173 CTVGPGAVVSTELKDP-IRSHHDIDKDNAKYYKKAVKFYDSLAKRLAANGHAVDIF 227


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 45.1 bits (107), Expect = 1e-04
 Identities = 25/147 (17%), Positives = 37/147 (25%), Gaps = 8/147 (5%)

Query: 81  NSLSTPVPQQFQRPQQQNANHSQQTFRMP-PNTYSQSSKTAPFNNGLTPLTNASLYQQQQ 139
                PV Q  Q   Q     + Q         ++  + +   N   T    A       
Sbjct: 53  PQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPA-----GP 107

Query: 140 QQQQQQQQQQQPIPNIPPLPNSQQ--NSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYP 197
                Q +  Q  P   P+  +Q   NS   QP+              P +         
Sbjct: 108 AGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQS 167

Query: 198 SLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
           +      Q QQQ   +   QQ      
Sbjct: 168 AQKNDESQLQQQPNGETPPQQTDGAGD 194



 Score = 44.7 bits (106), Expect = 1e-04
 Identities = 23/143 (16%), Positives = 36/143 (25%), Gaps = 14/143 (9%)

Query: 93  RPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPI 152
            P  Q      Q  +  P      +  A       P  +   +                +
Sbjct: 50  DPSPQAPPPVAQLPQPLPQPPPTQALQAL------PAGDQQQHNTPTGSPAANPPATFAL 103

Query: 153 PNIPPLPNSQQNSSNF--------QPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGY 204
           P  P  P  Q                 N   + +  P + +         G P+  Q   
Sbjct: 104 PAGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPS 163

Query: 205 QQQQQQQQQQQQQQQTLNSQFSN 227
           QQQ  Q+  + Q QQ  N +   
Sbjct: 164 QQQSAQKNDESQLQQQPNGETPP 186



 Score = 44.0 bits (104), Expect = 2e-04
 Identities = 36/208 (17%), Positives = 52/208 (25%), Gaps = 34/208 (16%)

Query: 27  ASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTP 86
           A  P  PS  APP     PQ        Q                    P       +TP
Sbjct: 45  AEFPWDPSPQAPPPVAQLPQPLPQPPPTQALQ---------------ALPAGDQQQHNTP 89

Query: 87  VPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQ 146
                       A +   TF +P      + +T P       +    + Q        Q 
Sbjct: 90  --------TGSPAANPPATFALPAGPAGPTIQTEPGQLYPVQVPV-MVTQNPANSPLDQP 140

Query: 147 QQQ-QPIPNIPPL--PNSQQNSSNFQPSNNLY-------NNIQVPPRAQPGNYPSVPAGY 196
            QQ            P S Q  S  Q +            N + PP+   G         
Sbjct: 141 AQQRALQQLQQRYGAPASGQLPSQQQSAQKNDESQLQQQPNGETPPQQTDGAGDDESEAL 200

Query: 197 PSLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
             L +A    +Q+ +  +      +  Q
Sbjct: 201 VRLREADGTLEQRIKGAEGGGAMKVLKQ 228



 Score = 41.7 bits (98), Expect = 0.001
 Identities = 23/127 (18%), Positives = 31/127 (24%), Gaps = 6/127 (4%)

Query: 128 PLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPG 187
           P   A L Q   Q    Q  Q  P  +         + +   P+          P  Q  
Sbjct: 56  PPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTIQTE 115

Query: 188 NYPSVPAGYPSLN--QAGYQQQQQQQQ----QQQQQQQTLNSQFSNLSVNQGMNKMWGYE 241
                P   P +           Q  Q    QQ QQ+    +     S  Q   K    +
Sbjct: 116 PGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQKNDESQ 175

Query: 242 PHDLLNT 248
                N 
Sbjct: 176 LQQQPNG 182



 Score = 37.0 bits (86), Expect = 0.038
 Identities = 31/228 (13%), Positives = 53/228 (23%), Gaps = 57/228 (25%)

Query: 2   QPLVSNKQGPNSY-----GSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQT 56
             L      P          A +  Q      +P+    A   +             +QT
Sbjct: 60  AQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPT-----IQT 114

Query: 57  PLGSSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQS 116
             G    ++ P++  +  A             QQ Q+     A+              Q 
Sbjct: 115 EPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPAS-------------GQL 161

Query: 117 SKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYN 176
                               QQQ  Q+  + Q Q  PN    P  Q + +    S  L  
Sbjct: 162 P------------------SQQQSAQKNDESQLQQQPN-GETPPQQTDGAGDDESEALVR 202

Query: 177 NIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
             +                  +L Q     +     +  +Q +     
Sbjct: 203 LREAD---------------GTLEQRIKGAEGGGAMKVLKQPKKQAKS 235



 Score = 31.3 bits (71), Expect = 2.2
 Identities = 25/158 (15%), Positives = 42/158 (26%), Gaps = 14/158 (8%)

Query: 1   MQPLVSNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGS 60
             P  S    P +  +       P I + P Q      P+   Q     +NS L  P   
Sbjct: 87  NTPTGSPAANPPATFALPAGPAGPTIQTEPGQLYPVQVPVMVTQ---NPANSPLDQP--- 140

Query: 61  SNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPN----TYSQS 116
           +       +  R+ AP     S   P  QQ  +   ++    Q     PP          
Sbjct: 141 AQQRALQQLQQRYGAPA----SGQLPSQQQSAQKNDESQLQQQPNGETPPQQTDGAGDDE 196

Query: 117 SKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPN 154
           S+               +   +     +  +Q +    
Sbjct: 197 SEALVRLREADGTLEQRIKGAEGGGAMKVLKQPKKQAK 234


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 45.8 bits (108), Expect = 1e-04
 Identities = 41/141 (29%), Positives = 54/141 (38%), Gaps = 12/141 (8%)

Query: 121 PFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQV 180
           P      P+     YQQ QQ    Q Q QQP   + P P  QQ      P    Y   Q 
Sbjct: 751 PVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ-YQQPQQ 809

Query: 181 PPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNKMWGY 240
           P   QP         Y    Q    Q Q QQ QQ    Q  ++    L +  G ++   +
Sbjct: 810 PVAPQP--------QYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPL-H 860

Query: 241 EPHDLLNTRDIL--PATKVEP 259
           +P   L + D+L  P ++VEP
Sbjct: 861 KPTTPLPSLDLLTPPPSEVEP 881



 Score = 40.1 bits (93), Expect = 0.006
 Identities = 24/89 (26%), Positives = 32/89 (35%), Gaps = 1/89 (1%)

Query: 86  PVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQ 145
           PV  Q Q  Q Q     Q  ++ P    +   +       + P       QQ    Q Q 
Sbjct: 758 PVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQY 817

Query: 146 QQQQQPIPNIPPLPNSQQNSSNFQPSNNL 174
           QQ QQP+   P     QQ  +  QP + L
Sbjct: 818 QQPQQPVAPQPQYQQPQQPVAP-QPQDTL 845



 Score = 39.3 bits (91), Expect = 0.011
 Identities = 39/189 (20%), Positives = 56/189 (29%), Gaps = 23/189 (12%)

Query: 24  PLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAP-----GA 78
           P+IA     P G        QP V   N  LQ P+        P      Q P       
Sbjct: 371 PVIAP---APEGYPQQSQYAQPAVQ-YNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPE 426

Query: 79  TINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQ 138
                    P   Q          +Q     P +  Q+ +T        P     LYQQ 
Sbjct: 427 QPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQ-----QPAAQEPLYQQP 481

Query: 139 QQQQQQQQQQQQPI-----PNIPPL----PNSQQNSSNFQPSNNLYNNIQVPPRAQPGNY 189
           Q  +QQ   + +P+     P  PPL       ++ +   +     Y  I  P +      
Sbjct: 482 QPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIK 541

Query: 190 PSVPAGYPS 198
            S+ A   +
Sbjct: 542 SSLKAPSVA 550



 Score = 34.7 bits (79), Expect = 0.28
 Identities = 27/88 (30%), Positives = 33/88 (37%), Gaps = 4/88 (4%)

Query: 132 ASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPS 191
           A L +Q  Q QQQ+   +QP    P       +   F P   L ++    P   P   P 
Sbjct: 697 AELARQFAQTQQQRYSGEQPAGANP----FSLDDFEFSPMKALLDDGPHEPLFTPIVEPV 752

Query: 192 VPAGYPSLNQAGYQQQQQQQQQQQQQQQ 219
                P   Q  YQQ QQ    Q Q QQ
Sbjct: 753 QQPQQPVAPQQQYQQPQQPVAPQPQYQQ 780



 Score = 34.3 bits (78), Expect = 0.39
 Identities = 25/129 (19%), Positives = 41/129 (31%), Gaps = 3/129 (2%)

Query: 135 YQQQQQQQQQQQQQQQPIPN-IPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVP 193
           Y QQ Q  Q   Q  +P+   + P       ++        Y      P  QP   P+  
Sbjct: 380 YPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPE 439

Query: 194 AGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNKMWGYEPHDLLNTRDILP 253
              P    A   ++QQ     Q   QT  +     +      +    E   ++    ++ 
Sbjct: 440 Q--PVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVE 497

Query: 254 ATKVEPPPI 262
            TK   PP+
Sbjct: 498 ETKPARPPL 506



 Score = 31.2 bits (70), Expect = 3.1
 Identities = 28/97 (28%), Positives = 30/97 (30%), Gaps = 17/97 (17%)

Query: 128 PLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPG 187
           PL    +   QQ QQ    QQQ   P  P  P  Q      QP         V P+ Q  
Sbjct: 743 PLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQ----QPQQ------PVAPQPQYQ 792

Query: 188 NYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
                 A  P       Q QQ QQ    Q Q     Q
Sbjct: 793 QPQQPVAPQP-------QYQQPQQPVAPQPQYQQPQQ 822



 Score = 30.4 bits (68), Expect = 5.2
 Identities = 27/136 (19%), Positives = 38/136 (27%), Gaps = 4/136 (2%)

Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNS-QQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPA 194
            Q     Q     Q   P I P P    Q S   QP+      +Q P + Q   Y     
Sbjct: 354 AQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAE 413

Query: 195 GYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNKMWGYEPHDLLNTRDILPA 254
                 Q  Y     +Q  QQ        Q    +  Q   +   + P     T      
Sbjct: 414 QPA---QQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQ 470

Query: 255 TKVEPPPIRLPQNLPE 270
              + P  + PQ + +
Sbjct: 471 PAAQEPLYQQPQPVEQ 486


>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175). 
          Length = 820

 Score = 45.3 bits (108), Expect = 1e-04
 Identities = 27/98 (27%), Positives = 36/98 (36%), Gaps = 6/98 (6%)

Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQ-PGNYPSVPA 194
           Q  Q  Q  Q + QQ +  +      QQ       S+  + ++Q    AQ          
Sbjct: 574 QVTQGGQGGQSEMQQAMEGLGETLREQQ-----GLSDETFRDLQEQFNAQRGEQQGQQGQ 628

Query: 195 GYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQ 232
           G     Q G Q QQ Q QQQ QQ Q       +L+  Q
Sbjct: 629 GGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQ 666



 Score = 43.4 bits (103), Expect = 5e-04
 Identities = 19/99 (19%), Positives = 28/99 (28%), Gaps = 21/99 (21%)

Query: 135 YQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPA 194
           ++  Q+Q   Q+ +QQ           Q      Q         Q   + Q G       
Sbjct: 608 FRDLQEQFNAQRGEQQGQQGQGGQGQGQPGQQGQQG--------QGQQQGQQG------- 652

Query: 195 GYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQG 233
                 Q G   Q    ++QQ  +  L  Q   L    G
Sbjct: 653 ------QGGQGGQGSLAERQQALRDELGRQRGGLPGMGG 685



 Score = 41.8 bits (99), Expect = 0.002
 Identities = 26/94 (27%), Positives = 33/94 (35%), Gaps = 16/94 (17%)

Query: 134 LYQQQQQQQQQQQQQQQPIPNIPPLPNS---QQNSSNFQPSNNLYNNIQVPPRAQPGNYP 190
           L   Q  Q  Q + QQ     +  L  +   QQ       S+  + ++Q    AQ G   
Sbjct: 573 LQVTQGGQGGQSEMQQA----MEGLGETLREQQ-----GLSDETFRDLQEQFNAQRGEQQ 623

Query: 191 SVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
               G       G  Q  QQ QQ Q QQQ    Q
Sbjct: 624 ----GQQGQGGQGQGQPGQQGQQGQGQQQGQQGQ 653



 Score = 32.2 bits (74), Expect = 1.5
 Identities = 20/126 (15%), Positives = 33/126 (26%), Gaps = 54/126 (42%)

Query: 134 LYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNS---------------------------S 166
           L +Q Q+  QQ  Q            NS + +                            
Sbjct: 512 LAEQAQRNPQQGDQPPDQ-------GNSMELTQQDLQRMMDRIEELMESGRRAEAQQLLE 564

Query: 167 NFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAG-------------YQQQQQQQQQ 213
             Q    +  N+QV    Q G          ++   G             ++  Q+Q   
Sbjct: 565 QLQ---QMMENLQVTQGGQGGQ----SEMQQAMEGLGETLREQQGLSDETFRDLQEQFNA 617

Query: 214 QQQQQQ 219
           Q+ +QQ
Sbjct: 618 QRGEQQ 623



 Score = 29.5 bits (67), Expect = 9.5
 Identities = 10/15 (66%), Positives = 10/15 (66%)

Query: 136 QQQQQQQQQQQQQQQ 150
            QQQ QQQ  Q QQQ
Sbjct: 740 AQQQGQQQGGQGQQQ 754



 Score = 29.5 bits (67), Expect = 9.5
 Identities = 10/15 (66%), Positives = 10/15 (66%)

Query: 205 QQQQQQQQQQQQQQQ 219
            QQQ QQQ  Q QQQ
Sbjct: 740 AQQQGQQQGGQGQQQ 754


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 43.0 bits (101), Expect = 9e-04
 Identities = 34/194 (17%), Positives = 56/194 (28%), Gaps = 4/194 (2%)

Query: 11   PNSYGSASNVMQVPLIASNPSQ-PSGAAPPISGLQPQVPVSNSS-LQTPLGSSNMMRPPL 68
            P         +  P  A  P+  P+G  PP +  QP  P          L     + P  
Sbjct: 2801 PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG 2860

Query: 69   VTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTP 128
               R     +     + P     +R  +   + S ++F +PP+   +  +          
Sbjct: 2861 DVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQ 2920

Query: 129  LTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNN--LYNNIQVPPRAQP 186
                   Q Q       + Q    P   P    + + +  QP     +   + VP    P
Sbjct: 2921 PQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVP 2980

Query: 187  GNYPSVPAGYPSLN 200
               PS  A   S  
Sbjct: 2981 QPAPSREAPASSTP 2994


>gnl|CDD|222579 pfam14179, YppG, YppG-like protein.  The YppG-like protein family
           includes the B. subtilis YppG protein, which is
           functionally uncharacterized. This family of proteins is
           found in bacteria. Proteins in this family are typically
           between 115 and 181 amino acids in length. There are two
           completely conserved residues (F and G) that may be
           functionally important.
          Length = 110

 Score = 36.2 bits (84), Expect = 0.012
 Identities = 35/121 (28%), Positives = 39/121 (32%), Gaps = 38/121 (31%)

Query: 124 NGLTPLTNASLYQQQQQQ--QQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVP 181
           N     TN    Q QQQQ  QQQ   QQ P P   P                       P
Sbjct: 1   NPYQQNTNQYPPQNQQQQPYQQQPYHQQMPPPPYSP-----------------------P 37

Query: 182 PRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSN----LSVNQGMNKM 237
            + Q    P  P  YP          Q  QQQQ  Q  +  SQF N       N+ MN +
Sbjct: 38  QQQQGHFMPPQPQPYPK---------QSPQQQQPPQFSSFLSQFKNSDGTFDFNKMMNTV 88

Query: 238 W 238
            
Sbjct: 89  G 89


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 38.8 bits (90), Expect = 0.012
 Identities = 36/144 (25%), Positives = 43/144 (29%), Gaps = 22/144 (15%)

Query: 89  QQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASL---YQQQQQQQQQQ 145
           Q      Q +  HS            +        +      NA+L      Q  Q  QQ
Sbjct: 28  QLSHEEAQSSEAHSFHVDSTKQPPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQ 87

Query: 146 QQQQQP-------------IPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSV 192
           QQ Q P             + ++P  P  QQ      PS       Q P   QP   P V
Sbjct: 88  QQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQV 147

Query: 193 PAGYPSLNQAGYQQQQQQQQQQQQ 216
           P       Q  YQ   QQ Q QQ 
Sbjct: 148 P------QQQQYQSPPQQPQYQQN 165



 Score = 34.9 bits (80), Expect = 0.20
 Identities = 40/197 (20%), Positives = 55/197 (27%), Gaps = 18/197 (9%)

Query: 27  ASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTP 86
           A +    S   PP      +  ++      PL   N   PP    +   P     S + P
Sbjct: 39  AHSFHVDSTKQPPAPEQVAKHELA----DAPLQQVNAALPPAPAPQSPQPDQQQQSQAPP 94

Query: 87  VPQQ-FQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLT---------NASLYQ 136
             Q   Q P QQ  +  QQ        Y   S+  P                      YQ
Sbjct: 95  SHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQ 154

Query: 137 QQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGY 196
              QQ Q QQ       + P +       S +QP +  Y   +  P +     P   +G 
Sbjct: 155 SPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQS--YPPNEPLPSSMAMQPP--YSGA 210

Query: 197 PSLNQAGYQQQQQQQQQ 213
           P   Q     Q      
Sbjct: 211 PPSQQFYGPPQPSPYMY 227



 Score = 32.6 bits (74), Expect = 1.0
 Identities = 40/216 (18%), Positives = 51/216 (23%), Gaps = 17/216 (7%)

Query: 18  SNVMQVPLIASNPSQPSGAAPPISGLQPQVPVS--NSSLQTPLGSSNMMRPPLVTNRFQA 75
            N    P  A    QP       +    Q P       +Q+        + P      Q 
Sbjct: 69  VNAALPPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQP 128

Query: 76  PGATINSLSTPVPQ-QFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASL 134
                       PQ   Q PQQQ      Q            +++AP  +GL P   +  
Sbjct: 129 QPPPAQQPQAQQPQPPPQVPQQQQYQSPPQ-QPQYQQNPPPQAQSAPQVSGLYP-EESPY 186

Query: 135 YQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNN------------IQVPP 182
             Q     +         P     P SQQ     QPS  +Y               Q PP
Sbjct: 187 QPQSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPPP 246

Query: 183 RAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQ 218
                 Y          N         Q   Q    
Sbjct: 247 SQGQEGYGYSGPPPSKGNHGSVASYAPQGSSQSYST 282


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 38.6 bits (90), Expect = 0.013
 Identities = 27/169 (15%), Positives = 38/169 (22%), Gaps = 30/169 (17%)

Query: 63  MMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPF 122
            MR   + +                PQQ    Q            MP             
Sbjct: 381 RMRQLPMGSPMG-GAMGQPPYYGQGPQQQFNGQPLGWPRMSM---MPTPMGPGGPLRP-- 434

Query: 123 NNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPP 182
            NGL P+       +   +  Q   Q+ P+  +   PN Q                    
Sbjct: 435 -NGLAPMNA----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPL-------------SQD 476

Query: 183 RAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVN 231
             QP       +      Q     Q       Q Q+Q L  +   L   
Sbjct: 477 LPQP------QSTASQGGQNKKLAQVLASATPQMQKQVLGERLFPLVEA 519



 Score = 37.5 bits (87), Expect = 0.030
 Identities = 22/136 (16%), Positives = 34/136 (25%), Gaps = 12/136 (8%)

Query: 111 NTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPI------PNIPPLPNSQQN 164
           + + Q           +P+  A        Q  QQQ   QP+          P+      
Sbjct: 373 DQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432

Query: 165 SSNFQPSNNL-----YNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQ 219
             N     N       N      +         P  Y SL  +    Q Q    Q  Q +
Sbjct: 433 RPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYPP-NYQSLPLSQDLPQPQSTASQGGQNK 491

Query: 220 TLNSQFSNLSVNQGMN 235
            L    ++ +      
Sbjct: 492 KLAQVLASATPQMQKQ 507



 Score = 34.8 bits (80), Expect = 0.20
 Identities = 18/92 (19%), Positives = 24/92 (26%), Gaps = 4/92 (4%)

Query: 132 ASLYQQQQQQQQQQQQQQQPIPNIPPL---PNSQQNSSNFQPSNNLYNNIQVP-PRAQPG 187
           A L  Q  Q Q + +Q     P    +   P   Q          L        P     
Sbjct: 369 AHLQDQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGP 428

Query: 188 NYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQ 219
             P  P G   +N      +  Q   Q+   Q
Sbjct: 429 GGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQ 460



 Score = 33.6 bits (77), Expect = 0.49
 Identities = 29/125 (23%), Positives = 38/125 (30%), Gaps = 18/125 (14%)

Query: 29  NPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVP 88
           +P   +   PP  G  PQ   +   L  P  S  MM  P+       P    N L+    
Sbjct: 389 SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMS--MMPTPMGPGGPLRP----NGLAPMNA 442

Query: 89  QQFQRPQQQNA--NHSQQTFRMPPNTYSQSS-------KTAPFNNGLTPLTN---ASLYQ 136
            +      QNA      Q    PPN  S          ++     G         AS   
Sbjct: 443 VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATP 502

Query: 137 QQQQQ 141
           Q Q+Q
Sbjct: 503 QMQKQ 507


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 38.5 bits (89), Expect = 0.017
 Identities = 23/112 (20%), Positives = 39/112 (34%), Gaps = 21/112 (18%)

Query: 108 MPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSN 167
           +PPN      + AP        T++  ++ +   +  QQ   Q     P + N  Q    
Sbjct: 172 IPPN----PPREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNPAQ---- 223

Query: 168 FQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQ 219
                         P+ QP   P  PA  P+      Q  Q +Q  ++ ++Q
Sbjct: 224 -------------QPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKRSREQ 262



 Score = 35.0 bits (80), Expect = 0.17
 Identities = 16/60 (26%), Positives = 19/60 (31%)

Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAG 195
            QQ   Q   QQ     P   P    QQ            N  Q PP+ + G+  S   G
Sbjct: 204 TQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKRSREQG 263



 Score = 32.7 bits (74), Expect = 0.87
 Identities = 15/63 (23%), Positives = 24/63 (38%), Gaps = 1/63 (1%)

Query: 88  PQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQ 147
           P+  Q+P  QN    Q T + P     Q  +  P      P       Q  Q +Q  ++ 
Sbjct: 201 PKPTQQPTVQNPAQ-QPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKRS 259

Query: 148 QQQ 150
           ++Q
Sbjct: 260 REQ 262


>gnl|CDD|111993 pfam03157, Glutenin_hmw, High molecular weight glutenin subunit.
           Members of this family include high molecular weight
           subunits of glutenin. This group of gluten proteins is
           thought to be largely responsible for the elastic
           properties of gluten, and hence, doughs. Indeed,
           glutenin high molecular weight subunits are classified
           as elastomeric proteins, because the glutenin network
           can withstand significant deformations without breaking,
           and return to the original conformation when the stress
           is removed. Elastomeric proteins differ considerably in
           amino acid sequence, but they are all polymers whose
           subunits consist of elastomeric domains, composed of
           repeated motifs, and non-elastic domains that mediate
           cross-linking between the subunits. The elastomeric
           domain motifs are all rich in glycine residues in
           addition to other hydrophobic residues. High molecular
           weight glutenin subunits have an extensive central
           elastomeric domain, flanked by two terminal non-elastic
           domains that form disulphide cross-links. The central
           elastomeric domain is characterized by the following
           three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It
           possesses overlapping beta-turns within and between the
           repeated motifs, and assumes a regular helical secondary
           structure with a diameter of approx. 1.9 nm and a pitch
           of approx. 1.5 nm.
          Length = 779

 Score = 38.5 bits (88), Expect = 0.019
 Identities = 37/138 (26%), Positives = 48/138 (34%), Gaps = 2/138 (1%)

Query: 88  PQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQ 147
           P Q Q+  Q    H   + +            +P   G            Q QQ +Q QQ
Sbjct: 216 PGQLQQSGQGQPGHYPTSLQQLGQGQQGHYLASPQQPGQGQQPGQLQQPAQGQQPEQGQQ 275

Query: 148 QQQPIPNIP-PLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQ 206
            QQP        P   Q     QP +    + Q P + QPG+YP+         + G  Q
Sbjct: 276 GQQPAQGQQGHQPAQGQQPGQGQPGH-YPASPQQPGQGQPGHYPASSQQPTQSQEPGQGQ 334

Query: 207 QQQQQQQQQQQQQTLNSQ 224
           Q QQ  Q QQ Q     Q
Sbjct: 335 QGQQVGQGQQAQIPAQGQ 352


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 37.3 bits (87), Expect = 0.024
 Identities = 30/118 (25%), Positives = 39/118 (33%), Gaps = 11/118 (9%)

Query: 77  GATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQ 136
           G  +NS +    +Q Q  +Q  A+  QQ  ++    Y          N  TP    S   
Sbjct: 71  GGEVNSPTQLTDEQRQLLEQMQADMRQQPTQLSEVPY----------NEQTPQVPRST-V 119

Query: 137 QQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPA 194
           Q QQQ QQQQ         P  P  Q  +     +          P  Q    P V A
Sbjct: 120 QIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAPKVEA 177


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 38.1 bits (88), Expect = 0.028
 Identities = 49/182 (26%), Positives = 64/182 (35%), Gaps = 12/182 (6%)

Query: 12  NSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQT--PLGSSNMMRPPLV 69
           N   S S+  Q  L    P  PS   PP + L P  P    S Q   P GS    +P   
Sbjct: 160 NESDSDSSAQQQLLQPQGP--PSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQ 217

Query: 70  TNRFQAPGATINSLSTPVPQQFQRP----QQQNANHSQQTFRMPPNTYSQSSKTAPFNNG 125
               Q    ++ S  +  PQ+   P    Q Q A+        P + + QSS   P    
Sbjct: 218 PQ--QPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPM 275

Query: 126 LTPLTNASLYQQQQQQQQQQQQQQQPIPNIP-PLPNSQQNSSNFQPSNNLYNNIQVPPRA 184
              L    ++ Q       Q          P PLP+  Q  S+  PS +     Q PPR 
Sbjct: 276 PHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQ-PPRE 334

Query: 185 QP 186
           QP
Sbjct: 335 QP 336


>gnl|CDD|172294 PRK13756, PRK13756, tetracycline repressor protein TetR;
           Provisional.
          Length = 205

 Score = 36.4 bits (84), Expect = 0.040
 Identities = 24/113 (21%), Positives = 51/113 (45%), Gaps = 12/113 (10%)

Query: 47  VPVSNSSLQTPLGSSNM-MRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQT 105
           +P+   S Q  L ++    R  L++ R    GA ++  + P  +Q++  + Q A   QQ 
Sbjct: 68  LPLEGESWQDFLRNNAKSFRCALLSYR---DGAKVHLGTRPTEKQYETLENQLAFLCQQG 124

Query: 106 FRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPL 158
           F +    Y+ S+        +   T   + + Q+ Q  +++++     ++PPL
Sbjct: 125 FSLENALYALSA--------VGHFTLGCVLEDQEHQVAKEERETPTTDSMPPL 169


>gnl|CDD|234977 PRK01741, PRK01741, cell division protein ZipA; Provisional.
          Length = 332

 Score = 36.6 bits (85), Expect = 0.044
 Identities = 23/137 (16%), Positives = 41/137 (29%), Gaps = 12/137 (8%)

Query: 91  FQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQ 150
           F R +  +   S +    P     QS           P T  S   + + + Q QQ+ +Q
Sbjct: 40  FTRTRPPSRPISNEEADQPNTLNPQSYVETTPPPFQQPQTEES---ESENEVQIQQEVEQ 96

Query: 151 PIPNIP-PLPNSQQNSSNFQPSNNLYNNIQVPPRAQP--GNYPSVPAGYPSLNQAGYQQ- 206
            +  I   LPN +         +      Q P    P   N  S+            +  
Sbjct: 97  SVDEIKITLPNQEPAYYMQNHRSEPIQPTQ-PQYQSPTQTNVASMTIEETQSPNVPIEGI 155

Query: 207 ----QQQQQQQQQQQQQ 219
               +Q + +  +   +
Sbjct: 156 NSSSEQLRVELAELAAE 172



 Score = 35.5 bits (82), Expect = 0.10
 Identities = 23/145 (15%), Positives = 44/145 (30%), Gaps = 3/145 (2%)

Query: 74  QAPGATINSLSTPVPQQFQRPQ--QQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTN 131
           Q       S     P  FQ+PQ  +  + +  Q  +    +  +   T P N        
Sbjct: 57  QPNTLNPQSYVETTPPPFQQPQTEESESENEVQIQQEVEQSVDEIKITLP-NQEPAYYMQ 115

Query: 132 ASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPS 191
               +  Q  Q Q Q   Q       +  +Q  +   +  N+    ++V          S
Sbjct: 116 NHRSEPIQPTQPQYQSPTQTNVASMTIEETQSPNVPIEGINSSSEQLRVELAELAAEIYS 175

Query: 192 VPAGYPSLNQAGYQQQQQQQQQQQQ 216
             +    L +   + Q + + Q + 
Sbjct: 176 DASHRVELAKNFMEPQAETEAQPEA 200



 Score = 32.4 bits (74), Expect = 1.0
 Identities = 26/181 (14%), Positives = 47/181 (25%), Gaps = 9/181 (4%)

Query: 6   SNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMR 65
           SN++  + Y S +N        S P     A  P + L PQ  V  +        +    
Sbjct: 25  SNRREKSQYFSNANTFTRTRPPSRPISNEEADQPNT-LNPQSYVETTPPPFQQPQTEESE 83

Query: 66  PPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNG 125
                   Q    +++ +   +P Q      QN          P       +  A     
Sbjct: 84  SENEVQIQQEVEQSVDEIKITLPNQEPAYYMQNHRSEPIQPTQPQYQSPTQTNVASMTIE 143

Query: 126 LTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLP--------NSQQNSSNFQPSNNLYNN 177
            T   N  +       +Q + +  +    I            N  +  +  +       N
Sbjct: 144 ETQSPNVPIEGINSSSEQLRVELAELAAEIYSDASHRVELAKNFMEPQAETEAQPEATTN 203

Query: 178 I 178
            
Sbjct: 204 F 204



 Score = 29.3 bits (66), Expect = 8.9
 Identities = 23/169 (13%), Positives = 45/169 (26%), Gaps = 16/169 (9%)

Query: 111 NTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQP 170
           NT++++   +   +        +L  Q   +      QQ            Q      Q 
Sbjct: 38  NTFTRTRPPSRPISNEEADQPNTLNPQSYVETTPPPFQQPQTEESESENEVQIQ----QE 93

Query: 171 SNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNL-- 228
                + I++         P+    Y   N      Q  Q Q Q   Q  + S       
Sbjct: 94  VEQSVDEIKI-------TLPNQEPAYYMQNHRSEPIQPTQPQYQSPTQTNVASMTIEETQ 146

Query: 229 SVNQGMNKMWGYEPHDLLNTRDILP-ATKVEPPPIRLPQNL--PEIHEQ 274
           S N  +  +        +   ++           + L +N   P+   +
Sbjct: 147 SPNVPIEGINSSSEQLRVELAELAAEIYSDASHRVELAKNFMEPQAETE 195


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 37.2 bits (86), Expect = 0.045
 Identities = 35/149 (23%), Positives = 43/149 (28%), Gaps = 29/149 (19%)

Query: 77  GATINSLS---TPVPQQFQRPQQQNANHSQQ-TFRMPPNTYSQSSKTAPFNNGLTPLTNA 132
           G +I  L       P    R + Q    SQ    R   N   QSS+              
Sbjct: 432 GQSIRFLPQSDQQRPVSRGRGRGQRGPRSQPQNQRRQQNRGRQSSQPPR----------- 480

Query: 133 SLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPG-NYPS 191
              QQQ +  Q  Q+Q                    Q   NL      P R   G     
Sbjct: 481 ---QQQNRSNQNNQRQS----------QGPNQGPRGQGGYNLRPRTYQPQRYGGGQGRRW 527

Query: 192 VPAGYPSLNQAGYQQQQQQQQQQQQQQQT 220
            P  Y    Q   QQQQ Q + +  Q +T
Sbjct: 528 NPNPYRQSGQGRSQQQQPQPEARGNQSRT 556


>gnl|CDD|227400 COG5068, ARG80, Regulator of arginine metabolism and related MADS
           box-containing transcription factors [Transcription].
          Length = 412

 Score = 36.5 bits (84), Expect = 0.063
 Identities = 30/221 (13%), Positives = 49/221 (22%), Gaps = 44/221 (19%)

Query: 28  SNPSQPSGAAPPISGLQPQVPVSNS---SLQTPLGSSNMMRPPLVTNRFQA--------- 75
           SN    S   P  S      P  N+   S Q          P       Q          
Sbjct: 160 SNAPSDSSEEPSSSASFSVDPNDNNPMGSFQHNGSPQTNFIPLQNPQTQQYQQHSSRKDH 219

Query: 76  -PGATINSLSTPVPQQFQRPQQQNANHSQQTFRMP---------------PNTYSQSSKT 119
                 N+ +   P +F  P+  +++ +                      P   +    T
Sbjct: 220 PTVPHSNTNNGRPPAKFMIPELHSSHSTLDLPSDFISDSGFPNQSSTSIFPLDSAIIQIT 279

Query: 120 APFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQ 179
            P      P  N             +                 +N  N  P+ +  NN  
Sbjct: 280 PPHLPNNPPQENRHELYSNDSSMVSET-------------PPPKNLPNGSPNQSPLNN-- 324

Query: 180 VPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQT 220
              R  P +  S+      +    +  +Q           T
Sbjct: 325 -LSRGNPASPNSIIRENNQVEDESFNGRQGSAIWNALISTT 364



 Score = 33.5 bits (76), Expect = 0.59
 Identities = 31/193 (16%), Positives = 50/193 (25%), Gaps = 22/193 (11%)

Query: 2   QPLVSNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSS 61
           Q   S K  P    S +N  + P     P   S     +      +  S    Q    SS
Sbjct: 211 QQHSSRKDHPTVPHSNTNNGRPPAKFMIPELHSSH-STLDLPSDFISDSGFPNQ----SS 265

Query: 62  NMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAP 121
             + P         P    N+       +       ++  S+     P N  + S   +P
Sbjct: 266 TSIFPLDSAIIQITPPHLPNNPPQENRHELY--SNDSSMVSETP--PPKNLPNGSPNQSP 321

Query: 122 FNNGLTPLTNAS-------------LYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNF 168
            NN       +               +  +Q             PN      +    S+ 
Sbjct: 322 LNNLSRGNPASPNSIIRENNQVEDESFNGRQGSAIWNALISTTQPNSGLHTEASTAPSST 381

Query: 169 QPSNNLYNNIQVP 181
            P++ L N  Q  
Sbjct: 382 IPADPLKNAAQTN 394


>gnl|CDD|220774 pfam10477, EIF4E-T, Nucleocytoplasmic shuttling protein for mRNA
           cap-binding EIF4E.  EIF4E-T is the transporter protein
           for shuttling the mRNA cap-binding protein EIF4E
           protein, targeting it for nuclear import. EIF4E-T
           contains several key binding domains including two
           functional leucine-rich NESs (nuclear export signals)
           between residues 438-447 and 613-638 in the human
           protein. The other two binding domains are an
           EIF4E-binding site, between residues 27-42 in Q9EST3,
           and a bipartite NLS (nuclear localisation signals)
           between 194-211, and these lie in family EIF4E-T_N.
           EIF4E is the eukaryotic translation initiation factor 4E
           that is the rate-limiting factor for cap-dependent
           translation initiation.
          Length = 520

 Score = 35.7 bits (81), Expect = 0.12
 Identities = 37/188 (19%), Positives = 56/188 (29%), Gaps = 18/188 (9%)

Query: 22  QVPLIASNPSQPSGAAPPISGLQPQ----VPVSNSSLQTPLGSSNMMRPPLVTNRFQAPG 77
           Q+P +AS P Q      PI+G         P  N++ +  L    MM      N   A  
Sbjct: 326 QMPKLASRPQQ--DLINPIAGSLGPGGDPPPDQNNAFKELLD---MMAQNRANNNLVAQQ 380

Query: 78  ATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPL-------T 130
              N    P+P       Q+         R+     S S      +N  TP        T
Sbjct: 381 QQYNV--PPMPTSSHVFLQKRLEIQHLIQRLHCGDVSHSFLEKFLDNPSTPAATKDVIAT 438

Query: 131 NASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYP 190
              +  + QQ +             P  P S     + +  +       + P    G  P
Sbjct: 439 LLQMLPKNQQSEVVDGDPNIATQQSPLQPQSVHQHYSQELHSQNTARHNISPVISHGTSP 498

Query: 191 SVPAGYPS 198
             P+  P+
Sbjct: 499 LSPSFTPT 506


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is
           found in bacteria. Proteins in this family are typically
           between 234 to 465 amino acids in length. There is a
           conserved GEV sequence motif.Most members are annotated
           as being SprA-related.
          Length = 261

 Score = 35.1 bits (81), Expect = 0.13
 Identities = 18/106 (16%), Positives = 31/106 (29%), Gaps = 13/106 (12%)

Query: 127 TPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQ--------PSNNLYNNI 178
             L++ S            +    P    P   +S+ +++                    
Sbjct: 4   NSLSSISSGSSAPIGTSALRGTNTPAAAKPAPSSSEASNAGSGSSEQKAKLKGQASTAAG 63

Query: 179 QVPPRAQPGNYPSVPAG-----YPSLNQAGYQQQQQQQQQQQQQQQ 219
                 Q     S          P  +QA   +Q  Q+QQ+QQQQ+
Sbjct: 64  SASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQE 109



 Score = 30.9 bits (70), Expect = 2.4
 Identities = 20/125 (16%), Positives = 42/125 (33%), Gaps = 16/125 (12%)

Query: 28  SNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPV 87
           S  S  SG++ PI     +   + ++ +    SS                   N+ S   
Sbjct: 5   SLSSISSGSSAPIGTSALRGTNTPAAAKPAPSSS----------------EASNAGSGSS 48

Query: 88  PQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQ 147
            Q+ +   Q +      +  +       + +              +  Q  Q+QQ+QQQQ
Sbjct: 49  EQKAKLKGQASTAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQ 108

Query: 148 QQQPI 152
           +++ +
Sbjct: 109 EEKQV 113



 Score = 29.4 bits (66), Expect = 7.7
 Identities = 19/138 (13%), Positives = 39/138 (28%), Gaps = 27/138 (19%)

Query: 80  INSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQ 139
           I++  + +      P   +A     T           S +   N G       +  + Q 
Sbjct: 2   ISNSLSSISSGSSAPIGTSALRGTNTPAA---AKPAPSSSEASNAGSGSSEQKAKLKGQA 58

Query: 140 QQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSL 199
                   Q+                   + ++      + P + Q              
Sbjct: 59  STAAGSASQEL-------------QKQASESNDEEVVGEEEPEKDQAEA----------- 94

Query: 200 NQAGYQQQQQQQQQQQQQ 217
            +   Q+QQ+QQQQ+++Q
Sbjct: 95  GKQLSQKQQEQQQQEEKQ 112


>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
           (DUF2076).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function. The domain,
           however, is found in various periplasmic ligand-binding
           sensor proteins.
          Length = 234

 Score = 34.6 bits (80), Expect = 0.16
 Identities = 18/88 (20%), Positives = 25/88 (28%), Gaps = 14/88 (15%)

Query: 132 ASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPS 191
           A+L Q   + ++ + Q Q P           Q+S  F     L           P   P+
Sbjct: 55  AALKQANARIEELEAQAQHP---------QSQSSGGF-----LSGMFGGGAPRPPPAAPA 100

Query: 192 VPAGYPSLNQAGYQQQQQQQQQQQQQQQ 219
           V    P            QQ   QQ   
Sbjct: 101 VQPPAPPARPGWGSGGPSQQGAGQQPGY 128



 Score = 33.1 bits (76), Expect = 0.44
 Identities = 18/86 (20%), Positives = 23/86 (26%), Gaps = 13/86 (15%)

Query: 132 ASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPS 191
                + ++ + Q Q  Q            Q +      S         PP A P   P 
Sbjct: 58  KQANARIEELEAQAQHPQS-----------QSSGGFL--SGMFGGGAPRPPPAAPAVQPP 104

Query: 192 VPAGYPSLNQAGYQQQQQQQQQQQQQ 217
            P   P     G  QQ   QQ    Q
Sbjct: 105 APPARPGWGSGGPSQQGAGQQPGYAQ 130


>gnl|CDD|113452 pfam04684, BAF1_ABF1, BAF1 / ABF1 chromatin reorganising factor.
           ABF1 is a sequence-specific DNA binding protein involved
           in transcription activation, gene silencing and
           initiation of DNA replication. ABF1 is known to remodel
           chromatin, and it is proposed that it mediates its
           effects on transcription and gene expression by
           modifying local chromatin architecture. These functions
           require a conserved stretch of 20 amino acids in the
           C-terminal region of ABF1 (amino acids 639 to 662 S.
           cerevisiae). The N-terminal two thirds of the protein
           are necessary for DNA binding, and the N-terminus (amino
           acids 9 to 91 in S. cerevisiae) is thought to contain a
           novel zinc-finger motif which may stabilise the protein
           structure.
          Length = 508

 Score = 35.1 bits (80), Expect = 0.17
 Identities = 19/66 (28%), Positives = 33/66 (50%), Gaps = 4/66 (6%)

Query: 158 LPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQ 217
           LPN+ Q S      +  +++  +P   Q      +P G+   +Q  +Q QQQ  Q Q+Q 
Sbjct: 280 LPNAHQISGADHHQHQHHHHPSIPAHHQH----QLPEGHQHDHQHHHQHQQQDLQHQEQH 335

Query: 218 QQTLNS 223
           QQ +++
Sbjct: 336 QQHVDN 341



 Score = 30.9 bits (69), Expect = 3.3
 Identities = 13/40 (32%), Positives = 18/40 (45%)

Query: 185 QPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
           Q  ++PS+PA +      G+Q   Q   Q QQQ      Q
Sbjct: 295 QHHHHPSIPAHHQHQLPEGHQHDHQHHHQHQQQDLQHQEQ 334


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 35.1 bits (81), Expect = 0.20
 Identities = 22/114 (19%), Positives = 38/114 (33%), Gaps = 7/114 (6%)

Query: 2   QPLVSNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQP-QVPVSNSSLQTPLGS 60
            P  S     +   S S+    P    + +QP+G  P +S   P  VPV       P  +
Sbjct: 389 APQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPV-----NPPSTA 443

Query: 61  SNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYS 114
              +RP       + P + ++SL     +  Q   +Q    + +          
Sbjct: 444 PQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATG-NIKEAPTGTQKEI 496



 Score = 31.3 bits (71), Expect = 2.5
 Identities = 23/143 (16%), Positives = 37/143 (25%), Gaps = 29/143 (20%)

Query: 39  PISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQN 98
           P +  QP    + S   +                 Q+  A   S      Q    P   +
Sbjct: 386 PAAAPQPSAAAAASPSPS-----------------QSSAAAQPSAPQSATQPAGTPPTVS 428

Query: 99  ANHSQQTFRMPPNTYSQSSKTAPFNN------GLTPLTNASLYQQQQQQQQQQQQQQQPI 152
            +        PP+T  Q+ + A F                S  +  Q++ +Q        
Sbjct: 429 VDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATG----- 483

Query: 153 PNIPPLPNSQQNSSNFQPSNNLY 175
            NI   P   Q     +     Y
Sbjct: 484 -NIKEAPTGTQKEIFTEEDLQYY 505


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 33.9 bits (78), Expect = 0.31
 Identities = 16/85 (18%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 137 QQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPA-- 194
           ++  +Q+ QQ  +Q  P          +SS    +         P R    N     A  
Sbjct: 135 EEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGSA--------KPER----NVSQEEAKK 182

Query: 195 GYPSLNQAGYQQQQQQQQQQQQQQQ 219
                     +QQQQ++++++++Q+
Sbjct: 183 RLQEWELKKLKQQQQKREEERRKQR 207


>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
           factor (transactivator).  This family includes EBV BRLF1
           and similar ORF 50 proteins from other herpesviruses.
          Length = 500

 Score = 34.0 bits (78), Expect = 0.39
 Identities = 27/187 (14%), Positives = 50/187 (26%), Gaps = 23/187 (12%)

Query: 33  PSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQ 92
            S  A P  G+Q  VP S+ S  +   + +  +P  +                  P+Q  
Sbjct: 285 SSPQAEPEQGVQSYVPQSSDSRPSCFPAPSTTQPTFLPPNTNKKAKRDRRPQMVTPKQEG 344

Query: 93  RPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPI 152
                              T  ++ +  P  +G +P +N+ L             Q   +
Sbjct: 345 GAAVSQN--------HDGGT-VRAPRGRPSGSGQSPPSNSPLLSSLADTPSGAAHQPASL 395

Query: 153 PNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQ 212
                +    +++S+ QP               PG+   VP                + Q
Sbjct: 396 LPPAVVQQQLEDASDKQPP-------------TPGSSL-VPQPDEQELGPSVMALLDRDQ 441

Query: 213 QQQQQQQ 219
                  
Sbjct: 442 GILALIL 448



 Score = 32.1 bits (73), Expect = 1.7
 Identities = 26/180 (14%), Positives = 46/180 (25%), Gaps = 27/180 (15%)

Query: 54  LQTPLGSSNMMRPPLVTNRFQAPGATINSLST----PVPQQFQRPQQQNANHSQQTFRMP 109
           LQ P         PL  ++F       +               RPQQ  +          
Sbjct: 196 LQNPEEILETRASPL--SQFHGFTPHPSLPQPQSPLKPSPSSARPQQSESFS-------- 245

Query: 110 PNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQ 169
            + +  S+++        PL   +      ++    Q++Q    +    P     S   Q
Sbjct: 246 -DVWPASTQSPREETSAEPL---APASPSSRRPSTAQEEQIACSSPQAEPEQGVQSYVPQ 301

Query: 170 PSNNLYNNIQVPPRAQPGNYP---------SVPAGYPSLNQAGYQQQQQQQQQQQQQQQT 220
            S++  +    P   QP   P                +  Q G     Q       +   
Sbjct: 302 SSDSRPSCFPAPSTTQPTFLPPNTNKKAKRDRRPQMVTPKQEGGAAVSQNHDGGTVRAPR 361


>gnl|CDD|180234 PRK05751, PRK05751, preprotein translocase subunit SecB; Validated.
          Length = 156

 Score = 32.5 bits (75), Expect = 0.45
 Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 6/33 (18%)

Query: 194 AGYPSLN------QAGYQQQQQQQQQQQQQQQT 220
            G+P LN       A Y+Q+ QQ QQ   + Q 
Sbjct: 123 GGFPPLNLAPINFDALYEQRLQQAQQAAAEAQE 155



 Score = 32.1 bits (74), Expect = 0.53
 Identities = 12/31 (38%), Positives = 17/31 (54%), Gaps = 2/31 (6%)

Query: 121 PFNNGLTPLTNASLYQQQQQQQQQQQQQQQP 151
           P N  L P+   +LY+Q+ QQ QQ   + Q 
Sbjct: 127 PLN--LAPINFDALYEQRLQQAQQAAAEAQE 155


>gnl|CDD|227358 COG5025, COG5025, Transcription factor of the Forkhead/HNF3 family
           [Transcription].
          Length = 610

 Score = 34.0 bits (78), Expect = 0.46
 Identities = 30/186 (16%), Positives = 56/186 (30%), Gaps = 25/186 (13%)

Query: 7   NKQGPNSYGSASNVMQVPLIASNPSQPSGAAP----PISGLQPQVPVSNSSLQTPLGSSN 62
           N + P    SA +V Q   +  N    S A        S +  Q    ++ + +  G SN
Sbjct: 426 NPRSPKKSPSAHSVHQKLSLHVNDLYQSPATSDIASSSSQVNSQPEFISTQIHSSKGVSN 485

Query: 63  MMRPPLVTNRFQAPG-------------ATINSLSTPVPQQFQRPQQQNANHSQQTFRMP 109
           +      + +  + G               INS S       Q+ Q  + N         
Sbjct: 486 VDLTEQDSQKEASKGNFLDDSGSLSPNTNEINSFSLNTTDS-QQKQSPSHNAPTNNSLNE 544

Query: 110 PNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPN-------IPPLPNSQ 162
             + + +S+T   N+        ++     Q ++     Q   P        +   PN  
Sbjct: 545 MASKNSNSQTQASNSNENVAAVKAILDASAQMEKPYDLSQAATPTKATESASVRQAPNPP 604

Query: 163 QNSSNF 168
            + +  
Sbjct: 605 PHQNLV 610


>gnl|CDD|227911 COG5624, TAF61, Transcription initiation factor TFIID, subunit
           TAF12 (also component of histone acetyltransferase SAGA)
           [Transcription].
          Length = 505

 Score = 33.9 bits (77), Expect = 0.47
 Identities = 14/67 (20%), Positives = 21/67 (31%), Gaps = 3/67 (4%)

Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQ-NSSNFQPSNNLYNNIQVPPRAQPGNYPSVPA 194
              +  Q QQ Q+Q   P    L   ++  +S   P      +  +     PG    VP 
Sbjct: 230 LVDRYPQFQQGQKQVLSPQQRFLHGMERYEASGMPPPAEWAGSNGL--HVLPGRREEVPR 287

Query: 195 GYPSLNQ 201
           G      
Sbjct: 288 GIFRCPS 294



 Score = 32.7 bits (74), Expect = 0.87
 Identities = 6/36 (16%), Positives = 11/36 (30%)

Query: 201 QAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNK 236
             G  +++   +  Q QQ              GM +
Sbjct: 222 HDGRSKRRLVDRYPQFQQGQKQVLSPQQRFLHGMER 257



 Score = 30.0 bits (67), Expect = 6.2
 Identities = 15/91 (16%), Positives = 29/91 (31%), Gaps = 8/91 (8%)

Query: 89  QQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQ 148
               +  Q+     QQ F      Y  S        G+ P    +          ++++ 
Sbjct: 234 YPQFQQGQKQVLSPQQRFLHGMERYEAS--------GMPPPAEWAGSNGLHVLPGRREEV 285

Query: 149 QQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQ 179
            + I   P   +S+   ++    N + NN Q
Sbjct: 286 PRGIFRCPSPESSRGEPTHLDYRNGMANNAQ 316


>gnl|CDD|213398 cd12191, gal11_coact, gall11 coactivator domain.  Gall11/MED15 acts
           in the general regulation of GAL structural genes and is
           required for full expression for several genes in this
           pathway, including GALs 1,7, and 10 in Saccharomyces
           cerevisiae. GAL11 function is dependent on GCN4
           functionality and binds GCN4 in a degenerate manner with
           multiple orientations found at the GCN4-Gal11 interface.
          Length = 90

 Score = 31.2 bits (71), Expect = 0.49
 Identities = 28/102 (27%), Positives = 36/102 (35%), Gaps = 27/102 (26%)

Query: 136 QQQQQQQQQQQQQQQPIPN----------IPP-----LPNSQQNSSNFQPSNNLYNNIQV 180
            QQQQQ QQQQQQQ P             IPP     +PN       ++          +
Sbjct: 1   PQQQQQPQQQQQQQMPQNPQLVNMMDNMPIPPQLLAKIPNLPPEVKTWRQLKAWLQQNNI 60

Query: 181 PPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLN 222
           PP+A                +   Q   Q   Q++QQQQ   
Sbjct: 61  PPQAME------------QIKEVQQTHFQLLMQRRQQQQQQQ 90


>gnl|CDD|179798 PRK04239, PRK04239, hypothetical protein; Provisional.
          Length = 110

 Score = 31.4 bits (72), Expect = 0.52
 Identities = 9/24 (37%), Positives = 15/24 (62%)

Query: 201 QAGYQQQQQQQQQQQQQQQTLNSQ 224
           Q   Q+QQQ Q++Q++ Q    +Q
Sbjct: 15  QKQAQEQQQAQEEQEEAQAQAEAQ 38



 Score = 28.3 bits (64), Expect = 7.4
 Identities = 7/15 (46%), Positives = 12/15 (80%)

Query: 136 QQQQQQQQQQQQQQQ 150
           +QQQ Q++Q++ Q Q
Sbjct: 20  EQQQAQEEQEEAQAQ 34



 Score = 28.3 bits (64), Expect = 7.7
 Identities = 7/16 (43%), Positives = 12/16 (75%)

Query: 135 YQQQQQQQQQQQQQQQ 150
            Q+QQQ Q++Q++ Q 
Sbjct: 18  AQEQQQAQEEQEEAQA 33



 Score = 27.9 bits (63), Expect = 9.0
 Identities = 8/15 (53%), Positives = 13/15 (86%)

Query: 136 QQQQQQQQQQQQQQQ 150
           Q+Q Q+QQQ Q++Q+
Sbjct: 15  QKQAQEQQQAQEEQE 29


>gnl|CDD|236797 PRK10927, PRK10927, essential cell division protein FtsN;
           Provisional.
          Length = 319

 Score = 33.1 bits (75), Expect = 0.61
 Identities = 19/89 (21%), Positives = 29/89 (32%), Gaps = 1/89 (1%)

Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAG 195
           Q+Q  +Q Q   +QQP   +  +P ++Q     Q +       Q     Q     S    
Sbjct: 118 QRQLLEQMQADMRQQPT-QLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQQSRTTE 176

Query: 196 YPSLNQAGYQQQQQQQQQQQQQQQTLNSQ 224
                Q    Q    Q Q +Q +     Q
Sbjct: 177 QSWQQQTRTSQAAPVQAQPRQSKPASTQQ 205



 Score = 32.0 bits (72), Expect = 1.6
 Identities = 29/127 (22%), Positives = 43/127 (33%), Gaps = 12/127 (9%)

Query: 88  PQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQ 147
           P+Q    Q+Q     Q   R  P    +     P+N   TP       +Q+QQ  Q+Q+Q
Sbjct: 111 PEQLTPEQRQLLEQMQADMRQQPTQLVE----VPWNEQ-TP-------EQRQQTLQRQRQ 158

Query: 148 QQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQ 207
            QQ          S+    ++Q          V  + +     S    Y  L Q      
Sbjct: 159 AQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASTQQPYQDLLQTPAHTT 218

Query: 208 QQQQQQQ 214
            Q + QQ
Sbjct: 219 AQSKPQQ 225


>gnl|CDD|217451 pfam03249, TSA, Type specific antigen.  There are several antigenic
           variants in Rickettsia tsutsugamushi, and a
           type-specific antigen (TSA) of 56-kilodaltons located on
           the rickettsial surface is responsible for the
           variation. TSA proteins are probably integral membrane
           proteins.
          Length = 502

 Score = 33.3 bits (76), Expect = 0.63
 Identities = 24/72 (33%), Positives = 34/72 (47%), Gaps = 8/72 (11%)

Query: 154 NIP--PLPNS---QQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQ 208
           ++P  PLPNS   +Q  +  Q  N++   ++    +  G   +  A    LN    QQ Q
Sbjct: 255 DVPDTPLPNSASVEQIQNKMQELNDVLEELR---ESFDGYIGNAFANQIQLNFVIPQQAQ 311

Query: 209 QQQQQQQQQQQT 220
           QQ Q QQQQ Q 
Sbjct: 312 QQGQGQQQQAQA 323



 Score = 32.5 bits (74), Expect = 1.0
 Identities = 11/15 (73%), Positives = 11/15 (73%)

Query: 136 QQQQQQQQQQQQQQQ 150
           QQ QQQ Q QQQQ Q
Sbjct: 308 QQAQQQGQGQQQQAQ 322



 Score = 31.8 bits (72), Expect = 2.0
 Identities = 11/16 (68%), Positives = 11/16 (68%)

Query: 205 QQQQQQQQQQQQQQQT 220
           Q QQQ Q QQQQ Q T
Sbjct: 309 QAQQQGQGQQQQAQAT 324



 Score = 31.8 bits (72), Expect = 2.1
 Identities = 11/19 (57%), Positives = 12/19 (63%)

Query: 206 QQQQQQQQQQQQQQTLNSQ 224
           QQ QQQ Q QQQQ    +Q
Sbjct: 308 QQAQQQGQGQQQQAQATAQ 326



 Score = 30.6 bits (69), Expect = 3.9
 Identities = 10/14 (71%), Positives = 10/14 (71%)

Query: 138 QQQQQQQQQQQQQP 151
           QQ QQQ Q QQQQ 
Sbjct: 308 QQAQQQGQGQQQQA 321


>gnl|CDD|220232 pfam09421, FRQ, Frequency clock protein.  The frequency clock
           protein, is the central component of the frq-based
           circadian negative feedback loop, regulates various
           aspects of the circadian clock in Neurospora crassa.
           This protein has been shown to interact with itself via
           a coiled-coil.
          Length = 989

 Score = 33.4 bits (76), Expect = 0.65
 Identities = 15/49 (30%), Positives = 20/49 (40%), Gaps = 7/49 (14%)

Query: 114 SQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQ 162
           +QS+ T   +  L P+       Q QQQ+  Q    QP  N    P S 
Sbjct: 319 TQSTLTPSVDAALPPV-------QAQQQEGTQMAPPQPPSNFITNPFSS 360



 Score = 31.1 bits (70), Expect = 3.0
 Identities = 14/52 (26%), Positives = 19/52 (36%), Gaps = 4/52 (7%)

Query: 183 RAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLSVNQGM 234
           R Q    PSV A  P +      QQQ+  Q    Q  +        SV + +
Sbjct: 318 RTQSTLTPSVDAALPPVQA----QQQEGTQMAPPQPPSNFITNPFSSVPEPL 365


>gnl|CDD|234683 PRK00191, tatA, twin arginine translocase protein A; Provisional.
          Length = 84

 Score = 30.7 bits (69), Expect = 0.68
 Identities = 16/34 (47%), Positives = 18/34 (52%)

Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQ 169
           Q  +QQQQQ QQQ  P     P PN QQ+    Q
Sbjct: 50  QTPEQQQQQPQQQIAPNQIEAPQPNFQQHYQGQQ 83


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 32.7 bits (75), Expect = 0.75
 Identities = 22/129 (17%), Positives = 33/129 (25%), Gaps = 16/129 (12%)

Query: 88  PQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQ 147
                 PQ Q            P      ++  P +            QQ   Q Q +Q 
Sbjct: 82  EAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQP 141

Query: 148 QQQPI-PNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQ 206
            QQP+ P + P P                     P  AQ    P+ P   P         
Sbjct: 142 LQQPVSPQVAPAPQPVH---------------SAPQPAQQAFQPAEPVAAPQPEPVAEPA 186

Query: 207 QQQQQQQQQ 215
               + +++
Sbjct: 187 PVMDKPKRK 195


>gnl|CDD|220882 pfam10818, DUF2547, Protein of unknown function (DUF2547).  This
           bacterial family of proteins has no known function.
          Length = 90

 Score = 30.5 bits (69), Expect = 0.85
 Identities = 14/68 (20%), Positives = 17/68 (25%), Gaps = 11/68 (16%)

Query: 91  FQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQ 150
           F  P+ Q             NT                +   +   QQ QQQQ      Q
Sbjct: 16  FALPEAQQLQAQNANNENEQNTVINQ-----------QIAQTAQIAQQVQQQQLFIALSQ 64

Query: 151 PIPNIPPL 158
               I P 
Sbjct: 65  KPLQIAPH 72


>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor.  This family is secreted by
           gram-negative Gammaproteobacteria such as Pseudomonas
           syringae of tomato and the fire blight plant pathogen
           Erwinia amylovora, amongst others. It is an essential
           pathogenicity factor of approximately 198 kDa. Its
           injection into the host-plant is dependent upon the
           bacterial type III or Hrp secretion system. The family
           is long and carries a number of predicted functional
           regions, including an ERMS or endoplasmic reticulum
           membrane retention signal at both the C- and the
           N-termini, a leucine-zipper motif from residues 539-560,
           and a nuclear localisation signal at 1358-1361. this
           conserved AvrE-family of effectors is among the few that
           are required for full virulence of many phytopathogenic
           pseudomonads, erwinias and pantoeas.
          Length = 1771

 Score = 32.8 bits (75), Expect = 1.1
 Identities = 23/148 (15%), Positives = 43/148 (29%), Gaps = 7/148 (4%)

Query: 9   QGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQ----PQVPVSNSSLQTPLGSSNMM 64
           Q P S  SA  + +   +A +  +     P +         Q  ++ SS       S   
Sbjct: 104 QAPESSPSARRLTRSEGVARHEMEDLAGRPVVKPDADRQLRQDILNKSSSSRRPPVSKEE 163

Query: 65  RPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNN 124
                        A +         Q     + +     +  R   N  +     AP   
Sbjct: 164 GTSSKMPATALASAALFKDDEIR--QEVDAARSDQASQSRLSRSRGNPPAIPPDAAPRQP 221

Query: 125 GLTPLTNASLYQQQQQQQQQQQQQQQPI 152
            LT  +    ++ + +  ++  Q Q PI
Sbjct: 222 MLTR-SAGGRFEGEDENLERNLQPQSPI 248


>gnl|CDD|233230 TIGR01000, bacteriocin_acc, bacteriocin secretion accessory
           protein.  This family represents an accessory protein
           that works with the bacteriocin maturation and ABC
           transport secretion protein described by TIGR01193
           [Transport and binding proteins, Other].
          Length = 457

 Score = 32.4 bits (74), Expect = 1.1
 Identities = 28/118 (23%), Positives = 41/118 (34%), Gaps = 9/118 (7%)

Query: 112 TYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPS 171
              Q +  +   N     T A L QQ  +  Q+ Q  Q     I    ++    +NF P 
Sbjct: 155 ETQQQNDKSQTQNEAAEKTKAQLDQQISKTDQKLQDYQALKNAI----SNGTKVANFNPY 210

Query: 172 NNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNLS 229
            +LY N Q    AQ     S       +        QQQ  Q Q+   +   Q + L+
Sbjct: 211 QSLYENYQ----AQ-LKSASDKDQKNQVKSTILATIQQQIDQLQKSIASYQVQKAGLT 263



 Score = 30.1 bits (68), Expect = 6.4
 Identities = 21/108 (19%), Positives = 40/108 (37%), Gaps = 10/108 (9%)

Query: 126 LTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSN------NLYNNIQ 179
           L    N +   Q+Q  +QQ    +    ++  L  S +N  N  P++      NL+N   
Sbjct: 87  LVVYDNGNEENQKQLLEQQLDNLKDQKKSLDTLKQSIENGRNQFPTDDSFGYRNLFNGYL 146

Query: 180 VPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSN 227
               AQ  +  S        +Q   +  ++ + Q  QQ    + +  +
Sbjct: 147 ----AQVESLTSETQQQNDKSQTQNEAAEKTKAQLDQQISKTDQKLQD 190


>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
           Reviewed.
          Length = 830

 Score = 32.4 bits (74), Expect = 1.3
 Identities = 31/114 (27%), Positives = 40/114 (35%), Gaps = 27/114 (23%)

Query: 123 NNGLTPLTNAS----LYQQQQQQQQQQQQQQQPIP--NIPP---LPNSQQNSSNFQPSNN 173
           NN  T L  AS    +YQ+  + Q        P P    PP   +        NF   + 
Sbjct: 723 NNQPTKLYGASGAMTIYQRYLENQT-------PTPLNLTPPEDIVDMGVDYDGNFVCGSG 775

Query: 174 LYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSN 227
               + V        +   P    SL Q    QQQ  Q QQQ QQQ    + S+
Sbjct: 776 GMRVLPV--------WTDDPQ---SLCQQSEMQQQPSQPQQQPQQQPAEQKDSD 818


>gnl|CDD|184256 PRK13700, PRK13700, conjugal transfer protein TraD; Provisional.
          Length = 732

 Score = 32.2 bits (73), Expect = 1.4
 Identities = 26/99 (26%), Positives = 46/99 (46%), Gaps = 6/99 (6%)

Query: 130 TNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNY 189
           T A   QQ QQ QQ QQ QQ   P  P + + + ++    P+  +   +++ P  +    
Sbjct: 617 TQAEQPQQPQQPQQPQQPQQPQQPVSPVINDKKSDAGVNVPAGGIEQELKMKPEEEMEQQ 676

Query: 190 --PSVPAGYPSLNQAGYQQQQQQQ----QQQQQQQQTLN 222
             P +      ++ A Y+  QQ+     QQQ Q+++ +N
Sbjct: 677 LPPGISESGEVVDMAAYEAWQQENHPDIQQQMQRREEVN 715


>gnl|CDD|220888 pfam10846, DUF2722, Protein of unknown function (DUF2722).  This
           eukaryotic family of proteins has no known function.
          Length = 373

 Score = 32.2 bits (73), Expect = 1.4
 Identities = 34/172 (19%), Positives = 58/172 (33%), Gaps = 18/172 (10%)

Query: 29  NPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVP 88
            P Q    A P+     + P S+S L          R P    R  A  A + +L+TP  
Sbjct: 135 APEQNGTNASPVPPSSYKFPPSSSGLTPRHTVLPTHRRPNSPARIGA--AAVANLATPTT 192

Query: 89  ---------QQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQ 139
                        R ++  +   Q+   +P NT +  +  +      T + N      Q 
Sbjct: 193 PYKEESLGASSPLRRKKFGSQLHQRNMSLPSNTPTSGNTNSNIPKPATSVLNFKPSPAQP 252

Query: 140 QQQQ---QQQQQQQPIPN----IPPLPNSQQNSSNFQPSNNLYNNIQVPPRA 184
             +Q     Q  Q+ + +    I   P SQQ       S++ +  I +   +
Sbjct: 253 LHKQSKSAPQPSQESMTSFQHIIQWKPESQQKKHRRTKSSSSFGVIDLNSIS 304


>gnl|CDD|234371 TIGR03839, termin_org_P1, adhesin P1.  Members of this protein
           family are the major adhesin of the Mycoplasma terminal
           organelle. The protein is called adhesin P1, cytadhesin
           P1, P140, attachment protein, and MgPa, with locus names
           MG191 in Mycoplasma genitalium and MPN141 in M.
           pneumoniae. A conserved C-terminal region is shared by
           additional paralogs in M. pneumoniae and M.
           gallisepticum, as well as by the member of this family
           [Cell envelope, Surface structures, Cellular processes,
           Pathogenesis].
          Length = 1425

 Score = 32.4 bits (73), Expect = 1.5
 Identities = 44/215 (20%), Positives = 70/215 (32%), Gaps = 21/215 (9%)

Query: 10  GPNSYGSASNVMQ--VPLIASNPSQPSGAAPPISG--LQPQVPVSN------SSLQTPLG 59
           G N   S  + +   VP I +     +  AP  S          SN        L + L 
Sbjct: 753 GTNLDSSVLDQVLDYVPWIGNGYRYGNNTAPQTSAGSSSGLPTFSNITVGTKQQLASTLA 812

Query: 60  SSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNAN-----HSQQTFRMPPNTYS 114
             N M        +   G   +  S     +F   Q Q+        S+Q          
Sbjct: 813 GENTMITGGSPQLWTGAGWRNDKASDENHTKFTSAQGQSGTSAGNPRSKQDNISKSGDSQ 872

Query: 115 QSSKTAPFNNGLTPLT---NASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSN-FQP 170
           +++        LTP     NA  +  +   Q+ Q   +  + NIP L N+  N++N FQ 
Sbjct: 873 EATNYTNLPPNLTPTADWLNALTFTNKNNDQRDQLFLRGLLGNIPVLVNTSGNNTNKFQA 932

Query: 171 SNNLYNNIQVPPRAQPGNYPSVPAGY-PSLNQAGY 204
           ++  +          P  +  V   Y  +LN   Y
Sbjct: 933 TDQKWTYTYQVGGNLPT-WGEVNGLYNTALNITYY 966


>gnl|CDD|221868 pfam12938, M_domain, M domain of GW182. 
          Length = 238

 Score = 31.4 bits (71), Expect = 1.6
 Identities = 40/204 (19%), Positives = 57/204 (27%), Gaps = 11/204 (5%)

Query: 26  IASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSS-NMMRPPLVTNRFQAPGATINSLS 84
           +           P         P S ++L   LG            N      A+++SL+
Sbjct: 5   MGFAGPFGGDRFPSGGSSVNSPPFSQNNLPNNLGGGGGGPGGGGGGNN--PNLASLSSLT 62

Query: 85  TPVPQQF---QRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQ 141
           +    +     +P               P         AP N            QQ +  
Sbjct: 63  SQGLGKILSGLQPPPLGNGGGSGAGGPGPVGGGGGPGVAPNNIQPNAQAQQPSTQQLRML 122

Query: 142 QQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQ-----PGNYPSVPAGY 196
            QQ Q   Q     P + N           N L  NI+V  +AQ      GN   V    
Sbjct: 123 VQQIQMAVQKGYLNPQILNQPLAPQTLLLLNQLLQNIKVLQQAQQSLQRGGNASGVEPVQ 182

Query: 197 PSLNQAGYQQQQQQQQQQQQQQQT 220
            ++     +QQ  Q Q Q   QQ 
Sbjct: 183 LNVEINKLKQQISQLQNQINAQQA 206


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 31.7 bits (72), Expect = 1.7
 Identities = 16/68 (23%), Positives = 27/68 (39%), Gaps = 4/68 (5%)

Query: 87  VPQQFQRPQQQNANHS----QQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQ 142
           V +Q+ R QQQ  +      Q+  +          K A     L  L    L  Q+Q++Q
Sbjct: 60  VVEQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQ 119

Query: 143 QQQQQQQQ 150
            ++  +Q 
Sbjct: 120 AEEAAKQA 127


>gnl|CDD|237624 PRK14143, PRK14143, heat shock protein GrpE; Provisional.
          Length = 238

 Score = 31.2 bits (71), Expect = 1.9
 Identities = 17/89 (19%), Positives = 32/89 (35%), Gaps = 2/89 (2%)

Query: 139 QQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPS 198
             +Q   + +   I       NS ++S           N +        +  S  A   +
Sbjct: 2   TPEQDPLEVKLAVISESEAEDNSPESSEEVTEQEAELTNPEGDAAEAESSPDSGSAASET 61

Query: 199 L--NQAGYQQQQQQQQQQQQQQQTLNSQF 225
              N A   Q +Q+ +  +Q+ + LNSQ+
Sbjct: 62  AADNAARLAQLEQELESLKQELEELNSQY 90


>gnl|CDD|216828 pfam01984, dsDNA_bind, Double-stranded DNA-binding domain.  This
           domain is believed to bind double-stranded DNA of 20
           bases length.
          Length = 105

 Score = 29.8 bits (68), Expect = 1.9
 Identities = 10/20 (50%), Positives = 16/20 (80%)

Query: 200 NQAGYQQQQQQQQQQQQQQQ 219
            Q G  ++QQQQ++QQQQ++
Sbjct: 11  QQQGGLEEQQQQEEQQQQEE 30



 Score = 29.1 bits (66), Expect = 3.7
 Identities = 10/20 (50%), Positives = 16/20 (80%)

Query: 200 NQAGYQQQQQQQQQQQQQQQ 219
            Q G ++QQQQ++QQQQ++ 
Sbjct: 12  QQGGLEEQQQQEEQQQQEEA 31



 Score = 28.3 bits (64), Expect = 5.7
 Identities = 9/23 (39%), Positives = 15/23 (65%)

Query: 201 QAGYQQQQQQQQQQQQQQQTLNS 223
           +   QQ++QQQQ++ Q+Q  L  
Sbjct: 17  EEQQQQEEQQQQEEAQKQAILRQ 39



 Score = 28.3 bits (64), Expect = 6.0
 Identities = 10/21 (47%), Positives = 16/21 (76%)

Query: 205 QQQQQQQQQQQQQQQTLNSQF 225
           QQQ++QQQQ++ Q+Q +  Q 
Sbjct: 20  QQQEEQQQQEEAQKQAILRQI 40


>gnl|CDD|221501 pfam12273, RCR, Chitin synthesis regulation, resistance to Congo
           red.  RCR proteins are ER membrane proteins that
           regulate chitin deposition in fungal cell walls.
           Although chitin, a linear polymer of beta-1,4-linked
           N-acetylglucosamine, constitutes only 2% of the cell
           wall it plays a vital role in the overall protection of
           the cell wall against stress, noxious chemicals and
           osmotic pressure changes. Congo red is a cell
           wall-disrupting benzidine-type dye extensively used in
           many cell wall mutant studies that specifically targets
           chitin in yeast cells and inhibits growth. RCR proteins
           render the yeasts resistant to Congo red by diminishing
           the content of chitin in the cell wall. RCR proteins are
           probably regulating chitin synthase III interact
           directly with ubiquitin ligase Rsp5, and the VPEY motif
           is necessary for this, via interaction with the WW
           domains of Rsp5.
          Length = 124

 Score = 30.1 bits (68), Expect = 1.9
 Identities = 14/87 (16%), Positives = 26/87 (29%), Gaps = 11/87 (12%)

Query: 146 QQQQQPIPNI----PPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVP-----AGY 196
           ++  QPI       PP     Q   N  P         VPP  +  N   +        +
Sbjct: 28  RRGLQPIYGTAWLAPPSYRQSQQQYNNTPQGRPPQ--YVPPYTETANENDLGYYDGQGEF 85

Query: 197 PSLNQAGYQQQQQQQQQQQQQQQTLNS 223
               +A   + Q      ++  ++  +
Sbjct: 86  HPNPKAPAIELQPPPNAYERGTRSPTT 112



 Score = 29.0 bits (65), Expect = 5.6
 Identities = 14/83 (16%), Positives = 22/83 (26%), Gaps = 11/83 (13%)

Query: 123 NNGLTPLTNASLYQ---QQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNF--------QPS 171
             GL P+   +       +Q QQQ     Q   P   P      N ++            
Sbjct: 28  RRGLQPIYGTAWLAPPSYRQSQQQYNNTPQGRPPQYVPPYTETANENDLGYYDGQGEFHP 87

Query: 172 NNLYNNIQVPPRAQPGNYPSVPA 194
           N     I++ P        +   
Sbjct: 88  NPKAPAIELQPPPNAYERGTRSP 110


>gnl|CDD|227823 COG5536, BET4, Protein prenyltransferase, alpha subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 328

 Score = 31.4 bits (71), Expect = 1.9
 Identities = 18/86 (20%), Positives = 28/86 (32%), Gaps = 9/86 (10%)

Query: 237 MWGYEPHDLLNTRDILPATKVEPPPIRLPQNLPEIHEQFLYDRNTEMFG-DPYRRPEIQS 295
           M G +  DL     I+   ++    +   +NLP +H     +            R   Q 
Sbjct: 246 MIGEKVEDLGKYIVIINGKEL---DLGPKENLPCLHSLLELEFLCHAEKALLTERDIEQK 302

Query: 296 ATVEYLATADYTVRDPPQPAMYLFLL 321
           A VE          DP +  +Y  L 
Sbjct: 303 ALVELAIKV-----DPARRNLYSTLH 323


>gnl|CDD|223716 COG0643, CheA, Chemotaxis protein histidine kinase and related
           kinases [Cell motility and secretion / Signal
           transduction mechanisms].
          Length = 716

 Score = 31.9 bits (73), Expect = 2.0
 Identities = 17/104 (16%), Positives = 39/104 (37%), Gaps = 24/104 (23%)

Query: 709 LPLYVLALLKHPAFRIGQSTRLDDRLFAMTEMKCLPLNSLMLSIYPELYPIHTLDQDPTI 768
           LPL  LA+++          R+   ++A      +PL S+  ++  +   I  ++    +
Sbjct: 571 LPL-TLAIIQ------ALLVRVGGEVYA------IPLASVEETLRIDPEDIKPVEGQEVL 617

Query: 769 EWNELQVPIPR---ILQLSAERLQATGTYLLNMPDLIIILVRHG 809
                 +P+ R   +L +     +             +++VR G
Sbjct: 618 RLRGEVLPLVRLGELLNVPPASPEPE--------QGPVVIVRSG 653


>gnl|CDD|236275 PRK08486, PRK08486, single-stranded DNA-binding protein;
           Provisional.
          Length = 182

 Score = 31.0 bits (70), Expect = 2.1
 Identities = 12/59 (20%), Positives = 21/59 (35%), Gaps = 1/59 (1%)

Query: 169 QPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSN 227
           +  N   N +Q     +  N  + P  Y + +Q  Y  Q Q   Q    ++   +Q   
Sbjct: 108 KSDNPQANAMQDNSFHENFNN-NYPGNYNNPSQDPYMNQAQSYNQNAYAKENQQAQPPK 165



 Score = 30.2 bits (68), Expect = 3.1
 Identities = 16/66 (24%), Positives = 26/66 (39%), Gaps = 5/66 (7%)

Query: 154 NIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPSLNQAGYQQQQQQQQQ 213
           +  P  N+ Q++S  +  NN Y      P   P    +      S NQ  Y ++ QQ Q 
Sbjct: 109 SDNPQANAMQDNSFHENFNNNYPGNYNNPSQDPYMNQA-----QSYNQNAYAKENQQAQP 163

Query: 214 QQQQQQ 219
            + +  
Sbjct: 164 PKYENS 169


>gnl|CDD|227549 COG5224, HAP2, CCAAT-binding factor, subunit B [Transcription].
          Length = 248

 Score = 31.3 bits (70), Expect = 2.2
 Identities = 17/86 (19%), Positives = 21/86 (24%), Gaps = 11/86 (12%)

Query: 157 PLPNSQQNSSNFQPSNNLYNNIQVPPRAQPGNYPSVPAGYPS-----------LNQAGYQ 205
           P P +   +S    +N   NNI  P      +  S P    S           L Q    
Sbjct: 61  PAPINSATASMSPANNTSGNNITSPNVRGELDMSSGPTNTASTSGPVPHDMTVLPQTDSN 120

Query: 206 QQQQQQQQQQQQQQTLNSQFSNLSVN 231
                    Q       S   N S  
Sbjct: 121 TSNLMSSGSQLGSFATQSTNGNNSTT 146


>gnl|CDD|214621 smart00327, VWA, von Willebrand factor (vWF) type A domain.  VWA
           domains in extracellular eukaryotic proteins mediate
           adhesion via metal ion-dependent adhesion sites (MIDAS).
           Intracellular VWA domains and homologues in prokaryotes
           have recently been identified. The proposed VWA domains
           in integrin beta subunits have recently been
           substantiated using sequence-based methods.
          Length = 175

 Score = 30.5 bits (69), Expect = 2.3
 Identities = 24/123 (19%), Positives = 39/123 (31%), Gaps = 35/123 (28%)

Query: 319 FLLDVSQ-IAAQSGYLYTVCEVLLNQLKSMP-GDRRTSIAIITYDSAVHFYSLAEGQTQP 376
           FLLD S  +           E +L  ++ +  G     + ++T+                
Sbjct: 4   FLLDGSGSMGGN--RFELAKEFVLKLVEQLDIGPDGDRVGLVTFSDDAR----------- 50

Query: 377 SQMILTDIDDIFLPSPENILVNLSECYDMIVDLLKQLPGKFKDSLDSGSALGPALQAAFK 436
                               +N S   D +++ L  L  K    L  G+ LG ALQ A +
Sbjct: 51  ----------------VLFPLNDSRSKDALLEALASLSYK----LGGGTNLGAALQYALE 90

Query: 437 LLQ 439
            L 
Sbjct: 91  NLF 93


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 31.6 bits (71), Expect = 2.6
 Identities = 34/198 (17%), Positives = 64/198 (32%), Gaps = 12/198 (6%)

Query: 5   VSNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMM 64
           V  K GP      S   ++P ++  P  P     P    +P+ P    S Q P    +  
Sbjct: 557 VGKKPGPAKEHKPS---KIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPK 613

Query: 65  RPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNN 124
            P L+      P +     S   P++   PQ+ ++    +  ++  +     S   PF+ 
Sbjct: 614 LPELL----DIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFD- 668

Query: 125 GLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPP-R 183
              P      Y        + ++ +  +       +  + +    P         +PP  
Sbjct: 669 ---PKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKL 725

Query: 184 AQPGNYPSVPAGYPSLNQ 201
            +   +P  P G P   Q
Sbjct: 726 PRDEEFPFEPIGDPDAEQ 743


>gnl|CDD|185081 PRK15127, PRK15127, multidrug efflux system protein AcrB;
           Provisional.
          Length = 1049

 Score = 31.4 bits (71), Expect = 2.6
 Identities = 33/126 (26%), Positives = 53/126 (42%), Gaps = 24/126 (19%)

Query: 52  SSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQNANHSQQTFRMPPN 111
           + L+ P+     + PP VT     PGA   ++   V Q      +QN N       M  N
Sbjct: 26  AILKLPVAQYPTIAPPAVTISASYPGADAKTVQDTVTQVI----EQNMNGIDNLMYMSSN 81

Query: 112 TYSQSSK--TAPFNNGLTPLTNASLYQQQQQQQQQ-------QQQQQQPIPNIPPLPNSQ 162
           + S  +   T  F +G    T+A + Q Q Q + Q       Q+ QQQ +       + +
Sbjct: 82  SDSTGTVQITLTFESG----TDADIAQVQVQNKLQLAMPLLPQEVQQQGV-------SVE 130

Query: 163 QNSSNF 168
           ++SS+F
Sbjct: 131 KSSSSF 136


>gnl|CDD|220724 pfam10380, CRF1, Transcription factor CRF1.  CRF1 is a
           transcription factor that co-represses ribosomal genes
           with FHL1 via the TOR signalling pathway and protein
           kinase A.
          Length = 114

 Score = 29.7 bits (67), Expect = 2.8
 Identities = 7/41 (17%), Positives = 16/41 (39%)

Query: 129 LTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQ 169
           L+  SLY    Q  + ++ + Q I        +  + ++  
Sbjct: 57  LSTRSLYPLITQNNKSEEPRLQSITQTDASEENDVSDASDD 97


>gnl|CDD|114726 pfam06020, Roughex, Drosophila roughex protein.  This family
           consists of several roughex (RUX) proteins specific to
           Drosophila species. Roughex can influence the
           intracellular distribution of cyclin A and is therefore
           defined as a distinct and specialised cell cycle
           inhibitor for cyclin A-dependent kinase activity. Rux is
           though to regulate the metaphase to anaphase transition
           during development.
          Length = 334

 Score = 31.1 bits (70), Expect = 2.8
 Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 5/57 (8%)

Query: 136 QQQQQQQQQQQQQQQPIPNIP---PLPNSQQNSSNFQPSNNLY--NNIQVPPRAQPG 187
           +Q+Q   QQ ++  +   NIP   P P    N S++ P       N  +VPP+  PG
Sbjct: 272 EQEQPAPQQAEETAREAVNIPVDLPTPLETTNFSSYTPRKRQQTTNGNEVPPKRTPG 328


>gnl|CDD|220771 pfam10471, APC_CDC26, Anaphase-promoting complex APC subunit 1.
           The anaphase-promoting complex (APC) or cyclosome is a
           cell cycle-regulated ubiquitin-protein ligase that
           regulates important events in mitosis such as the
           initiation of anaphase and exit from telophase. The APC,
           in conjunction with other enzymes, assembles
           multi-ubiquitin chains on a variety of regulatory
           proteins thereby targeting them for proteolysis by the
           26S proteasome. CDC26 is one of the nine or so subunits
           identified within APC but its exact function is not
           known. The APC/C becomes active at the
           metaphase/anaphase transition and remains active during
           G1 phase. One mechanism linked to activation of the
           APC/C is phosphorylation. The yeast APC/C is composed of
           at least 13 subunits, but the function of many of the
           subunits is unknown. Hcn1 is the smallest subunit of the
           S. pombe APC/C, and is found to be essential for cell
           viability, APC/C integrity, and proper APC/C regulation.
           In addition, Hcn1 phosphorylation indicates a specific
           role for the phosphorylation of this subunit late in the
           cell cycle.
          Length = 78

 Score = 28.6 bits (64), Expect = 2.8
 Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 3/48 (6%)

Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPR 183
           ++Q+ QQQQ  QQQ    N       +  +S   P+++     Q   R
Sbjct: 24  KEQKLQQQQNAQQQSVSSN---SAAGETAASTDSPNDSSKPLPQDSAR 68


>gnl|CDD|222647 pfam14282, FlxA, FlxA-like protein.  This family includes FlxA from
           E. coli. The expression of FlxA is regulated by the FliA
           sigma factor, a transcription factor specific for class
           3 flagellar operons. However FlxA is not required for
           flagellar function or formation.
          Length = 106

 Score = 29.3 bits (66), Expect = 3.0
 Identities = 12/32 (37%), Positives = 20/32 (62%)

Query: 205 QQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNK 236
           Q QQQQ ++ QQQQQ+      + +  +G+N+
Sbjct: 69  QLQQQQSEEAQQQQQSSIEDSPSQTSAEGVNR 100


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 31.2 bits (70), Expect = 3.2
 Identities = 33/174 (18%), Positives = 53/174 (30%), Gaps = 18/174 (10%)

Query: 22  QVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATIN 81
           Q P +   P +P+         QP          +P G++ M+         Q P     
Sbjct: 653 QPPQVEITPYKPTWTQIGHIPYQP----------SPTGANTMLPIQWAPGTMQPPPRAPT 702

Query: 82  SLSTPV--PQQFQRPQQQNAN----HSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLY 135
            +  P   P + QRP           +      PP      ++      G      A+  
Sbjct: 703 PMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG 762

Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNS--SNFQPSNNLYNNIQVPPRAQPG 187
           + +           QP P  PP P  +     +   P      ++Q+ PRA PG
Sbjct: 763 RARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPG 816


>gnl|CDD|238163 cd00261, AAI_SS, AAI_SS: Alpha-Amylase Inhibitors (AAIs) and Seed
           Storage (SS) Protein subfamily; composed of cereal-type
           AAIs and SS proteins. They are mainly present in the
           seeds of a variety of plants. AAIs play an important
           role in the natural defenses of plants against insects
           and pathogens such as fungi, bacteria and viruses. AAIs
           impede the digestion of plant starch and proteins by
           inhibiting digestive alpha-amylases and proteinases.
           Also included in this subfamily are SS proteins such as
           2S albumin, gamma-gliadin, napin, and prolamin. These
           AAIs and SS proteins are also known allergens in humans.
          Length = 110

 Score = 29.3 bits (66), Expect = 3.3
 Identities = 26/94 (27%), Positives = 32/94 (34%), Gaps = 22/94 (23%)

Query: 135 YQQQQQQQQQQQ---------QQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQ 185
             Q  QQQ QQ          QQ   +   P  P         Q    L    Q+P + +
Sbjct: 3   QCQPGQQQPQQPLNSCREYLRQQCSGVGGPPVWPQQSCEVLRQQCCQQLA---QIPEQCR 59

Query: 186 PGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQ 219
                       +L Q      QQQQQQQ+QQQ 
Sbjct: 60  ----------CEALRQMVQGVIQQQQQQQEQQQG 83



 Score = 28.1 bits (63), Expect = 7.3
 Identities = 12/17 (70%), Positives = 12/17 (70%)

Query: 207 QQQQQQQQQQQQQTLNS 223
           QQQ Q  QQQ QQ LNS
Sbjct: 1   QQQCQPGQQQPQQPLNS 17


>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
           Provisional.
          Length = 172

 Score = 30.3 bits (68), Expect = 3.3
 Identities = 11/21 (52%), Positives = 13/21 (61%)

Query: 136 QQQQQQQQQQQQQQQPIPNIP 156
           QQ Q+ QQ  QQQ +P P  P
Sbjct: 135 QQAQRPQQAAQQQSRPAPQQP 155



 Score = 28.7 bits (64), Expect = 9.5
 Identities = 14/48 (29%), Positives = 17/48 (35%), Gaps = 3/48 (6%)

Query: 110 PNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQQQQQQ---QPIPN 154
           P    Q  +     N   P   A   QQ  QQQ +   QQ   QP  +
Sbjct: 115 PQGDDQGGQGGGNYNQSAPRQQAQRPQQAAQQQSRPAPQQPAPQPAAD 162


>gnl|CDD|235562 PRK05687, fliH, flagellar assembly protein H; Validated.
          Length = 246

 Score = 30.3 bits (69), Expect = 3.5
 Identities = 9/27 (33%), Positives = 12/27 (44%)

Query: 201 QAGYQQQQQQQQQQQQQQQTLNSQFSN 227
           + G  + Q Q   Q  Q Q L +QF  
Sbjct: 98  EQGLAEGQAQAAPQAAQLQALAAQFQE 124



 Score = 29.1 bits (66), Expect = 8.5
 Identities = 9/33 (27%), Positives = 14/33 (42%), Gaps = 5/33 (15%)

Query: 201 QAGYQQ-----QQQQQQQQQQQQQTLNSQFSNL 228
           +AGYQ+      +Q   + Q Q     +Q   L
Sbjct: 86  EAGYQEGLAQGLEQGLAEGQAQAAPQAAQLQAL 118


>gnl|CDD|215106 PLN00212, PLN00212, glutelin; Provisional.
          Length = 493

 Score = 30.6 bits (69), Expect = 3.9
 Identities = 15/41 (36%), Positives = 19/41 (46%)

Query: 124 NGLTPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQN 164
           NGL  L      QQ+Q QQQQQ+  QQ         + + N
Sbjct: 261 NGLQLLQPTLTQQQEQAQQQQQRLYQQVQYQQSQQTSGRWN 301


>gnl|CDD|182380 PRK10328, PRK10328, DNA binding protein, nucleoid-associated;
           Provisional.
          Length = 134

 Score = 29.5 bits (66), Expect = 4.0
 Identities = 10/43 (23%), Positives = 23/43 (53%), Gaps = 7/43 (16%)

Query: 205 QQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNKMWGYEPHDLLN 247
           ++++QQQ++  ++Q+ +N+          + K  G  P +LL 
Sbjct: 42  EEEEQQQRELAERQEKINTWLE-------LMKADGINPEELLG 77


>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
          Length = 222

 Score = 30.0 bits (67), Expect = 4.2
 Identities = 13/54 (24%), Positives = 26/54 (48%), Gaps = 3/54 (5%)

Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQ---VPPRAQP 186
           +Q++ ++ + Q ++Q  P+  P    Q      QP+NN    ++    P R +P
Sbjct: 160 KQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNNNKRKAMEEGLSPDRTRP 213



 Score = 30.0 bits (67), Expect = 5.0
 Identities = 10/38 (26%), Positives = 18/38 (47%)

Query: 129 LTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSS 166
           L  + +  ++Q+Q  QQ Q Q      P  PN+ +  +
Sbjct: 164 LEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNNNKRKA 201


>gnl|CDD|221872 pfam12954, DUF3843, Protein of unknown function (DUF3843).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 325

 Score = 30.4 bits (69), Expect = 4.3
 Identities = 17/58 (29%), Positives = 25/58 (43%), Gaps = 6/58 (10%)

Query: 219 QTLNSQFSNLSVNQGMNKMWGYEPHDLLNTRDILPATKVEPPPIRLPQNLPEIHEQFL 276
             L   F    +N G + ++      L   RD+L       PPI   + LPE +E+FL
Sbjct: 140 GLLEEAFETAPINDGPSSLF------LHEGRDLLKMPFTPLPPIEPEEKLPEDYEKFL 191


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 30.7 bits (70), Expect = 4.6
 Identities = 30/114 (26%), Positives = 40/114 (35%), Gaps = 17/114 (14%)

Query: 127 TPLTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNN---LYNNIQVPPR 183
             L+  SL Q + +  Q   Q Q    ++    NSQ  S   QP      LY N Q   R
Sbjct: 116 ETLSTLSLRQLESRLAQTLDQLQNAQNDLAEY-NSQLVSLQTQPERAQAALYANSQ---R 171

Query: 184 AQP------GNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQTL---NSQFSNL 228
            Q       G      A  PS  +   Q +Q     Q   Q+     N+Q  +L
Sbjct: 172 LQQIRNLLKGGKVGGKALRPSQ-RVLLQAEQALLNAQNDLQRKSLEGNTQLQDL 224


>gnl|CDD|215597 PLN03137, PLN03137, ATP-dependent DNA helicase; Q4-like;
           Provisional.
          Length = 1195

 Score = 30.6 bits (69), Expect = 4.7
 Identities = 24/95 (25%), Positives = 40/95 (42%), Gaps = 2/95 (2%)

Query: 38  PPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINSLSTPVPQQFQRPQQQ 97
           P  +  Q    + N S Q    SSN +   L + + +   A   + +    Q    PQ +
Sbjct: 26  PQTNWSQHAKALENISSQRKFLSSNFLYS-LESQKDREMAARSINCAPVNVQTLASPQIE 84

Query: 98  NANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNA 132
            A H+  +  +  N Y +  KT P +NG T L++ 
Sbjct: 85  KAWHALSSLSIN-NNYLRPGKTPPIDNGSTDLSSD 118


>gnl|CDD|214832 smart00817, Amelin, Ameloblastin precursor (Amelin).  This family
           consists of several mammalian Ameloblastin precursor
           (Amelin) proteins. Matrix proteins of tooth enamel
           consist mainly of amelogenin but also of non-amelogenin
           proteins, which, although their volumetric percentage is
           low, have an important role in enamel mineralisation.
           One of the non-amelogenin proteins is ameloblastin, also
           known as amelin and sheathlin. Ameloblastin (AMBN) is
           one of the enamel sheath proteins which is though to
           have a role in determining the prismatic structure of
           growing enamel crystals.
          Length = 411

 Score = 30.2 bits (68), Expect = 4.9
 Identities = 40/157 (25%), Positives = 56/157 (35%), Gaps = 22/157 (14%)

Query: 74  QAPGAT-INSLSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFN----NGLTP 128
           Q PG   + SLS    +Q    Q  N          P + YS+     PFN    +GL P
Sbjct: 22  QQPGTPGMASLSLETMRQLGSLQGLN----------PLSQYSRFGFGKPFNSLWLHGLLP 71

Query: 129 LTNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNNIQVPPRAQ--- 185
               S +   + ++ + QQ +  +P  PP   SQ +    QP    +      P  Q   
Sbjct: 72  --PHSSFPWLRPREHETQQYEYSLPVHPPPLPSQPSLQPQQPGLKPFLQPTALPTNQATP 129

Query: 186 --PGNYPSVPAGYPSLNQAGYQQQQQQQQQQQQQQQT 220
              G  P +  G P L QA       Q     +  QT
Sbjct: 130 QKNGPQPPMHLGQPPLQQAELPMIPPQVAPSDKPPQT 166


>gnl|CDD|236795 PRK10920, PRK10920, putative uroporphyrinogen III
           C-methyltransferase; Provisional.
          Length = 390

 Score = 30.1 bits (68), Expect = 5.4
 Identities = 10/35 (28%), Positives = 15/35 (42%), Gaps = 4/35 (11%)

Query: 200 NQAGYQQ----QQQQQQQQQQQQQTLNSQFSNLSV 230
           N A   Q    Q+ Q+ Q+Q+ +  L  Q   L  
Sbjct: 69  NDALANQLTALQKAQESQKQELEGILKQQAKALDQ 103


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 30.0 bits (68), Expect = 5.8
 Identities = 22/98 (22%), Positives = 27/98 (27%), Gaps = 8/98 (8%)

Query: 6   SNKQGPNSYGSASNVMQVPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMR 65
           S    P+S     +    P   S+PS  S    P S      P S  S       +N   
Sbjct: 183 SPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPES------PTNPSP 236

Query: 66  PPLVTNRFQAPGATINSLST--PVPQQFQRPQQQNANH 101
           PP        P   +  LST  P P             
Sbjct: 237 PPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGI 274


>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein. 
          Length = 122

 Score = 28.4 bits (64), Expect = 6.5
 Identities = 8/46 (17%), Positives = 17/46 (36%), Gaps = 1/46 (2%)

Query: 204 YQQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNKMWGYEPHDLLNTR 249
             + QQ+ +Q +++ + L          Q      G    +L N +
Sbjct: 7   LAEAQQELEQAEEKLEELQQYRLEYR-QQLSGSGQGISAAELRNYQ 51


>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
          Length = 428

 Score = 30.0 bits (68), Expect = 7.0
 Identities = 9/20 (45%), Positives = 13/20 (65%), Gaps = 3/20 (15%)

Query: 205 QQQQQQQQ---QQQQQQQTL 221
           ++Q QQ+    +QQ QQQ L
Sbjct: 191 EKQSQQKTLLYEQQAQQQKL 210


>gnl|CDD|221416 pfam12090, Spt20, Spt20 family.  This presumed domain is found in
           the Spt20 proteins from both human and yeast. The Spt20
           protein is part of the SAGA complex which is a large
           cmplex mediating histone deacetylation. Yeast Spt20 has
           been shown to play a role in structural integrity of the
           SAGA complex as as no intact SAGA could be purified in
           spt20 deletion strains.
          Length = 191

 Score = 29.3 bits (66), Expect = 7.1
 Identities = 10/41 (24%), Positives = 15/41 (36%)

Query: 130 TNASLYQQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQP 170
              S  Q Q Q Q+QQ ++  P  N       Q+     + 
Sbjct: 87  PQQSQQQAQAQSQEQQDKENSPAQNQQADGQKQKPVKKPRV 127


>gnl|CDD|150393 pfam09712, PHA_synth_III_E, Poly(R)-hydroxyalkanoic acid synthase
           subunit (PHA_synth_III_E).  This entry represents the
           PhaE subunit of the heterodimeric class (class III) of
           polymerase for poly(R)-hydroxyalkanoic acids (PHAs),
           carbon and energy storage polymers of many bacteria. The
           most common PHA is polyhydroxybutyrate but about 150
           different constituent hydroxyalkanoic acids (HAs) have
           been identified in various species.
          Length = 293

 Score = 29.8 bits (67), Expect = 7.1
 Identities = 35/154 (22%), Positives = 48/154 (31%), Gaps = 18/154 (11%)

Query: 83  LSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQ 142
           +S   P Q Q             F        Q  +             A L Q Q Q  
Sbjct: 45  MSPAAPDQTQAAMDALMEQGGNFFGTAEQFLRQFEEQNA----------AHLEQMQAQFA 94

Query: 143 QQQQQQQQPIPNIP-----PLPNSQQ--NSSNFQPSNNLYNNIQVPPRAQPGNYPSVPA- 194
                              PL   Q+  +S +  P + L N      R+Q   + S+PA 
Sbjct: 95  GSLDTGDTTSDRAAQAWQLPLDGWQRMFSSLSPMPGDGLRNMPHQQFRSQLDPFLSMPAL 154

Query: 195 GYPSLNQAGYQQQQQQQQQQQQQQQTLNSQFSNL 228
           GY   +QA YQ   + Q + QQ  Q    Q+S  
Sbjct: 155 GYTREHQAQYQALIRAQMEYQQALQEYTGQYSQA 188


>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
          Length = 935

 Score = 30.0 bits (67), Expect = 7.2
 Identities = 33/139 (23%), Positives = 42/139 (30%), Gaps = 16/139 (11%)

Query: 35  GAAPPISGLQPQVPVSNSSLQTPLGSSNM-------MRPPLVTNRFQAPGATINSLSTP- 86
           G  PP+   QPQ      +     G+S         M  PLV  R+   GAT++    P 
Sbjct: 680 GPVPPVPATQPQYFDIPLTEPINQGASAAHFLPQQPMEGPLVPERWMFQGATLSQSVRPG 739

Query: 87  --VPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQQQ 144
               Q F  P  Q  NH             Q     P+         A   Q     Q Q
Sbjct: 740 VAQSQYFDLPLTQPINHGAPAAHFLH----QPPMEGPWVPEQWMFQGAPPSQGTDVVQHQ 795

Query: 145 QQQQQQP--IPNIPPLPNS 161
                    + N P +P S
Sbjct: 796 LDALGYVLHVLNHPGVPVS 814


>gnl|CDD|220844 pfam10675, DUF2489, Protein of unknown function (DUF2489).  This is
           a bacterial family of uncharacterized proteins.
          Length = 131

 Score = 28.7 bits (65), Expect = 7.5
 Identities = 5/17 (29%), Positives = 12/17 (70%)

Query: 134 LYQQQQQQQQQQQQQQQ 150
           L +Q++ Q++ Q + +Q
Sbjct: 15  LKKQKELQEKAQAEAKQ 31


>gnl|CDD|226036 COG3505, VirD4, Type IV secretory pathway, VirD4 components
           [Intracellular trafficking and secretion].
          Length = 596

 Score = 29.7 bits (67), Expect = 7.7
 Identities = 20/113 (17%), Positives = 33/113 (29%), Gaps = 15/113 (13%)

Query: 295 SATVEYLATADYTVRDP---PQPAMYLFLLDVSQIAAQSGYLYTVCEVLLNQLKSMPGDR 351
           +  ++ L      +RD     + +    L  +    A         +  L Q  S   D 
Sbjct: 271 TQAIKGLEGVFGRLRDILSSTELSSLAMLTRILLGDALPW-----VKAALGQFFSNKTDE 325

Query: 352 RTSIAIITYDSAVHFYSLAE--GQTQPSQMILTDI-----DDIFLPSPENILV 397
            TS  + T  SA+           T  S   + D+        F   P+ I  
Sbjct: 326 ETSGVLSTASSALTLLMNPLVAAATSGSDFDIRDLRKKKTTVYFGLDPDRISR 378


>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit.  This family includes
           prefoldin subunits that are not detected by pfam02996.
          Length = 106

 Score = 28.0 bits (63), Expect = 7.8
 Identities = 10/29 (34%), Positives = 14/29 (48%)

Query: 200 NQAGYQQQQQQQQQQQQQQQTLNSQFSNL 228
               +QQ QQQ Q   QQ + L +Q   +
Sbjct: 3   LINKFQQLQQQLQLLAQQIKQLETQLKEI 31


>gnl|CDD|151422 pfam10975, DUF2802, Protein of unknown function (DUF2802).  This
           bacterial family of proteins has no known function.
          Length = 70

 Score = 27.1 bits (61), Expect = 7.8
 Identities = 5/33 (15%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 205 QQQQQQQQQQQQQQQTLNSQFSNLSVNQGMNKM 237
           + +QQ ++ Q +Q++       +   ++   K+
Sbjct: 9   ELEQQLKELQDRQEELEQQDPQSRLYSRAA-KL 40


>gnl|CDD|223555 COG0479, FrdB, Succinate dehydrogenase/fumarate reductase, Fe-S
           protein subunit [Energy production and conversion].
          Length = 234

 Score = 29.2 bits (66), Expect = 8.3
 Identities = 11/35 (31%), Positives = 16/35 (45%), Gaps = 3/35 (8%)

Query: 848 FISYLNDDKAYVSP---VKVIKDSSPDRMEFYDKL 879
            +  L +    + P     VI+D   D  EFY+KL
Sbjct: 76  LMKDLEEGVITIEPLPNFPVIRDLVVDMEEFYEKL 110


>gnl|CDD|184029 PRK13399, PRK13399, fructose-1,6-bisphosphate aldolase;
           Provisional.
          Length = 347

 Score = 29.4 bits (66), Expect = 8.4
 Identities = 23/59 (38%), Positives = 25/59 (42%), Gaps = 12/59 (20%)

Query: 778 PRILQLSA-ERLQATGTYLLNM--------PDLIIIL-VRHGTSPAICSGLF--GFQSV 824
           P ILQ S   R  A    L +M        PD+ I L   HG SPA C      GF SV
Sbjct: 44  PVILQASRGARKYAGDAMLRHMVLAAAEMYPDIPICLHQDHGNSPATCQSAIRSGFTSV 102


>gnl|CDD|233516 TIGR01661, ELAV_HUD_SF, ELAV/HuD family splicing factor.  This
           model describes the ELAV/HuD subfamily of splicing
           factors found in metazoa. HuD stands for the human
           paraneoplastic encephalomyelitis antigen D of which
           there are 4 variants in human. ELAV stnds for the
           Drosophila Embryonic lethal abnormal visual protein.
           ELAV-like splicing factors are also known in human as
           HuB (ELAV-like protein 2), HuC (ELAV-like protein 3,
           Paraneoplastic cerebellar degeneration-associated
           antigen) and HuR (ELAV-like protein 1). These genes are
           most closely related to the sex-lethal subfamily of
           splicing factors found in Dipteran insects (TIGR01659).
           These proteins contain 3 RNA-recognition motifs (rrm:
           pfam00076).
          Length = 352

 Score = 29.5 bits (66), Expect = 9.0
 Identities = 29/143 (20%), Positives = 43/143 (30%), Gaps = 33/143 (23%)

Query: 23  VPLIASNPSQPSGAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAPGATINS 82
             +   N + PSG   PI+     V  +N+   +   +S  +   L     Q P  T   
Sbjct: 146 RAIKTLNGTTPSGCTEPIT-----VKFANNPSSS---NSKGLLSQLEA--VQNPQTTRVP 195

Query: 83  LSTPVPQQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLYQQQQQQQ 142
           LST +      P     +H+   FR     ++                        QQQQ
Sbjct: 196 LSTILTAAGIGP----MHHAAARFRPSAGDFTA-------------------VLAHQQQQ 232

Query: 143 QQQQQQQQPIPNIPPLPNSQQNS 165
               QQ       PP  + Q   
Sbjct: 233 HAVAQQHAAQRASPPATDGQTAG 255


>gnl|CDD|224428 COG1511, COG1511, Predicted membrane protein [Function unknown].
          Length = 780

 Score = 29.8 bits (67), Expect = 9.3
 Identities = 31/206 (15%), Positives = 51/206 (24%), Gaps = 35/206 (16%)

Query: 18  SNVMQVPLIASNPSQPS-GAAPPISGLQPQVPVSNSSLQTPLGSSNMMRPPLVTNRFQAP 76
           S   Q   +    +Q + G     S        S SSLQ+ L  S        +    A 
Sbjct: 304 SLGDQFSSLQEALTQIAQGLKQKTSSSLEAAQGSLSSLQSMLALSK-------SLDLTAE 356

Query: 77  GATINSLSTPVP-QQFQRPQQQNANHSQQTFRMPPNTYSQSSKTAPFNNGLTPLTNASLY 135
           GAT+++L  P   Q     Q+  A  S+                             +L 
Sbjct: 357 GATVDALGAPDGVQWLDESQKTLATLSELLS---------------TGIDGVSEGLDALE 401

Query: 136 QQQQQQQQQQQQQQQPIPNIPPLPNSQQN-----------SSNFQPSNNLYNNIQVPPRA 184
           Q   Q  +   + +  +  I                        +  N L   +     A
Sbjct: 402 QASAQLAKSLAKLKTAVAQIAASIAQLLPGASEVLKTLKSKGLDKLLNQLNGALAKGSNA 461

Query: 185 QPGNYPSVPAGYPSLNQAGYQQQQQQ 210
                      + S+  A  +     
Sbjct: 462 LVQGLSDANDSFRSITSAQLKAGLNT 487


>gnl|CDD|119430 cd05170, PIKKc_SMG1, Suppressor of morphogenetic effect on
           genitalia-1 (SMG-1), catalytic domain; The SMG-1
           catalytic domain subfamily is part of a larger
           superfamily that includes the catalytic domains of other
           kinases such as the typical serine/threonine/tyrosine
           protein kinases (PKs), aminoglycoside
           phosphotransferase, choline kinase, and RIO kinases.
           SMG-1 is a member of the phosphoinositide
           3-kinase-related protein kinase (PIKK) subfamily. PIKKs
           have intrinsic serine/threonine kinase activity and are
           distinguished from other PKs by their unique catalytic
           domain, similar to that of lipid PI3K, and their large
           molecular weight (240-470 kDa). In addition to its
           catalytic domain, SMG-1 contains a FATC (FRAP, ATM and
           TRRAP, C-terminal) domain at the C-terminus. SMG-1 plays
           a critical role in the mRNA surveillance mechanism known
           as non-sense mediated mRNA decay (NMD). NMD protects the
           cells from the accumulation of aberrant mRNAs with
           premature termination codons (PTCs) generated by genome
           mutations and by errors during transcription and
           splicing. SMG-1 phosphorylates Upf1, another central
           component of NMD, at the C-terminus upon recognition of
           PTCs. The phosphorylation/dephosphorylation cycle of
           Upf1 is essential for promoting NMD.
          Length = 307

 Score = 29.3 bits (66), Expect = 9.4
 Identities = 19/61 (31%), Positives = 22/61 (36%), Gaps = 20/61 (32%)

Query: 125 GLTPLTNASLY-------QQQQQQQQQQQQQQQPIPNIPPLPNSQQNSSNFQPSNNLYNN 177
           G TPL    LY          Q Q+ Q   Q   IP I P           +PS+  YN 
Sbjct: 90  GATPLFG--LYKRWQQREAVLQAQKSQVGYQNPQIPGIVP-----------RPSDLFYNK 136

Query: 178 I 178
           I
Sbjct: 137 I 137


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.133    0.389 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 45,949,362
Number of extensions: 4482886
Number of successful extensions: 11411
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8540
Number of HSP's successfully gapped: 367
Length of query: 912
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 806
Effective length of database: 6,236,078
Effective search space: 5026278868
Effective search space used: 5026278868
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.6 bits)