RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10778
         (556 letters)



>gnl|CDD|214982 smart01026, Beach, Beige/BEACH domain.  The BEACH domain was
           described in the BEIGE protein (D1035670) and in the
           highly homologous CHS protein. The BEACH domain is
           usually followed by a series of WD repeats. The function
           of the BEACH domain is unknown.
          Length = 280

 Score =  334 bits (859), Expect = e-112
 Identities = 116/195 (59%), Positives = 144/195 (73%), Gaps = 2/195 (1%)

Query: 9   NQEPYHYGSHYSNSGTVLHFLVRIPPFTSMFLNYQDNNFDLPDRTFHNLATTWRLTSSES 68
           +  P+HYG+HYS++G VL++L+R+ PFT++FL  Q   FD  DR FH++A TWR  S ES
Sbjct: 84  DIPPFHYGTHYSSAGIVLYYLIRLEPFTTLFLQLQGGRFDHADRLFHSVAATWRSASLES 143

Query: 69  TTDVKELIPELFYLADILVNNEGFDFGVRQNGQRVDSVILPPWA-SDPRTFIMVHRQSLE 127
            TDVKELIPE FYL + LVN  GFDFG RQ+G+ VD V LPPWA   P  FI  HR++LE
Sbjct: 144 MTDVKELIPEFFYLPEFLVNINGFDFGTRQDGEDVDDVELPPWAKGSPEEFIRKHREALE 203

Query: 128 SELVSENLPHWIDLVFGYKQSGKAAVDAINVFHPATYYG-FDIAAIPDPLERTAWETMIR 186
           SE VS++L HWIDL+FGYKQ GK AV+A+NVFHP TY G  D+ +I DP+ER A E  I 
Sbjct: 204 SEYVSQHLHHWIDLIFGYKQRGKEAVEALNVFHPLTYEGAVDLDSIEDPVERKALEGQIH 263

Query: 187 TYGQTPRQLFKSAHP 201
            +GQTP+QLFK  HP
Sbjct: 264 NFGQTPKQLFKEPHP 278


>gnl|CDD|216896 pfam02138, Beach, Beige/BEACH domain. 
          Length = 274

 Score =  292 bits (749), Expect = 1e-95
 Identities = 100/193 (51%), Positives = 130/193 (67%), Gaps = 3/193 (1%)

Query: 9   NQEPYHYGSHYSNSGTVLHFLVRIPPFTSMFLNYQDNNFDLPDRTFHNLATTWRLTSSES 68
              P+HYGSHYS+ G+VLH+L+R+ PFT++ +  Q   FD  DR F ++   W+   SES
Sbjct: 83  VGPPFHYGSHYSSPGSVLHYLIRLEPFTTLHIELQGGKFDHADRLFSSIEEAWKS-VSES 141

Query: 69  TTDVKELIPELFYLADILVNNEGFDFGVRQNGQRVDSVILPPWASDPRTFIMVHRQSLES 128
             DVKELIPE FYL + L N   FD G R +G++VD V LPPWA  P  F+  +R++LES
Sbjct: 142 MNDVKELIPEFFYLPEFLKNINNFDLGKR-DGEKVDDVELPPWAKSPEEFVYKNRKALES 200

Query: 129 ELVSENLPHWIDLVFGYKQSGKAAVDAINVFHPATYYGFDIAAIPDPLERTAWETMIRTY 188
           + VSENL +WIDL+FGYKQ G+ A++A NVFHP TY    +  I DP+ER A E  I  +
Sbjct: 201 DYVSENLHNWIDLIFGYKQRGEEAIEAKNVFHPLTYEDI-VDKIKDPVEREAIEAQISNF 259

Query: 189 GQTPRQLFKSAHP 201
           GQTP+QLF   HP
Sbjct: 260 GQTPKQLFTEPHP 272


>gnl|CDD|100117 cd06071, Beach, BEACH (Beige and Chediak-Higashi) domains,
           implicated in membrane trafficking,  are present in a
           family of proteins conserved throughout eukaryotes. This
           group contains human lysosomal trafficking regulator
           (LYST), LPS-responsive and beige-like anchor (LRBA) and
           neurobeachin. Disruption of LYST leads to
           Chediak-Higashi syndrome, characterized by severe
           immunodeficiency, albinism, poor blood coagulation and
           neurologic problems. Neurobeachin is a candidate gene
           linked to autism. LBRA seems to be upregulated in
           several cancer types. It has been shown that the BEACH
           domain itself is important for the function of these
           proteins.
          Length = 275

 Score =  276 bits (709), Expect = 1e-89
 Identities = 102/192 (53%), Positives = 133/192 (69%), Gaps = 3/192 (1%)

Query: 10  QEPYHYGSHYSNSGTVLHFLVRIPPFTSMFLNYQDNNFDLPDRTFHNLATTWRLTSSEST 69
             P+HYGSHYSN   VL++LVR+ PFT++ L+ Q  +FD  DR F+++ ++WR ++SE+ 
Sbjct: 85  DPPFHYGSHYSNPAIVLYYLVRLEPFTTLHLSLQGGHFDAADRLFNSIPSSWR-SASENP 143

Query: 70  TDVKELIPELFYLADILVNNEGFDFGVRQNGQRVDSVILPPWASDPRTFIMVHRQSLESE 129
           +DVKELIPE +YL +  +N   FDFG  Q+G++V+ V LPPWA  P  FI  HR++LESE
Sbjct: 144 SDVKELIPEFYYLPEFFLNINKFDFGK-QDGEKVNDVELPPWAKSPEEFIRKHREALESE 202

Query: 130 LVSENLPHWIDLVFGYKQSGKAAVDAINVFHPATYYGFDIAAIPDPLERTAWETMIRTYG 189
            VS+NL HWIDL+FGYKQ G+ AV A NVFHP TY G       D +ER A E  I  +G
Sbjct: 203 YVSKNLHHWIDLIFGYKQRGEEAVKAKNVFHPLTYEGSVDLDSID-VEREAIEAQINNFG 261

Query: 190 QTPRQLFKSAHP 201
           QTP QLF   HP
Sbjct: 262 QTPVQLFTKPHP 273


>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
           proteins that cover a wide variety of functions
           including adaptor/regulatory modules in signal
           transduction, pre-mRNA processing and cytoskeleton
           assembly; typically contains a GH dipeptide 11-24
           residues from its N-terminus and the WD dipeptide at its
           C-terminus and is 40 residues long, hence the name WD40;
           between GH and WD lies a conserved core; serves as a
           stable propeller-like platform to which proteins can
           bind either stably or reversibly; forms a propeller-like
           structure with several blades where each blade is
           composed of a four-stranded anti-parallel b-sheet;
           instances with few detectable copies are hypothesized to
           form larger structures by dimerization; each WD40
           sequence repeat forms the first three strands of one
           blade and the last strand in the next blade; the last
           C-terminal WD40 repeat completes the blade structure of
           the first WD40 repeat to create the closed ring
           propeller-structure; residues on the top and bottom
           surface of the propeller are proposed to coordinate
           interactions with other proteins and/or small ligands; 7
           copies of the repeat are present in this alignment.
          Length = 289

 Score = 62.0 bits (151), Expect = 1e-10
 Identities = 56/213 (26%), Positives = 88/213 (41%), Gaps = 26/213 (12%)

Query: 334 PDCHTLWLGLTSGNILVYEYELNASTNKIEFCKPPTILLGHSACVTDLYLCAGFSVAVSS 393
           PD   L  G   G I V++ E           +    L GH+  V D+   A  +   S 
Sbjct: 19  PDGKLLATGSGDGTIKVWDLETG---------ELLRTLKGHTGPVRDVAASADGTYLASG 69

Query: 394 SEDGTCLLWDINTIGYIREIPCVDPNTPVSHVTMSETNGEIASVLQTTKTGGSILRLHTI 453
           S D T  LWD+ T   +R +        VS V  S  +G I S   ++    +I      
Sbjct: 70  SSDKTIRLWDLETGECVRTLTGHTS--YVSSVAFS-PDGRILS---SSSRDKTIKVWDVE 123

Query: 454 NAQLITSLASELR-ITSVTFSNAPEGVSQNVIVTGHSDGTIQLWSGY--KLAHLLTLNTN 510
             + +T+L      + SV FS          + +   DGTI+LW     K    LT +T 
Sbjct: 124 TGKCLTTLRGHTDWVNSVAFS-----PDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTG 178

Query: 511 VALPVIAVTYSCDSQHLYASNTEGLVIIWESSS 543
               V +V +S D + L +S+++G + +W+ S+
Sbjct: 179 ---EVNSVAFSPDGEKLLSSSSDGTIKLWDLST 208



 Score = 54.3 bits (131), Expect = 5e-08
 Identities = 41/174 (23%), Positives = 76/174 (43%), Gaps = 15/174 (8%)

Query: 369 TILLGHSACVTDLYLCAGFSVAVSSSEDGTCLLWDINTIGYIREIPCVDPNTPVSHVTMS 428
           T L GH+  V  +      +   SSS+DGT  LWD+ T   +  +        V+ V  S
Sbjct: 129 TTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLT--GHTGEVNSVAFS 186

Query: 429 ETNGEIASVLQTTKTGGSILRLHTINA-QLITSLAS-ELRITSVTFSNAPEGVSQNVIVT 486
               ++ S      +    ++L  ++  + + +L   E  + SV FS         ++ +
Sbjct: 187 PDGEKLLSS-----SSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-----DGYLLAS 236

Query: 487 GHSDGTIQLWSGYKLAHLLTLNTNVALPVIAVTYSCDSQHLYASNTEGLVIIWE 540
           G  DGTI++W       + TL +     V ++ +S D + L + + +G + IW+
Sbjct: 237 GSEDGTIRVWDLRTGECVQTL-SGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289



 Score = 52.0 bits (125), Expect = 3e-07
 Identities = 40/177 (22%), Positives = 73/177 (41%), Gaps = 19/177 (10%)

Query: 371 LLGHSACVTDLYLCAGFSVAVSSSEDGTCLLWDINTIGYIREIPCVDPNTPVSHVTMSET 430
           L GH+  VT +       +  + S DGT  +WD+ T   +R         PV  V  S  
Sbjct: 5   LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRT--LKGHTGPVRDVAASAD 62

Query: 431 NGEIASVLQTTKTGGSILRLHTINA-QLITSLAS-ELRITSVTFSNAPEGVSQNVIVTGH 488
              +AS      +    +RL  +   + + +L      ++SV FS         ++ +  
Sbjct: 63  GTYLASG-----SSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSP-----DGRILSSSS 112

Query: 489 SDGTIQLWSG--YKLAHLLTLNTNVALPVIAVTYSCDSQHLYASNTEGLVIIWESSS 543
            D TI++W     K    L  +T+    V +V +S D   + +S+ +G + +W+  +
Sbjct: 113 RDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTFVASSSQDGTIKLWDLRT 166


>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
           only].
          Length = 466

 Score = 48.9 bits (115), Expect = 4e-06
 Identities = 41/178 (23%), Positives = 70/178 (39%), Gaps = 9/178 (5%)

Query: 366 KPPTILLGHSACVTDLYLCAGFSVAVSSSEDGTCLLWDINTIGYIREIPCVDPNTPVSHV 425
                L GHS+ V  +       +  S S DGT  LWD+ T   +  +       PVS +
Sbjct: 275 SLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSL 334

Query: 426 TMSETNGEIASVLQTTKTGGSILRLHTINAQLITSLASELRITSVTFSNAPEGVSQNVIV 485
           + S     + S      T   +  L T          S   + SV+FS         V+ 
Sbjct: 335 SFSPDGSLLVSGGSDDGT-IRLWDLRTGKPLKTLEGHS--NVLSVSFSP-----DGRVVS 386

Query: 486 TGHSDGTIQLWSGYKLAHLLTLNTNVALPVIAVTYSCDSQHLYASNTEGLVIIWESSS 543
           +G +DGT++LW       LL         V ++ +S D + L + +++  + +W+  +
Sbjct: 387 SGSTDGTVRLWD-LSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKT 443



 Score = 47.0 bits (110), Expect = 2e-05
 Identities = 46/208 (22%), Positives = 81/208 (38%), Gaps = 9/208 (4%)

Query: 338 TLWLGLTSGNILVYEYELNASTNKIEFCKPPTILLGHSACVTDLYLCAGFSVAVSSSEDG 397
            L LG +   IL+     ++  +  +      +L GH   +T +       + +S S DG
Sbjct: 30  LLSLGSSESGILLLALLSDSLVSLPDL--SSLLLRGHEDSITSIAFSPDGELLLSGSSDG 87

Query: 398 TCLLWDINT-IGYIREIPCVDPNTPVSHVTMSETNGEIASVLQTTKTGGSILRLHTINAQ 456
           T  LWD++     I+ +  +  ++       S     I     +      +  L T    
Sbjct: 88  TIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKL 147

Query: 457 LITSLASELRITSVTFSNAPEGVSQNVIVTGHSDGTIQLWSGYKLAHLLTLNTNVALPVI 516
           + T       +TS+ FS       + +      DGTI+LW       L TL  +   PV 
Sbjct: 148 IRTLEGHSESVTSLAFSPD----GKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTD-PVS 202

Query: 517 AVTYSCDSQHLYASNTE-GLVIIWESSS 543
           ++ +S D   L AS +  G + +W+ S+
Sbjct: 203 SLAFSPDGGLLIASGSSDGTIRLWDLST 230


>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat. 
          Length = 39

 Score = 33.9 bits (78), Expect = 0.009
 Identities = 13/39 (33%), Positives = 19/39 (48%)

Query: 365 CKPPTILLGHSACVTDLYLCAGFSVAVSSSEDGTCLLWD 403
            K    L GH+  VT +      ++  S S+DGT  +WD
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 29.6 bits (67), Expect = 0.33
 Identities = 11/32 (34%), Positives = 18/32 (56%), Gaps = 5/32 (15%)

Query: 466 RITSVTFSNAPEGVSQNVIVTGHSDGTIQLWS 497
            +TSV FS        N++ +G  DGT+++W 
Sbjct: 13  PVTSVAFSP-----DGNLLASGSDDGTVRVWD 39


>gnl|CDD|197651 smart00320, WD40, WD40 repeats.  Note that these repeats are
           permuted with respect to the structural repeats (blades)
           of the beta propeller domain.
          Length = 40

 Score = 33.8 bits (78), Expect = 0.010
 Identities = 13/38 (34%), Positives = 17/38 (44%)

Query: 366 KPPTILLGHSACVTDLYLCAGFSVAVSSSEDGTCLLWD 403
           +    L GH+  VT +          S S+DGT  LWD
Sbjct: 3   ELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 30.4 bits (69), Expect = 0.20
 Identities = 12/32 (37%), Positives = 16/32 (50%), Gaps = 5/32 (15%)

Query: 466 RITSVTFSNAPEGVSQNVIVTGHSDGTIQLWS 497
            +TSV FS          + +G  DGTI+LW 
Sbjct: 14  PVTSVAFSP-----DGKYLASGSDDGTIKLWD 40


>gnl|CDD|217535 pfam03401, TctC, Tripartite tricarboxylate transporter family
           receptor.  These probable extra-cytoplasmic solute
           receptors are strongly overrepresented in several
           beta-proteobacteria. This family, formerly known as Bug
           - Bordetella uptake gene (bug) product - is a family of
           bacterial tripartite tricarboxylate receptors of the
           extracytoplasmic solute binding receptor-dependent
           transporter group of families, distinct from the ABC and
           TRAP-T families. The TctABC system has been
           characterized in S. typhimurium, and TctC is the
           extracytoplasmic tricarboxylate-binding receptor which
           binds the transporters TctA and TctB, two integral
           membrane proteins. Complete three-component systems are
           found only in bacteria.
          Length = 274

 Score = 33.5 bits (77), Expect = 0.22
 Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 7/56 (12%)

Query: 304 LIRLKMSKELPATPVI---RPPLGATIATAT---TAPDCHTLWLGLTSGNILVYEY 353
            +   MSK+L   PVI   +P  G  I        APD +T+ LG T   +  + Y
Sbjct: 4   ALAQGMSKQL-GQPVIVENKPGAGGIIGADAVAKAAPDGYTILLGSTGLAVAPHLY 58


>gnl|CDD|187865 cd09734, Csb2_I-U, CRISPR/Cas system-associated protein Csb2.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Duplicated RAMP domains; also known as GSU0054 family.
          Length = 495

 Score = 32.7 bits (74), Expect = 0.63
 Identities = 17/58 (29%), Positives = 21/58 (36%), Gaps = 6/58 (10%)

Query: 50  PDRTFHNLATTWRLTSSESTTDVKELI-----PELFYLADILVNNEGFDFGVRQNGQR 102
           P R F  L  TW L       DV   +     P +F L   LV N    + V +   R
Sbjct: 21  PWRIFQALVATWALKVDAEDKDVLNWLESLPEPPVFALPPALVANATRHY-VPRAWGR 77


>gnl|CDD|182837 PRK10918, PRK10918, phosphate ABC transporter periplasmic
           substrate-binding protein PstS; Provisional.
          Length = 346

 Score = 32.1 bits (73), Expect = 0.66
 Identities = 24/79 (30%), Positives = 35/79 (44%), Gaps = 18/79 (22%)

Query: 111 WASDPRTFIMVHRQSLESELVSENLPHWIDLVFGYKQSGKAAVDAINVFHPATYYGFDIA 170
           W     TFI+VH+   + E  +E L  + D  + YK   K A D             D A
Sbjct: 275 WPITSTTFILVHKDQKKPEQGAEVL-KFFD--WAYKNGAKQAND------------LDYA 319

Query: 171 AIPDPLE---RTAWETMIR 186
           ++PD +    R AW+T I+
Sbjct: 320 SLPDSVVEQVRAAWKTNIK 338


>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
          Length = 793

 Score = 31.2 bits (70), Expect = 1.6
 Identities = 26/75 (34%), Positives = 36/75 (48%), Gaps = 8/75 (10%)

Query: 341 LGLTSGNILVYEYELNASTNKIEFCKPPTILLGHSACVTDLYLCAGFSVAVSSSEDGTCL 400
           L   S +  VY Y+L     K+  C     ++GHS  V+ +      S  VSSS D T  
Sbjct: 633 LAFGSADHKVYYYDLR--NPKLPLCT----MIGHSKTVSYVRFVDS-STLVSSSTDNTLK 685

Query: 401 LWDIN-TIGYIREIP 414
           LWD++ +I  I E P
Sbjct: 686 LWDLSMSISGINETP 700


>gnl|CDD|151125 pfam10596, U6-snRNA_bdg, U6-snRNA interacting domain of PrP8.  This
           domain incorporates the interacting site for the
           U6-snRNA as part of the U4/U6.U5 tri-snRNPs complex of
           the spliceosome, and is the prime candidate for the role
           of cofactor for the spliceosome's RNA core. The
           essential spliceosomal protein Prp8 interacts with U5
           and U6 snRNAs and with specific pre-mRNA sequences that
           participate in catalysis. This close association with
           crucial RNA sequences, together with extensive genetic
           evidence, suggests that Prp8 could directly affect the
           function of the catalytic core, perhaps acting as a
           splicing cofactor.
          Length = 160

 Score = 29.8 bits (67), Expect = 2.0
 Identities = 14/27 (51%), Positives = 16/27 (59%)

Query: 202 MNVKKLTPAQESRVEVIPNIRGLVWGS 228
           M  KKLT AQ S +  IPN R  +W S
Sbjct: 72  MKFKKLTNAQRSGLSQIPNRRFTLWWS 98


>gnl|CDD|234930 PRK01261, aroD, 3-dehydroquinate dehydratase; Provisional.
          Length = 229

 Score = 29.9 bits (67), Expect = 3.0
 Identities = 16/68 (23%), Positives = 32/68 (47%), Gaps = 9/68 (13%)

Query: 268 TTLLHSYHKDKNTNLISSLN-VIGAGAGMI---C-WNE----CDNLIRLKMSKELPATPV 318
           T L+ SYH + + N+ + L+ +       +   C +N+     D+L  + M K+    P+
Sbjct: 120 TMLMVSYHTNNSDNMPAILDIMNEKNPDYVKVACNYNDNKKFVDDLQYILMKKDEKYKPI 179

Query: 319 IRPPLGAT 326
           +  P+G  
Sbjct: 180 VFIPMGRE 187


>gnl|CDD|222861 PHA02537, M, terminase endonuclease subunit; Provisional.
          Length = 230

 Score = 29.7 bits (67), Expect = 3.2
 Identities = 17/66 (25%), Positives = 27/66 (40%), Gaps = 2/66 (3%)

Query: 112 ASDPRTFIMVHRQSLESELVSENLPHWIDLVFGYKQSGKAAVDAINVFHPATYYGFDIAA 171
           A+D R    +     ++EL  + LP ++  V G   +G    D  +V      + FDI  
Sbjct: 41  AADRRALKGIQSIERKAELKRDLLPKYLPWVEGVLAAGAGYQD--DVLMTVMVWRFDIGD 98

Query: 172 IPDPLE 177
               LE
Sbjct: 99  FDGALE 104


>gnl|CDD|235780 PRK06330, PRK06330, transcript cleavage factor/unknown domain
           fusion protein; Validated.
          Length = 718

 Score = 30.1 bits (68), Expect = 3.6
 Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 4/48 (8%)

Query: 99  NGQRVDSVILPPWASDPRTFIMVHRQSLESELVSENLPHWIDLVFGYK 146
            G +V   IL PW +DP   I+    SL+S+L  E L   +     ++
Sbjct: 660 QGDKVCYTILGPWDADPDNKIL----SLKSKLAQEMLGKKVGDSVQFQ 703


>gnl|CDD|187764 cd09324, TDT_TehA, Tellurite-resistance/Dicarboxylate Transporter
           (TDT) family includes TehA protein.  This subfamily
           includes Tellurite resistance protein TehA that belongs
           to the C4-dicarboxylate transporter/malic acid transport
           (TDT) protein family and is a homolog of plant Slow
           Anion Channel-Associated 1 (SLAC1). The tehA gene
           encodes an integral membrane protein that has been shown
           to have efflux activity of quaternary ammonium
           compounds. TehA protein of Escherichia coli functions as
           a tellurite-resistance uptake permease.
          Length = 301

 Score = 29.5 bits (67), Expect = 3.8
 Identities = 21/59 (35%), Positives = 29/59 (49%), Gaps = 7/59 (11%)

Query: 292 GAGMICWNECDNLI--RLKMSKELPATPVIRPPLGATIATATTAPDCHTLWLGLTSGNI 348
           GAGM  W   + +I  RL+ S  LP  P +RP LG  +A A       + +L L  G +
Sbjct: 161 GAGMFSWLSLEPVILQRLRTSPPLP--PALRPTLGIQLAPAFV---GCSAYLSLNGGEV 214


>gnl|CDD|146355 pfam03677, UPF0137, Uncharacterized protein family (UPF0137).  This
           family includes GP6-D a virulence plasmid encoded
           protein.
          Length = 244

 Score = 29.4 bits (66), Expect = 4.0
 Identities = 24/87 (27%), Positives = 39/87 (44%), Gaps = 22/87 (25%)

Query: 138 WIDLVFGYKQSGKAAVDAINVFHPATYYGFDIAAIPDPLERTAWETMIRTYGQTPRQLFK 197
           WI LVFG K S   A+          YY   I  +PD   +  ++++       P +   
Sbjct: 108 WILLVFGTKSSAYNAL---------VYYELFI-TLPDKSLKLEFQSI-------PYK--- 147

Query: 198 SAHPMNVKKLTPAQESRVEVIPNIRGL 224
           +A+ +  +K +  QE + +VI N RG 
Sbjct: 148 AAYTLASRKGS--QEKKEDVIRNYRGE 172


>gnl|CDD|221994 pfam13230, GATase_4, Glutamine amidotransferases class-II.  This
           family captures members that are not found in pfam00310.
          Length = 272

 Score = 29.2 bits (66), Expect = 5.4
 Identities = 15/53 (28%), Positives = 26/53 (49%), Gaps = 2/53 (3%)

Query: 19  YSNSGTVLHFLVRIPPFTSMFLNYQDNNFDLPDRTFHNLATTWRLTSSESTTD 71
           +++  T LH+LVR PPF +  L  +D + D    T         + +++  TD
Sbjct: 183 FAHCSTRLHYLVRRPPFGTAHLIDEDVSVDFAKVTTPEDRVA--VIATQPLTD 233


>gnl|CDD|234799 PRK00578, prfB, peptide chain release factor 2; Validated.
          Length = 367

 Score = 29.1 bits (66), Expect = 6.5
 Identities = 14/26 (53%), Positives = 15/26 (57%), Gaps = 2/26 (7%)

Query: 12  PYHYGSHYSNSGTVLHFLVRIPPFTS 37
           PY YG   S +G   H LVRI PF S
Sbjct: 184 PYAYGYLKSETGV--HRLVRISPFDS 207


>gnl|CDD|233554 TIGR01739, tegu_FGAM_synt, herpesvirus tegument
           protein/v-FGAM-synthase.  This model describes a family
           of large proteins of herpesvirues. The protein is
           described variably as tegument protein or
           phosphoribosylformylglycinamidine synthase
           (FGAM-synthase). Most of the length of the protein shows
           homology to eukaryotic FGAM-synthase. Functional
           characterizations were not verified during construction
           of this model.
          Length = 1202

 Score = 29.3 bits (66), Expect = 6.8
 Identities = 16/70 (22%), Positives = 28/70 (40%), Gaps = 13/70 (18%)

Query: 424 HVTMSETNGEIASVLQTTKTGGSILRLHTINAQLITSLASELRITSVTFSNAPEGVSQNV 483
            +++  +     S+ +    G S +RL  ++   +  L           S     V + V
Sbjct: 721 WLSLHPSYTLAGSIFEQI-LGLSFIRLPALSPVSLKKL----------LSALQTLVKEGV 769

Query: 484 IVTGH--SDG 491
           IV+GH  SDG
Sbjct: 770 IVSGHDVSDG 779


>gnl|CDD|204739 pfam11768, DUF3312, Protein of unknown function (DUF3312).  This is
           a eukaryotic family of uncharacterized proteins. This
           family shows similarity to WD40 repeat proteins.
          Length = 544

 Score = 28.9 bits (65), Expect = 7.3
 Identities = 18/93 (19%), Positives = 35/93 (37%), Gaps = 10/93 (10%)

Query: 285 SLNVIGAGAGMICWNECDNLIRLKMSKELPATPVIRPPLGATIATATTAPDCHTLWLGLT 344
            ++  G  +   C  EC    R KM +      V   PL + + +   +PD   L LG  
Sbjct: 226 KVSKKGEPSADSCIYEC---ARNKMRR----VAVTSIPLRSQVCSCARSPDEEKLLLGCI 278

Query: 345 SGNILVYEYELNAS-TNKIEFCKPPTILLGHSA 376
             ++++Y+     +   +      P ++  H  
Sbjct: 279 DSSLVLYDEHRGVTLLAQAAV--IPHLIAWHPD 309


>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
            modification].
          Length = 2365

 Score = 29.2 bits (65), Expect = 8.1
 Identities = 16/38 (42%), Positives = 18/38 (47%), Gaps = 8/38 (21%)

Query: 202  MNVKKLTPAQESRVEVIPNIRGLVWGS--------YVG 231
            M  KKLT AQ   +  IPN R  +W S        YVG
Sbjct: 1542 MKFKKLTNAQRMGLSQIPNRRFTLWWSPTINRANVYVG 1579


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.134    0.415 

Gapped
Lambda     K      H
   0.267   0.0774    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 27,809,394
Number of extensions: 2638847
Number of successful extensions: 2140
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2105
Number of HSP's successfully gapped: 35
Length of query: 556
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 454
Effective length of database: 6,413,494
Effective search space: 2911726276
Effective search space used: 2911726276
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (27.6 bits)