RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy949
         (569 letters)



>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
           translational repressor [Translation, ribosomal
           structure and biogenesis].
          Length = 777

 Score = 42.8 bits (101), Expect = 4e-04
 Identities = 47/242 (19%), Positives = 74/242 (30%), Gaps = 26/242 (10%)

Query: 166 RNTPDPSLIDSSSDTSTNNNPYASMFYIFFQPRLPEPDLIRLDSS-DDVLPTSSQCAGVE 224
            N    S I++ + +S+                   P L  L    + ++  SS  A   
Sbjct: 162 TNQSGSSFINTPASSSSQPLTNLV-----VSSIKRFPYLTSLSPFFNYLIDPSSDSATAS 216

Query: 225 NPLYPYFVPKNLQPQPEKQEEASGSNSSSNSFKQNMDFFSNLSNSSSSQLKSPPSRPSSN 284
               P F P             S +N  S S    +    ++ N+      S  +  +S 
Sbjct: 217 ADTSPSFNPPP---------NLSPNNLFSTSDLSPLPDTQSVENNIILNSSSSINELTSI 267

Query: 285 YHKAFSFHGKSQDTSNSFDFLTSSSTHPTHTN-----GFTTSSISSTFPSEPLPSSQPKV 339
           Y    S        S    FL  SS+    +         T S S+   +  LP S P  
Sbjct: 268 YGSVPSIRNLRGLNSALVSFLNVSSSSLAFSALNGKEVSPTGSPSTRSFARVLPKSSPNN 327

Query: 340 LST-----FMNHSQP-PPLSPRKLNNTESKQNLPLNNLPVNAPRLPLVPTVGPPSYLSYP 393
           L T      +N  Q  P L      +T +  +L   +  +N  +     T+   S L Y 
Sbjct: 328 LLTEILTTGVNPPQSLPSLLNPVFLSTSTGFSLTNLSGYLNPNKNLKKNTLSSLSNLGYS 387

Query: 394 KP 395
             
Sbjct: 388 SN 389



 Score = 33.2 bits (76), Expect = 0.43
 Identities = 53/248 (21%), Positives = 86/248 (34%), Gaps = 22/248 (8%)

Query: 246 ASGSNSSSNSFKQNMDFF---SNLSNSSSSQLKSPPSRPSSNYHKAF-SFHGKSQDTSNS 301
            S + SSS S  Q++      S+ + S+SS+ KS  +  S+    A  S    S   S+ 
Sbjct: 84  WSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQGNANSSVTLSSSTASSM 143

Query: 302 FDFLTSSSTHPTHTNGFTTSSISSTFPSEPLPSSQPKVLSTFMNHSQPPPLSPRKLNNTE 361
           F+       +P H+N  TT+   S+F + P  SS   +            +S  K     
Sbjct: 144 FNSNKLPLPNPNHSNSATTNQSGSSFINTPASSSSQPL--------TNLVVSSIKRFPYL 195

Query: 362 SKQNLPLNNLPVNAPRLPLVPTVGPPSYLSYPKPPVLNNGFSHPVQVSRPPSYLSYPKPP 421
           +    P  N  ++ P           S    P P +  N       +S  P   S     
Sbjct: 196 T-SLSPFFNYLID-PSSDSATASADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQSVENNI 253

Query: 422 VLN------NGFSHPVQVSSNSSSSEFNSEAKDFFDSILNNNSTTYPSVAQRNGVFASKP 475
           +LN         S    V S  +    NS    F +     +S  + ++  +       P
Sbjct: 254 ILNSSSSINELTSIYGSVPSIRNLRGLNSALVSFLNVSS--SSLAFSALNGKEVSPTGSP 311

Query: 476 STSTYASR 483
           ST ++A  
Sbjct: 312 STRSFARV 319


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 36.6 bits (84), Expect = 0.039
 Identities = 37/132 (28%), Positives = 55/132 (41%), Gaps = 10/132 (7%)

Query: 314 HTNGFTTSSISSTFPSEPLPSSQPKVLSTFMNHSQPPPLSPRKLNNTESKQNLPLNNLPV 373
             +  T  S S+  P +P P  QP   +  M H +PPP +P      +S ++ P    P 
Sbjct: 314 QPHSHTPPSQSALQPQQP-PREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPS 372

Query: 374 NAPR----LPLVPTVGPPSYLSYPKPPVLNNGFSHPVQVSRPPSYLSYP--KPPVLNNGF 427
             P+    LP  P + P S L    PP   +    P+Q+      L     +PPVL    
Sbjct: 373 PFPQMPSNLPPPPALKPLSSLPTHHPP---SAHPPPLQLMPQSQPLQSVPAQPPVLTQSQ 429

Query: 428 SHPVQVSSNSSS 439
           S P + S++  S
Sbjct: 430 SLPPKASTHPHS 441



 Score = 30.0 bits (67), Expect = 4.8
 Identities = 41/208 (19%), Positives = 68/208 (32%), Gaps = 20/208 (9%)

Query: 241 EKQEEASGSNSSSNSFKQNMDFFSNLSNSSSSQLKSPPSRPSSNYHKAFSFHGKSQDTSN 300
           E + E  G   SS+S   N +  S+  +       S PS PS           +S   S+
Sbjct: 115 EGEGEGEGEGESSDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQ-------DNESDSDSS 167

Query: 301 SFDFLTSSSTHPTHTNGFTTSSISSTFPSEP----LPSSQPKVLSTFMNHSQPPPLSPRK 356
           +   L      P+       +   S  P  P    +P     + +      Q P      
Sbjct: 168 AQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLI 227

Query: 357 LNNTESKQNLPLNNLPV-------NAPRLPLVPTVGPPSYLSYPKPPVLNNGFSHPVQVS 409
              +   Q LP  + P+        +P+ P   +  P S    P PP+ +     PV + 
Sbjct: 228 SAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFL- 286

Query: 410 RPPSYLSYPKPPVLNNGFSHPVQVSSNS 437
                 + P+P  L      P+ + S +
Sbjct: 287 -QHPSSNPPQPFGLAQSQVPPLPLPSQA 313


>gnl|CDD|225687 COG3145, AlkB, Alkylated DNA repair protein [DNA replication,
           recombination, and repair].
          Length = 194

 Score = 34.3 bits (79), Expect = 0.086
 Identities = 17/84 (20%), Positives = 34/84 (40%), Gaps = 9/84 (10%)

Query: 81  IKAHVD-SVRFCGNTIAGLSLLSDSVMKLVDEKTKTQEILVLLKQRSLYVMKDDARYKFT 139
           I  H D         +A LSL +  + +L   + +   + + L+   + VM   +R  + 
Sbjct: 119 IGWHQDKDEEDDRPPVASLSLGAPCIFRLRGRRRRGPGLRLRLEHGDVVVMGGPSRLAWH 178

Query: 140 HEVLENERSYFGDLFVPRGRRISV 163
           H + +  R          G+RI++
Sbjct: 179 HIIPKTSRL--------TGQRINL 194


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 33.5 bits (77), Expect = 0.23
 Identities = 20/129 (15%), Positives = 41/129 (31%), Gaps = 5/129 (3%)

Query: 232 VPKNLQPQPEKQEEASGSNSSSNSFKQNMDFFSNLSNSSSSQLKSPPSRPSSNYHKAFSF 291
            P     + ++  + + +NS ++   ++ D  S   +   S     PS PS     +   
Sbjct: 149 NPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPS 208

Query: 292 HGKSQDTSNSF-----DFLTSSSTHPTHTNGFTTSSISSTFPSEPLPSSQPKVLSTFMNH 346
                   +SF          S T+P+   G             PL +++P   S     
Sbjct: 209 DSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATP 268

Query: 347 SQPPPLSPR 355
           +    ++  
Sbjct: 269 APIGGITLD 277



 Score = 29.7 bits (67), Expect = 4.5
 Identities = 15/110 (13%), Positives = 27/110 (24%)

Query: 328 PSEPLPSSQPKVLSTFMNHSQPPPLSPRKLNNTESKQNLPLNNLPVNAPRLPLVPTVGPP 387
           P+   P  +    +     +          +   +  + P ++ P              P
Sbjct: 148 PNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSP 207

Query: 388 SYLSYPKPPVLNNGFSHPVQVSRPPSYLSYPKPPVLNNGFSHPVQVSSNS 437
           S  S P  P      + P     P +    P P          V   S +
Sbjct: 208 SDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTA 257



 Score = 28.5 bits (64), Expect = 9.3
 Identities = 22/102 (21%), Positives = 30/102 (29%), Gaps = 1/102 (0%)

Query: 295 SQDTSNSFDFLTSSSTHPTHTNGFTTSSISSTFPSEPLPSSQPKVLSTFMNHSQPPPLSP 354
           S    ++     S S  P+ + G  +       PS P  SS P    +      PPP   
Sbjct: 171 SFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPA-PSSFQSDTPPPSPE 229

Query: 355 RKLNNTESKQNLPLNNLPVNAPRLPLVPTVGPPSYLSYPKPP 396
              N +           PV            PPS  + P P 
Sbjct: 230 SPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPI 271


>gnl|CDD|217538 pfam03405, FA_desaturase_2, Fatty acid desaturase. 
          Length = 330

 Score = 33.6 bits (77), Expect = 0.28
 Identities = 13/26 (50%), Positives = 13/26 (50%)

Query: 173 LIDSSSDTSTNNNPYASMFYIFFQPR 198
           LI S  D  T NNPY    Y  FQ R
Sbjct: 140 LIGSGMDPGTENNPYLGFIYTSFQER 165


>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin.  Amelogenins play a role in
           biomineralisation. They seem to regulate the formation
           of crystallites during the secretory stage of tooth
           enamel development. thought to play a major role in the
           structural organisation and mineralisation of developing
           enamel. They are found in the extracellular matrix.
           Mutations in X-chromosomal amelogenin can cause
           Amelogenesis imperfecta.
          Length = 174

 Score = 31.8 bits (72), Expect = 0.53
 Identities = 22/75 (29%), Positives = 30/75 (40%), Gaps = 6/75 (8%)

Query: 328 PSEPLPSSQPKVLSTFMNHSQPPP----LSPRKLNNTESKQNLPLNNLPVNAPRLPLVPT 383
           P  P+P   P V  T       PP    L P    N + +Q  P +  P   P+ P  P 
Sbjct: 82  PMMPVPGHHPMVPMTGQQPHLQPPAQHPLQPTYGQNPQPQQ--PTHTQPPVQPQQPADPQ 139

Query: 384 VGPPSYLSYPKPPVL 398
            G P +   P PP++
Sbjct: 140 PGQPMFPMQPLPPLV 154


>gnl|CDD|222204 pfam13532, 2OG-FeII_Oxy_2, 2OG-Fe(II) oxygenase superfamily. 
          Length = 190

 Score = 31.1 bits (71), Expect = 1.1
 Identities = 18/87 (20%), Positives = 32/87 (36%), Gaps = 5/87 (5%)

Query: 78  KGYIKAHVD-SVRFCGNTIAGLSLLSDSVMKLVDEKTKTQEILVLLKQRSLYVMKDDARY 136
              +  H D      G  I  +SL   ++ +   +      I + L+   + VM   +R 
Sbjct: 105 GARMGLHQDKDELDFGAPIVSVSLGDPALFRFGGKTRDDPTIALPLESGDVLVMGGPSRL 164

Query: 137 KFTHEVLENERSYFGDLFVPRGRRISV 163
            + H V    R   G   +  G RI++
Sbjct: 165 AY-HGVP---RIKPGTHPLLGGGRINL 187


>gnl|CDD|227820 COG5533, UBP5, Ubiquitin C-terminal hydrolase [Posttranslational
           modification, protein turnover, chaperones].
          Length = 415

 Score = 31.5 bits (71), Expect = 1.2
 Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 10/80 (12%)

Query: 296 QDTSNSFDFLTSSSTHPTHTNGFTTSSISSTFPSEPLPSSQPKVLSTFMNHSQPPPL--- 352
            D+S  +DFL S+  HP+    ++              S   K+ S   +    P     
Sbjct: 1   MDSSKIYDFLKSAPFHPSMVICYSLERYFEDV------SLAYKLYSMLRSLKLDPHFMEL 54

Query: 353 -SPRKLNNTESKQNLPLNNL 371
            +P+K+  ++ K NLP N L
Sbjct: 55  ANPKKMVVSKRKDNLPPNGL 74


>gnl|CDD|218573 pfam05387, Chorion_3, Chorion family 3.  This family consists of
           several Drosophila chorion proteins S36 and S38. The
           chorion genes of Drosophila are amplified in response to
           developmental signals in the follicle cells of the
           ovary.
          Length = 277

 Score = 31.2 bits (70), Expect = 1.2
 Identities = 35/190 (18%), Positives = 55/190 (28%), Gaps = 16/190 (8%)

Query: 305 LTSSSTHPTHTNGFTTSSISS--TFPSEPLPSSQPKVLSTFMN----HSQPPPLSPRKLN 358
           L + + +P +     +++I+     P       Q K+          H     L P   N
Sbjct: 84  LQALNANPPYQKLKNSNTIAESLAEPQLASNIRQGKIKVVSPQFVDQHLFRSLLVPSGHN 143

Query: 359 NTESKQNLPLNNLPVNAPRLPLVPTVGPPSYLSYPKPPVLNNGFS----------HPVQV 408
           N +     PL  + V  P  P    V  P  +  P P +     S           P  +
Sbjct: 144 NHQVIATQPLPPIIVKQPGAPPKVLVNGPPLVVKPAPVIYKIKPSVIYQQEVINKVPTPL 203

Query: 409 SRPPSYLSYPKPPVLNNGFSHPVQVSSNSSSSEFNSEAKDFFDSILNNNSTTYPSVAQRN 468
           S  P Y+   KP         P      S  S   SE     +    ++   Y +    +
Sbjct: 204 SLNPVYVKVYKPGKKIEAPLVPEVQQVYSQPSYGGSEYSQPREQASPSSQPGYGAAGAAS 263

Query: 469 GVFASKPSTS 478
                  S S
Sbjct: 264 SNAGGASSAS 273


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 31.2 bits (70), Expect = 1.7
 Identities = 25/121 (20%), Positives = 37/121 (30%), Gaps = 16/121 (13%)

Query: 328 PSEPL--PSSQPKVLSTFMNHSQPPPLSPRKLNNTESKQNLPLNNL-PVNAPRLPLVPTV 384
           P EPL  P  +P          Q     P++    + +   P   + P    + P  P  
Sbjct: 740 PHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVA 799

Query: 385 GPPSYLSYPKPPVLNNGFSHPVQVSRPPSYLSYPKPPV-------------LNNGFSHPV 431
             P Y    +P      +  P Q   P      P+ PV             + NG S P+
Sbjct: 800 PQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPL 859

Query: 432 Q 432
            
Sbjct: 860 H 860


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
           family is conserved in fungi but the function is not
           known.
          Length = 509

 Score = 30.7 bits (69), Expect = 2.4
 Identities = 27/129 (20%), Positives = 44/129 (34%), Gaps = 13/129 (10%)

Query: 225 NPLYPYFVPKNLQPQPEKQEEASGSNSSSNSFKQNMDFFSNLSNSSSSQLKSPPSRPSSN 284
               P   P    P   +   A  S+ S+ +   + +  +  S  S+S   +      S 
Sbjct: 80  ALPMPNLAPI--TPPSSEPTPAPPSSESTATRTPDPNQQALESTESTST--TSADCNDSE 135

Query: 285 YHKAFSFHGKSQDTSNSFDFLTSSSTHPTH--TNGFTTSSISSTFPSEPLPSSQPKVLST 342
                +    S DTS S     SS   P+     GF+ S ISS++ S    +  P    +
Sbjct: 136 QSSTPN--LNSSDTSTS-----SSGALPSTSVVRGFSPSHISSSYRSTAQLNKAPSPTKS 188

Query: 343 FMNHSQPPP 351
               + P  
Sbjct: 189 AEPTAAPQA 197


>gnl|CDD|191511 pfam06409, NPIP, Nuclear pore complex interacting protein (NPIP).
           This family consists of a series of primate specific
           nuclear pore complex interacting protein (NPIP)
           sequences. The function of this family is unknown but is
           well conserved from African apes to humans.
          Length = 350

 Score = 30.2 bits (67), Expect = 3.4
 Identities = 30/124 (24%), Positives = 44/124 (35%), Gaps = 6/124 (4%)

Query: 324 SSTFPSEPLPSSQPKVLSTFMNHSQPPPLSPRKL-NNTESKQNLP---LNNLPVNAPRLP 379
           SS  P  P  +++   L   M H  PPP     + +N+ S +  P   L  LP +A    
Sbjct: 225 SSGLPYWPYLTAE--TLKNRMGHQPPPPTQQHSIIDNSLSLKTPPECLLTPLPPSALPSA 282

Query: 380 LVPTVGPPSYLSYPKPPVLNNGFSHPVQVSRPPSYLSYPKPPVLNNGFSHPVQVSSNSSS 439
                 P   L  P PP  ++         +PP  L       L      P+  S+   S
Sbjct: 283 DDNLKTPAECLLRPLPPSADDNIKECPLALKPPPPLPPSVDDNLKECLLAPLPPSALPPS 342

Query: 440 SEFN 443
            + N
Sbjct: 343 VDDN 346


>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
          Length = 1213

 Score = 30.3 bits (68), Expect = 3.7
 Identities = 11/71 (15%), Positives = 19/71 (26%), Gaps = 3/71 (4%)

Query: 432  QVSSNSSSSEFNSEAKDFFDSILNNNSTTYPSVAQRNGVFASKPSTSTYASRAAFLNEQL 491
            QVS +S  S F+    +     L         +   N            ++      +  
Sbjct: 1084 QVSPHSQISLFDRGLANLLSDTLQWVKLKKRKL---NDGIVKGLKEQGDSATNQISLDDE 1140

Query: 492  GPAVPSIAPKK 502
              A  +   KK
Sbjct: 1141 LNAYQAEWSKK 1151


>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
           Rho/Rac/Cdc42-like GTPases [Signal transduction
           mechanisms].
          Length = 1175

 Score = 30.2 bits (68), Expect = 4.0
 Identities = 34/137 (24%), Positives = 50/137 (36%), Gaps = 15/137 (10%)

Query: 246 ASGSNSSSNSFKQNMDFFSNLSNSSSSQLKSPPSRPSSNYHKAFSFHGKSQDTSNSFDFL 305
           A+ S SS NS   + D FS  S+S S    S  SR  S         G       +   L
Sbjct: 98  ATSSTSSLNSN--DGDQFSPASDSLSFNPSSTQSRKDSGPG-----DGSPVQKRKNP-LL 149

Query: 306 TSSSTHPTHTNGFTTSSISSTFPS------EPLPSSQPKVLSTFMNHSQPPPLSPRKLNN 359
            SSSTH TH     T +  S   +      + +PS   + +     H +    S    +N
Sbjct: 150 PSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSSD-TSN 208

Query: 360 TESKQNLPLNNLPVNAP 376
             S  ++  N+   +  
Sbjct: 209 GFSYPSIRKNSRHSSNS 225


>gnl|CDD|226880 COG4474, COG4474, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 180

 Score = 28.9 bits (65), Expect = 4.8
 Identities = 18/75 (24%), Positives = 29/75 (38%), Gaps = 7/75 (9%)

Query: 14  KLLYEYDHWDDAIHGFRETERSKWNEENTKIIARVQNLA-FPPNVTPIQYVHVLDLEQKG 72
           +L  EY H   A+    E +   WNE+N    AR+   A F  ++T   Y      E   
Sbjct: 64  ELKEEYPHIKLAVITPFEEQGKNWNEDNQMKYARILKAADFVKSITERPY------ESPA 117

Query: 73  YIKAHKGYIKAHVDS 87
             + +  ++    D 
Sbjct: 118 QFRQYNQFLLEKSDG 132


>gnl|CDD|227785 COG5498, ACF2, Predicted glycosyl hydrolase [Cell envelope
           biogenesis, outer membrane].
          Length = 760

 Score = 29.4 bits (66), Expect = 5.8
 Identities = 27/117 (23%), Positives = 35/117 (29%), Gaps = 15/117 (12%)

Query: 358 NNTESKQNLPLNNLPVNAPRLPLVPTVGPPSYLSYPKPPVLNNGFSHPVQVSRPPSYLSY 417
              +     PL  L V     P   +  P S +S PK             V  PP     
Sbjct: 14  ATMKGANVQPLFALTVIILLFPTGASASPHS-VSVPK---------GDSNVEFPPILSGT 63

Query: 418 PKPPVLNNGFSHPVQVSSNSSSSEFNSEAKDFFDSIL----NNNSTTYP-SVAQRNG 469
             PP       HPV      S S    +   F+ ++L     N    +P SV  R  
Sbjct: 64  INPPSTFRSVQHPVPPPGVRSDSTGPIQTNKFYGNLLVGSRTNPQYPHPYSVWWRAS 120


>gnl|CDD|183966 PRK13309, ureC, urease subunit alpha; Reviewed.
          Length = 572

 Score = 29.5 bits (66), Expect = 6.2
 Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 6/29 (20%)

Query: 202 PDLIRLDSSDDVLPTSSQCAGVENPLYPY 230
           PD+I++ S  +VLP+S+      NP  PY
Sbjct: 287 PDIIKVASQTNVLPSST------NPTLPY 309


>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
           Provisional.
          Length = 576

 Score = 29.2 bits (65), Expect = 6.5
 Identities = 17/85 (20%), Positives = 31/85 (36%), Gaps = 1/85 (1%)

Query: 328 PSEPLPSSQPKVLSTFMNHSQPPPLSPRKLNNTESKQNLPLNNLPVNAPRLPLVPTVGPP 387
              P P + P   ++  +     P +P   +  E + N+P ++       +P+ P     
Sbjct: 359 NVPPNPPNVPGGSNSEFSSDVENPPNPPNPDIPEQEPNIPEDSNKEVPEDVPMEPEDDRD 418

Query: 388 SYLSYPKPPVLNNGFSHPVQVSRPP 412
           +  + PK P  N G      V   P
Sbjct: 419 NNFNEPKKPE-NKGDGQNEPVIPKP 442


>gnl|CDD|233930 TIGR02564, cas_Csy1, CRISPR type I-F/YPEST-associated protein Csy1.
            CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) is a widespread family of
           prokaryotic direct repeats with spacers of unique
           sequence between consecutive repeats. This protein
           family, typified by YPO2465 of Yersinia pestis, is a
           CRISPR-associated (Cas) family strictly associated with
           the Ypest subtype of CRISPR/Cas locus. This family is
           designated Csy1, for CRISPR/Cas Subtype Ypest protein 1
           [Mobile and extrachromosomal element functions, Other].
          Length = 384

 Score = 29.0 bits (65), Expect = 7.6
 Identities = 25/115 (21%), Positives = 38/115 (33%), Gaps = 23/115 (20%)

Query: 385 GPPSYLSYPKPPVLNNGFSHPVQVS------RPPSYLSYPKPPVLNNGFSHPVQVSSNSS 438
               Y  Y    V   G + P  +S      +  SYL    PPV  N    P++ SS   
Sbjct: 188 HQKRYADYLDLAVQKLGGTKPQNISQLNSSRKGRSYLLPSLPPVWKNIDKPPIKFSS--- 244

Query: 439 SSEFNSEAKDFFDSILNNNSTTYPSVAQRNGVFAS-KPSTSTY---ASRAAFLNE 489
                       +S   + S    ++ Q   +    KP  + Y     R A L++
Sbjct: 245 ----------ILESRFFSKSLARRTLDQLKILLLVVKPKENNYDIRNMRLAALDQ 289


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 29.2 bits (66), Expect = 7.9
 Identities = 24/110 (21%), Positives = 35/110 (31%), Gaps = 17/110 (15%)

Query: 354 PRKLN--------NTESKQNLPLNNLPVNAPRLPLVPTVGPPSYLSYPKPPVLNNGFSHP 405
             KLN        +  +K   P    P      P +P     +    P PP      +  
Sbjct: 505 SIKLNLESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASS 564

Query: 406 VQVSRPPSYLSYPKPPVLNNGFSHPVQVSSNSSSSEF-NSEAK---DFFD 451
              ++ P+  S P P         P    +  SS E  +  AK   DFF+
Sbjct: 565 NAPAQIPADSSPPPPIP-----EEPTPSPTKDSSPEEIDKAAKNLADFFN 609


>gnl|CDD|187806 cd09675, Csy1_I-F, CRISPR/Cas system-associated protein Csy1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Large proteins; Predicted subunit of the Cascade
           complex; signature gene for I-F subtype; also known as
           Csy1 family.
          Length = 384

 Score = 28.6 bits (64), Expect = 8.3
 Identities = 25/115 (21%), Positives = 38/115 (33%), Gaps = 23/115 (20%)

Query: 385 GPPSYLSYPKPPVLNNGFSHPVQVS------RPPSYLSYPKPPVLNNGFSHPVQVSSNSS 438
               Y  Y    V   G + P  +S      +  SYL    PPV  N    P++ SS   
Sbjct: 187 HQKRYADYLDLAVQKLGGTKPQNISQLNSSRKGRSYLLPSLPPVWKNIDKPPIKFSS--- 243

Query: 439 SSEFNSEAKDFFDSILNNNSTTYPSVAQRNGVFAS-KPSTSTY---ASRAAFLNE 489
                       +S   + S    ++ Q   +    KP  + Y     R A L++
Sbjct: 244 ----------ILESRFFSKSLARRTLDQLKILLLVVKPKENNYDIRNMRLAALDQ 288


>gnl|CDD|222790 PHA00430, PHA00430, tail fiber protein.
          Length = 568

 Score = 28.7 bits (64), Expect = 8.8
 Identities = 12/53 (22%), Positives = 19/53 (35%)

Query: 433 VSSNSSSSEFNSEAKDFFDSILNNNSTTYPSVAQRNGVFASKPSTSTYASRAA 485
               S +   NSEA  F     +  S+   +  Q         +   YA++AA
Sbjct: 197 KQWRSEADGSNSEANRFKGYADSMTSSVEAAKGQAESSSKEANTAGDYATKAA 249


>gnl|CDD|240111 cd04794, euk_LANCL, eukaryotic Lanthionine synthetase C-like
           protein. This family contains the lanthionine synthetase
           C-like proteins 1 and 2 which are related to the
           bacterial lanthionine synthetase components C (LanC).
           LANCL1 and LANCL2 (testes-specific adriamycin
           sensitivity protein) are thought to be peptide-modifying
           enzyme components in eukaryotic cells. Both proteins are
           produced in large quantities in the brain and testes and
           may have role in the immune surveillance of these
           organs.
          Length = 343

 Score = 28.8 bits (65), Expect = 9.1
 Identities = 14/52 (26%), Positives = 21/52 (40%), Gaps = 2/52 (3%)

Query: 73  YIKAHKGYIKAHVDSVRFCGNTIAGLSLLSDSVMKLVDEKTKTQEILVLLKQ 124
           YIKA    I         CGN  AG+  +  +V   + ++ +  E L    Q
Sbjct: 34  YIKASLRAIPDDRRISFLCGN--AGVYAVGAAVYHYLKDEDEVDEFLSKFLQ 83


>gnl|CDD|218262 pfam04782, DUF632, Protein of unknown function (DUF632).  This
           plant protein may be a leucine zipper, but there is no
           experimental evidence for this.
          Length = 309

 Score = 28.5 bits (64), Expect = 9.3
 Identities = 18/72 (25%), Positives = 27/72 (37%), Gaps = 14/72 (19%)

Query: 26  IHGFRETERS--KWNEENTKIIARVQNLAFPPNVTP---IQYVHVLDLEQK--------- 71
           IHG     +S  + +E    II   + L   P+  P         L LE +         
Sbjct: 172 IHGLTRMWKSMLECHEIQLHIIQEAKLLDSSPSGKPTSESHRQATLQLEAELREWHSSFS 231

Query: 72  GYIKAHKGYIKA 83
            +I A + Y+KA
Sbjct: 232 NWIDAQRSYVKA 243


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 28.5 bits (63), Expect = 9.6
 Identities = 31/158 (19%), Positives = 49/158 (31%), Gaps = 5/158 (3%)

Query: 231 FVPKNLQPQPEKQEEASGSNSSSNSFKQNMDFFSNLSNSSSSQLKSPPSRPSSNYHKAFS 290
              ++  P+   +   +      +  K+     +    +S     S P+ PS    ++  
Sbjct: 140 ICQRDYNPRDRPKCRCTLRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQS-- 197

Query: 291 FHGKSQDTSNSFDFLTSSSTHPTHTNGFTTSSISSTFPSEPLPSSQPKVLSTFMNHSQPP 350
               +Q    +      SST P  T G TTSS      +EP PS +    S     S   
Sbjct: 198 -QPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEP-QTEPPPSQRGPSGSPQHPPSTTS 255

Query: 351 PLSPRKLNNTESKQNLPLNNLPVNAPRLPLVPTVGPPS 388
                  +  E  Q         N  R P      PP+
Sbjct: 256 QDQSTTGDGQEHTQRRKTPPATSN-RRSPHSTATPPPT 292


>gnl|CDD|237271 PRK13016, PRK13016, dihydroxy-acid dehydratase; Provisional.
          Length = 577

 Score = 28.5 bits (64), Expect = 9.9
 Identities = 11/32 (34%), Positives = 13/32 (40%)

Query: 277 PPSRPSSNYHKAFSFHGKSQDTSNSFDFLTSS 308
           P  R    Y   FS H +  D    FDFL + 
Sbjct: 535 PERRYERGYGWMFSQHVEQADKGCDFDFLETG 566


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.129    0.383 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 27,934,309
Number of extensions: 2610804
Number of successful extensions: 1909
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1857
Number of HSP's successfully gapped: 93
Length of query: 569
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 467
Effective length of database: 6,413,494
Effective search space: 2995101698
Effective search space used: 2995101698
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 62 (27.8 bits)