RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy949
(569 letters)
>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
translational repressor [Translation, ribosomal
structure and biogenesis].
Length = 777
Score = 42.8 bits (101), Expect = 4e-04
Identities = 47/242 (19%), Positives = 74/242 (30%), Gaps = 26/242 (10%)
Query: 166 RNTPDPSLIDSSSDTSTNNNPYASMFYIFFQPRLPEPDLIRLDSS-DDVLPTSSQCAGVE 224
N S I++ + +S+ P L L + ++ SS A
Sbjct: 162 TNQSGSSFINTPASSSSQPLTNLV-----VSSIKRFPYLTSLSPFFNYLIDPSSDSATAS 216
Query: 225 NPLYPYFVPKNLQPQPEKQEEASGSNSSSNSFKQNMDFFSNLSNSSSSQLKSPPSRPSSN 284
P F P S +N S S + ++ N+ S + +S
Sbjct: 217 ADTSPSFNPPP---------NLSPNNLFSTSDLSPLPDTQSVENNIILNSSSSINELTSI 267
Query: 285 YHKAFSFHGKSQDTSNSFDFLTSSSTHPTHTN-----GFTTSSISSTFPSEPLPSSQPKV 339
Y S S FL SS+ + T S S+ + LP S P
Sbjct: 268 YGSVPSIRNLRGLNSALVSFLNVSSSSLAFSALNGKEVSPTGSPSTRSFARVLPKSSPNN 327
Query: 340 LST-----FMNHSQP-PPLSPRKLNNTESKQNLPLNNLPVNAPRLPLVPTVGPPSYLSYP 393
L T +N Q P L +T + +L + +N + T+ S L Y
Sbjct: 328 LLTEILTTGVNPPQSLPSLLNPVFLSTSTGFSLTNLSGYLNPNKNLKKNTLSSLSNLGYS 387
Query: 394 KP 395
Sbjct: 388 SN 389
Score = 33.2 bits (76), Expect = 0.43
Identities = 53/248 (21%), Positives = 86/248 (34%), Gaps = 22/248 (8%)
Query: 246 ASGSNSSSNSFKQNMDFF---SNLSNSSSSQLKSPPSRPSSNYHKAF-SFHGKSQDTSNS 301
S + SSS S Q++ S+ + S+SS+ KS + S+ A S S S+
Sbjct: 84 WSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQGNANSSVTLSSSTASSM 143
Query: 302 FDFLTSSSTHPTHTNGFTTSSISSTFPSEPLPSSQPKVLSTFMNHSQPPPLSPRKLNNTE 361
F+ +P H+N TT+ S+F + P SS + +S K
Sbjct: 144 FNSNKLPLPNPNHSNSATTNQSGSSFINTPASSSSQPL--------TNLVVSSIKRFPYL 195
Query: 362 SKQNLPLNNLPVNAPRLPLVPTVGPPSYLSYPKPPVLNNGFSHPVQVSRPPSYLSYPKPP 421
+ P N ++ P S P P + N +S P S
Sbjct: 196 T-SLSPFFNYLID-PSSDSATASADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQSVENNI 253
Query: 422 VLN------NGFSHPVQVSSNSSSSEFNSEAKDFFDSILNNNSTTYPSVAQRNGVFASKP 475
+LN S V S + NS F + +S + ++ + P
Sbjct: 254 ILNSSSSINELTSIYGSVPSIRNLRGLNSALVSFLNVSS--SSLAFSALNGKEVSPTGSP 311
Query: 476 STSTYASR 483
ST ++A
Sbjct: 312 STRSFARV 319
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 36.6 bits (84), Expect = 0.039
Identities = 37/132 (28%), Positives = 55/132 (41%), Gaps = 10/132 (7%)
Query: 314 HTNGFTTSSISSTFPSEPLPSSQPKVLSTFMNHSQPPPLSPRKLNNTESKQNLPLNNLPV 373
+ T S S+ P +P P QP + M H +PPP +P +S ++ P P
Sbjct: 314 QPHSHTPPSQSALQPQQP-PREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPS 372
Query: 374 NAPR----LPLVPTVGPPSYLSYPKPPVLNNGFSHPVQVSRPPSYLSYP--KPPVLNNGF 427
P+ LP P + P S L PP + P+Q+ L +PPVL
Sbjct: 373 PFPQMPSNLPPPPALKPLSSLPTHHPP---SAHPPPLQLMPQSQPLQSVPAQPPVLTQSQ 429
Query: 428 SHPVQVSSNSSS 439
S P + S++ S
Sbjct: 430 SLPPKASTHPHS 441
Score = 30.0 bits (67), Expect = 4.8
Identities = 41/208 (19%), Positives = 68/208 (32%), Gaps = 20/208 (9%)
Query: 241 EKQEEASGSNSSSNSFKQNMDFFSNLSNSSSSQLKSPPSRPSSNYHKAFSFHGKSQDTSN 300
E + E G SS+S N + S+ + S PS PS +S S+
Sbjct: 115 EGEGEGEGEGESSDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQ-------DNESDSDSS 167
Query: 301 SFDFLTSSSTHPTHTNGFTTSSISSTFPSEP----LPSSQPKVLSTFMNHSQPPPLSPRK 356
+ L P+ + S P P +P + + Q P
Sbjct: 168 AQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLI 227
Query: 357 LNNTESKQNLPLNNLPV-------NAPRLPLVPTVGPPSYLSYPKPPVLNNGFSHPVQVS 409
+ Q LP + P+ +P+ P + P S P PP+ + PV +
Sbjct: 228 SAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFL- 286
Query: 410 RPPSYLSYPKPPVLNNGFSHPVQVSSNS 437
+ P+P L P+ + S +
Sbjct: 287 -QHPSSNPPQPFGLAQSQVPPLPLPSQA 313
>gnl|CDD|225687 COG3145, AlkB, Alkylated DNA repair protein [DNA replication,
recombination, and repair].
Length = 194
Score = 34.3 bits (79), Expect = 0.086
Identities = 17/84 (20%), Positives = 34/84 (40%), Gaps = 9/84 (10%)
Query: 81 IKAHVD-SVRFCGNTIAGLSLLSDSVMKLVDEKTKTQEILVLLKQRSLYVMKDDARYKFT 139
I H D +A LSL + + +L + + + + L+ + VM +R +
Sbjct: 119 IGWHQDKDEEDDRPPVASLSLGAPCIFRLRGRRRRGPGLRLRLEHGDVVVMGGPSRLAWH 178
Query: 140 HEVLENERSYFGDLFVPRGRRISV 163
H + + R G+RI++
Sbjct: 179 HIIPKTSRL--------TGQRINL 194
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 33.5 bits (77), Expect = 0.23
Identities = 20/129 (15%), Positives = 41/129 (31%), Gaps = 5/129 (3%)
Query: 232 VPKNLQPQPEKQEEASGSNSSSNSFKQNMDFFSNLSNSSSSQLKSPPSRPSSNYHKAFSF 291
P + ++ + + +NS ++ ++ D S + S PS PS +
Sbjct: 149 NPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPS 208
Query: 292 HGKSQDTSNSF-----DFLTSSSTHPTHTNGFTTSSISSTFPSEPLPSSQPKVLSTFMNH 346
+SF S T+P+ G PL +++P S
Sbjct: 209 DSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATP 268
Query: 347 SQPPPLSPR 355
+ ++
Sbjct: 269 APIGGITLD 277
Score = 29.7 bits (67), Expect = 4.5
Identities = 15/110 (13%), Positives = 27/110 (24%)
Query: 328 PSEPLPSSQPKVLSTFMNHSQPPPLSPRKLNNTESKQNLPLNNLPVNAPRLPLVPTVGPP 387
P+ P + + + + + + P ++ P P
Sbjct: 148 PNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSP 207
Query: 388 SYLSYPKPPVLNNGFSHPVQVSRPPSYLSYPKPPVLNNGFSHPVQVSSNS 437
S S P P + P P + P P V S +
Sbjct: 208 SDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTA 257
Score = 28.5 bits (64), Expect = 9.3
Identities = 22/102 (21%), Positives = 30/102 (29%), Gaps = 1/102 (0%)
Query: 295 SQDTSNSFDFLTSSSTHPTHTNGFTTSSISSTFPSEPLPSSQPKVLSTFMNHSQPPPLSP 354
S ++ S S P+ + G + PS P SS P + PPP
Sbjct: 171 SFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPA-PSSFQSDTPPPSPE 229
Query: 355 RKLNNTESKQNLPLNNLPVNAPRLPLVPTVGPPSYLSYPKPP 396
N + PV PPS + P P
Sbjct: 230 SPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPI 271
>gnl|CDD|217538 pfam03405, FA_desaturase_2, Fatty acid desaturase.
Length = 330
Score = 33.6 bits (77), Expect = 0.28
Identities = 13/26 (50%), Positives = 13/26 (50%)
Query: 173 LIDSSSDTSTNNNPYASMFYIFFQPR 198
LI S D T NNPY Y FQ R
Sbjct: 140 LIGSGMDPGTENNPYLGFIYTSFQER 165
>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in
biomineralisation. They seem to regulate the formation
of crystallites during the secretory stage of tooth
enamel development. thought to play a major role in the
structural organisation and mineralisation of developing
enamel. They are found in the extracellular matrix.
Mutations in X-chromosomal amelogenin can cause
Amelogenesis imperfecta.
Length = 174
Score = 31.8 bits (72), Expect = 0.53
Identities = 22/75 (29%), Positives = 30/75 (40%), Gaps = 6/75 (8%)
Query: 328 PSEPLPSSQPKVLSTFMNHSQPPP----LSPRKLNNTESKQNLPLNNLPVNAPRLPLVPT 383
P P+P P V T PP L P N + +Q P + P P+ P P
Sbjct: 82 PMMPVPGHHPMVPMTGQQPHLQPPAQHPLQPTYGQNPQPQQ--PTHTQPPVQPQQPADPQ 139
Query: 384 VGPPSYLSYPKPPVL 398
G P + P PP++
Sbjct: 140 PGQPMFPMQPLPPLV 154
>gnl|CDD|222204 pfam13532, 2OG-FeII_Oxy_2, 2OG-Fe(II) oxygenase superfamily.
Length = 190
Score = 31.1 bits (71), Expect = 1.1
Identities = 18/87 (20%), Positives = 32/87 (36%), Gaps = 5/87 (5%)
Query: 78 KGYIKAHVD-SVRFCGNTIAGLSLLSDSVMKLVDEKTKTQEILVLLKQRSLYVMKDDARY 136
+ H D G I +SL ++ + + I + L+ + VM +R
Sbjct: 105 GARMGLHQDKDELDFGAPIVSVSLGDPALFRFGGKTRDDPTIALPLESGDVLVMGGPSRL 164
Query: 137 KFTHEVLENERSYFGDLFVPRGRRISV 163
+ H V R G + G RI++
Sbjct: 165 AY-HGVP---RIKPGTHPLLGGGRINL 187
>gnl|CDD|227820 COG5533, UBP5, Ubiquitin C-terminal hydrolase [Posttranslational
modification, protein turnover, chaperones].
Length = 415
Score = 31.5 bits (71), Expect = 1.2
Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 10/80 (12%)
Query: 296 QDTSNSFDFLTSSSTHPTHTNGFTTSSISSTFPSEPLPSSQPKVLSTFMNHSQPPPL--- 352
D+S +DFL S+ HP+ ++ S K+ S + P
Sbjct: 1 MDSSKIYDFLKSAPFHPSMVICYSLERYFEDV------SLAYKLYSMLRSLKLDPHFMEL 54
Query: 353 -SPRKLNNTESKQNLPLNNL 371
+P+K+ ++ K NLP N L
Sbjct: 55 ANPKKMVVSKRKDNLPPNGL 74
>gnl|CDD|218573 pfam05387, Chorion_3, Chorion family 3. This family consists of
several Drosophila chorion proteins S36 and S38. The
chorion genes of Drosophila are amplified in response to
developmental signals in the follicle cells of the
ovary.
Length = 277
Score = 31.2 bits (70), Expect = 1.2
Identities = 35/190 (18%), Positives = 55/190 (28%), Gaps = 16/190 (8%)
Query: 305 LTSSSTHPTHTNGFTTSSISS--TFPSEPLPSSQPKVLSTFMN----HSQPPPLSPRKLN 358
L + + +P + +++I+ P Q K+ H L P N
Sbjct: 84 LQALNANPPYQKLKNSNTIAESLAEPQLASNIRQGKIKVVSPQFVDQHLFRSLLVPSGHN 143
Query: 359 NTESKQNLPLNNLPVNAPRLPLVPTVGPPSYLSYPKPPVLNNGFS----------HPVQV 408
N + PL + V P P V P + P P + S P +
Sbjct: 144 NHQVIATQPLPPIIVKQPGAPPKVLVNGPPLVVKPAPVIYKIKPSVIYQQEVINKVPTPL 203
Query: 409 SRPPSYLSYPKPPVLNNGFSHPVQVSSNSSSSEFNSEAKDFFDSILNNNSTTYPSVAQRN 468
S P Y+ KP P S S SE + ++ Y + +
Sbjct: 204 SLNPVYVKVYKPGKKIEAPLVPEVQQVYSQPSYGGSEYSQPREQASPSSQPGYGAAGAAS 263
Query: 469 GVFASKPSTS 478
S S
Sbjct: 264 SNAGGASSAS 273
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 31.2 bits (70), Expect = 1.7
Identities = 25/121 (20%), Positives = 37/121 (30%), Gaps = 16/121 (13%)
Query: 328 PSEPL--PSSQPKVLSTFMNHSQPPPLSPRKLNNTESKQNLPLNNL-PVNAPRLPLVPTV 384
P EPL P +P Q P++ + + P + P + P P
Sbjct: 740 PHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVA 799
Query: 385 GPPSYLSYPKPPVLNNGFSHPVQVSRPPSYLSYPKPPV-------------LNNGFSHPV 431
P Y +P + P Q P P+ PV + NG S P+
Sbjct: 800 PQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPL 859
Query: 432 Q 432
Sbjct: 860 H 860
>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295). This
family is conserved in fungi but the function is not
known.
Length = 509
Score = 30.7 bits (69), Expect = 2.4
Identities = 27/129 (20%), Positives = 44/129 (34%), Gaps = 13/129 (10%)
Query: 225 NPLYPYFVPKNLQPQPEKQEEASGSNSSSNSFKQNMDFFSNLSNSSSSQLKSPPSRPSSN 284
P P P + A S+ S+ + + + + S S+S + S
Sbjct: 80 ALPMPNLAPI--TPPSSEPTPAPPSSESTATRTPDPNQQALESTESTST--TSADCNDSE 135
Query: 285 YHKAFSFHGKSQDTSNSFDFLTSSSTHPTH--TNGFTTSSISSTFPSEPLPSSQPKVLST 342
+ S DTS S SS P+ GF+ S ISS++ S + P +
Sbjct: 136 QSSTPN--LNSSDTSTS-----SSGALPSTSVVRGFSPSHISSSYRSTAQLNKAPSPTKS 188
Query: 343 FMNHSQPPP 351
+ P
Sbjct: 189 AEPTAAPQA 197
>gnl|CDD|191511 pfam06409, NPIP, Nuclear pore complex interacting protein (NPIP).
This family consists of a series of primate specific
nuclear pore complex interacting protein (NPIP)
sequences. The function of this family is unknown but is
well conserved from African apes to humans.
Length = 350
Score = 30.2 bits (67), Expect = 3.4
Identities = 30/124 (24%), Positives = 44/124 (35%), Gaps = 6/124 (4%)
Query: 324 SSTFPSEPLPSSQPKVLSTFMNHSQPPPLSPRKL-NNTESKQNLP---LNNLPVNAPRLP 379
SS P P +++ L M H PPP + +N+ S + P L LP +A
Sbjct: 225 SSGLPYWPYLTAE--TLKNRMGHQPPPPTQQHSIIDNSLSLKTPPECLLTPLPPSALPSA 282
Query: 380 LVPTVGPPSYLSYPKPPVLNNGFSHPVQVSRPPSYLSYPKPPVLNNGFSHPVQVSSNSSS 439
P L P PP ++ +PP L L P+ S+ S
Sbjct: 283 DDNLKTPAECLLRPLPPSADDNIKECPLALKPPPPLPPSVDDNLKECLLAPLPPSALPPS 342
Query: 440 SEFN 443
+ N
Sbjct: 343 VDDN 346
>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
Length = 1213
Score = 30.3 bits (68), Expect = 3.7
Identities = 11/71 (15%), Positives = 19/71 (26%), Gaps = 3/71 (4%)
Query: 432 QVSSNSSSSEFNSEAKDFFDSILNNNSTTYPSVAQRNGVFASKPSTSTYASRAAFLNEQL 491
QVS +S S F+ + L + N ++ +
Sbjct: 1084 QVSPHSQISLFDRGLANLLSDTLQWVKLKKRKL---NDGIVKGLKEQGDSATNQISLDDE 1140
Query: 492 GPAVPSIAPKK 502
A + KK
Sbjct: 1141 LNAYQAEWSKK 1151
>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
Rho/Rac/Cdc42-like GTPases [Signal transduction
mechanisms].
Length = 1175
Score = 30.2 bits (68), Expect = 4.0
Identities = 34/137 (24%), Positives = 50/137 (36%), Gaps = 15/137 (10%)
Query: 246 ASGSNSSSNSFKQNMDFFSNLSNSSSSQLKSPPSRPSSNYHKAFSFHGKSQDTSNSFDFL 305
A+ S SS NS + D FS S+S S S SR S G + L
Sbjct: 98 ATSSTSSLNSN--DGDQFSPASDSLSFNPSSTQSRKDSGPG-----DGSPVQKRKNP-LL 149
Query: 306 TSSSTHPTHTNGFTTSSISSTFPS------EPLPSSQPKVLSTFMNHSQPPPLSPRKLNN 359
SSSTH TH T + S + + +PS + + H + S +N
Sbjct: 150 PSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSSD-TSN 208
Query: 360 TESKQNLPLNNLPVNAP 376
S ++ N+ +
Sbjct: 209 GFSYPSIRKNSRHSSNS 225
>gnl|CDD|226880 COG4474, COG4474, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 180
Score = 28.9 bits (65), Expect = 4.8
Identities = 18/75 (24%), Positives = 29/75 (38%), Gaps = 7/75 (9%)
Query: 14 KLLYEYDHWDDAIHGFRETERSKWNEENTKIIARVQNLA-FPPNVTPIQYVHVLDLEQKG 72
+L EY H A+ E + WNE+N AR+ A F ++T Y E
Sbjct: 64 ELKEEYPHIKLAVITPFEEQGKNWNEDNQMKYARILKAADFVKSITERPY------ESPA 117
Query: 73 YIKAHKGYIKAHVDS 87
+ + ++ D
Sbjct: 118 QFRQYNQFLLEKSDG 132
>gnl|CDD|227785 COG5498, ACF2, Predicted glycosyl hydrolase [Cell envelope
biogenesis, outer membrane].
Length = 760
Score = 29.4 bits (66), Expect = 5.8
Identities = 27/117 (23%), Positives = 35/117 (29%), Gaps = 15/117 (12%)
Query: 358 NNTESKQNLPLNNLPVNAPRLPLVPTVGPPSYLSYPKPPVLNNGFSHPVQVSRPPSYLSY 417
+ PL L V P + P S +S PK V PP
Sbjct: 14 ATMKGANVQPLFALTVIILLFPTGASASPHS-VSVPK---------GDSNVEFPPILSGT 63
Query: 418 PKPPVLNNGFSHPVQVSSNSSSSEFNSEAKDFFDSIL----NNNSTTYP-SVAQRNG 469
PP HPV S S + F+ ++L N +P SV R
Sbjct: 64 INPPSTFRSVQHPVPPPGVRSDSTGPIQTNKFYGNLLVGSRTNPQYPHPYSVWWRAS 120
>gnl|CDD|183966 PRK13309, ureC, urease subunit alpha; Reviewed.
Length = 572
Score = 29.5 bits (66), Expect = 6.2
Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 6/29 (20%)
Query: 202 PDLIRLDSSDDVLPTSSQCAGVENPLYPY 230
PD+I++ S +VLP+S+ NP PY
Sbjct: 287 PDIIKVASQTNVLPSST------NPTLPY 309
>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
Provisional.
Length = 576
Score = 29.2 bits (65), Expect = 6.5
Identities = 17/85 (20%), Positives = 31/85 (36%), Gaps = 1/85 (1%)
Query: 328 PSEPLPSSQPKVLSTFMNHSQPPPLSPRKLNNTESKQNLPLNNLPVNAPRLPLVPTVGPP 387
P P + P ++ + P +P + E + N+P ++ +P+ P
Sbjct: 359 NVPPNPPNVPGGSNSEFSSDVENPPNPPNPDIPEQEPNIPEDSNKEVPEDVPMEPEDDRD 418
Query: 388 SYLSYPKPPVLNNGFSHPVQVSRPP 412
+ + PK P N G V P
Sbjct: 419 NNFNEPKKPE-NKGDGQNEPVIPKP 442
>gnl|CDD|233930 TIGR02564, cas_Csy1, CRISPR type I-F/YPEST-associated protein Csy1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) is a widespread family of
prokaryotic direct repeats with spacers of unique
sequence between consecutive repeats. This protein
family, typified by YPO2465 of Yersinia pestis, is a
CRISPR-associated (Cas) family strictly associated with
the Ypest subtype of CRISPR/Cas locus. This family is
designated Csy1, for CRISPR/Cas Subtype Ypest protein 1
[Mobile and extrachromosomal element functions, Other].
Length = 384
Score = 29.0 bits (65), Expect = 7.6
Identities = 25/115 (21%), Positives = 38/115 (33%), Gaps = 23/115 (20%)
Query: 385 GPPSYLSYPKPPVLNNGFSHPVQVS------RPPSYLSYPKPPVLNNGFSHPVQVSSNSS 438
Y Y V G + P +S + SYL PPV N P++ SS
Sbjct: 188 HQKRYADYLDLAVQKLGGTKPQNISQLNSSRKGRSYLLPSLPPVWKNIDKPPIKFSS--- 244
Query: 439 SSEFNSEAKDFFDSILNNNSTTYPSVAQRNGVFAS-KPSTSTY---ASRAAFLNE 489
+S + S ++ Q + KP + Y R A L++
Sbjct: 245 ----------ILESRFFSKSLARRTLDQLKILLLVVKPKENNYDIRNMRLAALDQ 289
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 29.2 bits (66), Expect = 7.9
Identities = 24/110 (21%), Positives = 35/110 (31%), Gaps = 17/110 (15%)
Query: 354 PRKLN--------NTESKQNLPLNNLPVNAPRLPLVPTVGPPSYLSYPKPPVLNNGFSHP 405
KLN + +K P P P +P + P PP +
Sbjct: 505 SIKLNLESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASS 564
Query: 406 VQVSRPPSYLSYPKPPVLNNGFSHPVQVSSNSSSSEF-NSEAK---DFFD 451
++ P+ S P P P + SS E + AK DFF+
Sbjct: 565 NAPAQIPADSSPPPPIP-----EEPTPSPTKDSSPEEIDKAAKNLADFFN 609
>gnl|CDD|187806 cd09675, Csy1_I-F, CRISPR/Cas system-associated protein Csy1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Large proteins; Predicted subunit of the Cascade
complex; signature gene for I-F subtype; also known as
Csy1 family.
Length = 384
Score = 28.6 bits (64), Expect = 8.3
Identities = 25/115 (21%), Positives = 38/115 (33%), Gaps = 23/115 (20%)
Query: 385 GPPSYLSYPKPPVLNNGFSHPVQVS------RPPSYLSYPKPPVLNNGFSHPVQVSSNSS 438
Y Y V G + P +S + SYL PPV N P++ SS
Sbjct: 187 HQKRYADYLDLAVQKLGGTKPQNISQLNSSRKGRSYLLPSLPPVWKNIDKPPIKFSS--- 243
Query: 439 SSEFNSEAKDFFDSILNNNSTTYPSVAQRNGVFAS-KPSTSTY---ASRAAFLNE 489
+S + S ++ Q + KP + Y R A L++
Sbjct: 244 ----------ILESRFFSKSLARRTLDQLKILLLVVKPKENNYDIRNMRLAALDQ 288
>gnl|CDD|222790 PHA00430, PHA00430, tail fiber protein.
Length = 568
Score = 28.7 bits (64), Expect = 8.8
Identities = 12/53 (22%), Positives = 19/53 (35%)
Query: 433 VSSNSSSSEFNSEAKDFFDSILNNNSTTYPSVAQRNGVFASKPSTSTYASRAA 485
S + NSEA F + S+ + Q + YA++AA
Sbjct: 197 KQWRSEADGSNSEANRFKGYADSMTSSVEAAKGQAESSSKEANTAGDYATKAA 249
>gnl|CDD|240111 cd04794, euk_LANCL, eukaryotic Lanthionine synthetase C-like
protein. This family contains the lanthionine synthetase
C-like proteins 1 and 2 which are related to the
bacterial lanthionine synthetase components C (LanC).
LANCL1 and LANCL2 (testes-specific adriamycin
sensitivity protein) are thought to be peptide-modifying
enzyme components in eukaryotic cells. Both proteins are
produced in large quantities in the brain and testes and
may have role in the immune surveillance of these
organs.
Length = 343
Score = 28.8 bits (65), Expect = 9.1
Identities = 14/52 (26%), Positives = 21/52 (40%), Gaps = 2/52 (3%)
Query: 73 YIKAHKGYIKAHVDSVRFCGNTIAGLSLLSDSVMKLVDEKTKTQEILVLLKQ 124
YIKA I CGN AG+ + +V + ++ + E L Q
Sbjct: 34 YIKASLRAIPDDRRISFLCGN--AGVYAVGAAVYHYLKDEDEVDEFLSKFLQ 83
>gnl|CDD|218262 pfam04782, DUF632, Protein of unknown function (DUF632). This
plant protein may be a leucine zipper, but there is no
experimental evidence for this.
Length = 309
Score = 28.5 bits (64), Expect = 9.3
Identities = 18/72 (25%), Positives = 27/72 (37%), Gaps = 14/72 (19%)
Query: 26 IHGFRETERS--KWNEENTKIIARVQNLAFPPNVTP---IQYVHVLDLEQK--------- 71
IHG +S + +E II + L P+ P L LE +
Sbjct: 172 IHGLTRMWKSMLECHEIQLHIIQEAKLLDSSPSGKPTSESHRQATLQLEAELREWHSSFS 231
Query: 72 GYIKAHKGYIKA 83
+I A + Y+KA
Sbjct: 232 NWIDAQRSYVKA 243
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 28.5 bits (63), Expect = 9.6
Identities = 31/158 (19%), Positives = 49/158 (31%), Gaps = 5/158 (3%)
Query: 231 FVPKNLQPQPEKQEEASGSNSSSNSFKQNMDFFSNLSNSSSSQLKSPPSRPSSNYHKAFS 290
++ P+ + + + K+ + +S S P+ PS ++
Sbjct: 140 ICQRDYNPRDRPKCRCTLRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQS-- 197
Query: 291 FHGKSQDTSNSFDFLTSSSTHPTHTNGFTTSSISSTFPSEPLPSSQPKVLSTFMNHSQPP 350
+Q + SST P T G TTSS +EP PS + S S
Sbjct: 198 -QPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEP-QTEPPPSQRGPSGSPQHPPSTTS 255
Query: 351 PLSPRKLNNTESKQNLPLNNLPVNAPRLPLVPTVGPPS 388
+ E Q N R P PP+
Sbjct: 256 QDQSTTGDGQEHTQRRKTPPATSN-RRSPHSTATPPPT 292
>gnl|CDD|237271 PRK13016, PRK13016, dihydroxy-acid dehydratase; Provisional.
Length = 577
Score = 28.5 bits (64), Expect = 9.9
Identities = 11/32 (34%), Positives = 13/32 (40%)
Query: 277 PPSRPSSNYHKAFSFHGKSQDTSNSFDFLTSS 308
P R Y FS H + D FDFL +
Sbjct: 535 PERRYERGYGWMFSQHVEQADKGCDFDFLETG 566
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.129 0.383
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 27,934,309
Number of extensions: 2610804
Number of successful extensions: 1909
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1857
Number of HSP's successfully gapped: 93
Length of query: 569
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 467
Effective length of database: 6,413,494
Effective search space: 2995101698
Effective search space used: 2995101698
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 62 (27.8 bits)