RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9363
(421 letters)
>gnl|CDD|214997 smart01052, CAP_GLY, Cytoskeleton-associated proteins (CAPs) are
involved in the organisation of microtubules and
transportation of vesicles and organelles along the
cytoskeletal network. A conserved motif, CAP-Gly, has
been identified in a number of CAPs, including CLIP-170
and dynactins. The crystal structure of Caenorhabditis
elegans F53F4.3 protein CAP-Gly domain was recently
solved. The domain contains three beta-strands. The most
conserved sequence, GKNDG, is located in two consecutive
sharp turns on the surface, forming the entrance to a
groove.
Length = 68
Score = 86.1 bits (214), Expect = 4e-21
Identities = 30/68 (44%), Positives = 38/68 (55%), Gaps = 2/68 (2%)
Query: 185 IGLRV-CDSEGHRGTIRYVGSVEGTQGVWYGIDWDSETRGKHDGSHNGVKYFWTHSTTSG 243
+G RV G RGT+RYVG GVW G++ D RGK+DGS GV+YF G
Sbjct: 1 VGDRVEVGGGGRRGTVRYVGPTPFAPGVWVGVELDEPLRGKNDGSVKGVRYF-ECPPKHG 59
Query: 244 SFMRRDKL 251
F+R K+
Sbjct: 60 IFVRPSKV 67
>gnl|CDD|216424 pfam01302, CAP_GLY, CAP-Gly domain. Cytoskeleton-associated
proteins (CAPs) are involved in the organisation of
microtubules and transportation of vesicles and
organelles along the cytoskeletal network. A conserved
motif, CAP-Gly, has been identified in a number of CAPs,
including CLIP-170 and dynactins. The crystal structure
of Caenorhabditis elegans F53F4.3 protein CAP-Gly domain
was recently solved. The domain contains three
beta-strands. The most conserved sequence, GKNDG, is
located in two consecutive sharp turns on the surface,
forming the entrance to a groove.
Length = 67
Score = 85.3 bits (212), Expect = 8e-21
Identities = 31/68 (45%), Positives = 41/68 (60%), Gaps = 3/68 (4%)
Query: 185 IGLRV-CDSEGHRGTIRYVGSVEGTQGVWYGIDWDSETRGKHDGSHNGVKYFWTHSTTSG 243
+G RV G RGT+RYVG V G+W G++ D E RGK+DGS +GV+YF G
Sbjct: 1 VGDRVEVLGGGRRGTVRYVGPVPFAPGLWVGVELD-EPRGKNDGSVDGVRYFECPP-KYG 58
Query: 244 SFMRRDKL 251
F+R K+
Sbjct: 59 IFVRPSKV 66
>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat.
Length = 60
Score = 43.7 bits (104), Expect = 4e-06
Identities = 20/64 (31%), Positives = 32/64 (50%), Gaps = 10/64 (15%)
Query: 350 VTCLKLPSNRITTLDSVPGMF---SSLEELYLQENNI--VDWGEVNALGSLPNLKYLNLA 404
+ L L +NR+T + G F +L+ L L NN+ + A LP+L+ L+L+
Sbjct: 2 LKSLDLSNNRLTVIP--DGAFKGLPNLKVLDLSGNNLTSISP---EAFSGLPSLRSLDLS 56
Query: 405 STNL 408
NL
Sbjct: 57 GNNL 60
Score = 32.9 bits (76), Expect = 0.022
Identities = 16/42 (38%), Positives = 23/42 (54%), Gaps = 5/42 (11%)
Query: 372 SLEELYLQENNI--VDWGEVNALGSLPNLKYLNLASTNLRNI 411
+L+ L L N + + G A LPNLK L+L+ NL +I
Sbjct: 1 NLKSLDLSNNRLTVIPDG---AFKGLPNLKVLDLSGNNLTSI 39
>gnl|CDD|227569 COG5244, NIP100, Dynactin complex subunit involved in mitotic
spindle partitioning in anaphase B [Cell division and
chromosome partitioning].
Length = 669
Score = 47.8 bits (113), Expect = 7e-06
Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 194 GHRGTIRYVGSVEGTQGVWYGIDWDSETRGKHDGSHNGVKYFWTHSTTSGSFMRRDKLN 252
GT+R++G + G+W G++ D + GK+DGS NGV+YF G F+R D +
Sbjct: 14 DKFGTVRFIGKTKFKDGIWIGLELD-DPVGKNDGSVNGVRYFHCKKRH-GIFIRPDDDS 70
>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies). Leucine rich
repeats are short sequence motifs present in a number of
proteins with diverse functions and cellular locations.
These repeats are usually involved in protein-protein
interactions. Each Leucine Rich Repeat is composed of a
beta-alpha unit. These units form elongated non-globular
structures. Leucine Rich Repeats are often flanked by
cysteine rich domains.
Length = 43
Score = 42.1 bits (100), Expect = 8e-06
Identities = 14/41 (34%), Positives = 23/41 (56%), Gaps = 3/41 (7%)
Query: 372 SLEELYLQENNIVDWGEVNALGSLPNLKYLNLASTNLRNIK 412
+LE L L N I D + L +LPNL+ L+L+ + ++
Sbjct: 2 NLETLDLSNNQITD---LPPLSNLPNLETLDLSGNKITDLS 39
Score = 29.8 bits (68), Expect = 0.21
Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 1/33 (3%)
Query: 353 LKLPSNRITTLDSVPGMFSSLEELYLQENNIVD 385
L L +N+IT L + + +LE L L N I D
Sbjct: 6 LDLSNNQITDLPPLSNL-PNLETLDLSGNKITD 37
>gnl|CDD|227223 COG4886, COG4886, Leucine-rich repeat (LRR) protein [Function
unknown].
Length = 394
Score = 38.4 bits (89), Expect = 0.005
Identities = 26/76 (34%), Positives = 45/76 (59%), Gaps = 3/76 (3%)
Query: 337 IWADILKLLANFPVTCLKLPSNRITTLDSVPGMF-SSLEELYLQENNIVDWGEVNALGSL 395
+ ++I +LL +T L L +N IT + + G+ S+L+EL L +N I + L +L
Sbjct: 105 LRSNISELLELTNLTSLDLDNNNITDIPPLIGLLKSNLKELDLSDNKIES--LPSPLRNL 162
Query: 396 PNLKYLNLASTNLRNI 411
PNLK L+L+ +L ++
Sbjct: 163 PNLKNLDLSFNDLSDL 178
Score = 35.7 bits (82), Expect = 0.035
Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 325 HIHTLTLGNMGYIWADILKLLANFP-VTCLKLPSNRITTLDSVPGMFSSLEELYLQENNI 383
+ L L N I ++L L+N ++ L+L +N++ L G S+LE L L N I
Sbjct: 210 ALEELDLSNNSII--ELLSSLSNLKNLSGLELSNNKLEDLPESIGNLSNLETLDLSNNQI 267
Query: 384 VDWGEVNALGSLPNLKYLNLASTNLRNI 411
++LGSL NL+ L+L+ +L N
Sbjct: 268 SSI---SSLGSLTNLRELDLSGNSLSNA 292
Score = 30.7 bits (69), Expect = 1.6
Identities = 21/93 (22%), Positives = 40/93 (43%), Gaps = 2/93 (2%)
Query: 320 IFPQGHIHTLTLGNMGYIWADILKLLANFPVTCLKLPSNRITTLDSVPGMFSSLEELYLQ 379
+ ++ +L L N L L + L L N+I +L S +L+ L L
Sbjct: 112 LLELTNLTSLDLDNNNITDIPPLIGLLKSNLKELDLSDNKIESLPSPLRNLPNLKNLDLS 171
Query: 380 ENNIVDWGEVNALGSLPNLKYLNLASTNLRNIK 412
N++ D ++ + L NL L+L+ + ++
Sbjct: 172 FNDLSDLPKLLSN--LSNLNNLDLSGNKISDLP 202
Score = 29.9 bits (67), Expect = 2.3
Identities = 18/54 (33%), Positives = 33/54 (61%), Gaps = 2/54 (3%)
Query: 358 NRITTLDSVPGMFSSLEELYLQENNIVDWGEVNALGSLPNLKYLNLASTNLRNI 411
N+I+ L + S+LEEL L N+I++ +++L +L NL L L++ L ++
Sbjct: 196 NKISDLPPEIELLSALEELDLSNNSIIE--LLSSLSNLKNLSGLELSNNKLEDL 247
>gnl|CDD|235394 PRK05297, PRK05297, phosphoribosylformylglycinamidine synthase;
Provisional.
Length = 1290
Score = 35.5 bits (83), Expect = 0.066
Identities = 35/135 (25%), Positives = 46/135 (34%), Gaps = 28/135 (20%)
Query: 45 LKHLRHLNLRIRLKSSSAPAHSYLAEVVFFVHLGRTLCDEDFVKEGSLLHYLPKLSSLRF 104
L R L RL+++ P S AE V F L L E+ K LL Y P
Sbjct: 10 LSAFRLQKLLARLQAAVLPVTSIYAEYVHFADLSAPLSAEEQAKLERLLTYGP------- 62
Query: 105 TKNPILAEERVV----------SSREKTIARLGGLKLLNGSAIERQERQGSEYDYIKEFG 154
++ +V SS+ IA GL + I R ER G Y
Sbjct: 63 AEHEPAGRLFLVTPRPGTISPWSSKATDIAHNCGL-----AGIRRIER-GIAYYVEAA-- 114
Query: 155 AVWLDEKRRAEFLEA 169
L ++RA
Sbjct: 115 ---LSAEQRAALAAL 126
>gnl|CDD|205873 pfam13698, DUF4156, Domain of unknown function (DUF4156). The
function of this family is unknown but members are
annotated as putative lipoprotein outer membrane
proteins.
Length = 87
Score = 28.8 bits (65), Expect = 1.6
Identities = 8/21 (38%), Positives = 14/21 (66%)
Query: 200 RYVGSVEGTQGVWYGIDWDSE 220
+++G+V G+QG WY + S
Sbjct: 23 QFLGTVTGSQGNWYSGVFTSN 43
>gnl|CDD|218731 pfam05752, Calici_MSP, Calicivirus minor structural protein. This
family consists of minor structural proteins largely
from human calicivirus isolates. Human calicivirus
causes gastroenteritis. The function of this family is
unknown.
Length = 167
Score = 29.7 bits (67), Expect = 1.7
Identities = 18/79 (22%), Positives = 31/79 (39%), Gaps = 14/79 (17%)
Query: 259 EALHRKYVETDNELTVRENVEEVKASINAPFLELVGFDQVH--------EEQNTNKLPIP 310
EAL R+ + +L+V V+++++A GFD V E L P
Sbjct: 49 EALQRRGQDLSRDLSVNGPALRVQSAVDA------GFDPVSARRLAGSGERVIYGGLDRP 102
Query: 311 NDTSGVMEQIFPQGHIHTL 329
G + I H+ ++
Sbjct: 103 IRQRGTLPGIRETRHLQSM 121
>gnl|CDD|211380 cd10981, ZnPC_S1P1, Zinc dependent phospholipase C/S1-P1 nuclease.
This model describes both the bacterial and archeal
zinc-dependent phospholipase C, a domain found in the
alpha toxin of Clostridium perfringens, as well as S1/P1
nucleases, which predominantly act on single-stranded
DNA and RNA.
Length = 238
Score = 29.7 bits (67), Expect = 2.4
Identities = 29/118 (24%), Positives = 37/118 (31%), Gaps = 18/118 (15%)
Query: 121 KTIARLGGLKLLNGSAIERQERQGSEY-DYIKEFGAVWLDEKRRAEFLEANPRVGDNYFQ 179
IAR + L E D + E GAV D++R + EA ++
Sbjct: 6 HKIARDA-VFTLPPELAEFWSDLLKRNIDELLE-GAVAPDKRRYDDDGEAQ----NHVLD 59
Query: 180 VMEDDIGLRVCDSEGHRGTIRYVGSVEGTQGVWYGI--------DWDSETRGKHDGSH 229
V DD G D G T+ Y G WY DW SH
Sbjct: 60 V--DDYGDSALDKYGED-TLGYWGGAPWQIQEWYQRLVDAFRRKDWKDIAFAAGVLSH 114
>gnl|CDD|234080 TIGR02987, met_A_Alw26, type II restriction m6 adenine DNA
methyltransferase, Alw26I/Eco31I/Esp3I family. Members
of this family are the m6-adenine DNA methyltransferase
protein, or domain of a fusion protein that also carries
m5 cytosine methyltransferase activity, of type II
restriction systems of the Alw26I/Eco31I/Esp3I family. A
methyltransferase of this family is alway accompanied by
a type II restriction endonuclease from the
Alw26I/Eco31I/Esp3I family (TIGR02986) and by an
adenine-specific modification methyltransferase. Members
of this family are unusual in that regions of similarity
to homologs outside this family are circularly permuted
[DNA metabolism, Restriction/modification].
Length = 524
Score = 29.4 bits (66), Expect = 4.3
Identities = 27/122 (22%), Positives = 44/122 (36%), Gaps = 25/122 (20%)
Query: 91 SLLHYLPKLSSLRFTKNPILAEERVVSSREKTIARLGGLKLLNGSAIER--------QER 142
LL LP LSSL N + E + ++ G L+ G + R E
Sbjct: 298 KLLKKLPTLSSLEKIIN-LRGELDLTGQKDLLSKLGTGYPLIRGKHVGRFNLKNESSSED 356
Query: 143 QGSEYD-------YIKEFGAVWLD-----EKRRAEF--LEANPRVGD--NYFQVMEDDIG 186
++ +I+ VW +KRR F + N +G+ NY V +++
Sbjct: 357 IFKKFIKNSKKSRHIRMKRIVWRQCSYMKQKRRLIFCIVPPNYILGNSCNYISVKDNNEI 416
Query: 187 LR 188
Sbjct: 417 ED 418
>gnl|CDD|178354 PLN02753, PLN02753, triacylglycerol lipase.
Length = 531
Score = 29.3 bits (65), Expect = 4.6
Identities = 17/50 (34%), Positives = 27/50 (54%), Gaps = 3/50 (6%)
Query: 138 ERQERQGSEYDYIKEFGAVWL---DEKRRAEFLEANPRVGDNYFQVMEDD 184
ER+ERQG D + G +WL D +RR + E R+ D + ++ +D
Sbjct: 65 ERRERQGLLIDEAEGAGELWLTAEDIRRRDKKTEEERRLRDTWRKIQGED 114
>gnl|CDD|132095 TIGR03051, PS_I_psaG_plant, photosystem I reaction center subunit
V, chloroplast.
Length = 88
Score = 27.1 bits (60), Expect = 4.7
Identities = 12/44 (27%), Positives = 18/44 (40%)
Query: 138 ERQERQGSEYDYIKEFGAVWLDEKRRAEFLEANPRVGDNYFQVM 181
+Q E + K GA D + A L++N G N V+
Sbjct: 23 RNVAKQVPEQNGPKTTGATDFDALQYASLLKSNDPAGFNLVDVL 66
>gnl|CDD|235756 PRK06259, PRK06259, succinate dehydrogenase/fumarate reductase
iron-sulfur subunit; Provisional.
Length = 486
Score = 28.4 bits (64), Expect = 8.2
Identities = 12/38 (31%), Positives = 16/38 (42%), Gaps = 5/38 (13%)
Query: 269 DNELTVRENVEEVKASI---NAPFLELVGFDQVHEEQN 303
L VRENV + S+ FLE V ++ N
Sbjct: 223 PAHLEVRENVLKTGRSVPKEKPSFLEEV--SDIYPYGN 258
>gnl|CDD|226839 COG4408, COG4408, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 431
Score = 28.2 bits (63), Expect = 8.3
Identities = 10/46 (21%), Positives = 22/46 (47%)
Query: 310 PNDTSGVMEQIFPQGHIHTLTLGNMGYIWADILKLLANFPVTCLKL 355
P + +++P+G I + +M +W + ++LL V + L
Sbjct: 233 PEQRPQYVYKLYPEGPITPALIRDMRGLWKEYMRLLNRLGVEEINL 278
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.136 0.403
Gapped
Lambda K H
0.267 0.0787 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,640,272
Number of extensions: 2108770
Number of successful extensions: 1614
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1603
Number of HSP's successfully gapped: 29
Length of query: 421
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 322
Effective length of database: 6,546,556
Effective search space: 2107991032
Effective search space used: 2107991032
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (26.8 bits)