RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6603
(724 letters)
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 469 bits (1210), Expect = e-159
Identities = 171/332 (51%), Positives = 233/332 (70%), Gaps = 5/332 (1%)
Query: 168 FEHSELLLYQSMVIQDSGDLEEAVKHLDRFKEQIHDKLTVEETYGALKLKLGQYNEAMKH 227
+EHSELLLY++ +I++SGDLEEA++HL+ ++QI D+L V E L LKLG+ EA
Sbjct: 1 YEHSELLLYKNSLIEESGDLEEALEHLEEKEKQIVDRLAVMEMRADLLLKLGRKEEAEAT 60
Query: 228 YESLIERNQENTLYYNKLVEAKQLT----NNDDIFQLLTHYISKYPKATVPKRLSLNYVS 283
Y +L++RN EN YY L +A L + + + L KYP++ P+RL L+++
Sbjct: 61 YRALLDRNPENYDYYEGLQKALGLEISSGDIEALLALYDELAEKYPRSDAPRRLPLDFLE 120
Query: 284 GDQFRTEIDKYLRHGFHKGVPPLFVNLRSLYSNTEKCKIIQDLVESYVEALSKTGHFSLQ 343
GD+F+ D YLR KGVP LF +L+SLYS++EK I+++LVE YV +L +G FS
Sbjct: 121 GDEFKELADAYLRRMLRKGVPSLFADLKSLYSDSEKVDILEELVEGYVSSLKSSGSFSGS 180
Query: 344 DEG-EVEPASALLWVYHYLAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHA 402
D E EP S LLW ++LAQHYD+LG KAL YI+ AI+HTPTL+EL++TK RI KHA
Sbjct: 181 DSNSEKEPPSTLLWTLYFLAQHYDYLGSRDKALEYIDKAIEHTPTLVELYMTKARILKHA 240
Query: 403 GDVLEAYKWLDEAQSLDTADRYINSKCAKYMLRANLIKEAEETCSKFTREGVSAMENLNE 462
GD+ +A + ++EA+SLD ADRYINSKCAKYMLRAN +EAE+T S FTR+G + +LN+
Sbjct: 241 GDLQKAAEAMEEARSLDLADRYINSKCAKYMLRANENEEAEKTASLFTRDGTGPLADLND 300
Query: 463 MQCMWFQTECALAYQRLGRWGDTLKKCHEVDR 494
MQCMWF TE AYQR G++G LK+ H V++
Sbjct: 301 MQCMWFLTEDGEAYQRQGKYGLALKRFHAVEK 332
Score = 204 bits (521), Expect = 8e-58
Identities = 64/103 (62%), Positives = 82/103 (79%)
Query: 610 REGVSAMENLNEMQCMWFQTECALAYQRLGRWGDTLKKCHEVDRHFSEIIEDQFDFHTYC 669
R+G + +LN+MQCMWF TE AYQR G++G LK+ H V++HF EI EDQFDFH+YC
Sbjct: 289 RDGTGPLADLNDMQCMWFLTEDGEAYQRQGKYGLALKRFHAVEKHFDEIQEDQFDFHSYC 348
Query: 670 MRKMTLRSYVRLLRLEDVLRSHPFYYTAARCAIQVYLRLHDRP 712
+RKMTLR+YV +LR ED LRSHPFY+ AA AI++YL+LHD+P
Sbjct: 349 LRKMTLRAYVDMLRWEDRLRSHPFYFKAALGAIEIYLKLHDKP 391
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only
if a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 46.6 bits (111), Expect = 5e-05
Identities = 105/592 (17%), Positives = 197/592 (33%), Gaps = 82/592 (13%)
Query: 42 GRKEEAYEYVRRGLRNDLKSHVCWHVYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRD 101
R +EA + L D + + G L S + A+ YR A+ +NI ++
Sbjct: 173 NRFDEARALIDEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLA 232
Query: 102 LSLLQIQMRDLEGYRETRYQLFMLRPTQRASWIGFAMAYHLLHDFDMACNILEAFRKTQQ 161
L+ + I+ + E + L P + A+ +++ A R+T Q
Sbjct: 233 LATILIEAGEFEEAEKHADALLKKAPNSPLAHYLKALVDFQKKNYEDA-------RETLQ 285
Query: 162 VTNSYDFEHSELLLYQSMVIQDSGDLEEAVKHLDRFKEQIHDKLTVEETYGALKLKLGQY 221
E+ LL G+LE+A ++L++ + + +++L+LG+
Sbjct: 286 DALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILKYAPNSHQARRLLASIQLRLGRV 345
Query: 222 NEAMKHYESLIERNQENTLYYN----KLVEAKQLTNNDDIFQLLTHYISKYPKATVPKRL 277
+EA+ + + ++ + + + T + A +
Sbjct: 346 DEAIATLSPALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAAARTQLGI 405
Query: 278 S-------------------LNYVSGDQFRTEIDKYLRHG-FHKGV-------------P 304
S L+ G I YLR G F K +
Sbjct: 406 SKLSQGDPSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAKKLEKKQPDNA 465
Query: 305 PLFVNLRSLYSNTEKCKIIQDLVESYVEALS-KTGHFSLQDEGEVEPASALLWVYHYLAQ 363
L L ++Y + E++ +ALS + F PA+A L
Sbjct: 466 SLHNLLGAIYLGKGD---LAKAREAFEKALSIEPDFF---------PAAANLARIDIQEG 513
Query: 364 HYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHAGDVLEAYKWLDEAQSLDTADR 423
+ D A+ + P + + +Y G+ EA WL++A L+ +
Sbjct: 514 NPD------DAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEI 567
Query: 424 YINSKCAKYMLRANLIKEAE---ETCSKFTREGVSAMENLNEMQCMWFQTECALA-YQRL 479
A+Y L +K+A + + A L Q A++ +++L
Sbjct: 568 EPALALAQYYLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKL 627
Query: 480 -GRWGDT------LKKCHEVDRKCYEHKQYKNGLKFAKQILTNPKYAEHGETLAMKGLTL 532
D+ L + V + Y + K+ L ++ E L
Sbjct: 628 LALQPDSALALLLLADAYAVMKN------YAKAITSLKRALELKP--DNTEAQIGLAQLL 679
Query: 533 NCLGRKEEAYEYVRRGLRNDLKSHVCWHVYGLLQRSDKKYDEAIKCYRNALK 584
R E A + + + K+ + + + G L K Y AI+ YR ALK
Sbjct: 680 LAAKRTESAKKIAKSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALK 731
Score = 45.1 bits (107), Expect = 1e-04
Identities = 123/616 (19%), Positives = 215/616 (34%), Gaps = 81/616 (13%)
Query: 3 YEHKQYKNGLKFAKQIL-TNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKS 61
++ K Y++ + + L + P E+ L + G + LG E+AY+Y+ + L+ S
Sbjct: 272 FQKKNYEDARETLQDALKSAP---EYLPALLLAGASEYQLGNLEQAYQYLNQILKYAPNS 328
Query: 62 HVCWHVYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRDLEGYRETRYQ 121
H + +Q + DEAI AL + D+ + L + + D E E +
Sbjct: 329 HQARRLLASIQLRLGRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAK 388
Query: 122 LFMLRPTQRASWIGFAMAYHLLHDFDMACNILEAFRKTQQVTNSYDFEHSELLLYQSMVI 181
L P A+ ++ D A + Q+ + ++LLL S +
Sbjct: 389 ATELDPENAAARTQLGISKLSQGDPSEA---IADLETAAQL--DPELGRADLLLILSYL- 442
Query: 182 QDSGDLEEAVKHLDRFKEQIHDKLTVEETYGALKLKLGQYNEAMKHYESLIERNQEN-TL 240
SG ++A+ + +++ D ++ GA+ L G +A + +E + +
Sbjct: 443 -RSGQFDKALAAAKKLEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIEPDFFPA 501
Query: 241 YYNKLVEAKQLTNNDDIFQLLTHYISKYPKATVPKRLSLNYVSGDQFRTEIDKYLRHGFH 300
N Q N DD Q ++ PK + L+L + YLR G
Sbjct: 502 AANLARIDIQEGNPDDAIQRFEKVLTIDPK-NLRAILALAGL-----------YLRTG-- 547
Query: 301 KGVPPLFVNLRSLYSNTEKCKIIQDLVESYVEALSKTGHFSLQDEGEVEPASALLWVYHY 360
N E EA++ + + E+EPA AL
Sbjct: 548 ---------------NEE-------------EAVAWLEKAAELNPQEIEPALAL------ 573
Query: 361 LAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHAGDVLEAYKWLDEAQSLDT 420
AQ+Y G KAL +N A D P E ++ GR AGD+ +A + +L
Sbjct: 574 -AQYYLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQP 632
Query: 421 ADRYINSKCAKYMLRANLIKEAEETCSKFTREGVSAMENLNEMQCMWFQTECALAYQRLG 480
A +A + + + + E Q Q A
Sbjct: 633 DSALALLLLADAYAVMKNYAKAITS----LKRALELKPDNTEAQIGLAQLLLA-----AK 683
Query: 481 RWGDTLKKCHEVDRKCYEHKQ----------YKNGLKFAKQILTNPKYAEHGETLAMK-G 529
R K + ++ + + A Q A+K
Sbjct: 684 RTESAKKIAKSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKRAPSSQNAIKLH 743
Query: 530 LTLNCLGRKEEAYEYVRRGLRNDLKSHVCWHVYGLLQRSDKKYDEAIKCYRNALKWEHDN 589
L G EA + + L+ V L + K YD+AIK Y+ +K DN
Sbjct: 744 RALLASGNTAEAVKTLEAWLKTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAPDN 803
Query: 590 IQIMRDLSLLQIQMRD 605
++ +L+ L ++++D
Sbjct: 804 AVVLNNLAWLYLELKD 819
Score = 40.1 bits (94), Expect = 0.005
Identities = 98/587 (16%), Positives = 188/587 (32%), Gaps = 94/587 (16%)
Query: 76 KKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRDLEGYRETRYQLFMLRPTQRASWIG 135
+ + A K Y AL + ++ L+ L + + R ++ P + +
Sbjct: 139 GQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEVLTADPGNVDALLL 198
Query: 136 FAMAYHLLHDFDMACNILEAFRKTQQVTNSYDFEHSELLLYQSMVIQDSGDLEEAVKHLD 195
L + ++A L A+RK + + +LL + ++ ++G+ EEA KH D
Sbjct: 199 KGDLLLSLGNIELA---LAAYRKAIAL----RPNNIAVLLALATILIEAGEFEEAEKHAD 251
Query: 196 RFKEQIHDKLTVEETYGALKLKLGQYNEAMKHYESLIERNQE--NTLYYNKLVEAKQLTN 253
++ + + + Y +A + + ++ E L E QL N
Sbjct: 252 ALLKKAPNSPLAHYLKALVDFQKKNYEDARETLQDALKSAPEYLPALLLAGASEY-QLGN 310
Query: 254 NDDIFQLLTHYISKYPKATVPKR------LSLNYVSGDQFRTEIDKYLRHGFHKGVPPLF 307
+ +Q L + P + +R L L V D+ +
Sbjct: 311 LEQAYQYLNQILKYAPNSHQARRLLASIQLRLGRV--DEAIATLS--------------- 353
Query: 308 VNLRSLYSNTEKCKIIQDLVESYVEA-LSKTGHFSLQDEGEVEPASA------------- 353
+L + + + L E+Y+ + L E++P +A
Sbjct: 354 ---PALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAAARTQLGISKLSQ 410
Query: 354 -------------------LLWVYHYLAQHYDHLGDTMKALNYINAAIDHTPTLIELFVT 394
L L Y G KAL P L
Sbjct: 411 GDPSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAKKLEKKQPDNASLHNL 470
Query: 395 KGRIYKHAGDVLEAYKWLDEAQSLDTADRYINSKCAKYMLRANLIKEAEETCSKFTR--- 451
G IY GD+ +A + ++A S++ + A+ ++ +A + K
Sbjct: 471 LGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDP 530
Query: 452 EGVSAMENLNEMQCMWFQTECALAYQRLGRWGDTLKKCHEVDRKCYE-----------HK 500
+ + A+ L + E A+A+ L+K E++ + E
Sbjct: 531 KNLRAILALAGLYLRTGNEEEAVAW---------LEKAAELNPQEIEPALALAQYYLGKG 581
Query: 501 QYKNGLKFAKQILTNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSHVCWH 560
Q K L + + E M G G +A ++ L S +
Sbjct: 582 QLKKALAILNEAADA--APDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSALALL 639
Query: 561 VYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRDLE 607
+ K Y +AI + AL+ + DN + L+ L + + E
Sbjct: 640 LLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTE 686
Score = 33.5 bits (77), Expect = 0.45
Identities = 27/115 (23%), Positives = 45/115 (39%), Gaps = 5/115 (4%)
Query: 42 GRKEEAYEYVRRGLRNDLKSHVCWHVYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRD 101
G EA + + L+ V L + K YD+AIK Y+ +K DN ++ +
Sbjct: 750 GNTAEAVKTLEAWLKTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAPDNAVVLNN 809
Query: 102 LSLLQIQMRDLE--GYRETRYQLFMLRPTQRASWIGFAMAYHLLHDFDMACNILE 154
L+ L ++++D Y E + L P A + D A +L
Sbjct: 810 LAWLYLELKDPRALEYAERALK---LAPNIPAILDTLGWLLVEKGEADRALPLLR 861
Score = 31.6 bits (72), Expect = 1.6
Identities = 52/240 (21%), Positives = 83/240 (34%), Gaps = 38/240 (15%)
Query: 381 AIDHTPTLIELFVTKGRIYKHAGDVLEAYKWLDEAQSLDTADRYINSKCAKYMLRAN--- 437
A+ P E G+IY GD A K L +A SL + A+ L
Sbjct: 48 ALQKDPNDAEARFLLGKIYLALGDYAAAEKELRKALSLGYPKNQVLPLLARAYLLQGKFQ 107
Query: 438 -LIKEAEETCSKFTREGVSAMENLNEMQCMWFQTECALAYQRLGRWGDTLKKCHEVDRKC 496
++ E + EG + + L LAY LG+ E+ +K
Sbjct: 108 QVLDELPGK-TLLDDEGAAELLALR-----------GLAYLGLGQ--------LELAQKS 147
Query: 497 YEHKQYKNGLKFAKQILTNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSH 556
YE + P+ LA L N R +EA + L D +
Sbjct: 148 YEQALAID-----------PRSLYAKLGLAQLALAEN---RFDEARALIDEVLTADPGNV 193
Query: 557 VCWHVYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRDLEGYREGVSAM 616
+ G L S + A+ YR A+ +NI ++ L+ + I+ + E + A+
Sbjct: 194 DALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLALATILIEAGEFEEAEKHADAL 253
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
34 amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
in a variety of organisms including bacteria,
cyanobacteria, yeast, fungi, plants, and humans in
various subcellular locations; involved in a variety of
functions including protein-protein interactions, but
common features in the interaction partners have not
been defined; involved in chaperone, cell-cycle,
transciption, and protein transport complexes; the
number of TPR motifs varies among proteins (1,3-11,13
15,16,19); 5-6 tandem repeats generate a right-handed
helical structure with an amphipathic channel that is
thought to accomodate an alpha-helix of a target
protein; it has been proposed that TPR proteins
preferably interact with WD-40 repeat proteins, but in
many instances several TPR-proteins seem to aggregate to
multi-protein complexes; examples of TPR-proteins
include, Cdc16p, Cdc23p and Cdc27p components of the
cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
targeting signals, the Tom70p co-receptor for
mitochondrial targeting signals, Ser/Thr phosphatase 5C
and the p110 subunit of O-GlcNAc transferase; three
copies of the repeat are present here.
Length = 100
Score = 41.2 bits (97), Expect = 1e-04
Identities = 22/85 (25%), Positives = 36/85 (42%)
Query: 34 KGLTLNCLGRKEEAYEYVRRGLRNDLKSHVCWHVYGLLQRSDKKYDEAIKCYRNALKWEH 93
G LG +EA EY + L D + ++ KY+EA++ Y AL+ +
Sbjct: 6 LGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDP 65
Query: 94 DNIQIMRDLSLLQIQMRDLEGYRET 118
DN + +L L ++ E E
Sbjct: 66 DNAKAYYNLGLAYYKLGKYEEALEA 90
Score = 41.2 bits (97), Expect = 1e-04
Identities = 22/84 (26%), Positives = 36/84 (42%)
Query: 528 KGLTLNCLGRKEEAYEYVRRGLRNDLKSHVCWHVYGLLQRSDKKYDEAIKCYRNALKWEH 587
G LG +EA EY + L D + ++ KY+EA++ Y AL+ +
Sbjct: 6 LGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDP 65
Query: 588 DNIQIMRDLSLLQIQMRDLEGYRE 611
DN + +L L ++ E E
Sbjct: 66 DNAKAYYNLGLAYYKLGKYEEALE 89
Score = 38.1 bits (89), Expect = 0.002
Identities = 18/63 (28%), Positives = 27/63 (42%)
Query: 357 VYHYLAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHAGDVLEAYKWLDEAQ 416
Y+ LA Y LG +AL A++ P + + G Y G EA + ++A
Sbjct: 36 AYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKAL 95
Query: 417 SLD 419
LD
Sbjct: 96 ELD 98
Score = 36.6 bits (85), Expect = 0.006
Identities = 23/91 (25%), Positives = 39/91 (42%), Gaps = 4/91 (4%)
Query: 1 KCYEHKQYKNGLKFAKQILT-NPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDL 59
Y+ Y L++ ++ L +P A+ LG+ EEA E + L D
Sbjct: 9 LYYKLGDYDEALEYYEKALELDPDNAD---AYYNLAAAYYKLGKYEEALEDYEKALELDP 65
Query: 60 KSHVCWHVYGLLQRSDKKYDEAIKCYRNALK 90
+ ++ GL KY+EA++ Y AL+
Sbjct: 66 DNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96
Score = 36.6 bits (85), Expect = 0.006
Identities = 23/91 (25%), Positives = 39/91 (42%), Gaps = 4/91 (4%)
Query: 495 KCYEHKQYKNGLKFAKQILT-NPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDL 553
Y+ Y L++ ++ L +P A+ LG+ EEA E + L D
Sbjct: 9 LYYKLGDYDEALEYYEKALELDPDNAD---AYYNLAAAYYKLGKYEEALEDYEKALELDP 65
Query: 554 KSHVCWHVYGLLQRSDKKYDEAIKCYRNALK 584
+ ++ GL KY+EA++ Y AL+
Sbjct: 66 DNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96
Score = 35.8 bits (83), Expect = 0.009
Identities = 21/95 (22%), Positives = 32/95 (33%)
Query: 357 VYHYLAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHAGDVLEAYKWLDEAQ 416
L Y LGD +AL Y A++ P + + Y G EA + ++A
Sbjct: 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61
Query: 417 SLDTADRYINSKCAKYMLRANLIKEAEETCSKFTR 451
LD + + +EA E K
Sbjct: 62 ELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96
Score = 32.4 bits (74), Expect = 0.16
Identities = 24/93 (25%), Positives = 38/93 (40%), Gaps = 2/93 (2%)
Query: 172 ELLLYQSMVIQDSGDLEEAVKHLDRFKEQIHDKLTVEETYGALKLKLGQYNEAMKHYESL 231
E LL + GD +EA+++ ++ E D A KLG+Y EA++ YE
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKA 60
Query: 232 IERNQENTLYYNKLVEAKQLTNNDDIFQLLTHY 264
+E + +N Y L + L Y
Sbjct: 61 LELDPDNAKAYYNL--GLAYYKLGKYEEALEAY 91
Score = 29.7 bits (67), Expect = 1.7
Identities = 24/131 (18%), Positives = 42/131 (32%), Gaps = 41/131 (31%)
Query: 69 GLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRDLEGYRETRYQLFMLRPT 128
G L YDEA++ Y AL+ L P
Sbjct: 7 GNLYYKLGDYDEALEYYEKALE----------------------------------LDPD 32
Query: 129 QRASWIGFAMAYHLLHDFDMACNILEAFRKTQQVTNSYDFEHSELLLYQSMVIQDSGDLE 188
++ A AY+ L ++ A LE + K ++ ++ + G E
Sbjct: 33 NADAYYNLAAAYYKLGKYEEA---LEDYEKALELDPDN----AKAYYNLGLAYYKLGKYE 85
Query: 189 EAVKHLDRFKE 199
EA++ ++ E
Sbjct: 86 EALEAYEKALE 96
Score = 28.1 bits (63), Expect = 5.0
Identities = 24/103 (23%), Positives = 44/103 (42%), Gaps = 7/103 (6%)
Query: 133 WIGFAMAYHLLHDFDMACNILEAFRKTQQVTNSYDFEHSELLLYQSMVIQDSGDLEEAVK 192
+ Y+ L D+D A LE + K ++ ++++ + G EEA++
Sbjct: 3 LLNLGNLYYKLGDYDEA---LEYYEKALELDP----DNADAYYNLAAAYYKLGKYEEALE 55
Query: 193 HLDRFKEQIHDKLTVEETYGALKLKLGQYNEAMKHYESLIERN 235
++ E D G KLG+Y EA++ YE +E +
Sbjct: 56 DYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELD 98
>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
repeats [General function prediction only].
Length = 484
Score = 43.9 bits (104), Expect = 2e-04
Identities = 23/130 (17%), Positives = 47/130 (36%), Gaps = 7/130 (5%)
Query: 127 PTQRASWIGFAMAYHLLHDFDMACNILEAFRKTQQVTNSYDFEHSELLLYQSMVIQDSGD 186
A+ G A+ +L +D A +L+ Q Y ++LL
Sbjct: 303 RGGLAAQYGRALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEA-------NK 355
Query: 187 LEEAVKHLDRFKEQIHDKLTVEETYGALKLKLGQYNEAMKHYESLIERNQENTLYYNKLV 246
+EA++ L + + ++ LK G+ EA++ + + E+ ++ L
Sbjct: 356 AKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLA 415
Query: 247 EAKQLTNNDD 256
+A N
Sbjct: 416 QAYAELGNRA 425
Score = 30.1 bits (68), Expect = 4.3
Identities = 27/186 (14%), Positives = 49/186 (26%), Gaps = 11/186 (5%)
Query: 15 AKQILTNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSHVCWHVYGLLQRS 74
A+Q + + + R+ + L
Sbjct: 260 AEQ-SPPYNKLDSPDFQLARARIRAKYEALPNQQAADLLAKRSKRGGLAAQYGRALQTYL 318
Query: 75 DKKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRDLEGYRETRYQLFMLRPTQRASWI 134
+YDEA+K + + + DN + + ++ + E + L P +
Sbjct: 319 AGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALALDPNSPLLQL 378
Query: 135 GFAMAYHLLHDFDMACNILEAFRKTQQVTNSYDFEHSELLLYQSMVIQDSGDLEEAVKHL 194
A A A IL N Y F E ++ Q +L + L
Sbjct: 379 NLAQALLKGGKPQEAIRIL----------NRYLFNDPEDPNGWDLLAQAYAELGNRAEAL 428
Query: 195 DRFKEQ 200
E
Sbjct: 429 LARAEG 434
>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
[Coenzyme metabolism].
Length = 400
Score = 38.1 bits (89), Expect = 0.015
Identities = 32/182 (17%), Positives = 68/182 (37%), Gaps = 11/182 (6%)
Query: 94 DNIQIMRDLSL--LQIQMRDLEGYRETRYQLFMLRPTQRASWIGFAMAYHLLHDFDMACN 151
+ + +L+ L + RD RE QL + P AY L +
Sbjct: 149 GDDTLAVELTRARLLLNRRDYPAARENVDQLLEMTPRHPEVLRLALRAYIRLGAWQALLA 208
Query: 152 ILEAFRKTQQVTNS----YDFEHSELLLYQSMVIQDSGDLEEAVKHLDRFKEQIHDKLTV 207
IL RK +++ + + E LL Q+ D E ++ + +
Sbjct: 209 ILPKLRKAGLLSDEEAARLEQQAWEGLLQQAR---DDNGSEGLKTWWKNQPRKLRNDPEL 265
Query: 208 EETYGALKLKLGQYNEAMKHYESLIERNQENTLYYNKLVEAKQLTNNDDIFQLLTHYISK 267
Y ++LG ++EA + E ++R + L +L+ + + + + + ++ +
Sbjct: 266 VVAYAERLIRLGDHDEAQEIIEDALKRQWDPRLC--RLIPRLRPGDPEPLIKAAEKWLKQ 323
Query: 268 YP 269
+P
Sbjct: 324 HP 325
>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
Length = 291
Score = 37.5 bits (85), Expect = 0.018
Identities = 52/235 (22%), Positives = 91/235 (38%), Gaps = 6/235 (2%)
Query: 6 KQYKNGLKFAKQILTNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSH--V 63
+ L+ ++ L ++ L + L L LGR EEA E + + L +L +
Sbjct: 37 GELAEALELLEEALELLPNSDLAGLLLLLALALLKLGRLEEALELLEKALELELLPNLAE 96
Query: 64 CWHVYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDLSL-LQIQMRDLEGYRETRYQL 122
GLL + KY+EA++ AL + D L+L ++ D E E +
Sbjct: 97 ALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALELYEKA 156
Query: 123 FMLRPTQRASWIGFAMAYHLLHDFDMACNILEAFRKTQQVTNSYDFEHSELLLYQSMVIQ 182
L P L + EA ++ + +E LL ++
Sbjct: 157 LELDPELNEL---AEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYL 213
Query: 183 DSGDLEEAVKHLDRFKEQIHDKLTVEETYGALKLKLGQYNEAMKHYESLIERNQE 237
G EEA+++ ++ E D L L+LG+Y EA++ E +E + +
Sbjct: 214 KLGKYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPD 268
Score = 30.2 bits (66), Expect = 4.3
Identities = 42/219 (19%), Positives = 74/219 (33%), Gaps = 7/219 (3%)
Query: 432 YMLRANLIKEAEETCSKFTREGVSAMENLNEMQCMWFQTECALAYQRLGRWGDTLKKCHE 491
+ + + G++ +E L E+ E AL L
Sbjct: 8 LAILLEALAKLLAEALALLEAGLALLELLGELAEALELLEEALELLPNSDLAGLLLLLAL 67
Query: 492 VDRKCYEHKQYKNGLKFAKQILTNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRN 551
K + + L+ ++ L E L GL L LG+ EEA E + + L
Sbjct: 68 ALLKL---GRLEEALELLEKALELELLPNLAEALLNLGLLLEALGKYEEALELLEKALAL 124
Query: 552 DLKSHVCWHVYGL-LQRSDKKYDEAIKCYRNALK---WEHDNIQIMRDLSLLQIQMRDLE 607
D + + L Y+EA++ Y AL+ ++ + + L L + E
Sbjct: 125 DPDPDLAEALLALGALYELGDYEEALELYEKALELDPELNELAEALLALGALLEALGRYE 184
Query: 608 GYREGVSAMENLNEMQCMWFQTECALAYQRLGRWGDTLK 646
E + LN L Y +LG++ + L+
Sbjct: 185 EALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALE 223
>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat.
Length = 69
Score = 33.8 bits (78), Expect = 0.029
Identities = 13/49 (26%), Positives = 22/49 (44%)
Query: 63 VCWHVYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRD 111
G YDEAI+ Y AL+ + DN + +L+L +++
Sbjct: 4 EALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGK 52
Score = 33.8 bits (78), Expect = 0.029
Identities = 13/49 (26%), Positives = 22/49 (44%)
Query: 557 VCWHVYGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDLSLLQIQMRD 605
G YDEAI+ Y AL+ + DN + +L+L +++
Sbjct: 4 EALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGK 52
Score = 33.4 bits (77), Expect = 0.035
Identities = 17/64 (26%), Positives = 25/64 (39%), Gaps = 1/64 (1%)
Query: 357 VYHYLAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHAG-DVLEAYKWLDEA 415
L LGD +A+ A++ P E + Y G D EA + L++A
Sbjct: 5 ALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKA 64
Query: 416 QSLD 419
LD
Sbjct: 65 LELD 68
Score = 30.4 bits (69), Expect = 0.42
Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 1/63 (1%)
Query: 29 ETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSHVCWHVYGL-LQRSDKKYDEAIKCYRN 87
E L G L LG +EA E + L D + ++ L + K Y+EA++
Sbjct: 4 EALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEK 63
Query: 88 ALK 90
AL+
Sbjct: 64 ALE 66
Score = 30.4 bits (69), Expect = 0.42
Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 1/63 (1%)
Query: 523 ETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSHVCWHVYGL-LQRSDKKYDEAIKCYRN 581
E L G L LG +EA E + L D + ++ L + K Y+EA++
Sbjct: 4 EALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEK 63
Query: 582 ALK 584
AL+
Sbjct: 64 ALE 66
Score = 28.4 bits (64), Expect = 2.0
Identities = 15/44 (34%), Positives = 21/44 (47%)
Query: 212 GALKLKLGQYNEAMKHYESLIERNQENTLYYNKLVEAKQLTNND 255
G KLG Y+EA++ YE +E + +N Y L A D
Sbjct: 10 GNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKD 53
>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat.
Length = 78
Score = 33.9 bits (78), Expect = 0.032
Identities = 20/75 (26%), Positives = 31/75 (41%), Gaps = 7/75 (9%)
Query: 354 LLWVYHYLAQHYDHLGDTMKALNYINAAI-------DHTPTLIELFVTKGRIYKHAGDVL 406
L + LA LGD +AL + A+ + P R+Y GD
Sbjct: 4 LAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYD 63
Query: 407 EAYKWLDEAQSLDTA 421
EA ++L++A +L A
Sbjct: 64 EALEYLEKALALREA 78
Score = 28.1 bits (63), Expect = 3.8
Identities = 16/72 (22%), Positives = 32/72 (44%), Gaps = 7/72 (9%)
Query: 173 LLLYQSMVIQDSGDLEEAVKHLDR-------FKEQIHDKLTVEETYGALKLKLGQYNEAM 225
L ++V++ GD +EA++ L++ E + L L LG Y+EA+
Sbjct: 7 ALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEAL 66
Query: 226 KHYESLIERNQE 237
++ E + +
Sbjct: 67 EYLEKALALREA 78
Score = 26.9 bits (60), Expect = 8.9
Identities = 16/68 (23%), Positives = 26/68 (38%), Gaps = 7/68 (10%)
Query: 30 TLAMKGLTLNCLGRKEEAYEYVRRGL-------RNDLKSHVCWHVYGLLQRSDKKYDEAI 82
L L L LG +EA E + + L + ++ + L + YDEA+
Sbjct: 7 ALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEAL 66
Query: 83 KCYRNALK 90
+ AL
Sbjct: 67 EYLEKALA 74
Score = 26.9 bits (60), Expect = 8.9
Identities = 16/68 (23%), Positives = 26/68 (38%), Gaps = 7/68 (10%)
Query: 524 TLAMKGLTLNCLGRKEEAYEYVRRGL-------RNDLKSHVCWHVYGLLQRSDKKYDEAI 576
L L L LG +EA E + + L + ++ + L + YDEA+
Sbjct: 7 ALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEAL 66
Query: 577 KCYRNALK 584
+ AL
Sbjct: 67 EYLEKALA 74
>gnl|CDD|236624 PRK09782, PRK09782, bacteriophage N4 receptor, outer membrane
subunit; Provisional.
Length = 987
Score = 36.8 bits (85), Expect = 0.052
Identities = 15/62 (24%), Positives = 26/62 (41%), Gaps = 1/62 (1%)
Query: 358 YHYLAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHAGDVLEAYKWLDEAQS 417
Y +L G ALN + +++ P+ +V + IY+ +V A L A
Sbjct: 579 YWWLHAQRYIPGQPELALNDLTRSLNIAPS-ANAYVARATIYRQRHNVPAAVSDLRAALE 637
Query: 418 LD 419
L+
Sbjct: 638 LE 639
>gnl|CDD|215244 PLN02445, PLN02445, anthranilate synthase component I.
Length = 523
Score = 35.4 bits (82), Expect = 0.10
Identities = 23/99 (23%), Positives = 47/99 (47%), Gaps = 12/99 (12%)
Query: 168 FEHSELLLYQSMVIQDSG--DLEEA----VKHLDRFKEQIHDKLTVEETYGALKLKLGQY 221
F+H E Y ++ +EEA +K L+ ++ D + + G++KL Q+
Sbjct: 165 FDHVEKKAYVIHWVRLDRYSSVEEAYEDGMKRLEALVSRLQDINPPKLSPGSVKLSTNQF 224
Query: 222 NEAMKHYESLIERNQENTLYYNKLVEAKQLTNNDDIFQL 260
+++ +S + + Y N +++AK+ DIFQ+
Sbjct: 225 GPSLE--KSNMTSEE----YKNAVLQAKEHILAGDIFQI 257
>gnl|CDD|220676 pfam10288, DUF2392, Protein of unknown function (DUF2392). This is
a family of proteins conserved from plants to humans.
The function is not known. It carries a characteristic
GRG sequence motif.
Length = 104
Score = 32.2 bits (74), Expect = 0.18
Identities = 16/51 (31%), Positives = 21/51 (41%), Gaps = 4/51 (7%)
Query: 285 DQFRTEIDKYLRHGFHKGVPPLFVNLRSLY-SNTEKCKIIQDLVESYVEAL 334
D EI+ Y K +PPL +N S K I +L E Y + L
Sbjct: 39 DLLLKEIEAYCSL---KNLPPLLINSLIQIASKLSKNMSINELTEQYFDNL 86
>gnl|CDD|227731 COG5444, COG5444, Uncharacterized conserved protein [Function
unknown].
Length = 565
Score = 34.8 bits (80), Expect = 0.20
Identities = 34/123 (27%), Positives = 48/123 (39%), Gaps = 19/123 (15%)
Query: 309 NLRSLYSNTEKCKI--IQDLVESYVEALSKTGHFSLQDE----GEVEPASALLWVYHYLA 362
N+ S Y + I I+ + S A K+ L+DE G VE A ++ H L
Sbjct: 51 NIISFYQENQLPTIRSIRIFLISMKIAFEKSISAILEDEPLSNGYVEEA----FLKHELP 106
Query: 363 QHYDHLGDTM----KALNYINAAIDHTPTLIEL---FVTK--GRIYKHAGDVLEAYKWLD 413
YD + M KA+ I + + H L +L KHA D LE LD
Sbjct: 107 NGYDRYEEIMEEQKKAIKDILSDVSHILDLGKLDTEDFKDEVDSARKHAKDTLEKLYKLD 166
Query: 414 EAQ 416
+
Sbjct: 167 QEG 169
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 34.2 bits (78), Expect = 0.27
Identities = 43/233 (18%), Positives = 88/233 (37%), Gaps = 21/233 (9%)
Query: 162 VTNSYDFEHSELLLYQSMVIQDSGDLEEAVKHLDRFKEQIHDKLTVEETYGALKLKLGQY 221
VT SY + ++ + + GD ++A + D+ + + + L G++
Sbjct: 364 VTQSY--------IKRASMNLELGDPDKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEF 415
Query: 222 NEAMKHYESLIERNQENTLYYNKL-VEAKQLTNNDDIFQLLTHYISKYPKA-TVPKRLSL 279
+A K Y+ I+ + + + +L V + + +P+A V
Sbjct: 416 AQAGKDYQKSIDLDPDFIFSHIQLGVTQYKEGSIASSMATFRRCKKNFPEAPDVYNYYGE 475
Query: 280 NYVSGDQFRTEIDKY-LRHGFHKGVPPLFVNLRSLYSNTEKCKIIQDLVESYVEALSKTG 338
+ ++F I+K+ K P+++N+ L + K + + ++EA
Sbjct: 476 LLLDQNKFDEAIEKFDTAIELEKETKPMYMNVLPLIN---KALALFQWKQDFIEA-ENLC 531
Query: 339 HFSLQDEGEVEPASALLWVYHYLAQHYDHLGDTMKALNYINAAIDHTPTLIEL 391
+L + E + A A +AQ GD +AL A + T EL
Sbjct: 532 EKALIIDPECDIAVAT------MAQLLLQQGDVDEALKLFERAAELARTEGEL 578
>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat.
Length = 34
Score = 29.7 bits (68), Expect = 0.32
Identities = 9/22 (40%), Positives = 12/22 (54%)
Query: 69 GLLQRSDKKYDEAIKCYRNALK 90
G KYDEA++ Y AL+
Sbjct: 8 GNAYLKLGKYDEALEYYEKALE 29
Score = 29.7 bits (68), Expect = 0.32
Identities = 9/22 (40%), Positives = 12/22 (54%)
Query: 563 GLLQRSDKKYDEAIKCYRNALK 584
G KYDEA++ Y AL+
Sbjct: 8 GNAYLKLGKYDEALEYYEKALE 29
Score = 27.0 bits (61), Expect = 3.1
Identities = 13/27 (48%), Positives = 19/27 (70%)
Query: 212 GALKLKLGQYNEAMKHYESLIERNQEN 238
G LKLG+Y+EA+++YE +E N N
Sbjct: 8 GNAYLKLGKYDEALEYYEKALELNPNN 34
>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat.
Length = 65
Score = 30.0 bits (68), Expect = 0.53
Identities = 17/59 (28%), Positives = 24/59 (40%)
Query: 361 LAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRIYKHAGDVLEAYKWLDEAQSLD 419
LA+ GD +AL + AA+ P E + G G + EA L A + D
Sbjct: 3 LARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAAD 61
>gnl|CDD|198356 cd07778, FGGY_L-RBK_like, L-ribulokinase-like proteins; a subfamily
of the FGGY family of carbohydrate kinases. This
subfamily is composed of a group of putative bacterial
L-ribulokinases (RBK; EC 2.7.1.16) and similar proteins.
L-RBK catalyzes the MgATP-dependent phosphorylation of a
variety of sugar substrates. Members of this subfamily
belong to the FGGY family of carbohydrate kinases, the
monomers of which contain two large domains, which are
separated by a deep cleft that forms the active site.
This model includes both the N-terminal domain, which
adopts a ribonuclease H-like fold, and the structurally
related C-terminal domain.
Length = 466
Score = 32.3 bits (74), Expect = 0.94
Identities = 31/125 (24%), Positives = 49/125 (39%), Gaps = 27/125 (21%)
Query: 105 LQIQMRDLEGYRETRYQLFMLRPTQRASWIGFAMAYHLLHDFDMACNILEAFRKTQQVTN 164
LQ DL + +Y + T + IG + H FD +L FR T + T
Sbjct: 56 LQACYADLANDVKEKYGV---TLTTIGA-IGISAMMHGYLAFDKNGKLLVPFR-TWRNTI 110
Query: 165 SYDF--EHSELL------------LYQSMVIQDSGDLEEAVKHLDRFKE---QIHDKLTV 207
+ + E +EL LYQ+++ + EE VK + +H +LT
Sbjct: 111 TEEAAAELTELFGFNIPQRWSIAHLYQAIL-----NKEEHVKEVAFLTTLAGYVHWQLTG 165
Query: 208 EETYG 212
E+ G
Sbjct: 166 EKVLG 170
>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat. This Pfam entry
includes outlying Tetratricopeptide-like repeats (TPR)
that are not matched by pfam00515.
Length = 34
Score = 27.9 bits (63), Expect = 1.4
Identities = 9/31 (29%), Positives = 16/31 (51%)
Query: 65 WHVYGLLQRSDKKYDEAIKCYRNALKWEHDN 95
+ GL Y+EA++ Y AL+ + +N
Sbjct: 4 LYNLGLAYYKLGDYEEALEAYEKALELDPNN 34
Score = 27.9 bits (63), Expect = 1.4
Identities = 9/31 (29%), Positives = 16/31 (51%)
Query: 559 WHVYGLLQRSDKKYDEAIKCYRNALKWEHDN 589
+ GL Y+EA++ Y AL+ + +N
Sbjct: 4 LYNLGLAYYKLGDYEEALEAYEKALELDPNN 34
>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats. Repeats present in 4
or more copies in proteins. Contain a minimum of 34
amino acids each and self-associate via a "knobs and
holes" mechanism.
Length = 34
Score = 27.8 bits (63), Expect = 1.5
Identities = 8/22 (36%), Positives = 11/22 (50%)
Query: 69 GLLQRSDKKYDEAIKCYRNALK 90
G YDEA++ Y AL+
Sbjct: 8 GNAYLKLGDYDEALEYYEKALE 29
Score = 27.8 bits (63), Expect = 1.5
Identities = 8/22 (36%), Positives = 11/22 (50%)
Query: 563 GLLQRSDKKYDEAIKCYRNALK 584
G YDEA++ Y AL+
Sbjct: 8 GNAYLKLGDYDEALEYYEKALE 29
Score = 27.0 bits (61), Expect = 2.6
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 212 GALKLKLGQYNEAMKHYESLIERNQEN 238
G LKLG Y+EA+++YE +E + N
Sbjct: 8 GNAYLKLGDYDEALEYYEKALELDPNN 34
>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat.
Length = 34
Score = 27.8 bits (63), Expect = 1.6
Identities = 8/31 (25%), Positives = 17/31 (54%)
Query: 65 WHVYGLLQRSDKKYDEAIKCYRNALKWEHDN 95
+++ G + Y+EA + Y AL+ + +N
Sbjct: 4 YYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34
Score = 27.8 bits (63), Expect = 1.6
Identities = 8/31 (25%), Positives = 17/31 (54%)
Query: 559 WHVYGLLQRSDKKYDEAIKCYRNALKWEHDN 589
+++ G + Y+EA + Y AL+ + +N
Sbjct: 4 YYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34
>gnl|CDD|203842 pfam08032, SpoU_sub_bind, RNA 2'-O ribose methyltransferase
substrate binding. This domain is a RNA 2'-O ribose
methyltransferase substrate binding domain.
Length = 74
Score = 28.4 bits (64), Expect = 2.8
Identities = 9/33 (27%), Positives = 18/33 (54%)
Query: 378 INAAIDHTPTLIELFVTKGRIYKHAGDVLEAYK 410
+ A+ P + +L+VT+GR ++L+ K
Sbjct: 8 VEEALKSGPEIEKLYVTEGRRDPRLKELLKLAK 40
>gnl|CDD|223260 COG0182, COG0182, Predicted translation initiation factor 2B
subunit, eIF-2B alpha/beta/delta family [Translation,
ribosomal structure and biogenesis].
Length = 346
Score = 30.7 bits (70), Expect = 3.0
Identities = 17/99 (17%), Positives = 32/99 (32%), Gaps = 23/99 (23%)
Query: 180 VIQDSGDLEEAVKHLDRFKEQIHD--------KLTVEETYGALK-------LKLGQYNEA 224
++ EE ++ L++ E + ++ A K K EA
Sbjct: 66 ESKNDSKGEEFIEALEKAAETLKSTRPTAVNLFWALDRMLNAAKEAIEVKEPKESILQEA 125
Query: 225 MKHYESLIERNQENTLYYNKLVEAKQLTNNDDIFQLLTH 263
+ E +E N+ +L+ D +LTH
Sbjct: 126 EEIAEEDLEANRAIGENGAELLP--------DGDTVLTH 156
>gnl|CDD|215450 PLN02837, PLN02837, threonine-tRNA ligase.
Length = 614
Score = 30.6 bits (69), Expect = 3.7
Identities = 18/59 (30%), Positives = 30/59 (50%), Gaps = 8/59 (13%)
Query: 151 NILEAFRKTQQVTNSYDFEHSELLLYQSMVIQDSGDLEEAVKHLDRFKEQIHDKLTVEE 209
+I+E K + YD LLY V + DL + HLD +KE ++D++ +E+
Sbjct: 251 HIIEDSWKKMHFEHGYD------LLYTPHVAK--ADLWKTSGHLDFYKENMYDQMDIED 301
>gnl|CDD|100115 cd05804, StaR_like, StaR_like; a well-conserved protein found in
bacteria, plants, and animals. A family member from
Streptomyces toyocaensis, StaR is part of a gene cluster
involved in the biosynthesis of glycopeptide antibiotics
(GPAs), specifically A47934. It has been speculated that
StaR could be a flavoprotein hydroxylating a tyrosine
sidechain. Some family members have been annotated as
proteins containing tetratricopeptide (TPR) repeats,
which may at least indicate mostly alpha-helical
secondary structure.
Length = 355
Score = 29.9 bits (68), Expect = 4.5
Identities = 31/127 (24%), Positives = 49/127 (38%), Gaps = 33/127 (25%)
Query: 7 QYKNGLKFAKQILT-NPK--YAEHGETLA----MKGLTLNCLGRKEEAYEYVRRGLR--- 56
QY + A++ L NP +A H +A M+G R +E ++
Sbjct: 129 QYDRAEEAARRALELNPDDAWAVH--AVAHVLEMQG-------RFKEGIAFMESWRDTWD 179
Query: 57 --NDLKSHVCWHV-YGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDL----SLLQIQM 109
+ L+ H WH+ L+R D Y+ A+ Y + DL SLL
Sbjct: 180 CSSMLRGHNWWHLALFYLERGD--YEAALAIYDTHI-APSAESDPALDLLDAASLL---W 233
Query: 110 R-DLEGY 115
R +L G+
Sbjct: 234 RLELAGH 240
Score = 29.9 bits (68), Expect = 4.5
Identities = 31/127 (24%), Positives = 49/127 (38%), Gaps = 33/127 (25%)
Query: 501 QYKNGLKFAKQILT-NPK--YAEHGETLA----MKGLTLNCLGRKEEAYEYVRRGLR--- 550
QY + A++ L NP +A H +A M+G R +E ++
Sbjct: 129 QYDRAEEAARRALELNPDDAWAVH--AVAHVLEMQG-------RFKEGIAFMESWRDTWD 179
Query: 551 --NDLKSHVCWHV-YGLLQRSDKKYDEAIKCYRNALKWEHDNIQIMRDL----SLLQIQM 603
+ L+ H WH+ L+R D Y+ A+ Y + DL SLL
Sbjct: 180 CSSMLRGHNWWHLALFYLERGD--YEAALAIYDTHI-APSAESDPALDLLDAASLL---W 233
Query: 604 R-DLEGY 609
R +L G+
Sbjct: 234 RLELAGH 240
>gnl|CDD|237283 PRK13106, ubiA, prenyltransferase; Reviewed.
Length = 300
Score = 29.7 bits (67), Expect = 4.7
Identities = 10/32 (31%), Positives = 13/32 (40%)
Query: 671 RKMTLRSYVRLLRLEDVLRSHPFYYTAARCAI 702
+ +R LR+E S P Y A AI
Sbjct: 11 TRSKFYIILRFLRIEQTFFSLPMAYLGAFVAI 42
>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat.
Length = 44
Score = 26.7 bits (59), Expect = 5.0
Identities = 11/44 (25%), Positives = 17/44 (38%)
Query: 355 LWVYHYLAQHYDHLGDTMKALNYINAAIDHTPTLIELFVTKGRI 398
LA+ LGD +AL + A+ P E + R+
Sbjct: 1 PAALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44
>gnl|CDD|221837 pfam12895, Apc3, Anaphase-promoting complex, cyclosome, subunit
3. Apc3, otherwise known as Cdc27, is one of the
subunits of the anaphase-promoting complex or
cyclosome. The anaphase-promoting complex is a
multiprotein subunit E3 ubiquitin ligase complex that
controls segregation of chromosomes and exit from
mitosis in eukaryotes. The protein members of this
family contain TPR repeats just as those of Apc7 do,
and it appears that these TPR units bind the C-termini
of the APC co-activators CDH1 and CDC20.
Length = 80
Score = 27.6 bits (62), Expect = 6.0
Identities = 21/85 (24%), Positives = 34/85 (40%), Gaps = 5/85 (5%)
Query: 4 EHKQYKNGLKFAKQILTNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSHV 63
+ Y+N + A+++L E LA G+ + AYE +R+ + S
Sbjct: 1 DQGNYENAIFLAEKLLALTPSNEDAYLLA---QCYFLQGQYKRAYELLRKLKLD--NSSG 55
Query: 64 CWHVYGLLQRSDKKYDEAIKCYRNA 88
C ++ KYDEAI A
Sbjct: 56 CRYLLAQCLLKLGKYDEAIAVLEKA 80
Score = 27.6 bits (62), Expect = 6.0
Identities = 21/85 (24%), Positives = 34/85 (40%), Gaps = 5/85 (5%)
Query: 498 EHKQYKNGLKFAKQILTNPKYAEHGETLAMKGLTLNCLGRKEEAYEYVRRGLRNDLKSHV 557
+ Y+N + A+++L E LA G+ + AYE +R+ + S
Sbjct: 1 DQGNYENAIFLAEKLLALTPSNEDAYLLA---QCYFLQGQYKRAYELLRKLKLD--NSSG 55
Query: 558 CWHVYGLLQRSDKKYDEAIKCYRNA 582
C ++ KYDEAI A
Sbjct: 56 CRYLLAQCLLKLGKYDEAIAVLEKA 80
>gnl|CDD|151696 pfam11254, DUF3053, Protein of unknown function (DUF3053). Some
members in this family of proteins are annotated as the
membrane protein YiaF. No function is currently known.
Length = 229
Score = 28.9 bits (65), Expect = 7.1
Identities = 18/99 (18%), Positives = 39/99 (39%), Gaps = 19/99 (19%)
Query: 125 LRPTQRASWIGFAMAYHLLHDFD------MACNILEAFRKTQQVTNSYDFEHSELLLYQS 178
L Q+ ++ +A Y +L DF + ++L A + + ++ D
Sbjct: 49 LTADQKKAFGPYADDYAILTDFSQQMDQAVDGSLLPALDQIGSIRSAQD----------- 97
Query: 179 MVIQDSGDLEEAVKHLDRFKEQIHD-KLTVEETYGALKL 216
+ DL ++ L+ +Q+ + K + + ALK
Sbjct: 98 -YLTKRDDLRQSRGGLNLLGQQLQNAKAQADAAHSALKQ 135
>gnl|CDD|147082 pfam04740, LXG, LXG domain of WXG superfamily. This domain is
present is the N-terminal region of a group of
polymorphic toxin proteins in bacteria. It is predicted
to use Type VII secretion pathway to mediate export of
bacterial toxins.
Length = 204
Score = 28.8 bits (65), Expect = 8.5
Identities = 21/124 (16%), Positives = 43/124 (34%), Gaps = 19/124 (15%)
Query: 309 NLRSLYSNTEK--CKIIQDLVESYVEALSKTGHFSLQDEGEVEPAS-ALL---WVYHYLA 362
+++ Y + I + ++ Y+E L K F VE +S + ++ H L
Sbjct: 50 AIKAFYQDLHLPVLDSILEFIDEYIEFLEKIKSFLSD----VESSSNGYIDEEFLEHELK 105
Query: 363 QHYDHLGDTMKAL-NYINAAIDHTPTLIELFVTKGRIY--------KHAGDVLEAYKWLD 413
+ + L + IN+ + ++ L + K D LE D
Sbjct: 106 NGLKKAEEKTEELKDAINSILASISDIVHLPKFSTSAFQEQLDKAKKKRNDTLEKLYEFD 165
Query: 414 EAQS 417
+ +
Sbjct: 166 QELT 169
>gnl|CDD|218497 pfam05205, COMPASS-Shg1, COMPASS (Complex proteins associated with
Set1p) component shg1. The Shg1 subunit is one of the
eight subunits of the COMPASS complex, complex
associated with SET1, conserved in yeasts and in other
eukaryotes up to humans. It is associated with the
region of the Set1 protein that is N-terminal to the
C-terminus, ie Set1-560-900. The function of Shg1 seems
to be to slightly inhibit histone 3 lysine 4 (H3K4) di-
and tri-methylation, and it is a pioneer protein. The
COMPASS complex functions to methylate the fourth lysine
of Histone 3 and for silencing of genes close to the
telomeres of chromosomes.
Length = 106
Score = 27.6 bits (62), Expect = 9.2
Identities = 10/45 (22%), Positives = 22/45 (48%), Gaps = 1/45 (2%)
Query: 193 HLDRFKEQIHDKLTVEETYGALKLKLGQYNEAM-KHYESLIERNQ 236
H D+ ++ I ++ ET L L++ + E + L+ +N+
Sbjct: 12 HFDKLRKDILEEFEKSETEQNLTLRVQELVETEVEKDPQLLSKNR 56
>gnl|CDD|221398 pfam12056, DUF3537, Protein of unknown function (DUF3537). This
family of transmembrane proteins are functionally
uncharacterized. This protein is found in eukaryotes.
Proteins in this family are typically between 427 to 453
amino acids in length.
Length = 398
Score = 28.9 bits (65), Expect = 9.6
Identities = 10/45 (22%), Positives = 19/45 (42%)
Query: 463 MQCMWFQTECALAYQRLGRWGDTLKKCHEVDRKCYEHKQYKNGLK 507
+ C+ F+ C L RL + + +V+ EH + + L
Sbjct: 176 LVCVLFRLICHLQILRLDDFAKLFEGESDVESVLKEHLRIRRQLS 220
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.135 0.414
Gapped
Lambda K H
0.267 0.0748 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 36,562,733
Number of extensions: 3599180
Number of successful extensions: 3422
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3376
Number of HSP's successfully gapped: 101
Length of query: 724
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 620
Effective length of database: 6,324,786
Effective search space: 3921367320
Effective search space used: 3921367320
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.0 bits)