RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy2915
(883 letters)
>gnl|CDD|216598 pfam01602, Adaptin_N, Adaptin N terminal region. This family
consists of the N terminal region of various alpha, beta
and gamma subunits of the AP-1, AP-2 and AP-3 adaptor
protein complexes. The adaptor protein (AP) complexes
are involved in the formation of clathrin-coated pits
and vesicles. The N-terminal region of the various
adaptor proteins (APs) is constant by comparison to the
C-terminal which is variable within members of the AP-2
family; and it has been proposed that this constant
region interacts with another uniform component of the
coated vesicles.
Length = 522
Score = 375 bits (965), Expect = e-120
Identities = 180/558 (32%), Positives = 271/558 (48%), Gaps = 119/558 (21%)
Query: 11 KKGEIFELKGELNS--DKKEKKREAVKKVIASMTVGKDVSALFPDVVNCMQTDNLELKKL 68
+K EL LNS D KK+EAVKK+I + +G+D+S LF +VV + +++ LK+L
Sbjct: 1 RKRIQQELARILNSFRDDPRKKKEAVKKLIYLIMLGEDISFLFFEVVKLVASNDFTLKRL 60
Query: 69 VYLYLMNYAKSHPDMAIMAFSTFFYQKSSSSFQCMIDIRSIPLFDLCSSVGVFVKDCEDS 128
YLYL A+ PD+AI+ ++ KD +
Sbjct: 61 GYLYLKLLAEESPDLAILVTNS------------------------------IKKDLQSP 90
Query: 129 NPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDINAQLVED 188
NPLIR LA+RT+ CIRV ++ L ++K L D DPYVRK AA+ + KLY + LV D
Sbjct: 91 NPLIRGLALRTLSCIRVPELARDLAPDIKKLLVDRDPYVRKKAALAILKLYRKDPDLVRD 150
Query: 189 QGFLDQLKDLLSDSNPMVVANAVAAI--LLLPRKSYWQRNLSSRKKQIC--------WNL 238
+ +LK+LLSD +P VV+ AVA + + + Y + L +++C W
Sbjct: 151 F-LVPELKELLSDKDPGVVSAAVALLYEIRKNDRLYLNKLLPLLVRRLCNLLTVCNPWLQ 209
Query: 239 PYLMNLSVIY----PAWPLSTINPHTPLLK-----VLMK----LMEMLPGEGDFVSTLTK 285
++ L Y P P + LL+ VL + ++ + P L
Sbjct: 210 VKILRLLTRYAPQDPREPKELLEDILNLLQNSNNAVLYEAVKTIIHLDP-----EPELIV 264
Query: 286 KLAPPLVTLLSSEPE-VQYVALRNINLIVQKRPDILKH-EMKVFFVKYNDPIYVKLEKLD 343
L LLSS E ++YVALRN+N I++K P ++H ++ +F +K +D I ++L LD
Sbjct: 265 LAVNALGRLLSSPDENLRYVALRNLNKILEKHPPAVQHLDLIIFCLKTDDDISIRLRALD 324
Query: 344 IMIRLASQANIAQVLSELKEYATEV-DVDFVRKAVRAIGRCAIKVEQSAERCVSTLLDLI 402
++ +L ++N+ +++ EL +Y +E+ D +F K V+AIGR A K AE C+ LL+L+
Sbjct: 325 LLYKLVDESNVKEIVKELLKYVSEIADPEFKIKLVKAIGRLAEKFPTDAEWCIDVLLELL 384
Query: 403 QTK----------------------------------------------------YAERI 410
Y E I
Sbjct: 385 SLAGSYVVDEIVEVIRDIIRKYPELREYILEHLCELLEDIESPEARAAALWILGEYGELI 444
Query: 411 -DNADELLESFLEGFHDENTQVQLQLLTAIVKLFLKRPTDT--QELVQQVLSLATQDSDN 467
++ +LL S LE F E+ +V+L LLTA+VKL L P + +VQ VLSLATQDS +
Sbjct: 445 PNSPSDLLRSILEVFVLESLKVRLALLTALVKLSLTFPDEEVQNLIVQVVLSLATQDSSD 504
Query: 468 PDLRDRGFIYWRLLSTGN 485
+LRDR Y RLLS +
Sbjct: 505 LELRDRAVEYLRLLSLAD 522
Score = 366 bits (941), Expect = e-117
Identities = 149/440 (33%), Positives = 231/440 (52%), Gaps = 64/440 (14%)
Query: 497 VIASMTVGKDVSALFPDVVNCMQTDNLELKKLVYLYLMNYAKSHPDMAIMAVNTFVKDCE 556
+I + +G+D+S LF +VV + +++ LK+L YLYL A+ PD+AI+ N+ KD +
Sbjct: 29 LIYLIMLGEDISFLFFEVVKLVASNDFTLKRLGYLYLKLLAEESPDLAILVTNSIKKDLQ 88
Query: 557 DSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDINAQLV 616
NPLIR LA+RT+ CIRV ++ L ++K L D DPYVRK AA+ + KLY + LV
Sbjct: 89 SPNPLIRGLALRTLSCIRVPELARDLAPDIKKLLVDRDPYVRKKAALAILKLYRKDPDLV 148
Query: 617 EDQGFLDQLKDLLSDSNPMVVANAVAALSEMNEASTSGVALIEMNAQTINKLLTALNECT 676
D + +LK+LLSD +P VV+ AVA L E+ + L ++ + +L L C
Sbjct: 149 RDF-LVPELKELLSDKDPGVVSAAVALLYEIRKNDRL--YLNKLLPLLVRRLCNLLTVCN 205
Query: 677 EWGQVFILDSLSNYSPKDDREAQSICERITPRLAHANAAVVLSAVKVLMKLMEMLPGEGD 736
W QV IL L+ Y+P+D RE + + E I L ++N AV+ AVK ++ L D
Sbjct: 206 PWLQVKILRLLTRYAPQDPREPKELLEDILNLLQNSNNAVLYEAVKTIIHL--------D 257
Query: 737 FVSTLTKKLAPPLVTLLSSEPE-VQYVALRNINLIVQKRPDILKH-EMKVFFVKYNDPIY 794
L L LLSS E ++YVALRN+N I++K P ++H ++ +F +K +D I
Sbjct: 258 PEPELIVLAVNALGRLLSSPDENLRYVALRNLNKILEKHPPAVQHLDLIIFCLKTDDDIS 317
Query: 795 VKLEKLDIMIRLASQANIAQV-----NYV--------VQEAIVVIKDIFRKYPNKYET-- 839
++L LD++ +L ++N+ ++ YV + + I + K+P E
Sbjct: 318 IRLRALDLLYKLVDESNVKEIVKELLKYVSEIADPEFKIKLVKAIGRLAEKFPTDAEWCI 377
Query: 840 -----------------------------------IISTLCENLDTLDEPEARASMIWII 864
I+ LCE L+ ++ PEARA+ +WI+
Sbjct: 378 DVLLELLSLAGSYVVDEIVEVIRDIIRKYPELREYILEHLCELLEDIESPEARAAALWIL 437
Query: 865 GEYAERI-DNADELLESFLE 883
GEY E I ++ +LL S LE
Sbjct: 438 GEYGELIPNSPSDLLRSILE 457
>gnl|CDD|240415 PTZ00429, PTZ00429, beta-adaptin; Provisional.
Length = 746
Score = 338 bits (869), Expect = e-104
Identities = 181/559 (32%), Positives = 287/559 (51%), Gaps = 108/559 (19%)
Query: 2 TDSKYFTTTKKGEIFELKGELNSDKKEKKREAVKKVIASMTVGKDVSALFPDVVNCMQTD 61
T SKYF T++GE EL+ +LN +K+ AVK++IA+MT+G+DVS LF DVV +
Sbjct: 21 TGSKYFAQTRRGEGAELQNDLNGTDSYRKKAAVKRIIANMTMGRDVSYLFVDVVKLAPST 80
Query: 62 NLELKKLVYLYLMNYAKSHPDMAIMAFSTFFYQKSSSSFQCMIDIRSIPLFDLCSSVGVF 121
+LELKKLVYLY+++ A+ P+ A++A +TF
Sbjct: 81 DLELKKLVYLYVLSTARLQPEKALLAVNTF------------------------------ 110
Query: 122 VKDCEDSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDI 181
++D +S+P++RALAVRTM CIRV + EY EPLR+ + D DPYVRKTAA+ + KL+
Sbjct: 111 LQDTTNSSPVVRALAVRTMMCIRVSSVLEYTLEPLRRAVADPDPYVRKTAAMGLGKLFHD 170
Query: 182 NAQLVEDQGFLDQLKDLLSDSNPMVVANAVAAIL---------LLPRKSYWQRNLSSRKK 232
+ QL Q F L +LL+D+NP+V +NA A + + + R + +
Sbjct: 171 DMQLFYQQDFKKDLVELLNDNNPVVASNAAAIVCEVNDYGSEKIESSNEWVNRLVYHLPE 230
Query: 233 QICWNLPYLMNL-------------SVIYPAWP-LSTINPHTPL--LKVLMKLMEMLPGE 276
W Y++ L +++ P +S NP + +KV+ L E
Sbjct: 231 CNEWGQLYILELLAAQRPSDKESAETLLTRVLPRMSHQNPAVVMGAIKVVANLASRCSQE 290
Query: 277 GDFVSTLTKKLAPPLVTLLSSEPEVQYVALRNINLIVQKRPDILKHEMKVFFVKYNDPIY 336
+ T ++ L+TL + E QY+ +NI+ ++ P++L+ + F+V+Y+DP +
Sbjct: 291 --LIERCTVRVNTALLTLSRRDAETQYIVCKNIHALLVIFPNLLRTNLDSFYVRYSDPPF 348
Query: 337 VKLEKLDIMIRLASQANIAQVLSELKEYATEVDVDFVRKAVRAIGRCAIKVEQSAERCVS 396
VKLEKL ++++L + + ++L EL EYA+ VD+ FV + VRAI AIKV+ A C +
Sbjct: 349 VKLEKLRLLLKLVTPSVAPEILKELAEYASGVDMVFVVEVVRAIASLAIKVDSVAPDCAN 408
Query: 397 TLLDLIQTK--------------------------------------------------- 405
LL ++ +
Sbjct: 409 LLLQIVDRRPELLPQVVTAAKDIVRKYPELLMLDTLVTDYGADEVVEEEAKVSLLWMLGE 468
Query: 406 YAERIDNADELLESFLEGFHDENTQVQLQLLTAIVKLFLKRPTDTQELVQQVLSLATQDS 465
Y + I+N ++++ F++ + +VQL +L+A VK+FL+ P + + +VL T S
Sbjct: 469 YCDFIENGKDIIQRFIDTIMEHEQRVQLAILSAAVKMFLRDPQGMEPQLNRVLETVTTHS 528
Query: 466 DNPDLRDRGFIYWRLLSTG 484
D+PD+RDR F YWRLLS G
Sbjct: 529 DDPDVRDRAFAYWRLLSKG 547
Score = 329 bits (846), Expect = e-100
Identities = 155/437 (35%), Positives = 255/437 (58%), Gaps = 57/437 (13%)
Query: 497 VIASMTVGKDVSALFPDVVNCMQTDNLELKKLVYLYLMNYAKSHPDMAIMAVNTFVKDCE 556
+IA+MT+G+DVS LF DVV + +LELKKLVYLY+++ A+ P+ A++AVNTF++D
Sbjct: 56 IIANMTMGRDVSYLFVDVVKLAPSTDLELKKLVYLYVLSTARLQPEKALLAVNTFLQDTT 115
Query: 557 DSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDINAQLV 616
+S+P++RALAVRTM CIRV + EY EPLR+ + D DPYVRKTAA+ + KL+ + QL
Sbjct: 116 NSSPVVRALAVRTMMCIRVSSVLEYTLEPLRRAVADPDPYVRKTAAMGLGKLFHDDMQLF 175
Query: 617 EDQGFLDQLKDLLSDSNPMVVANAVAALSEMNEASTSGVALIEMNAQTINKLLTALNECT 676
Q F L +LL+D+NP+V +NA A + E+N+ + IE + + +N+L+ L EC
Sbjct: 176 YQQDFKKDLVELLNDNNPVVASNAAAIVCEVNDYGSEK---IESSNEWVNRLVYHLPECN 232
Query: 677 EWGQVFILDSLSNYSPKDDREAQSICERITPRLAHANAAVVLSAVKVLMKLMEMLPGEGD 736
EWGQ++IL+ L+ P D A+++ R+ PR++H N AVV+ A+KV+ L E
Sbjct: 233 EWGQLYILELLAAQRPSDKESAETLLTRVLPRMSHQNPAVVMGAIKVVANLASRCSQE-- 290
Query: 737 FVSTLTKKLAPPLVTLLSSEPEVQYVALRNINLIVQKRPDILKHEMKVFFVKYNDPIYVK 796
+ T ++ L+TL + E QY+ +NI+ ++ P++L+ + F+V+Y+DP +VK
Sbjct: 291 LIERCTVRVNTALLTLSRRDAETQYIVCKNIHALLVIFPNLLRTNLDSFYVRYSDPPFVK 350
Query: 797 LEKLDIMIRLASQANIAQ--------------------------------------VNYV 818
LEKL ++++L + + + N +
Sbjct: 351 LEKLRLLLKLVTPSVAPEILKELAEYASGVDMVFVVEVVRAIASLAIKVDSVAPDCANLL 410
Query: 819 VQ----------EAIVVIKDIFRKYPNK--YETIISTLCENLDTLDEPEARASMIWIIGE 866
+Q + + KDI RKYP +T+++ D + E EA+ S++W++GE
Sbjct: 411 LQIVDRRPELLPQVVTAAKDIVRKYPELLMLDTLVTDY--GADEVVEEEAKVSLLWMLGE 468
Query: 867 YAERIDNADELLESFLE 883
Y + I+N ++++ F++
Sbjct: 469 YCDFIENGKDIIQRFID 485
>gnl|CDD|227427 COG5096, COG5096, Vesicle coat complex, various subunits
[Intracellular trafficking and secretion].
Length = 757
Score = 300 bits (770), Expect = 1e-89
Identities = 150/390 (38%), Positives = 230/390 (58%), Gaps = 31/390 (7%)
Query: 495 RLVIASMTVGKDVSALFPDVVNCMQTDNLELKKLVYLYLMNYAKSHPDMAIMAVNTFVKD 554
+ +IA M++G+D+S+LFPDV+ + T ++ELK+L+YLYL YAK P++A++AVNT KD
Sbjct: 41 KKIIAQMSLGEDMSSLFPDVIKNVATRDVELKRLLYLYLERYAKLKPELALLAVNTIQKD 100
Query: 555 CEDSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDINAQ 614
+D N IR A+RT+ +RV ++ + +P++K L D YVRKTAA+ VAKLY ++
Sbjct: 101 LQDPNEEIRGFALRTLSLLRVKELLGNIIDPIKKLLTDPHAYVRKTAALAVAKLYRLDKD 160
Query: 615 LVEDQGFLDQLKDLLSDSNPMVVANAVAALSEMNEASTSGVALIEMNAQTINKLLTALNE 674
L + G +D LK+L++DS+P+V+ANA+A+L+E++ G +E+ + L +L+
Sbjct: 161 LYHELGLIDILKELVADSDPIVIANALASLAEIDPELAHGY-SLEVILRIPQLDLLSLSV 219
Query: 675 CTEWGQVFILDSLSNYSPKDDREAQSICERITPRLAHANAAVVLSAVKVLMKLMEMLPGE 734
TEW + IL+ L+ P A+ ER++P L H NA V+L AVKV+++L+ LP
Sbjct: 220 STEWLLLIILEVLTERVPTTPDSAEDFEERLSPPLQHNNAEVLLIAVKVILRLLVFLP-- 277
Query: 735 GDFVSTLTKKLAPPLVTLLSS-EPEVQYVALRNINLIVQKRPDILKHEMKVFFVKYNDPI 793
+ L +PPLVTLL+ E +QYV RNI + ++ +L K+F ++YND I
Sbjct: 278 ---SNNLFLISSPPLVTLLAKPESLIQYVLRRNIQIDLEVCSKLLDKVKKLFLIEYNDDI 334
Query: 794 YVKLEKLDIMIRLASQANIAQ-----VNYV---------VQEAIVVIKDIFRKYPNKYET 839
Y+KLEKLD + RLA N++Q + Y+ V EAI + D+ K +
Sbjct: 335 YIKLEKLDQLTRLADDQNLSQILLELIYYIAENHIDAEMVSEAIKALGDLASKAESSVND 394
Query: 840 IISTLCENLDTLDEPEARASMIWIIGEYAE 869
IS L E L +WI G Y
Sbjct: 395 CISELLELL----------EGVWIRGSYIV 414
Score = 270 bits (693), Expect = 1e-78
Identities = 143/413 (34%), Positives = 223/413 (53%), Gaps = 65/413 (15%)
Query: 19 KGELNSDKKEKKREAVKKVIASMTVGKDVSALFPDVVNCMQTDNLELKKLVYLYLMNYAK 78
G L S KK +A+KK+IA M++G+D+S+LFPDV+ + T ++ELK+L+YLYL YAK
Sbjct: 25 SGRLESSNDYKKIDAMKKIIAQMSLGEDMSSLFPDVIKNVATRDVELKRLLYLYLERYAK 84
Query: 79 SHPDMAIMAFSTFFYQKSSSSFQCMIDIRSIPLFDLCSSVGVFVKDCEDSNPLIRALAVR 138
P++A++A +T KD +D N IR A+R
Sbjct: 85 LKPELALLAVNT------------------------------IQKDLQDPNEEIRGFALR 114
Query: 139 TMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDINAQLVEDQGFLDQLKDL 198
T+ +RV ++ + +P++K L D YVRKTAA+ VAKLY ++ L + G +D LK+L
Sbjct: 115 TLSLLRVKELLGNIIDPIKKLLTDPHAYVRKTAALAVAKLYRLDKDLYHELGLIDILKEL 174
Query: 199 LSDSNPMVVANAVAAILLLPRKSY--WQRNLSSRKKQI---CWNLPYLMNLSVIY----- 248
++DS+P+V+ANA+A++ + + + + R Q+ ++ L +I
Sbjct: 175 VADSDPIVIANALASLAEIDPELAHGYSLEVILRIPQLDLLSLSVSTEWLLLIILEVLTE 234
Query: 249 --PAWPLSTI-------------NPHTPL--LKVLMKLMEMLPGEGDFVSTLTKKLAPPL 291
P P S N L +KV+++L+ LP + L +PPL
Sbjct: 235 RVPTTPDSAEDFEERLSPPLQHNNAEVLLIAVKVILRLLVFLP-----SNNLFLISSPPL 289
Query: 292 VTLLSS-EPEVQYVALRNINLIVQKRPDILKHEMKVFFVKYNDPIYVKLEKLDIMIRLAS 350
VTLL+ E +QYV RNI + ++ +L K+F ++YND IY+KLEKLD + RLA
Sbjct: 290 VTLLAKPESLIQYVLRRNIQIDLEVCSKLLDKVKKLFLIEYNDDIYIKLEKLDQLTRLAD 349
Query: 351 QANIAQVLSELKEYATE--VDVDFVRKAVRAIGRCAIKVEQSAERCVSTLLDL 401
N++Q+L EL Y E +D + V +A++A+G A K E S C+S LL+L
Sbjct: 350 DQNLSQILLELIYYIAENHIDAEMVSEAIKALGDLASKAESSVNDCISELLEL 402
Score = 63.6 bits (155), Expect = 3e-10
Identities = 29/76 (38%), Positives = 35/76 (46%), Gaps = 4/76 (5%)
Query: 415 ELLESFLEGFHDENTQVQLQLLTAIVKLFLKRPTDTQ----ELVQQVLSLATQDSDNPDL 470
ELL + F DE +VQ +L + VKL + EL Q VL PDL
Sbjct: 490 ELLRIAISNFVDETLEVQYTILMSSVKLIANSIRKAKQCNSELDQDVLRRCFDYVLVPDL 549
Query: 471 RDRGFIYWRLLSTGNT 486
RDR +Y RLLST
Sbjct: 550 RDRARMYSRLLSTPLP 565
Score = 36.6 bits (85), Expect = 0.070
Identities = 24/90 (26%), Positives = 42/90 (46%), Gaps = 19/90 (21%)
Query: 800 LDIMIRLASQANIAQVNYVVQEA-----IVVIK---DIFRKYPNKYETIIST-LCENLDT 850
+ ++ L I +Y+VQE I VI+ + R PN+Y I+ L +T
Sbjct: 396 ISELLELLEGVWIRG-SYIVQEVRIVDCISVIRISVLVLRILPNEYPKILLRGLYALEET 454
Query: 851 LD----EPEARASM-----IWIIGEYAERI 871
L+ EP A++ W++GE+++ I
Sbjct: 455 LELQSREPRAKSVTDKYLGAWLLGEFSDII 484
>gnl|CDD|221731 pfam12717, Cnd1, non-SMC mitotic condensation complex subunit 1.
The three non-SMC (structural maintenance of
chromosomes) subunits of the mitotic condensation
complex are Cnd1-3. The whole complex is essential for
viability and the condensing of chromosomes in mitosis.
Length = 171
Score = 148 bits (375), Expect = 5e-41
Identities = 60/183 (32%), Positives = 95/183 (51%), Gaps = 24/183 (13%)
Query: 559 NPLIRALAVRTMG--CIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDINAQLV 616
+PLIR AV +G CIR + E L CL+DEDPYVRKTA + + L + V
Sbjct: 1 DPLIRNNAVIALGDLCIRYPNLVEPYTPNLYACLRDEDPYVRKTALLVLTHLILNDMVKV 60
Query: 617 EDQGFLDQLKDLLSDSNPMVVANAVAALSEMNEASTSGVALIEMNAQTINKLLTALNECT 676
+ Q FL+ LK L D +P + A A + SE+ + + + + ++++ LN CT
Sbjct: 61 KGQLFLEMLK-CLVDEDPEIRALAKSFFSELLKKNPN------LIYNLFPEIISVLNSCT 113
Query: 677 EWGQV----------FILDSLSNYSPKDDREAQSICERITPRLAHANAAVVLSAVKVLMK 726
E GQV F+L+ + D++ +S+ E++ R AN+A VLS + ++
Sbjct: 114 EHGQVSEEKRKKIYKFLLEFI-----TKDKQKESLVEKLCQRFLAANSARVLSDILFILS 168
Query: 727 LME 729
L+E
Sbjct: 169 LLE 171
Score = 100 bits (251), Expect = 2e-24
Identities = 38/95 (40%), Positives = 52/95 (54%), Gaps = 3/95 (3%)
Query: 129 NPLIRALAVRTMG--CIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDINAQLV 186
+PLIR AV +G CIR + E L CL+DEDPYVRKTA + + L + V
Sbjct: 1 DPLIRNNAVIALGDLCIRYPNLVEPYTPNLYACLRDEDPYVRKTALLVLTHLILNDMVKV 60
Query: 187 EDQGFLDQLKDLLSDSNPMVVANAVAAILLLPRKS 221
+ Q FL+ LK L D +P + A A + L +K+
Sbjct: 61 KGQLFLEMLK-CLVDEDPEIRALAKSFFSELLKKN 94
>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats. This family includes multiple
HEAT repeats.
Length = 88
Score = 53.9 bits (130), Expect = 4e-09
Identities = 31/91 (34%), Positives = 44/91 (48%), Gaps = 13/91 (14%)
Query: 556 EDSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDINAQL 615
D +P +RA A R +G + + L + LKD DP VR+ AA + KL D A
Sbjct: 10 SDPDPEVRAAAARALGELGDPEALP----ALLELLKDPDPEVRRAAAEALGKLGDPEA-- 63
Query: 616 VEDQGFLDQLKDLL-SDSNPMVVANAVAALS 645
L L +LL D + +V A A +AL+
Sbjct: 64 ------LPALLELLQDDDDAVVRAAAASALA 88
Score = 53.1 bits (128), Expect = 7e-09
Identities = 31/90 (34%), Positives = 42/90 (46%), Gaps = 12/90 (13%)
Query: 126 EDSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDINAQL 185
D +P +RA A R +G + + L + LKD DP VR+ AA + KL D A
Sbjct: 10 SDPDPEVRAAAARALGELGDPEALP----ALLELLKDPDPEVRRAAAEALGKLGDPEA-- 63
Query: 186 VEDQGFLDQLKDLLSDSNPMVVANAVAAIL 215
L L +LL D + VV A A+ L
Sbjct: 64 ------LPALLELLQDDDDAVVRAAAASAL 87
Score = 41.6 bits (98), Expect = 9e-05
Identities = 22/69 (31%), Positives = 30/69 (43%), Gaps = 9/69 (13%)
Query: 584 EPLRKCL-KDEDPYVRKTAAVCVAKLYDINAQLVEDQGFLDQLKDLLSDSNPMVVANAVA 642
E L + L D DP VR AA + +L D L L +LL D +P V A
Sbjct: 2 EALLEALLSDPDPEVRAAAARALGEL--------GDPEALPALLELLKDPDPEVRRAAAE 53
Query: 643 ALSEMNEAS 651
AL ++ +
Sbjct: 54 ALGKLGDPE 62
Score = 36.9 bits (86), Expect = 0.004
Identities = 21/62 (33%), Positives = 27/62 (43%), Gaps = 9/62 (14%)
Query: 154 EPLRKCL-KDEDPYVRKTAAVCVAKLYDINAQLVEDQGFLDQLKDLLSDSNPMVVANAVA 212
E L + L D DP VR AA + +L D L L +LL D +P V A
Sbjct: 2 EALLEALLSDPDPEVRAAAARALGEL--------GDPEALPALLELLKDPDPEVRRAAAE 53
Query: 213 AI 214
A+
Sbjct: 54 AL 55
Score = 32.3 bits (74), Expect = 0.15
Identities = 13/57 (22%), Positives = 26/57 (45%), Gaps = 4/57 (7%)
Query: 547 AVNTFVKDCEDSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAV 603
A+ ++ +D +P +R A +G + + L + L+D+D V + AA
Sbjct: 32 ALPALLELLKDPDPEVRRAAAEALGKLGDPEA----LPALLELLQDDDDAVVRAAAA 84
Score = 32.3 bits (74), Expect = 0.17
Identities = 12/49 (24%), Positives = 22/49 (44%), Gaps = 4/49 (8%)
Query: 125 CEDSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAV 173
+D +P +R A +G + + L + L+D+D V + AA
Sbjct: 40 LKDPDPEVRRAAAEALGKLGDPEA----LPALLELLQDDDDAVVRAAAA 84
>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
40 amino acid long tandemly repeated sequence motif
first identified in the Drosophila segment polarity gene
armadillo; these repeats were also found in the
mammalian armadillo homolog beta-catenin, the junctional
plaque protein plakoglobin, the adenomatous polyposis
coli (APC) tumor suppressor protein, and a number of
other proteins. ARM has been implicated in mediating
protein-protein interactions, but no common features
among the target proteins recognized by the ARM repeats
have been identified; related to the HEAT domain; three
consecutive copies of the repeat are represented by this
alignment model.
Length = 120
Score = 47.3 bits (113), Expect = 2e-06
Identities = 29/106 (27%), Positives = 41/106 (38%), Gaps = 19/106 (17%)
Query: 148 ITEYLCEPLRKCLKDEDPYVRKTAAVCVAKL---YDINAQLVEDQGFLDQLKDLLSDSNP 204
I L L D V++ AA ++ L + N Q V + G L L LL +
Sbjct: 4 IQAGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDE 63
Query: 205 MVVANAVAAILLLPRKSYWQRNLS---SRKKQICWN---LPYLMNL 244
VV A+ A+ RNL+ K I +P L+NL
Sbjct: 64 EVVKAALWAL----------RNLAAGPEDNKLIVLEAGGVPKLVNL 99
Score = 47.3 bits (113), Expect = 2e-06
Identities = 21/73 (28%), Positives = 30/73 (41%), Gaps = 3/73 (4%)
Query: 578 ITEYLCEPLRKCLKDEDPYVRKTAAVCVAKL---YDINAQLVEDQGFLDQLKDLLSDSNP 634
I L L D V++ AA ++ L + N Q V + G L L LL +
Sbjct: 4 IQAGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDE 63
Query: 635 MVVANAVAALSEM 647
VV A+ AL +
Sbjct: 64 EVVKAALWALRNL 76
Score = 43.1 bits (102), Expect = 6e-05
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 3/69 (4%)
Query: 584 EPLRKCLKDEDPYVRKTAAVCV---AKLYDINAQLVEDQGFLDQLKDLLSDSNPMVVANA 640
L + LK ED V K A + A + N +V + G + +L +LL SN + NA
Sbjct: 52 PALVQLLKSEDEEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNA 111
Query: 641 VAALSEMNE 649
ALS +
Sbjct: 112 TGALSNLAS 120
>gnl|CDD|224331 COG1413, COG1413, FOG: HEAT repeat [Energy production and
conversion].
Length = 335
Score = 45.7 bits (108), Expect = 7e-05
Identities = 30/111 (27%), Positives = 45/111 (40%), Gaps = 13/111 (11%)
Query: 112 FDLCSSVGVFVKDCEDSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLK-DEDPYVRKT 170
+V + + D +P +R A +G + PL + L+ DE+ VR
Sbjct: 70 LGSEEAVPLLRELLSDEDPRVRDAAADALGEL----GDPEAVPPLVELLENDENEGVRAA 125
Query: 171 AAVCVAKLYDINAQLVEDQGFLDQLKDLLSDSNPMVVANAVAAILLLPRKS 221
AA + KL D A LD L + L D + A A+ A LL R +
Sbjct: 126 AARALGKLGDERA--------LDPLLEALQDEDSGSAAAALDAALLDVRAA 168
Score = 35.7 bits (82), Expect = 0.10
Identities = 41/180 (22%), Positives = 66/180 (36%), Gaps = 21/180 (11%)
Query: 547 AVNTFVKDCEDSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVA 606
A + +K ED + L+R A +G + +E LR+ L DEDP VR AA +
Sbjct: 44 AADELLKLLEDEDLLVRLSAAVALGELG----SEEAVPLLRELLSDEDPRVRDAAADALG 99
Query: 607 KLYDINAQLVEDQGFLDQLKDLLS-DSNPMVVANAVAALSEMNEASTSGVALIEMNAQTI 665
+L D + L +LL D N V A A AL ++ + + L+E
Sbjct: 100 EL--------GDPEAVPPLVELLENDENEGVRAAAARALGKLGDE-RALDPLLEALQDED 150
Query: 666 NKLLTALNECTEWGQVF-ILDSLSNYSPKDDREAQSICERITPRLAHANAAVVLSAVKVL 724
+ A + ++L + + L +A V +A L
Sbjct: 151 SGSAAAALDAALLDVRAAAAEALGELGDPEAIPL------LIELLEDEDADVRRAAASAL 204
Score = 31.0 bits (70), Expect = 3.0
Identities = 30/93 (32%), Positives = 41/93 (44%), Gaps = 13/93 (13%)
Query: 126 EDSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDINAQL 185
ED + L+R A +G + +E LR+ L DEDP VR AA + +L
Sbjct: 53 EDEDLLVRLSAAVALGELG----SEEAVPLLRELLSDEDPRVRDAAADALGEL------- 101
Query: 186 VEDQGFLDQLKDLLS-DSNPMVVANAVAAILLL 217
D + L +LL D N V A A A+ L
Sbjct: 102 -GDPEAVPPLVELLENDENEGVRAAAARALGKL 133
>gnl|CDD|227565 COG5240, SEC21, Vesicle coat complex COPI, gamma subunit
[Intracellular trafficking and secretion].
Length = 898
Score = 37.3 bits (86), Expect = 0.039
Identities = 64/319 (20%), Positives = 116/319 (36%), Gaps = 55/319 (17%)
Query: 479 RLLSTGNTFYILLHLTRLVIASMTVGKDVSALFPDVVNCMQTDNLELKKLVYLYLMNYAK 538
LLSTG F + LF ++ Q +L L++ VY + +K
Sbjct: 50 YLLSTGELFP---------------EATATNLFFAILKLFQHKDLYLRQCVYSAIKELSK 94
Query: 539 SHPDMAIMAVNTFVKDCEDSNP-LIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYV 597
D +M ++ +KD P ++ +A+R++ + +D T Y E
Sbjct: 95 LTED-VLMGTSSIMKDLNGGVPDDVKPMAIRSLFSV-IDGETVYDFERYLNQAFVSTSMA 152
Query: 598 RKTAAVCVA-KLYDINAQLVEDQGFLDQLKDLLSDSNPMVVANAVAALSEMNEASTSGVA 656
R++AA+ VA L N +Q K L+++ V+ N+ G
Sbjct: 153 RRSAALVVAYHLLPNN---------FNQTKRWLNETQEAVLDLK----QFPNQHGNEG-- 197
Query: 657 LIEMNAQTINK-----LLTALNECTEWGQVFILDSLSNYSPKDDREAQSICERITPRLAH 711
E N I++ LL + Q+ +++ + ++ A + R T L
Sbjct: 198 -YEPNGNPISQYHALGLLYQSKRTDKMAQLKLVEHFRGNASMKNQLAGVLLVRATVELLK 256
Query: 712 ANAAVVLSAVKVLMKLM----EMLPGEG--DFVSTLTKKLAPPLV--------TLLSSEP 757
N+ +L L + EM+ E + + + V T L S
Sbjct: 257 ENSQALLQLRPFLNSWLSDKFEMVFLEAARAVCALSEENVGSQFVDQTVSSLRTFLKSTR 316
Query: 758 EV-QYVALRNINLIVQKRP 775
V ++ A+R +N + K P
Sbjct: 317 VVLRFSAMRILNQLAMKYP 335
>gnl|CDD|214547 smart00185, ARM, Armadillo/beta-catenin-like repeats. Approx. 40
amino acid repeat. Tandem repeats form superhelix of
helices that is proposed to mediate interaction of
beta-catenin with its ligands. Involved in transducing
the Wingless/Wnt signal. In plakoglobin arm repeats bind
alpha-catenin and N-cadherin.
Length = 41
Score = 32.4 bits (75), Expect = 0.044
Identities = 15/36 (41%), Positives = 18/36 (50%)
Query: 612 NAQLVEDQGFLDQLKDLLSDSNPMVVANAVAALSEM 647
N Q V D G L L +LL + VV A ALS +
Sbjct: 4 NKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNL 39
Score = 29.3 bits (67), Expect = 0.54
Identities = 17/48 (35%), Positives = 20/48 (41%), Gaps = 10/48 (20%)
Query: 182 NAQLVEDQGFLDQLKDLLSDSNPMVVANAVAAILLLPRKSYWQRNLSS 229
N Q V D G L L +LL + VV A A+ NLSS
Sbjct: 4 NKQAVVDAGGLPALVELLKSEDEEVVKEAAWAL----------SNLSS 41
>gnl|CDD|202500 pfam02985, HEAT, HEAT repeat. The HEAT repeat family is related to
armadillo/beta-catenin-like repeats (see pfam00514).
Length = 31
Score = 30.6 bits (70), Expect = 0.16
Identities = 11/30 (36%), Positives = 16/30 (53%)
Query: 152 LCEPLRKCLKDEDPYVRKTAAVCVAKLYDI 181
L L + L D DP VR+ AA + L ++
Sbjct: 1 LLPLLLELLNDPDPEVREAAAEALGALAEV 30
Score = 30.6 bits (70), Expect = 0.16
Identities = 11/30 (36%), Positives = 16/30 (53%)
Query: 582 LCEPLRKCLKDEDPYVRKTAAVCVAKLYDI 611
L L + L D DP VR+ AA + L ++
Sbjct: 1 LLPLLLELLNDPDPEVREAAAEALGALAEV 30
Score = 27.1 bits (61), Expect = 3.0
Identities = 10/27 (37%), Positives = 14/27 (51%)
Query: 621 FLDQLKDLLSDSNPMVVANAVAALSEM 647
L L +LL+D +P V A AL +
Sbjct: 1 LLPLLLELLNDPDPEVREAAAEALGAL 27
>gnl|CDD|205691 pfam13513, HEAT_EZ, HEAT-like repeat. The HEAT repeat family is
related to armadillo/beta-catenin-like repeats (see
pfam00514). These EZ repeats are found in subunits of
cyanobacterial phycocyanin lyase and other proteins and
probably carry out a scaffolding role.
Length = 55
Score = 31.3 bits (71), Expect = 0.18
Identities = 15/55 (27%), Positives = 22/55 (40%), Gaps = 2/55 (3%)
Query: 595 PYVRKTAAVCVAKLYDINAQLVED--QGFLDQLKDLLSDSNPMVVANAVAALSEM 647
VR+ AA+ + L +L+ L L LL D + V A AL +
Sbjct: 1 WEVREAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55
Score = 29.0 bits (65), Expect = 1.2
Identities = 14/57 (24%), Positives = 22/57 (38%), Gaps = 10/57 (17%)
Query: 130 PLIRALAVRTMGCIRVDKITEYLCEP--------LRKCLKDEDPYVRKTAAVCVAKL 178
+R A +G + L P L LKD+D VR+ AA + ++
Sbjct: 1 WEVREAAALALG--ALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55
Score = 29.0 bits (65), Expect = 1.2
Identities = 14/57 (24%), Positives = 22/57 (38%), Gaps = 10/57 (17%)
Query: 560 PLIRALAVRTMGCIRVDKITEYLCEP--------LRKCLKDEDPYVRKTAAVCVAKL 608
+R A +G + L P L LKD+D VR+ AA + ++
Sbjct: 1 WEVREAAALALG--ALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55
Score = 27.8 bits (62), Expect = 2.9
Identities = 14/52 (26%), Positives = 21/52 (40%), Gaps = 2/52 (3%)
Query: 165 PYVRKTAAVCVAKLYDINAQLVED--QGFLDQLKDLLSDSNPMVVANAVAAI 214
VR+ AA+ + L +L+ L L LL D + V A A+
Sbjct: 1 WEVREAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWAL 52
>gnl|CDD|201276 pfam00514, Arm, Armadillo/beta-catenin-like repeat. Approx. 40
amino acid repeat. Tandem repeats form super-helix of
helices that is proposed to mediate interaction of
beta-catenin with its ligands. CAUTION: This family does
not contain all known armadillo repeats.
Length = 41
Score = 30.5 bits (70), Expect = 0.21
Identities = 13/36 (36%), Positives = 17/36 (47%)
Query: 612 NAQLVEDQGFLDQLKDLLSDSNPMVVANAVAALSEM 647
N Q V + G + L LLS + V A ALS +
Sbjct: 4 NKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNL 39
Score = 28.2 bits (64), Expect = 1.4
Identities = 13/48 (27%), Positives = 19/48 (39%), Gaps = 10/48 (20%)
Query: 182 NAQLVEDQGFLDQLKDLLSDSNPMVVANAVAAILLLPRKSYWQRNLSS 229
N Q V + G + L LLS + V A A+ NL++
Sbjct: 4 NKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWAL----------SNLAA 41
>gnl|CDD|227429 COG5098, COG5098, Chromosome condensation complex Condensin, subunit
D2 [Chromatin structure and dynamics / Cell division and
chromosome partitioning].
Length = 1128
Score = 35.0 bits (80), Expect = 0.23
Identities = 27/81 (33%), Positives = 39/81 (48%), Gaps = 12/81 (14%)
Query: 129 NPLIRALAVRTMGCIRV------DKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDIN 182
P IRA AV +G V D+ T YL L DED VR+T + + L I
Sbjct: 947 IPRIRANAVVGLGDFLVCFNTTADEHTHYLYRRL----GDEDADVRRTCLMTIHFL--IL 1000
Query: 183 AQLVEDQGFLDQLKDLLSDSN 203
A ++ +G L ++ LL+D +
Sbjct: 1001 AGQLKVKGQLGKMALLLTDED 1021
Score = 35.0 bits (80), Expect = 0.23
Identities = 27/81 (33%), Positives = 39/81 (48%), Gaps = 12/81 (14%)
Query: 559 NPLIRALAVRTMGCIRV------DKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDIN 612
P IRA AV +G V D+ T YL L DED VR+T + + L I
Sbjct: 947 IPRIRANAVVGLGDFLVCFNTTADEHTHYLYRRL----GDEDADVRRTCLMTIHFL--IL 1000
Query: 613 AQLVEDQGFLDQLKDLLSDSN 633
A ++ +G L ++ LL+D +
Sbjct: 1001 AGQLKVKGQLGKMALLLTDED 1021
>gnl|CDD|233164 TIGR00877, purD, phosphoribosylamine--glycine ligase. Alternate
name: glycinamide ribonucleotide synthetase (GARS). This
enzyme appears as a monofunctional protein in
prokaryotes but as part of a larger, multidomain protein
in eukaryotes [Purines, pyrimidines, nucleosides, and
nucleotides, Purine ribonucleotide biosynthesis].
Length = 422
Score = 33.4 bits (77), Expect = 0.57
Identities = 20/101 (19%), Positives = 34/101 (33%), Gaps = 16/101 (15%)
Query: 324 MKVFFVKYNDPIYVKLEKLDIMIRLASQANIAQVLSELKEYATEVDVDFVRKAVRAIGRC 383
K F +Y I A E K Y E V KA
Sbjct: 108 AKDFMKRYG-------------IPTAEYEVFTDP-EEAKSYIQEKGAPIVVKADGLAAGK 153
Query: 384 AIKVEQSAERCVSTLLDLIQTKYAERIDNADELLESFLEGF 424
+ V ++ E + + D+++ K+ + ++E FL+G
Sbjct: 154 GVIVAKTNEEAIKAVEDILEQKFGD--AGERVVIEEFLDGE 192
>gnl|CDD|221409 pfam12074, DUF3554, Domain of unknown function (DUF3554). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 287 to 356 amino acids in length. This domain is
found associated with pfam02985.
Length = 329
Score = 32.4 bits (74), Expect = 0.91
Identities = 42/240 (17%), Positives = 88/240 (36%), Gaps = 46/240 (19%)
Query: 586 LRKCLKDEDPYVRKTAAVCVAK-LYDINAQLVEDQGFLDQLKDLLSD--SNPM------- 635
++K L ++ P +RK + + + L++ +A L + FL +L +L + SNP+
Sbjct: 66 IKKGLAEKKPPLRKIWLLSLGEALWNNSASLPLIEDFLPKLLKILKEAASNPLPAAQNGT 125
Query: 636 -VVANAVAALSEMNEASTSGVALIEMNA-------------------QTINKLLTAL--- 672
A + +L+ + ++ + + L AL
Sbjct: 126 LSGAYVLLSLTPSLSKKLIKFWSLVLDPKPSFLLSPKFYSKLSDEDLCWVLRALEALFSG 185
Query: 673 ------NECTEWGQVFILDSLS-NYSPKDDREAQSICERITPRLAHANAAVVLSAVKVLM 725
+ WGQ +I LS + + + A + +++ A ++SA+ +
Sbjct: 186 HPSELAEDKIAWGQAWIYVLLSPSLPWEVRKRALQLLKKLLSSNPGVLANSIISALWNWL 245
Query: 726 KLMEMLPGEGDFVSTLTKKLAPPLVTLLSSEPEVQYVALRN------INLIVQKRPDILK 779
+ E+ P E ST K L P L + E + ++L+V ++
Sbjct: 246 EAHELGPKESSIESTSLKFLTPVFSALSLALSEKDSADISIGLEKLLVSLLVLAHHPLIP 305
>gnl|CDD|222866 PHA02544, 44, clamp loader, small subunit; Provisional.
Length = 316
Score = 32.7 bits (75), Expect = 0.91
Identities = 11/36 (30%), Positives = 16/36 (44%)
Query: 832 KYPNKYETIISTLCENLDTLDEPEARASMIWIIGEY 867
Y N Y + + L + L P + +I IIGE
Sbjct: 253 NYANDYASFVGKLYDELYPQVTPPSIIRLIEIIGEN 288
>gnl|CDD|214846 smart00836, DALR_1, DALR anticodon binding domain. This all alpha
helical domain is the anticodon binding domain of
Arginyl tRNA synthetase. This domain is known as the
DALR domain after characteristic conserved amino acids.
Length = 122
Score = 30.2 bits (69), Expect = 1.5
Identities = 20/61 (32%), Positives = 28/61 (45%), Gaps = 1/61 (1%)
Query: 819 VQEAIVVIKDIFRKYPNKYETIISTLCENLDTLDEPEARASMIWIIGEYAERIDNADELL 878
VQ A I I RK ET+ +L L EPE A ++ + + E ++ A E L
Sbjct: 1 VQYAHARICSILRKAGEAGETLPDIADADLSLLTEPEEWA-LLLKLARFPEVLEAAAEQL 59
Query: 879 E 879
E
Sbjct: 60 E 60
>gnl|CDD|224141 COG1220, HslU, ATP-dependent protease HslVU (ClpYQ), ATPase subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 444
Score = 31.9 bits (73), Expect = 1.9
Identities = 18/66 (27%), Positives = 33/66 (50%), Gaps = 11/66 (16%)
Query: 334 PIYVKLEKL--DIMIRLASQANIAQVLSELKEY----ATE-VDVDFVRKAVRAIGRCAIK 386
PI V+L+ L + R+ ++ S +K+Y TE V+++F A++ I A +
Sbjct: 328 PIRVELDALTKEDFERILTEPK----ASLIKQYKALLKTEGVELEFTDDAIKRIAEIAYQ 383
Query: 387 VEQSAE 392
V + E
Sbjct: 384 VNEKTE 389
>gnl|CDD|218811 pfam05918, API5, Apoptosis inhibitory protein 5 (API5). This
family consists of apoptosis inhibitory protein 5 (API5)
sequences from several organisms. Apoptosis or
programmed cell death is a physiological form of cell
death that occurs in embryonic development and organ
formation. It is characterized by biochemical and
morphological changes such as DNA fragmentation and cell
volume shrinkage. API5 is an anti apoptosis gene located
in human chromosome 11, whose expression prevents the
programmed cell death that occurs upon the deprivation
of growth factors.
Length = 543
Score = 31.6 bits (71), Expect = 2.2
Identities = 19/83 (22%), Positives = 33/83 (39%), Gaps = 2/83 (2%)
Query: 518 MQTDNLELKKLVYLYLMNYAKSHPDMAIMAVNTFVKDCEDSNPLIRALAVRTMG--CIRV 575
+ K+L + Y K P +A A + C+D + +R A+R + C
Sbjct: 31 ASKTTSKEKQLASQLIPRYFKFFPSLATEAFDAQFDLCDDDDTGVRVQAIRGLPLFCKDT 90
Query: 576 DKITEYLCEPLRKCLKDEDPYVR 598
T + + L + L E+P R
Sbjct: 91 PDATSKIGDVLVQLLNTEEPVER 113
>gnl|CDD|181879 PRK09464, pdhR, transcriptional regulator PdhR; Reviewed.
Length = 254
Score = 31.1 bits (71), Expect = 2.2
Identities = 14/30 (46%), Positives = 18/30 (60%), Gaps = 1/30 (3%)
Query: 275 GEGDFVST-LTKKLAPPLVTLLSSEPEVQY 303
G G FV + L + + PLV LLS PE Q+
Sbjct: 69 GGGTFVQSSLWQSFSDPLVELLSDHPESQF 98
Score = 31.1 bits (71), Expect = 2.2
Identities = 14/30 (46%), Positives = 18/30 (60%), Gaps = 1/30 (3%)
Query: 733 GEGDFVST-LTKKLAPPLVTLLSSEPEVQY 761
G G FV + L + + PLV LLS PE Q+
Sbjct: 69 GGGTFVQSSLWQSFSDPLVELLSDHPESQF 98
>gnl|CDD|131323 TIGR02270, TIGR02270, conserved hypothetical protein. Members are
found in Myxococcus xanthus (six members), Geobacter
sulfurreducens, and Pseudomonas aeruginosa; a short
protein homologous to the N-terminal region is found in
Mesorhizobium loti. All sequence are from
Proteobacteria. The function is unknown [Hypothetical
proteins, Conserved].
Length = 410
Score = 31.4 bits (71), Expect = 2.3
Identities = 22/101 (21%), Positives = 36/101 (35%), Gaps = 16/101 (15%)
Query: 122 VKDC-EDSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTA-AVCVAKLY 179
V + + A +G + + +L EPL L +P R A A +
Sbjct: 91 VLAVLQAGPEGLCAGIQAALGWLGGRQAEPWL-EPL---LAASEPPGRAIGLAALGAHRH 146
Query: 180 DINAQLVEDQGFLDQLKDLLSDSNPMVVANAVAAILLLPRK 220
D L L+ + +V A A+ A+ LPR+
Sbjct: 147 DPGPALEA----------ALTHEDALVRAAALRALGELPRR 177
>gnl|CDD|237512 PRK13800, PRK13800, putative oxidoreductase/HEAT repeat-containing
protein; Provisional.
Length = 897
Score = 31.7 bits (72), Expect = 2.4
Identities = 31/104 (29%), Positives = 38/104 (36%), Gaps = 22/104 (21%)
Query: 556 EDSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDINAQL 615
E +P R LAV + V ++ YL +P DP VR+TA A L
Sbjct: 606 EPPSP--RILAVLALDAPSVAELAPYLADP--------DPGVRRTAV----------AVL 645
Query: 616 VED--QGFLDQLKDLLSDSNPMVVANAVAALSEMNEASTSGVAL 657
E GF L L D V A L E+ E AL
Sbjct: 646 TETTPPGFGPALVAALGDGAAAVRRAAAEGLRELVEVLPPAPAL 689
Score = 31.4 bits (71), Expect = 2.7
Identities = 22/89 (24%), Positives = 35/89 (39%), Gaps = 11/89 (12%)
Query: 557 DSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDINAQLV 616
D +P +R AV + + L L D VR+ AA + +L ++ L
Sbjct: 632 DPDPGVRRTAVAVL----TETTPPGFGPALVAALGDGAAAVRRAAAEGLRELVEV---LP 684
Query: 617 EDQGFLDQLKDLLSDSNPMVVANAVAALS 645
L+D L +P+V A A+ L
Sbjct: 685 PAPA----LRDHLGSPDPVVRAAALDVLR 709
Score = 30.6 bits (69), Expect = 4.2
Identities = 21/85 (24%), Positives = 34/85 (40%), Gaps = 11/85 (12%)
Query: 127 DSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDINAQLV 186
D +P +R AV + + L L D VR+ AA + +L ++ L
Sbjct: 632 DPDPGVRRTAVAVL----TETTPPGFGPALVAALGDGAAAVRRAAAEGLRELVEV---LP 684
Query: 187 EDQGFLDQLKDLLSDSNPMVVANAV 211
L+D L +P+V A A+
Sbjct: 685 PAPA----LRDHLGSPDPVVRAAAL 705
>gnl|CDD|222605 pfam14223, UBN2, gag-polypeptide of LTR copia-type. This family is
found in Plants and fungi, and contains
LTR-polyproteins, or retrotransposons of the copia-type.
Length = 119
Score = 29.5 bits (67), Expect = 2.7
Identities = 13/33 (39%), Positives = 21/33 (63%), Gaps = 4/33 (12%)
Query: 825 VIKDIFRKYPNKYETIISTLCENLD----TLDE 853
V++ I R P KYE I++++ E+ D TL+E
Sbjct: 69 VVEKILRSLPEKYEQIVTSIEESKDLSTLTLEE 101
>gnl|CDD|236394 PRK09169, PRK09169, hypothetical protein; Validated.
Length = 2316
Score = 31.2 bits (71), Expect = 3.4
Identities = 35/154 (22%), Positives = 53/154 (34%), Gaps = 41/154 (26%)
Query: 540 HPDMAIM--AVNTFVKDCEDSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYV 597
PD A + D L+++L + + L L K DE
Sbjct: 217 WPDSPRCRNAAERLAERLADEPGLLQSLRAQEVA---------LLLNALSKWPDDEAC-- 265
Query: 598 RKTAAVCVAKLYDINAQLVEDQGFLDQLKDLLSDSNPMVVANAVAALSEMNEASTSGVA- 656
+ AA +A A+L + G L +P VANA+ ALS+ + A
Sbjct: 266 -RQAAEALA------ARLAREPG-------LRLALDPQGVANALNALSKWPDTEACRQAA 311
Query: 657 -------------LIEMNAQTINKLLTALNECTE 677
L MNAQ + L AL++ +
Sbjct: 312 EALAERLAQERGLLQAMNAQAVANALNALSKWPD 345
Score = 29.7 bits (67), Expect = 8.9
Identities = 22/88 (25%), Positives = 34/88 (38%), Gaps = 27/88 (30%)
Query: 599 KTAAVCVAKLYDINAQLVEDQGFLDQLKDLLSDSNPMVVANAVAALSEM--NEASTSGVA 656
+ AA +A +L ++ + LL N VANA+ ALS+ EA +
Sbjct: 308 RQAAEALA------ERLAQE-------RGLLQAMNAQAVANALNALSKWPDEEACRAAAE 354
Query: 657 LI------------EMNAQTINKLLTAL 672
+ +NAQ + L AL
Sbjct: 355 ALAARLARDAGLRRALNAQELANALNAL 382
>gnl|CDD|218281 pfam04826, Arm_2, Armadillo-like. This domain contains
armadillo-like repeats. Proteins containing this domain
interact with numerous other proteins, through these
interactions they are involved in a wide variety of
processes including carcinogenesis, control of cellular
ageing and survival, regulation of circadian rhythm and
lysosomal sorting of G protein-coupled receptors.
Length = 254
Score = 29.7 bits (67), Expect = 6.4
Identities = 24/96 (25%), Positives = 43/96 (44%), Gaps = 7/96 (7%)
Query: 147 KITEYLCEP-LRKCL----KDEDPYVRKTA--AVCVAKLYDINAQLVEDQGFLDQLKDLL 199
K + L L+K L EDP++ + A + + Y N ++ D G + + +LL
Sbjct: 4 KSDDVLEPHDLQKLLALLESTEDPFIHEIALITLGNSAAYSFNQDIIRDLGGIPIIANLL 63
Query: 200 SDSNPMVVANAVAAILLLPRKSYWQRNLSSRKKQIC 235
S+ NP + A+ A+ L Q+ + Q+C
Sbjct: 64 SNPNPEIKEKALNALNNLSVNVENQKKIKVYVNQVC 99
Score = 29.3 bits (66), Expect = 9.0
Identities = 21/82 (25%), Positives = 39/82 (47%), Gaps = 7/82 (8%)
Query: 577 KITEYLCEP-LRKCL----KDEDPYVRKTA--AVCVAKLYDINAQLVEDQGFLDQLKDLL 629
K + L L+K L EDP++ + A + + Y N ++ D G + + +LL
Sbjct: 4 KSDDVLEPHDLQKLLALLESTEDPFIHEIALITLGNSAAYSFNQDIIRDLGGIPIIANLL 63
Query: 630 SDSNPMVVANAVAALSEMNEAS 651
S+ NP + A+ AL+ ++
Sbjct: 64 SNPNPEIKEKALNALNNLSVNV 85
>gnl|CDD|165867 PLN02223, PLN02223, phosphoinositide phospholipase C.
Length = 537
Score = 30.0 bits (67), Expect = 6.5
Identities = 16/53 (30%), Positives = 31/53 (58%), Gaps = 14/53 (26%)
Query: 79 SHPDMAIMAFSTFFYQKSSSSFQC---------MID-IRSIPLFD----LCSS 117
++PD+A+++F + Y+ S++ C +I+ IR++PL+D CSS
Sbjct: 475 TYPDLALISFEVYDYEVSTADAFCGQTCLPVSELIEGIRAVPLYDERGKACSS 527
>gnl|CDD|107372 cd06377, PBP1_iGluR_NMDA_NR3, N-terminal
leucine/isoleucine/valine-binding protein (LIVBP)-like
domain of the NR3 subunit of NMDA receptor family.
N-terminal leucine/isoleucine/valine-binding protein
(LIVBP)-like domain of the NR3 subunit of NMDA receptor
family. The ionotropic N-methyl-d-asparate (NMDA)
subtype of glutamate receptor serves critical functions
in neuronal development, functioning, and degeneration
in the mammalian central nervous system. The functional
NMDA receptor is a heterotetramer composed of two NR1
and two NR2 (A, B, C, and D) or of NR3 (A and B)
subunits. The receptor controls a cation channel that is
highly permeable to monovalent ions and calcium and
exhibits voltage-dependent inhibition by magnesium. Dual
agonists, glutamate and glycine, are required for
efficient activation of the NMDA receptor. Among NMDA
receptor subtypes, the NR2B subunit containing receptors
appear particularly important for pain perception; thus
NR2B-selective antagonists may be useful in the
treatment of chronic pain.
Length = 382
Score = 29.8 bits (67), Expect = 6.8
Identities = 11/31 (35%), Positives = 15/31 (48%)
Query: 32 EAVKKVIASMTVGKDVSALFPDVVNCMQTDN 62
E V + + S T+ + AL P VNCM
Sbjct: 277 ELVARAVGSATLVQPELALIPATVNCMDLPT 307
>gnl|CDD|237822 PRK14819, PRK14819, NADH dehydrogenase subunit B; Provisional.
Length = 264
Score = 29.6 bits (66), Expect = 7.1
Identities = 12/23 (52%), Positives = 17/23 (73%)
Query: 282 TLTKKLAPPLVTLLSSEPEVQYV 304
T+TKK+AP +V L + PE +YV
Sbjct: 80 TVTKKMAPQVVRLYNQMPEPRYV 102
Score = 29.6 bits (66), Expect = 7.1
Identities = 12/23 (52%), Positives = 17/23 (73%)
Query: 740 TLTKKLAPPLVTLLSSEPEVQYV 762
T+TKK+AP +V L + PE +YV
Sbjct: 80 TVTKKMAPQVVRLYNQMPEPRYV 102
>gnl|CDD|171504 PRK12445, PRK12445, lysyl-tRNA synthetase; Reviewed.
Length = 505
Score = 29.6 bits (66), Expect = 7.4
Identities = 56/245 (22%), Positives = 94/245 (38%), Gaps = 39/245 (15%)
Query: 262 LLKVLMKLMEMLPGEGDFVSTLTKKLAPPLVTLLSSEPEVQYVAL--------------- 306
++V +M+++PG +T A L L PE+ L
Sbjct: 203 FMEVETPMMQVIPGGASARPFITHHNALDLDMYLRIAPELYLKRLVVGGFERVFEINRNF 262
Query: 307 RNINLIVQKRPDILKHEMKVFFVKYNDPIYVKLEKLDIMIRLASQANIAQVLSELKEYAT 366
RN + V+ P+ E+ + + Y+D I E + + R +Q +VL K
Sbjct: 263 RNEGISVRHNPEFTMMELYMAYADYHDLI----ELTESLFRTLAQ----EVLGTTKVTYG 314
Query: 367 EVDVDFVRKAVRAIGRCAIKVEQSAERCVSTLLDLIQTKYAERIDNADELLESFLEGFHD 426
E DF + + R AIK R + + DL + D A L ES G
Sbjct: 315 EHVFDFGKPFEKLTMREAIK----KYRPETDMADL------DNFDAAKALAESI--GITV 362
Query: 427 ENT----QVQLQLLTAIVKLFLKRPTDTQELVQQVLSLATQDSDNPDLRDRGFIYWRLLS 482
E + ++ ++ + + L +PT E +V LA ++ NP++ DR +
Sbjct: 363 EKSWGLGRIVTEIFDEVAEAHLIQPTFITEYPAEVSPLARRNDVNPEITDRFEFFIGGRE 422
Query: 483 TGNTF 487
GN F
Sbjct: 423 IGNGF 427
>gnl|CDD|222206 pfam13535, ATP-grasp_4, ATP-grasp domain. This family includes a
diverse set of enzymes that possess ATP-dependent
carboxylate-amine ligase activity.
Length = 183
Score = 29.2 bits (66), Expect = 7.7
Identities = 19/87 (21%), Positives = 31/87 (35%), Gaps = 13/87 (14%)
Query: 345 MIRLASQANIAQV-------LSELKEYATEVDVDFVRKAVRAIG-RCAIKVEQSAERCVS 396
M L A + +L A E+ V K G +V+ +AE
Sbjct: 8 MRELLRAAGLPVPPFFLVDDEEDLDAAAEEIGFPVVLKPRDGAGSLGVFRVDSAAE---- 63
Query: 397 TLLDLIQTKYAERIDNADELLESFLEG 423
L + AE D + L+E +++G
Sbjct: 64 -LEAALAALAAEVEDTREYLVEEYIDG 89
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number
of Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 29.6 bits (67), Expect = 8.9
Identities = 16/67 (23%), Positives = 26/67 (38%), Gaps = 11/67 (16%)
Query: 606 AKLYDINAQLVEDQGFLDQLKDLLSDSNPMVVANAVAALSEMNEASTSGVALIEMNAQTI 665
+L +L E D LK L P+++A + A SE ++ + +
Sbjct: 211 EELEAARLELNEAIAQRDALKRQLGGEEPVLLAGSSVANSE-----------LDGRIEAL 259
Query: 666 NKLLTAL 672
K L AL
Sbjct: 260 EKQLDAL 266
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.135 0.385
Gapped
Lambda K H
0.267 0.0791 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 44,158,581
Number of extensions: 4401866
Number of successful extensions: 4709
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4645
Number of HSP's successfully gapped: 90
Length of query: 883
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 778
Effective length of database: 6,280,432
Effective search space: 4886176096
Effective search space used: 4886176096
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (27.9 bits)