RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4168
(454 letters)
>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain.
Length = 372
Score = 252 bits (645), Expect = 5e-80
Identities = 107/394 (27%), Positives = 179/394 (45%), Gaps = 44/394 (11%)
Query: 66 TPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPSDTTPLWKELEEVPI---NNLEEFTELFS 122
P P + PL+W ++ D +W EL+E +L E ELFS
Sbjct: 4 EPKPKKKLK--------PLHWDKV----NPAQDRGTVWDELDEESFEKDLDLSELEELFS 51
Query: 123 RQVTAQRPVTRKRQQKSSKVQNVARLIDSKRSQNVGILAQSLHIEFSEIESAIFNMDASV 182
+ ++ +++ S K + ++D KRSQN+ IL + L + EI AI MD SV
Sbjct: 52 AKAKKKKSKKSEKKSSSKKKKKEISVLDPKRSQNIAILLRKLKLPPEEIVQAILEMDESV 111
Query: 183 VSLEALQQIYDVRATDEEINLIRAHLASNSDLQLDKPEQFLADLADIPHFSERIACFMFE 242
+ LE L+ + + T EE+ ++ + +L + EQFL +L+ IP ER+ +F+
Sbjct: 112 LGLELLENLLKMAPTKEELKKLKEY--KGDPSKLGRAEQFLLELSKIPRLEERLEALLFK 169
Query: 243 ADFSDSIALIDSKLNNIKSVCEFLMKSEHLKKVIAIILSLGNFMNGGNRLRGQADGFGLE 302
+ F + + + L +++ E L +S KK++ +IL+LGN+MN G R RG A GF L
Sbjct: 170 STFEEEVEELKPSLETLEAASEELRESRKFKKLLELILALGNYMNSGTR-RGNAKGFKLS 228
Query: 303 ILPKLKDVRSKDNSVTLLHFIVRTYLRNSENPLNESLPVPEPGDVDRAASVVFDDIHSQL 362
L KL D +S DN TLLH++V+ P E V++AA V + + +
Sbjct: 229 SLLKLSDTKSTDNKTTLLHYLVKIIREK--LPDLLDFS-SELSHVEKAAKVDLEQLEKDV 285
Query: 363 STLAKKLDAVTISMNKVVQESKPDHMEPFRTKMESCVKTGK----------------FKS 406
L K L + + + F KM+ ++ + FK
Sbjct: 286 KELEKGLKKLERELEL--SALDEHPDDKFVEKMKEFLEEAEEKLDKLESLLKEAMELFKE 343
Query: 407 VLKFYQYIPKGIPESDVTPKDLFMLWAPFCHDFK 440
+ +++ PK + +P++ F + F FK
Sbjct: 344 LTEYFGEDPK-----ETSPEEFFKILRDFLRMFK 372
>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain. FH proteins control
rearrangements of the actin cytoskeleton, especially in
the context of cytokinesis and cell polarisation.
Members of this family have been found to interact with
Rho-GTPases, profilin and other actin-assoziated
proteins. These interactions are mediated by the
proline-rich FH1 domain, usually located in front of FH2
(but not listed in SMART). Despite this cytosolic
function, vertebrate formins have been assigned
functions within the nucleus. A set of Formin-Binding
Proteins (FBPs) has been shown to bind FH1 with their WW
domain.
Length = 392
Score = 234 bits (598), Expect = 7e-73
Identities = 111/414 (26%), Positives = 194/414 (46%), Gaps = 52/414 (12%)
Query: 64 MGTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSR 123
P P + PL+W + ++P S T +W +++E +L+E ELFS
Sbjct: 1 KKEPKPKKKLK--------PLHWDK--LNPSDLSGT--VWDKIDEESEGDLDELEELFSA 48
Query: 124 QVTAQRP---VTRKRQQKSSKVQNVARLIDSKRSQNVGILAQSLHIEFSEIESAIFNMDA 180
+ + V+ K+ K +++D KRSQN+ IL + LH+ + EI+ AI D
Sbjct: 49 KEKTKSASKDVSEKKSILKKKASQEFKILDPKRSQNLAILLRKLHMSYEEIKEAILEGDE 108
Query: 181 SVVSLEALQQIYDVRATDEEINLIRAHLASNSDL-QLDKPEQFLADLADIPHFSERIACF 239
V+S++ L+Q+ T EE+ +R + D +L + EQFL +++IP+ ER+
Sbjct: 109 DVLSVDLLEQLLKYAPTKEELKKLREY--KEEDPEELARAEQFLLLISNIPYLEERLNAL 166
Query: 240 MFEADFSDSIALIDSKLNNIKSVCEFLMKSEHLKKVIAIILSLGNFMNGGNRLRGQADGF 299
+F+A+F + + + ++ +++ CE L +S+ +K++ +IL++GN+MNGG+R RGQA GF
Sbjct: 167 LFKANFEEEVEDLKPQIEKVEAACEELRESKKFRKLLELILAIGNYMNGGSR-RGQAYGF 225
Query: 300 GLEILPKLKDVRSKDNSVTLLHFIVRT----YLRNSENPLNES---LPVPEPGDVDRAAS 352
L L KL DV+S DN TLLHF+V+ YL +P N + V +P +AA
Sbjct: 226 KLSSLLKLSDVKSADNKTTLLHFLVKIIRKKYLGGLSDPENLDDKFIEVMKP--FLKAAK 283
Query: 353 VVFDDIHSQLSTLAKKLDA---VTISMNKVVQESKPDHMEPFRTKMESCVKTGK------ 403
+D + LS L + + K S + + F ++ K +
Sbjct: 284 EKYDKLQKDLSDLKTRFEKLVEYYGEDPKDT--SPEEFFKDFNEFLKEFSKAAEENIKKE 341
Query: 404 ----------FKSVLKFYQYIPKGIPESDVTPKDLFMLWAPFCHDFKDFWKKEQ 447
K ++ Q + + + P F + F ++
Sbjct: 342 EEEEERRKKLVKETTEYEQSSSR---QKERNPSMDFEVERDFLGVLDSLLEELG 392
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This
family contains bacterial fibronectin-attachment
proteins (FAP). Family members are rich in alanine and
proline, are approximately 300 long, and seem to be
restricted to mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 53.4 bits (128), Expect = 8e-08
Identities = 21/64 (32%), Positives = 23/64 (35%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
+LP A P PPPPPP PPP PP P P PP P+
Sbjct: 31 IALPATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNA 90
Query: 81 LGPL 84
P
Sbjct: 91 PPPP 94
Score = 44.9 bits (106), Expect = 4e-05
Identities = 26/59 (44%), Positives = 28/59 (47%), Gaps = 4/59 (6%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGT----PPPPPPMMGTPPPPPPMMGTTPHVLG 82
AAPPPPPPP P P P PP PPPPP PPPP P G + +G
Sbjct: 58 AAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPAPEPGRIDNAVG 116
Score = 33.7 bits (77), Expect = 0.16
Identities = 18/39 (46%), Positives = 20/39 (51%), Gaps = 4/39 (10%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
P+ A PPPPP P PPPP + PP PP P P
Sbjct: 74 PDDPNAAPPPPPADPNAPPPPPVDPNAPP----PPAPEP 108
Score = 32.2 bits (73), Expect = 0.55
Identities = 17/49 (34%), Positives = 17/49 (34%), Gaps = 12/49 (24%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P PPPPP P PP PPP P PP P P
Sbjct: 72 PQPDDPNAAPPPPPADPNAPP------------PPPVDPNAPPPPAPEP 108
Score = 28.3 bits (63), Expect = 7.4
Identities = 24/83 (28%), Positives = 26/83 (31%), Gaps = 30/83 (36%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMG--------------------------TPPPPP 61
A PPPP P PPP P G + G T PP
Sbjct: 90 APPPPPVDPNAPPPPAPEPGRIDNAVGGFSYVVPAGWVESDATHLDYGSALLSKTTGEPP 149
Query: 62 PMMGTPPPPPPMMGTTPHVLGPL 84
P PPP+ T VLG L
Sbjct: 150 M----PGQPPPVANDTRVVLGRL 168
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 54.2 bits (130), Expect = 1e-07
Identities = 33/95 (34%), Positives = 34/95 (35%), Gaps = 17/95 (17%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP---PMMGTT 77
F+LP Q PP P PP P P P PPP PPPPPP PP P P
Sbjct: 2898 FALPPDQPERPPQPQAPPPPQPQP---QPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGE 2954
Query: 78 PHVLGPLYW-----------TRLIVSPEIPSDTTP 101
P P W R V PS P
Sbjct: 2955 PSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989
Score = 49.9 bits (119), Expect = 2e-06
Identities = 19/49 (38%), Positives = 22/49 (44%), Gaps = 5/49 (10%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+ ++ PP P P P PPP P PPPP PPPPPP
Sbjct: 2893 RSTESFALPPDQPERPPQPQA-----PPPPQPQPQPPPPPQPQPPPPPP 2936
Score = 48.4 bits (115), Expect = 7e-06
Identities = 21/49 (42%), Positives = 22/49 (44%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P V + PP P PP PPP PPPPP PPPPPP
Sbjct: 2889 PAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937
Score = 46.5 bits (110), Expect = 3e-05
Identities = 18/69 (26%), Positives = 24/69 (34%), Gaps = 2/69 (2%)
Query: 6 KRAKR--RKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPM 63
R +R R ++ + P P P P P P+ + PPP P P
Sbjct: 382 TRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPG 441
Query: 64 MGTPPPPPP 72
P PPP
Sbjct: 442 SDDGPAPPP 450
Score = 43.4 bits (102), Expect = 3e-04
Identities = 26/85 (30%), Positives = 34/85 (40%), Gaps = 17/85 (20%)
Query: 6 KRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTP------------PPM 53
+R +RR + T ++ SL A PPPPPP P P P ++ TP P +
Sbjct: 2681 QRPRRRAARPTVGSLTSL-----ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL 2735
Query: 54 MGTPPPPPPMMGTPPPPPPMMGTTP 78
P PP G P P P
Sbjct: 2736 PAAPAPPAVPAGPATPGGPARPARP 2760
Score = 43.0 bits (101), Expect = 4e-04
Identities = 14/51 (27%), Positives = 19/51 (37%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
S ++ + P PP + P + P PPP PPPPP
Sbjct: 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG 2844
Score = 42.2 bits (99), Expect = 6e-04
Identities = 21/81 (25%), Positives = 24/81 (29%), Gaps = 4/81 (4%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
P A P P PP P PPP P P P P PPP P P +
Sbjct: 2608 PRGPAPPSPLPPDTHAPDPPP----PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSR 2663
Query: 84 LYWTRLIVSPEIPSDTTPLWK 104
R + S +
Sbjct: 2664 PRRARRLGRAAQASSPPQRPR 2684
Score = 40.3 bits (94), Expect = 0.002
Identities = 22/90 (24%), Positives = 25/90 (27%), Gaps = 10/90 (11%)
Query: 22 SLPNVQAAPPPPPPP----------PPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
S+P + AP P P PP P G PP P T P P
Sbjct: 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDP 2626
Query: 72 PMMGTTPHVLGPLYWTRLIVSPEIPSDTTP 101
P +P P V P P
Sbjct: 2627 PPPSPSPAANEPDPHPPPTVPPPERPRDDP 2656
Score = 39.9 bits (93), Expect = 0.003
Identities = 24/92 (26%), Positives = 27/92 (29%), Gaps = 17/92 (18%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMG------------TPPPP 70
L A PP P PLPPP T PPP G PPP P+ G P
Sbjct: 2812 LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSP 2871
Query: 71 PPMMGTTPHVLGPLYWTRLIVSPEIPSDTTPL 102
R + P + T
Sbjct: 2872 AAKPAAPARPP-----VRRLARPAVSRSTESF 2898
Score = 39.5 bits (92), Expect = 0.004
Identities = 27/83 (32%), Positives = 29/83 (34%), Gaps = 11/83 (13%)
Query: 22 SLPNVQAAPPPPPPPP-----PLPPPPPMMGTTPPPMMGTPPPP---PPMMGTPPPPPPM 73
V A+ P P P P P PP P+ P G PPP PP T P P
Sbjct: 407 PAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDP 466
Query: 74 MGTTPHVLGPLYWTRLIVSPEIP 96
T L L R PE P
Sbjct: 467 DDATRKALDALRERR---PPEPP 486
Score = 39.2 bits (91), Expect = 0.005
Identities = 17/60 (28%), Positives = 20/60 (33%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
+ ++ P P P P PP P PPP P PPP G P L
Sbjct: 2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPL 2852
Score = 38.4 bits (89), Expect = 0.010
Identities = 15/49 (30%), Positives = 16/49 (32%), Gaps = 5/49 (10%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPP-----PMMGTPPPPPPMMGTPPPPP 71
A PPPPP P P + G P P PPP P
Sbjct: 272 ARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAP 320
Score = 38.0 bits (88), Expect = 0.011
Identities = 22/76 (28%), Positives = 24/76 (31%), Gaps = 24/76 (31%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPM-----------MGTTPPPMMG-------------TPPP 59
P P PPPPPPP P PP G P P +G P P
Sbjct: 2923 PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP 2982
Query: 60 PPPMMGTPPPPPPMMG 75
P PP+ G
Sbjct: 2983 APSREAPASSTPPLTG 2998
Score = 38.0 bits (88), Expect = 0.011
Identities = 25/80 (31%), Positives = 29/80 (36%), Gaps = 7/80 (8%)
Query: 27 QAAPPPP----PPPPPLPPPPPMMGTTP--PPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
QA+P P PP P P P P PP PP P P PP + T P V
Sbjct: 2730 QASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRL-TRPAV 2788
Query: 81 LGPLYWTRLIVSPEIPSDTT 100
+ SP P+D
Sbjct: 2789 ASLSESRESLPSPWDPADPP 2808
Score = 37.2 bits (86), Expect = 0.018
Identities = 24/98 (24%), Positives = 34/98 (34%), Gaps = 11/98 (11%)
Query: 5 DKRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPP-PPMMGTTPPPMMGTPPPPPPM 63
D A R + + ++PP P P + PP PPPP P
Sbjct: 2654 DDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPP----PPPPTP- 2708
Query: 64 MGTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPSDTTP 101
P P ++ TP GP + SP +P+ P
Sbjct: 2709 ---EPAPHALVSATPLPPGPAAARQ--ASPALPAAPAP 2741
Score = 37.2 bits (86), Expect = 0.020
Identities = 20/64 (31%), Positives = 22/64 (34%), Gaps = 18/64 (28%)
Query: 27 QAAPPPPPPP-PPLP-----------PPPPMMGTTP------PPMMGTPPPPPPMMGTPP 68
++ PPP P P P P PP P P P P PP P
Sbjct: 2566 RSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
Query: 69 PPPP 72
PPPP
Sbjct: 2626 PPPP 2629
Score = 35.7 bits (82), Expect = 0.059
Identities = 15/49 (30%), Positives = 15/49 (30%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
P P P PP P P P P T PPP P P
Sbjct: 2611 PAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPG 2659
Score = 34.9 bits (80), Expect = 0.094
Identities = 14/64 (21%), Positives = 17/64 (26%), Gaps = 6/64 (9%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPM------MGTPPPPPPMMGTPPPPPPMMGTTPHV 80
+ P P P PP+ P + PP P P PPP
Sbjct: 2865 RPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPP 2924
Query: 81 LGPL 84
P
Sbjct: 2925 PPPQ 2928
Score = 34.9 bits (80), Expect = 0.12
Identities = 19/82 (23%), Positives = 23/82 (28%), Gaps = 19/82 (23%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMM-------------GTTPPPMMGTPPPPPPMMGT- 66
PPP P P P + + PP P +G+
Sbjct: 2636 NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695
Query: 67 -----PPPPPPMMGTTPHVLGP 83
PPPPPP PH L
Sbjct: 2696 TSLADPPPPPPTPEPAPHALVS 2717
Score = 34.1 bits (78), Expect = 0.20
Identities = 19/80 (23%), Positives = 24/80 (30%), Gaps = 16/80 (20%)
Query: 6 KRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP---PP 62
+R+ R F+ P P P P P P P+ + PPP P
Sbjct: 385 RRSARHAATP-----FARGPGGDDQTRPAAPVPASVPTP----APTPVPASAPPPPATPL 435
Query: 63 MMGTPP----PPPPMMGTTP 78
P P PP P
Sbjct: 436 PSAEPGSDDGPAPPPERQPP 455
Score = 33.4 bits (76), Expect = 0.30
Identities = 20/71 (28%), Positives = 25/71 (35%), Gaps = 9/71 (12%)
Query: 22 SLPNVQAAPPPPPPP-----PPLPPPPPMMGTTPPPMMGTPP----PPPPMMGTPPPPPP 72
+ P + AAP PP P P P P TT P PP PP T P
Sbjct: 2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
Query: 73 MMGTTPHVLGP 83
+ + + P
Sbjct: 2791 LSESRESLPSP 2801
Score = 33.4 bits (76), Expect = 0.34
Identities = 18/84 (21%), Positives = 25/84 (29%), Gaps = 4/84 (4%)
Query: 22 SLPNVQAAP--PPPPPPPPLPPPPPMMGTTPPPMMGTP--PPPPPMMGTPPPPPPMMGTT 77
+ P A P P PP L P + + +P P PP P +
Sbjct: 2764 AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAS 2823
Query: 78 PHVLGPLYWTRLIVSPEIPSDTTP 101
P P + +P P P
Sbjct: 2824 PAGPLPPPTSAQPTAPPPPPGPPP 2847
Score = 32.2 bits (73), Expect = 0.67
Identities = 18/63 (28%), Positives = 22/63 (34%), Gaps = 18/63 (28%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMM--------------GTPPPPPPMMGTPPP 69
P+ A P P P + P P+ P M+ G PPPP P P
Sbjct: 2504 PDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLP----PAA 2559
Query: 70 PPP 72
PP
Sbjct: 2560 PPA 2562
Score = 32.2 bits (73), Expect = 0.74
Identities = 21/85 (24%), Positives = 23/85 (27%), Gaps = 7/85 (8%)
Query: 24 PNVQAAPPPPPPPP----PLPPPPPMMGTTPPPMMGTPPPPPPM---MGTPPPPPPMMGT 76
P P P PP P P PP PP T P + + P P
Sbjct: 2748 PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADP 2807
Query: 77 TPHVLGPLYWTRLIVSPEIPSDTTP 101
VL P SP P
Sbjct: 2808 PAAVLAPAAALPPAASPAGPLPPPT 2832
Score = 31.8 bits (72), Expect = 1.1
Identities = 14/71 (19%), Positives = 17/71 (23%), Gaps = 20/71 (28%)
Query: 22 SLPNVQAAPPPPPP------------------PPPLPPPPPMMGTTPPPM--MGTPPPPP 61
+ P PP PPP PP P PP+ + P
Sbjct: 2834 AQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSR 2893
Query: 62 PMMGTPPPPPP 72
PP
Sbjct: 2894 STESFALPPDQ 2904
Score = 31.1 bits (70), Expect = 1.7
Identities = 21/78 (26%), Positives = 23/78 (29%), Gaps = 30/78 (38%)
Query: 24 PNVQAAPPPPPPPPPLP---------------PPPPMM---------------GTTPPPM 53
P+ PP P PP P P P M G PPP+
Sbjct: 2496 PDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPL 2555
Query: 54 MGTPPPPPPMMGTPPPPP 71
PP P PPP P
Sbjct: 2556 PPAAPPAAPDRSVPPPRP 2573
Score = 29.9 bits (67), Expect = 4.2
Identities = 15/57 (26%), Positives = 17/57 (29%), Gaps = 7/57 (12%)
Query: 23 LPNVQAAPPPPPPPPPLP---PPPPMMGTTPPPMMGTP----PPPPPMMGTPPPPPP 72
P AP P P P P +G P + P P P P P P
Sbjct: 2938 RPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994
Score = 29.5 bits (66), Expect = 4.4
Identities = 19/63 (30%), Positives = 22/63 (34%), Gaps = 18/63 (28%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP----PPPMM--------------GTPPP 69
AAP P PP P PP P ++ P P M+ G PPP
Sbjct: 2494 AAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPP 2553
Query: 70 PPP 72
P P
Sbjct: 2554 PLP 2556
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 48.6 bits (116), Expect = 5e-06
Identities = 34/130 (26%), Positives = 44/130 (33%), Gaps = 10/130 (7%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPP-----MMGTPPPPPPMMGTPPPP---PPM 73
LP AP PP PP LP PP + P M PP PP PP P PP
Sbjct: 219 FLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPP 278
Query: 74 MGTTPHVLGPLYWTRLIVSPEIPSDTTPLWKELEEVPINNL--EEFTELFSRQVTAQRPV 131
G + P PL ++ + E+ +L +Q A
Sbjct: 279 QNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQQREALSQE 338
Query: 132 TRKRQQKSSK 141
KR ++ K
Sbjct: 339 EAKRAKRRHK 348
Score = 40.5 bits (95), Expect = 0.002
Identities = 15/71 (21%), Positives = 17/71 (23%), Gaps = 6/71 (8%)
Query: 19 NIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGT------PPPPPPMMGTPPPPPP 72
LP P P PP P P P PP PP
Sbjct: 176 PQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQ 235
Query: 73 MMGTTPHVLGP 83
+ P + P
Sbjct: 236 LPQQPPPLQQP 246
Score = 38.2 bits (89), Expect = 0.010
Identities = 18/61 (29%), Positives = 19/61 (31%), Gaps = 1/61 (1%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLG 82
L Q AP P PP + P P G PP PP PP P L
Sbjct: 163 LQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQG-PPEQPPGYPQPPQGHPEQVQPQQFLP 221
Query: 83 P 83
Sbjct: 222 A 222
Score = 37.4 bits (87), Expect = 0.015
Identities = 15/65 (23%), Positives = 16/65 (24%), Gaps = 7/65 (10%)
Query: 22 SLPNVQAAPPPPPPPPPLPPP----PPMMGTTPPPMMGTPPPPPPMMGTP---PPPPPMM 74
PP P +PP P PP PP P P P P
Sbjct: 167 QQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQA 226
Query: 75 GTTPH 79
P
Sbjct: 227 PAQPP 231
Score = 35.9 bits (83), Expect = 0.043
Identities = 12/49 (24%), Positives = 13/49 (26%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P P P P P PP P PPP+ P
Sbjct: 204 PQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLS 252
Score = 35.9 bits (83), Expect = 0.046
Identities = 16/66 (24%), Positives = 17/66 (25%), Gaps = 11/66 (16%)
Query: 24 PNVQAAPPPPPPPPPL------PPPPPMMGTTPPPMMGTP-----PPPPPMMGTPPPPPP 72
P Q P PP P PP P P P PP PP
Sbjct: 175 PPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPP 234
Query: 73 MMGTTP 78
+ P
Sbjct: 235 QLPQQP 240
Score = 34.7 bits (80), Expect = 0.11
Identities = 16/60 (26%), Positives = 19/60 (31%), Gaps = 3/60 (5%)
Query: 20 IFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
+ SL V+A P LP PP + P M P G P PP
Sbjct: 153 MLSLEEVEAQLQQRQQAPQLPQPPQQV---LPQGMPPRQAAFPQQGPPEQPPGYPQPPQG 209
Score = 34.4 bits (79), Expect = 0.15
Identities = 18/69 (26%), Positives = 19/69 (27%), Gaps = 5/69 (7%)
Query: 21 FSLPNVQAAPPPPP--PPPPLPPPPPMMGTTPPPMMGTPPPPPPM---MGTPPPPPPMMG 75
P Q PP P PPP P P G PPP P + P
Sbjct: 259 PPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGP 318
Query: 76 TTPHVLGPL 84
L L
Sbjct: 319 QFREQLVQL 327
Score = 32.4 bits (74), Expect = 0.58
Identities = 14/81 (17%), Positives = 18/81 (22%), Gaps = 7/81 (8%)
Query: 20 IFSLPNVQAAPPPP-PPPPPLPPPP--PMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGT 76
++ PP P P P P M+ P P PP
Sbjct: 125 LWGTAPKPEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQ----LPQPPQQVL 180
Query: 77 TPHVLGPLYWTRLIVSPEIPS 97
+ PE P
Sbjct: 181 PQGMPPRQAAFPQQGPPEQPP 201
Score = 32.4 bits (74), Expect = 0.64
Identities = 14/77 (18%), Positives = 15/77 (19%), Gaps = 13/77 (16%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPM-------------MGTTPPPMMGTPPPPPPMMGTPP 68
SL P P PP P P P P PP +
Sbjct: 124 SLWGTAPKPEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQG 183
Query: 69 PPPPMMGTTPHVLGPLY 85
PP
Sbjct: 184 MPPRQAAFPQQGPPEQP 200
>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN). This family
consists of several eukaryotic survival motor neuron
(SMN) proteins. The Survival of Motor Neurons (SMN)
protein, the product of the spinal muscular
atrophy-determining gene, is part of a large
macromolecular complex (SMN complex) that functions in
the assembly of spliceosomal small nuclear
ribonucleoproteins (snRNPs). The SMN complex functions
as a specificity factor essential for the efficient
assembly of Sm proteins on U snRNAs and likely protects
cells from illicit, and potentially deleterious,
non-specific binding of Sm proteins to RNAs.
Length = 264
Score = 46.9 bits (111), Expect = 7e-06
Identities = 22/53 (41%), Positives = 24/53 (45%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
S N + P PPPPPP PP + G PPP P PPPPPM
Sbjct: 165 PSPWNPRFPPGPPPPPPGFGRHGEKPSGWPPFLSGWPPPFPLGPPMIPPPPPM 217
Score = 33.8 bits (77), Expect = 0.14
Identities = 12/24 (50%), Positives = 14/24 (58%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMG 47
P + PPP P PP+ PPPP M
Sbjct: 195 PFLSGWPPPFPLGPPMIPPPPPMS 218
>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR. This region is found
to the N-terminus of the pfam00170 transcription factor
domain. It is between 150 and 200 amino acids in length.
The N-terminal half is rather rich in proline residues
and has been termed the PRD (proline rich domain),
whereas the C-terminal half is more polar and has been
called the MFMR (multifunctional mosaic region). It has
been suggested that this family is composed of three
sub-families called A, B and C, classified according to
motif composition. It has been suggested that some of
these motifs may be involved in mediating
protein-protein interactions. The MFMR region contains a
nuclear localisation signal in bZIP opaque and GBF-2.
The MFMR also contains a transregulatory activity in
TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
signals.
Length = 189
Score = 45.6 bits (108), Expect = 1e-05
Identities = 27/85 (31%), Positives = 28/85 (32%), Gaps = 24/85 (28%)
Query: 24 PNVQAAPPPPPPPPPL-------PPPPPMMGTTPPPMM---GTPPP-----PP------- 61
+QA P PPPP P P P M PMM GTPPP PP
Sbjct: 32 SAMQAYYGPRPPPPYFNSSVASSPQPHPYMWGPQQPMMPPYGTPPPYAAMYPPGGVYAHP 91
Query: 62 --PMMGTPPPPPPMMGTTPHVLGPL 84
P P P M PL
Sbjct: 92 SMPPGSHPFSPYAMPSAEVPGSTPL 116
Score = 37.9 bits (88), Expect = 0.005
Identities = 13/67 (19%), Positives = 17/67 (25%), Gaps = 8/67 (11%)
Query: 29 APPPPPPPPPLPPPPPMMGTTP--------PPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
P P + P P PP + + P P P M
Sbjct: 8 KPSKSSPKTSVQEDTPTPTVYPDWSAMQAYYGPRPPPPYFNSSVASSPQPHPYMWGPQQP 67
Query: 81 LGPLYWT 87
+ P Y T
Sbjct: 68 MMPPYGT 74
Score = 37.1 bits (86), Expect = 0.008
Identities = 13/63 (20%), Positives = 19/63 (30%), Gaps = 3/63 (4%)
Query: 24 PNVQAAPPPPPPPPPLPPPPP---MMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
P+ + P P P M P P + +P P P M G +
Sbjct: 9 PSKSSPKTSVQEDTPTPTVYPDWSAMQAYYGPRPPPPYFNSSVASSPQPHPYMWGPQQPM 68
Query: 81 LGP 83
+ P
Sbjct: 69 MPP 71
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 45.7 bits (108), Expect = 3e-05
Identities = 17/52 (32%), Positives = 17/52 (32%), Gaps = 3/52 (5%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P PP P T P GT P GTP PP P G P
Sbjct: 220 TTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAE---GTPAPPTPGGGEAP 268
Score = 41.9 bits (98), Expect = 5e-04
Identities = 16/48 (33%), Positives = 18/48 (37%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P+ +PP P P T P GTP PP P G PP
Sbjct: 224 PSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPPAN 271
Score = 41.1 bits (96), Expect = 9e-04
Identities = 15/52 (28%), Positives = 17/52 (32%), Gaps = 1/52 (1%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLG 82
P P P P P T P +PP + P GTTP G
Sbjct: 205 PATPRPTPRTTASPET-TPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEG 255
Score = 33.8 bits (77), Expect = 0.20
Identities = 19/76 (25%), Positives = 24/76 (31%), Gaps = 8/76 (10%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP----PPMMGTPPPPPPMMGTTPHVLGPLY 85
P P P L P + TP P T P P T PP + + + P
Sbjct: 188 PALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQA 247
Query: 86 WTRLIVSPEIPSDTTP 101
T +PE P
Sbjct: 248 GT----TPEAEGTPAP 259
Score = 33.4 bits (76), Expect = 0.24
Identities = 15/64 (23%), Positives = 17/64 (26%), Gaps = 6/64 (9%)
Query: 38 PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPS 97
P P P T P T P P P P TT + P+
Sbjct: 205 PATPRPTPRTTASPE---TTPTPSTTTSPPSTTIPAPSTTIAAPQA---GTTPEAEGTPA 258
Query: 98 DTTP 101
TP
Sbjct: 259 PPTP 262
>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
(DUF2360). This is the conserved 140 amino acid region
of a family of proteins conserved from nematodes to
humans. One C. elegans member is annotated as a
Daf-16-dependent longevity protein 1 but this could not
be confirmed. The function is unknown.
Length = 147
Score = 43.5 bits (103), Expect = 3e-05
Identities = 18/60 (30%), Positives = 21/60 (35%), Gaps = 5/60 (8%)
Query: 18 SNIFSLPNVQAAPPPPPPPP-----PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
++I L +V PPPP PPPPP P P PP P P
Sbjct: 45 ASIPGLEDVTVQTTPPPPASAITNGGPPPPPPARAEAASPPPPEAPAEPPAEPEPEAPAE 104
Score = 33.1 bits (76), Expect = 0.13
Identities = 20/62 (32%), Positives = 20/62 (32%), Gaps = 13/62 (20%)
Query: 24 PNVQAAPPPPP-------PPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGT 76
VQ PPPP PPPP P PP P PP P P P T
Sbjct: 53 VTVQTTPPPPASAITNGGPPPPPP---ARAEAASPPPPEAPAEPPAE---PEPEAPAENT 106
Query: 77 TP 78
Sbjct: 107 VT 108
Score = 29.3 bits (66), Expect = 2.1
Identities = 19/69 (27%), Positives = 21/69 (30%), Gaps = 13/69 (18%)
Query: 34 PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSP 93
PPPP T G PPPPPP PPP P
Sbjct: 53 VTVQTTPPPPASAITN-----GGPPPPPPARAEAASPPP-PEAPAEPPAE-------PEP 99
Query: 94 EIPSDTTPL 102
E P++ T
Sbjct: 100 EAPAENTVT 108
>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1. This region is found
in some of the Diaphanous related formins (Drfs). It
consists of low complexity repeats of around 12
residues.
Length = 160
Score = 43.4 bits (102), Expect = 4e-05
Identities = 39/81 (48%), Positives = 42/81 (51%), Gaps = 24/81 (29%)
Query: 22 SLPNVQAAPPPPP--------PPPPLP-------PPPPMMGTT----PPPM---MGTPPP 59
LP A PPPPP PPPPLP PPPP+ G PPP+ G PPP
Sbjct: 65 PLPGATAIPPPPPLPGAAGIPPPPPLPGGAGIPPPPPPLPGGAAVPPPPPLPGGPGVPPP 124
Query: 60 PPPMMGTP--PPPPPMMGTTP 78
PPP G P PPPPP MG+ P
Sbjct: 125 PPPFPGAPGIPPPPPGMGSPP 145
Score = 42.2 bits (99), Expect = 9e-05
Identities = 25/57 (43%), Positives = 26/57 (45%), Gaps = 6/57 (10%)
Query: 23 LPNVQAAPPPPPPP------PPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
LP A PPPPP P PP PP P G PPP PPPPP P P +
Sbjct: 103 LPGGAAVPPPPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGSPPPPPFGFGVPAAPVL 159
Score = 40.7 bits (95), Expect = 4e-04
Identities = 32/67 (47%), Positives = 34/67 (50%), Gaps = 12/67 (17%)
Query: 23 LPNVQAAPPPPP---------PPPPLPPPPPMMGTTPPPMMGTP--PPPPPMMGTPPPPP 71
LP PPPPP PPPPLP P + PPP G P PPPPP MG+PPPPP
Sbjct: 90 LPGGAGIPPPPPPLPGGAAVPPPPPLPGGPGVP-PPPPPFPGAPGIPPPPPGMGSPPPPP 148
Query: 72 PMMGTTP 78
G
Sbjct: 149 FGFGVPA 155
Score = 38.4 bits (89), Expect = 0.002
Identities = 35/77 (45%), Positives = 38/77 (49%), Gaps = 23/77 (29%)
Query: 22 SLPNVQAAPPPP--------PPPPPL------PPPPPMMGTT----PPPMMGT-----PP 58
LP PPPP PPPPPL PPPPP+ G T PPP+ G PP
Sbjct: 29 PLPGGTGIPPPPPLPGGAAIPPPPPLPGVAGIPPPPPLPGATAIPPPPPLPGAAGIPPPP 88
Query: 59 PPPPMMGTPPPPPPMMG 75
P P G PPPPPP+ G
Sbjct: 89 PLPGGAGIPPPPPPLPG 105
Score = 35.3 bits (81), Expect = 0.023
Identities = 32/71 (45%), Positives = 36/71 (50%), Gaps = 10/71 (14%)
Query: 23 LPNVQAAPPPPPPPPP--LPPPPPMMGTT----PPPMMG----TPPPPPPMMGTPPPPPP 72
LP V PPPPP P +PPPPP+ G PPP+ G PPPPP G PPPP
Sbjct: 54 LPGVAGIPPPPPLPGATAIPPPPPLPGAAGIPPPPPLPGGAGIPPPPPPLPGGAAVPPPP 113
Query: 73 MMGTTPHVLGP 83
+ P V P
Sbjct: 114 PLPGGPGVPPP 124
Score = 34.5 bits (79), Expect = 0.039
Identities = 25/56 (44%), Positives = 28/56 (50%), Gaps = 10/56 (17%)
Query: 30 PPPPPPP-----PPLPPPPPMMGTTPPPMMG-----TPPPPPPMMGTPPPPPPMMG 75
PPPPP P PP PP P + PPP + PPPP P PPPPP+ G
Sbjct: 1 PPPPPLPGGVGIPPPPPLPGGVCIPPPPPLPGGTGIPPPPPLPGGAAIPPPPPLPG 56
Score = 34.5 bits (79), Expect = 0.050
Identities = 33/69 (47%), Positives = 37/69 (53%), Gaps = 21/69 (30%)
Query: 30 PPPP-------PPPPPL------PPPPPMMGTT----PPPMMGT----PPPPPPMMGTPP 68
PPPP PPPPPL PPPPP+ G T PPP+ G PPPP P + P
Sbjct: 2 PPPPLPGGVGIPPPPPLPGGVCIPPPPPLPGGTGIPPPPPLPGGAAIPPPPPLPGVAGIP 61
Query: 69 PPPPMMGTT 77
PPPP+ G T
Sbjct: 62 PPPPLPGAT 70
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 45.3 bits (108), Expect = 4e-05
Identities = 23/99 (23%), Positives = 34/99 (34%), Gaps = 4/99 (4%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWT 87
A+ PPPP PPP T P P P P PPPPP + + +
Sbjct: 517 ASNTAKTPPPPQKSPPPPAPTPPLP---QPTATAPPPTPPPPPPTATQASSNAPAQIPAD 573
Query: 88 RLIVSPEIPSDTTPLWKELEEVPINNL-EEFTELFSRQV 125
P T K+ I+ + + F+ ++
Sbjct: 574 SSPPPPIPEEPTPSPTKDSSPEEIDKAAKNLADFFNGEI 612
Score = 35.3 bits (82), Expect = 0.071
Identities = 19/49 (38%), Positives = 20/49 (40%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P A PP PPPPPP P +PPPP P TP P
Sbjct: 543 PTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKD 591
Score = 34.6 bits (80), Expect = 0.11
Identities = 17/55 (30%), Positives = 19/55 (34%), Gaps = 4/55 (7%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTP----PPPPPMMGTPPPPPP 72
+ P PP P P PP PPP P P +PPPP P
Sbjct: 527 PQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIP 581
Score = 34.6 bits (80), Expect = 0.11
Identities = 15/49 (30%), Positives = 15/49 (30%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P P P P P PPP T P P PPP P
Sbjct: 534 PAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPE 582
Score = 33.4 bits (77), Expect = 0.24
Identities = 22/91 (24%), Positives = 29/91 (31%), Gaps = 10/91 (10%)
Query: 12 KKKETYSNIFSLPNVQAAPPPPPPPPPL-----PPPPPMMGTTPPPMMGTPPPPPPMMGT 66
P+ AP P P P PPP + P P+ P P P
Sbjct: 371 ASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSP---- 426
Query: 67 PPPPPPMMGTTPHVLGPLYWTRLIVSPEIPS 97
PP L L W +++ E+PS
Sbjct: 427 TPPANAANAPPSLNLEEL-WQQILAKLELPS 456
Score = 33.4 bits (77), Expect = 0.24
Identities = 28/117 (23%), Positives = 38/117 (32%), Gaps = 14/117 (11%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLG 82
LP+ + P P P P P P+ + P TP PPP V
Sbjct: 360 LPSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPA 419
Query: 83 PLYWTRLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSRQVTAQRPVTRK--RQQ 137
T +P + P P NLEE + ++ + P TR QQ
Sbjct: 420 EP--TEPSPTPPANAANAP--------PSLNLEELWQQILAKL--ELPSTRMLLSQQ 464
Score = 33.0 bits (76), Expect = 0.33
Identities = 14/50 (28%), Positives = 16/50 (32%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
P + PPP P PPPP + P P P P P
Sbjct: 538 PPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPS 587
Score = 32.6 bits (75), Expect = 0.42
Identities = 19/50 (38%), Positives = 21/50 (42%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
LP A PPP PPPP P P + PPPP+ P P P
Sbjct: 540 LPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPT 589
Score = 32.2 bits (74), Expect = 0.63
Identities = 11/49 (22%), Positives = 12/49 (24%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
PPPPPP P PP P +P
Sbjct: 545 ATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKDSS 593
Score = 31.9 bits (73), Expect = 0.76
Identities = 17/61 (27%), Positives = 20/61 (32%)
Query: 12 KKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
+ S + PPP P P P TTP P PP P+ P P
Sbjct: 366 SEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPS 425
Query: 72 P 72
P
Sbjct: 426 P 426
Score = 31.5 bits (72), Expect = 1.2
Identities = 17/62 (27%), Positives = 21/62 (33%)
Query: 12 KKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
+K A PPP PPP PP + P + PPP + P P
Sbjct: 528 QKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPS 587
Query: 72 PM 73
P
Sbjct: 588 PT 589
Score = 29.5 bits (67), Expect = 4.7
Identities = 21/69 (30%), Positives = 27/69 (39%), Gaps = 1/69 (1%)
Query: 41 PPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSP-EIPSDT 99
TPPP +PPPP P P P TP P +P +IP+D+
Sbjct: 515 GSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADS 574
Query: 100 TPLWKELEE 108
+P EE
Sbjct: 575 SPPPPIPEE 583
>gnl|CDD|218146 pfam04554, Extensin_2, Extensin-like region.
Length = 57
Score = 40.5 bits (95), Expect = 6e-05
Identities = 20/51 (39%), Positives = 26/51 (50%), Gaps = 2/51 (3%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGT--PPPPPPMMGTPPPPPPMMGTTP 78
PPPP PPPP + PPP+ PPPP+ +PPPP + + P
Sbjct: 5 SPPPPVKQYSPPPPYYYKSPPPPVKSPVYKSPPPPVYKSPPPPKYVYKSPP 55
Score = 40.1 bits (94), Expect = 6e-05
Identities = 21/50 (42%), Positives = 28/50 (56%), Gaps = 3/50 (6%)
Query: 24 PNVQAAPPPPPP---PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
P V+ PPPP PP P P+ + PPP+ +PPPP + +PPPP
Sbjct: 8 PPVKQYSPPPPYYYKSPPPPVKSPVYKSPPPPVYKSPPPPKYVYKSPPPP 57
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 42.5 bits (100), Expect = 8e-05
Identities = 19/59 (32%), Positives = 23/59 (38%), Gaps = 2/59 (3%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMG--TPPPPPPMMGTPPPPPPMMGTTP 78
+LP P P P P P PM P + P PP P M P PP++ P
Sbjct: 91 NLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPLLPDLP 149
Score = 35.9 bits (83), Expect = 0.016
Identities = 18/74 (24%), Positives = 24/74 (32%), Gaps = 3/74 (4%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLG 82
+P + P P LP P P PP P M PP P+ P
Sbjct: 76 VPGQHSMTPTQHHQPNLPQPAQQP---FQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPL 132
Query: 83 PLYWTRLIVSPEIP 96
P + + P +P
Sbjct: 133 PPMFPMQPLPPLLP 146
Score = 33.2 bits (76), Expect = 0.12
Identities = 18/60 (30%), Positives = 20/60 (33%), Gaps = 6/60 (10%)
Query: 18 SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
+Q PP P PP P PP PPM P PP + P P T
Sbjct: 106 QPPQPQQPMQPQPPVHPIPPLPPQPP------LPPMFPMQPLPPLLPDLPLEAWPATDKT 159
Score = 29.4 bits (66), Expect = 2.8
Identities = 17/61 (27%), Positives = 20/61 (32%), Gaps = 5/61 (8%)
Query: 23 LPNVQAAPPPPPPPP--PLPPPPPMMGT---TPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
+P + A P P P P+P M T P P P PP P M
Sbjct: 58 IPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQ 117
Query: 78 P 78
P
Sbjct: 118 P 118
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which
suggests a potential regulatory role for tyrosine
phosphorylation in modulating WW domain-ligand
interactions. Given the likelihood that WWP1 and WWP2
function as E3 ubiquitin-protein ligases, it is possible
that initial substrate-specific recognition occurs via
WW domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 41.6 bits (98), Expect = 8e-05
Identities = 21/46 (45%), Positives = 22/46 (47%), Gaps = 3/46 (6%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
AAPPP PPPP P PP + TPPP PPPP P
Sbjct: 65 YAAPPPYGPPPPYYPAPPGVYPTPPPPN---SGYMADPQEPPPPYP 107
Score = 38.5 bits (90), Expect = 8e-04
Identities = 28/80 (35%), Positives = 34/80 (42%), Gaps = 9/80 (11%)
Query: 1 MEYMDKRAKRRKKKETYSNIFSLPNVQAAPPPPPP------PPPLPPPPPMMGTTPPPMM 54
ME + +RA+R + S + P P P PPP PPPP PP +
Sbjct: 27 MERLAERAQRAQPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYP-APPGVY 85
Query: 55 GTPPPP--PPMMGTPPPPPP 72
TPPPP M PPPP
Sbjct: 86 PTPPPPNSGYMADPQEPPPP 105
Score = 37.4 bits (87), Expect = 0.002
Identities = 22/72 (30%), Positives = 23/72 (31%), Gaps = 2/72 (2%)
Query: 14 KETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP--PPPMMGTPPPPP 71
E + PPP L P P PP G PPP P P P PPP
Sbjct: 31 AERAQRAQPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPP 90
Query: 72 PMMGTTPHVLGP 83
P G P
Sbjct: 91 PNSGYMADPQEP 102
Score = 34.3 bits (79), Expect = 0.026
Identities = 15/40 (37%), Positives = 16/40 (40%), Gaps = 1/40 (2%)
Query: 24 PNVQAAPPPPPPPPPLP-PPPPMMGTTPPPMMGTPPPPPP 62
P PPP P PP P PP + PPPP P
Sbjct: 68 PPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPPPPYP 107
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 42.1 bits (99), Expect = 9e-05
Identities = 18/56 (32%), Positives = 19/56 (33%), Gaps = 4/56 (7%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
P APP P PP P P + T PP PP P P P P
Sbjct: 81 PLTPPAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAP----AAAVPAPAAAPPPS 132
Score = 38.7 bits (90), Expect = 0.002
Identities = 15/45 (33%), Positives = 15/45 (33%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
PP P P P PP T P P P PP PP
Sbjct: 91 VTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPP 135
Score = 36.7 bits (85), Expect = 0.007
Identities = 16/49 (32%), Positives = 17/49 (34%), Gaps = 2/49 (4%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P P P P +P PPP PP P P PPP P
Sbjct: 88 PEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPA--AAPPPSDP 134
Score = 34.4 bits (79), Expect = 0.040
Identities = 16/48 (33%), Positives = 16/48 (33%), Gaps = 4/48 (8%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
AP P PPP P P P PPP PP PP
Sbjct: 95 TAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSD----PPQPP 138
Score = 27.5 bits (61), Expect = 9.6
Identities = 12/36 (33%), Positives = 12/36 (33%), Gaps = 1/36 (2%)
Query: 28 AAPPPPPPPPPLP-PPPPMMGTTPPPMMGTPPPPPP 62
P P PP P P PPP PP P
Sbjct: 106 PPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQPPRAP 141
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 44.8 bits (106), Expect = 9e-05
Identities = 14/56 (25%), Positives = 16/56 (28%), Gaps = 7/56 (12%)
Query: 24 PNVQAAPPPPPPPPPLP-------PPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+ A P P P P PPPP + P P P PP
Sbjct: 687 AHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPP 742
Score = 44.4 bits (105), Expect = 1e-04
Identities = 11/42 (26%), Positives = 12/42 (28%), Gaps = 1/42 (2%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
PPPP P P P + P P PP
Sbjct: 711 PPPPDLPAAAGPAPCGSSLIASPTAPPEPEPP-GAEQADGAE 751
Score = 42.9 bits (101), Expect = 3e-04
Identities = 15/59 (25%), Positives = 17/59 (28%), Gaps = 1/59 (1%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGT-TPHVLGP 83
AA P P P PP G PPP + P P + P P
Sbjct: 688 HHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGAEQ 746
Score = 42.1 bits (99), Expect = 5e-04
Identities = 14/55 (25%), Positives = 15/55 (27%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P A P P PP G PPP P G+ P P
Sbjct: 685 PTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEP 739
Score = 34.4 bits (79), Expect = 0.15
Identities = 19/75 (25%), Positives = 20/75 (26%), Gaps = 9/75 (12%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWT 87
A P P+ P P PP G PPPP P G
Sbjct: 672 ARPSRVARGDPVRPTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCG----- 726
Query: 88 RLIVSPEIPSDTTPL 102
S I S T P
Sbjct: 727 ----SSLIASPTAPP 737
Score = 28.6 bits (64), Expect = 7.9
Identities = 12/62 (19%), Positives = 13/62 (20%), Gaps = 17/62 (27%)
Query: 30 PPPPPPPPPLP---------------PPPPMM--GTTPPPMMGTPPPPPPMMGTPPPPPP 72
PP P P P P M T P + P P P
Sbjct: 636 PPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARGDPVRPTAHHAALRAP 695
Query: 73 MM 74
Sbjct: 696 QA 697
>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729). This
family of proteins is found in viruses. Proteins in this
family are typically between 145 and 1707 amino acids in
length. The family is found in association with
pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
There is a single completely conserved residue L that
may be functionally important.
Length = 115
Score = 40.4 bits (95), Expect = 2e-04
Identities = 21/52 (40%), Positives = 22/52 (42%), Gaps = 4/52 (7%)
Query: 24 PNVQAAPPPPPPP----PPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P V A PPP PP P PP+ G PP PP P PPP P
Sbjct: 55 PPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAPPAPARKPPLPPPRP 106
Score = 37.3 bits (87), Expect = 0.003
Identities = 18/47 (38%), Positives = 22/47 (46%), Gaps = 3/47 (6%)
Query: 29 APPPPP---PPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+PP PPP+ P P + TPPP+ PPP PPP P
Sbjct: 30 SPPESAHPDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDP 76
Score = 35.8 bits (83), Expect = 0.008
Identities = 17/44 (38%), Positives = 20/44 (45%), Gaps = 2/44 (4%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
PP P PPP G PP++ TPPP + PPP P
Sbjct: 29 FSPPESAHPDDPPPV--GDPRPPVVDTPPPVSAVWVLPPPSEPA 70
Score = 32.0 bits (73), Expect = 0.17
Identities = 16/51 (31%), Positives = 19/51 (37%), Gaps = 4/51 (7%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
PP P PP+ PP + PPP P P P PP+ G
Sbjct: 39 DPPPVGDPRPPVVDTPPPVSAVWVL----PPPSEPAAPPPDPEPPVPGPAG 85
Score = 31.2 bits (71), Expect = 0.39
Identities = 17/52 (32%), Positives = 17/52 (32%), Gaps = 3/52 (5%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP---PPPMMGTPPPPP 71
P PPP PPP PPP P P PP PP P
Sbjct: 45 DPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAPPAPA 96
Score = 29.3 bits (66), Expect = 1.8
Identities = 12/23 (52%), Positives = 13/23 (56%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPP 44
+ P APP P PPLPPP P
Sbjct: 84 AGPPSPLAPPAPARKPPLPPPRP 106
Score = 28.5 bits (64), Expect = 3.0
Identities = 17/50 (34%), Positives = 18/50 (36%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P PP PPP + PPP PPP P P P P P
Sbjct: 43 VGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAP 92
Score = 28.5 bits (64), Expect = 3.3
Identities = 16/51 (31%), Positives = 17/51 (33%), Gaps = 2/51 (3%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGT--PPPPPPMMGTPPPPPP 72
P+ P PP PPP PP PPP P P PP
Sbjct: 37 PDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPP 87
>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator. Shisa is a
transcription factor-type molecule that physically
interacts with immature forms of the Wnt receptor
Frizzled and the FGF receptor within the endoplasmic
reticulum to inhibit their post-translational maturation
and trafficking to the cell surface.
Length = 177
Score = 41.7 bits (98), Expect = 2e-04
Identities = 19/55 (34%), Positives = 20/55 (36%), Gaps = 2/55 (3%)
Query: 26 VQAAPPPPPP--PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
VQ P P PP P P P PP G P PP + PP G P
Sbjct: 122 VQTTPLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQYPPPGLLQPQGPPP 176
Score = 32.5 bits (74), Expect = 0.25
Identities = 19/60 (31%), Positives = 23/60 (38%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
P + A PLP PP + P P P PP G P PP + P +L P
Sbjct: 112 PVMTRATSTTVQTTPLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQYPPPGLLQP 171
Score = 30.2 bits (68), Expect = 1.4
Identities = 17/41 (41%), Positives = 18/41 (43%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
S P Q P PP P P PP PPP + P PPP
Sbjct: 136 SYPGPQYQGYHPMPPQPGMPAPPYSLQYPPPGLLQPQGPPP 176
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 42.5 bits (100), Expect = 4e-04
Identities = 22/116 (18%), Positives = 32/116 (27%), Gaps = 8/116 (6%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
P + P PP + T P P PP P+ P P
Sbjct: 379 VRPTPAPSTRPKAAAAANIPPKEPVRETATP---PPVPPRPVAPPVPHTPESAPKLTRAA 435
Query: 82 GPLYWTRLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSRQVTAQRPVTRKRQQ 137
P+ P P + K L + LE+ ++ Q+ P Q
Sbjct: 436 IPVDEKPKYTPPAPPKEE---EKALIA-DGDVLEQLEAIWK-QILRDVPPRSPAVQ 486
Score = 39.8 bits (93), Expect = 0.003
Identities = 22/89 (24%), Positives = 28/89 (31%), Gaps = 12/89 (13%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP--PPMMGTTPHVL 81
P Q A P P P+ P P PP P+ T PP PP P
Sbjct: 364 PAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPP--- 420
Query: 82 GPLYWTRLIVSPEIPSDTTPLWKELEEVP 110
+ +PE T ++E P
Sbjct: 421 -------VPHTPESAPKLTRAAIPVDEKP 442
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 42.5 bits (100), Expect = 4e-04
Identities = 17/40 (42%), Positives = 17/40 (42%), Gaps = 6/40 (15%)
Query: 24 PNVQAAPPPPP-PPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
P AA PPP PP PP PPP PP PP
Sbjct: 906 PGGDAASAPPPGAGPPAPPQA-----VPPPRTTQPPAAPP 940
Score = 40.6 bits (95), Expect = 0.002
Identities = 14/40 (35%), Positives = 14/40 (35%), Gaps = 4/40 (10%)
Query: 33 PPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P PPP G PP PPP PP PP
Sbjct: 905 APGGDAASAPPP--GAGPP--APPQAVPPPRTTQPPAAPP 940
Score = 28.7 bits (64), Expect = 8.8
Identities = 13/44 (29%), Positives = 13/44 (29%), Gaps = 4/44 (9%)
Query: 44 PMMGTTPPPMMGTPPPPPPMMGTPPPP----PPMMGTTPHVLGP 83
P G G PP PP P PP T P P
Sbjct: 897 PGTGRVATAPGGDAASAPPPGAGPPAPPQAVPPPRTTQPPAAPP 940
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 42.2 bits (99), Expect = 5e-04
Identities = 16/70 (22%), Positives = 17/70 (24%), Gaps = 1/70 (1%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWT 87
P P P P P P P G P P P P P W
Sbjct: 442 RGPGGAPAPAPAPAAAPA-AAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWE 500
Query: 88 RLIVSPEIPS 97
L P+
Sbjct: 501 ELPPEFASPA 510
Score = 39.9 bits (93), Expect = 0.003
Identities = 20/107 (18%), Positives = 26/107 (24%), Gaps = 4/107 (3%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
A P P P P P PPP + P P P
Sbjct: 463 RPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAP----AQPDAAP 518
Query: 84 LYWTRLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSRQVTAQRP 130
W + +D ++ L P + V A RP
Sbjct: 519 AGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRP 565
Score = 36.8 bits (85), Expect = 0.024
Identities = 26/135 (19%), Positives = 32/135 (23%), Gaps = 31/135 (22%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMM---------------GTPPPPP 71
AA P P PP P P + P P G P P P
Sbjct: 393 AAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAP 452
Query: 72 -----PMMGTTPHVLGPLYWTRLIVSPEIPS----------DTTPLWKEL-EEVPINNLE 115
P P GP + + D P W+EL E
Sbjct: 453 APAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPA 512
Query: 116 EFTELFSRQVTAQRP 130
+ + V P
Sbjct: 513 QPDAAPAGWVAESIP 527
Score = 34.9 bits (80), Expect = 0.085
Identities = 8/50 (16%), Positives = 8/50 (16%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
P Q AP P P P P
Sbjct: 381 PVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAP 430
Score = 33.7 bits (77), Expect = 0.24
Identities = 8/51 (15%), Positives = 9/51 (17%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMM 74
V P P P P P + P P
Sbjct: 380 APVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAP 430
Score = 32.5 bits (74), Expect = 0.46
Identities = 11/57 (19%), Positives = 11/57 (19%), Gaps = 4/57 (7%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
A P P P P P P P P P P
Sbjct: 371 GAGPATAAAAPVAQPAPA----AAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAP 423
Score = 31.4 bits (71), Expect = 1.1
Identities = 10/49 (20%), Positives = 10/49 (20%), Gaps = 3/49 (6%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P AA P P P P P P P
Sbjct: 374 PATAAAAPVAQPAPAAAAPAA---AAPAPAAPPAAPAAAPAAAAAARAV 419
Score = 28.7 bits (64), Expect = 7.7
Identities = 12/60 (20%), Positives = 15/60 (25%), Gaps = 3/60 (5%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPP---MMGTPPPPPPMMGTPPPPPPMMGTTP 78
V AA P P PPP + P P P + + P
Sbjct: 468 GPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIP 527
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 41.6 bits (98), Expect = 5e-04
Identities = 13/49 (26%), Positives = 17/49 (34%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P + PP P +P P P + PP P + PPP
Sbjct: 179 PASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPS 227
Score = 40.8 bits (96), Expect = 0.001
Identities = 18/52 (34%), Positives = 19/52 (36%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
F P + P PP P P P T P PPP PPPPP
Sbjct: 197 FPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPV 248
Score = 39.7 bits (93), Expect = 0.002
Identities = 16/59 (27%), Positives = 18/59 (30%), Gaps = 13/59 (22%)
Query: 24 PNVQAAPPPPPPPPP----------LPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P + P P PP LPP P + PP P P PP P
Sbjct: 187 PPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPP---PSPESPTNPSPPPGPAA 242
Score = 39.3 bits (92), Expect = 0.003
Identities = 14/51 (27%), Positives = 17/51 (33%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
A+P PP P P P P + PP P PPP +
Sbjct: 182 ASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPT 232
Score = 38.9 bits (91), Expect = 0.004
Identities = 19/57 (33%), Positives = 22/57 (38%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
S P+ P P P P TPPP +P P P G PPPP + P
Sbjct: 196 SFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVP 252
Score = 37.7 bits (88), Expect = 0.008
Identities = 21/76 (27%), Positives = 27/76 (35%), Gaps = 8/76 (10%)
Query: 17 YSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGT----TPPPMMGTPPPPPPMM----GTPP 68
++ PP P PPP + +PPP PPPPP T
Sbjct: 199 SPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAK 258
Query: 69 PPPPMMGTTPHVLGPL 84
P PP TP +G +
Sbjct: 259 PTPPSASATPAPIGGI 274
Score = 35.4 bits (82), Expect = 0.046
Identities = 17/59 (28%), Positives = 22/59 (37%)
Query: 16 TYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMM 74
+ + P +P P PPP PPP PP+ P PP TP P +
Sbjct: 217 SSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGIT 275
Score = 34.7 bits (80), Expect = 0.078
Identities = 16/56 (28%), Positives = 18/56 (32%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
P P P PPP P PP P + T P PP P P +
Sbjct: 224 PPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDDD 279
Score = 30.8 bits (70), Expect = 1.4
Identities = 16/54 (29%), Positives = 18/54 (33%), Gaps = 4/54 (7%)
Query: 18 SNIFSLPNVQAAPPPPPPPPPLP----PPPPMMGTTPPPMMGTPPPPPPMMGTP 67
P + PP P PPP P PP TPP TP P +
Sbjct: 225 PPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDD 278
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 42.0 bits (99), Expect = 5e-04
Identities = 15/69 (21%), Positives = 19/69 (27%), Gaps = 5/69 (7%)
Query: 24 PNVQAAPPPP-PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLG 82
Q A P P + + P P PPP PP PP
Sbjct: 268 AAAQPATAAPAPSRMTDTNDSKSVTSQPAAAAAAPAPPPN----PPATPPEPPARRGRGS 323
Query: 83 PLYWTRLIV 91
W ++V
Sbjct: 324 AALWFVVVV 332
Score = 30.5 bits (69), Expect = 2.2
Identities = 7/40 (17%), Positives = 9/40 (22%), Gaps = 2/40 (5%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP 61
+ N + P P PP P P
Sbjct: 283 TDTNDSKSVTSQPAAAAAAPAPP--PNPPATPPEPPARRG 320
>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
(DUF2076). This domain, found in various hypothetical
prokaryotic proteins, has no known function. The domain,
however, is found in various periplasmic ligand-binding
sensor proteins.
Length = 234
Score = 41.2 bits (97), Expect = 6e-04
Identities = 13/45 (28%), Positives = 16/45 (35%)
Query: 18 SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
S +F + P P PP PP P G+ P G P
Sbjct: 84 SGMFGGGAPRPPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGY 128
Score = 36.2 bits (84), Expect = 0.023
Identities = 11/45 (24%), Positives = 11/45 (24%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
P PPP P P P P P P G
Sbjct: 90 GAPRPPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGYAQPGPG 134
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 41.7 bits (98), Expect = 7e-04
Identities = 16/53 (30%), Positives = 17/53 (32%), Gaps = 3/53 (5%)
Query: 22 SLPNVQAAPPPPPPPPPLPPP---PPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P A P P P PPP PP+ P P P P PP
Sbjct: 382 PAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPP 434
Score = 34.7 bits (80), Expect = 0.11
Identities = 15/48 (31%), Positives = 15/48 (31%), Gaps = 7/48 (14%)
Query: 29 APPPPPP----PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
AP PP P P P P PP P P P P P
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAAAPPV---PPAAPARPAAARPAPAP 425
Score = 30.9 bits (70), Expect = 1.8
Identities = 13/48 (27%), Positives = 13/48 (27%), Gaps = 2/48 (4%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
AA PP PP P P P P PP P
Sbjct: 399 APPPAAAPPVPPAA--PARPAAARPAPAPAPPAAAAPPARSADPAAAA 444
Score = 30.1 bits (68), Expect = 3.1
Identities = 15/58 (25%), Positives = 16/58 (27%), Gaps = 18/58 (31%)
Query: 22 SLPNVQAAPPPPPPPP-------PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+ P AAPP PP P P P P P PP P
Sbjct: 398 AAPPPAAAPPVPPAAPARPAAARPAPAPAP-----------PAAAAPPARSADPAAAA 444
>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2. Arsenite is a
carcinogenic compound which can act as a co-mutagen by
inhibiting DNA repair. Arsenite-resistance protein 2 is
thought to play a role in arsenite resistance.
Length = 211
Score = 40.2 bits (94), Expect = 0.001
Identities = 28/78 (35%), Positives = 36/78 (46%), Gaps = 10/78 (12%)
Query: 5 DKRAKRRKKKETYSNIF------SLPNVQAAPPPPPPPPPLPPPPPMMG-TTPPPMMGTP 57
DK + RK+ E ++N +LP ++ PP P P L P P TP +M
Sbjct: 103 DKVEEVRKEVEFFNNYLTDAKRPALPEIKPLQPPKPDPGGLAPGLPGYPPQTPQALMPYG 162
Query: 58 PPPPPMMGTP---PPPPP 72
P PPMMG PP PP
Sbjct: 163 QPRPPMMGYGRGGPPFPP 180
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 40.6 bits (95), Expect = 0.001
Identities = 19/50 (38%), Positives = 23/50 (46%), Gaps = 7/50 (14%)
Query: 27 QAAPPPPPPPPPL----PPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+A P PPPP P + PPP M +PP P P +PPP P
Sbjct: 139 EALPQPPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPAS---SPPPTPA 185
Score = 38.7 bits (90), Expect = 0.004
Identities = 25/72 (34%), Positives = 26/72 (36%), Gaps = 6/72 (8%)
Query: 11 RKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPP--PMMGTTPPPMMGTPPPPPPMMGT-- 66
RKK+ P V PPPP PP PP P TP PPP
Sbjct: 136 RKKEALPQPPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKS 195
Query: 67 --PPPPPPMMGT 76
PP PM GT
Sbjct: 196 SHPPLKSPMAGT 207
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 40.9 bits (96), Expect = 0.002
Identities = 16/44 (36%), Positives = 17/44 (38%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
PPPP PPP PPP P + P PPP P
Sbjct: 115 DPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGAS 158
Score = 39.0 bits (91), Expect = 0.006
Identities = 16/48 (33%), Positives = 19/48 (39%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P + PPPP PPP PPP + P PP +PP
Sbjct: 109 PGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAG 156
Score = 38.2 bits (89), Expect = 0.011
Identities = 14/55 (25%), Positives = 18/55 (32%), Gaps = 4/55 (7%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P A PPP P P L +G+ PP +PP P +
Sbjct: 119 PTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAG----ASPAAVASDAASS 169
Score = 37.1 bits (86), Expect = 0.021
Identities = 14/44 (31%), Positives = 15/44 (34%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P P T P P PPPP P +PPP P
Sbjct: 88 PTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPA 131
Score = 37.1 bits (86), Expect = 0.021
Identities = 15/46 (32%), Positives = 18/46 (39%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
++P PPPP PP PPP M+ P P PP
Sbjct: 111 PSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAG 156
Score = 36.7 bits (85), Expect = 0.032
Identities = 16/54 (29%), Positives = 20/54 (37%), Gaps = 3/54 (5%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPM---MGTPPPPPP 72
+ N + P P P G+ PP +P PPPP PP P P
Sbjct: 79 APANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAP 132
Score = 36.3 bits (84), Expect = 0.043
Identities = 19/55 (34%), Positives = 21/55 (38%), Gaps = 6/55 (10%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP------PPPPMMGTPPPPPP 72
P AA PP P P P P P + P P G P PP + TP P
Sbjct: 889 PKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDPRGGFRRVPPGDLHTPAPSAA 943
Score = 35.9 bits (83), Expect = 0.052
Identities = 15/52 (28%), Positives = 17/52 (32%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
P P PPP PPP +P P + P G PP P
Sbjct: 108 PPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASP 159
Score = 35.1 bits (81), Expect = 0.083
Identities = 15/53 (28%), Positives = 19/53 (35%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
PPP PPP PP P + +G+P PPP P +
Sbjct: 115 DPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAA 167
Score = 34.4 bits (79), Expect = 0.13
Identities = 16/44 (36%), Positives = 18/44 (40%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+P PP P P PPPP +PPP P PPP
Sbjct: 105 SPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPP 148
Score = 34.4 bits (79), Expect = 0.14
Identities = 14/49 (28%), Positives = 18/49 (36%), Gaps = 4/49 (8%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
+ A P PP PP P +P P P P+ + P P G
Sbjct: 181 ETARAPSSPPAEPPPSTPPAAASPRP----PRRSSPISASASSPAPAPG 225
Score = 34.0 bits (78), Expect = 0.19
Identities = 12/49 (24%), Positives = 14/49 (28%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P +A P + TPP P PP PPP
Sbjct: 75 PGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPP 123
Score = 34.0 bits (78), Expect = 0.20
Identities = 14/72 (19%), Positives = 18/72 (25%)
Query: 7 RAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGT 66
R + + + P PP P P P PP P
Sbjct: 846 PPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRP 905
Query: 67 PPPPPPMMGTTP 78
P P +G P
Sbjct: 906 RPAPRVKLGPMP 917
Score = 34.0 bits (78), Expect = 0.22
Identities = 12/68 (17%), Positives = 17/68 (25%)
Query: 6 KRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMG 65
+ ++ K K + + P PP P P P P P
Sbjct: 849 RSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPA 908
Query: 66 TPPPPPPM 73
PM
Sbjct: 909 PRVKLGPM 916
Score = 33.6 bits (77), Expect = 0.29
Identities = 15/52 (28%), Positives = 18/52 (34%), Gaps = 3/52 (5%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
S + PP P P PPP P + PP P M+ P P
Sbjct: 98 ASPAREGSPTPPGPSSPDPPPPTPPPASPPPS---PAPDLSEMLRPVGSPGP 146
Score = 33.2 bits (76), Expect = 0.34
Identities = 10/54 (18%), Positives = 13/54 (24%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
++P P PP P T P PP + P
Sbjct: 171 QAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAP 224
Score = 32.8 bits (75), Expect = 0.45
Identities = 10/34 (29%), Positives = 12/34 (35%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
+P P PPPP P P P +P
Sbjct: 350 SPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAG 383
Score = 32.8 bits (75), Expect = 0.48
Identities = 13/56 (23%), Positives = 15/56 (26%), Gaps = 1/56 (1%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
PPP PPP P P + P PPP P +
Sbjct: 118 PPTPPPASPPP-SPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQA 172
Score = 32.1 bits (73), Expect = 0.67
Identities = 13/57 (22%), Positives = 14/57 (24%), Gaps = 5/57 (8%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
S P P P P P PPP P P P P +
Sbjct: 331 SSSESSRGAAVSPGPSPSRSPSP--SRPPPP---ADPSSPRKRPRPSRAPSSPAASA 382
Score = 32.1 bits (73), Expect = 0.78
Identities = 9/49 (18%), Positives = 10/49 (20%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+ P PP P P P P PP
Sbjct: 142 GSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPP 190
Score = 31.7 bits (72), Expect = 0.90
Identities = 13/51 (25%), Positives = 15/51 (29%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
AP P + +P PP P PPP PP P
Sbjct: 78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPP 128
Score = 31.7 bits (72), Expect = 1.2
Identities = 16/51 (31%), Positives = 18/51 (35%), Gaps = 6/51 (11%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP--PPP 72
P +A P PP PP P P P P +G PP P P
Sbjct: 877 PEPRARPGAAAPPKAAAAAPPAGAPAPRP----RPAPRVKLGPMPPGGPDP 923
Score = 30.9 bits (70), Expect = 1.6
Identities = 8/50 (16%), Positives = 10/50 (20%), Gaps = 3/50 (6%)
Query: 28 AAPPPPPP---PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMM 74
A+PPP P L P P +
Sbjct: 124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAA 173
Score = 30.9 bits (70), Expect = 1.8
Identities = 23/73 (31%), Positives = 26/73 (35%), Gaps = 3/73 (4%)
Query: 7 RAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGT 66
A + T+S P A P PP P P PP TPPP PP P P +
Sbjct: 80 PANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPP--PPTPPP-ASPPPSPAPDLSE 136
Query: 67 PPPPPPMMGTTPH 79
P G P
Sbjct: 137 MLRPVGSPGPPPA 149
Score = 30.5 bits (69), Expect = 2.0
Identities = 11/45 (24%), Positives = 13/45 (28%), Gaps = 2/45 (4%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+P P P P P PP P P P +P
Sbjct: 341 VSPGPSPSRSPSPSRPPPPADPSSP--RKRPRPSRAPSSPAASAG 383
Score = 30.1 bits (68), Expect = 2.8
Identities = 10/49 (20%), Positives = 11/49 (22%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
V + P P PPP P P PP
Sbjct: 160 AAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPR 208
Score = 30.1 bits (68), Expect = 2.9
Identities = 13/55 (23%), Positives = 14/55 (25%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
V A PP PPP P + P P P TP
Sbjct: 54 TVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTP 108
Score = 29.8 bits (67), Expect = 3.6
Identities = 12/45 (26%), Positives = 13/45 (28%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
A PL P P PP PP +P PP
Sbjct: 165 DAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRR 209
Score = 29.4 bits (66), Expect = 4.8
Identities = 10/46 (21%), Positives = 12/46 (26%)
Query: 25 NVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
+ P P PP P P P + P PP
Sbjct: 873 RPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPP 918
Score = 29.4 bits (66), Expect = 5.0
Identities = 11/52 (21%), Positives = 15/52 (28%), Gaps = 3/52 (5%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+LP P P P + +P P P G+ P P
Sbjct: 263 ITLPTRIWEASGWNGPSSRPGPASSSSSPRER---SPSPSPSSPGSGPAPSS 311
Score = 29.0 bits (65), Expect = 6.1
Identities = 21/81 (25%), Positives = 27/81 (33%), Gaps = 5/81 (6%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP--PPPMMGTPPPPPPMMGTTPH 79
S P + P P P P ++P P TPPP PPP +P P M
Sbjct: 86 STPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPP---SPAPDLSEMLRPVG 142
Query: 80 VLGPLYWTRLIVSPEIPSDTT 100
GP + P+
Sbjct: 143 SPGPPPAASPPAAGASPAAVA 163
Score = 28.6 bits (64), Expect = 9.5
Identities = 13/79 (16%), Positives = 15/79 (18%), Gaps = 5/79 (6%)
Query: 7 RAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGT 66
K+K P PP PPP +
Sbjct: 813 SRTASKRKSRSHTPDGGSESSG---PARPPGAAARPPPARSSESSKSKPAAAGGRARGKN 869
Query: 67 P--PPPPPMMGTTPHVLGP 83
P PP P P
Sbjct: 870 GRRRPRPPEPRARPGAAAP 888
>gnl|CDD|216368 pfam01213, CAP_N, Adenylate cyclase associated (CAP) N terminal.
Length = 313
Score = 39.8 bits (93), Expect = 0.002
Identities = 15/33 (45%), Positives = 19/33 (57%)
Query: 12 KKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPP 44
KK + +LP V ++ P PPPPP PPPP
Sbjct: 211 KKGPVAAAKSALPAVSSSAPSAPPPPPPPPPPS 243
Score = 32.5 bits (74), Expect = 0.41
Identities = 11/33 (33%), Positives = 11/33 (33%), Gaps = 4/33 (12%)
Query: 40 PPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P PPPP PPPPPP
Sbjct: 214 PVAAAKSALPAVSSSAPSAPPPP----PPPPPP 242
Score = 32.1 bits (73), Expect = 0.59
Identities = 14/48 (29%), Positives = 16/48 (33%), Gaps = 9/48 (18%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
A P P PPPPP PPP PP P + +
Sbjct: 217 AAKSALPAVSSSAPSAPPPPP-----PPP----PPSVPTISNSVESAS 255
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 40.4 bits (95), Expect = 0.002
Identities = 17/78 (21%), Positives = 19/78 (24%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
P P P P P P PP P P P P P T P
Sbjct: 425 PAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAP 484
Query: 84 LYWTRLIVSPEIPSDTTP 101
+P P+
Sbjct: 485 PAAPAPAAAPAAPAAPAA 502
Score = 37.3 bits (87), Expect = 0.017
Identities = 13/51 (25%), Positives = 13/51 (25%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
AAP P P P P P P PP P P
Sbjct: 408 AAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPP 458
Score = 37.3 bits (87), Expect = 0.018
Identities = 14/46 (30%), Positives = 14/46 (30%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
AAP P P P P P P P P P P P
Sbjct: 459 AAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA 504
Score = 37.3 bits (87), Expect = 0.019
Identities = 13/49 (26%), Positives = 13/49 (26%), Gaps = 1/49 (2%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGT-TPPPMMGTPPPPPPMMGTPPPPPPMM 74
PP P PP P P P P P P P M
Sbjct: 738 PVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
Score = 37.3 bits (87), Expect = 0.019
Identities = 14/57 (24%), Positives = 14/57 (24%), Gaps = 4/57 (7%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
AA P P P P P P P P P PP G
Sbjct: 401 AAAAAPAAAPAPAAAAPAAAAAPAPA----AAPQPAPAPAPAPAPPSPAGNAPAGGA 453
Score = 36.9 bits (86), Expect = 0.023
Identities = 14/55 (25%), Positives = 15/55 (27%), Gaps = 2/55 (3%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P+ AA P P P P P P P P PP P
Sbjct: 399 PSAAAAAPAAAPAPAAAAPAAAAAPAPAA--APQPAPAPAPAPAPPSPAGNAPAG 451
Score = 36.5 bits (85), Expect = 0.031
Identities = 14/58 (24%), Positives = 15/58 (25%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
AA P P P P P P P P P P + P P
Sbjct: 405 APAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAP 462
Score = 36.5 bits (85), Expect = 0.033
Identities = 17/51 (33%), Positives = 18/51 (35%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+ P P PP P PP P PPP P P PP PP
Sbjct: 730 APSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
Score = 35.7 bits (83), Expect = 0.049
Identities = 15/57 (26%), Positives = 16/57 (28%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
+ P A PPP P P P P P PP P P P
Sbjct: 447 NAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAP 503
Score = 35.7 bits (83), Expect = 0.055
Identities = 16/53 (30%), Positives = 16/53 (30%), Gaps = 2/53 (3%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
P A P PPPPP P P PPP P M P
Sbjct: 747 DPPDPAGAPAQPPPPPAPAPAAAPAAAPPP--SPPSEEEEMAEDDAPSMDDED 797
Score = 35.7 bits (83), Expect = 0.056
Identities = 15/52 (28%), Positives = 16/52 (30%), Gaps = 5/52 (9%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
P+ Q AP P P P P P PP P P P P
Sbjct: 462 PSAQPAPAPAAAPEPTAAPAP-----APPAAPAPAAAPAAPAAPAAPAGADD 508
Score = 35.3 bits (82), Expect = 0.073
Identities = 15/55 (27%), Positives = 15/55 (27%), Gaps = 3/55 (5%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
AAP P P P P P P PP P P P P
Sbjct: 411 APAAAAPAAAAAPAPAAAPQPAPAPAPAP---APPSPAGNAPAGGAPSPPPAAAP 462
Score = 34.6 bits (80), Expect = 0.12
Identities = 17/78 (21%), Positives = 18/78 (23%), Gaps = 5/78 (6%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP---PPPMMGTPPPPPPMMGTTPHVLGP 83
AA P P P P P P P P P P P P G
Sbjct: 395 AAAAPSAAAAAPAAAPAP--AAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGG 452
Query: 84 LYWTRLIVSPEIPSDTTP 101
+P P
Sbjct: 453 APSPPPAAAPSAQPAPAP 470
Score = 34.6 bits (80), Expect = 0.14
Identities = 14/50 (28%), Positives = 14/50 (28%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
AA P P P P P P PP G P P P
Sbjct: 415 AAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSA 464
Score = 34.2 bits (79), Expect = 0.16
Identities = 13/46 (28%), Positives = 14/46 (30%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
Q AP P PP P + P P P PP P
Sbjct: 700 QPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEP 745
Score = 34.2 bits (79), Expect = 0.17
Identities = 10/52 (19%), Positives = 11/52 (21%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
+ P P P P P PP P P T
Sbjct: 461 APSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATL 512
Score = 33.4 bits (77), Expect = 0.29
Identities = 11/57 (19%), Positives = 13/57 (22%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P + P PPP + P P P P PP
Sbjct: 437 PAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAA 493
Score = 32.7 bits (75), Expect = 0.44
Identities = 6/43 (13%), Positives = 6/43 (13%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P P P P P P
Sbjct: 385 LGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAA 427
Score = 32.7 bits (75), Expect = 0.53
Identities = 16/80 (20%), Positives = 21/80 (26%), Gaps = 1/80 (1%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
S P +AA P P P P P G P + P P + P +
Sbjct: 606 SGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDA-SD 664
Query: 82 GPLYWTRLIVSPEIPSDTTP 101
G W +
Sbjct: 665 GGDGWPAKAGGAAPAAPPPA 684
Score = 32.3 bits (74), Expect = 0.62
Identities = 15/60 (25%), Positives = 16/60 (26%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
P P PP + P P P P PP PP P P P
Sbjct: 705 PAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAP 764
Score = 32.3 bits (74), Expect = 0.65
Identities = 15/51 (29%), Positives = 15/51 (29%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
A P P PLPP P PPPP P P P
Sbjct: 730 APSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
Score = 32.3 bits (74), Expect = 0.66
Identities = 13/60 (21%), Positives = 15/60 (25%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
P ++ PP P P P P P P P HV P
Sbjct: 601 PAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVP 660
Score = 31.9 bits (73), Expect = 0.88
Identities = 11/47 (23%), Positives = 11/47 (23%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P P PPP P G P P PP
Sbjct: 666 GDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAG 712
Score = 31.5 bits (72), Expect = 1.0
Identities = 15/74 (20%), Positives = 19/74 (25%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWT 87
A PP P PP P P P P P P V P +
Sbjct: 595 AGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHP 654
Query: 88 RLIVSPEIPSDTTP 101
+ + P+
Sbjct: 655 KHVAVPDASDGGDG 668
Score = 31.1 bits (71), Expect = 1.4
Identities = 16/53 (30%), Positives = 17/53 (32%), Gaps = 4/53 (7%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
A P P PP P PP P PPPPP P + P
Sbjct: 731 PSPAADDPVPLPPEPDDPPDPAGAPAQ----PPPPPAPAPAAAPAAAPPPSPP 779
Score = 30.7 bits (70), Expect = 1.7
Identities = 14/60 (23%), Positives = 15/60 (25%), Gaps = 3/60 (5%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPP---MMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
P P PP G + P P PP P P P P P
Sbjct: 707 ATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAP 766
Score = 30.7 bits (70), Expect = 1.8
Identities = 12/55 (21%), Positives = 13/55 (23%), Gaps = 2/55 (3%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P P PP P + PP P P P P P
Sbjct: 584 VEAVVGPAPGAAGGEGPPAPA--SSGPPEEAARPAAPAAPAAPAAPAPAGAAAAP 636
Score = 30.7 bits (70), Expect = 2.1
Identities = 10/51 (19%), Positives = 11/51 (21%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
S P P + P G P P P P P
Sbjct: 640 SAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAP 690
Score = 30.7 bits (70), Expect = 2.1
Identities = 13/49 (26%), Positives = 13/49 (26%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
A P PPP P P P P P P G P
Sbjct: 671 AKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAA 719
Score = 30.0 bits (68), Expect = 3.3
Identities = 14/51 (27%), Positives = 15/51 (29%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
A PP P P P P P TPP PP G +
Sbjct: 680 APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASA 730
Score = 30.0 bits (68), Expect = 3.4
Identities = 14/49 (28%), Positives = 14/49 (28%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
AAPPP P P P P P PP PP
Sbjct: 675 GAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQ 723
Score = 30.0 bits (68), Expect = 3.7
Identities = 10/47 (21%), Positives = 10/47 (21%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P P P P P P P P PP
Sbjct: 679 AAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAA 725
Score = 29.6 bits (67), Expect = 3.9
Identities = 11/48 (22%), Positives = 11/48 (22%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P A P P PP P P G P P
Sbjct: 654 PKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQP 701
Score = 29.6 bits (67), Expect = 4.0
Identities = 14/49 (28%), Positives = 14/49 (28%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
AAP PPP P P G P P PP P
Sbjct: 675 GAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQ 723
Score = 29.6 bits (67), Expect = 4.1
Identities = 15/80 (18%), Positives = 17/80 (21%), Gaps = 2/80 (2%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
P PP P PP P P P G P +
Sbjct: 592 PGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPA--GAAAAPAEASAAPAPGVAA 649
Query: 84 LYWTRLIVSPEIPSDTTPLW 103
V+ SD W
Sbjct: 650 PEHHPKHVAVPDASDGGDGW 669
Score = 29.6 bits (67), Expect = 4.2
Identities = 12/49 (24%), Positives = 13/49 (26%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
AAP P +P P G P PP P P
Sbjct: 644 APGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAA 692
Score = 28.8 bits (65), Expect = 7.4
Identities = 6/50 (12%), Positives = 6/50 (12%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
P PP P P P
Sbjct: 649 APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAP 698
Score = 28.4 bits (64), Expect = 8.9
Identities = 12/49 (24%), Positives = 12/49 (24%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P A P P P G P G P P P P
Sbjct: 643 PAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPA 691
Score = 28.4 bits (64), Expect = 9.6
Identities = 10/46 (21%), Positives = 10/46 (21%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
P PPP P P P P P P
Sbjct: 674 GGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAA 719
Score = 28.4 bits (64), Expect = 9.9
Identities = 8/50 (16%), Positives = 8/50 (16%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
A P P P P P P P
Sbjct: 640 SAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAA 689
>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
modification].
Length = 2365
Score = 40.3 bits (94), Expect = 0.002
Identities = 20/37 (54%), Positives = 20/37 (54%), Gaps = 11/37 (29%)
Query: 36 PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
PP PPPPP PPP G PP P PPPPPP
Sbjct: 5 PPGNPPPPP-----PPP--GFEPPSQP----PPPPPP 30
Score = 39.6 bits (92), Expect = 0.004
Identities = 18/30 (60%), Positives = 18/30 (60%), Gaps = 5/30 (16%)
Query: 33 PPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
PP PP PPPPP G PP PPPPPP
Sbjct: 5 PPGNPPPPPPPP--GFEPP---SQPPPPPP 29
Score = 39.2 bits (91), Expect = 0.005
Identities = 17/31 (54%), Positives = 18/31 (58%), Gaps = 9/31 (29%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP 60
PPPPPPPP PP + PPP PPPP
Sbjct: 9 PPPPPPPPGFEPP-----SQPPP----PPPP 30
Score = 36.5 bits (84), Expect = 0.030
Identities = 16/26 (61%), Positives = 16/26 (61%), Gaps = 3/26 (11%)
Query: 22 SLPNVQAAPPPPPP---PPPLPPPPP 44
SLP PPPPPP PP PPPPP
Sbjct: 3 SLPPGNPPPPPPPPGFEPPSQPPPPP 28
Score = 36.1 bits (83), Expect = 0.052
Identities = 16/35 (45%), Positives = 17/35 (48%), Gaps = 3/35 (8%)
Query: 46 MGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
M + PP G PPPPPP G PP P P V
Sbjct: 1 MASLPP---GNPPPPPPPPGFEPPSQPPPPPPPGV 32
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 39.9 bits (94), Expect = 0.003
Identities = 11/54 (20%), Positives = 11/54 (20%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
AP P P PP P P P P V
Sbjct: 64 AKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEV 117
Score = 39.1 bits (92), Expect = 0.005
Identities = 11/58 (18%), Positives = 12/58 (20%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPL 84
A P P P P P P P P + PL
Sbjct: 63 AAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPL 120
Score = 37.9 bits (89), Expect = 0.013
Identities = 12/49 (24%), Positives = 12/49 (24%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
AAP P P P PP PP PP
Sbjct: 56 SAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAA 104
Score = 36.8 bits (86), Expect = 0.023
Identities = 10/55 (18%), Positives = 10/55 (18%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
A P P P P P P P PP
Sbjct: 53 AAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAA 107
Score = 35.6 bits (83), Expect = 0.053
Identities = 9/49 (18%), Positives = 9/49 (18%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
AA P P PP P PP
Sbjct: 47 AAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAA 95
Score = 35.6 bits (83), Expect = 0.057
Identities = 11/45 (24%), Positives = 11/45 (24%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
AA P P P PP PP P P
Sbjct: 52 AAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAP 96
Score = 34.9 bits (81), Expect = 0.11
Identities = 8/49 (16%), Positives = 9/49 (18%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P P + P P P P P P
Sbjct: 38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKP 86
Score = 34.5 bits (80), Expect = 0.14
Identities = 9/48 (18%), Positives = 10/48 (20%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
+ A P P P PP PP P
Sbjct: 39 GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKP 86
Score = 34.1 bits (79), Expect = 0.18
Identities = 7/46 (15%), Positives = 7/46 (15%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
P P PP P P P
Sbjct: 70 PAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
Score = 33.7 bits (78), Expect = 0.23
Identities = 7/49 (14%), Positives = 7/49 (14%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P P PP PP P
Sbjct: 40 STAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAA 88
Score = 32.6 bits (75), Expect = 0.54
Identities = 9/51 (17%), Positives = 9/51 (17%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
AA P P P P P P P
Sbjct: 46 AAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAP 96
Score = 32.2 bits (74), Expect = 0.63
Identities = 9/58 (15%), Positives = 13/58 (22%)
Query: 15 ETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
++ F+ + P P P P P P PP
Sbjct: 27 PSWREFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPP 84
Score = 32.2 bits (74), Expect = 0.65
Identities = 9/49 (18%), Positives = 9/49 (18%), Gaps = 1/49 (2%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P AA P P P P P P
Sbjct: 44 PTAAAAAAAAAASAPAAAPAAKAPAAPAP-APPAAAAPAAPPKPAAAAA 91
>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824). This is
a repeating domain found in fungal proteins. It is
proline-rich, and the function is not known.
Length = 135
Score = 37.6 bits (87), Expect = 0.003
Identities = 18/42 (42%), Positives = 19/42 (45%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
PPPP P+PPP P G P P PP P PP P
Sbjct: 78 PPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGP 119
Score = 36.0 bits (83), Expect = 0.010
Identities = 23/90 (25%), Positives = 32/90 (35%), Gaps = 12/90 (13%)
Query: 2 EYMDKRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTT--------PPPM 53
+Y +R +++ ++E + P P P PP PP+ PPP
Sbjct: 22 KYSQRRERKKAERERERYRHDHDAYSDSYEEPYDPTPYPPSPPVSDPRYYPNSNYFPPPP 81
Query: 54 MGTPPPPPPMMGT----PPPPPPMMGTTPH 79
TP PPP PPPP P
Sbjct: 82 GSTPVPPPGPQPGYNPADYPPPPGAVPPPQ 111
Score = 34.9 bits (80), Expect = 0.023
Identities = 27/80 (33%), Positives = 32/80 (40%), Gaps = 18/80 (22%)
Query: 7 RAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLP---------PPPPMMGTTPPPMMGTP 57
R + R + YS+ + P P PP PP P PPPP G+TP P G
Sbjct: 36 RERYRHDHDAYSDSYEEP-YDPTPYPPSPPVSDPRYYPNSNYFPPPP--GSTPVPPPGPQ 92
Query: 58 P------PPPPMMGTPPPPP 71
P PPP PPP
Sbjct: 93 PGYNPADYPPPPGAVPPPQN 112
Score = 33.3 bits (76), Expect = 0.082
Identities = 21/52 (40%), Positives = 21/52 (40%), Gaps = 5/52 (9%)
Query: 24 PNVQAAPPPP---PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
PN PPPP P PPP P P PPP PPP PP P
Sbjct: 72 PNSNYFPPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQN--YPYPPGPGQ 121
Score = 32.9 bits (75), Expect = 0.11
Identities = 20/56 (35%), Positives = 20/56 (35%), Gaps = 3/56 (5%)
Query: 17 YSNIFSLPNVQAAPPPPPPPP---PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
SN F P PPP P P P PPP PP PP P P P
Sbjct: 73 NSNYFPPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGPGQDPYAPRP 128
>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
Length = 318
Score = 38.7 bits (90), Expect = 0.004
Identities = 16/52 (30%), Positives = 20/52 (38%), Gaps = 3/52 (5%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP---PPMMGTPPPPPP 72
++ PP P PP P PP +P +PP P PP P P
Sbjct: 4 ARSRSPSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAP 55
Score = 37.6 bits (87), Expect = 0.011
Identities = 16/44 (36%), Positives = 18/44 (40%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
+A P PP PP P +P PP PP G PPP
Sbjct: 3 RARSRSPSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPP 46
Score = 32.6 bits (74), Expect = 0.35
Identities = 24/87 (27%), Positives = 27/87 (31%), Gaps = 21/87 (24%)
Query: 22 SLPNVQAAPPPPPPPPPLP--------PPPPMMGTTPPPMMGTPP----------PPPPM 63
S + PP P PP P P PP G PPP P P
Sbjct: 8 SPSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGCPAGVT 67
Query: 64 MGTPPPPPPMMGTTPHVLG---PLYWT 87
+ PP P +G PL WT
Sbjct: 68 FSSSAPPRPPLGLDDAPAATPPPLDWT 94
>gnl|CDD|233927 TIGR02557, HpaP, type III secretion protein HpaP. This family of
genes is always found in type III secretion operons,
althought its function in the processes of secretion
and virulence is unclear. Hpa stands for Hrp-associated
gene, where Hrp stands for hypersensitivity response
and virulence.
Length = 201
Score = 37.9 bits (88), Expect = 0.005
Identities = 17/53 (32%), Positives = 18/53 (33%), Gaps = 2/53 (3%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
A PP P PPPP P PPP P P PP + H
Sbjct: 36 YAPPPRPEPPPPCDEDRPEPRADTR--ASDPPPEAPTDADPAQPPDDPDASAH 86
Score = 29.1 bits (65), Expect = 4.0
Identities = 18/86 (20%), Positives = 19/86 (22%), Gaps = 15/86 (17%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGT---TPPPMMGTPPPPPPMMGTPPPPP-------PMMG 75
+Q A P P P P T PPP P PP P
Sbjct: 2 IQGARLARPAPAD-PARPARRRTPLAQLRRRDALAYAPPP----RPEPPPPCDEDRPEPR 56
Query: 76 TTPHVLGPLYWTRLIVSPEIPSDTTP 101
P P P D
Sbjct: 57 ADTRASDPPPEAPTDADPAQPPDDPD 82
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 39.0 bits (91), Expect = 0.005
Identities = 15/48 (31%), Positives = 17/48 (35%), Gaps = 2/48 (4%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
+ + P PPPPPPP PP P P T P P
Sbjct: 263 ATRLPEPEPQPPPPPPPPEPPEP--EEEPDEPDQTDPDDGEETDQIPE 308
Score = 36.3 bits (84), Expect = 0.036
Identities = 17/52 (32%), Positives = 18/52 (34%), Gaps = 6/52 (11%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
+A P P P P PPPPP P P P P T P P
Sbjct: 262 RATRLPEPEPQPPPPPPP-----PEPPEPEEEPDEPDQ-TDPDDGEETDQIP 307
Score = 32.1 bits (73), Expect = 0.62
Identities = 15/51 (29%), Positives = 16/51 (31%), Gaps = 4/51 (7%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMM----GTPPPP 70
P PPPP PP P P T P T P +M P
Sbjct: 271 PQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPEELMFDAVEADLPD 321
Score = 30.2 bits (68), Expect = 2.5
Identities = 15/43 (34%), Positives = 15/43 (34%), Gaps = 9/43 (20%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P P P PP PPP PPP PP P P
Sbjct: 260 LPRATRLPEPEPQPP-----PPP----PPPEPPEPEEEPDEPD 293
Score = 29.0 bits (65), Expect = 6.5
Identities = 16/43 (37%), Positives = 16/43 (37%), Gaps = 9/43 (20%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P P P P PPP PPP PP PP P P
Sbjct: 260 LPRATRLPEPEPQPPP-----PPP----PPEPPEPEEEPDEPD 293
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 38.7 bits (90), Expect = 0.006
Identities = 18/68 (26%), Positives = 20/68 (29%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYW 86
Q AP PP P PPPP P P P P P P
Sbjct: 16 QPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPASLWKL 75
Query: 87 TRLIVSPE 94
+V P+
Sbjct: 76 EDFVVEPQ 83
Score = 31.4 bits (71), Expect = 0.91
Identities = 12/50 (24%), Positives = 12/50 (24%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
P AA P PPPP P P P P
Sbjct: 21 PPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPA 70
Score = 31.4 bits (71), Expect = 0.92
Identities = 14/60 (23%), Positives = 16/60 (26%), Gaps = 1/60 (1%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
P APPPP P P P P P P + V+ P
Sbjct: 24 PAAAPAPPPPAKTAA-PATKAAAPAAAAPRAEKPKKDKPRRERKPKPASLWKLEDFVVEP 82
Score = 31.4 bits (71), Expect = 0.98
Identities = 15/76 (19%), Positives = 15/76 (19%), Gaps = 13/76 (17%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTR 88
P P P P PP P P P P P
Sbjct: 12 GEAEQPAPAPPSPAA----APAPPPPAKTAAPATKAAAPAAAAPRAE-KPKKDKP----- 61
Query: 89 LIVSPEIPSDTTPLWK 104
E LWK
Sbjct: 62 ---RRERKPKPASLWK 74
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 38.9 bits (91), Expect = 0.006
Identities = 11/52 (21%), Positives = 11/52 (21%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
P A P P P P P PP P P
Sbjct: 381 PARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAA 432
Score = 37.8 bits (88), Expect = 0.013
Identities = 12/54 (22%), Positives = 14/54 (25%), Gaps = 3/54 (5%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP---PPPPPMMGTTP 78
AAP P P P + P PP P+ P P
Sbjct: 391 AAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAA 444
Score = 37.4 bits (87), Expect = 0.016
Identities = 11/51 (21%), Positives = 11/51 (21%), Gaps = 2/51 (3%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP--PPMMGTPPPPPP 72
P P P P P PP P P P P
Sbjct: 389 PAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPA 439
Score = 37.0 bits (86), Expect = 0.018
Identities = 17/76 (22%), Positives = 22/76 (28%), Gaps = 4/76 (5%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGT-TPHVLG 82
AA P P PP T P+ P P P P TP G
Sbjct: 435 AAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEG 494
Query: 83 PLY---WTRLIVSPEI 95
++ +L + I
Sbjct: 495 DVWHATVQQLAAAEAI 510
Score = 36.6 bits (85), Expect = 0.028
Identities = 12/45 (26%), Positives = 12/45 (26%), Gaps = 3/45 (6%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
AA P PP PP P P P P P
Sbjct: 410 AASAPAAPPAAAPPAPV---AAPAAAAPAAAPAAAPAAVALAPAP 451
Score = 35.1 bits (81), Expect = 0.075
Identities = 10/55 (18%), Positives = 11/55 (20%), Gaps = 4/55 (7%)
Query: 22 SLPNVQAAPPPPPP----PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+ AP P P PP P P P P
Sbjct: 395 AQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAP 449
Score = 34.7 bits (80), Expect = 0.12
Identities = 13/63 (20%), Positives = 15/63 (23%), Gaps = 8/63 (12%)
Query: 24 PNVQAAPPPPPPPPPL-PPPPPMMGTTPPPMMGTPPPPP-------PMMGTPPPPPPMMG 75
P A P P P P P + PP P+ P P
Sbjct: 417 PPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAA 476
Query: 76 TTP 78
P
Sbjct: 477 PAP 479
Score = 34.3 bits (79), Expect = 0.14
Identities = 12/57 (21%), Positives = 13/57 (22%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
A PP PP P P P P P P + P
Sbjct: 412 SAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAP 468
Score = 33.9 bits (78), Expect = 0.17
Identities = 13/55 (23%), Positives = 15/55 (27%), Gaps = 4/55 (7%)
Query: 22 SLPNVQAAPPP----PPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+ AA P P PP PP + P P P PP
Sbjct: 398 AAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPP 452
Score = 32.8 bits (75), Expect = 0.40
Identities = 11/60 (18%), Positives = 11/60 (18%), Gaps = 2/60 (3%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPM--MGTPPPPPPMMGTPPPPPPMMGTTP 78
F A P P P P P P P P P
Sbjct: 364 FKPAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPP 423
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 38.7 bits (90), Expect = 0.006
Identities = 19/59 (32%), Positives = 23/59 (38%), Gaps = 6/59 (10%)
Query: 19 NIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPM------MGTPPPPP 71
N F A P PP P PP PP++ PP + P P P+ G P P
Sbjct: 222 NPFLPGPSPAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQPIIPIQHIRAVTGETPANP 280
Score = 34.9 bits (80), Expect = 0.087
Identities = 19/51 (37%), Positives = 21/51 (41%), Gaps = 7/51 (13%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
LP V P P P P P PP + P PP PP + PPP P
Sbjct: 215 RLPRVSFNPFLPGPSPAQPSAPP---ASIP----APPIPPVIQYVAPPPVP 258
Score = 30.7 bits (69), Expect = 2.1
Identities = 18/72 (25%), Positives = 21/72 (29%), Gaps = 30/72 (41%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTP------------------------------PPP 60
PPPP LP PP + P TP PP
Sbjct: 186 IQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPPI 245
Query: 61 PPMMGTPPPPPP 72
PP++ PPP
Sbjct: 246 PPVIQYVAPPPV 257
>gnl|CDD|222828 PHA01732, PHA01732, proline-rich protein.
Length = 94
Score = 35.5 bits (81), Expect = 0.007
Identities = 15/43 (34%), Positives = 16/43 (37%)
Query: 20 IFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
IF P P P PP P PPPP P P + P
Sbjct: 3 IFRAPKPPEPPAPLPPAPVPPPPPAPPAPVPEPTVKPVNAEAP 45
Score = 32.4 bits (73), Expect = 0.092
Identities = 13/35 (37%), Positives = 13/35 (37%)
Query: 35 PPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
P PP PP P PPP P P P P
Sbjct: 7 PKPPEPPAPLPPAPVPPPPPAPPAPVPEPTVKPVN 41
Score = 32.0 bits (72), Expect = 0.13
Identities = 21/52 (40%), Positives = 23/52 (44%), Gaps = 10/52 (19%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
AP PP PP PLPP P PPP PP PP P P+ P +
Sbjct: 6 APKPPEPPAPLPPAP-----VPPP-----PPAPPAPVPEPTVKPVNAEAPKI 47
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 38.1 bits (89), Expect = 0.009
Identities = 16/45 (35%), Positives = 16/45 (35%), Gaps = 12/45 (26%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
A P PP PP PPP PPPPP PP
Sbjct: 255 ATRLPAPPQPPEEEPPP------------PPPPPEDDDDPPEDEE 287
Score = 36.5 bits (85), Expect = 0.027
Identities = 10/40 (25%), Positives = 10/40 (25%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
LP P PPPPP PP
Sbjct: 258 LPAPPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRAL 297
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 37.7 bits (87), Expect = 0.010
Identities = 11/48 (22%), Positives = 11/48 (22%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P VQ P P P P P P P P
Sbjct: 207 PTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQ 254
Score = 37.7 bits (87), Expect = 0.010
Identities = 14/57 (24%), Positives = 18/57 (31%)
Query: 6 KRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
+ A + ++ T N P VQ P P P P TP PP
Sbjct: 198 RHAPKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQ 254
Score = 37.0 bits (85), Expect = 0.017
Identities = 12/55 (21%), Positives = 13/55 (23%), Gaps = 4/55 (7%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P Q P P + P P P P TP P T
Sbjct: 203 PTQQPTVQNPAQQPTVQNPAQ----QPQQQPQQQPVQPAQQPTPQNPAQQPPQTE 253
Score = 34.6 bits (79), Expect = 0.10
Identities = 11/46 (23%), Positives = 12/46 (26%), Gaps = 2/46 (4%)
Query: 29 APPPPPPPPPLPPP--PPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
AP P P P P + P P P P P
Sbjct: 200 APKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNP 245
Score = 31.2 bits (70), Expect = 1.2
Identities = 9/46 (19%), Positives = 10/46 (21%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
A P P P P + P P P P
Sbjct: 199 HAPKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQN 244
Score = 30.4 bits (68), Expect = 1.9
Identities = 11/51 (21%), Positives = 13/51 (25%), Gaps = 5/51 (9%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
++ N P P P P P TP P PP
Sbjct: 208 TVQNPAQQPTVQNPAQQ-PQQQPQQQPVQPAQQPTPQNPAQ----QPPQTE 253
Score = 30.4 bits (68), Expect = 1.9
Identities = 10/48 (20%), Positives = 10/48 (20%), Gaps = 4/48 (8%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P P P P T P P P P T
Sbjct: 200 APKPTQQPTVQNPAQQPTVQNP----AQQPQQQPQQQPVQPAQQPTPQ 243
>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI. This minor capsid
protein may act as a link between the external capsid
and the internal DNA-protein core. The C-terminal 11
residues may function as a protease cofactor leading to
enzyme activation.
Length = 238
Score = 37.1 bits (86), Expect = 0.010
Identities = 14/49 (28%), Positives = 18/49 (36%), Gaps = 1/49 (2%)
Query: 24 PNVQAAPPPPPPPPPLPPPP-PMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
V AAP PP + P P P+ + P P+ PP P
Sbjct: 140 VLVPAAPEPPSYEETIKPGPAPVEEPVDSMAIAVPAIDTPVTLELPPAP 188
Score = 32.5 bits (74), Expect = 0.31
Identities = 10/51 (19%), Positives = 11/51 (21%), Gaps = 5/51 (9%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P P L P P P P P P+ P
Sbjct: 124 DPIQALQPRPRPDVEEVLVPAAPE---PPSYEETIKPGPAPV--EEPVDSM 169
Score = 30.9 bits (70), Expect = 1.2
Identities = 13/72 (18%), Positives = 18/72 (25%), Gaps = 6/72 (8%)
Query: 27 QAAPPPPPPPPPLPP-PPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLY 85
+ P P P+ P P ++ P PP T GP
Sbjct: 107 EKVLGEEEPAPQEETVADPIQALQPRPRP---DVEEVLVPAAPEPPSYEETIK--PGPAP 161
Query: 86 WTRLIVSPEIPS 97
+ S I
Sbjct: 162 VEEPVDSMAIAV 173
Score = 30.5 bits (69), Expect = 1.5
Identities = 12/57 (21%), Positives = 15/57 (26%), Gaps = 6/57 (10%)
Query: 23 LPNVQAAPPPPPPPPPLPPPP------PMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
L V P P + P P + P PP P P P+
Sbjct: 106 LEKVLGEEEPAPQEETVADPIQALQPRPRPDVEEVLVPAAPEPPSYEETIKPGPAPV 162
Score = 30.2 bits (68), Expect = 2.1
Identities = 10/55 (18%), Positives = 13/55 (23%), Gaps = 2/55 (3%)
Query: 26 VQAAPPPPPP--PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
+ P P PP P P P + P P+ P
Sbjct: 131 PRPRPDVEEVLVPAAPEPPSYEETIKPGPAPVEEPVDSMAIAVPAIDTPVTLELP 185
>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
Validated.
Length = 598
Score = 38.0 bits (89), Expect = 0.010
Identities = 8/45 (17%), Positives = 9/45 (20%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
A P P P + P PP P
Sbjct: 407 APGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPAPAA 451
Score = 37.2 bits (87), Expect = 0.019
Identities = 6/57 (10%), Positives = 7/57 (12%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
PP P P + P P
Sbjct: 391 PPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPP 447
Score = 36.0 bits (84), Expect = 0.042
Identities = 10/45 (22%), Positives = 11/45 (24%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
AAP P P + P PP P P
Sbjct: 415 AAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPAPAAAPQPAVRL 459
Score = 36.0 bits (84), Expect = 0.045
Identities = 15/93 (16%), Positives = 22/93 (23%), Gaps = 11/93 (11%)
Query: 32 PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIV 91
PP P PP G P P P + P
Sbjct: 391 PPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPD-----------A 439
Query: 92 SPEIPSDTTPLWKELEEVPINNLEEFTELFSRQ 124
+ + P V +N+ E+ L +
Sbjct: 440 AAAAAAPPAPAAAPQPAVRLNSFEDIVALAEEK 472
Score = 34.1 bits (79), Expect = 0.15
Identities = 7/57 (12%), Positives = 8/57 (14%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P PP P P P + P
Sbjct: 392 PSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPA 448
Score = 33.7 bits (78), Expect = 0.23
Identities = 10/51 (19%), Positives = 10/51 (19%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
AA P P P P PP P P
Sbjct: 408 PGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPAPAAAPQPAVR 458
>gnl|CDD|237802 PRK14723, flhF, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 767
Score = 37.9 bits (88), Expect = 0.010
Identities = 10/36 (27%), Positives = 11/36 (30%)
Query: 25 NVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP 60
+ QA PP P P P P P P
Sbjct: 54 SAQAYAPPAPAPLPAALVAPAPAAASIAAPAAVPAP 89
Score = 35.5 bits (82), Expect = 0.065
Identities = 10/39 (25%), Positives = 12/39 (30%), Gaps = 2/39 (5%)
Query: 26 VQAAPPP--PPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
V A+ PP P PLP + P P
Sbjct: 51 VAASAQAYAPPAPAPLPAALVAPAPAAASIAAPAAVPAP 89
Score = 34.8 bits (80), Expect = 0.11
Identities = 11/57 (19%), Positives = 12/57 (21%), Gaps = 8/57 (14%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPL 84
A PP P P P P P G + G L
Sbjct: 51 VAASAQAYAPPAPAPLP----AALVAPAPAAASIAAPAAVPAP----GAIGDLRGEL 99
Score = 31.7 bits (72), Expect = 0.86
Identities = 6/30 (20%), Positives = 6/30 (20%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPP 52
P P P P P P
Sbjct: 60 PPAPAPLPAALVAPAPAAASIAAPAAVPAP 89
>gnl|CDD|237011 PRK11892, PRK11892, pyruvate dehydrogenase subunit beta;
Provisional.
Length = 464
Score = 38.0 bits (89), Expect = 0.010
Identities = 10/55 (18%), Positives = 11/55 (20%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGT 76
S + AAP P P P P GT
Sbjct: 83 SASDAGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADPDIPAGT 137
Score = 34.1 bits (79), Expect = 0.16
Identities = 12/66 (18%), Positives = 17/66 (25%), Gaps = 7/66 (10%)
Query: 15 ETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP--- 71
E+ S+ + P A P P P P + P P
Sbjct: 82 ESASDAGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAA----PAAPAAEVAADPDIPAGT 137
Query: 72 PMMGTT 77
M+ T
Sbjct: 138 EMVTMT 143
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 36.9 bits (86), Expect = 0.011
Identities = 18/57 (31%), Positives = 19/57 (33%), Gaps = 6/57 (10%)
Query: 24 PNVQAAPPPPPPPPPLPPP--PPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
PN P P P PP P P P P + PP P P P P P
Sbjct: 93 PNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKP----EPKPVVEEKAAP 145
Score = 36.1 bits (84), Expect = 0.021
Identities = 17/55 (30%), Positives = 21/55 (38%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P++ A PP P P P P+ P P+ P P P PP P P
Sbjct: 83 PSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKP 137
Score = 35.4 bits (82), Expect = 0.043
Identities = 17/64 (26%), Positives = 20/64 (31%), Gaps = 2/64 (3%)
Query: 24 PNVQAAPPPP--PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
P A P P P P P+ PP P P P P P P++
Sbjct: 87 PATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKAAPT 146
Query: 82 GPLY 85
G Y
Sbjct: 147 GKAY 150
Score = 32.7 bits (75), Expect = 0.30
Identities = 15/49 (30%), Positives = 20/49 (40%), Gaps = 6/49 (12%)
Query: 26 VQAAPPPPPP--PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
V+A P P + PP + P P+ PP P P+ P P P
Sbjct: 75 VRAGDAAAPSLDPATVAPPNTPVEPEPAPV--EPPKPKPV--EKPKPKP 119
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 37.7 bits (87), Expect = 0.012
Identities = 21/59 (35%), Positives = 22/59 (37%), Gaps = 6/59 (10%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP--PPPPMMGTPPPPPPMMGTTPHVLGPL 84
P PPP PP P P TP P PP P M P P G T +L L
Sbjct: 774 PTPQPPPQAPPAPQQRPRGAPTPQP----PPQAGPTSMQLMPRAAPGQQGPTKQILRQL 828
Score = 36.6 bits (84), Expect = 0.026
Identities = 20/59 (33%), Positives = 20/59 (33%), Gaps = 3/59 (5%)
Query: 28 AAPPPPPPPPPLPPPPPMMGT---TPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
A PP P PP G PPP P P P PPP G T L P
Sbjct: 753 RARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMP 811
Score = 34.7 bits (79), Expect = 0.13
Identities = 17/55 (30%), Positives = 17/55 (30%), Gaps = 5/55 (9%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP----PPPPPMMGTTP 78
A PP P PP G PP P P P P PPP P
Sbjct: 733 RARPPAAAPGRARPPAAAPGRARPPA-AAPGRARPPAAAPGAPTPQPPPQAPPAP 786
Score = 33.5 bits (76), Expect = 0.24
Identities = 17/57 (29%), Positives = 19/57 (33%), Gaps = 4/57 (7%)
Query: 26 VQAAP----PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
+Q AP PPP P P+ PP G P T PP PP
Sbjct: 687 IQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGR 743
Score = 29.6 bits (66), Expect = 4.7
Identities = 16/49 (32%), Positives = 16/49 (32%), Gaps = 6/49 (12%)
Query: 28 AAPPPPPPP--PPLPPPPPMMGTTP-PPMMGTPP---PPPPMMGTPPPP 70
AP P PP PP P T P P PP G PP
Sbjct: 699 RAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPP 747
Score = 29.3 bits (65), Expect = 5.9
Identities = 11/51 (21%), Positives = 11/51 (21%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
A P PP G PP PP PP
Sbjct: 713 RAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGR 763
>gnl|CDD|222579 pfam14179, YppG, YppG-like protein. The YppG-like protein family
includes the B. subtilis YppG protein, which is
functionally uncharacterized. This family of proteins
is found in bacteria. Proteins in this family are
typically between 115 and 181 amino acids in length.
There are two completely conserved residues (F and G)
that may be functionally important.
Length = 110
Score = 35.1 bits (81), Expect = 0.014
Identities = 14/53 (26%), Positives = 15/53 (28%), Gaps = 4/53 (7%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
P Q PPPP PP M PP P P P +
Sbjct: 21 QQQPYHQQMPPPPYSPPQQQQGHFM----PPQPQPYPKQSPQQQQPPQFSSFL 69
Score = 33.9 bits (78), Expect = 0.039
Identities = 11/50 (22%), Positives = 11/50 (22%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
PP P M PP PP PP P
Sbjct: 5 QNTNQYPPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPK 54
Score = 33.2 bits (76), Expect = 0.072
Identities = 11/54 (20%), Positives = 11/54 (20%), Gaps = 5/54 (9%)
Query: 24 PNVQAAPPPPPPPPPL--PPPPPMMGTTPPPMMGTPPP---PPPMMGTPPPPPP 72
N PP P P M P PP P P
Sbjct: 5 QNTNQYPPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPKQSPQ 58
>gnl|CDD|151322 pfam10873, DUF2668, Protein of unknown function (DUF2668). Members
in this family of proteins are annotated as Cysteine and
tyrosine-rich protein 1, however currently no function
is known.
Length = 154
Score = 35.6 bits (82), Expect = 0.019
Identities = 16/52 (30%), Positives = 18/52 (34%), Gaps = 13/52 (25%)
Query: 25 NVQAAPPPPPP---------PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP 67
N + P PPP P LPPP + P PPPP G
Sbjct: 105 NAISYPMAPPPYTYDHEMEYPTDLPPPY----SPAPQASAQRSPPPPYPGNS 152
Score = 29.5 bits (66), Expect = 2.2
Identities = 17/62 (27%), Positives = 21/62 (33%), Gaps = 11/62 (17%)
Query: 14 KETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
+ + N S P APPP + P + PPP P PPPP
Sbjct: 100 RAAHINAISYP---MAPPPYTYDHEMEYPTDL----PPPYSPA----PQASAQRSPPPPY 148
Query: 74 MG 75
G
Sbjct: 149 PG 150
>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 130
Score = 35.2 bits (81), Expect = 0.019
Identities = 18/56 (32%), Positives = 19/56 (33%), Gaps = 4/56 (7%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
QAA P P P+P P P P G P PM GT VL
Sbjct: 25 AQAAAPAQPASTPVPVPTEASPQVEAQ----APQPAAAAGADAMPSPMPGTILKVL 76
>gnl|CDD|182059 PRK09752, PRK09752, adhesin; Provisional.
Length = 1250
Score = 37.3 bits (86), Expect = 0.020
Identities = 16/44 (36%), Positives = 20/44 (45%), Gaps = 3/44 (6%)
Query: 18 SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP 61
+N + L + + PP PP P P P P P P P P P P
Sbjct: 908 NNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTP---DPEPTP 948
Score = 32.0 bits (72), Expect = 0.79
Identities = 17/38 (44%), Positives = 17/38 (44%), Gaps = 11/38 (28%)
Query: 34 PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
PP PP P P P P P TP P P TP P P
Sbjct: 920 PPSPPDPDPTP----DPDP---TPDPDP----TPDPEP 946
>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
terminal. This domain family is found in eukaryotes,
and is typically between 177 and 207 amino acids in
length. The family is found in association with
pfam00778, pfam02377, pfam00610, pfam00595. The segment
polarity gene dishevelled (dsh) is required for pattern
formation of the embryonic segments. It is involved in
the determination of body organisation through the
Wingless pathway (analogous to the Wnt-1 pathway).
Length = 202
Score = 36.1 bits (83), Expect = 0.022
Identities = 15/38 (39%), Positives = 19/38 (50%), Gaps = 2/38 (5%)
Query: 34 PPPPPLPPPPPMMGTTPPPMMGTPPPPPPM--MGTPPP 69
P P P PPM+ PPP PP PP+ + + PP
Sbjct: 151 APGVPPPYNPPMLMMKPPPPSPGPPGAPPVRELASVPP 188
Score = 29.5 bits (66), Expect = 2.4
Identities = 12/37 (32%), Positives = 19/37 (51%)
Query: 20 IFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGT 56
++ P + PPPP P PP PP + + PP + +
Sbjct: 157 PYNPPMLMMKPPPPSPGPPGAPPVRELASVPPELTAS 193
Score = 28.0 bits (62), Expect = 8.6
Identities = 18/54 (33%), Positives = 21/54 (38%), Gaps = 5/54 (9%)
Query: 24 PNVQAAPPP--PPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
P+ ++ PPP L P P G PPP P M PPPP G
Sbjct: 123 PSERSGPPPSEGSVRSSLSHPSSHSSYGAP---GVPPPYNPPMLMMKPPPPSPG 173
>gnl|CDD|220260 pfam09483, HpaP, Type III secretion protein (HpaP). This entry
represents proteins encoded by genes which are always
found in type III secretion operons, although their
function in the processes of secretion and virulence is
unclear. Hpa stands for Hrp-associated gene, where Hrp
stands for hypersensitivity response and virulence. see
also PMID:18584024.
Length = 185
Score = 35.9 bits (83), Expect = 0.022
Identities = 15/51 (29%), Positives = 16/51 (31%), Gaps = 1/51 (1%)
Query: 23 LPNVQAAPPP-PPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+A PP P PP P P P P PP G P P
Sbjct: 21 RRAARAGPPGTPAPPGPAEDAHPEFPERPRDAPAPPAPPRATDGDRDPQPL 71
Score = 32.8 bits (75), Expect = 0.18
Identities = 11/49 (22%), Positives = 13/49 (26%), Gaps = 4/49 (8%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
++ A PP P PP P P P PP
Sbjct: 16 YARLMRRAARAGPPGTPAPPGPAEDAHPEFP----ERPRDAPAPPAPPR 60
Score = 28.2 bits (63), Expect = 6.5
Identities = 11/57 (19%), Positives = 11/57 (19%), Gaps = 4/57 (7%)
Query: 26 VQAAPPPP-PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP---PMMGTTP 78
AAP PP P P P P P P
Sbjct: 3 AGAAPRAARRSFDYARLMRRAARAGPPGTPAPPGPAEDAHPEFPERPRDAPAPPAPP 59
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 36.7 bits (85), Expect = 0.023
Identities = 16/58 (27%), Positives = 17/58 (29%), Gaps = 3/58 (5%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP---PMMGTTP 78
P P +P PP P P PP PPP P G TP
Sbjct: 123 LGANPVTATGEPVPQMPASPPGPEGEPQPGNTPVSFPPQGSVAVPPPTAFYPGNGVTP 180
Score = 31.7 bits (72), Expect = 0.76
Identities = 19/59 (32%), Positives = 22/59 (37%), Gaps = 6/59 (10%)
Query: 24 PNVQAAPPPPPPP-PPLPPPPPMMGTTPPPMMGTPPPPP---PMMGTPPPPPPMMGTTP 78
P Q PP P P P P+ + PP PPP P G PPP + P
Sbjct: 134 PVPQMPASPPGPEGEPQPGNTPV--SFPPQGSVAVPPPTAFYPGNGVTPPPQVTYQSVP 190
Score = 29.0 bits (65), Expect = 5.2
Identities = 14/43 (32%), Positives = 16/43 (37%), Gaps = 3/43 (6%)
Query: 23 LPNVQAAPPPPPPPPPLPPPP---PMMGTTPPPMMGTPPPPPP 62
P PP +PPP P G TPPP + P P
Sbjct: 150 QPGNTPVSFPPQGSVAVPPPTAFYPGNGVTPPPQVTYQSVPVP 192
>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
Length = 484
Score = 36.5 bits (85), Expect = 0.027
Identities = 8/46 (17%), Positives = 9/46 (19%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
AAP P P P + P P
Sbjct: 105 AAAPAAPEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADAAA 150
Score = 31.5 bits (72), Expect = 1.1
Identities = 9/44 (20%), Positives = 11/44 (25%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
A P P PPP P P P ++
Sbjct: 52 APPAAAAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLT 95
Score = 31.1 bits (71), Expect = 1.2
Identities = 8/61 (13%), Positives = 9/61 (14%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
V A P P P P P V
Sbjct: 100 QLVARAAAPAAPEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADAAAAVADAVKAR 159
Query: 84 L 84
+
Sbjct: 160 I 160
Score = 29.2 bits (66), Expect = 5.9
Identities = 17/99 (17%), Positives = 21/99 (21%), Gaps = 21/99 (21%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP---------------------PP 60
+L AA P PPP P + P P P
Sbjct: 50 ALAPPAAAAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAPA 109
Query: 61 PPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPSDT 99
P P P P +P+D
Sbjct: 110 APEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADA 148
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ
associated protein 2 has a highly conserved sequence
throughout evolution including a conserved polyproline
region and several SH2/SH3 binding sites. It occurs as
a single copy gene with a four-exon organisation and is
located on chromosome 12. It encodes a ubiquitously
expressed protein and binds to DAZ and DAZL1 through
DAZ repeats.
Length = 136
Score = 34.4 bits (79), Expect = 0.032
Identities = 19/70 (27%), Positives = 24/70 (34%), Gaps = 4/70 (5%)
Query: 15 ETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP--PMMGTPPPPPP 72
E Y + +P P P P PM + PP P+ PP PP
Sbjct: 10 ELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPP 69
Query: 73 MMGTTPHVLG 82
G+T V G
Sbjct: 70 --GSTVLVQG 77
Score = 32.1 bits (73), Expect = 0.24
Identities = 10/41 (24%), Positives = 11/41 (26%)
Query: 40 PPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
PP P + P P P P P M V
Sbjct: 3 DAPPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQV 43
Score = 31.7 bits (72), Expect = 0.35
Identities = 17/64 (26%), Positives = 19/64 (29%), Gaps = 4/64 (6%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM----MGTTPH 79
P P PP P P P M P +G PPM +G P
Sbjct: 6 PAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPP 65
Query: 80 VLGP 83
V P
Sbjct: 66 VYPP 69
Score = 29.4 bits (66), Expect = 1.7
Identities = 11/52 (21%), Positives = 13/52 (25%), Gaps = 3/52 (5%)
Query: 32 PPPPPPPLPPPPPMMGTTPPPMMGT---PPPPPPMMGTPPPPPPMMGTTPHV 80
P PP P P M P P P M + + H
Sbjct: 2 PDAPPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHP 53
Score = 29.4 bits (66), Expect = 1.8
Identities = 16/73 (21%), Positives = 22/73 (30%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRL 89
P PP L P ++ + P P P M P +G Y+
Sbjct: 2 PDAPPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIG 61
Query: 90 IVSPEIPSDTTPL 102
P P +T L
Sbjct: 62 APPPVYPPGSTVL 74
Score = 27.9 bits (62), Expect = 6.1
Identities = 16/70 (22%), Positives = 18/70 (25%), Gaps = 24/70 (34%)
Query: 27 QAAPPPPPP-PPPLPPPPPMMGTTPPPM---MGTPPPPPPMMGTP--------------- 67
P P P + PP +G PPP P T
Sbjct: 29 APYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPPGSTVLVQGGFDAGARFGPG 88
Query: 68 -----PPPPP 72
PPPPP
Sbjct: 89 TGSSIPPPPP 98
>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 584
Score = 36.0 bits (83), Expect = 0.035
Identities = 12/50 (24%), Positives = 14/50 (28%), Gaps = 8/50 (16%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
S+P P P P P P P P P + P P
Sbjct: 386 SIPANLLHNAPQAAPAPSAAAP-----EPKH---QPAPEPRPVLAPTPAS 427
Score = 34.5 bits (79), Expect = 0.12
Identities = 15/68 (22%), Positives = 18/68 (26%), Gaps = 16/68 (23%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
L N A P P P P P P P P+ P P G
Sbjct: 391 LLHNAPQAAPAPSAAAPEPKHQP------------APEPRPV----LAPTPASGEPNAAA 434
Query: 82 GPLYWTRL 89
W+ +
Sbjct: 435 VRSMWSTV 442
>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
family. The proteins represented by this model contain
three RNA recognition motifs (rrm: pfam00076) and have
been characterized as poly-pyrimidine tract binding
proteins associated with RNA splicing factors. In the
case of PUF60 (GP|6176532), in complex with p54, and in
the presence of U2AF, facilitates association of U2
snRNP with pre-mRNA.
Length = 612
Score = 36.2 bits (83), Expect = 0.036
Identities = 21/79 (26%), Positives = 29/79 (36%), Gaps = 5/79 (6%)
Query: 18 SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
++I + V +A PPLP P + P PM P PPP + P P
Sbjct: 340 TDIGNKAVVSSAKKEAEEVPPLPQAAPAV-VKPGPMEIPTPVPPPGLAIPSLVAPPGLVA 398
Query: 78 PHVLGPLYWTRLIVSPEIP 96
P + P + SP
Sbjct: 399 PTEINP----SFLASPRKK 413
Score = 35.8 bits (82), Expect = 0.047
Identities = 30/138 (21%), Positives = 38/138 (27%), Gaps = 12/138 (8%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
+ P V P P P PP + PP + P P P P
Sbjct: 364 AAPAVVKPGPMEIPTPVPPPGLAIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVTF 423
Query: 82 GPLYWTRLIVSPEIPSDTTPLWKEL------------EEVPINNLEEFTELFSRQVTAQR 129
G L T P T+ K L E +E EL + V
Sbjct: 424 GALDDTLAWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMNSE 483
Query: 130 PVTRKRQQKSSKVQNVAR 147
+ Q+ S N AR
Sbjct: 484 DASLASQEGMSIRGNSAR 501
>gnl|CDD|237541 PRK13881, PRK13881, conjugal transfer protein TrbI; Provisional.
Length = 472
Score = 35.9 bits (83), Expect = 0.044
Identities = 12/46 (26%), Positives = 15/46 (32%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
++ A P P P P + P PP PP G P
Sbjct: 86 IEPASPLKVPEMPTGPASAPLPIARPDNPDAPPTPPANPGNPGQVN 131
Score = 31.7 bits (72), Expect = 0.75
Identities = 11/50 (22%), Positives = 13/50 (26%), Gaps = 4/50 (8%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+ P P P P P+ P PP PP P
Sbjct: 86 IEPASPLKVPEMPTGPASAPLPI----ARPDNPDAPPTPPANPGNPGQVN 131
>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin
carboxyl carrier protein subunit; Validated.
Length = 153
Score = 34.5 bits (79), Expect = 0.045
Identities = 15/57 (26%), Positives = 21/57 (36%), Gaps = 1/57 (1%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP-PPPPMMGTPPPPPPMMGTTPHVL 81
+ + P P P P P + + P P+ P P P G PM G +L
Sbjct: 43 IDLSAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPASAGENVVTAPMPGKILRIL 99
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 35.9 bits (83), Expect = 0.047
Identities = 18/79 (22%), Positives = 25/79 (31%), Gaps = 12/79 (15%)
Query: 5 DKRAKRRKKKETYSNIFS------LPNVQAAPPPPP------PPPPLPPPPPMMGTTPPP 52
D + R K+ +F+ P+ AA P P P P TPP
Sbjct: 367 DDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPT 426
Query: 53 MMGTPPPPPPMMGTPPPPP 71
+ PP P+ P
Sbjct: 427 VSVDPPAAVPVNPPSTAPQ 445
Score = 33.6 bits (77), Expect = 0.21
Identities = 18/81 (22%), Positives = 22/81 (27%), Gaps = 2/81 (2%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMM--GTTPHVLGPL 84
Q+A P PP + PP PP P P P L P+
Sbjct: 415 QSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPI 474
Query: 85 YWTRLIVSPEIPSDTTPLWKE 105
+ I T KE
Sbjct: 475 QEKAEQATGNIKEAPTGTQKE 495
>gnl|CDD|227547 COG5222, COG5222, Uncharacterized conserved protein, contains RING
Zn-finger [General function prediction only].
Length = 427
Score = 35.5 bits (81), Expect = 0.050
Identities = 21/68 (30%), Positives = 28/68 (41%), Gaps = 7/68 (10%)
Query: 4 MDKRAKRRKKKET--YSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP 61
M ++ KR + +S + P ++A P P P G P PMM P PP
Sbjct: 357 MSEKRKREDPNSSAVFSKATAEPAFKSAMAIPMPSMPHV-----QGFPPFPMMPLPQMPP 411
Query: 62 PMMGTPPP 69
MM PP
Sbjct: 412 MMMIANPP 419
Score = 30.9 bits (69), Expect = 1.6
Identities = 14/35 (40%), Positives = 14/35 (40%), Gaps = 2/35 (5%)
Query: 47 GTTPPPMMGTPP--PPPPMMGTPPPPPPMMGTTPH 79
P P M PP PMM P PP MM P
Sbjct: 385 MAIPMPSMPHVQGFPPFPMMPLPQMPPMMMIANPP 419
>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
Length = 540
Score = 35.7 bits (82), Expect = 0.052
Identities = 18/57 (31%), Positives = 19/57 (33%), Gaps = 4/57 (7%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLG 82
V P P PP PP P PP PP P P P + HV G
Sbjct: 160 VGPRPVPTPPGGTPPPDDDEGDEAGAPATPAPPLHPP----PAPHPHPIAEVAHVRG 212
Score = 30.7 bits (69), Expect = 1.7
Identities = 14/49 (28%), Positives = 15/49 (30%), Gaps = 9/49 (18%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P PP PPP G P P P + PP P P
Sbjct: 162 PRPVPTPPGGTPPP---------DDDEGDEAGAPATPAPPLHPPPAPHP 201
>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
Length = 463
Score = 35.5 bits (81), Expect = 0.052
Identities = 18/71 (25%), Positives = 25/71 (35%), Gaps = 3/71 (4%)
Query: 5 DKRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTP---PPMMGTPPPPP 61
K A K ++ S + + P P PP P + + P P + PPPP
Sbjct: 160 TKVAIISKSEDAASQVTPSQKIPETTDPKPSPPAEDKQKPKVESAPVAEKPKAPSSPPPP 219
Query: 62 PMMGTPPPPPP 72
P PP
Sbjct: 220 KQSAKEPQLPP 230
Score = 33.2 bits (75), Expect = 0.30
Identities = 15/55 (27%), Positives = 18/55 (32%), Gaps = 1/55 (1%)
Query: 27 QAAPPPPPPPPPLP-PPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
Q P P P P PP P + P P + PPPP P +
Sbjct: 174 QVTPSQKIPETTDPKPSPPAEDKQKPKVESAPVAEKPKAPSSPPPPKQSAKEPQL 228
>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator. In
eukaryotes, this family of proteins induces
mitochondrial fission.
Length = 248
Score = 35.1 bits (81), Expect = 0.054
Identities = 14/28 (50%), Positives = 16/28 (57%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPM 53
V PPPPPPPPP PPP T+ +
Sbjct: 180 VLEVPPPPPPPPPPPPPSLQQSTSAIDL 207
Score = 34.7 bits (80), Expect = 0.071
Identities = 17/48 (35%), Positives = 19/48 (39%), Gaps = 9/48 (18%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P P PP T P++ PPPPPP PPPPP
Sbjct: 162 VPSSSTTSFPISPP----TEEPVLEVPPPPPP-----PPPPPPPSLQQ 200
Score = 33.2 bits (76), Expect = 0.19
Identities = 17/42 (40%), Positives = 21/42 (50%), Gaps = 9/42 (21%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPM 63
S + +PP P +PPPPP PPP PPPPP +
Sbjct: 166 STTSFPISPPTEEPVLEVPPPPP-----PPP----PPPPPSL 198
Score = 31.6 bits (72), Expect = 0.61
Identities = 11/28 (39%), Positives = 12/28 (42%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPP 51
P + PPPPP PPPPP
Sbjct: 174 PPTEEPVLEVPPPPPPPPPPPPPSLQQS 201
Score = 30.1 bits (68), Expect = 1.9
Identities = 16/58 (27%), Positives = 17/58 (29%), Gaps = 21/58 (36%)
Query: 21 FSLPNVQAAPP-----PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
+ P PP P PPPPPP PPPPPP
Sbjct: 155 LLSSDESVPSSSTTSFPISPPTEEPVLE------------VPPPPPP----PPPPPPP 196
Score = 28.9 bits (65), Expect = 4.4
Identities = 16/44 (36%), Positives = 18/44 (40%), Gaps = 9/44 (20%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
PP P PPPP PPP PPPPPP + +
Sbjct: 173 SPPTEEPVLEVPPPP-----PPP----PPPPPPSLQQSTSAIDL 207
>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
Length = 209
Score = 34.4 bits (79), Expect = 0.067
Identities = 14/46 (30%), Positives = 15/46 (32%), Gaps = 3/46 (6%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
Q A P P TTP MG P PP + P P
Sbjct: 97 QKATSMSEPATENKPAEV---TTPVEPMGLPETPPAVPVPAPAPAV 139
Score = 30.6 bits (69), Expect = 1.2
Identities = 9/37 (24%), Positives = 11/37 (29%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
+ P P PM PP + P P P
Sbjct: 102 MSEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPA 138
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 35.2 bits (82), Expect = 0.067
Identities = 10/49 (20%), Positives = 13/49 (26%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P A PPPP P P + T ++ G
Sbjct: 393 PPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKS 441
Score = 33.7 bits (78), Expect = 0.20
Identities = 15/62 (24%), Positives = 18/62 (29%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
LP + P P PP PPPP P P+ TT +
Sbjct: 363 APLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQL 422
Query: 81 LG 82
L
Sbjct: 423 LA 424
Score = 32.9 bits (76), Expect = 0.40
Identities = 10/56 (17%), Positives = 13/56 (23%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
A PP P +PPPP P + + T
Sbjct: 384 TAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAK 439
Score = 32.9 bits (76), Expect = 0.43
Identities = 13/55 (23%), Positives = 14/55 (25%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
P P PP P T P PP P + P P P
Sbjct: 362 AAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLP 416
Score = 32.1 bits (74), Expect = 0.68
Identities = 10/57 (17%), Positives = 14/57 (24%), Gaps = 2/57 (3%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
S +A P + PP PPP P P + +
Sbjct: 374 SAAPAASAQATAAPTAAVAPPQAPAV--PPPPASAPQQAPAVPLPETTSQLLAARQQ 428
Score = 32.1 bits (74), Expect = 0.73
Identities = 14/57 (24%), Positives = 16/57 (28%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
A P P PP P T P PP P + PP P +
Sbjct: 362 AAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPET 418
Score = 31.4 bits (72), Expect = 1.1
Identities = 12/58 (20%), Positives = 14/58 (24%), Gaps = 3/58 (5%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP---PPPMMGTPPPPPPMMGTTP 78
P P PP P P + P PPP P P +
Sbjct: 361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPET 418
Score = 31.4 bits (72), Expect = 1.3
Identities = 9/87 (10%), Positives = 14/87 (16%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
P A P P P + + + L
Sbjct: 402 PPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSEPAAASRARPVNSALERLAS 461
Query: 84 LYWTRLIVSPEIPSDTTPLWKELEEVP 110
+ + WK V
Sbjct: 462 VRPAPSALEKAPAKKEAYRWKATNPVE 488
Score = 28.7 bits (65), Expect = 8.3
Identities = 7/50 (14%), Positives = 11/50 (22%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
L Q A P + + + P P + P
Sbjct: 429 LQRAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEA 478
>gnl|CDD|237000 PRK11855, PRK11855, dihydrolipoamide acetyltransferase; Reviewed.
Length = 547
Score = 35.2 bits (82), Expect = 0.069
Identities = 13/55 (23%), Positives = 13/55 (23%), Gaps = 3/55 (5%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
V AA P P P P P P P G PH
Sbjct: 194 VAAAAPAAAAAPAAAAPAAAAAAAPAPAPAAAAAPAA---AAPAAAAAPGKAPHA 245
Score = 29.8 bits (68), Expect = 3.2
Identities = 8/38 (21%), Positives = 8/38 (21%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP 61
AA P P P P P P
Sbjct: 202 AAAPAAAAPAAAAAAAPAPAPAAAAAPAAAAPAAAAAP 239
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 34.9 bits (80), Expect = 0.083
Identities = 18/57 (31%), Positives = 20/57 (35%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
Q+ PP P P + PP G P PP P P M P P G P
Sbjct: 189 QSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPP 245
Score = 34.5 bits (79), Expect = 0.10
Identities = 12/48 (25%), Positives = 13/48 (27%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P Q P PPP P + P PPP P
Sbjct: 131 PPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSG 178
Score = 34.1 bits (78), Expect = 0.13
Identities = 15/59 (25%), Positives = 17/59 (28%), Gaps = 1/59 (1%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
+ P Q PPP P P PP +PP P PPP
Sbjct: 121 YYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQ-SPPQQPQYQQNPPPQAQSAPQVSG 178
Score = 32.2 bits (73), Expect = 0.55
Identities = 14/48 (29%), Positives = 16/48 (33%), Gaps = 6/48 (12%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P Q+AP P P P + PP P P M PP
Sbjct: 168 PQAQSAPQVSGLYPEESPYQPQ---SYPP---NEPLPSSMAMQPPYSG 209
Score = 30.7 bits (69), Expect = 1.8
Identities = 15/59 (25%), Positives = 16/59 (27%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLY 85
+ P PPP P P P PP P PPP L P
Sbjct: 125 PSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEE 183
Score = 30.3 bits (68), Expect = 2.0
Identities = 16/67 (23%), Positives = 19/67 (28%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
+ A PP P P P P + PP P PP P P P+
Sbjct: 63 DAPLQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYY 122
Query: 81 LGPLYWT 87
P
Sbjct: 123 PPPSQPQ 129
Score = 28.4 bits (63), Expect = 9.6
Identities = 14/54 (25%), Positives = 16/54 (29%), Gaps = 5/54 (9%)
Query: 31 PPPPPPPPLPPPPPMMGTT-----PPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
P PP + P PPP PPP P PPP +
Sbjct: 99 PSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQ 152
>gnl|CDD|147144 pfam04834, Adeno_E3_14_5, Early E3 14.5 kDa protein. The E3B
14.5 kDa was first identified in Human adenovirus type
5. It is an integral membrane protein oriented with its
C terminus in the cytoplasm. It functions to
down-regulate the epidermal growth factor receptor and
prevent tumour necrosis factor cytolysis. It achieves
this through the interaction with E3 10.4 kDa protein.
Length = 97
Score = 32.4 bits (74), Expect = 0.088
Identities = 10/21 (47%), Positives = 11/21 (52%)
Query: 24 PNVQAAPPPPPPPPPLPPPPP 44
P+ Q P PPPPP P P
Sbjct: 63 PDPQHIPLQQPPPPPEPQPRA 83
Score = 27.0 bits (60), Expect = 6.5
Identities = 10/26 (38%), Positives = 13/26 (50%), Gaps = 1/26 (3%)
Query: 16 TYSNIFSLPNVQAAPPPPPPPPPLPP 41
Y + +P +Q PPPP P P P
Sbjct: 61 RYPDPQHIP-LQQPPPPPEPQPRAPS 85
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 35.0 bits (80), Expect = 0.099
Identities = 14/52 (26%), Positives = 16/52 (30%), Gaps = 3/52 (5%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
+ PP P P G P PPP P PP G+ P
Sbjct: 440 HSGLHSGPPQSPFAQHPFTSGGLPAI---GPPPSLPTSTPAAPPRASSGSQP 488
Score = 32.4 bits (73), Expect = 0.59
Identities = 24/107 (22%), Positives = 30/107 (28%), Gaps = 24/107 (22%)
Query: 24 PNVQAAPPPPPPPPPL----------------PPPP--PMMGTTPPPMMGTPPPPPPMMG 65
+ A P P P P P PP P + P P P
Sbjct: 208 SPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSS 267
Query: 66 TPPPPPPMMGTTPHVL--GPLYWTRLIVSPEIPSDTTPLWKELEEVP 110
P PPM PH L GP++ +P P +P
Sbjct: 268 HHGPGPPM----PHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLP 310
Score = 32.4 bits (73), Expect = 0.61
Identities = 15/46 (32%), Positives = 17/46 (36%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
A PPP P P PP + P P G PP PP+
Sbjct: 464 AIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPI 509
Score = 31.6 bits (71), Expect = 1.1
Identities = 20/62 (32%), Positives = 21/62 (33%), Gaps = 15/62 (24%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPP---------------PPPP 72
P PP PLPP P M PPP P P PP PPPP
Sbjct: 326 LQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPSNLPPPP 385
Query: 73 MM 74
+
Sbjct: 386 AL 387
Score = 31.2 bits (70), Expect = 1.5
Identities = 24/100 (24%), Positives = 34/100 (34%), Gaps = 3/100 (3%)
Query: 5 DKRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPP--PPPMMGTTPPPMMGTPPPPPP 62
D A+++ + +P A P PPP P PP P P P
Sbjct: 165 DSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPL 224
Query: 63 MMGTPPPPPPMMGTTPH-VLGPLYWTRLIVSPEIPSDTTP 101
+ + P P +PH L P ++ P PS P
Sbjct: 225 SLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHP 264
Score = 30.4 bits (68), Expect = 2.2
Identities = 19/60 (31%), Positives = 25/60 (41%), Gaps = 7/60 (11%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPM-----MGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P++Q P P P LPPPP + + T PP PPP +M P + P
Sbjct: 366 PHLQGPSPFPQMPSNLPPPPALKPLSSLPTHHPP--SAHPPPLQLMPQSQPLQSVPAQPP 423
Score = 28.5 bits (63), Expect = 8.9
Identities = 16/69 (23%), Positives = 21/69 (30%), Gaps = 12/69 (17%)
Query: 28 AAPPPP----PPPPPLPPP---PPMMG---TTPPPMMGTPPPPPPMMGTPPPPP--PMMG 75
+A PPP P PL PP++ + PP P P P
Sbjct: 400 SAHPPPLQLMPQSQPLQSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTS 459
Query: 76 TTPHVLGPL 84
+GP
Sbjct: 460 GGLPAIGPP 468
Score = 28.5 bits (63), Expect = 9.4
Identities = 15/56 (26%), Positives = 19/56 (33%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
P P P PP + P P PPP +P P P ++ T H
Sbjct: 489 PGSALPSSGGCAGPGPPLPPIQIKEEPLDEAEEPESPPPPPRSPSPEPTVVNTPSH 544
>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 504
Score = 34.8 bits (80), Expect = 0.10
Identities = 12/57 (21%), Positives = 15/57 (26%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPL 84
PLP P P P PP++ P + VL L
Sbjct: 381 PTAAATAAGAPLPDFDPRPRGPPAPEPARSAEAPPLVAPAAAPAGLALRWRDVLAAL 437
Score = 31.7 bits (72), Expect = 0.85
Identities = 11/45 (24%), Positives = 12/45 (26%), Gaps = 6/45 (13%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+AP P P P PP P P PP
Sbjct: 379 SAPTAAATAAGAPLP------DFDPRPRGPPAPEPARSAEAPPLV 417
Score = 31.0 bits (70), Expect = 1.3
Identities = 12/43 (27%), Positives = 12/43 (27%), Gaps = 5/43 (11%)
Query: 26 VQAAPPPPPPPP---PLPPPPPMMGTTPPPMMGTPPPPPPMMG 65
AA P P P P PP P PP P
Sbjct: 382 TAAATAAGAPLPDFDPRPRGPP--APEPARSAEAPPLVAPAAA 422
Score = 29.4 bits (66), Expect = 4.7
Identities = 16/70 (22%), Positives = 16/70 (22%), Gaps = 10/70 (14%)
Query: 19 NIFSLPNVQAAPPPPPP------PPPLPPPP---PMMGTTPPPMMGTPPPPP-PMMGTPP 68
S APP P P L T G P P P PP
Sbjct: 344 GAPSEGVAAVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPP 403
Query: 69 PPPPMMGTTP 78
P P
Sbjct: 404 APEPARSAEA 413
Score = 28.3 bits (63), Expect = 9.0
Identities = 12/67 (17%), Positives = 17/67 (25%), Gaps = 4/67 (5%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
V++ P P+ P P P P PP P
Sbjct: 372 KEVRSLRSAPTAAAT-AAGAPLPDFDPRPRGPPAPEPARSAEAPPLVAPAAAPAG---LA 427
Query: 84 LYWTRLI 90
L W ++
Sbjct: 428 LRWRDVL 434
>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
Length = 280
Score = 34.5 bits (78), Expect = 0.10
Identities = 25/84 (29%), Positives = 32/84 (38%), Gaps = 11/84 (13%)
Query: 29 APPPPPP----PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP------PPPPPMMGTTP 78
AP P P P P P PP T PP P P P P PP PP+ P
Sbjct: 99 APAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTPKP 158
Query: 79 H-VLGPLYWTRLIVSPEIPSDTTP 101
P++ + P+ P+ + P
Sbjct: 159 APAAKPIFLHNQLPPPDYPAASCP 182
Score = 28.3 bits (62), Expect = 8.0
Identities = 19/74 (25%), Positives = 20/74 (27%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWT 87
A P P P P P P P P P PPP + P P
Sbjct: 83 APSPACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPACPP 142
Query: 88 RLIVSPEIPSDTTP 101
P P TP
Sbjct: 143 STRQCPPAPPLPTP 156
>gnl|CDD|221404 pfam12067, Sox_C_TAD, Sox C-terminal transactivation domain. This
domain is found at the C-terminus of the Sox family of
transcription factors. It is found associated with
pfam00505. It binds to the Armadillo repeats (pfam00514)
in Catenin beta-1 (CTNNB1), which is involved in
transcriptional regulation. It functions as a
transactivating domain (TAD).
Length = 197
Score = 33.9 bits (78), Expect = 0.10
Identities = 14/60 (23%), Positives = 18/60 (30%), Gaps = 10/60 (16%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMM--------GTPPPPPPMMGTPPPPP--PMMGTTPHV 80
P P P +MG PP M P +G PPP + T +
Sbjct: 74 AVTENIPQGSPAPSIMGCPTPPQMYYGQMYVPECAKHHPVQLGQLSPPPESQHLDTLDQL 133
Score = 29.7 bits (67), Expect = 2.5
Identities = 11/49 (22%), Positives = 15/49 (30%), Gaps = 3/49 (6%)
Query: 26 VQAAPPP--PPPPPPLPPPPPMMGTTP-PPMMGTPPPPPPMMGTPPPPP 71
Q P P P ++ + P P +MG P PP
Sbjct: 48 CQMMPYGYNSSYAPQNAHAPALLRQMAVTENIPQGSPAPSIMGCPTPPQ 96
>gnl|CDD|235309 PRK04596, minC, septum formation inhibitor; Reviewed.
Length = 248
Score = 34.2 bits (78), Expect = 0.11
Identities = 14/31 (45%), Positives = 15/31 (48%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGT 56
V PPPPPPP P PP+ P M T
Sbjct: 117 VSPPPPPPPPPARAEPAPPVARPAPGRMQRT 147
Score = 33.8 bits (77), Expect = 0.12
Identities = 12/26 (46%), Positives = 13/26 (50%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPP 52
+PPPPPPPPP P P P
Sbjct: 116 AVSPPPPPPPPPARAEPAPPVARPAP 141
Score = 28.8 bits (64), Expect = 5.3
Identities = 12/26 (46%), Positives = 13/26 (50%), Gaps = 4/26 (15%)
Query: 47 GTTPPPMMGTPPPPPPMMGTPPPPPP 72
+PPP PPPPPP P PP
Sbjct: 116 AVSPPP----PPPPPPARAEPAPPVA 137
Score = 28.8 bits (64), Expect = 5.6
Identities = 16/38 (42%), Positives = 17/38 (44%), Gaps = 7/38 (18%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMM 64
+AA PPPPP PPPP PP P P M
Sbjct: 114 RAAVSPPPPP---PPPPARAEPAPP----VARPAPGRM 144
>gnl|CDD|165431 PHA03160, PHA03160, hypothetical protein; Provisional.
Length = 499
Score = 34.7 bits (79), Expect = 0.11
Identities = 20/71 (28%), Positives = 26/71 (36%), Gaps = 6/71 (8%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLI 90
P P PP+ + PM PM PPPM PHV P R++
Sbjct: 402 PYGAPKNDHHLLPPLACSQQLPMQPLHVQQAPMQAPHVAPPPM--QPPHVQQP----RVL 455
Query: 91 VSPEIPSDTTP 101
S + S+ P
Sbjct: 456 PSTDGASNEAP 466
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 34.0 bits (78), Expect = 0.12
Identities = 16/57 (28%), Positives = 17/57 (29%), Gaps = 1/57 (1%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPP-MMGTPPPPPPMMGTPPPPPPMMGTTP 78
P P PP P PP P P P P P P P P + P
Sbjct: 59 EPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQP 115
Score = 30.5 bits (69), Expect = 1.6
Identities = 16/55 (29%), Positives = 17/55 (30%), Gaps = 5/55 (9%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P + PP P PP P P P P P P P P P P
Sbjct: 56 PQPEPEPPEEQPKPPTEPETP-----PEPTPPKPKEKPKPEKKPKKPKPKPKPKP 105
Score = 29.8 bits (67), Expect = 2.3
Identities = 11/49 (22%), Positives = 12/49 (24%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P + P PP PP P P P P P
Sbjct: 58 PEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPK 106
Score = 28.2 bits (63), Expect = 8.9
Identities = 16/53 (30%), Positives = 19/53 (35%), Gaps = 1/53 (1%)
Query: 21 FSLPNVQAAPPPPPPPPPLP-PPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
++ + A P P P P PP PP TPP P P P P
Sbjct: 39 LAVFLLAAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPE 91
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 34.2 bits (78), Expect = 0.14
Identities = 10/50 (20%), Positives = 10/50 (20%), Gaps = 2/50 (4%)
Query: 25 NVQAAPPPPPPPPPL--PPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
N A P P P P PP P P
Sbjct: 204 NAAPAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAP 253
Score = 32.7 bits (74), Expect = 0.47
Identities = 11/53 (20%), Positives = 13/53 (24%), Gaps = 1/53 (1%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGT 76
+A P P P P P P P P P P+
Sbjct: 211 AEEEAPAPSEAGSEPAPDPAARAPHAAPD-PPAPAPAPAKTAAPAAAAPVSSG 262
Score = 32.3 bits (73), Expect = 0.61
Identities = 14/56 (25%), Positives = 15/56 (26%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
A P P P P G+ P P P P P P P P
Sbjct: 203 ANAAPAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAAAAP 258
Score = 29.2 bits (65), Expect = 5.1
Identities = 12/38 (31%), Positives = 13/38 (34%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMG 65
+ P P P P P P P P P P P G
Sbjct: 84 SEPAPAAPEPEAAPEPEAPAPAPTPAAEAPAPAAPQAG 121
>gnl|CDD|215496 PLN02918, PLN02918, pyridoxine (pyridoxamine) 5'-phosphate
oxidase.
Length = 544
Score = 34.1 bits (78), Expect = 0.15
Identities = 11/40 (27%), Positives = 14/40 (35%)
Query: 32 PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P P+ PPPP + T P G+ PP
Sbjct: 15 QSLLPLPISPPPPHSSSLSSSPSPTQRFLTPSQGSRLPPR 54
>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
component of pyruvate dehydrogenase complex.
Length = 539
Score = 34.1 bits (78), Expect = 0.16
Identities = 14/49 (28%), Positives = 15/49 (30%), Gaps = 5/49 (10%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P+ AAP P P PPP P P P PP
Sbjct: 202 PSSSAAPAAPKAKPSPPPPKEEEVEKP-----ASSPEPKASKPSAPPSS 245
Score = 29.4 bits (66), Expect = 5.0
Identities = 12/52 (23%), Positives = 15/52 (28%)
Query: 42 PPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSP 93
P P +PPPP P P + P R+ SP
Sbjct: 202 PSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSSGDRIFASP 253
>gnl|CDD|218108 pfam04487, CITED, CITED. CITED, CBP/p300-interacting
transactivator with ED-rich tail, are characterized by
a conserved 32-amino acid sequence at the C-terminus.
CITED proteins do not bind DNA directly and are thought
to function as transcriptional co-activators.
Length = 206
Score = 33.3 bits (76), Expect = 0.16
Identities = 9/40 (22%), Positives = 10/40 (25%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
P P+ P M P P M P P
Sbjct: 59 PRSAMSGPMGGGHPHQSMPAYMMFNPSSKPQPFMLVPGPQ 98
Score = 33.3 bits (76), Expect = 0.18
Identities = 16/75 (21%), Positives = 20/75 (26%), Gaps = 20/75 (26%)
Query: 24 PNVQAAPPPPPPPPPL-------------------PPPPPMMGTTPPPMMGTPPPPP-PM 63
P+ + P P P L P P G G PP
Sbjct: 84 PSSKPQPFMLVPGPQLMASMQLQKLNTQYQGHAGAPAGHPGGGGPQQFRPGAGQPPGMQH 143
Query: 64 MGTPPPPPPMMGTTP 78
M P PP ++ T
Sbjct: 144 MPAPALPPNVIDTDL 158
Score = 32.9 bits (75), Expect = 0.22
Identities = 15/76 (19%), Positives = 17/76 (22%), Gaps = 16/76 (21%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMG--------TTPPPMMGTPP-------PPPPMMGTP 67
+ P P P L P P +M T G P P G
Sbjct: 77 PAYMMFNPSSKPQPFMLVPGPQLMASMQLQKLNTQYQGHAGAPAGHPGGGGPQQFRPGAG 136
Query: 68 PPPP-PMMGTTPHVLG 82
PP M
Sbjct: 137 QPPGMQHMPAPALPPN 152
>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein. Tymoviruses
are single stranded RNA viruses. This family includes a
protein of unknown function that has been named based on
its molecular weight. Tymoviruses such as the ononis
yellow mosaic tymovirus encode only three proteins. Of
these two are overlapping this protein overlaps a larger
ORF that is thought to be the polymerase.
Length = 458
Score = 33.9 bits (78), Expect = 0.17
Identities = 15/53 (28%), Positives = 20/53 (37%)
Query: 18 SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
S++ + + PP P P P + P GTPPPP PP
Sbjct: 406 SHVGASSSKHHRLPPSVLPGPRLSSPSPSPSLPTRRPGTPPPPASPPTPSPPS 458
>gnl|CDD|220596 pfam10138, Tellurium_res, Tellurium resistance protein. Members
of this family confer resistance to the metalloid
element tellurium and its salts.
Length = 98
Score = 31.6 bits (72), Expect = 0.18
Identities = 12/34 (35%), Positives = 12/34 (35%), Gaps = 11/34 (32%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
P PPP P PP P PP PP
Sbjct: 1 PAAPVPPPAPAPPAPA-----------PPPAAPP 23
Score = 30.8 bits (70), Expect = 0.35
Identities = 11/18 (61%), Positives = 11/18 (61%)
Query: 27 QAAPPPPPPPPPLPPPPP 44
AAP PPP P P P PP
Sbjct: 1 PAAPVPPPAPAPPAPAPP 18
Score = 30.4 bits (69), Expect = 0.46
Identities = 8/21 (38%), Positives = 8/21 (38%)
Query: 24 PNVQAAPPPPPPPPPLPPPPP 44
P P PP P PP P
Sbjct: 2 AAPVPPPAPAPPAPAPPPAAP 22
Score = 27.0 bits (60), Expect = 7.0
Identities = 8/17 (47%), Positives = 8/17 (47%)
Query: 28 AAPPPPPPPPPLPPPPP 44
P PP P P P PP
Sbjct: 7 PPAPAPPAPAPPPAAPP 23
Score = 27.0 bits (60), Expect = 7.0
Identities = 14/41 (34%), Positives = 14/41 (34%), Gaps = 18/41 (43%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P P PPP P PP P PP P PP
Sbjct: 1 PAAPVPPPAPAPPA--------------PAPP----PAAPP 23
>gnl|CDD|147759 pfam05782, ECM1, Extracellular matrix protein 1 (ECM1). This
family consists of several eukaryotic extracellular
matrix protein 1 (ECM1) sequences. ECM1 has been shown
to regulate endochondral bone formation, stimulate the
proliferation of endothelial cells and induce
angiogenesis. Mutations in the ECM1 gene can cause
lipoid proteinosis, a disorder which causes generalised
thickening of skin, mucosae and certain viscera.
Classical features include beaded eyelid papules and
laryngeal infiltration leading to hoarseness.
Length = 419
Score = 33.8 bits (77), Expect = 0.18
Identities = 30/91 (32%), Positives = 35/91 (38%), Gaps = 7/91 (7%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWT 87
AAPP PP LP P+ PP G PP +P P P L P
Sbjct: 48 AAPPSPPLSRRLPVDHPVTSQHDPPFEGQSEVQPP--PSPEDIPVYEEDWPTFLNP---N 102
Query: 88 RLIVSPEIPSDTTPLWKELE--EVPINNLEE 116
P +P + PL KEL +VPI E
Sbjct: 103 VDKAGPAVPQEAIPLQKELPPPQVPIEQKEV 133
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 32.6 bits (73), Expect = 0.19
Identities = 7/49 (14%), Positives = 9/49 (18%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P+ + P P P P P P
Sbjct: 60 PDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEP 108
Score = 31.1 bits (69), Expect = 0.54
Identities = 13/55 (23%), Positives = 14/55 (25%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P + P P P P P P P P P P P P
Sbjct: 64 PEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEP 118
Score = 31.1 bits (69), Expect = 0.58
Identities = 10/72 (13%), Positives = 16/72 (22%), Gaps = 4/72 (5%)
Query: 7 RAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGT 66
+ K K + ++ + + P P P P P
Sbjct: 41 KGKGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEG----EEEPEPEETG 96
Query: 67 PPPPPPMMGTTP 78
P P P
Sbjct: 97 EEEPEPEPEPEP 108
Score = 31.1 bits (69), Expect = 0.59
Identities = 15/61 (24%), Positives = 16/61 (26%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
+ P P P P P P P P P P P P T V P
Sbjct: 74 EEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEPEPEPGAATLKSVALP 133
Query: 84 L 84
Sbjct: 134 F 134
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 33.8 bits (77), Expect = 0.20
Identities = 14/64 (21%), Positives = 22/64 (34%)
Query: 8 AKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP 67
R++ + I ++ PP P P ++ P GT P P+ P
Sbjct: 378 PADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVP 437
Query: 68 PPPP 71
P P
Sbjct: 438 PQPT 441
Score = 33.4 bits (76), Expect = 0.26
Identities = 19/117 (16%), Positives = 27/117 (23%), Gaps = 11/117 (9%)
Query: 28 AAPPPPPPPPPLPPPPPMM---GTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPL 84
AA P P P + + P PM PP P
Sbjct: 362 AAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNP 421
Query: 85 YWTRLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSRQVTAQRPVTRKRQQKSSK 141
PE P +PI+ + V P ++K +
Sbjct: 422 QSPGTSYGPEPVGPVPPQPTNPYVMPIS--------MANMVYPGHPQEHGHERKRKR 470
>gnl|CDD|177451 PHA02669, PHA02669, hypothetical protein; Provisional.
Length = 210
Score = 33.1 bits (75), Expect = 0.21
Identities = 10/47 (21%), Positives = 13/47 (27%), Gaps = 4/47 (8%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP----PPPPMMGTPPPP 70
+ PP PP+ P P + T P P P
Sbjct: 140 SGPPRQANIPPVTPYPDEVSVGVGSGPSTEHGHYEGDGPEQDLEPEP 186
>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
region.
Length = 295
Score = 33.5 bits (76), Expect = 0.21
Identities = 19/45 (42%), Positives = 19/45 (42%), Gaps = 3/45 (6%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMM---GTPPPPPPMMGTPPPPP 71
PPPPPP P P PM T PP G P P TP P
Sbjct: 227 APPPPPPMARPVPLPMPDTKPPTTSTEGGATSPTSPTYSTPSTSP 271
Score = 28.5 bits (63), Expect = 8.2
Identities = 18/46 (39%), Positives = 19/46 (41%)
Query: 33 PPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P P PPPPPM P PM T PP G P +TP
Sbjct: 222 PTPMLAPPPPPPMARPVPLPMPDTKPPTTSTEGGATSPTSPTYSTP 267
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 33.6 bits (77), Expect = 0.22
Identities = 14/85 (16%), Positives = 20/85 (23%), Gaps = 2/85 (2%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMM--GTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
P P P + P PPM PP P +T
Sbjct: 428 PGGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQG 487
Query: 82 GPLYWTRLIVSPEIPSDTTPLWKEL 106
G +++ P + E
Sbjct: 488 GQNKKLAQVLASATPQMQKQVLGER 512
Score = 29.4 bits (66), Expect = 4.8
Identities = 12/48 (25%), Positives = 14/48 (29%), Gaps = 1/48 (2%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMM 74
P P P P P G P + P PP P+M
Sbjct: 417 PRMSMMPTPMGPGGPLRPN-GLAPMNAVRAPSRNAQNAAQKPPMQPVM 463
>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
Length = 931
Score = 33.6 bits (77), Expect = 0.23
Identities = 12/61 (19%), Positives = 16/61 (26%), Gaps = 2/61 (3%)
Query: 13 KKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMG--TTPPPMMGTPPPPPPMMGTPPPP 70
+ PPP PP P P + P P P P+ +
Sbjct: 61 EAAPLPAAAESIASPEVPPPVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAEPVEPSLAAN 120
Query: 71 P 71
P
Sbjct: 121 P 121
Score = 30.6 bits (69), Expect = 1.9
Identities = 17/94 (18%), Positives = 24/94 (25%), Gaps = 4/94 (4%)
Query: 7 RAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTT---PPPMMGTPPPPPPM 63
R R E + + Q A P PLP + + PP
Sbjct: 33 RELRSLVAEGAAGPVAKAAEQMAAPEAAEAAPLPAAAESIASPEVPPPVPPAPAQEGEAP 92
Query: 64 MGTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPS 97
P P P P+ L +P +
Sbjct: 93 AAEQPSAVPAPSAAPAPAEPV-EPSLAANPFAAA 125
>gnl|CDD|233154 TIGR00859, ENaC, sodium channel transporter. The Epithelial Na+
Channel (ENaC) Family (TC 1.A.06)The ENaC family
consists of sodium channels from animals and has no
recognizable homologues in other eukaryotes or bacteria.
The vertebrate ENaC proteins from epithelial cells
cluster tightly together on the phylogenetic tree:
voltage-insensitive ENaC homologues are also found in
the brain. Eleven sequenced C. elegans proteins,
including the degenerins, are distantly related to the
vertebrate proteins as well as to each other. At least
some ofthese proteins form part of a mechano-transducing
complex for touch sensitivity. Other members of the ENaC
family, the acid-sensing ion channels, ASIC1-3,are homo-
or hetero-oligomeric neuronal H+-gated channels that
mediate pain sensation in response to tissue acidosis.
The homologous Helix aspersa(FMRF-amide)-activated Na+
channel is the first peptide neurotransmitter-gated
ionotropic receptor to be sequenced.Mammalian ENaC is
important for the maintenance of Na+ balance and the
regulation of blood pressure. Three homologous ENaC
subunits, a, b and g, havebeen shown to assemble to form
the highly Na+-selective channel.This model is designed
from the vertebrate members of the ENaC family
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 595
Score = 33.5 bits (77), Expect = 0.24
Identities = 15/64 (23%), Positives = 21/64 (32%), Gaps = 6/64 (9%)
Query: 6 KRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMG 65
R ++ ++ P +A PP PP P P P + G
Sbjct: 533 WRFRKWWQRRRGPPYAEPPEPVSADTPPSLQLDDPPTFP--SALPLP----HASGLSLPG 586
Query: 66 TPPP 69
TPPP
Sbjct: 587 TPPP 590
Score = 32.4 bits (74), Expect = 0.56
Identities = 11/43 (25%), Positives = 13/43 (30%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
A PP P P T P + P + PPP
Sbjct: 548 AEPPEPVSADTPPSLQLDDPPTFPSALPLPHASGLSLPGTPPP 590
Score = 31.2 bits (71), Expect = 1.3
Identities = 12/50 (24%), Positives = 15/50 (30%), Gaps = 5/50 (10%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
PP PP P+ P PP P + P + TP
Sbjct: 544 GPPYAEPPEPVSADTPP-SLQLD----DPPTFPSALPLPHASGLSLPGTP 588
Score = 28.9 bits (65), Expect = 6.4
Identities = 10/25 (40%), Positives = 12/25 (48%), Gaps = 1/25 (4%)
Query: 30 PPPPPPPPPLPPPPPM-MGTTPPPM 53
PP P PLP + + TPPP
Sbjct: 567 PPTFPSALPLPHASGLSLPGTPPPN 591
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 33.1 bits (76), Expect = 0.27
Identities = 13/75 (17%), Positives = 17/75 (22%)
Query: 25 NVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPL 84
+ P P P P TP P P P T P
Sbjct: 167 STTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPA 226
Query: 85 YWTRLIVSPEIPSDT 99
+ +P+D
Sbjct: 227 APATPDGAAPLPTDQ 241
Score = 29.6 bits (67), Expect = 3.4
Identities = 14/60 (23%), Positives = 15/60 (25%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
P V AP P P P P P G P P G + P
Sbjct: 192 PAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADP 251
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 33.3 bits (76), Expect = 0.29
Identities = 10/46 (21%), Positives = 12/46 (26%), Gaps = 1/46 (2%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP-PPPPMMGTPPPPPP 72
AAP P P + P P + PPP
Sbjct: 583 AAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQAAESRGAPPP 628
Score = 29.8 bits (67), Expect = 3.2
Identities = 10/45 (22%), Positives = 11/45 (24%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
A P P P P P + P P P G
Sbjct: 575 AKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQA 619
Score = 29.0 bits (65), Expect = 7.2
Identities = 8/56 (14%), Positives = 9/56 (16%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
AA P P P P P + P
Sbjct: 573 AAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQAAESRGAPPP 628
>gnl|CDD|219241 pfam06958, Pyocin_S, S-type Pyocin. This family represents a
conserved region approximately 180 residues long within
bacterial S-type pyocins. Pyocins are polypeptide toxins
produced by, and active against, bacteria. S-type
pyocins cause cell death by DNA breakdown due to
endonuclease activity.
Length = 139
Score = 31.9 bits (73), Expect = 0.30
Identities = 16/60 (26%), Positives = 19/60 (31%), Gaps = 7/60 (11%)
Query: 24 PNVQAAPPPPPPPP-----PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
V P P PP + +T P PP P + GTP P TP
Sbjct: 54 TGVYEFTLDAVPGRTILWTPASPPGDIPSSTSSP--APPPDTPVITGTPITPVVDPVITP 111
Score = 27.7 bits (62), Expect = 6.8
Identities = 10/36 (27%), Positives = 13/36 (36%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPM 63
P P P P P + GT P++ P P
Sbjct: 79 DIPSSTSSPAPPPDTPVITGTPITPVVDPVITPYPA 114
Score = 27.7 bits (62), Expect = 7.0
Identities = 14/44 (31%), Positives = 15/44 (34%), Gaps = 2/44 (4%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
A PP P P P P + GTP P P P P
Sbjct: 74 ASPPGDIPSSTSSPAP--PPDTPVITGTPITPVVDPVITPYPAP 115
>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
Length = 1000
Score = 33.1 bits (75), Expect = 0.32
Identities = 19/68 (27%), Positives = 24/68 (35%), Gaps = 6/68 (8%)
Query: 8 AKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP 67
AK +K + +A PP P PP PP+M P P +M TP
Sbjct: 530 AKPHRKVQDGFQRSGRRQKRATPPKVSPSDRGPP------KASPPVMAPPSTGPRVMATP 583
Query: 68 PPPPPMMG 75
P M
Sbjct: 584 STGPRDMA 591
Score = 31.9 bits (72), Expect = 0.93
Identities = 12/42 (28%), Positives = 16/42 (38%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P P P+ P ++ PP PP + P PPP
Sbjct: 455 PSDQPSVPVEPAHLTPVEHTTVILHQPPQSPPTVAIKPAPPP 496
Score = 30.0 bits (67), Expect = 2.9
Identities = 16/55 (29%), Positives = 24/55 (43%), Gaps = 5/55 (9%)
Query: 24 PNVQAAPPPPPPP----PPLPPPPPMMGTTPPPMMGTPPPPPPM-MGTPPPPPPM 73
P++Q+ PP P LP ++ ++ P P P + T PPPPM
Sbjct: 852 PHLQSETGPPRLQLSQVPQLPYSQTLVSSSAPSWSSPQPRAPIRPIPTRFPPPPM 906
Score = 29.6 bits (66), Expect = 4.2
Identities = 14/50 (28%), Positives = 18/50 (36%), Gaps = 1/50 (2%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
S P + P P P P + T + P PP + P PPP
Sbjct: 448 STPE-RPGPSDQPSVPVEPAHLTPVEHTTVILHQPPQSPPTVAIKPAPPP 496
>gnl|CDD|236692 PRK10431, PRK10431, N-acetylmuramoyl-l-alanine amidase II;
Provisional.
Length = 445
Score = 32.9 bits (75), Expect = 0.34
Identities = 16/56 (28%), Positives = 24/56 (42%), Gaps = 8/56 (14%)
Query: 7 RAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
A +R+ Y+ +F+ + A PPPPPP PP + P + P P
Sbjct: 109 EAVKRQNGSNYTVVFT---INADVPPPPPP-----PPVVAKRVETPAVVAPRVSEP 156
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 33.1 bits (75), Expect = 0.35
Identities = 14/47 (29%), Positives = 16/47 (34%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
Q + P P P PP G P P P P P M +P
Sbjct: 428 QPPQPSVPSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQSPASQ 474
Score = 33.1 bits (75), Expect = 0.39
Identities = 20/58 (34%), Positives = 22/58 (37%), Gaps = 5/58 (8%)
Query: 24 PNVQAAPPP--PPPPPPLPPPPPMMGTTPPPM-MGTPPPPPPMMGTPPPPPPMMGTTP 78
P Q P PP P P P G+ PP G P PP P P P M +P
Sbjct: 416 PVPQVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPSPP--ALMPSPSPQMSQSP 471
Score = 29.6 bits (66), Expect = 4.2
Identities = 11/42 (26%), Positives = 15/42 (35%), Gaps = 1/42 (2%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
PP +P PP +M +P P M P + P
Sbjct: 444 PPQSVSGGMIPSPPALM-PSPSPQMSQSPASQRTIQQDMVSP 484
>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region. The
defective chorion-1 gene (dec-1) in Drosophila encodes
follicle cell proteins necessary for proper eggshell
assembly. Multiple products of the dec-1 gene are formed
by alternative RNA splicing and proteolytic processing.
Cleavage products include S80 (80 kDa) which is
incorporated into the eggshell, and further proteolysis
of S80 gives S60 (60 kDa).
Length = 407
Score = 32.8 bits (74), Expect = 0.36
Identities = 11/32 (34%), Positives = 14/32 (43%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMG 55
P AAPP P P P P+ P ++G
Sbjct: 114 PAPAAAPPAAPAPAADTPAAPIPDAVQPAILG 145
Score = 31.7 bits (71), Expect = 0.85
Identities = 19/61 (31%), Positives = 22/61 (36%), Gaps = 13/61 (21%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
S+P + A P P P P P P P P P TP P P P +L
Sbjct: 98 SMPGLLGAAAPVPAPAPAPAAAP------------PAAPAPAADTPAAPIP-DAVQPAIL 144
Query: 82 G 82
G
Sbjct: 145 G 145
Score = 30.2 bits (67), Expect = 2.4
Identities = 13/42 (30%), Positives = 16/42 (38%), Gaps = 4/42 (9%)
Query: 37 PPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P +P P ++G P P P P PP P TP
Sbjct: 94 PAMPSMPGLLGAAAP----VPAPAPAPAAAPPAAPAPAADTP 131
>gnl|CDD|221170 pfam11696, DUF3292, Protein of unknown function (DUF3292). This
eukaryotic family of proteins has no known function.
Length = 641
Score = 32.8 bits (75), Expect = 0.37
Identities = 11/44 (25%), Positives = 14/44 (31%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
AAP PPPP L P + + + P P
Sbjct: 402 AAPLPPPPSSSLRKAPSSPASIDHKQLNLGASEEEIDQAAAPEP 445
>gnl|CDD|221827 pfam12881, NUT_N, NUT protein N terminus. This family includes
the NUT protein. The gene encoding for NUT protein
(Nuclear Testis protein) is found fused to BRD3 or BRD4
genes, in some aggressive types of carcinoma, due to
chromosomal translocations. Proteins of the BRD family
contain two bromodomains that bind transcriptionally
active chromatin through associations with acetylated
histones H3 and H4. Such proteins are crucial for the
regulation of cell cycle progression. On the other
hand, little is known about NUT protein. NUT is known
to have a Nuclear Export Sequence (NES) as well as a
Nuclear Localization Signal (NLS), both located towards
the C-terminal end of the protein. A fused NUT-GFP
protein showed either cytoplasmic or nuclear
localization, suggesting that it is subject to
nuclear/cytoplasmic shuttling. Consistent with this
possibility, treatment with leptomycin B an inhibitor
of CRM1-dependent nuclear export resulted in
re-distribution of NUT-GFP to the nucleus. Inspection
of NUT revealed a C-terminal sequence similar to known
nuclear export sequences (NES) which are often
regulated by phosphorylation.
Length = 328
Score = 32.6 bits (74), Expect = 0.37
Identities = 15/57 (26%), Positives = 21/57 (36%), Gaps = 4/57 (7%)
Query: 32 PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMM----GTTPHVLGPL 84
P PP P PP P PP + P P++ + P P++ G G
Sbjct: 22 PFLPPAPQPPDQPFWEPPPPLVTAGFSPGNPLVLSALPSMPLVTEGGGPGLSGAGAG 78
Score = 28.4 bits (63), Expect = 8.0
Identities = 18/60 (30%), Positives = 23/60 (38%), Gaps = 10/60 (16%)
Query: 24 PNVQAAP----PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPM-MGTPPPPPPMMGTTP 78
P++ P P P LPP P PP PPPP + G P P ++ P
Sbjct: 5 PDMTMNPGASLSPFAALPFLPPAP-----QPPDQPFWEPPPPLVTAGFSPGNPLVLSALP 59
Score = 28.0 bits (62), Expect = 9.6
Identities = 15/47 (31%), Positives = 18/47 (38%), Gaps = 2/47 (4%)
Query: 26 VQAAPPPPPPPP--PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
+ P PPPPPP L P + P P T P + P P
Sbjct: 150 ILGLPLPPPPPPVAQLVPIVSLENAWPGPQGATGEGGPAAIQKPSPG 196
>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 226
Score = 32.2 bits (73), Expect = 0.41
Identities = 14/49 (28%), Positives = 18/49 (36%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+V P PP P P+ TP P+ PP + P P P
Sbjct: 87 ASVDPQPVAQPPVESTPAGVPVAAQTPKPVKPPKQPPAGAVPAKPTPKP 135
Score = 31.8 bits (72), Expect = 0.61
Identities = 15/53 (28%), Positives = 17/53 (32%), Gaps = 5/53 (9%)
Query: 24 PNVQAAPPPPP-----PPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P V++ P P P P PP P G P P P P P
Sbjct: 97 PPVESTPAGVPVAAQTPKPVKPPKQPPAGAVPAKPTPKPEPKPVAEPAAAPTG 149
Score = 31.0 bits (70), Expect = 0.92
Identities = 12/45 (26%), Positives = 13/45 (28%), Gaps = 3/45 (6%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P P P P+ PP P P P P P P
Sbjct: 99 VESTPAGVPVAAQTPKPVKPPKQPPAGAVPAKPTP---KPEPKPV 140
>gnl|CDD|225499 COG2948, VirB10, Type IV secretory pathway, VirB10 components
[Intracellular trafficking and secretion].
Length = 360
Score = 32.4 bits (74), Expect = 0.43
Identities = 13/69 (18%), Positives = 17/69 (24%), Gaps = 1/69 (1%)
Query: 3 YMDKRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
+ KR + SN+ P PP PP P P + P P
Sbjct: 45 ALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPP-PLPVDLGAPVLPDQQVEEAKDQP 103
Query: 63 MMGTPPPPP 71
Sbjct: 104 RRLRAAELA 112
>gnl|CDD|219753 pfam08226, DUF1720, Domain of unknown function (DUF1720). This
domain is found in different combinations with cortical
patch components EF hand, SH3 and ENTH and is therefore
likely to be involved in cytoskeletal processes. This
family contains many hypothetical proteins.
Length = 73
Score = 29.7 bits (67), Expect = 0.47
Identities = 13/60 (21%), Positives = 13/60 (21%), Gaps = 2/60 (3%)
Query: 26 VQAAPPPPPPPPPLPPPP--PMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
P PL P P M P G P M T L P
Sbjct: 9 QPPQQQQPQQQGPLQPQPTGFMQPQPTGFGQQQQGLQPQQTGFQPQAGQQMPTGTGPLQP 68
>gnl|CDD|117486 pfam08919, F_actin_bind, F-actin binding. The F-actin binding
domain forms a compact bundle of four antiparallel
alpha-helices, which are arranged in a left-handed
topology. Binding of F-actin to the F-actin binding
domain may result in cytoplasmic retention and
subcellular distribution of the protein, as well as
possible inhibition of protein function.
Length = 179
Score = 31.6 bits (71), Expect = 0.48
Identities = 17/40 (42%), Positives = 21/40 (52%), Gaps = 1/40 (2%)
Query: 38 PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
P+PP P +T P+ GTPP P P+ T P P M T
Sbjct: 6 PVPPAVPKPQSTAKPV-GTPPSPVPLPSTSPSPSKMANGT 44
>gnl|CDD|221598 pfam12484, PE_PPE_C, Polymorphic PE/PPE proteins C terminal.
This domain family is found in bacteria, and is
approximately 90 amino acids in length. The family is
found in association with pfam00823. There is a
conserved SVP sequence motif. There is a single
completely conserved residue W that may be functionally
important. The proteins in this family are PE/PPE
proteins implicated in immunostimulation and virulence.
Length = 81
Score = 30.0 bits (68), Expect = 0.51
Identities = 10/57 (17%), Positives = 11/57 (19%)
Query: 40 PPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIP 96
P P + P P P Y R V P P
Sbjct: 25 PAVSAAAAALPGAGVAAAAEAAAGAVLPGMPGAAAAGRGSAGFPRYGFRPTVMPRPP 81
Score = 27.7 bits (62), Expect = 3.3
Identities = 11/49 (22%), Positives = 14/49 (28%), Gaps = 2/49 (4%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTP--PPPPPMMGTPP 68
+LP A + P P G P P +M PP
Sbjct: 33 ALPGAGVAAAAEAAAGAVLPGMPGAAAAGRGSAGFPRYGFRPTVMPRPP 81
>gnl|CDD|219053 pfam06482, Endostatin, Collagenase NC10 and Endostatin. NC10
stands for Non-helical region 10 and is taken from human
COL15A1. A mutation in this region in human COL18A1 is
associated with an increased risk of prostrate cancer.
This domain is cleaved from the precursor and forms
endostatin. Endostatin is a key tumour suppressor and
has been used highly successfully to treat cancer. It is
a potent angiogenesis inhibitor. Endostatin also binds a
zinc ion near the N-terminus; this is likely to be of
structural rather than functional importance according
to.
Length = 291
Score = 32.0 bits (73), Expect = 0.59
Identities = 12/70 (17%), Positives = 15/70 (21%), Gaps = 5/70 (7%)
Query: 15 ETYSNIFSLPNVQAAPPP-PPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
+ Q P P P P PP + P P P P
Sbjct: 50 GELVPLPGTTATQPPPVVLTPWSDPRLPDPPHLPDPQTHSATAHRNPHP----PLNSPAR 105
Query: 74 MGTTPHVLGP 83
+ P
Sbjct: 106 IAHVHQDFQP 115
>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
This family consists mainly of the potato leaf roll
virus readthrough protein. This is generated via a
readthrough of open reading frame 3 a coat protein
allowing transcription of open reading frame 5 to give
an extended coat protein with a large c-terminal
addition or read through domain. The readthrough
protein is thought to play a role in the circulative
aphid transmission of potato leaf roll virus. Also in
the family is open reading frame 6 from beet western
yellows virus and potato leaf roll virus both
luteovirus and an unknown protein from cucurbit
aphid-borne yellows virus a closterovirus.
Length = 460
Score = 32.0 bits (73), Expect = 0.67
Identities = 14/37 (37%), Positives = 14/37 (37%), Gaps = 8/37 (21%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
V P P P P P PPPP P P P P
Sbjct: 1 VDGEPGPKPGPTPTPPPP-----APTP---EPTPAKH 29
>gnl|CDD|217439 pfam03229, Alpha_GJ, Alphavirus glycoprotein J.
Length = 126
Score = 30.5 bits (68), Expect = 0.69
Identities = 15/42 (35%), Positives = 17/42 (40%), Gaps = 1/42 (2%)
Query: 44 PMMGTTPPPMMGTPPP-PPPMMGTPPPPPPMMGTTPHVLGPL 84
G+T G PP P TP P PP P V+G L
Sbjct: 52 AAAGSTIAATAGAPPTDSTPAGETPGPFPPTDFALPLVIGGL 93
>gnl|CDD|220093 pfam09030, Creb_binding, Creb binding. The Creb binding domain
assumes a structure comprising of three alpha-helices
which pack in a bundle, exposing a hydrophobic groove
between alpha-1 and alpha-3 within which complimentary
domains found in the protein 'activator for thyroid
hormone and retinoid receptors' (ACTR) can dock.
Docking of these domains is required for the
recruitment of RNA polymerase II and the basal
transcription machinery.
Length = 104
Score = 30.0 bits (67), Expect = 0.71
Identities = 19/70 (27%), Positives = 23/70 (32%), Gaps = 5/70 (7%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMM---GTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYW 86
P P PLP M P P+M P G P +P+ L L
Sbjct: 2 QPGQWPQGPLPQQQQMQPGMPRPVMQMVAQHAVAGPRPGLVQPGISRGIVSPNALQDLL- 60
Query: 87 TRLIVSPEIP 96
R + SP P
Sbjct: 61 -RTLKSPSSP 69
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 31.8 bits (72), Expect = 0.72
Identities = 24/101 (23%), Positives = 31/101 (30%), Gaps = 3/101 (2%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP---PPPPPMMGTTPHVLGPLYWT 87
PP P P P ++ + P P T PPP M +P+ +
Sbjct: 449 PPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKP 508
Query: 88 RLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSRQVTAQ 128
SP P E V + N T L Q AQ
Sbjct: 509 PTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQ 549
>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
Provisional.
Length = 418
Score = 32.0 bits (73), Expect = 0.73
Identities = 13/51 (25%), Positives = 14/51 (27%), Gaps = 4/51 (7%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
P P P P P + P P PP P P PP
Sbjct: 137 AEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPA----PAAKPPPTPVA 183
Score = 30.0 bits (68), Expect = 2.5
Identities = 15/55 (27%), Positives = 15/55 (27%), Gaps = 2/55 (3%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
P AA P P T PP P PP P P TP
Sbjct: 129 PAAAAAAKAEKTTPEKPKAAAP--TPEPPAASKPTPPAAAKPPEPAPAAKPPPTP 181
Score = 29.7 bits (67), Expect = 3.9
Identities = 17/63 (26%), Positives = 19/63 (30%), Gaps = 1/63 (1%)
Query: 8 AKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLP-PPPPMMGTTPPPMMGTPPPPPPMMGT 66
A + P A P P PP P PP P P PPP P+
Sbjct: 126 AAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPTPVARA 185
Query: 67 PPP 69
P
Sbjct: 186 DPR 188
Score = 28.9 bits (65), Expect = 7.0
Identities = 14/48 (29%), Positives = 15/48 (31%), Gaps = 1/48 (2%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPP-PMMGTPPPPPPMMGTPPPPPP 72
+ PP P TTP P P P PP P PP
Sbjct: 118 IDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAA 165
>gnl|CDD|221459 pfam12200, DUF3597, Domain of unknown function (DUF3597). This
family of proteins is found in bacteria, eukaryotes and
viruses. Proteins in this family are typically between
126 and 281 amino acids in length. The function of this
domain is unknown. The structure of this domain has
been found to contain five helices with a long flexible
loop between helices one and two.
Length = 124
Score = 30.4 bits (69), Expect = 0.73
Identities = 9/37 (24%), Positives = 10/37 (27%), Gaps = 4/37 (10%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPM 63
AA P P P P TPP +
Sbjct: 14 AAAAPAPAAAPATAPAAA----AAAAPAATPPAAASV 46
Score = 28.5 bits (64), Expect = 3.8
Identities = 9/46 (19%), Positives = 10/46 (21%), Gaps = 13/46 (28%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
AA P P P P TPP +
Sbjct: 14 AAAAPAPAAAPATAPAA-------------AAAAAPAATPPAAASV 46
Score = 27.7 bits (62), Expect = 6.7
Identities = 7/35 (20%), Positives = 7/35 (20%), Gaps = 3/35 (8%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
A P P P P PP
Sbjct: 12 EAAAAAPAPAAAPATAP---AAAAAAAPAATPPAA 43
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 31.6 bits (72), Expect = 0.75
Identities = 12/51 (23%), Positives = 13/51 (25%), Gaps = 3/51 (5%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P A P P PP P T P P P+ P
Sbjct: 128 QQPPATTAQPQPVTPPRQTTAPVQPQTPAPV---RTQPAAPVTQAVEAPKV 175
Score = 30.8 bits (70), Expect = 1.2
Identities = 14/57 (24%), Positives = 15/57 (26%), Gaps = 2/57 (3%)
Query: 18 SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP--PPPPMMGTPPPPPP 72
S + Q PP P P PP T P P P T P
Sbjct: 117 STVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAP 173
>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
Provisional.
Length = 576
Score = 31.9 bits (72), Expect = 0.82
Identities = 12/42 (28%), Positives = 13/42 (30%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P PP PP P + PP PP P P
Sbjct: 355 PDESNVPPNPPNVPGGSNSEFSSDVENPPNPPNPDIPEQEPN 396
>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
(LMP2) protein. This family consists of several
Gammaherpesvirus latent membrane protein (LMP2)
proteins. Epstein-Barr virus is a human
Gammaherpesvirus that infects and establishes latency
in B lymphocytes in vivo. The latent membrane protein 2
(LMP2) gene is expressed in latently infected B cells
and encodes two protein isoforms, LMP2A and LMP2B, that
are identical except for an additional N-terminal 119
aa cytoplasmic domain which is present in the LMP2A
isoform. LMP2A is thought to play a key role in either
the establishment or the maintenance of latency and/or
the reactivation of productive infection from the
latent state. The significance of LMP2B and its role in
pathogenesis remain unclear.
Length = 489
Score = 31.4 bits (71), Expect = 0.96
Identities = 16/74 (21%), Positives = 19/74 (25%), Gaps = 19/74 (25%)
Query: 14 KETYSNIFSLPNVQAAPP-PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPP---- 68
Y + F + PP P P PPP G P+ P
Sbjct: 28 NPYYPSSFGSSWDRPGPPVPEDYDAPSHRPPPYGG----SNGDRHGGYQPLGQQDPSLYA 83
Query: 69 ----------PPPP 72
PPPP
Sbjct: 84 GLGQNGGGGLPPPP 97
Score = 29.1 bits (65), Expect = 5.1
Identities = 10/49 (20%), Positives = 12/49 (24%), Gaps = 8/49 (16%)
Query: 31 PPPPPPPPLP--------PPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P P P P + G P P + PPP
Sbjct: 11 AGGPRSHGGPDGDEGDSNPYYPSSFGSSWDRPGPPVPEDYDAPSHRPPP 59
>gnl|CDD|215605 PLN03153, PLN03153, hypothetical protein; Provisional.
Length = 537
Score = 31.4 bits (71), Expect = 0.96
Identities = 10/42 (23%), Positives = 16/42 (38%), Gaps = 2/42 (4%)
Query: 6 KRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMG 47
K + ++ + P V P PPPP P P ++
Sbjct: 61 KDWEGSPDTLLWNKRYHHPIV--TPLPPPPSSPSLPSSLLLD 100
>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 433 to 538 amino acids in length. This domain is
found associated with pfam00616, pfam00168. This domain
has two conserved sequence motifs: DLQ and PLSFQNP.
Length = 489
Score = 31.6 bits (71), Expect = 0.99
Identities = 10/50 (20%), Positives = 15/50 (30%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
A P P P G PP ++ + P M+ + P
Sbjct: 250 LPRQNSAGPQRRVDQPSPPGGGSHRGRIPPSLLSSLPSEGSMLSSEWPQS 299
>gnl|CDD|152353 pfam11918, DUF3436, Domain of unknown function (DUF3436). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is about 50
amino acids in length. This domain is found associated
with pfam03572. This domain has two conserved sequence
motifs: DPRL and SYEP.
Length = 55
Score = 28.3 bits (64), Expect = 0.99
Identities = 7/18 (38%), Positives = 8/18 (44%)
Query: 24 PNVQAAPPPPPPPPPLPP 41
P+ AP P P L P
Sbjct: 23 PSYLEAPQQAPALPNLTP 40
>gnl|CDD|234398 TIGR03921, T7SS_mycosin, type VII secretion-associated serine
protease mycosin. Members of this family are
subtilisin-related serine proteases, found strictly in
the Actinobacteria and associated with type VII
secretion operons. The designation mycosin is used for
members from Mycobacterium [Protein fate, Protein and
peptide secretion and trafficking, Protein fate, Protein
modification and repair].
Length = 350
Score = 31.1 bits (71), Expect = 1.0
Identities = 12/39 (30%), Positives = 13/39 (33%), Gaps = 11/39 (28%)
Query: 34 PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
PP P P P P P+ PPPPP
Sbjct: 286 LPPEDGRPLRP-----------APAPARPVAAPAPPPPP 313
Score = 30.0 bits (68), Expect = 2.7
Identities = 14/42 (33%), Positives = 16/42 (38%), Gaps = 6/42 (14%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP 67
+ PP P P P P P+ P PPPP TP
Sbjct: 282 LTGELPPEDGRPLRPAPAPA-----RPVAA-PAPPPPPDDTP 317
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 31.4 bits (71), Expect = 1.1
Identities = 12/65 (18%), Positives = 15/65 (23%), Gaps = 6/65 (9%)
Query: 11 RKKKETYSNIFSLPNVQAAPPPPPPPPPLPPP---PPMMGTTPPPMMGTPPPPPPMMGTP 67
+ K T +AP P P PP P +P P
Sbjct: 76 KPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEE---KSPSEESATATAP 132
Query: 68 PPPPP 72
P
Sbjct: 133 ESPST 137
>gnl|CDD|234797 PRK00575, tatA, twin arginine translocase protein A; Provisional.
Length = 92
Score = 29.3 bits (66), Expect = 1.2
Identities = 6/36 (16%), Positives = 7/36 (19%)
Query: 34 PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
P TP P+ P G
Sbjct: 52 AEASAAAAQAPYQVATPTPVQSQRVDPAAASGQDST 87
Score = 27.8 bits (62), Expect = 3.8
Identities = 5/36 (13%), Positives = 9/36 (25%), Gaps = 3/36 (8%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
+A+ P + P P+ P
Sbjct: 53 EASAAAAQAPYQVATPTPVQSQRVDP---AAASGQD 85
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 30.8 bits (70), Expect = 1.2
Identities = 14/47 (29%), Positives = 18/47 (38%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
V A PP PP P + P + PP P++ P P P
Sbjct: 53 VAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIPKPEPKP 99
Score = 29.7 bits (67), Expect = 3.1
Identities = 13/45 (28%), Positives = 15/45 (33%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
+ PPP P+ P P P P P P P P P
Sbjct: 59 EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIPKPEPKPKPKP 103
>gnl|CDD|236511 PRK09429, mepA, penicillin-insensitive murein endopeptidase;
Reviewed.
Length = 275
Score = 31.0 bits (71), Expect = 1.2
Identities = 18/60 (30%), Positives = 19/60 (31%), Gaps = 25/60 (41%)
Query: 24 PNVQAAPPPPP-----------PPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P + PPPP P PP P T P P P PPP PP
Sbjct: 221 PECEDQAPPPPGDGCGAELASWSDAPFEPPKP----------TTKPKPKP----PPPLPP 266
>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
Length = 416
Score = 31.1 bits (70), Expect = 1.3
Identities = 19/88 (21%), Positives = 21/88 (23%), Gaps = 18/88 (20%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPP----MMGTPPPPPPMM------GTPPPPPPM 73
P P PP P G P P G P P PP
Sbjct: 268 PAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGGEPKPGPP- 326
Query: 74 MGTTPHVLGPLYWTRLIVSPEIPSDTTP 101
P P W P + + T P
Sbjct: 327 -RPAPDADRPEGW------PSLEAITFP 347
Score = 29.2 bits (65), Expect = 4.4
Identities = 13/42 (30%), Positives = 14/42 (33%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
A P P P P P P + PPP TP P
Sbjct: 316 AAGGEPKPGPPRPAPDADRPEGWPSLEAITFPPPTPATPAVP 357
>gnl|CDD|215379 PLN02708, PLN02708, Probable pectinesterase/pectinesterase
inhibitor.
Length = 553
Score = 31.0 bits (70), Expect = 1.3
Identities = 10/32 (31%), Positives = 14/32 (43%)
Query: 20 IFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPP 51
+F P+ + P PPPP +TPP
Sbjct: 14 LFHSPSSSSNRHHHHHTPSPSPPPPSSPSTPP 45
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 30.8 bits (70), Expect = 1.4
Identities = 10/38 (26%), Positives = 12/38 (31%)
Query: 32 PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
P P P P P + P P+ P P P
Sbjct: 136 PQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEP 173
Score = 30.4 bits (69), Expect = 1.9
Identities = 12/49 (24%), Positives = 14/49 (28%), Gaps = 3/49 (6%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P A P P P P+ P P P P+ P P
Sbjct: 121 PAQPAPQPVQQPAYQPQPEQPLQQPVSPQ---VAPAPQPVHSAPQPAQQ 166
Score = 29.2 bits (66), Expect = 4.5
Identities = 11/49 (22%), Positives = 13/49 (26%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
P + P P P P P+ P P P P P
Sbjct: 136 PQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAE 184
Score = 29.2 bits (66), Expect = 4.9
Identities = 14/53 (26%), Positives = 15/53 (28%), Gaps = 2/53 (3%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
A P P PP PP P P P P P P P+
Sbjct: 98 SAQPRQPVQQPPEAQVPP--QHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSP 148
Score = 28.8 bits (65), Expect = 5.3
Identities = 11/47 (23%), Positives = 15/47 (31%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
P P P + P P + + P P P P+ P P
Sbjct: 138 PEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAE 184
>gnl|CDD|233055 TIGR00617, rpa1, replication factor-a protein 1 (rpa1). All
proteins in this family for which functions are known
are part of a multiprotein complex made up of homologs
of RPA1, RPA2 and RPA3 that bind ssDNA and function in
the recognition of DNA damage for nucleotide excision
repairThis family is based on the phylogenomic analysis
of JA Eisen (1999, Ph.D. Thesis, Stanford University)
[DNA metabolism, DNA replication, recombination, and
repair].
Length = 608
Score = 31.2 bits (71), Expect = 1.4
Identities = 14/69 (20%), Positives = 19/69 (27%), Gaps = 8/69 (11%)
Query: 19 NIFSLPNVQAAPPPPPPPPPLPPPP-------PMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
+ S Q P P PP P G PP P G+
Sbjct: 123 YLDSWHEEQVLASKPATNPANPPNAKAPKNEVASYNNAANPERGN-APPAPNSGSTRRVM 181
Query: 72 PMMGTTPHV 80
P+ +P+
Sbjct: 182 PIASLSPYQ 190
>gnl|CDD|148844 pfam07469, DUF1518, Domain of unknown function (DUF1518). This
domain, which is usually found tandemly repeated, is
found various receptor co-activating proteins.
Length = 56
Score = 27.9 bits (62), Expect = 1.4
Identities = 15/53 (28%), Positives = 16/53 (30%), Gaps = 3/53 (5%)
Query: 22 SLPNVQAAPPPPPPPP---PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
+P P P PP P PP M P P PMM P
Sbjct: 1 GMPGQMGNPRMPQAPPQQFPYPPNYGMGQQPDPAFTSPFSPQSPMMSPRMGPS 53
>gnl|CDD|223041 PHA03321, PHA03321, tegument protein VP11/12; Provisional.
Length = 694
Score = 31.1 bits (70), Expect = 1.4
Identities = 12/27 (44%), Positives = 13/27 (48%)
Query: 18 SNIFSLPNVQAAPPPPPPPPPLPPPPP 44
S +FS P PPP P P PP P
Sbjct: 632 SPLFSPPPAARFPPPDPALSPEPPALP 658
Score = 28.8 bits (64), Expect = 8.4
Identities = 24/106 (22%), Positives = 25/106 (23%), Gaps = 29/106 (27%)
Query: 23 LPNVQAAPPPPPPPPPLPPP--------------------------PPMMGTTPPPMMGT 56
P PPPPP P P G PPP
Sbjct: 435 APRRDNDPPPPPRARPGSTPACARRARAQRARDAGPEYVDPLGALRRLPAGAAPPPEPAA 494
Query: 57 PPPPP---PMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPSDT 99
P P MG PP P L P + PE D
Sbjct: 495 APSPATYYTRMGGGPPRLPPRNRATETLRPDWGPPAAAPPEQMEDP 540
>gnl|CDD|219837 pfam08430, Fork_head_N, Forkhead N-terminal region. The region
described in this family is found towards the N-terminus
of various eukaryotic fork head/HNF-3-related
transcription factors (which contain the pfam00250
domain). These proteins play key roles in embryogenesis,
maintenance of differentiated cell states, and
tumorigenesis.
Length = 137
Score = 29.6 bits (66), Expect = 1.5
Identities = 12/44 (27%), Positives = 12/44 (27%)
Query: 33 PPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGT 76
P MG MGT P MG P P G
Sbjct: 61 NGMSPGYGGAGSPMGMMGMSSMGTSLSPSGTMGAMGPMPAGSGG 104
Score = 29.6 bits (66), Expect = 1.5
Identities = 11/61 (18%), Positives = 15/61 (24%), Gaps = 7/61 (11%)
Query: 25 NVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP-------PPPPMMGTPPPPPPMMGTT 77
+ + P P MMG + +P P P G P M
Sbjct: 56 SSSSMNGMSPGYGGAGSPMGMMGMSSMGTSLSPSGTMGAMGPMPAGSGGSLSPNMSMSRA 115
Query: 78 P 78
Sbjct: 116 S 116
>gnl|CDD|217197 pfam02711, Pap_E4, E4 protein. This is is a family of
Papillomavirus proteins, E4, coded for by ORF4. A
splice variant, E1--E4, exists but neither the function
of E4 or E1--E4 is known.
Length = 95
Score = 28.8 bits (65), Expect = 1.6
Identities = 9/15 (60%), Positives = 9/15 (60%)
Query: 30 PPPPPPPPPLPPPPP 44
PP PPPPP P P
Sbjct: 21 TPPRPPPPPPPWAPK 35
Score = 27.2 bits (61), Expect = 5.3
Identities = 9/15 (60%), Positives = 9/15 (60%)
Query: 30 PPPPPPPPPLPPPPP 44
PP PPPP PP P
Sbjct: 20 TTPPRPPPPPPPWAP 34
Score = 26.9 bits (60), Expect = 7.0
Identities = 8/16 (50%), Positives = 8/16 (50%)
Query: 29 APPPPPPPPPLPPPPP 44
PP PP PPPP
Sbjct: 17 HTYTTPPRPPPPPPPW 32
Score = 26.9 bits (60), Expect = 9.1
Identities = 10/20 (50%), Positives = 10/20 (50%)
Query: 23 LPNVQAAPPPPPPPPPLPPP 42
L P PPPPPPP P
Sbjct: 16 LHTYTTPPRPPPPPPPWAPK 35
>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
Length = 590
Score = 30.8 bits (70), Expect = 1.6
Identities = 12/46 (26%), Positives = 21/46 (45%)
Query: 8 AKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPM 53
+ R++K + S++ P++ P P PP PP T P +
Sbjct: 32 PRPRRRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAATTQTPQL 77
Score = 28.9 bits (65), Expect = 5.9
Identities = 10/29 (34%), Positives = 10/29 (34%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPP 58
P P P L P P T PP P
Sbjct: 47 TPSILPLPKLSSPSPPSVTLPPAATTQTP 75
Score = 28.9 bits (65), Expect = 6.9
Identities = 16/67 (23%), Positives = 22/67 (32%), Gaps = 1/67 (1%)
Query: 18 SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
S+ S + + P P P P P ++ P P + PP T
Sbjct: 15 SSSSSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAATTQT 74
Query: 78 PHVLGPL 84
P L PL
Sbjct: 75 PQ-LNPL 80
>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
factor N terminal domain. The N terminus of the PEA3
transcription factors is implicated in transactivation
and in inhibition of DNA binding. Transactivation is
potentiated by activation of the Ras/MAP kinase and
protein kinase A signalling cascades. The N terminal
region contains conserved MAP kinase phosphorylation
sites.
Length = 336
Score = 30.6 bits (69), Expect = 1.6
Identities = 19/63 (30%), Positives = 20/63 (31%), Gaps = 5/63 (7%)
Query: 17 YSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMM-----GTPPPPPPMMGTPPPPP 71
YS P PP PP P P P P P P P+ P PP
Sbjct: 119 YSAYDRKPASGFKPPTPPSTPCSPVNPQETVRQLQPSGPLSNSSPPSPHTPLPNQSPLPP 178
Query: 72 PMM 74
PM
Sbjct: 179 PMS 181
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 30.9 bits (70), Expect = 1.6
Identities = 15/55 (27%), Positives = 16/55 (29%), Gaps = 7/55 (12%)
Query: 24 PNVQAAPPPPPPPP-------PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P+ PPPP P P P G P PP P P P
Sbjct: 90 PSAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARP 144
Score = 30.6 bits (69), Expect = 2.3
Identities = 15/63 (23%), Positives = 18/63 (28%), Gaps = 4/63 (6%)
Query: 23 LPNVQAAPP---PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
LP + A P P P P G +G PP P P P+
Sbjct: 139 LPTARPAYPAYQQRPEPGAWPRAADDYGWQQQR-LGFPPRAPYASPASYAPEQERDREPY 197
Query: 80 VLG 82
G
Sbjct: 198 DAG 200
>gnl|CDD|181205 PRK08040, PRK08040, putative semialdehyde dehydrogenase;
Provisional.
Length = 336
Score = 30.4 bits (69), Expect = 1.6
Identities = 31/98 (31%), Positives = 43/98 (43%), Gaps = 22/98 (22%)
Query: 147 RLIDSKRS--QNVGILAQSLHIEFSEIESAIFNMDASVVSLEALQQIYDVRATDEEINLI 204
RL+D R Q+ G L I S ++S +F A +V EAL+ + A +E
Sbjct: 214 RLVDQVRKILQDEG-----LPISVSCVQSPVFYGHAQMVHFEALRPL----AAEE----A 260
Query: 205 RAHLASNSDLQL----DKPEQFLADLADIPHFSERIAC 238
R L D+ L D P Q + D + PH S I C
Sbjct: 261 RDALEQGEDIVLSEENDYPTQ-VGDASGNPHLS--IGC 295
>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
(DUF1750). This is a fungal domain of unknown function.
Length = 669
Score = 30.8 bits (69), Expect = 1.7
Identities = 13/36 (36%), Positives = 14/36 (38%), Gaps = 1/36 (2%)
Query: 37 PPLPPPPPMMGTTP-PPMMGTPPPPPPMMGTPPPPP 71
PP P P M+G P P G PP G P
Sbjct: 216 PPGPYPNAMVGRQPFYPQPGAVAGPPKRRGGHKAPR 251
>gnl|CDD|216078 pfam00716, Peptidase_S21, Assemblin (Peptidase family S21).
Length = 326
Score = 30.5 bits (69), Expect = 1.8
Identities = 14/51 (27%), Positives = 14/51 (27%), Gaps = 11/51 (21%)
Query: 18 SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPP 68
L QA PP P P PP PP P P P
Sbjct: 287 DAYNQLLAGQAFNQPPDPQGPAPPAELA-----------PPAPAPPPPANP 326
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 30.8 bits (69), Expect = 1.8
Identities = 19/57 (33%), Positives = 20/57 (35%), Gaps = 2/57 (3%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMM--GTPPPPPPMMGTPPPPPPMMGTTPHVLG 82
AP P P P P P + PP PP G PP P P M V G
Sbjct: 408 TAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPG 464
Score = 30.8 bits (69), Expect = 1.8
Identities = 25/73 (34%), Positives = 28/73 (38%), Gaps = 10/73 (13%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTP---PPMMGTPPPP--PPMMGTPPPPPPMMG--- 75
P AAP P P PP PP G P P M P P P + + PP +G
Sbjct: 424 PAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVASASDAPPTLGDPS 483
Query: 76 -TTPHV-LGPLYW 86
T H GP W
Sbjct: 484 DTAEHTPSGPRTW 496
Score = 29.3 bits (65), Expect = 5.2
Identities = 18/49 (36%), Positives = 18/49 (36%), Gaps = 8/49 (16%)
Query: 26 VQAAPPPPPPPPPLPP---PPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
A P P P P P PP PP G PP P P M P P
Sbjct: 420 APATPAPSAAPSPRVPWDDAPPA-----PPRSGIPPRPAPRMPEASPVP 463
>gnl|CDD|235166 PRK03837, PRK03837, transcriptional regulator NanR; Provisional.
Length = 241
Score = 30.4 bits (69), Expect = 1.8
Identities = 16/43 (37%), Positives = 21/43 (48%), Gaps = 4/43 (9%)
Query: 195 RATDEEINLIRAHLASNSDLQLDKPEQFLADLADIPHFSERIA 237
ATDE+I L+R L NS L F+ +D+ F IA
Sbjct: 125 HATDEQIALLRKALERNSQ-SLGDNAAFIR--SDM-EFHRVIA 163
>gnl|CDD|218602 pfam05478, Prominin, Prominin. The prominins are an emerging
family of proteins that among the multispan membrane
proteins display a novel topology. Mouse prominin and
human prominin (mouse)-like 1 (PROML1) are predicted to
contain five membrane spanning domains, with an
N-terminal domain exposed to the extracellular space
followed by four, alternating small cytoplasmic and
large extracellular, loops and a cytoplasmic C-terminal
domain. The exact function of prominin is unknown
although in humans defects in PROM1, the gene coding for
prominin, cause retinal degeneration.
Length = 807
Score = 30.8 bits (70), Expect = 1.8
Identities = 16/58 (27%), Positives = 29/58 (50%), Gaps = 3/58 (5%)
Query: 346 DVDRAASVVFDDIHSQL-STLAKKLDAVTISMNKVVQESKPDHMEPFRTKMESCVKTG 402
D+D A + IH +L S + L+A+ + +V+QE K D +E ++ + G
Sbjct: 215 DLDGAGEHLGGRIHDELKSHVLPVLEAIV-ELGQVLQEMK-DGLESVNKTLKELREGG 270
>gnl|CDD|216325 pfam01140, Gag_MA, Matrix protein (MA), p15. The matrix protein,
p15, is encoded by the gag gene. MA is involved in
pathogenicity.
Length = 129
Score = 29.4 bits (66), Expect = 1.9
Identities = 12/42 (28%), Positives = 17/42 (40%), Gaps = 5/42 (11%)
Query: 32 PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
PPP P PP ++ + P +P P PP P +
Sbjct: 92 PPPWVKPFVDPPKVLLPSSTPKPVSPSPS-----APPRPSSL 128
>gnl|CDD|236886 PRK11249, katE, hydroperoxidase II; Provisional.
Length = 752
Score = 30.8 bits (70), Expect = 2.0
Identities = 13/52 (25%), Positives = 14/52 (26%), Gaps = 8/52 (15%)
Query: 30 PPPPPPPPPLPPP----PPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
P PL P M P P P P TPP P +
Sbjct: 4 KNPHQHQSPLHDSSEAKPGMDSLAPEDGSHRPAPEP----TPPGAQPTAPGS 51
>gnl|CDD|221518 pfam12301, CD99L2, CD99 antigen like protein 2. This family of
proteins is found in eukaryotes. Proteins in this
family are typically between 165 and 237 amino acids in
length. CD99L2 and CD99 are involved in
trans-endothelial migration of neutrophils in vitro and
in the recruitment of neutrophils into inflamed
peritoneum.
Length = 154
Score = 29.4 bits (66), Expect = 2.1
Identities = 10/49 (20%), Positives = 11/49 (22%), Gaps = 6/49 (12%)
Query: 28 AAPPPPPPPPPLPPPPPM----MGTTPPPMMGTPPPPPPMMGTPPPPPP 72
A P P P P + P P P P P
Sbjct: 12 AMKPTKKPATPKKPSKGDDDFDLEDALDGGNDGGAGPKP--KPRPQPDP 58
>gnl|CDD|223068 PHA03384, PHA03384, early DNA-binding protein E2A; Provisional.
Length = 445
Score = 30.4 bits (69), Expect = 2.1
Identities = 9/51 (17%), Positives = 14/51 (27%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
+ P PP LPP P ++ PP+ +
Sbjct: 13 SSDDSPSLEPPELPPKKKGRRRVSPVEEEEEEEEAEVVAVGFSYPPVRISR 63
>gnl|CDD|234012 TIGR02784, addA_alphas, double-strand break repair helicase AddA,
alphaproteobacterial type. AddAB, also called RexAB,
substitutes for RecBCD in several bacterial lineages.
These DNA recombination proteins act before synapse and
are particularly important for DNA repair of
double-stranded breaks by homologous recombination. The
term AddAB is used broadly, with AddA homologous between
the alphaproteobacteria (as modeled here) and the
Firmicutes, while the partner AddB proteins show no
strong homology across the two groups of species [DNA
metabolism, DNA replication, recombination, and repair].
Length = 1135
Score = 30.4 bits (69), Expect = 2.3
Identities = 7/35 (20%), Positives = 8/35 (22%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
P L P + P P P P
Sbjct: 917 GLPVEAAQTDTLEALPDWLRAPAPAEPALPRPLAP 951
Score = 29.7 bits (67), Expect = 4.1
Identities = 6/35 (17%), Positives = 8/35 (22%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
P P+ P + P P P
Sbjct: 909 TRRDWDPVGLPVEAAQTDTLEALPDWLRAPAPAEP 943
>gnl|CDD|226435 COG3921, COG3921, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 300
Score = 30.1 bits (68), Expect = 2.4
Identities = 10/44 (22%), Positives = 12/44 (27%)
Query: 32 PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
P P P P + P P+ PP PP
Sbjct: 71 PKGLPILALAGPVGELGQPMDLPAPANPGDPLALPEPPSPPTKP 114
>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein. This is a
family of proteins which appears to have a highly
conserved zinc finger domain at the C terminal end,
described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
predicted to contain a coiled coil. Members are
annotated as being tumour-associated antigen HCA127 in
humans but this could not confirmed.
Length = 215
Score = 29.8 bits (67), Expect = 2.4
Identities = 14/59 (23%), Positives = 16/59 (27%), Gaps = 13/59 (22%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP-------------PPPM 73
+P P PP P PP +T P P PPPM
Sbjct: 121 ISPATSPVPPVPLPDPPAFPSTLPANAAAAAAAQQQRDVPQNLINRHRMPPSFQQPPPM 179
>gnl|CDD|193414 pfam12941, HCV_NS5a_C, HCV NS5a protein C-terminal region. This is
a family of proteins found in the hepatitis C virus.
This family contains the C-terminal region of the NS5A
protein. CC The molecular function of the non-structural
5a protein is uncertain. The NS5a protein is
phosphorylated when expressed in mammalian cells. It is
thought to interact with the ds RNA dependent
(interferon inducible) kinase PKR.
Length = 244
Score = 29.7 bits (67), Expect = 2.5
Identities = 22/55 (40%), Positives = 25/55 (45%), Gaps = 15/55 (27%)
Query: 30 PPPPPPPPPLP----PP--PPMMGT------TPPPMMGTPPPPPPMMGTPPPPPP 72
PP P PP LP P PP++ T PP + G PP G PP PPP
Sbjct: 99 PPRPKFPPALPIWARPDYNPPLLETWKAPDYEPPTVHGCALPPA---GLPPVPPP 150
Score = 28.2 bits (63), Expect = 9.4
Identities = 9/19 (47%), Positives = 10/19 (52%)
Query: 24 PNVQAAPPPPPPPPPLPPP 42
P V PP PP+PPP
Sbjct: 132 PTVHGCALPPAGLPPVPPP 150
>gnl|CDD|179538 PRK03100, PRK03100, sec-independent translocase; Provisional.
Length = 136
Score = 28.8 bits (65), Expect = 2.6
Identities = 10/38 (26%), Positives = 11/38 (28%), Gaps = 6/38 (15%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP 59
N + P P P PP P G PP
Sbjct: 99 QTGNGRPTPSSPDAAGPAPPDA------TPLAPGERPP 130
>gnl|CDD|221759 pfam12764, Gly-rich_Ago1, Glycine-rich region of argonaut. This
domain is often found at the very N-terminal of
argonaut-like proteins.
Length = 102
Score = 28.4 bits (63), Expect = 2.6
Identities = 10/55 (18%), Positives = 13/55 (23%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPL 84
PP P +P + T P P + PP L
Sbjct: 31 STGGPPRPSVPELHQATQAPYQAAVSTQPAPSEASSSSQPPESSSLQVTQQFQQL 85
>gnl|CDD|235777 PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 155
Score = 29.0 bits (66), Expect = 2.7
Identities = 11/55 (20%), Positives = 12/55 (21%), Gaps = 5/55 (9%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP-----PPMMGT 76
AA P P P P P PM+GT
Sbjct: 35 AAAAPVAPVAQQAAAAPVAAAPAAAAAAAAAPAAAPAAAAAEAEGHVVTSPMVGT 89
Score = 28.6 bits (65), Expect = 4.5
Identities = 10/48 (20%), Positives = 11/48 (22%), Gaps = 5/48 (10%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP-----PPMMGT 66
P Q A P P P PM+GT
Sbjct: 42 PVAQQAAAAPVAAAPAAAAAAAAAPAAAPAAAAAEAEGHVVTSPMVGT 89
>gnl|CDD|219061 pfam06495, Transformer, Fruit fly transformer protein. This family
consists of transformer proteins from several Drosophila
species and also from Ceratitis capitata (Mediterranean
fruit fly). The transformer locus (tra) produces an RNA
processing protein that alternatively splices the
doublesex pre-mRNA in the sex determination hierarchy of
Drosophila melanogaster.
Length = 182
Score = 29.2 bits (65), Expect = 2.7
Identities = 21/72 (29%), Positives = 27/72 (37%), Gaps = 9/72 (12%)
Query: 9 KRRKKKETYSNIFSLPNVQAAPPPPPP---------PPPLPPPPPMMGTTPPPMMGTPPP 59
+RR++ + S P + P P P PPP P M G P M+ P P
Sbjct: 97 ERRRRSRSRSRYSRTPRIITVPVPVPAADYPYAYGWPPPAPQFSGMQGAFPYGMLPRPVP 156
Query: 60 PPPMMGTPPPPP 71
P PP P
Sbjct: 157 PYFAPYPRPPAP 168
>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
Length = 200
Score = 29.5 bits (66), Expect = 2.7
Identities = 14/40 (35%), Positives = 15/40 (37%), Gaps = 2/40 (5%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
PP P P PPP P PP T + PPP
Sbjct: 53 PPTTPHPSSQPPPCPPSPGHPPQTNDTHEKDLAL--QPPP 90
Score = 28.4 bits (63), Expect = 7.3
Identities = 14/40 (35%), Positives = 15/40 (37%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP 60
F P PPP PP P PP T + PPP
Sbjct: 51 FCPPTTPHPSSQPPPCPPSPGHPPQTNDTHEKDLALQPPP 90
>gnl|CDD|178678 PLN03132, PLN03132, NADH dehydrogenase (ubiquinone) flavoprotein
1; Provisional.
Length = 461
Score = 30.2 bits (68), Expect = 2.8
Identities = 11/19 (57%), Positives = 11/19 (57%)
Query: 26 VQAAPPPPPPPPPLPPPPP 44
QAA P PP PPPPP
Sbjct: 7 TQAAATAATPQPPPPPPPP 25
Score = 29.5 bits (66), Expect = 4.7
Identities = 9/22 (40%), Positives = 9/22 (40%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGT 48
A P P PPPPP T
Sbjct: 7 TQAAATAATPQPPPPPPPPEKT 28
Score = 29.1 bits (65), Expect = 4.9
Identities = 11/21 (52%), Positives = 11/21 (52%)
Query: 21 FSLPNVQAAPPPPPPPPPLPP 41
FS A P PPPPP PP
Sbjct: 5 FSTQAAATAATPQPPPPPPPP 25
>gnl|CDD|190856 pfam04078, Rcd1, Cell differentiation family, Rcd1-like. Two of
the members in this family have been characterized as
being involved in regulation of Ste11 regulated sex
genes. Mammalian Rcd1 is a novel transcriptional
cofactor that mediates retinoic acid-induced cell
differentiation.
Length = 262
Score = 29.9 bits (68), Expect = 2.8
Identities = 12/34 (35%), Positives = 20/34 (58%)
Query: 301 LEILPKLKDVRSKDNSVTLLHFIVRTYLRNSENP 334
+L K+ + ++ S LL ++R YLR S+NP
Sbjct: 187 ATVLNKMVEQLTRQPSPRLLKHVIRCYLRLSDNP 220
>gnl|CDD|223067 PHA03380, PHA03380, transactivating tegument protein VP16;
Provisional.
Length = 432
Score = 29.6 bits (67), Expect = 3.2
Identities = 17/70 (24%), Positives = 22/70 (31%), Gaps = 17/70 (24%)
Query: 11 RKKKETYSNIFSLPNVQAAPPPPPPPPPL-------PPPPPMMGTTPPPMMGTPPPPPPM 63
R K E YS P P L P G+T ++ P P +
Sbjct: 332 RAKLEAYSEAH----------PSEPLSVLREHTYARRREKPNYGSTIEALLDPPSPSAVL 381
Query: 64 MGTPPPPPPM 73
G P PP +
Sbjct: 382 PGDPVPPLTV 391
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 29.7 bits (67), Expect = 3.2
Identities = 13/52 (25%), Positives = 15/52 (28%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMM 74
LP PPP LP TP PP + P P +
Sbjct: 62 LPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTIQ 113
Score = 29.3 bits (66), Expect = 4.3
Identities = 13/49 (26%), Positives = 15/49 (30%), Gaps = 3/49 (6%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMM---GTPPPPPPM 73
+P PPP LP P P T G+P PP
Sbjct: 51 PSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPA 99
Score = 28.6 bits (64), Expect = 7.3
Identities = 11/60 (18%), Positives = 14/60 (23%), Gaps = 4/60 (6%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP----PPPMMGTPPPPPPMMGTTPH 79
+ PPP P P P P + P PP + P
Sbjct: 49 WDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPA 108
>gnl|CDD|177646 PHA03418, PHA03418, hypothetical E4 protein; Provisional.
Length = 230
Score = 29.3 bits (65), Expect = 3.3
Identities = 17/54 (31%), Positives = 18/54 (33%), Gaps = 5/54 (9%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP-----PMMGTPPPPPPMMGTTPH 79
P P P P G PP PP + G PPP PP PH
Sbjct: 132 PLHGDPDPPPGATGGQGEEPPEGGEESQPPLGEGEGAVEGHPPPLPPAPEPKPH 185
Score = 28.6 bits (63), Expect = 5.7
Identities = 25/97 (25%), Positives = 35/97 (36%), Gaps = 21/97 (21%)
Query: 27 QAAPPPPP---------PPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP----- 72
P PPP PP P +G + G PPP PP P P P
Sbjct: 134 HGDPDPPPGATGGQGEEPPEGGEESQPPLGEGEGAVEGHPPPLPP----APEPKPHNGDA 189
Query: 73 ---MMGTTPHVLGPLYWTRLIVSPEIPSDTTPLWKEL 106
++GT +LG + + +I D W++L
Sbjct: 190 THGLLGTVACLLGTWEESFRQLVEDIQEDLDDYWRKL 226
>gnl|CDD|219490 pfam07627, PSCyt3, Protein of unknown function (DUF1588). A
region of similarity shared by several Rhodopirellula
baltica cytochrome-like proteins that are predicted to
be secreted. These proteins also match pfam07626 and
pfam07624.
Length = 102
Score = 27.9 bits (63), Expect = 3.3
Identities = 10/23 (43%), Positives = 10/23 (43%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTT 49
PPPPP P L P P T
Sbjct: 37 DPPPPPPPNVPALEPDDPGALTL 59
>gnl|CDD|237940 PRK15313, PRK15313, autotransport protein MisL; Provisional.
Length = 955
Score = 29.8 bits (66), Expect = 3.4
Identities = 28/102 (27%), Positives = 38/102 (37%), Gaps = 28/102 (27%)
Query: 11 RKKKETYSNIFSLPNVQAAPPPPPP--PPPLPPPP----PMMGTTPPPMMGTPPPPPPMM 64
+K K Y + P+ P P P P P+ P P P+ P P++ P P P+
Sbjct: 547 QKGKNWYLTSYIEPDEPIIPDPVDPVIPDPVIPDPVDPDPVDPVIPDPVIPDPVDPDPV- 605
Query: 65 GTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIP----SDTTPL 102
P P P +I P IP SDT P+
Sbjct: 606 -DPEPVDP----------------VIPDPTIPDIGQSDTPPI 630
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 29.9 bits (67), Expect = 3.5
Identities = 30/176 (17%), Positives = 47/176 (26%), Gaps = 12/176 (6%)
Query: 25 NVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTP-PPPPPMMGTPPPPPPM-MGTTPHVLG 82
+ P P P P TP + TP TP P M
Sbjct: 262 RQKNTPAQTPMPERSWQTPA---QTPARRISTPMTEEIKSWQTPLQTPAMYSSDYQAPKP 318
Query: 83 PLYWTRLIVSPEIPSDTTPLWKELEEVPIN--NLEEFTELFSRQVTAQRPVTRKRQQKSS 140
+T + E L + + + R+V + + S
Sbjct: 319 EPIYTWEELLRERFPSDLFAISSLPDSDSEASDSGPTRKRKRRRVPPLPEYSSDEDEDDS 378
Query: 141 KVQNVARLIDSKRSQNVGILAQSLHI---EFSEIESAIFNMDASVVSLEALQQIYD 193
V + KR + L + E S+ M+ V A+Q+ YD
Sbjct: 379 DEDEVDYEKERKRRREED--KNFLRLKALELSKYAGVNERMEKIVQVTRAMQRTYD 432
>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
Length = 306
Score = 29.6 bits (66), Expect = 3.5
Identities = 19/61 (31%), Positives = 23/61 (37%), Gaps = 6/61 (9%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP---PPPPPMMGTTPH 79
L + A PP P P P P + P+ P P + P P P P TPH
Sbjct: 12 LAGLVANVPPAAAPTPQPNP---VIQPQAPVQPGQPGAPQQLAIPTQQPQPVPTSAMTPH 68
Query: 80 V 80
V
Sbjct: 69 V 69
>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
Provisional.
Length = 376
Score = 29.5 bits (66), Expect = 3.6
Identities = 20/80 (25%), Positives = 21/80 (26%), Gaps = 29/80 (36%)
Query: 11 RKKKETYSNIFSLPNVQAAPPPP------------PPPPPLPPPPPMMGTTPPPMMGTPP 58
R KKE P PP P P +PP PP P
Sbjct: 52 RSKKEN------------EPAPPSTMIATNTKPFHPAPIDVPPDPP-----AAQEAVQPT 94
Query: 59 PPPPMMGTPPPPPPMMGTTP 78
PP P P TP
Sbjct: 95 APPSAQSEPERNEPRPEETP 114
Score = 28.3 bits (63), Expect = 8.5
Identities = 10/51 (19%), Positives = 14/51 (27%), Gaps = 7/51 (13%)
Query: 40 PPPPPMMGTTPPP-------MMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
PP M+ T P + PP + PP + P
Sbjct: 60 APPSTMIATNTKPFHPAPIDVPPDPPAAQEAVQPTAPPSAQSEPERNEPRP 110
>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
dihydrolipoamide acetyltransferase, long form. This
model describes a subset of pyruvate dehydrogenase
complex dihydrolipoamide acetyltransferase specifically
close by both phylogenetic and per cent identity (UPGMA)
trees. Members of this set include two or three copies
of the lipoyl-binding domain. E. coli AceF is a member
of this model, while mitochondrial and some other
bacterial forms belong to a separate model [Energy
metabolism, Pyruvate dehydrogenase].
Length = 546
Score = 29.5 bits (66), Expect = 3.9
Identities = 11/51 (21%), Positives = 13/51 (25%), Gaps = 3/51 (5%)
Query: 20 IFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
I +L + P P P P T P P P P
Sbjct: 186 ILTLSVAGSTPATAPAPASAQPAAQSPAATQPEPAAAPAAAK---AQAPAP 233
>gnl|CDD|216421 pfam01299, Lamp, Lysosome-associated membrane glycoprotein (Lamp).
Length = 305
Score = 29.3 bits (66), Expect = 3.9
Identities = 8/30 (26%), Positives = 10/30 (33%), Gaps = 5/30 (16%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGT 56
+P P P P P P P +G
Sbjct: 85 TPSPTTVATPSPSPTPVP-----SSPAVGN 109
Score = 28.5 bits (64), Expect = 6.3
Identities = 11/29 (37%), Positives = 12/29 (41%), Gaps = 3/29 (10%)
Query: 48 TTPPPMMGTPPPPPPMMGTPPPPPPMMGT 76
TP P P P P TP P P +G
Sbjct: 84 DTPSPTTVATPSPSP---TPVPSSPAVGN 109
Score = 28.5 bits (64), Expect = 7.2
Identities = 9/27 (33%), Positives = 12/27 (44%), Gaps = 3/27 (11%)
Query: 40 PPPPPMMGTTPPPMMGTPPPPPPMMGT 66
P P + +P P TP P P +G
Sbjct: 86 PSPTTVATPSPSP---TPVPSSPAVGN 109
>gnl|CDD|215915 pfam00429, TLV_coat, ENV polyprotein (coat polyprotein).
Length = 560
Score = 29.8 bits (67), Expect = 3.9
Identities = 20/72 (27%), Positives = 23/72 (31%), Gaps = 22/72 (30%)
Query: 32 PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIV 91
PP P PL PP P P PP GT +L L L +
Sbjct: 212 PPSQPHPLAPPLP----------------------RPTSPPGPGTGDRLLDLLQGAYLTL 249
Query: 92 SPEIPSDTTPLW 103
+ PS T W
Sbjct: 250 NATNPSLTQDCW 261
>gnl|CDD|218566 pfam05349, GATA-N, GATA-type transcription activator, N-terminal.
GATA transcription factors mediate cell differentiation
in a diverse range of tissues. Mutation are often
associated with certain congenital human disorders. The
six classical vertebrate GATA proteins, GATA-1 to
GATA-6, are highly homologous and have two tandem zinc
fingers. The classical GATA transcription factors
function transcription activators. In lower metazoans
GATA proteins carry a single canonical zinc finger. This
family represents the N-terminal domain of the family of
GATA transcription activators.
Length = 177
Score = 29.0 bits (65), Expect = 3.9
Identities = 12/60 (20%), Positives = 16/60 (26%)
Query: 18 SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
S + SLP +Q P P PP+ + PPM
Sbjct: 41 SMLPSLPYLQGCEASQQAHALAAHPGWSQAAAESSAFNPGSPHPPVGFSYSHSPPMSNGG 100
>gnl|CDD|180714 PRK06827, PRK06827, phosphoribosylpyrophosphate synthetase;
Provisional.
Length = 382
Score = 29.5 bits (67), Expect = 3.9
Identities = 11/36 (30%), Positives = 20/36 (55%)
Query: 241 FEADFSDSIALIDSKLNNIKSVCEFLMKSEHLKKVI 276
E D S IA I LN+ S+ + L ++ ++K++
Sbjct: 342 IEVDMSKLIARIIDALNHDVSLSKLLDPTDIIQKLL 377
>gnl|CDD|219865 pfam08493, AflR, Aflatoxin regulatory protein. This domain is
found in the aflatoxin regulatory protein (AflR) which
is involved in the regulation of the biosynthesis of
aflatoxin in the fungal genus Aspergillus. It occurs
together with the fungal Zn(2)-Cys(6) binuclear cluster
domain (pfam00172).
Length = 275
Score = 29.1 bits (65), Expect = 3.9
Identities = 15/52 (28%), Positives = 21/52 (40%), Gaps = 1/52 (1%)
Query: 16 TYSNIFSLPNVQAAPPPPPPPPPLPPPP-PMMGTTPPPMMGTPPPPPPMMGT 66
T S+ P VQ+ P PP P P +P +PP P + G+
Sbjct: 21 TTSSSHPQPPVQSGPSSIQPPVATPHTPNGTSSPSPKFSHQSPPAEPELWGS 72
>gnl|CDD|150201 pfam09447, Cnl2_NKP2, Cnl2/NKP2 family protein. This family
includes the Cnl2 kinetochore protein.
Length = 67
Score = 26.9 bits (60), Expect = 4.0
Identities = 14/55 (25%), Positives = 28/55 (50%), Gaps = 11/55 (20%)
Query: 114 LEEFTELFSRQVTAQRPVTRKRQQKSSKVQNVARLIDSKRSQNVGILAQSLHIEF 168
LE+FTELF R+V + +++ + R + +R + + +A ++ EF
Sbjct: 22 LEQFTELFPRRV-----------RSEPQIRTLYRELQRQREKVLDKVAANIEQEF 65
>gnl|CDD|233381 TIGR01371, met_syn_B12ind,
5-methyltetrahydropteroyltriglutamate--homocysteine
S-methyltransferase. This model describes the
cobalamin-independent methionine synthase. A family of
uncharacterized archaeal proteins is homologous to the
C-terminal region of this family. That family is
excluded from this model but, along with this family,
belongs to Pfam model pfam01717 [Amino acid
biosynthesis, Aspartate family].
Length = 750
Score = 29.6 bits (67), Expect = 4.1
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 167 EFSEIESAIFNMDASVVSLEA 187
EF+EI +I ++DA V+S+EA
Sbjct: 642 EFNEIIESIADLDADVISIEA 662
>gnl|CDD|235709 PRK06132, PRK06132, hypothetical protein; Provisional.
Length = 359
Score = 29.3 bits (66), Expect = 4.2
Identities = 10/51 (19%), Positives = 12/51 (23%), Gaps = 3/51 (5%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
+ P + A P P P P P P P P
Sbjct: 179 VAHPLLLA---PKAAPAPAPASDAAANGRPAAPAPAPAEAAWADADAPTGP 226
>gnl|CDD|236507 PRK09424, pntA, NAD(P) transhydrogenase subunit alpha; Provisional.
Length = 509
Score = 29.4 bits (67), Expect = 4.2
Identities = 6/24 (25%), Positives = 6/24 (25%), Gaps = 2/24 (8%)
Query: 24 PNVQAAPPPPP--PPPPLPPPPPM 45
V AAP P P
Sbjct: 373 IQVSAAPAAAAAAPAAKEEEKKPA 396
>gnl|CDD|152747 pfam12312, NeA_P2, Nepovirus subgroup A polyprotein. This family
of proteins is found in viruses. Proteins in this family
are typically between 259 and 1110 amino acids in
length. The family is found in association with
pfam03688, pfam03689, pfam03391. This family is one of
the polyproteins expressed by Nepoviruses in subgroup A.
Length = 175
Score = 28.8 bits (64), Expect = 4.3
Identities = 11/19 (57%), Positives = 11/19 (57%)
Query: 24 PNVQAAPPPPPPPPPLPPP 42
P PPPPPPPP PP
Sbjct: 113 PGGPCLPPPPPPPPIQKPP 131
Score = 27.6 bits (61), Expect = 8.9
Identities = 10/13 (76%), Positives = 11/13 (84%)
Query: 31 PPPPPPPPLPPPP 43
PPPPPPPP+ PP
Sbjct: 119 PPPPPPPPIQKPP 131
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 29.2 bits (65), Expect = 4.3
Identities = 17/66 (25%), Positives = 20/66 (30%), Gaps = 6/66 (9%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPP------PPPPMMGTTPH 79
V+A PP P TTPP P P TP P P P
Sbjct: 38 VEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSSTKVETPQSPTTKQVPT 97
Query: 80 VLGPLY 85
+ P +
Sbjct: 98 EINPKF 103
>gnl|CDD|237859 PRK14908, PRK14908, glycyl-tRNA synthetase; Provisional.
Length = 1000
Score = 29.6 bits (67), Expect = 4.3
Identities = 6/23 (26%), Positives = 8/23 (34%)
Query: 46 MGTTPPPMMGTPPPPPPMMGTPP 68
+ PPP P P + P
Sbjct: 290 LLKVPPPPAAVTPKKTPDICGPE 312
Score = 29.2 bits (66), Expect = 5.8
Identities = 6/22 (27%), Positives = 8/22 (36%)
Query: 60 PPPMMGTPPPPPPMMGTTPHVL 81
PPP P P + +L
Sbjct: 294 PPPPAAVTPKKTPDICGPETLL 315
>gnl|CDD|177594 PHA03356, PHA03356, tegument protein UL11; Provisional.
Length = 93
Score = 27.5 bits (61), Expect = 4.4
Identities = 16/48 (33%), Positives = 19/48 (39%), Gaps = 3/48 (6%)
Query: 19 NIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP---PPPM 63
N + P ++ PP PP P PP T G P PPPM
Sbjct: 45 NFYISPELRVVTQPPGRQPPRPSGPPSRHTRHWSPSGARPAPTFPPPM 92
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 29.6 bits (66), Expect = 4.4
Identities = 13/40 (32%), Positives = 16/40 (40%)
Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
P P PPP P + PPPP + +PPP
Sbjct: 271 PRTSPKRFRSPPPRKARGRSPRRLIRSPPPPGRLRSPPPL 310
>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
Length = 580
Score = 29.3 bits (66), Expect = 4.6
Identities = 15/68 (22%), Positives = 21/68 (30%), Gaps = 2/68 (2%)
Query: 35 PPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV--LGPLYWTRLIVS 92
PP + T+ + PP P P P P P R S
Sbjct: 55 YPPRETGSGGGVATSTIYTVPRPPRGPEQTLDKPDSLPASRELPPGPTPVPPGGFRGASS 114
Query: 93 PEIPSDTT 100
P + +D+T
Sbjct: 115 PRLGADST 122
>gnl|CDD|99960 cd03784, GT1_Gtf_like, This family includes the Gtfs, a group of
homologous glycosyltransferases involved in the final
stages of the biosynthesis of antibiotics vancomycin and
related chloroeremomycin. Gtfs transfer sugar moieties
from an activated NDP-sugar donor to the oxidatively
cross-linked heptapeptide core of vancomycin group
antibiotics. The core structure is important for the
bioactivity of the antibiotics.
Length = 401
Score = 29.3 bits (66), Expect = 5.0
Identities = 16/55 (29%), Positives = 19/55 (34%)
Query: 17 YSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
S++ L A PPPP P G P G PPP + PP
Sbjct: 187 GSDVPELYGFSPAVLPPPPDWPRFDLVTGYGFRDVPYNGPPPPELWLFLAAGRPP 241
>gnl|CDD|219979 pfam08704, GCD14, tRNA methyltransferase complex GCD14 subunit.
GCD14 is a subunit of the tRNA methyltransferase complex
and is required for 1-methyladenosine modification and
maturation of initiator methionyl-tRNA.
Length = 309
Score = 29.0 bits (65), Expect = 5.1
Identities = 12/42 (28%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query: 181 SVVSLEALQQIYDVR-ATDEEINLIRAHLASNSDLQLDKPEQ 221
+ ++E L + YDVR + I+L R L N +++ P++
Sbjct: 228 EIETIEVLPRQYDVRTVSLPVIDLGRDTLEENERRRIEGPKE 269
>gnl|CDD|235367 PRK05222, PRK05222,
5-methyltetrahydropteroyltriglutamate--homocysteine
S-methyltransferase; Provisional.
Length = 758
Score = 29.3 bits (67), Expect = 5.1
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 167 EFSEIESAIFNMDASVVSLEA 187
EF++I AI +DA V+S+E
Sbjct: 648 EFNDIIDAIAALDADVISIET 668
>gnl|CDD|215382 PLN02712, PLN02712, arogenate dehydrogenase.
Length = 667
Score = 29.2 bits (65), Expect = 5.2
Identities = 15/44 (34%), Positives = 17/44 (38%), Gaps = 1/44 (2%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
+PPPPP PPP P TT P + T P P
Sbjct: 1 SPPPPPQPPPS-LFPTSKPTTMSPRLSLSIKSQSATATDKQPLP 43
>gnl|CDD|114474 pfam05750, Rubella_Capsid, Rubella capsid protein. Rubella virus
is an enveloped positive-strand RNA virus of the family
Togaviridae. Virions are composed of three structural
proteins: a capsid and two membrane-spanning
glycoproteins, E2 and E1. During virus assembly, the
capsid interacts with genomic RNA to form nucleocapsids.
It has been discovered that capsid phosphorylation
serves to negatively regulate binding of viral genomic
RNA. This may delay the initiation of nucleocapsid
assembly until sufficient amounts of virus glycoproteins
accumulate at the budding site and/or prevent
non-specific binding to cellular RNA when levels of
genomic RNA are low. It follows that at a late stage in
replication, the capsid may undergo dephosphorylation
before nucleocapsid assembly occurs.
Length = 300
Score = 29.1 bits (64), Expect = 5.2
Identities = 15/39 (38%), Positives = 17/39 (43%), Gaps = 5/39 (12%)
Query: 27 QAAPPPPPPPPPLPPPPPMM-----GTTPPPMMGTPPPP 60
Q P P PP P PP M G+ P P +G P P
Sbjct: 90 QTPAPKPSRAPPQQPQPPRMQTGRGGSAPRPELGPPTNP 128
>gnl|CDD|180801 PRK07033, PRK07033, hypothetical protein; Provisional.
Length = 427
Score = 29.2 bits (66), Expect = 5.3
Identities = 7/40 (17%), Positives = 7/40 (17%)
Query: 34 PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
P P TP P P P
Sbjct: 1 NTSSDPFSAGSGGFVPPNPGDRTPAAAPAAGAPFQPRPGR 40
Score = 28.1 bits (63), Expect = 9.9
Identities = 10/17 (58%), Positives = 10/17 (58%)
Query: 23 LPNVQAAPPPPPPPPPL 39
LP VQ A P P PPP
Sbjct: 268 LPVVQIARPAPQPPPAP 284
>gnl|CDD|216412 pfam01285, TEA, TEA/ATTS domain family.
Length = 424
Score = 29.0 bits (65), Expect = 5.6
Identities = 14/106 (13%), Positives = 23/106 (21%), Gaps = 23/106 (21%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
+++ P P + + + P
Sbjct: 137 SAQRSSAQHLFYIIPGGPSWR------TSIKPFSSSHYGSHNSSAYSDHLRPLQP----- 185
Query: 84 LYWTRLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSRQVTAQR 129
S E+P P W+ I LE FS V+
Sbjct: 186 -------YSGELPPPLGPNWQASNSKKIRGLE-----FSMWVSQPN 219
>gnl|CDD|182101 PRK09835, PRK09835, sensor kinase CusS; Provisional.
Length = 482
Score = 29.0 bits (65), Expect = 5.6
Identities = 15/34 (44%), Positives = 19/34 (55%)
Query: 87 TRLIVSPEIPSDTTPLWKELEEVPINNLEEFTEL 120
T LI EI + KELE+V +NLEE T +
Sbjct: 278 TNLITQTEIALSQSRSQKELEDVLYSNLEELTRM 311
>gnl|CDD|177556 PHA03195, PHA03195, tegument protein VP11/12; Provisional.
Length = 746
Score = 29.2 bits (65), Expect = 5.6
Identities = 17/54 (31%), Positives = 22/54 (40%), Gaps = 7/54 (12%)
Query: 32 PPPPPPPLPPPPPMMGTT----PPPMMGT---PPPPPPMMGTPPPPPPMMGTTP 78
PP P P PP GT P + G+ P PP + +P P G+T
Sbjct: 440 PPQGPLPAPPNSKTRGTFRRPRPGSVRGSRQLPASPPSNIVSPRTNPAAEGSTA 493
>gnl|CDD|239164 cd02763, MopB_2, The MopB_2 CD includes a group of related
uncharacterized bacterial molybdopterin-binding
oxidoreductase-like domains with a putative N-terminal
iron-sulfur [4Fe-4S] cluster binding site and
molybdopterin cofactor binding site. These members
belong to the molybdopterin_binding (MopB) superfamily
of proteins.
Length = 679
Score = 29.0 bits (65), Expect = 6.0
Identities = 17/43 (39%), Positives = 19/43 (44%), Gaps = 4/43 (9%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
PP P PPLP PP + P TP PP +G P P
Sbjct: 339 HKPPYPRHIPPLPKPPKIPSADKP---FTPLYGPP-LGWPASP 377
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 29.1 bits (65), Expect = 6.0
Identities = 12/53 (22%), Positives = 15/53 (28%), Gaps = 2/53 (3%)
Query: 22 SLPNVQAAPPPPPP--PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
S + AP P P P P P + +P P PP
Sbjct: 387 SPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARS 439
>gnl|CDD|234383 TIGR03895, protease_PatA, cyanobactin maturation protease,
PatA/PatG family. This model describes a protease
domain associated with the maturation of various members
of the cyanobactin family of ribosomally produced,
heavily modified bioactive metabolites. Members include
the PatA protein and C-terminal domain of the PatG
protein of Prochloron didemni, TenA and a region of TenG
from Nostoc spongiaeforme var. tenue, etc.
Length = 602
Score = 28.9 bits (65), Expect = 6.1
Identities = 8/57 (14%), Positives = 14/57 (24%), Gaps = 3/57 (5%)
Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTT---PPPMMGTPPPPPPMMGTPPPPPPMMGT 76
+ + + PP P P P+ P + P P+
Sbjct: 255 IESSTSVIPPGRAAEPAPVSIPVAAPGEGATPAAAQIELSAGVLPNAISPATPVRPA 311
>gnl|CDD|188060 TIGR00531, BCCP, acetyl-CoA carboxylase, biotin carboxyl carrier
protein. This model is designed to identify biotin
carboxyl carrier protein as a peptide of acetyl-CoA
carboxylase. Scoring below the trusted cutoff is a
related protein encoded in a region associated with
polyketide synthesis in the prokaryote
Saccharopolyspora hirsuta, and a reported
chloroplast-encoded biotin carboxyl carrier protein
that may be highly derived from the last common
ancestral sequence. Scoring below the noise cutoff are
biotin carboxyl carrier domains of other enzymes such
as pyruvate carboxylase.The gene name is accB or fabE
[Fatty acid and phospholipid metabolism, Biosynthesis].
Length = 156
Score = 28.3 bits (63), Expect = 6.2
Identities = 12/51 (23%), Positives = 16/51 (31%), Gaps = 10/51 (19%)
Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP----------PMMGT 66
++A P P P P P + P P PM+GT
Sbjct: 40 KKSAVQQAAAPVPAQVPAAPSAQAPAPAVCAPAPAKADKKGHFVRSPMVGT 90
>gnl|CDD|236063 PRK07586, PRK07586, hypothetical protein; Validated.
Length = 514
Score = 29.0 bits (66), Expect = 6.6
Identities = 8/19 (42%), Positives = 9/19 (47%)
Query: 26 VQAAPPPPPPPPPLPPPPP 44
+ A P PP P PP P
Sbjct: 315 LGAKPAAPPLAAPARPPLP 333
>gnl|CDD|227869 COG5582, COG5582, Uncharacterized conserved protein [Function
unknown].
Length = 182
Score = 28.2 bits (63), Expect = 6.6
Identities = 26/146 (17%), Positives = 55/146 (37%), Gaps = 14/146 (9%)
Query: 108 EVPINNLEEFTELFSRQVTAQRP----VTRKRQQKSSKVQNVARLIDSKRSQNVGILAQS 163
EV + ++F F Q +R V + ++NV ++ ++ +
Sbjct: 3 EVSVQEKKDFIRWFLNQYQLKRREAVWVLNYLMSHDALLENV-HFVEEAEGCPRTLIIST 61
Query: 164 LHIEFSEIESAIFNMDASVVSLEALQQIYDVRATDEEINLIRAHLASNSDLQLDKPEQFL 223
+ E ++ +++ + +D+RA E I+ H A S K Q+L
Sbjct: 62 HGVH----EVPFQFYKYNIKTMDPEKAFHDIRANPNEAIYIQLHFAGAS-----KNPQYL 112
Query: 224 ADLADIPHFSERIACFMFEADFSDSI 249
A L P E ++ + +D++
Sbjct: 113 AVLESNPQLPENLSEQTKLKELADAL 138
>gnl|CDD|217769 pfam03867, FTZ, Fushi tarazu (FTZ), N-terminal region. This region
contains the important motif (LXXLL) necessary for the
interaction of FTZ with the nuclear receptor FTZ-F1. FTZ
is thought to represents a category of LXXLL
motif-dependent co-activators for nuclear receptors.
Length = 264
Score = 28.6 bits (63), Expect = 6.6
Identities = 9/22 (40%), Positives = 10/22 (45%)
Query: 49 TPPPMMGTPPPPPPMMGTPPPP 70
TPPP TP + G PP
Sbjct: 204 TPPPAATTPVDGIGIEGISTPP 225
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 29.0 bits (66), Expect = 6.6
Identities = 21/174 (12%), Positives = 45/174 (25%), Gaps = 9/174 (5%)
Query: 5 DKRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMM 64
++ K K+E + AA P
Sbjct: 41 EEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAE 100
Query: 65 GTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSRQ 124
P P + P K+ + + R+
Sbjct: 101 DEAARPAEAAARRPKAKKA---------AKKKKGPKPKKKKPKRKAARGGKRGKGGKGRR 151
Query: 125 VTAQRPVTRKRQQKSSKVQNVARLIDSKRSQNVGILAQSLHIEFSEIESAIFNM 178
R RK+++K + + R + + V LA+ + ++ +E+ +F +
Sbjct: 152 RRRGRRRRRKKKKKQKPTEKIPREVVIPETITVAELAEKMAVKAAEVIKKLFKL 205
>gnl|CDD|219056 pfam06485, DUF1092, Protein of unknown function (DUF1092). This
family consists of several hypothetical proteins of
unknown function all from photosynthetic organisms
including plants and cyanobacteria.
Length = 270
Score = 28.4 bits (64), Expect = 6.7
Identities = 8/22 (36%), Positives = 8/22 (36%)
Query: 22 SLPNVQAAPPPPPPPPPLPPPP 43
P A PPP PP P
Sbjct: 110 QEPGYMALAPPPVALDKPPPQP 131
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which
are homologues of floricaula (FLO) and Leafy (LFY)
proteins which are floral meristem identity proteins.
Mutations in the sequences of these proteins affect
flower and leaf development.
Length = 382
Score = 28.8 bits (65), Expect = 7.0
Identities = 8/31 (25%), Positives = 8/31 (25%)
Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTP 57
PPPPP LPP
Sbjct: 15 PRPAVPPPPPRLLPPAAVPPPPLAAAAAARK 45
>gnl|CDD|235562 PRK05687, fliH, flagellar assembly protein H; Validated.
Length = 246
Score = 28.4 bits (64), Expect = 7.5
Identities = 11/33 (33%), Positives = 11/33 (33%)
Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP 61
PPP PPPP PPP P
Sbjct: 25 DPPPEPPPPEPAAPPPEPPEPEEVPEEEAAPLT 57
>gnl|CDD|235931 PRK07092, PRK07092, benzoylformate decarboxylase; Reviewed.
Length = 530
Score = 28.8 bits (65), Expect = 7.7
Identities = 19/74 (25%), Positives = 23/74 (31%), Gaps = 14/74 (18%)
Query: 34 PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSP 93
PP P PP PPP P G P ++ L L IV
Sbjct: 335 LPPSARPAPPAR----------PMPPPAPAPGEPLSVAFVL----QTLAALRPADAIVVE 380
Query: 94 EIPSDTTPLWKELE 107
E PS + + L
Sbjct: 381 EAPSTRPAMQEHLP 394
Score = 28.4 bits (64), Expect = 9.4
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMM 46
P+ + APP P PPP P P +
Sbjct: 337 PSARPAPPARPMPPPAPAPGEPL 359
>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486). This
family is made up of members from various Burkholderia
spp. The function is unknown.
Length = 245
Score = 28.3 bits (63), Expect = 7.7
Identities = 12/52 (23%), Positives = 13/52 (25%), Gaps = 3/52 (5%)
Query: 23 LPNVQAAPPPPPPP---PPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
LP+ AA P PP TPP PP
Sbjct: 116 LPSALAADDPQAPPAGATAADAGDAAPDATPPAAGDASPPAAAQAAASAAAA 167
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 27.5 bits (61), Expect = 7.7
Identities = 9/17 (52%), Positives = 10/17 (58%)
Query: 28 AAPPPPPPPPPLPPPPP 44
+AP PP P P P PP
Sbjct: 15 SAPAPPAGPGPGPNAPP 31
>gnl|CDD|219805 pfam08347, CTNNB1_binding, N-terminal CTNNB1 binding. This region
tends to appear at the N-terminus of proteins also
containing DNA-binding HMG (high mobility group) boxes
(pfam00505) and appears to bind the armadillo repeat of
CTNNB1 (beta-catenin), forming a stable complex.
Signaling by Wnt through TCF/LCF is involved in
developmental patterning, induction of neural tissues,
cell fate decisions and stem cell differentiation.
Isoforms of HMG T-cell factors lacking the N-terminal
CTNNB1-binding domain cannot fulfill their role as
transcriptional activators in T-cell differentiation.
Length = 200
Score = 27.9 bits (62), Expect = 7.8
Identities = 18/71 (25%), Positives = 21/71 (29%), Gaps = 10/71 (14%)
Query: 22 SLPNVQAAPPPPPPP----------PPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
P P P PPP P + PP PP + P P
Sbjct: 129 VPVVQPPHHVHPLTPLITYSNEHFSPGTPPPHLPYDVDPKTGIPRPPHPPDISPFYPLSP 188
Query: 72 PMMGTTPHVLG 82
+G PH LG
Sbjct: 189 GGVGQIPHPLG 199
Score = 27.9 bits (62), Expect = 8.1
Identities = 14/58 (24%), Positives = 17/58 (29%), Gaps = 9/58 (15%)
Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMM---------GTPPPPPP 72
P + P P P PP P P++ GTPPP P
Sbjct: 105 PFLMLPNPYLPNGSLSPLPPSSNKVPVVQPPHHVHPLTPLITYSNEHFSPGTPPPHLP 162
Score = 27.9 bits (62), Expect = 8.6
Identities = 21/78 (26%), Positives = 29/78 (37%), Gaps = 16/78 (20%)
Query: 23 LPNVQAAPPPPP--------PPPPLPPPPPMM--------GTTPPPMMGTPPPPPPMMGT 66
LPN +P PP PP + P P++ TPPP + P +
Sbjct: 114 LPNGSLSPLPPSSNKVPVVQPPHHVHPLTPLITYSNEHFSPGTPPPHLPYDVDPKTGIPR 173
Query: 67 PPPPPPMMGTTPHVLGPL 84
PP PP + P G +
Sbjct: 174 PPHPPDISPFYPLSPGGV 191
>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24).
This family consists of several MGC-24 (or Cd164
antigen) proteins from eukaryotic organisms.
MGC-24/CD164 is a sialomucin expressed in many normal
and cancerous tissues. In humans, soluble and
transmembrane forms of MGC-24 are produced by
alternative splicing.
Length = 187
Score = 28.0 bits (62), Expect = 7.9
Identities = 10/44 (22%), Positives = 10/44 (22%)
Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
A P P P T T P TP P
Sbjct: 110 VATTTPVPTNSTAKPTITPSPTTSHHHVTSEPKTNTTVTPTSQP 153
>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
Length = 171
Score = 27.8 bits (62), Expect = 8.0
Identities = 18/106 (16%), Positives = 30/106 (28%), Gaps = 8/106 (7%)
Query: 1 MEYMDKRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP 60
ME +++AK+ ++ + P + P P + TTP
Sbjct: 39 MEQANEKAKKLEQSSD-----AKPAAASLPRLAEGNFRTMPVQTVTATTPSGSRPAVTAT 93
Query: 61 PPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSPE---IPSDTTPLW 103
P P P + P+ + P T LW
Sbjct: 94 PEQKLLAPRPLFTAAREVKTVVPVSSVTPVTPPRPLRTGEQTARLW 139
>gnl|CDD|177445 PHA02662, PHA02662, ORF131 putative membrane protein;
Provisional.
Length = 226
Score = 28.1 bits (62), Expect = 8.2
Identities = 9/14 (64%), Positives = 9/14 (64%)
Query: 30 PPPPPPPPPLPPPP 43
PP PP PP PP P
Sbjct: 2 PPRTPPTPPRPPEP 15
>gnl|CDD|111097 pfam02166, Androgen_recep, Androgen receptor.
Length = 423
Score = 28.6 bits (63), Expect = 8.4
Identities = 10/17 (58%), Positives = 12/17 (70%)
Query: 21 FSLPNVQAAPPPPPPPP 37
++ P A PPPPPPPP
Sbjct: 339 YNFPLALAGPPPPPPPP 355
>gnl|CDD|218017 pfam04305, DUF455, Protein of unknown function (DUF455).
Length = 250
Score = 28.3 bits (64), Expect = 8.7
Identities = 14/51 (27%), Positives = 23/51 (45%), Gaps = 8/51 (15%)
Query: 165 HIEFSEIESAIFNMDASVVSLEALQQIYD--VRATDEEI---NLIRAHLAS 210
HIE + I+ A+ DA ++ YD ++ D+E L+R L
Sbjct: 74 HIELNAIDLAL---DAVARFRGLPREFYDDWLKVADDEAKHFRLLRERLEE 121
>gnl|CDD|222296 pfam13660, DUF4147, Domain of unknown function (DUF4147). This
domain is frequently found at the N-terminus of proteins
carrying the glycerate kinase-like domain MOFRL,
pfam05161.
Length = 227
Score = 28.2 bits (64), Expect = 8.8
Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 4/40 (10%)
Query: 173 SAIFNMDASVVSLEALQQIYD--VR--ATDEEINLIRAHL 208
SA+ A ++LE Q + + A +EIN +R HL
Sbjct: 121 SALLPAPAEGITLEDKQAVTRALLASGADIQEINAVRKHL 160
>gnl|CDD|216111 pfam00772, DnaB, DnaB-like helicase N terminal domain. The
hexameric helicase DnaB unwinds the DNA duplex at the
Escherichia coli chromosome replication fork. Although
the mechanism by which DnaB both couples ATP hydrolysis
to translocation along DNA and denatures the duplex is
unknown, a change in the quaternary structure of the
protein involving dimerisation of the N-terminal domain
has been observed and may occur during the enzymatic
cycle. This N-terminal domain is required both for
interaction with other proteins in the primosome and for
DnaB helicase activity.
Length = 103
Score = 26.8 bits (60), Expect = 8.8
Identities = 9/51 (17%), Positives = 18/51 (35%), Gaps = 6/51 (11%)
Query: 184 SLEALQQIYDV----RATDEEINLI--RAHLASNSDLQLDKPEQFLADLAD 228
A + I++ + I+ + L L+ +LA+LA
Sbjct: 36 YDPAHRLIFEAILELYEKGKPIDPVTVAEELERKGKLEEVGGIAYLAELAA 86
>gnl|CDD|227361 COG5028, COG5028, Vesicle coat complex COPII, subunit SEC24/subunit
SFB2/subunit SFB3 [Intracellular trafficking and
secretion].
Length = 861
Score = 28.6 bits (64), Expect = 8.9
Identities = 15/66 (22%), Positives = 18/66 (27%), Gaps = 5/66 (7%)
Query: 40 PPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL---GPLYWTRLIVSPEI- 95
P P +M P M P P P L P + L + P
Sbjct: 73 NPAPSVMSPAFQSQQKFSSPYGGSMADGTAPKPTNPLVPVDLFEDQPPPISDLFLPPPPI 132
Query: 96 -PSDTT 100
P TT
Sbjct: 133 VPPLTT 138
>gnl|CDD|223037 PHA03301, PHA03301, envelope glycoprotein L; Provisional.
Length = 226
Score = 27.9 bits (62), Expect = 9.0
Identities = 11/32 (34%), Positives = 14/32 (43%), Gaps = 1/32 (3%)
Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPP 52
S P+ +A+P P P P P TP P
Sbjct: 180 LSAPDDEASPQPKSLATPPPVAAP-SRRTPRP 210
>gnl|CDD|223689 COG0616, SppA, Periplasmic serine proteases (ClpP class)
[Posttranslational modification, protein turnover,
chaperones / Intracellular trafficking and secretion].
Length = 317
Score = 28.1 bits (63), Expect = 9.2
Identities = 31/143 (21%), Positives = 48/143 (33%), Gaps = 30/143 (20%)
Query: 239 FMFEADFSDSIALIDSKLNNIKSVCEFLMKSEHLKKVIAIILSLGNFMNGGNRLRGQADG 298
++ + L L + + + KVIA+I G + GG LR
Sbjct: 24 WLLSLILLLAPTLKAVGLLAKRLGKLEALLRKRGSKVIAVIHVEGAIVAGGGPLRFIG-- 81
Query: 299 FGLEILPKLKDVRSKDNSVTLLHFIVRTYLR-NSENPLNESLPVPEP-GDVDRAASVVFD 356
G +I L+ R+ + ++ LR NS P G V AS +
Sbjct: 82 -GDDIEEILRAARADPSVKAVV-------LRINS------------PGGSV--VAS---E 116
Query: 357 DIHSQLSTLAKKLDAVTISMNKV 379
I L L K V +S+
Sbjct: 117 LIARALKRLRAK-KPVVVSVGGY 138
>gnl|CDD|226708 COG4257, Vgb, Streptogramin lyase [Defense mechanisms].
Length = 353
Score = 28.3 bits (63), Expect = 9.3
Identities = 11/54 (20%), Positives = 16/54 (29%), Gaps = 3/54 (5%)
Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
P ++G P G+ P P P P G+ P + P
Sbjct: 19 PASSSAAANAVGANALIGEFATPAAGSSPAPDGSSAEFPVPN---GSAPFDVAP 69
>gnl|CDD|180315 PRK05933, PRK05933, type III secretion system protein; Validated.
Length = 372
Score = 28.3 bits (63), Expect = 9.6
Identities = 10/24 (41%), Positives = 11/24 (45%)
Query: 20 IFSLPNVQAAPPPPPPPPPLPPPP 43
I S PN+Q PP P P P
Sbjct: 280 ITSYPNLQPEDPPEEPSEGGPAAP 303
>gnl|CDD|226293 COG3770, MepA, Murein endopeptidase [Cell envelope biogenesis,
outer membrane].
Length = 284
Score = 28.2 bits (63), Expect = 9.6
Identities = 14/51 (27%), Positives = 14/51 (27%), Gaps = 12/51 (23%)
Query: 30 PPPPPPPPPLPPPPPMMGT--------TPPPMMGTPPPPPPMMGTPPPPPP 72
P PP G PPP P P PPP PP
Sbjct: 222 PAGSLECEDQALVPPGDGCGKELASWFEPPP----PGTTKPEPKAPPPLPP 268
>gnl|CDD|185156 PRK15244, PRK15244, virulence protein SpvB; Provisional.
Length = 591
Score = 28.5 bits (63), Expect = 9.8
Identities = 11/16 (68%), Positives = 11/16 (68%)
Query: 50 PPPMMGTPPPPPPMMG 65
P M PPPPPPMMG
Sbjct: 361 PVNNMMPPPPPPPMMG 376
Score = 28.5 bits (63), Expect = 9.8
Identities = 11/16 (68%), Positives = 11/16 (68%)
Query: 60 PPPMMGTPPPPPPMMG 75
P M PPPPPPMMG
Sbjct: 361 PVNNMMPPPPPPPMMG 376
>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
Provisional.
Length = 418
Score = 28.3 bits (63), Expect = 10.0
Identities = 7/34 (20%), Positives = 10/34 (29%)
Query: 36 PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
PPL + P + P PP +
Sbjct: 385 SPPLDLTTSLRHMPSPSVTSAPEPPSIPLTYLSD 418
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.135 0.401
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,076,429
Number of extensions: 2415394
Number of successful extensions: 14082
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8932
Number of HSP's successfully gapped: 993
Length of query: 454
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 354
Effective length of database: 6,502,202
Effective search space: 2301779508
Effective search space used: 2301779508
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.4 bits)