RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14534
(423 letters)
>gnl|CDD|238040 cd00087, FReD, Fibrinogen-related domains (FReDs); C terminal
globular domain of fibrinogen. Fibrinogen is involved in
blood clotting, being activated by thrombin to assemble
into fibrin clots. The N-termini of 2 times 3 chains
come together to form a globular arrangement called the
disulfide knot. The C termini of fibrinogen chains end
in globular domains, which are not completely
equivalent. C terminal globular domains of the gamma
chains (C-gamma) dimerize and bind to the GPR motif of
the N-terminal domain of the alpha chain, while the GHR
motif of N-terminal domain of the beta chain binds to
the C terminal globular domains of another beta chain
(C-beta), which leads to lattice formation.
Length = 215
Score = 239 bits (613), Expect = 9e-78
Identities = 89/216 (41%), Positives = 117/216 (54%), Gaps = 26/216 (12%)
Query: 230 KLPNDC-------EGHSGLQLISL-----GRSVLCDEHH----WMTIQRRYNGMQEFNLK 273
LP DC SG+ I V CD W IQRR +G +F
Sbjct: 1 PLPRDCSEVLQRGGRTSGVYTIQPPGSNEPFQVYCDMDTDGGGWTVIQRRGDGSVDFYRS 60
Query: 274 WTDYAQGFGSPESEFWIGNDALHRLTSQDNMSLRIEFVDIYGKAWYAEYDTFSVASESEG 333
W +Y GFG+ + EFW+G + +H LTSQ LRI+ D G YAEYD+F V SESEG
Sbjct: 61 WKEYKDGFGNLDGEFWLGLEKIHLLTSQGPYELRIDLEDWEGNTAYAEYDSFKVGSESEG 120
Query: 334 YRLNLSGYHGNASDAMTYQNNMKFSTIDRDNDLSNTHCASNYEGGWWFSHCLHANLNGKF 393
YRL L GY G A DA++Y N MKFST DRDND ++ +CA +Y GGWW++ C +NLNG++
Sbjct: 121 YRLTLGGYSGTAGDALSYHNGMKFSTFDRDNDGASGNCAESYSGGWWYNSCHASNLNGRY 180
Query: 394 N---------LGLTWFHSEKNEWMAVARSHMKIRRR 420
G+ W + K ++ + MKIR +
Sbjct: 181 YSGGHRNEYDNGINW-ATWKGSTYSLKFTEMKIRPK 215
>gnl|CDD|214548 smart00186, FBG, Fibrinogen-related domains (FReDs). Domain
present at the C-termini of fibrinogen beta and gamma
chains, and a variety of fibrinogen-related proteins,
including tenascin and Drosophila scabrous.
Length = 212
Score = 226 bits (579), Expect = 1e-72
Identities = 92/213 (43%), Positives = 119/213 (55%), Gaps = 24/213 (11%)
Query: 231 LPNDCEGH-------SGLQLISLGRS-----VLCD-EHH---WMTIQRRYNGMQEFNLKW 274
LP DC SGL I S V CD E W IQRR +G +F W
Sbjct: 1 LPRDCSDVLQNGGKTSGLYTIYPDGSSRPLKVYCDMETDGGGWTVIQRRMDGSVDFYRDW 60
Query: 275 TDYAQGFGSPESEFWIGNDALHRLTSQDNMSLRIEFVDIYGKAWYAEYDTFSVASESEGY 334
DY +GFG+ EFW+GN+ +H LTSQ LRI+ D G YA YD+F VA E++GY
Sbjct: 61 KDYKEGFGNLAGEFWLGNENIHLLTSQGKYELRIDLEDWEGNTAYALYDSFKVADEADGY 120
Query: 335 RLNLSGYHGNASDA-MTYQNNMKFSTIDRDNDLSNTHCASNYEGGWWFSHCLHANLNGKF 393
RL++ GY G A DA +TY N M+FST DRDND + +CA Y GGWW+++C ANLNG++
Sbjct: 121 RLHIGGYSGTAGDASLTYHNGMQFSTYDRDNDKYSGNCAEEYGGGWWYNNCHAANLNGRY 180
Query: 394 ------NLGLTWFHSEKNEWMAVARSHMKIRRR 420
+ G+ W K W ++ + MKIR
Sbjct: 181 YPNNNYDNGINWATW-KGSWYSLKFTEMKIRPL 212
>gnl|CDD|215748 pfam00147, Fibrinogen_C, Fibrinogen beta and gamma chains,
C-terminal globular domain.
Length = 221
Score = 187 bits (476), Expect = 3e-57
Identities = 89/221 (40%), Positives = 113/221 (51%), Gaps = 33/221 (14%)
Query: 232 PNDCEGH-------SGLQLISLGRS-----VLCD----EHHWMTIQRRYNGMQEFNLKWT 275
P DC SGL I + V CD W QRR +G F W
Sbjct: 2 PKDCSDILQNGGETSGLYTIQPDGANKPLRVYCDMETDGGGWTVFQRRQDGSVNFYRNWK 61
Query: 276 DYAQGFGSPES-EFWIGNDALHRLTSQDNMSLRIEFVDIYGKAWYAEYDTFSVASESEGY 334
DY GFG+ + EFW+GND +H LTSQ LR+E D G+ YA YD F+V E+ Y
Sbjct: 62 DYKAGFGNLATGEFWLGNDKIHLLTSQGPYELRVELEDWNGETVYALYDKFTVGDEANKY 121
Query: 335 RLNLSGYHGNASDA-------MTYQNNMKFSTIDRDNDLSNTHCASNYEGGWWFSHCLHA 387
RL++SGY G A +A MTY N M FST DRDND +CA +Y GGWW+++C A
Sbjct: 122 RLSVSGYSGTAGNALMTAGDSMTYHNGMFFSTYDRDNDSPRGNCALSYGGGWWYNNCHAA 181
Query: 388 NLNGKF--------NLGLTWFHSEKNEWMAVARSHMKIRRR 420
NLNG++ G+ W K W ++ + MKIR R
Sbjct: 182 NLNGRYYYGGTYSWQNGVVWMTW-KGSWYSMKFAEMKIRPR 221
>gnl|CDD|113356 pfam04582, Reo_sigmaC, Reovirus sigma C capsid protein.
Length = 326
Score = 39.1 bits (91), Expect = 0.003
Identities = 42/170 (24%), Positives = 76/170 (44%), Gaps = 26/170 (15%)
Query: 45 EILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMMELLESIETIENKVDYNLPTIQR 104
E++ I L +S N L I ++++ +E S+ L ES+ TI +++ +
Sbjct: 11 EVVGLILSLTSSVTINPGDLTPIYERLTNLE---SATASLSESVSTISSRLS----DLSA 63
Query: 105 EISKLEFGVAQINSTLQVVNETRESDANYLKSTVSTLATLQEKVEANNII---MQALSDA 161
++ + + +TL + T ++ + S S LA L V A++ +Q A
Sbjct: 64 DLQDMTTSLDDAKATLNGLITTVQALQTSVDSLSSELADLSLTVSAHDAAISDLQTTVHA 123
Query: 162 NY-----LKSTVS----TLATLQEKVEANNIIMQALRNGRSEGLSLSSQL 202
N LKS+VS + L+++V ++L +G S GLS S L
Sbjct: 124 NSTDISNLKSSVSANGLNITDLEQRV-------KSLESGSSHGLSFSPPL 166
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 38.5 bits (90), Expect = 0.007
Identities = 36/205 (17%), Positives = 83/205 (40%), Gaps = 9/205 (4%)
Query: 25 LEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMMEL 84
LE++ + ++ L+ E++ R+ LE + + L + +++ +E S+ E
Sbjct: 714 LERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEA 773
Query: 85 L----ESIETIENKVDYN---LPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKST 137
L E IE +E K L ++ E+ + E + + L+ + + RE ++
Sbjct: 774 LAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEEL 833
Query: 138 VSTLATLQEKVEANNIIMQALSDANYLKSTVSTLATLQEKVEANNIIMQALRNGRSEGLS 197
+ L+EK++ ++ L L+ L L+ + E ++ L + E
Sbjct: 834 EEEIEELEEKLDELEEELEELEKE--LEELKEELEELEAEKEELEDELKELEEEKEELEE 891
Query: 198 LSSQLDDSKNLSPETIMQQMNKIHD 222
+L+ E I + ++ +
Sbjct: 892 ELRELESELAELKEEIEKLRERLEE 916
Score = 34.7 bits (80), Expect = 0.11
Identities = 32/157 (20%), Positives = 67/157 (42%), Gaps = 9/157 (5%)
Query: 23 HHLEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMM 82
LE+E + +++ LQ EE+ + + LE + L + ++I +E ++
Sbjct: 740 EELEEELEELEEELEELQERLEELEEELESLE-------EALAKLKEEIEELEEKRQALQ 792
Query: 83 ELLESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTVSTLA 142
E LE +E + + L ++RE+ LE ++ ++ + E E L L
Sbjct: 793 EELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELE 852
Query: 143 TLQEKVEANNIIMQALSDANYLKSTVSTLATLQEKVE 179
L++++E ++ L + L L+E+ E
Sbjct: 853 ELEKELEELKEELEELEAE--KEELEDELKELEEEKE 887
Score = 32.4 bits (74), Expect = 0.54
Identities = 25/138 (18%), Positives = 63/138 (45%), Gaps = 7/138 (5%)
Query: 18 FLKTKHHLEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHL 77
+ LE+ + + +++ L+ EE+ +++ LE + K L + +++ E L
Sbjct: 812 LERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEEL---EEL 868
Query: 78 HSSMMELLESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKST 137
+ EL + ++ +E + + ++ E+ +LE +A++ ++ + E E L+
Sbjct: 869 EAEKEELEDELKELEEEKE----ELEEELRELESELAELKEEIEKLRERLEELEAKLERL 924
Query: 138 VSTLATLQEKVEANNIIM 155
L L+E++E
Sbjct: 925 EVELPELEEELEEEYEDT 942
Score = 30.5 bits (69), Expect = 1.9
Identities = 37/195 (18%), Positives = 76/195 (38%), Gaps = 21/195 (10%)
Query: 19 LKTKHH-LEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHL 77
LK++ L +E + ++ L+ EE+ IS L + L + +++ ++
Sbjct: 272 LKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEK 331
Query: 78 HSSMMELLESIETI------------------ENKVDYNLPTIQREISKLEFGVAQINST 119
++ E LE ET+ E K+ L ++ L +A++ +
Sbjct: 332 IEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAE 391
Query: 120 LQVVNETRESDANYLKSTVSTLATLQEKVEANNIIMQALSDANYLKSTVSTLATLQEKVE 179
L + E ++S L L E++E ++ L L+ + L L E++E
Sbjct: 392 LAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAE--LEELQTELEELNEELE 449
Query: 180 ANNIIMQALRNGRSE 194
++ LR+ E
Sbjct: 450 ELEEQLEELRDRLKE 464
Score = 28.5 bits (64), Expect = 7.8
Identities = 32/190 (16%), Positives = 78/190 (41%), Gaps = 12/190 (6%)
Query: 26 EKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMMELL 85
E E ++ K+K L+ EE+ + +S LE + + L K+I ++ + E L
Sbjct: 224 ELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREEL 283
Query: 86 ESIET-------IENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTV 138
E ++ +++ + ++ + +LE + ++ L+ + E E+ L+
Sbjct: 284 EELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERE 343
Query: 139 STLATLQEKVEANNIIMQALSDAN--YLKSTVSTLATLQEKVEANNIIMQALRNGRSEGL 196
+ L L++ + + L + L+ L+E++ + L R+E
Sbjct: 344 TLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAE---LEAELAEIRNELE 400
Query: 197 SLSSQLDDSK 206
L +++ +
Sbjct: 401 ELKREIESLE 410
>gnl|CDD|179329 PRK01742, tolB, translocation protein TolB; Provisional.
Length = 429
Score = 34.5 bits (79), Expect = 0.10
Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 15/68 (22%)
Query: 178 VEANNIIMQALRNGRSEGLSLSSQLDDSKNLSPETIMQQMNKIHDSYLQVVGKLPNDCEG 237
+ +N++ Q L +G +E LS S+ LD+S ++SP IM I S Q +GK+
Sbjct: 350 INGDNVVKQDLTSGSTEVLS-STFLDESPSISPNGIM-----IIYSSTQGLGKV------ 397
Query: 238 HSGLQLIS 245
LQL+S
Sbjct: 398 ---LQLVS 402
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 34.6 bits (80), Expect = 0.10
Identities = 37/209 (17%), Positives = 84/209 (40%), Gaps = 10/209 (4%)
Query: 25 LEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMMEL 84
L KE + +++ L+ EE+ ++IS L + + ++I+ + + +
Sbjct: 703 LRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAE 762
Query: 85 LESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTVSTLATL 144
+E +E + + L + EI +LE + Q+ L+ + E + L A L
Sbjct: 763 IEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANL 822
Query: 145 QEKVEA--NNIIMQALSDANYLKSTVSTLATLQEKVEANNIIMQALRNGRSEGLSLSSQL 202
+E++E+ I L+ + L E +E+ ++ L E L S+L
Sbjct: 823 RERLESLERRIAATERR----LEDLEEQIEELSEDIESLAAEIEELEELIEE---LESEL 875
Query: 203 DDSKNLSPETIMQQMNKIHDSYLQVVGKL 231
+ N ++ + + + ++ +L
Sbjct: 876 EALLNER-ASLEEALALLRSELEELSEEL 903
Score = 31.6 bits (72), Expect = 0.92
Identities = 35/186 (18%), Positives = 84/186 (45%), Gaps = 6/186 (3%)
Query: 25 LEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMMEL 84
E+E + +++ L+ EE+ +S LE + K L+ + +IS +E + E
Sbjct: 251 AEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRER 310
Query: 85 LESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTVSTLATL 144
L ++E +++ L ++ ++ +L +A++ L+ + E ES L+ + L L
Sbjct: 311 LANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEEL 370
Query: 145 QEKVEANNIIMQALSDANYLKSTVSTLATLQEKVEANNIIMQAL----RNGRSEGLSLSS 200
+ ++E ++ L + +A+L ++E ++ L + E L
Sbjct: 371 ESRLEELEEQLETLRSK--VAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLK 428
Query: 201 QLDDSK 206
+L++++
Sbjct: 429 KLEEAE 434
>gnl|CDD|112109 pfam03279, Lip_A_acyltrans, Bacterial lipid A biosynthesis
acyltransferase.
Length = 295
Score = 33.4 bits (77), Expect = 0.19
Identities = 20/58 (34%), Positives = 26/58 (44%), Gaps = 2/58 (3%)
Query: 165 KSTVSTLATLQEKVEANNIIMQALRNGRSEGLSLS--SQLDDSKNLSPETIMQQMNKI 220
K+T ++ A L K A I + RNG G + LD + E I Q MNKI
Sbjct: 217 KATTTSPAKLARKTGAAVIPVFPWRNGDGSGYTAIVHPALDLTPTEDVEQIAQAMNKI 274
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 33.5 bits (77), Expect = 0.22
Identities = 18/96 (18%), Positives = 40/96 (41%), Gaps = 7/96 (7%)
Query: 35 KMKTLQAINEEILQRISHLETSSQTNVK--TLFNITKQISGVEHLHSSMMELLESIETIE 92
++ L I +E Q++ L + + L ++ G E L L E IE
Sbjct: 185 NIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKE-KEALERQKEAIE 243
Query: 93 NKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRE 128
++ +++ E+ KL ++++ L+ + + E
Sbjct: 244 RQLA----SLEEELEKLTEEISELEKRLEEIEQLLE 275
Score = 32.3 bits (74), Expect = 0.52
Identities = 31/131 (23%), Positives = 66/131 (50%), Gaps = 7/131 (5%)
Query: 25 LEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMMEL 84
+EKE + + + L+ EE+ + +S LE NVK+ + + + +E L + +L
Sbjct: 721 IEKEIEQLEQEEEKLKERLEELEEDLSSLE-QEIENVKS--ELKELEARIEELEEDLHKL 777
Query: 85 LESIETIENKVDYN-LPTIQREISKLEFGVAQINSTLQVVN---ETRESDANYLKSTVST 140
E++ +E ++ ++ +P IQ E+SKLE V++I + L+ + + YL+ +
Sbjct: 778 EEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQE 837
Query: 141 LATLQEKVEAN 151
L + ++
Sbjct: 838 LQEQRIDLKEQ 848
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 31.6 bits (72), Expect = 0.84
Identities = 32/159 (20%), Positives = 63/159 (39%), Gaps = 13/159 (8%)
Query: 25 LEKEQHVINHKMKTLQAINEEILQRISHLETSS---QTNVKTLFNITKQISGVEHLHSSM 81
LEKE + + L+ E+ +RI L+ + VK L K++ + +
Sbjct: 243 LEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKEL----KELKEKAEEYIKL 298
Query: 82 MELLESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTVSTL 141
E E +++ L ++ EI+ +E + ++ + + E ++ LK L
Sbjct: 299 SEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKK----LKELEKRL 354
Query: 142 ATLQEKVEANNIIMQALSDANYLKSTVSTLATLQEKVEA 180
L+E+ E + LK ++ L EK+E
Sbjct: 355 EELEERHELYEEAKAKKEELERLKKRLTGLTP--EKLEK 391
Score = 30.0 bits (68), Expect = 2.5
Identities = 27/173 (15%), Positives = 64/173 (36%), Gaps = 26/173 (15%)
Query: 19 LKTKHHLEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLH 78
L+ L+K+ + K+ L+ E+L+ + L S ++ +++ +E +
Sbjct: 548 LEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELE------ERLKELEPFY 601
Query: 79 SSMMEL----------LESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVN-ETR 127
+ +EL + ++ +E ++D + +LE ++ + + E
Sbjct: 602 NEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEY 661
Query: 128 ESDANYLKSTVSTLATLQEKVEANNIIMQALSDANYLKSTVSTLATLQEKVEA 180
E LA L+ ++E + TL L+E++E
Sbjct: 662 EELREEYLELSRELAGLRAELEELE---------KRREEIKKTLEKLKEELEE 705
>gnl|CDD|224428 COG1511, COG1511, Predicted membrane protein [Function unknown].
Length = 780
Score = 30.6 bits (69), Expect = 1.8
Identities = 24/116 (20%), Positives = 46/116 (39%), Gaps = 5/116 (4%)
Query: 93 NKVDYNLPTIQREISKLEFGVAQINSTLQVVN---ETRESDANYLKSTVSTLATLQEKVE 149
N++D L T+ I +L+ G Q+N + + S L + V L +
Sbjct: 237 NQLDSGLGTLAAGIGELKQGAEQLNEGIGEFSSGLSELNSGVQDLAAGVPQLNQGISALA 296
Query: 150 ANNIIMQALSD--ANYLKSTVSTLATLQEKVEANNIIMQALRNGRSEGLSLSSQLD 203
A + +L D ++ ++ L++K ++ Q + L+LS LD
Sbjct: 297 AGLSLPDSLGDQFSSLQEALTQIAQGLKQKTSSSLEAAQGSLSSLQSMLALSKSLD 352
>gnl|CDD|226823 COG4388, COG4388, Mu-like prophage I protein [General function
prediction only].
Length = 357
Score = 30.2 bits (68), Expect = 1.9
Identities = 30/124 (24%), Positives = 47/124 (37%), Gaps = 10/124 (8%)
Query: 85 LESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTVSTLATL 144
L ET+ D P + E+SK + + + LKS V+ L+
Sbjct: 207 LTLAETLAAPKDEKDPQVA-ELSKGDAAKPTDKDLTKYAPD---GTVAELKSQVAALSAK 262
Query: 145 QEKVEANNIIMQALSDANYLKSTVSTLATL--QEKVEANNIIMQALRNGRSEGLSLSSQL 202
+E + +I ALS+ L + L Q+ V A + AL +GR +LS
Sbjct: 263 VIAMEVDGLIRAALSEGRLLPAQEEYAKQLGKQDGVAA----LSALIDGRQPVAALSKTQ 318
Query: 203 DDSK 206
K
Sbjct: 319 TGGK 322
>gnl|CDD|177949 PLN02315, PLN02315, aldehyde dehydrogenase family 7 member.
Length = 508
Score = 30.2 bits (68), Expect = 2.3
Identities = 23/94 (24%), Positives = 41/94 (43%), Gaps = 7/94 (7%)
Query: 76 HLHSSMMELLESIETIEN---KVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDAN 132
H S + IE I++ K+ I+ E + ++ + +I+ VV E
Sbjct: 343 HTPESKKNFEKGIEIIKSQGGKILTGGSAIESEGNFVQPTIVEISPDADVVKEELFGPVL 402
Query: 133 YLKSTVSTLATLQEKVEANNIIMQALSDANYLKS 166
Y V TL+E +E NN + Q LS + + ++
Sbjct: 403 Y----VMKFKTLEEAIEINNSVPQGLSSSIFTRN 432
>gnl|CDD|218408 pfam05062, RICH, RICH domain. This presumed domain is about 85
residues in length and very rich in charged residues,
hence the name RICH (Rich In CHarged residues). It is
found in secreted proteins such as PspC, SpsA, and IgA
FC receptor from Streptococcus agalactiae. This domain
could be involved in bacterial adherence or cell wall
binding.
Length = 81
Score = 27.7 bits (62), Expect = 2.7
Identities = 13/45 (28%), Positives = 21/45 (46%), Gaps = 1/45 (2%)
Query: 16 KLFLKTKHHLEKEQHVINHKM-KTLQAINEEILQRISHLETSSQT 59
K+ + L+K +H N + K L AI E L + L+T +
Sbjct: 17 KILSEIYKQLDKRKHTQNVALIKKLSAIKTEYLYELDVLKTKVEL 61
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 29.6 bits (67), Expect = 2.8
Identities = 28/127 (22%), Positives = 54/127 (42%), Gaps = 9/127 (7%)
Query: 59 TNVKTLF---NITKQISGVEHLHSSMMELLESIETIENKVDYNLPTIQREISKLEFGVAQ 115
N+K+L + + +E + + L+ + ++ L ++ E+ LE V+Q
Sbjct: 4 NNLKSLLAIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQ 63
Query: 116 INSTLQVVNETRESDANYLKSTVSTLATLQEKVEANNIIMQAL-SDANYLKSTVSTLATL 174
+ S +Q + E R A S V + ++ A NI +Q N L+ ++ L
Sbjct: 64 LESEIQEIRE-RIKRAEEKLSAVKD----ERELRALNIEIQIAKERINSLEDELAELMEE 118
Query: 175 QEKVEAN 181
EK+E
Sbjct: 119 IEKLEKE 125
>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
endonuclease-like fold, RmuC family [General function
prediction only].
Length = 448
Score = 29.3 bits (66), Expect = 4.3
Identities = 27/137 (19%), Positives = 55/137 (40%), Gaps = 9/137 (6%)
Query: 24 HLEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMME 83
E + ++LQA+N E++Q ++ L+ Q + + +E L E
Sbjct: 49 AFRAEAEQLRTFARSLQALNLELIQELNELKARLQQQLLQS--REQLQLLIESLAQLSSE 106
Query: 84 LLESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNET-RESDANYLKSTVSTLA 142
E I +++ L + ++ K Q+ L+ V E RE + + +
Sbjct: 107 FQELANEIFEELNRRLAELNQQNLK------QLLKPLREVLEKFREQLEQRIHESAEERS 160
Query: 143 TLQEKVEANNIIMQALS 159
TL E+++ +Q L+
Sbjct: 161 TLLEEIDRLLGEIQQLA 177
>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain. This alpha
helical domain is found in a set of bacterial plasmid
replication proteins. The domain is found to the
C-terminus of the primase/polymerase domain. Mutants of
this domain are defective in template binding,
dinucleotide formation and conformation change prior to
DNA extension.
Length = 135
Score = 28.2 bits (63), Expect = 4.4
Identities = 11/37 (29%), Positives = 19/37 (51%)
Query: 35 KMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQI 71
K KT++AI EE+ ++I + K + N K +
Sbjct: 27 KGKTVEAIREEVCKKIKKSLEELKEKKKAILNTAKGV 63
>gnl|CDD|212500 cd11641, Precorrin-4_C11-MT, Precorrin-4 C11-methyltransferase
(CbiF/CobM). Precorrin-4 C11-methyltransferase
participates in the pathway toward the biosynthesis of
cobalamin (vitamin B12). There are two distinct
cobalamin biosynthetic pathways in bacteria. The aerobic
pathway requires oxygen, and cobalt is inserted late in
the pathway; the anaerobic pathway does not require
oxygen, and cobalt insertion is the first committed step
towards cobalamin synthesis. In the aerobic pathway,
CobM catalyzes the methylation of precorrin-4 at C-11 to
yield precorrin-5. In the anaerobic pathway, CibF
catalyzes the methylation of cobalt-precorrin-4 to
cobalt-precorrin-5. Both CibF and CobM, which are
homologous, are included in this model. There are about
30 enzymes involved in vitamin B12 synthetic pathway.
The enzymes involved in the aerobic pathway are prefixed
Cob and those of the anaerobic pathway Cbi. Most of the
enzymes are shared in both pathways and several of these
enzymes are pathway-specific.
Length = 228
Score = 28.6 bits (65), Expect = 5.7
Identities = 9/19 (47%), Positives = 11/19 (57%)
Query: 140 TLATLQEKVEANNIIMQAL 158
TLA + EKV+ I AL
Sbjct: 202 TLADIAEKVKEAGITRTAL 220
Score = 28.6 bits (65), Expect = 5.7
Identities = 9/19 (47%), Positives = 11/19 (57%)
Query: 170 TLATLQEKVEANNIIMQAL 188
TLA + EKV+ I AL
Sbjct: 202 TLADIAEKVKEAGITRTAL 220
>gnl|CDD|218403 pfam05055, DUF677, Protein of unknown function (DUF677). This
family consists of AT14A like proteins from Arabidopsis
thaliana. At14a has a small domain that has sequence
similarities to integrins from fungi, insects and
humans. Transcripts of At14a are found in all
Arabidopsis tissues and localises partly to the plasma
membrane.
Length = 336
Score = 28.6 bits (64), Expect = 6.2
Identities = 24/70 (34%), Positives = 38/70 (54%), Gaps = 7/70 (10%)
Query: 44 EEILQR----ISHLETSSQTNVKTLFNITKQISGVEHLHSSMMELLE-SIETIENKVDYN 98
EE L+R IS +E Q NVK + NI+ + +E +SM++ +E ++E EN+V
Sbjct: 235 EEALKRQKEIISSMEKGIQVNVKDMENISILVDRLESEITSMLKTVEFAVEHEENEVAVR 294
Query: 99 LPTIQREISK 108
+ EI K
Sbjct: 295 I--AMDEIKK 302
>gnl|CDD|224476 COG1560, HtrB, Lauroyl/myristoyl acyltransferase [Cell envelope
biogenesis, outer membrane].
Length = 308
Score = 28.4 bits (64), Expect = 6.6
Identities = 13/58 (22%), Positives = 22/58 (37%), Gaps = 2/58 (3%)
Query: 165 KSTVSTLATLQEKVEANNIIMQALRNGRSEG--LSLSSQLDDSKNLSPETIMQQMNKI 220
+T + A L A + + +RN G L + + D + E Q+MN
Sbjct: 220 AATTTGPAKLARLTGAAVVPVFPVRNPDGSGYTLHIHPPMTDDPSEDVEADAQRMNDF 277
>gnl|CDD|237582 PRK14011, PRK14011, prefoldin subunit alpha; Provisional.
Length = 144
Score = 27.8 bits (62), Expect = 6.8
Identities = 16/52 (30%), Positives = 30/52 (57%), Gaps = 7/52 (13%)
Query: 42 INEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMMELLESIETIEN 93
+NEE+ + LE +Q K + +++S ++ + MELL+SIE++E
Sbjct: 1 MNEELQNQFMALEVYNQQVQK----LQEELSSIDMM---KMELLKSIESMEG 45
>gnl|CDD|224414 COG1497, COG1497, Predicted transcriptional regulator
[Transcription].
Length = 260
Score = 28.1 bits (63), Expect = 7.8
Identities = 22/78 (28%), Positives = 35/78 (44%), Gaps = 9/78 (11%)
Query: 23 HHLEKEQHVINHKMK-TLQAINEEI--LQRISHLETSSQTNVKTLFNITKQISGVEHLHS 79
+Q I K+ TLQA++E I L + +E + + ITK+ G E L
Sbjct: 22 RQPRVKQKEIAKKLGITLQAVSEHIKELVKEGLIEKEGRGE----YEITKK--GAEWLLE 75
Query: 80 SMMELLESIETIENKVDY 97
+ +L E +E +DY
Sbjct: 76 QLSDLRRFSEEVELVLDY 93
>gnl|CDD|112953 pfam04163, Tht1, Tht1-like nuclear fusion protein.
Length = 554
Score = 28.5 bits (63), Expect = 8.2
Identities = 41/147 (27%), Positives = 55/147 (37%), Gaps = 24/147 (16%)
Query: 98 NLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTVSTLATLQEKVEANNIIMQA 157
N I I +LE + INS N ES + LA E N I + A
Sbjct: 244 NQDEILETIEQLESEIFDINS--FFANFIEESAG-VHNHIIEDLA--DEFASENAIALSA 298
Query: 158 LSD-----ANYLKSTVSTLAT-----LQEKVEAN----NIIMQALRN---GRSEGL-SLS 199
+ D +L+ + L L+E V+ N N Q L + G E L L
Sbjct: 299 IGDLKDDFLAFLEDRIKNLIEMSEDSLKESVQRNIDFVNSGFQELEDFSIGLKEELGGLK 358
Query: 200 SQLDDSKNLSPETIMQQMNKI-HDSYL 225
L + +NL E I+Q N HDS
Sbjct: 359 KDLSEQQNLEAEEILQGFNDFLHDSKD 385
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 28.0 bits (62), Expect = 9.9
Identities = 23/132 (17%), Positives = 52/132 (39%), Gaps = 18/132 (13%)
Query: 22 KHHLEKEQHVINHKMKTLQAINEEILQ---RISHLETSSQTNVKTLFNITKQISGVEHLH 78
++ LE Q N MK A+ ++ RI LET+ + + + + +Q +
Sbjct: 142 QNQLEGCQRERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSS 201
Query: 79 SSMMELLESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTV 138
SM + N + + ++ + + ++ +Q + + ++ + L+ T
Sbjct: 202 ESMSA------------ERNA--LLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQ 247
Query: 139 STLATL-QEKVE 149
A L + K E
Sbjct: 248 DIEAELERMKGE 259
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.128 0.373
Gapped
Lambda K H
0.267 0.0746 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,465,612
Number of extensions: 1881489
Number of successful extensions: 1907
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1884
Number of HSP's successfully gapped: 86
Length of query: 423
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 323
Effective length of database: 6,502,202
Effective search space: 2100211246
Effective search space used: 2100211246
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (26.9 bits)