From PneumoWiki
Jump to navigation Jump to search
PangenomeTIGR4
serotype 4
D39
serotype 2
D39V
serotype 2
Hungary19A-6
serotype 19A
EF3030
serotype 19F
670-6B
serotype 6B
6A-10
serotype 6A
70585
serotype 5
A026
serotype 19F
A66
serotype 3
AP200
serotype 11A
ASP0581
serotype 12F
ATCC 49619
serotype 19F
ATCC 700669
serotype 23F
BM6001
serotype 19F
BVJ1JL
serotype 1
CGSP14
serotype 14
G54
serotype 19F
HU-OH
serotype 3
Hu15
serotype 19A
Hu17
serotype 19A
INV104
serotype 1
INV200
serotype 14
JJA
serotype 14
MDRSPN001
serotype 19F
NCTC7465
serotype 1
NCTC7466
serotype 2
NU83127
serotype 4
OXC141
serotype 3
P1031
serotype 1
R6
serotype 2
SP49
serotype 19A
SPN032672
serotype 1
SPN034156
serotype 3
SPN034183
serotype 3
SPN994038
serotype 3
SPN994039
serotype 3
SPNA45
serotype 3
ST556
serotype 19F
TCH8431/19A
serotype 19A
Taiwan19F-14
serotype 19F
Xen35
serotype 4
gamPNI0373
serotype 1

NCBI: 30-NOV-2024

Summary[edit | edit source]

  • organism: Streptococcus pneumoniae EF3030
  • locus tag: EF3030_RS12230 [old locus tag: EF3030_08420 ]
  • pan locus tag?: PNEUPAN003239000
  • symbol: EF3030_RS12230
  • pan gene symbol?:
  • synonym:
  • product: accessory Sec-dependent serine-rich glycoprotein adhesin

Genome View[edit | edit source]

Gene[edit | edit source]

General[edit | edit source]

  • type: CDS, pseudogene
  • locus tag: EF3030_RS12230 [old locus tag: EF3030_08420 ]
  • symbol: EF3030_RS12230
  • product: accessory Sec-dependent serine-rich glycoprotein adhesin
  • replicon: chromosome
  • strand: -
  • coordinates: <1616192..1623542
  • length: 7351
  • essential: unknown

Accession numbers[edit | edit source]

  • Location: NZ_CP035897 <1616192..1623542 NCBI
  • BioCyc:
  • MicrobesOnline:
  • PneumoBrowse for strain D39V:

Phenotype[edit | edit source]

Share your knowledge and add information here. [edit]

DNA sequence[edit | edit source]

  • 1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    4081
    4141
    4201
    4261
    4321
    4381
    4441
    4501
    4561
    4621
    4681
    4741
    4801
    4861
    4921
    4981
    5041
    5101
    5161
    5221
    5281
    5341
    5401
    5461
    5521
    5581
    5641
    5701
    5761
    5821
    5881
    5941
    6001
    6061
    6121
    6181
    6241
    6301
    6361
    6421
    6481
    6541
    6601
    6661
    6721
    6781
    6841
    6901
    6961
    7021
    7081
    7141
    7201
    7261
    7321
    ATATTTTTTAGACGTCAGAAGGGTGAATATAGAGAAACAGACCGAGTAACTCGGTTCAAA
    TTAATCAAATCAGGGAAGCATTGGCTACGGGCCTCGACTTCTCTTTTTGGCTTGTTTAAG
    GTCATGCGTGGTGGTGTTGATACTACTCAGGTCATGACCGAAACGGTAGAAGATAAAGTA
    AGTCATTCAATTACTGGGCTTGATATCCTCAAGGGGATAGTTGCTGCGGGAACTGTCATA
    AGTGGAACCGTTGCAACTCAAACGAAGGTATTTACAAATGAGTCGGCAGTACTTGAAAAA
    ACTGTAGAGAAAACGGATGCTTTGGCAACAAATGGTACAGTAGTTCTAGGTACGATATCT
    ACAAGTAATTCAGCGAGTTCAACTAGTTTGTCAGCTTCAGAGTCGGCAAGTACATCTGCA
    TCTGAGTCAGCCTCAACCAGCGCTTCGACCTCAGCGAGTACAAGTGCATCAGAATCAGCA
    AGTACATCGGCTTCGACAAGTATTTCTGCATCATCTACTGTAGTAGGTTCACAAACAGCT
    GCCGCTACAGAAGCAACTGCTAAGAAGGTCGAAGAAGATCGTAAGAAACCAGCTAGTGAT
    TATGTAGCATCAGTTACAAATGTCAATCTCCAATCTTATGCTAAGCGACGCAAGCGTTCA
    GTAGATTCCATCGAGCAATTGCTGGCTTCTATAAATGCTGCTGTTTTTTCTGGCAATACG
    ATTGTAAATGGCGCCCCTGCAATTAATGCAAGTCTAAACATTGCTAAAAGTGAGACAAAA
    GTTTATACAGGTGAAGGTGTAGATTCGGTATATCATGTTCCAATTTACTATGAATTGACA
    GTGAAAAATGATGGTTCAGAATTGACCTTTACCTATACGGTTACGTATGTGAATCCTACA
    ACAAAGGCTCTTGGTAGTATATCAAGGATGAGTAATGGATATTCTATCTATAATTCAGGT
    ACTTCAATACAAACAATGTTAACCCTTGGCAGTGGTCTTGGTACCCCTTCAGGTGTAAAG
    AACTCCATTACTGACAAAAATGGTGTACAGGTTAAATACTATAATATATCTACAATGACG
    ACGCAGGGTAGTGGGTATACTTGGGCAAATGGTGCCCAAATGAATGGTTGGTATGCTAAG
    CAAGGATATGGATTAACATCATCTTGGACTGTATCAATTACTGGAACGGATACATCCTTT
    ACATTTACCCCTTACGCTGCTAAAACAGATAACATTAGAACTAACTACTTCAATGGTAGA
    GGAAAGGTAGTTGAATCTAGCACGACCAGTCAGTCACTTTCACAGTCTAAGTCACTCTCA
    GTAAGTGCTAGTCAAAGCGCCTCAGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCA
    ACCAGCGCATCAGCCTCAGCAAGTACAAGTGCCTCAGCCTCAGCATCAACGAGTGCGTCA
    GCTTCCGCATCAACGAGTGCGTCAGCTTCAGCAAGTACCTCAGCGTCTGAATCTGCATCA
    ACGAGTGCATCAGCTTCCGCATCAACGAGTGCGTCAGCTTCCGCATCAACGAGTGCGTCG
    GCTTCAGCAAGCACTAGCGCCTCGGCTTCAGCAAGTACTAGCGCCTCAGCATCAGCGTCA
    ACCAGTGCATCAGCCTCAGCAAGTACCAGTGCTTCAGCTTCCGCATCAACGAGTGCGTCG
    GCTTCAGCGTCAACCAGCGCCTCAGCCTCAGCAAGTACCAGTGCCTCGGCTTCAGCAAGC
    ACCAGTGCCTCGGCTTCAGCAAGCACCAGCGCCTCAGCTTCAGCATCAACGAGTGCTTCA
    GCTTCAGCATCAACGAGTGCTTCAGCTTCAGCAAGCACATCAGCTTCTGAGTCAGCGTCA
    ACCAGTGCGTCAGCTTCCGCATCAACAAGTGCCTCAGCTTCAGCAAGTACCAGTGCTTCA
    GCCTCAGCAAGCACCAGTGCTTCGGCTTCAGCAAGTACAAGTGCCTCAGCCTCAGCATCA
    ACCAGTGCCTCGGCTTCAGCAAGCACCAGCGCCTCAGCTTCAGCATCAACAAGTGCATCG
    GCTTCAGCAAGCACCAGTGCGTCGGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCAAGT
    ATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCA
    GCTTCCGCATCAACGAGTGCATCGGCTTCAGCGTCAACCAGTGCATCAGTCTCAGCAAGC
    ACCAGTGCGTCGGCTTCAGCAAGTACTAGCGCCTCAGCCTCCGCATCAACCAGTGCCTCG
    GCTTCAGCAAGCACCAGCGCCTCAGCTTCAGCATCAACAAGTGCCTCGGCTTCAGCAAGC
    ACCAGTGCTTCGGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCATCAACCAGTGCCTCA
    GCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCAGCCTCAGCAAGT
    ACTAGTGCATCAGCTTCCGCATCAACGAGTGCATCGGCTTCAGCGTCAACCAGTGCATCA
    GTCTCAGCAAGCACCAGTGCGTCGGCTTCAGCATCAACCAGTGCCTCAGCCTCAGCAAGT
    ACCAGTGCATCAGCTTCAGCATCAACGAGTGCGTCGGCTTCCGCAAGCACCAGCGCCTCA
    GCCTCAGCAAGCACCAGTGCCTCAGCTTCAGCAAGCACCAGCGCCAGCCTCAGCAAGTAC
    CTCAGCGTCTGAATCGGCCTCAACCAGTGCTTCGGCTTCAGCGTCAACGAGTGCGTCTGA
    GTCAGCATCAACGAGTGCGTCAGCCTCAGCATCAACGAGTGCCTCGGCTTCCGCAAGTAC
    TAGTGCCTCTGAATCCGCATCAACAAGTGCATCAGCTTCAGCAAGCACCAGTGCTTCGGC
    TTCAGCAAGTACTAGTGCATCAGCTTCCGCATCAACGAGTGCGTCGGCTTCAGCGTCAAC
    CAGCGCCTCAGCCTCAGCAAGTACAAGTGCATCAGAGTCAGCAAGTACCAGTGCTTCAGC
    TTCAGCGTCAACCAGTGCCTCAGCCTCAGCCTCAACGAGTGCGTCAGCATCAGCGTCAAC
    CAGTGCTTCAGCATCAGCGTCAACCAGTGCATCAGTCTCAGCAAGCACCAGTGCGTCGGC
    TTCAGCATCAACCAGTGCATCAGCCTCAGCAAGTACCAGTGCATCAGCTTCAGCATCAAC
    GAGTGCGTCGGCTTCCGCAAGTACCAGTGCATCGGCTTCAGCAAGCACCAGCGCCTCAGC
    TTCAGCAAGCACCAGCGCCTCAGCCTCAGCAAGTACCTCAGCGTCTGAATCGGCCTCAAC
    CAGTGCTTCGGCTTCAGCGTCAACGAGTGCGTCTGAGTCAGCATCAACGAGTGCGTCAGC
    CTCAGCATCAACGAGTGCCTCGGCTTCCGCAAGTACTAGTGCTTCAGCTTCAGCAAGCAC
    ATCAGCTTCTGAGTCAGCGTCAACCAGTGCGTCAGCTTCCGCATCAACAAGTGCCTCAGC
    TTCAGCAAGCACCAGCGCCTCAGCTTCAGCAAGTACTAGTGCATCAGCTTCCGCATCAAC
    GAGTGCGTCGGCTTCAGCGTCAACCAGCGCCTCAGCCTCAGCAAGTACCAGTGCCTCGGC
    TTCAGCAAGCACCAGTGCCTCGGCTTCAGCAAGCACCAGCGCCTCAGCTTCAGCAAGTAC
    CAGCGCGTCAGCCTCAGCATCAACCAGTGCCTCAGCCTCAGCAAGCACCTCAGCGTCTGA
    ATCTGCATCAACAAGCGCGTCAGCTTCAGCGTCAACCAGTGCTTCAGCCTCAGCAAGTAC
    AAGTGCCTCAGCCTCAGCATCAACCAGTGCCTCGGCTTCAGCAAGCACCAGCGCCTCAGC
    TTCAGCATCAACAAGTGCATCGGCTTCAGCAAGCACCAGTGCGTCGGCTTCAGCAAGTAC
    TAGCGCCTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCATCAGC
    CTCAGCAAGTACAAGTGCCTCAGCCTCAGCATCAACGAGTGCGTCAGCTTCCGCATCAAC
    AAGTGCCTCAGCTTCAGCAAGTACCTCAGCGTCTGAATCTGCATCAACGAGTGCATCAGC
    TTCCGCATCAACGAGTGCTTCAGCTTCCGCATCAACGAGTGCGTCGGCTTCAGCAAGCAC
    TAGCGCCTCGGCTTCAGCAAGTACTAGCGCCTCAGCATCAGCGTCAACCAGTGCATCAGC
    CTCAGCAAGCACCAGTGCGTCGGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCAAGTAT
    CTCAGCGTCTGAATCGGCATCAACGAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCAGC
    TTCCGCATCAACCAGTGCATCGGCTTCAGCGTCAACCAGTGCATCAGTCTCAGCAAGCAC
    CAGTGCGTCGGCTTCAGCAAGTACTAGCGCCTCAGCCTCCGCATCAACCAGTGCCTCGGC
    TTCAGCAAGCACCAGCGCCTCAGCTTCAGCATCAACAAGTGCCTCGGCTTCAGCAAGCAC
    CAGTGCTTCGGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCATCAACCAGTGCATCAGC
    CTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCAGCCTCAGCAAGTAC
    TAGTGCATCAGCTTCCGCATCAACGAGTGCATCGGCTTCAGCGTCAACCAGTGCATCAGT
    CTCAGCAAGCACCAGTGCATCGGCTTCAGCGTCAACCAGTGCATCAGTCTCAGCAAGCAC
    CAGTGCGTCGGCTTCAGCAAGTACTAGCGCCTCAGCCTCCGCATCAACCAGTGCCTCGGC
    TTCAGCAAGCACCAGCGCCTCAGCTTCAGCATCAACAAGTGCCTCGGCTTCAGCAAGCAC
    CAGTGCTTCGGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCATCAACCAGTGCATCAGC
    CTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCAGCCTCAGCAAGTAC
    TAGTGCATCAGCTTCCGCATCAACGAGTGCATCGGCTTCAGCGTCAACCAGTGCATCAGT
    CTCAGCAAGCACCAGTGCGTCGGCTTCAGCATCAACCAGTGCCTCAGCCTCAGCAAGCAC
    CTCAGCGTCTGAATCTGCATCAACAAGCGCGTCAGCTTCAGCGTCAACCAGTGCTTCAGC
    CTCAGCAAGCACAAGCGCCTCAACTTCAGCATCAACCAGTGCGTCAGCTTCAGCAAGCAC
    CTCAGCGTCTGAATCGGCATCAACGAGTGCTTCGGCCTCAGCAAGCACAAGCGCCTCAGC
    CTCAGCAAGCACCAGTGCTTCGGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCATCAAC
    CAGTGCATCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCAGC
    CTCAGCAAGTACTAGTGCATCAGCTTCCGCATCAACGAGTGCATCGGCTTCAGCGTCAAC
    CAGTGCATCAGTCTCAGCAAGCACCAGTGCATCGGCTTCAGCGTCAACCAGTGCATCAGT
    CTCAGCAAGCACCAGTGCGTCGGCTTCAGCAAGTACTAGCGCCTCAGCCTCCGCATCAAC
    CAGTGCCTCGGCTTCAGCAAGCACCAGCGCCTCAGCTTCAGCATCAACAAGTGCCTCGGC
    TTCAGCAAGCACCAGTGCTTCGGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCATCAAC
    CAGTGCATCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCAGC
    CTCAGCAAGCACAAGCGCCTCAACTTCAGCATCAACCAGTGCGTCAGCTTCAGCAAGCAC
    CTCAGCGTCTGAATCGGCATCAACGAGTGCTTCGGCCTCAGCAAGCACAAGCGCCTCAGC
    CTCAGCAAGTACCAGTGCTTCAGCTTCAGCATCAACAAGTGCATCAGCTTCAGCAAGCAC
    CAGTGCTTCGGCTTCAGCAAGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCGTCGGC
    TTCAGCGTCAACCAGCGCCTCAGCCTCAGCAAGTACCAGTGCCTCGGCTTCAGCAAGCAC
    CAGTGCCTCGGCTTCAGCAAGCACATCAGCTTCTGAGTCAGCGTCAACCAGTGCATCAGC
    TTCAGCATCAACGAGTGCGTCGGCTTCCGCAAGTACCAGTGCATCGGCTTCAGCAAGCAC
    CAGCGCCTCAGCTTCAGCAAGTACCAGCGCGTCAGCCTCAGCATCAACCAGTGCCTCAGC
    CTCAGCAAGCACCTCAGCGTCTGAATCTGCATCAACAAGCGCGTCAGCTTCAGCGTCAAC
    CAGTGCTTCAGCCTCAGCAAGCACAAGCGCCTCAACTTCAGCATCAACCAGTGCGTCAGC
    CTCAGCAAGCACCAGTGCTTCGGCTTCAGCAAGTACAAGTGCCTCAGCCTCAGCATCAAC
    CAGTGCCTCGGCTTCAGCAAGCACCAGCGCCTCAGCTTCAGCATCAACAAGTGCATCGGC
    TTCAGCAAGCACCAGTGCGTCGGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCAAGTAT
    CTCAGCGTCTGAATCGGCATCAACGAGTGCATCAGCCTCAGCAAGTACAAGTGCCTCAGC
    CTCAGCATCAACGAGTGCGTCAGCTTCCGCATCAACGAGTGCGTCAGCTTCAGCAAGTAC
    CTCAGCGTCTGAATCTGCATCAACGAGTGCATCAGCTTCAGCATCAACGAGTGCTTCAGC
    TTCCGCATCAACGAGTGCGTCGGCTTCAGCAAGCACTAGCGCCTCGGCTTCAGCAAGTAC
    TAGCGCCTCAGCATCAGCGTCAACCAGTGCATCAGCCTCAGCAAGTACCAGTGCTTCAGC
    TTCCGCATCAACGAGTGCGTCGGCATCAGCGTCAACCAGCGCCTCAGCCTCAGCAAGTAC
    CAGTGCCTCGGCTTCAGCAAGCACCAGTGCCTCGGCTTCAGCAAGCACCAGCGCCTCAGC
    TTCAGCATCAACGAGTGCTTCAGCTTCAGCAAGCACATCAGCTTCTGAGTCAGCGTCAAC
    CAGTGCGTCAGCTTCCGCATCAACAAGTGCCTCAGCTTCAGCAAGTACCAGTGCTTCAGC
    CTCAGCAAGCACCAGTGCTTCGGCTTCAGCAAGTACAAGTGCCTCAGCCTCAGCATCAAC
    CAGTGCCTCGGCTTCAGCAAGCACCAGCGCCTCAGCTTCAGCATCAACAAGTGCATCGGC
    TTCAGCAAGCACCAGTGCGTCGGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCAAGTAT
    CTCAGCGTCTGAATCGGCATCAACGAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCAGC
    TTCCGCATCAACGAGTGCATCGGCTTCAGCG
    60
    120
    180
    240
    300
    360
    420
    480
    540
    600
    660
    720
    780
    840
    900
    960
    1020
    1080
    1140
    1200
    1260
    1320
    1380
    1440
    1500
    1560
    1620
    1680
    1740
    1800
    1860
    1920
    1980
    2040
    2100
    2160
    2220
    2280
    2340
    2400
    2460
    2520
    2580
    2640
    2700
    2760
    2820
    2880
    2940
    3000
    3060
    3120
    3180
    3240
    3300
    3360
    3420
    3480
    3540
    3600
    3660
    3720
    3780
    3840
    3900
    3960
    4020
    4080
    4140
    4200
    4260
    4320
    4380
    4440
    4500
    4560
    4620
    4680
    4740
    4800
    4860
    4920
    4980
    5040
    5100
    5160
    5220
    5280
    5340
    5400
    5460
    5520
    5580
    5640
    5700
    5760
    5820
    5880
    5940
    6000
    6060
    6120
    6180
    6240
    6300
    6360
    6420
    6480
    6540
    6600
    6660
    6720
    6780
    6840
    6900
    6960
    7020
    7080
    7140
    7200
    7260
    7320
    7351

Protein[edit | edit source]

General[edit | edit source]

  • locus tag: EF3030_RS12230 [old locus tag: EF3030_08420 ]
  • symbol: EF3030_RS12230
  • description: accessory Sec-dependent serine-rich glycoprotein adhesin
  • length:
  • theoretical pI:
  • theoretical MW:
  • GRAVY:

Function[edit | edit source]

  • TIGRFAM:
  • TheSEED:
  • PFAM:

Structure, modifications & cofactors[edit | edit source]

  • domains:
  • modifications:
  • cofactors:
  • effectors:

Localization[edit | edit source]

  • PSORTb:
  • DeepLocPro:
  • SignalP:
  • predicted transmembrane helices (TMHMM):

Accession numbers[edit | edit source]

  • GI:
  • RefSeq:
  • UniProt:

Protein sequence[edit | edit source]

Experimental data[edit | edit source]

  • protein localization:
  • interaction partners:

Expression & Regulation[edit | edit source]

Operon[edit | edit source]

Regulation[edit | edit source]

  • regulator:

Transcription[edit | edit source]

  • transcription start site:

Expression data[edit | edit source]

  • PneumoExpress for strain D39V:

Biological Material[edit | edit source]

Mutants[edit | edit source]

Expression vector[edit | edit source]

lacZ fusion[edit | edit source]

GFP fusion[edit | edit source]

two-hybrid system[edit | edit source]

FLAG-tag construct[edit | edit source]

Antibody[edit | edit source]

Other Information[edit | edit source]

You can add further information about the gene and protein here. [edit]

Literature[edit | edit source]

References[edit | edit source]

Relevant publications[edit | edit source]