From PneumoWiki
Jump to navigation Jump to search
PangenomeTIGR4
serotype 4
D39
serotype 2
D39V
serotype 2
Hungary19A-6
serotype 19A
EF3030
serotype 19F
670-6B
serotype 6B
6A-10
serotype 6A
70585
serotype 5
A026
serotype 19F
A66
serotype 3
AP200
serotype 11A
ASP0581
serotype 12F
ATCC 49619
serotype 19F
ATCC 700669
serotype 23F
BM6001
serotype 19F
BVJ1JL
serotype 1
CGSP14
serotype 14
G54
serotype 19F
HU-OH
serotype 3
Hu15
serotype 19A
Hu17
serotype 19A
INV104
serotype 1
INV200
serotype 14
JJA
serotype 14
MDRSPN001
serotype 19F
NCTC7465
serotype 1
NCTC7466
serotype 2
NU83127
serotype 4
OXC141
serotype 3
P1031
serotype 1
R6
serotype 2
SP49
serotype 19A
SPN032672
serotype 1
SPN034156
serotype 3
SPN034183
serotype 3
SPN994038
serotype 3
SPN994039
serotype 3
SPNA45
serotype 3
ST556
serotype 19F
TCH8431/19A
serotype 19A
Taiwan19F-14
serotype 19F
Xen35
serotype 4
gamPNI0373
serotype 1

NCBI: 26-AUG-2017

Summary[edit | edit source]

  • organism: Streptococcus pneumoniae OXC141
  • locus tag: SPNOXC00540 [new locus tag: SPNOXC_RS00315 ]
  • pan locus tag?: PNEUPAN000462000
  • symbol: SPNOXC00540
  • pan gene symbol?: smc_1
  • synonym:
  • product: phage protein

Genome View[edit | edit source]

Gene[edit | edit source]

General[edit | edit source]

  • type: CDS
  • locus tag: SPNOXC00540 [new locus tag: SPNOXC_RS00315 ]
  • symbol: SPNOXC00540
  • product: phage protein
  • replicon: chromosome
  • strand: +
  • coordinates: 43984..47130
  • length: 3147
  • essential: unknown

Accession numbers[edit | edit source]

  • Location: FQ312027 (43984..47130) NCBI
  • BioCyc: see SPNOXC_RS00315
  • MicrobesOnline:
  • PneumoBrowse for strain D39V:

Phenotype[edit | edit source]

Share your knowledge and add information here. [edit]

DNA sequence[edit | edit source]

  • 1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    GTGGTAGAAGTTCTCTCAGCGCTTTTCTATTTTTTTGAGAAAGGAGGAAATATGGCAGGA
    AATATCAAAGGTATCAAAATTGAAATCGATGGCGACACGCAACCCTTGCAGAAGGCGTTA
    AAAGCTATTAATAAAGAGTCTGTTAATACTACAAATGAACTAAAACAAATTGATAAGGCT
    TTAAAGTTTGACACTGGGAACGTTATTTTACTAACCCAAAAACAAGAAGTCTTACAGAAA
    CAAATAGGTATAACCAGAGACAAACTAGAAACTTTAAGACAAGCCCAATCTAAAGTAGAT
    GAGGAATTTAAAAAGGGGAATATTGGTTCTGAACAGTATCGCGCTTTCCAGCGTGAAGTA
    GAAGTGACTCAAAATGTCCTAAAAGGATATGAGGGAAAGCTTGCCAGTCTCACTCAAGCT
    CTTGAAGGAAATGGTGATGCAGCCAAGAACAATCAAGCTCAACTAAAAGAATTGCAGAAT
    GAACAAAAATTGCTTGCTAGTGAATCTGAAAAAGTAGTTAGTTCATTTAAACTGCAAGAA
    AGCCAGATGGGTGCCAATGCTAGCGAAGCAGACAAGTTAGCATTAGCCGAAAAAAAGATT
    GGCGCACAGTCTGAAATCGTCACTCGTCAAATCGAAAACCTTGAGAAGCAGTTAAGCCTA
    ACTAAAGAACAGTATGGCGAAAACTCAGCCGAAGCTAACAAGATGGAAGCAGAGCTAAAT
    CAAGCTAAGACCGCTTACGCTAATCTTAATCAGGAATTAGGAAAACTTGGTAGTACAGCT
    AAGAGCAACCAAACGCAACTGAAAGAATTGCAGAATGAGCAGAGTCAACTTGCTTCAGAG
    ATGAGTAAGGTGACAAGTTCATTCAAACTGCAAGAAAGTGCTTTGGGTTCAAATGCTAGT
    GAAGCCGAGAGAAATGCTCTTGCCCAGAAAAAGATTGGTGCCCAGTCTGAGATTGTAAGT
    AAACAGATTTCAAATCTAGAACAGCAATTGGAAATCACTAAAAAAGAATTTGGTGAGAAC
    TCCACACAAGCCAACAAGATGGAATCTGAGCTAAATCAGGCTAAGACTGCTTTTAATCAT
    CTCAATGATGAGATGAAGGGAACAAAGTCTGCTGCTGATAGCACTCAAGAAAGTTTAAGT
    GAAATCTCAAGAAATTTAAGAGCAGAACTACTTCAACAGTTTAGTGAGAAGTTGAGTGCT
    ATTTCAGAAAAGCTTGTGGAAGTAGGAAAAGAAGCGTTAGAAGCAGCTGCTCAAATGCAA
    GCTAGTAATGCTCAATTTACTACCGTTTTCGGAGATATGGAAACCCAAGCAAGAGAAGCG
    TTGAATGCTATTGGTCAGGAAATGGATATTGTCCCAGAGCGTTTACAAGGGTCATTTACT
    CAGATGGCTTCATTTGCAAAGACTTCAGGATTGGATACAGCAGAAGCTTTGGATCTTACT
    TCTCGTGCAACTAGGGCAGCAGCAGACGGTGCAGCCTTCTATGACAAATCTATTGAGAGC
    GTGACAGAGAGCTTACAATCTTTTTTGAAGGGAAACTTTGCTAACGATGCGGCTCTTGGC
    ATTTCTGCAACAGAAACGACCAGGAATGCCGCTGCAAATAAATTGTACGGAAAGTCATTC
    AAGGACTTGAGCGAAGCGCAGAAGCAATTGACCTTGCTTCAGATGGTCGAAGACGGAAAT
    AAACTCTCAGGAGCTCTTGGACAGGCTGCAAGAGAATCAGACGGCCTAGAAAACGTGATG
    GGGAATCTGAAACAAGCTGGGACCAATGCATTATCTGCTATTGGTCAACCTCTTCTAGAA
    ATGATGATTCCTGTTTTTCAAACCTTGGCAACGATTGTGAAAGGTGTAGCTGAGCTGTTC
    AGTTCCTTACCTGCTCCAGTAAAAGATTTTGTTGTTATTTTAGGAACAGTTGTGACTGCT
    GTAGGGGTCATAGCCCCCATATTCTTATCGTTGCAAGCCCTTGCTGAGTTTTTAAAAATA
    TCTATTGGAGAAATGATAATTGCCGCATTGCCAATTATTGGAACAGCTATTGCAATTGCT
    GCTGCAGTTGCTGCAATTGTTGCTATTGTGAAATATCTCTGGGAAACTAACGAAGGTTTT
    CGAGATGCGGTCACGACCGTTTGGAATGCGATTCTTGAAGTTATCAATGCAGTCGTATCA
    GAGATTTCTAATTTTGTCATGAGTATCTTTGGAACGGTTGTTACTTGGTGGACGGAGAAC
    CAGGAACTTATCAGGACAAGTGCTGAGACTGTCTGGAATGCCATTTATACGGTCATCAGT
    ACAATACTGGATATACTTGGCCCCTTGCTCCAAGCTGGCTGGGATAACATTCAACTGATC
    ATTACAACAACTTGGGAAATCATCAAGATCGTTGTTGAGACTGCAATCAATGTTGTCCTT
    GGTGTTATCCAAGCAGTTATGCAGATCATTACTGGTGATTGGTCAGGAGCTTGGGAAACT
    ATCAAGGGAGTGTTTTCTACTGTATGGCAAGCTATTCAAAGCATTGTTCAGACTATTTTC
    TCAGCTATCCAGAGTTATATTTCAAATATTCTCAACGGCATTTCAGGAACTGTATCAAAT
    ATCTGGAACAGCATCAAGGACACTGTCTCAAATGTGTTAAATGCTATATCTAGTACTGTA
    TCAAGTGTTTGGGAAGGTATCAAGAGTACCATTTCAAGTGCTATCAATGGTGCAAGGGAT
    GCTGTTTCTTCAGCTATTGAAGCCATCAAAGGATTGTTTAACTTCAACATCAGCTGGCCA
    CATATCCCACTACCTCACTTTTACGTGAGTGGTTCGGCCAATCCATTAGATTGGTTGAGT
    CAAGGTGTTCCAAGTATTGGAATCGAATGGTATGCCAAAGGCGGTATCATGACGAAACCA
    ACTATCTTTGGAATGAATGGTAATAACATAATGGTTGGTGGTGAAGCTGGGAATGAAGCA
    GTATTGCCACTTAACGACAAAACGCTTGGAGCCATCGGTCGAGGTATCGCTCAAACTATG
    GGTGGAACTTCACCAACCATTAACATTACCATTACTGGCAACACTGTCAGAGAAGAAGCC
    GACATCAGTCGTATTGCTGATGAGGTGGCTCAGCGTATTGCTGACGAGTTGCAACGTAAG
    ACACAATTGAGAGGAGGGTTTACATGA
    60
    120
    180
    240
    300
    360
    420
    480
    540
    600
    660
    720
    780
    840
    900
    960
    1020
    1080
    1140
    1200
    1260
    1320
    1380
    1440
    1500
    1560
    1620
    1680
    1740
    1800
    1860
    1920
    1980
    2040
    2100
    2160
    2220
    2280
    2340
    2400
    2460
    2520
    2580
    2640
    2700
    2760
    2820
    2880
    2940
    3000
    3060
    3120
    3147

Protein[edit | edit source]

General[edit | edit source]

  • locus tag: SPNOXC00540 [new locus tag: SPNOXC_RS00315 ]
  • symbol: SPNOXC00540
  • description: phage protein
  • length: 1048
  • theoretical pI: 4.62764
  • theoretical MW: 113118
  • GRAVY: -0.0902672

Function[edit | edit source]

  • TIGRFAM:
    Cellular processes Cellular processes Cell division chromosome segregation protein SMC (TIGR02168; HMM-score: 39.4)
    Genetic information processing DNA metabolism Chromosome-associated proteins chromosome segregation protein SMC (TIGR02168; HMM-score: 39.4)
    Cellular processes Cellular processes Cell division chromosome segregation protein SMC (TIGR02169; HMM-score: 31.6)
    Genetic information processing DNA metabolism Chromosome-associated proteins chromosome segregation protein SMC (TIGR02169; HMM-score: 31.6)
    and 3 more
    CXXX repeat peptide modification system protein (TIGR04116; HMM-score: 6.1)
    MSMEG_0570 family protein (TIGR04042; HMM-score: 5.5)
    helix-rich protein (TIGR04523; HMM-score: 4.8)
  • TheSEED  :
    • Phage tail length tape-measure protein
    Phages, Prophages, Transposable elements, Plasmids Phages, Prophages Phage tail proteins 2  Phage tail length tape-measure protein
  • PFAM:
    no clan defined DUF424; Protein of unknown function (DUF424) (PF04242; HMM-score: 16.3)
    DisA; DisA glycoprotein (PF19226; HMM-score: 15.9)
    DUF5517; Family of unknown function (DUF5517) (PF17639; HMM-score: 13.4)
    CCDC-167; Coiled-coil domain-containing protein 167 (PF15188; HMM-score: 13.1)
    and 9 more
    DUF1664; Protein of unknown function (DUF1664) (PF07889; HMM-score: 12.9)
    DUF6616; Family of unknown function (DUF6616) (PF20321; HMM-score: 11.7)
    Fez1; Fez1 (PF06818; HMM-score: 11.3)
    Aha1_BPI (CL0648) JHBP; Haemolymph juvenile hormone binding protein (JHBP) (PF06585; HMM-score: 11.1)
    Phage_tail (CL0348) HK97-gp10_like; Bacteriophage HK97-gp10, putative tail-component (PF04883; HMM-score: 11)
    no clan defined Occludin_ELL; Occludin homology domain (PF07303; HMM-score: 7.8)
    IL32; Interleukin 32 (PF15225; HMM-score: 7.7)
    Prefoldin (CL0200) Prefoldin_3; Prefoldin subunit (PF13758; HMM-score: 7.5)
    no clan defined EzrA; Septation ring formation regulator, EzrA (PF06160; HMM-score: 6.8)

Structure, modifications & cofactors[edit | edit source]

  • domains:
  • modifications:
  • cofactors:
  • effectors:

Localization[edit | edit source]

  • PSORTb: Cytoplasmic Membrane
    • Cytoplasmic Score: 0.32
    • Cytoplasmic Membrane Score: 9.55
    • Cellwall Score: 0.12
    • Extracellular Score: 0.01
    • Internal Helices: 2
  • DeepLocPro: Extracellular
    • Cytoplasmic Score: 0.0002
    • Cytoplasmic Membrane Score: 0.3665
    • Cell wall & surface Score: 0.0091
    • Extracellular Score: 0.6242
  • SignalP: no predicted signal peptide
    • SP(Sec/SPI): 0.013574
    • TAT(Tat/SPI): 0.001061
    • LIPO(Sec/SPII): 0.002466
  • predicted transmembrane helices (TMHMM): 6

Accession numbers[edit | edit source]

  • GI:
  • RefSeq: CBW31715 NCBI
  • UniProt:

Protein sequence[edit | edit source]

  • MVEVLSALFYFFEKGGNMAGNIKGIKIEIDGDTQPLQKALKAINKESVNTTNELKQIDKALKFDTGNVILLTQKQEVLQKQIGITRDKLETLRQAQSKVDEEFKKGNIGSEQYRAFQREVEVTQNVLKGYEGKLASLTQALEGNGDAAKNNQAQLKELQNEQKLLASESEKVVSSFKLQESQMGANASEADKLALAEKKIGAQSEIVTRQIENLEKQLSLTKEQYGENSAEANKMEAELNQAKTAYANLNQELGKLGSTAKSNQTQLKELQNEQSQLASEMSKVTSSFKLQESALGSNASEAERNALAQKKIGAQSEIVSKQISNLEQQLEITKKEFGENSTQANKMESELNQAKTAFNHLNDEMKGTKSAADSTQESLSEISRNLRAELLQQFSEKLSAISEKLVEVGKEALEAAAQMQASNAQFTTVFGDMETQAREALNAIGQEMDIVPERLQGSFTQMASFAKTSGLDTAEALDLTSRATRAAADGAAFYDKSIESVTESLQSFLKGNFANDAALGISATETTRNAAANKLYGKSFKDLSEAQKQLTLLQMVEDGNKLSGALGQAARESDGLENVMGNLKQAGTNALSAIGQPLLEMMIPVFQTLATIVKGVAELFSSLPAPVKDFVVILGTVVTAVGVIAPIFLSLQALAEFLKISIGEMIIAALPIIGTAIAIAAAVAAIVAIVKYLWETNEGFRDAVTTVWNAILEVINAVVSEISNFVMSIFGTVVTWWTENQELIRTSAETVWNAIYTVISTILDILGPLLQAGWDNIQLIITTTWEIIKIVVETAINVVLGVIQAVMQIITGDWSGAWETIKGVFSTVWQAIQSIVQTIFSAIQSYISNILNGISGTVSNIWNSIKDTVSNVLNAISSTVSSVWEGIKSTISSAINGARDAVSSAIEAIKGLFNFNISWPHIPLPHFYVSGSANPLDWLSQGVPSIGIEWYAKGGIMTKPTIFGMNGNNIMVGGEAGNEAVLPLNDKTLGAIGRGIAQTMGGTSPTINITITGNTVREEADISRIADEVAQRIADELQRKTQLRGGFT

Experimental data[edit | edit source]

  • protein localization:
  • interaction partners:

Expression & Regulation[edit | edit source]

Operon[edit | edit source]

Regulation[edit | edit source]

  • regulator:

Transcription[edit | edit source]

  • transcription start site:

Expression data[edit | edit source]

  • PneumoExpress for strain D39V:

Biological Material[edit | edit source]

Mutants[edit | edit source]

Expression vector[edit | edit source]

lacZ fusion[edit | edit source]

GFP fusion[edit | edit source]

two-hybrid system[edit | edit source]

FLAG-tag construct[edit | edit source]

Antibody[edit | edit source]

Other Information[edit | edit source]

You can add further information about the gene and protein here. [edit]

Literature[edit | edit source]

References[edit | edit source]

Relevant publications[edit | edit source]