ºÝºÝߣ

ºÝºÝߣShare a Scribd company logo
M ICROBIAL G ENOMICS TOOL
     B ETA V ERSION 1.0




               By
               Mr. Rajendra kumar Verma
                               &
               Mr. Kishore Shende Sir
TAKE   INPUT FASTA FILE


        ?   In bioinformatics, FASTA format is a text-
            based format for representing either
            nucleotide sequences or peptide sequences, in
            which nucleotides or amino acids are
            represented using single-letter codes. The
            format also allows for sequence names and
            comments to precede the sequences. The
            format originates from the FASTA software
            package, but has now become a standard in
            the field of bioinformatics.

        ?   A sequence in FASTA format begins with a single-line
            description, followed by lines of sequence data. The
            description line is distinguished from the sequence data
            by a greater-than (">") symbol in the first column. The
            word following the ">" symbol is the identifier of the
            sequence, and the rest of the line is the description
            (both are optional). There should be no space between
            the ">" and the first letter of the identifier. It is
            recommended that all lines of text be shorter than 80
            characters. The sequence ends if another line starting
            with a ">" appears; this indicates the start of another
            sequence. A simple example of one sequence in
            FASTA format:
E XAMPLE   OF FASTA FILE
T HIS    IS A TOOL


        It gives output in text
        format and M.S Excel
        format in Frame and M.S
        Excel file.
Nucleotide Composition
                  400
                  300                                                                80
                  200                             Nucleotide
                                                                                     60
                  100                             Compositio
                                                                                     40
                                                  n                                                                   Series1
                    0                                                                20

                         A   T   G        C                                          0
                                                                                          (A+T)%         (G+C)%




             20
Axis Title




             15                                                       30
             10                                                       25
              5                                                       20
              0                                                       15
                                              1                       10                                          Series1
                                                                            Amino Acid
                                           0.8                         5
                                                                       0
                                           0.6
                                                                                Nc                 ENc
                                           0.4                             Series1
                                           0.2
                                     Axis Title
                                              0
                                                  GC3s         AT3s
Microbial genomics tool beta version 1

More Related Content

Microbial genomics tool beta version 1

  • 1. M ICROBIAL G ENOMICS TOOL B ETA V ERSION 1.0 By Mr. Rajendra kumar Verma & Mr. Kishore Shende Sir
  • 2. TAKE INPUT FASTA FILE ? In bioinformatics, FASTA format is a text- based format for representing either nucleotide sequences or peptide sequences, in which nucleotides or amino acids are represented using single-letter codes. The format also allows for sequence names and comments to precede the sequences. The format originates from the FASTA software package, but has now become a standard in the field of bioinformatics. ? A sequence in FASTA format begins with a single-line description, followed by lines of sequence data. The description line is distinguished from the sequence data by a greater-than (">") symbol in the first column. The word following the ">" symbol is the identifier of the sequence, and the rest of the line is the description (both are optional). There should be no space between the ">" and the first letter of the identifier. It is recommended that all lines of text be shorter than 80 characters. The sequence ends if another line starting with a ">" appears; this indicates the start of another sequence. A simple example of one sequence in FASTA format:
  • 3. E XAMPLE OF FASTA FILE
  • 4. T HIS IS A TOOL It gives output in text format and M.S Excel format in Frame and M.S Excel file.
  • 5. Nucleotide Composition 400 300 80 200 Nucleotide 60 100 Compositio 40 n Series1 0 20 A T G C 0 (A+T)% (G+C)% 20 Axis Title 15 30 10 25 5 20 0 15 1 10 Series1 Amino Acid 0.8 5 0 0.6 Nc ENc 0.4 Series1 0.2 Axis Title 0 GC3s AT3s