Skip to content
Snippets Groups Projects
Select Git revision
  • 2278fd897442050a33e3c75028ec45a87f7d2085
  • master default protected
  • Francois
  • Baptiste
  • Maxime
  • Sacha
  • Paul
7 results

groupe-16

Project : Genome assembly

Author : Gautier COUTURE

Pre-requisites

Requires Biopyhton to function correctly :

$ pip install Biopython

Usage :

  • Windows :
    $ py Assembly.py -h
  • Linux :
    $ python3 Assembly.py -h

Program Settings :

-h, --help show this help message and exit
-d , --dataset Path to a fasta file that contain reads to assemble
-e , --errors Indicates whether the given dataset contain sequencing errors or not (0: No, 1: Yes)
-o , --out Output file name (& path)
-k , --kmersize Size of kmers we need (Optional, Default : 10)

Exemple:

$ python3 Assembly.py -d DatasetAssembly/Sequencage1.fa -e 1 -o output.fa -k 25

Implemented features :

  • K-mer Index
  • Reconstruction of the reference using sequencing data without errors
  • K-mer filter (Low occurences)

Unfinished features :

  • Graph cleaning

Non Implemented features :

  • Reconstruction of the reference using sequencing data with errors