Skip to content
/ KMC Public
forked from refresh-bio/KMC

Fast and frugal disk based k-mer counter

Notifications You must be signed in to change notification settings

dfguan/KMC

This branch is 17 commits ahead of, 198 commits behind refresh-bio/KMC:master.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

c05a5f9 · Jun 29, 2019
Oct 4, 2018
May 17, 2019
May 13, 2019
Dec 1, 2018
Nov 27, 2018
Apr 29, 2019
Apr 29, 2019
Dec 29, 2018
Dec 29, 2018
Dec 10, 2014
Jul 17, 2018
Jan 28, 2017
Jun 29, 2019
May 17, 2019
Oct 4, 2018
Dec 29, 2018
Dec 29, 2018
Dec 29, 2018
Jun 29, 2019

Repository files navigation

KMC (modified by dfguan)

This is a modified KMC to compare K-mers from short reads data and an assembly, and make a plot.

Installation

git clone https://github.com/dfguan/KMC.git && cd KMC
make -j 16 

if you compile the source code sucessfully, there will be a bin directory including all exectuable files you need. Otherwise, please refer to the old README.

Quick Start

Given an assembly in fasta/fasta.gz format and list of short reads files in fastq/fastq.gz format, you can use the following commands to make a comparison plot.

bin/kmc -k21 -ci0 -fm -t12 -m20 -sm $asm $asm.prefix tmp
bin/kmc -k21 -ci0 -t12 -m20 -sm @$reads $reads.prefix tmp
bin/kmc_tools analyze $reads.prefix $asm.prefix $output.matrix
python3 spectra.py $output.matrix $output.png

when all the commands are finished, you will see a figure like this: kmc_plot.png

Instruction

  1. How to make a list of short reads files (fastq/fastq.gz) the read file list is a <tab> deliminated text file, one read file per line, following a simple syntax: <READ_FILE_PATH><tab>[TRIM_NUMBER]. Please notice if the TRIM_NUMBER is not set, it will be treated as 0. If you only have one read file, you can use KMC command directly without a read file list, and you can use -d to set trimmed off bases.

Notice:

This plot is just a small part learned from a K-mer Analysis Toolkit (KAT). If you'd like to know more, please go to their website: kat-web.

About

Fast and frugal disk based k-mer counter

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C++ 83.4%
  • C 15.3%
  • Python 1.1%
  • Makefile 0.2%
  • Shell 0.0%
  • Objective-C 0.0%