icm2 data processing pipeline and analysis codes

icm2 (in-cell mutate-and-map) is a method to characterize RNA structure inside cells. The experiment generates two-dimensional accessibility mapping data under cellular conditions. This code demonstrates how such data can be used to model RNA secondary structure ensembles. The input here is the .fastq files from Illumina sequencing run for an icm2 experiment. Outputs are a set of visualizations of the data and a set of secondary structures and their weights fitted by REEFFIT. The repository accompanies [PAPER CITATION HERE] and reproduces the analysis presented in the paper.

Dependencies

Utilities

cutdapt
bowtie2
bbmap
samtools
bamtools
shapemapper2
viennarna

Python packages

numpy
rdatkit
reeffit

R packages

data.table
cowplot
tidyverse
scales
Biostrings
edgeR
limma
hues
viridis
impute
ggrepel

Python packages rdatkit and reeffit should be installed and properly set up so that they can be called in the working environment. (see https://github.com/ribokit/RDATKit and https://github.com/ribokit/REEFFIT)
Same goes for ViennaRNA package. Its standalone programs should be set up to be called from the environment.

Usage

The set of scripts here are used to do the following:

Pre-process and align the sequencing reads (p0.sh)
Make correlated mutation counts matrix (m2matrix.py)
Data visualization; clustering; output constraints (icm2.R)
Ensemble model fits (reeffit_bootstrap_run.sh)

wrapper.sh and reeffit_bootstrap_run.sh contain slurm job execution commands
bonus_combined.dot is the set of 500 suboptimals sampled by RNAsubopt used as input to REEFFIT exactly for the paper.
The parameters within each script were used to produce the analysis presented in the paper. The index are available under index/reference*. The raw sequencing data can be downloaded from GEO under accession code GSE155656.

Reference

[CITATION GOES HERE]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

icm2 data processing pipeline and analysis codes

Dependencies

Usage

Reference

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
REEFFIT		REEFFIT
index		index
shapemapper2		shapemapper2
README.md		README.md
bonus_combined.dot		bonus_combined.dot
bootstrap_slurm_header.txt		bootstrap_slurm_header.txt
icm2.R		icm2.R
m2matrix.py		m2matrix.py
p0.sh		p0.sh
reeffit_bootstrap_run.sh		reeffit_bootstrap_run.sh
shapemapper_mutation_parser		shapemapper_mutation_parser
submit.sh		submit.sh
wrapper.sh		wrapper.sh

barnalab/icm2p

Folders and files

Latest commit

History

Repository files navigation

icm2 data processing pipeline and analysis codes

Dependencies

Usage

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages