• Welcome
  • Research
  • CV
  • Publications
  • Software
  • Blogs
  • Travel
  • Contact
  • Links

iAmphioxus

~ It's a long way from Amphioxus

iAmphioxus

Tag Archives: SMRTanalysis

Custom installation of PacBio’s GenomicConsensus (Quiver)

04 Thursday May 2017

Posted by iAmphioxus in Bioinformatics, Blogs

≈ Leave a comment

Tags

GenomicConsensus, PacBio, Quiver, SMRTanalysis

It has always been a tricky experience for me to install individual PacBio’s SMRTanalysis tools on isolated machines that do not have the SMRTanalysis package pre-installed. Below is a summary of my recent custom installation of the GenomicConsensus (Quiver) tool, which is used for “polishing” (error-correction) PacBio assemblies.

First of all, I installed Conda to set up an independent python environment that is isolated from the python environment pre-configured for the server. To keep compatible with the SMRTanalysis tools,  I chose python2.7.9 for my conda python environment.

conda install python=2.7.9

Then log out and re-log in.

Install required Python libraries NumPy and h5py, which can be installed via:

conda install numpy=1.12.1 conda install h5py=2.7.0

Now we are ready to install required PacBio libraries

pip install git+https://github.com/PacificBiosciences/pbcore
git clone https://github.com/PacificBiosciences/ConsensusCore
cd ConsensusCore
python setup.py install --swig=$SWIG --boost=$BOOST

Here you should replace $SWIG with the path to your swig executable and $BOOST with the path to your boost installation (the top level directory). (Note that if SWIG is in your $PATH and boost is in /usr/local or /usr/include/, you do not need to specify these flags on the command line. setup.py will find them automatically).

Now, we can install GenomicConsensus (Quiver). Given the significant revision made for GenomicConsensus recently, here I install an older version (v1.1.0) to make sure the installation can succeed. ( I also tried the most recent version but it didn’t work.)

git clone https://github.com/PacificBiosciences/GenomicConsensus.git
cd GenomicConsensus
git reset --hard 654d0276d4a03f269cd1a14ddd7dfd0f54bede45
python setup.py install

Now, GenomicConsensus (Quiver) should have been successfully installed. Let’s test!

quiver -h

usage: variantCaller.py --referenceFilename REFERENCEFILENAME -o
                        OUTPUTFILENAMES [-j NUMWORKERS]
                        [--minConfidence MINCONFIDENCE]
                        [--minCoverage MINCOVERAGE]
                        [--noEvidenceConsensusCall {nocall,reference,lowercasereference}]
                        [--coverage COVERAGE] [--minMapQV MINMAPQV]
                        [--referenceWindow REFERENCEWINDOWSASSTRING]
                        [--alignmentSetRefWindows]
                        [--referenceWindowsFile REFERENCEWINDOWSASSTRING]
                        [--barcode _BARCODE] [--readStratum READSTRATUM]
                        [--algorithm ALGORITHM]
                        [--parametersFile PARAMETERSFILE]
...

References

http://bioinfo-master.ird.fr:8080/smrtanalysis/doc/bioinformatics-tools/GenomicConsensus/doc/HowToQuiver.html

https://github.com/PacificBiosciences/GenomicConsensus/tree/654d0276d4a03f269cd1a14ddd7dfd0f54bede45

http://pb-falcon.readthedocs.io/en/latest/quick_start.html#quick-start

Share this:

  • Email
  • Twitter
  • Facebook

Like this:

Like Loading...

Installing blasr on the the CentOS/RHEL system

13 Monday Jul 2015

Posted by iAmphioxus in Bioinformatics

≈ Leave a comment

Tags

blasr, PacBio, SMRTanalysis

I am working heavily with PacBio data recently, so it will be nice to have PacBio’s blasr (https://github.com/PacificBiosciences/blasr) installed on our local server.

First, make sure the library glibc, glibc-devel and glibc-static have been pre-installed on the system. I got errors when compiling blasr without glibc-devl and glibc-static libraries.

Second, we need to install the hdf5 library. It is a bit tricky with regard to this. I firstly installed the latest version hdf5-1.8.15-patch1 but I got errors when compiling blasr with this version. I did a bit  research online and found out this is due to an API change of the hdf5 library happened since the version hdf5-1.8.12. So I removed the hdf5-1.8.15-patch1 and installed hdf5-1.8.11 instead. Here I recorded the commands that I used for installing the hdf501.8.11 library and blasr as follows.

#!/usr/bin/sh

# download and install hdf5-1.8.11
wget http://www.hdfgroup.org/ftp/HDF5/releases/hdf5-1.8.11/src/hdf5-1.8.11.tar.gz
tar -xvzf hdf5-1.8.11.tar.gz
cd hdf5-1.8.11
./configure --enable-cxx --prefix=/home/yjx/local/lib/libhdf5 --with-zlib=/home/yjx/local/lib/libhdf5
# I don't have the root permission so I installed this library locally under my home directory (/home/yjx). 
# You can use "--prefix=/usr/local/lib/libhdf5 --with-zlib=/usr/local/lib/libhdf5" instead if you have root permission.

make
make check > check.out
# Please check the content of the check.out file generated here to make sure everything is OK
make install > install.log
cd ..


# download and install blasr
wget https://github.com/PacificBiosciences/blasr/archive/smrtanalysis-2.2.tar.gz
tar -xvzf smrtanalysis-2.2.tar.gz
cd blasr-smrtanalysis-2.2
# edit the common.mk file to set the path to your hdf library like this:
HDF5INCLUDEDIR ?= /home/yjx/local/lib/libhdf5/include
HDF5LIBDIR     ?= /home/yjx/local/lib/libhdf5/lib

# Now we are ready to compile blasr. Just type "make":
make
# We probably want to put the executable files that we compiled into a dedicated directory.
make install ASSEMBLY_HOME=/home/yjx/Programs/smrtanalysis/analysis 
# Now you should see all the executable files were copied to the following directory:
"/home/yjx/Programs/smrtanalysis/analysis/bin" 
# Then, add this path to your .bashrc or .bash_profile file.
PATH=$PATH:/home/yjx/Programs/smrtanalysis/analysis/bin
export PATH
# also add the following line:
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/home/yjx/local/lib/libhdf5/lib
# now restart a login session and type "blasr -h" to test. You should see the help information for 
# blasr correctly displayed.

 blasr -h
   Options for blasr 
   Basic usage: 'blasr reads.{fasta,bas.h5} genome.fasta [-options] 
 option    Description (default_value).

 Input Files.
   reads.fasta is a multi-fasta file of reads.  While any fasta file is valid input, 
               it is preferable to use pls.h5 or bas.h5 files because they contain
               more rich quality value information.

   reads.bas.h5|reads.pls.h5 Is the native output format in Hierarchical Data Format of 
               SMRT reads. This is the preferred input to blasr because rich quality
               value (insertion,deletion, and substitution quality values) information is 
               maintained.  The extra quality information improves variant detection and mapping
               speed.
...

Voilà!

References
https://github.com/PacificBiosciences/blasr/wiki/Blasr-Installation-Qs-&-As
http://www.vcru.wisc.edu/simonlab/bioinformatics/programs/install/blasr.htm

Share this:

  • Email
  • Twitter
  • Facebook

Like this:

Like Loading...

Recent Posts

  • Using anaconda to manage local python environment
  • Local installation of ncbi-blast+ together with the nr and taxonomy database
  • Installing Augustus with manual bamtools installation
  • Custom installation of PacBio’s GenomicConsensus (Quiver)
  • Compiling 64-bit MUMmer

Archives

  • January 2018 (2)
  • May 2017 (2)
  • February 2016 (1)
  • January 2016 (1)
  • August 2015 (1)
  • July 2015 (1)
  • April 2015 (1)
  • June 2013 (1)
  • April 2013 (1)
  • October 2012 (1)
  • May 2012 (1)
  • April 2012 (1)
  • March 2012 (1)
  • February 2012 (2)
  • January 2012 (2)

Categories

  • Blogs (19)
    • Bioinformatics (14)
    • Journal club (3)

2012 annotation arabidopsis Augustus bamtools BioPerl blasr blast C. elegans Centennial CMake conda CPAN developmental biology embryonic genesis Ensembl evolution genewise Genome assembly GenomicConsensus Git GitHub Kent Utilities LiftOver Macports maker maternal effect multicellularity mummer MySQL ncbi new year nr PacBio parallel evolution Perl plant biology Python Quiver resolution Rice rmath SMRTanalysis snoscan Stampy synteny taxonomy tRNAscan-SE UCSC Genome Browser wise2 yeast

Meta

  • Register
  • Log in
  • Entries feed
  • Comments feed
  • WordPress.com

Enter your email address to follow this blog and receive notifications of new posts by email.

Join 320 other followers

Blog Stats

  • 34,727 hits
Locations of visitors to this page
Map

Blog at WordPress.com.

loading Cancel
Post was not sent - check your email addresses!
Email check failed, please try again
Sorry, your blog cannot share posts by email.
%d bloggers like this: