Software
NemaScan
To enable genome-wide assiciation mappings and simulations using Caenorhabditis data, we build a wrapper for the Genome-wide Complex Trait Analysis package GCTA called NemaScan. Please follow the vignettes for use cases. Also, the Caenorhabditis Natural Diversity Resource CaeNDR uses this mapping package for GWAS.
EasyXpress
Our lab has developed an R package to manage, process, and plot high-throughput imaging data obtained using the Molecular Devices ImageXpress platform.
EasyFulcrum
For labs interested in field sampling wild strains, we built an R package to read-in, process, analyze, and plot sampling data collected using Fulcrum (apps are below). A procotol for nematode sampling is here.
Fulcrum Apps
Our lab has developed Fulcrum forms/applications for gathering data when collecting wild Caenorhabditis species.
VCF-kit - Documentation
VCF-kit is a command-line based collection of utilities for performing analysis on Variant Call Format (VCF) files. A summary of the commands is provided below. See documentation for details on installation and usage.
Command | Description |
---|---|
calc | Obtain frequency/count of genotypes and alleles. |
call | Compare variants identified from sequences obtained through alternative methods against a VCF. |
filter | Filter variants with a minimum or maximum number of REF, HET, ALT, or missing calls. |
geno | Various operations at the genotype level. |
genome | Reference genome processing and management. |
hmm | Hidden-markov model for use in imputing genotypes from parental genotypes in linkage studies. |
phylo | Generate dendrograms from a VCF. |
primer | Generate primers for variant validation. |
rename | Add a prefix, suffix, or substitute a string in sample names. |
tajima | Calculate Tajima’s D. |
vcf2tsv | Convert a VCF to TSV. |
Cegwas
A set of functions to process phenotype data, perform GWAS, and perform post-mapping data processing for C. elegans.
Install
devtools::install_github("AndersenLab/cegwas")
Usage
pheno <- spike(snps, c(80, 1020))
processed_phenotypes = process_pheno(pheno)
mapping_df = gwas_mappings(processed_phenotypes, cores = 4, only_sig = FALSE)
processed_mapping_df = process_mappings(mapping_df, phenotype_df = processed_phenotypes, CI_size = 50, snp_grouping = 200)
manplot(processed_mapping_df)
linkagemapping
This package includes all data and functions necessary to complete a mapping for the phenotype of your choice using the recombinant inbred lines from Andersen, et al. 2015 (G3). Included with this package are the cross and map objects for this strain set as well a markers.rds file containing a lookup table for the physical positions of all markers used for mapping. See the github page for more information on usage.
Install
devtools::install_github("AndersenLab/linkagemapping")
library("linkagemapping")
Usage
# Get the cross object
data("N2xCB4856cross")
cross <- N2xCB4856cross
# Get the phenotype data
pheno <- readRDS("~/Dropbox/AndersenLab/LabFolders/PastMembers/Tyler/ForTrip/RIAILs2_processed.rds")
# Merge the cross object and the phenotype data
cross <- mergepheno(cross, pheno)
# Perform a mapping with only 10 iterations of the phenotype data for FDR calc
map <- fsearch(cross, permutations = 10)
# Annotate the LOD scores
annotatedlods <- annotate_lods(map, cross)
easysorter
easysorter
is effectively version 2 of the COPASutils package (Shimko and Andersen, 2014). This package is specialized for use with worms and includes additional functionality on top of that provided by COPASutils, including division of recorded objects by larval stage and the ability to regress out control phenotypes from those recorded in experimental conditions. The package is rather specific to use in the Andersen Lab and, therefore, is not available from CRAN. To install you will need the devtools package. You can install both the devtools package and easy sorter using the commands below:
Install
install.packages("devtools")
devtools::install_github("AndersenLab/easysorter")
COPASutils
An R package that presents a logical workflow for the reading, processing, and visualization of data obtained from the Union Biometrica Complex Object Parametric Analyzer and Sorter (COPAS) platform large-particle flow cytometers and a powerful suite of functions for the rapid processing and analysis of large high-throughput screening data sets. It combines the speed of dplyr with the elegance of ggplot2 to make analysis of COPAS data fast and painless.
Install
install.packages("COPASutils")
Additional Resources
liftover-utils
Liftover is a python script that wraps the remap_gff_between_releases.pl
script by Gary Williams. It expands upon the number of filetypes you can liftover:
- VCF/BCF (Requires bcftools)
- GFF
- BED
Additionally, custom file formats can be lifted over by specifying chromosome, start position column, and optionally an end position column.
Install
pip install https://github.com/AndersenLab/liftover-utils/archive/v0.1.tar.gz
Usage
Note that the end_pos_column parameter is optional, meaning you only need to specify a chromosome and base pair location to be lifted over.
liftover <file> <release1> <release2> (bcf|vcf|gff|bed)
liftover <file> <release1> <release2> <chrom_col> <start_pos_column> [<end_pos_column>] [options]
Options:
-h --help Show this screen.
--delim=<delim> File Delimiter; Default is a tab [default: TAB].