BRC Tools and Services

The BRCs include a wide array of tools and services to enable researchers to locate, filter, process, analyze, and explore data. Note that many of these tools require login to the BRC to enable use with private data.

IRD/ViPR

PATRIC

VEuPathDB

Tools for Viruses at IRD/ViPR

Identify Similar Sequences (BLAST)	Uses BLAST algorithms to identify similar nucleotide or amino acid sequences in a variety of custom BRC databases.
Identify Short Peptides in Proteins	Identifies short amino acid strings in target proteins using exact, fuzzy, or pattern matching.
Generate Phylogenetic Tree	Generates phylogenetic trees based on nucleotide or amino acid sequences and displays results in interactive tree viewer to annotate tree nodes.
Identify Point Mutations	Identifies protein sequences that have a specific amino acid at a specific position.
Pandemic H1N1 Classification	Identifies nucleotide sequences closely related to the 2009 pandemic H1N1 strain.
HPAI H5N1 Clade Classification	Predicts clade for highly pathogenic H5 HA sequences.
Swine H1 Clade Classification	Predicts clade for Global and US Swine H1 viruses.
Sequence Format Conversion	Performs simple sequence format conversions
HA Subtype Numbering Conversion	Renumbers HA sequences according to a cross-subtype numbering scheme proposed by Burke and Smith.
Align Sequences	Aligns nucleotide or amino acid sequences.
Visualize Aligned Sequences	Displays interactive alignment viewer to view aligned nucleotide or amino acid sequences.
Annotate Sequences	Identifies segment, type, CDS location, and subtype (if segments 4 or 6), and possible sequencing errors for a given set of FASTA sequences using IRD Influenza annotation protocol.
VIGOR4 Genome Annotator (ViPR)	Annotates given genome sequence using VIGOR4 (Viral Genome ORF Reader).
GATU Genome Annotator (ViPR)	Annotates given genome sequence using a well annotated reference genome.
Analyze Sequence Variation (SNP)	Computes the extent of variation at each position of nucleotide or amino acid sequences uploaded or selected from the BRC database.
Metadata-driven Comparative Analysis Tool for Sequences (meta-CATS)	Performs a customized automated comparative analysis to identify positions in the sequence that significantly differ between groups of sequences
PCR Primer Design	Predicts the optimal primer set(s) for DNA sequences
Antiviral Resistance Risk Assessment	Determines the presence of amino acid changes associated with altered response to antiviral drugs in user input sequence.
Identify Sequence Features in Segments	Identifies experimentally characterized sequence features or phenotypic variant types.
SARS-Cov-2 Genome Assembly and Annotation (via BV-BRC)	Provides meta-service that performs tailored genome assembly, annotation, and variation analysis of SARS-CoV-2 genome reads.

Tools for Bacteria at PATRIC

BLAST	Performs searches against public or private genomes in the BRC or other reference databases using DNA or protein sequences.
Codon Tree	Constructs custom phylogenetic trees built from up to 100 genomes using the Codon Tree method.
Comparative Pathway	Identifies a set of pathways based on taxonomy, EC number, pathway ID, pathway name and/or specific annotation type across a set of genomes.
Comprehensive Genome Analysis	Provides meta-service including genome assembly, annotation, nearest neighbors, subsystem summary, phylogenetic tree, and distinguishing features.
Differential Expression	Integrates user-provided differential expression datasets for comparison with other data sets or analysis using annotations.
FastqUtils	Performs operations on FASTQ files including base call quality reports, read alignment to genomes, quantity/quality assessment, and trimming.
Genome Alignment	Produces a whole-genome alignment of two or more genomes and presents a visualization showing homologous regions and rearrangements
Genome Annotation	Annotates prokaryotic genomes with gene calls, protein features, and a variety of other genomic features.
Genome Assembly	Assembles sequence reads into contigs with a variety of assembly algorithms tuned to fit certain data types or analysis criteria
ID Mapper	Maps BRC identifiers to those from other prominent external databases such as GenBank, RefSeq, EMBL, UniProt, KEGG, etc, and vice versa.
Metagenome Binning	Bins reads or contigs from environmental samples into a set of genomes to reconstruct constituent bacterial and archaeal genomes.
Metagenomic Read Mapping	Aligns reads against antibiotic resistance genes, virulence factors, or other custom sets of genes.
Model Reconstruction	Generates draft genome-scale metabolic models using genome annotation and a collection of other analytical methods.
Protein Family Sorter	Displays the distribution of protein families across a set of selected genomes via an interactive heatmap with clustering and filter controls.
Proteome Comparison	Compares up to eight genomes against a reference by identifying corresponding protein sequences, displayed in an interactive circular genome view.
RNASeq Analysis	Provides services for aligning, assembling, and testing differential expression on RNA-Seq data from bacterial or host, e.g., human.
Similar Genome Finder	Finds similar public genomes in the BRC database or computes genome distance between genomes.
Taxonomic Classification	Assigns reads or contigs from metagenomic samples to taxonomic bins, providing an initial profile of the possible constituent organisms present in the sample
TnSeq Analysis	Facilitates determination of essential regions in bacterial genomes from transposon insertion sequencing (Tn-Seq) data
Variation Analysis	Identifies and annotates sequence variations in short read samples and compares them to a closely related reference genome.

Tools for Eukaryotes/Vectors at VEuPathDB

BLAST	Finds genes, genomic sequences, ESTs, or PopSet Sequences that have sequence similarity to given input sequence.
Companion	Provides parasite genome annotation as a service using a reference-based approach.
EuPaGDT (Eukaryotic Pathogen CRISPR gRNA Design Tool)	Identifies guide RNA (gRNA) in input gene(s) to guide appropriate gRNA design for many eukaryotic pathogens.
Galaxy	Analyzes RNA-Seq, ChIP-Seq, Variants, and many other data sets with preconfigured workflows and preloaded genomes.
PubMed and Entrez	Provides targeted search results from PubMed and Entrez.
Sequence Retrieval	Retrieve FASTA sequences based on provided identifiers.
Web Services	Provides programmatic access to searches via REST Web Services, returning a list of records (genes, compounds, etc.) in selected formats.
Enrichment Analyses	Provides a tool to determine statistical enrichment in gene lists of gene ontology (GO) terms, metabolic pathways and text in gene product descriptions.
Boolean operations	Provides a tool to combine search results using Boolean operations like intersect, union and minus.
Apollo	Provides a tool to enable the annotation of both functional and structural aspects of genes.
Site Search	Provides a tool to search the entire site for any keyword. Search results are organized in filterable categories.
Genome Browser	Provides a tool to explore genomes and aligned genomic data in an interactive browsable way.
User Comments	Provide a mechanism for users to quickly share their knowledge about genes with the community.
Multiple sequence alignment (isolates)	Provides a multiple sequence alignment tool for nucleotide sequences retrieved from PopSet.
Results downloads	Provides a tool to customize downloads with any available information.
Data analysis searches	Specialized searches enabling sophisticated mining of underlying data.
--- Annotation	Provides a set of searches that enable the identification of genes based on lists of IDs, user comments and annotation updates.
--- Epigenomics	Provides searches to identify genes based on location of epigenetic markers.
--- Function prediction	Provides searches to identify genes based on predicted function using enzyme commission (EC) numbers or GO terms.
--- Gene models	Provides a tool to identify genes based on their type and/or structural features.
--- Genetic variation	Provides a tool to identify genes based on genetic variation in the population, including single nucleotide polymorphisms or copy number variation.
--- Genomic Location	Provides a tool to identify genes based on their location on nuclear or non-nuclear genomes.
--- Immunology	Provides a tool to identify genes whose protein products are predicted to contain epitopes identified by the Immune Epitope Database and Analysis Resource (IEDB).
--- Orthology and synteny	Provides tools to identify genes based on their phyletic pattern across the tree of life.
--- Pathways and interactions	Provides tools to identify genes based on their involvement in metabolic pathways or their interaction with other genes.
--- Phenotype	Provides a tool to identify genes based on available phenotype datasets.
--- Protein features and properties	Provides tools to identify genes based on the properties of the proteins they code for such as molecular weight, isoelectric point or presence of InterPro domains.
--- Protein targeting and localization	Provides tools to identify genes based on predicted elements that may determine their subcellular localization such as signal peptides and transmembrane domains.
--- Proteomics	Provides tools to identify genes based on experimental proteomic data including mass spectrometry and quantitative data.
--- Sequence analysis	Provides tools to identify genes based on specified analyses such as regular expressions to define motifs
--- Structure analysis	Provides tools to identify genes based on structural analyses.
--- Taxonomy	Provides a tool to identify genes based on their taxonomy.
--- Transcriptomics	Provides tools to identify genes based on experimental transcriptomic data including expressed sequence tags, microarrays or RNA sequencing.
--- Popset Isolate Sequences	Provides tools to identify isolates retrieved from the PopSet database using information about the isolates or by sequence similarity.
--- Genomic Sequences	Provides tools to retrieve genomic sequences such as contigs, scaffolds or chromosomes
--- Genomic Segments	Provides tools to identify specific segments in the genomic sequence by location or regular expressions.
--- Single Nucleotide Polymorphisms (SNPs)	Provides tools to find SNPs identified from next generation sequencing data that distinguish specified populations.
--- Expressed Sequence Tags (ESTs)	Provides tools to identify ESTs retrieved from dbEST.
--- Metabolic Pathways	Provides tools to identify metabolic pathways and to dynamically interact with pathways.
--- Compounds	Provides tools to identify substrates and reactants found in metabolic pathways.