| BLAST Algorithms |
| FASTA3 | The FASTA3 program compares protein or DNA sequence to a sequence database using using an improved version of the rapid sequence comparison algorithm described by Lipman and Pearson. |
| WU-BLAST2 |
The Washington University BLAST 2.0 program performs rapid similarity searches of protein and nucleotide sequence databases. |
| Web PSI-BLAST | This is the (unfinished) driver to NCBI web site for PSI-BLAST.
|
| Database Search |
| ACeDB | The ACeDB software performs general purpose queries against the underlying ACeDB database that is popular amongst biologists to store genomic data (e.g. C. elegans, rice and grass sequence databases). |
| FASTA3 | The FASTA3 program compares protein or DNA sequence to a sequence database using using an improved version of the rapid sequence comparison algorithm described by Lipman and Pearson. |
| NCBI Taxonomy | The NCBI Taxonomy database is a hierarchical classification of
organisms whose nucleotide sequences can be found in the GenBank/EMBL/DDBJ databases.
|
| Oracle | Oracle is a relational database management system (RDBMS) that is popular amongst biological scientists for storing genomic, clinical trial, and other data (e.g. Incyte, HGS databases).
|
| SCOP | The SCOP (Structural Classification of Proteins) database provides a comprehensive description of the structural and evolutionary relationships of proteins with known structures including all protein entries in the Protein Data Bank (PDB).
|
| SeqIndex | SeqIndex is a suite of programs for indexing and matching long strings like amino acid and DNA sequences. |
| SeqIndex for Data | SeqIndex for Data can improve search access speeds to very large data files using pre-indexed key fields. |
| Sybase |
Sybase is a relational database management system that is popular amongst biological scientists for storing genomic and other data (e.g. GDB and GSDB).
|
| Web DBEST | Web dbEST, a web site located at NCBI (National Center for Biotechnology Information) is a database of Expressed Sequence Tags (dbEST) that contains nucleic acid sequence data and other information on single-pass cDNA sequences or Expressed Sequence Tags (ESTs) from different organisms.
|
| Web Entrez | Web Entrez, located at the NCBI (National Center for Biotechnology Information) web site, allows the broad querying of several integrated protein, nulceic acid and published literature databases, including GenBank/EMBL/DDBJ, SwissProt, PDB, PIR, PRF, PubMED, OMIM.
|
| Web PDB | Web PDB, a web site located at the Protein Data Bank (PDB) of the Brookhaven National Laboratory, is a database of protein and nucleic acid structures determined using X-ray or NMR techniques that contains the coordinates of the structures along with information about the molecules and about the techniques used to determine the structures.
|
| Web Patent | WebPatent, two web sites located at the US Patent Office and IBM Corporation, allows searching of the US patent database for both claims and figures.
|
| Web TIGR | Web TIGR, a web site located at The Institute for Genome Research (TIGR), provides access to several databases including the Human Gene Index (HGI) and the Expressed Gene Anatomy Database (EGAD).
|
| Web UniGene |
Web UniGene, a web site located at the National Center for Biotechnology Information (NCBI), is an experimental system for automatically partitioning GenBank sequences into a nonredundant set of geneoriented clusters.
|
| WebEnzyme | WebEnzyme, a web site located at the ExPASy web site at the Swiss Institute of Bioinformatics (SIB), is a repository of nomenclature of enzymes and allows the search of enzymes by EC (Enzyme Commission) number, enzyme class and other relevant descriptions.
|
| Web Search |
| HTTP |
HTTP uses the standard http protocol to evaluate Uniform Resource Locators (URLs) and execute native http requests.
|
| Text Analysis Algorithms |
| KEX | KEX (Knowledge EXtraction tool) is a program that locates protein names in MEDLINE abstracts or other English biomedical text.
|
| XML reader | XML Reader allows reading of documents in the eXtended Markup Language (XML) without consulting a Data Type Definition (DTD) file |
| Pattern Recognition Algorithms |
| FASTA3 | The FASTA3 program compares protein or DNA sequence to a sequence database using using an improved version of the rapid sequence comparison algorithm described by Lipman and Pearson. |
| HMMer | Sean Eddy's HMMER program suite (version 1.0) can be used to perform multiple sequence alignment or sensitive database searching using statistical hidden Markov model descriptions of a consensus sequence derived from a family of sequences.
|
| HMMer 2 |
The HMMER2 program uses profile Hidden Markov Models (profile HMMs) to perform nucleotide or amino acid database searches and alignments.
|
| PSORT | Kenta Nakai's PSORT II program predicts the sub-cellular localization sites (e.g. transmembrane, mitochodria, chloroplast) of proteins from an analysis of their amino acid sequences. |
| Prosite Scan | ISREC's (Swiss Institute of Experimental Cancer Research) PrositeScan program can compare a query protein sequence against a locally installed protein profile database to detect common sequence motifs.
|
| Web BLOCKS | The Henikoffs' Web BLOCKS web site performs sequence similarity searches against the BLOCKS database, which is a collection of the most conserved regions in groups of related protein and nucleotide sequences.
|
| Web HMM Search |
Web HMM Search, a web site centered at Washington University in St. Louis, searches a Pfam (protein families grouped by sequence similarity) database located in St. Louis for sequences similar to the input query sequence.
|
| Web pfscan | Web pfscan, Philipp Bucher's pfscan 2.1 protein family recognition program, allows remote searching of a query protein sequence for common motifs or domains against a pre-curated database of protein family profiles at the ISREC (Swiss Institute for Experimental Cancer Research) web site.
|
| hmmPfam |
hmmPfam uses the HMMER software package to search a locally installed Pfam (protein families grouped by sequence similarity) database for sequences similar to the input query sequence.
|
| pfscan 2.1 | Philipp Buscher's pfscan 2.1 protein family recognition program, allows remote searching of a query protein sequence for common motifs or domains against a pre-curated database of protein family profiles at the ISREC (Swiss Institute for Experimental Cancer Research) web site.
|
| Sequence Analysis Algorithms |
| BLAST 2.0 | The BLAST 2.0 program performs rapid similarity searches of protein and nucleotide sequence databases. |
| ClustalW | ClustalW is a general purpose multiple sequence alignment program for nucleic acid or protein sequences.
|
| Codon Frequency Usage |
Codon Usage Frequency calculates simple codon frequencies, amino acid frequencies, and nucleotide frequencies while allowing for translation between nucleic acid and protein sequence. It can also be used for translating protein sequences to nucleic acid sequences and vice-versa using user-specified codon mapping tables.
|
| FASTA3 | The FASTA3 program compares protein or DNA sequence to a sequence database using using an improved version of the rapid sequence comparison algorithm described by Lipman and Pearson. |
| HLA | Human
Leucocyte Antigen).
|
| HMMer | Sean Eddy's HMMER program suite (version 1.0) can be used to perform multiple sequence alignment or sensitive database searching using statistical hidden Markov model descriptions of a consensus sequence derived from a family of sequences.
|
| HMMer 2 |
The HMMER2 program uses profile Hidden Markov Models (profile HMMs) to perform nucleotide or amino acid database searches and alignments.
|
| MinAlig | Zhang Louxin's (Minimum Alignment) is a simple pairwise sequence alignment program for nucleic acid or protein sequences. |
| ORF Extraction | Jean-Michel Claveries JMC ORF Extraction program finds/predicts candidatereading frames in a supplied DNA sequence and subsequently translates them into protein sequences. |
| Prosite Scan | ISREC's (Swiss Institute of Experimental Cancer Research) PrositeScan program can compare a query protein sequence against a locally installed protein profile database to detect common sequence motifs.
|
| SEG filter | Wootton and Federhen's SEG program identifies and masks segments of low compositional complexity in amino acid sequences.
|
| Six Frame Translation | Six Frame Transaltion translates a DNA sequence into protein sequences using all three possiblereading frames from both strands of the nucleic acid sequence.
|
| Ssearch3 | W. Pearson's Ssearch3 program performs similarity searches against databases of protein and nucleotide sequences using the Smith-Waterman algorithm and generates globally optimal pairwise sequence alignments.
|
| WU-BLAST2 |
The Washington University BLAST 2.0 program performs rapid similarity searches of protein and nucleotide sequence databases. |
| Web BLAST |
Web BLAST performs similarity searches against databases of protein and nucleotide sequences using the BLAST website at NCBI (The National Center for Biotechnology Information). |
| Web BLAST 2 | Web BLAST 2 performs similarity searches against sequence databases of protein and nucleotide sequences using the BLAST 2.0 web site located at NCBI (The National Center for Biotechnology Information). |
| Web BLOCKS | The Henikoffs' Web BLOCKS web site performs sequence similarity searches against the BLOCKS database, which is a collection of the most conserved regions in groups of related protein and nucleotide sequences.
|
| Web COG | Web COG, a web site located at NCBI (The National Center for Biotechnology Information), can be used to find whether a given protein or nucleic acid sequence is a member of a known COG (Clusters of Orthologous Group) which are groupings of functionally similar genes from different species that contain the same, common, ancient, conserved domains.
|
| Web DBEST | Web dbEST, a web site located at NCBI (National Center for Biotechnology Information) is a database of Expressed Sequence Tags (dbEST) that contains nucleic acid sequence data and other information on single-pass cDNA sequences or Expressed Sequence Tags (ESTs) from different organisms.
|
| Web HMM Search |
Web HMM Search, a web site centered at Washington University in St. Louis, searches a Pfam (protein families grouped by sequence similarity) database located in St. Louis for sequences similar to the input query sequence.
|
| Web NetStart | NetStart predicts translation initiation sites in vertebrates and
Arabidopsis using neural nets. |
| Web PSI-BLAST | This is the (unfinished) driver to NCBI web site for PSI-BLAST.
|
| Web Patent | WebPatent, two web sites located at the US Patent Office and IBM Corporation, allows searching of the US patent database for both claims and figures.
|
| Web VGC ORF Translation | Web VGC ORF, located at the Virtual Genome Center web site, predicts candidatereading frames (ORFs) in very long, multi-gene, genomic DNA sequences and translates them into candidate protein sequences.
|
| Web nnPredict | Web nnPredict, a web site located at UCSF (University of California San Francsico), predicts secondary structure from a protein sequence.
|
| Web pfscan | Web pfscan, Philipp Bucher's pfscan 2.1 protein family recognition program, allows remote searching of a query protein sequence for common motifs or domains against a pre-curated database of protein family profiles at the ISREC (Swiss Institute for Experimental Cancer Research) web site.
|
| hmmPfam |
hmmPfam uses the HMMER software package to search a locally installed Pfam (protein families grouped by sequence similarity) database for sequences similar to the input query sequence.
|
| pfscan 2.1 | Philipp Buscher's pfscan 2.1 protein family recognition program, allows remote searching of a query protein sequence for common motifs or domains against a pre-curated database of protein family profiles at the ISREC (Swiss Institute for Experimental Cancer Research) web site.
|