Genomalysis

Genomalysis User Guide


PDF Version (250 KB).

Table of Contents


A Note

This is the user guide is for the final Java version of Genomalysis. The Genomalysis project is being migrated to Python and will likely be available in a Django web interface. The final Java version of Genomalysis has been focused on the core functions of mining and viewing proteomic and genomic sequences. A legacy version of Genomalysis, which includes more features (many of them not working), is available from the downloads section of this web site.

Introduction

Genomalysis is a graphical user interface for mining proteomes and genomes for sequences of particular interest to the user. It is essentially a graphical wrapper that allows for the execution, in rapid succession, of third party algorithms. These are algorithms that test sequences in silico for specific properties: transmembrane segments, sequence patterns, secretion signals, etc. By automating the execution of such algorithms on an entire genome or proteome, it becomes much easier and faster for the user to extract sequences that have specific features. Genomalysis allows for the user to construct multiple rules using multiple algorithms in succession to mine for sequences that have multiple properties of interest. In addition to the mining functions, there is a viewing function that allows the user to open input and output files to view the sequences within. Genomalysis is currently implemented for Windows, is written in Java/Swing, and can mine genomic and proteomic information that is in the FASTA format. FASTA is a text only format that contains sequences with headers that describe what they are. These files can be opened by any text editor. The current filters available in Genomalysis are as follows: Sequences are read from a FASTA formatted sequence file into the user configured filter set and are passed or failed by each filter in succession. If a sequence passes all filters, then it is written to an output file in FASTA format.

Top of page Next page