UGENE integrates dozens of well-known biological tools, algorithms, and original tools in the context of genomics, evolutionary biology, virology, and other branches of life science. UGENE provides a graphical user interface (GUI) for the pre-built tools so biologists with no computer programming skills can access those tools more easily.
Using UGENE Workflow Designer, it is possible to streamline a multi-step analysis. The workflow consists of blocks such as data readers, blocks executing embedded tools and algorithms, and data writers. Blocks can be created with command line tools or a script. A set of sample workflows is available in the Workflow Designer, to annotate sequences, convert data formats, analyze NGS data, etc.
Beside the graphical interface, UGENE also has a command-line interface. Workflows may also be executed thereby.
The Sequence View is used to visualize, analyze and modify nucleic acid or protein sequences. Depending on the sequence type and the options selected, the following views can be present in the Sequence View window:
The Alignment Editor allows working with multiple nucleic acid or protein sequences - aligning them, editing the alignment, analyzing it, storing the consensus sequence, building a phylogenetic tree, and so on.
Phylogenetic Tree Viewer
The Phylogenetic Tree Viewer helps to visualize and edit phylogenetic trees. It is possible to synchronize a tree with the corresponding multiple alignment used to build the tree.
Assembly Browser
Assembly Browser
The Assembly Browser project was started in 2010 as an entry for Illumina iDEA Challenge 2011.[19] The browser allows users to visualize and browse large (up to hundreds of millions of short reads) next generation sequence assemblies. It supports SAM,[20] BAM (the binary version of SAM), and ACE formats. Before browsing assembly data in UGENE, an input file is converted to a UGENE database file automatically. This approach has its pros and cons. The pros are that this allows viewing the whole assembly, navigating in it, and going to well-covered regions rapidly. The cons are that a conversion may take time for a large file, and needs enough disk space to store the database.
The distinguishing feature of Workflow Designer, relative to other bioinformatics workflow management systems is that workflows are executed on a local computer. It helps to avoid data transfer issues, whereas other tools’ reliance on remote file storage and internet connectivity does not.
The elements that a workflow consists of correspond to the bulk of algorithms integrated into UGENE. Using Workflow Designer also allows creating custom workflow elements. The elements can be based on a command-line tool or a script.
Workflows are stored in a special text format. This allows their reuse, and transfer between users.
A workflow can be run using the graphical interface or launched from the command line. The graphical interface also allows controlling the workflow execution, storing the parameters, and so on.
There is an embedded library of workflow samples to convert, filter, and annotate data, with several pipelines to analyze NGS data developed in collaboration with NIH NIAID.[22] A wizard is available for each workflow sample.
Other formats: Bairoch (enzymes info), HMM (HMMER profiles), PWM and PFM (position matrices), SNP and VCF4 (genome variations)
Release cycle
UGENE is primarily developed by Unipro LLC[23] with headquarters in Akademgorodok of Novosibirsk, Russia. Each iteration lasts about 1–2 months, followed by a new release. Development snapshots may also be downloaded.
The features to include in each release are mostly initiated by users.
↑ Efremov, I. E.; Fursov, M. Y; Danilova, Yu. E. (2009). "UGENE: high performance genome analysis suite". Proceedings of the Fifth Moscow International Congress on Biotechnology. 2: 405–406. ISBN978-5-7237-0372-8.
↑ Vaskin, Y.; Khomicheva, I.; Ignatieva, E.; Vityaev, E. (2012). "ExpertDiscovery and UGENE integrated system for intelligent analysis of regulatory regions of genes". In Silico Biology. 11 (3–4): 97–108. doi:10.3233/ISB-2012-0448. PMID22935964.
This page is based on this Wikipedia article Text is available under the CC BY-SA 4.0 license; additional terms may apply. Images, videos and audio are available under their respective licenses.