LiSiCA

Last updated
LiSiCA
Developer(s) Insilab (National Institute of Chemistry Slovenia)
Initial release2015;8 years ago (2015)
Written in Python
Operating system Unix-like, Windows
Type plugin
Website insilab.org/lisica/

LiSiCA (Ligand Similarity using Clique Algorithm) is a ligand-based virtual screening software that searches for 2D and 3D similarities between a reference compound and a database of target compounds which should be represented in a Mol2 format. The similarities are expressed using the Tanimoto coefficients and the target compounds are ranked accordingly. LiSiCA is also available as LiSiCA PyMOL plugin both on Linux and Windows operating systems.

Contents

Description

As an input LiSiCA requires at least one reference compound and database of target compounds. For 3D screening this database has to be a pregenerated database of conformations of target and for 2D screening a topology, that is, a list of atoms and bonds, for each target compound. On each step the algorithm compares reference compound to one of the compounds from target compounds based on their 2D or 3D representation. Both compounds(molecules) are converted to molecular graphs. In 2D and 3D screening the molecular graph vertices represent atoms. In 2D screening edges of molecular graph represent covalent bonds while in 3D screening edges are drawn between every pair of vertices and have no chemical meaning. A product graph generated from molecular graphs is then searched using fast maximum clique algorithm [1] [2] to find the largest substructure common to both compounds. The similarity between compounds is calculated using Tanimoto coefficients and target compounds are ranked according to their Tanimoto coefficients.

Feature overview

An example of 2D screening with LiSiCA PyMOL plugin. LiSiCA 2d output.png
An example of 2D screening with LiSiCA PyMOL plugin.
An example of 3D screening with LiSiCA PyMOL plugin. LiSiCA 3d output.png
An example of 3D screening with LiSiCA PyMOL plugin.

LiSiCA can search 2D and 3D similarities between a reference compound and a database of target compounds. It takes as an input at least one reference compound and a database of target compounds. By default it returns only the compound most similar to the reference compound out of all compounds in database of target compounds. Other optional parameters LiSiCA uses are:

In addition LiSiCA PyMOL plugin also offers to load saved results.

History

Interesting fact

The Slovene word lisica means 'fox', which is why the logo of LiSiCA software is a fox holding two molecules.

Related Research Articles

In chemistry, the oxidation state, or oxidation number, is the hypothetical charge of an atom if all of its bonds to other atoms were fully ionic. It describes the degree of oxidation of an atom in a chemical compound. Conceptually, the oxidation state may be positive, negative or zero. While fully ionic bonds are not found in nature, many bonds exhibit strong ionicity, making oxidation state a useful predictor of charge.

<span class="mw-page-title-main">Clique problem</span> Task of computing complete subgraphs

In computer science, the clique problem is the computational problem of finding cliques in a graph. It has several different formulations depending on which cliques, and what information about the cliques, should be found. Common formulations of the clique problem include finding a maximum clique, finding a maximum weight clique in a weighted graph, listing all maximal cliques, and solving the decision problem of testing whether a graph contains a clique larger than a given size.

In theoretical computer science, the subgraph isomorphism problem is a computational task in which two graphs G and H are given as input, and one must determine whether G contains a subgraph that is isomorphic to H. Subgraph isomorphism is a generalization of both the maximum clique problem and the problem of testing whether a graph contains a Hamiltonian cycle, and is therefore NP-complete. However certain other cases of subgraph isomorphism may be solved in polynomial time.

<span class="mw-page-title-main">Structural alignment</span> Aligning molecular sequences using sequence and structural information

Structural alignment attempts to establish homology between two or more polymer structures based on their shape and three-dimensional conformation. This process is usually applied to protein tertiary structures but can also be used for large RNA molecules. In contrast to simple structural superposition, where at least some equivalent residues of the two structures are known, structural alignment requires no a priori knowledge of equivalent positions. Structural alignment is a valuable tool for the comparison of proteins with low sequence similarity, where evolutionary relationships between proteins cannot be easily detected by standard sequence alignment techniques. Structural alignment can therefore be used to imply evolutionary relationships between proteins that share very little common sequence. However, caution should be used in using the results as evidence for shared evolutionary ancestry because of the possible confounding effects of convergent evolution by which multiple unrelated amino acid sequences converge on a common tertiary structure.

<span class="mw-page-title-main">Structural bioinformatics</span> Bioinformatics subfield

Structural bioinformatics is the branch of bioinformatics that is related to the analysis and prediction of the three-dimensional structure of biological macromolecules such as proteins, RNA, and DNA. It deals with generalizations about macromolecular 3D structures such as comparisons of overall folds and local motifs, principles of molecular folding, evolution, binding interactions, and structure/function relationships, working both from experimentally solved structures and from computational models. The term structural has the same meaning as in structural biology, and structural bioinformatics can be seen as a part of computational structural biology. The main objective of structural bioinformatics is the creation of new methods of analysing and manipulating biological macromolecular data in order to solve problems in biology and generate new knowledge.

<span class="mw-page-title-main">Clique (graph theory)</span> Subset of the vertices of a node-link graph that are all adjacent to each other

In the mathematical area of graph theory, a clique is a subset of vertices of an undirected graph such that every two distinct vertices in the clique are adjacent. That is, a clique of a graph is an induced subgraph of that is complete. Cliques are one of the basic concepts of graph theory and are used in many other mathematical problems and constructions on graphs. Cliques have also been studied in computer science: the task of finding whether there is a clique of a given size in a graph is NP-complete, but despite this hardness result, many algorithms for finding cliques have been studied.

A chemical database is a database specifically designed to store chemical information. This information is about chemical and crystal structures, spectra, reactions and syntheses, and thermophysical data.

In mathematics, computer science and especially graph theory, a distance matrix is a square matrix containing the distances, taken pairwise, between the elements of a set. Depending upon the application involved, the distance being used to define this matrix may or may not be a metric. If there are N elements, this matrix will have size N×N. In graph-theoretic applications, the elements are more often referred to as points, nodes or vertices.

<span class="mw-page-title-main">OpenEye Scientific Software</span> American molecular modelling software company

OpenEye Scientific Software is an American software company founded by Anthony Nicholls in 1997. It develops large-scale molecular modelling applications and toolkits. Following OpenEye's acquisition by Cadence Design Systems for $500 million in September 2022, the company was rebranded to OpenEye Cadence Molecular Sciences and operates as a business unit under Cadence.

<span class="mw-page-title-main">Uranyl</span> Oxycation of uranium

The uranyl ion is an oxycation of uranium in the oxidation state +6, with the chemical formula UO2+
2
. It has a linear structure with short U–O bonds, indicative of the presence of multiple bonds between uranium and oxygen. Four or more ligands may be bound to the uranyl ion in an equatorial plane around the uranium atom. The uranyl ion forms many complexes, particularly with ligands that have oxygen donor atoms. Complexes of the uranyl ion are important in the extraction of uranium from its ores and in nuclear fuel reprocessing.

This page describes mining for molecules. Since molecules may be represented by molecular graphs this is strongly related to graph mining and structured data mining. The main problem is how to represent molecules while discriminating the data instances. One way to do this is chemical similarity metrics, which has a long tradition in the field of cheminformatics.

Protein–ligand docking is a molecular modelling technique. The goal of protein–ligand docking is to predict the position and orientation of a ligand when it is bound to a protein receptor or enzyme. Pharmaceutical research employs docking techniques for a variety of purposes, most notably in the virtual screening of large databases of available chemicals in order to select likely drug candidates. There has been rapid development in computational ability to determine protein structure with programs such as AlphaFold, and the demand for the corresponding protein-ligand docking predictions is driving implementation of software that can find accurate models. Once the protein folding can be predicted accurately along with how the ligands of various structures will bind to the protein, the ability for drug development to progress at a much faster rate becomes possible.

<span class="mw-page-title-main">Virtual screening</span>

Virtual screening (VS) is a computational technique used in drug discovery to search libraries of small molecules in order to identify those structures which are most likely to bind to a drug target, typically a protein receptor or enzyme.

This list of structural comparison and alignment software is a compilation of software tools and web portals used in pairwise or multiple structural comparison and structural alignment.

<span class="mw-page-title-main">Chemical similarity</span> Chemical term

Chemical similarity refers to the similarity of chemical elements, molecules or chemical compounds with respect to either structural or functional qualities, i.e. the effect that the chemical compound has on reaction partners in inorganic or biological settings. Biological effects and thus also similarity of effects are usually quantified using the biological activity of a compound. In general terms, function can be related to the chemical activity of compounds.

Computer Atlas of Surface Topography of Proteins (CASTp) aims to provide comprehensive and detailed quantitative characterization of topographic features of protein, is now updated to version 3.0. Since its release in 2006, the CASTp server has ≈45000 visits and fulfills ≈33000 calculation requests annually. CASTp has been proven as a confident tool for a wide range of researches, including investigations of signaling receptors, discoveries of cancer therapeutics, understanding of mechanism of drug actions, studies of immune disorder diseases, analysis of protein–nanoparticle interactions, inference of protein functions and development of high-throughput computational tools. This server is maintained by Jie Liang's lab in University of Illinois at Chicago.

Early twenty-first century pesticide research has focused on developing molecules that combine low use rates and that are more selective, safer, resistance-breaking and cost-effective. Obstacles include increasing pesticide resistance and an increasingly stringent regulatory environment.

Lisica may refer to:

<span class="mw-page-title-main">MaxCliqueDyn algorithm</span>

The MaxCliqueDyn algorithm is an algorithm for finding a maximum clique in an undirected graph.

ProBiS is a computer software which allows prediction of binding sites and their corresponding ligands for a given protein structure. Initially ProBiS was developed as a ProBiS algorithm by Janez Konc and Dušanka Janežič in 2010 and is now available as ProBiS server, ProBiS CHARMMing server, ProBiS algorithm and ProBiS plugin. The name ProBiS originates from the purpose of the software itself, that is to predict for a given Protein structure Binding Sites and their corresponding ligands.

References

  1. Janez Konc; Dusanka Janezic (2007). "An improved branch and bound algorithm for the maximum clique problem" (PDF). MATCH Communications in Mathematical and in Computer Chemistry. 58 (3): 569–590.
  2. Matjaz Depolli; Janez Konc; Kati Rozman; Roman Trobec; Dušanka Janežič (2013). "Exact parallel maximum clique algorithm for general and protein graphs". Journal of Chemical Information and Modeling . 53 (9): 2217–2228. doi:10.1021/ci4002525. PMID   23965016.
  3. Samo Lešnik; Tanja Štular; Boris Brus; Damijan Knez; Stanislav Gobec; Dušanka Janežič; Janez Konc (2015). "LiSiCA: A Software for Ligand-Based Virtual Screening and Its Application for the Discovery of Butyrylcholinesterase Inhibitors". Journal of Chemical Information and Modeling . 55 (8): 1521–1528. doi:10.1021/acs.jcim.5b00136. PMID   26158767.
  4. Athira Dilip; Samo Lešnik; Tanja Štular; Dušanka Janežič; Janez Konc (2016). "Ligand-based virtual screening interface between PyMOL and LiSiCA". Journal of Cheminformatics . 8 (46): 46. doi: 10.1186/s13321-016-0157-z . PMC   5013575 . PMID   27606012.