ChemDiff

Overview

ChemDiff is an Indigo-based utility for finding duplications and visual comparison of two files containing multiple structures. SDF, SMILES, CML, MOLFILE input formats are supported. Files can contains large amount of molecules and ChemDiff was test on files with up to 1 million ones.

There are at least 3 possible use cases for ChemDiff:

  1. Compare 2 sets of molecules.

  2. Find duplicates in a single set of molecules.

  3. Test whether a specified molecule (or some set of molecules) belongs to another set of molecules.

Screenshots

image0 image1

Features

ChemDiff window contains two tabs: first tab is for input molecules, and second tab contains comparison result. To load set of molecules you need to click corresponding button for the first set or for the second set. After loading set of molecules ChemDiff asks what property should be used as an identifier for the molecule. Example of such dialog is given below. Table of possible properties contains name, serial number in the file, and all properties associated with this molecule.

image2

When the “Compare” button is pressed, ChemDiff compares the specified set of molecules and shows the results on the second tab:

image3

Any table cell can be opened by double click or by pressing right mouse button:

image4

If a table cell contain multiple molecules then ChemDiff shows a separate window with molecules from this group:

image5

If you double-click a molecule, you will see a separate window with this molecule:

image6

There are different comparison options:

image7

If a molecule contains atoms with invalid valence or etc. then red mark is shown. Details can be found by double-clicking this molecule:

image8

ChemDiff distribution includes a set of test files showing features of molecules comparation.

More screenshots

image9 image10 image11 image12

Portability

ChemDiff is written in Java and supports Linux and Windows operating systems, both 32-bit and 64-bit versions of each system. Mac OS X 10.5 and 10.6 are supported as well. ChemDiff depends on the Indigo library. The binaries of Indigo for all supported systems are included in the distribution.

Download and Install

Look at the Downloads page for package suitable for your system. For Windows, an installer executable is provided. On Linux and Mac OS X, you just need to unpack the provided zipfile and run the following command:

/path/to/chemdiff-1.0-xxxx/chemdiff

You can also put the symlink in /usr/local/bin:

sudo ln -s /path/to/chemdiff-1.0-xxxx/chemdiff /usr/local/bin

and then you will be able to run ChemDiff by typing chemdiff in the command line.

License

Licensed under the Apache License, Version 2.0 (the “License”); you may not use this file except in compliance with the License. You may obtain a copy of the License at

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an “AS IS” BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Commercial Availability

If licensed ChemDiff does not fit your needs, please contact us to discuss the purchase of a commercial license. You may need the commercial license if you want to:

  • Receive ongoing support and maintenance

  • Include ChemDiff as a component in your proprietary software product