Semantic Universe Website with Chemical/Polymer Informatics Contributions now Live

Over on Twitter, Semantic Universe has just announced the relaunch of their website. The purpose of the site is “to educate the world about semantic technologies and applications.”

To quote from the website:

“Semantic Universe and Cerebra today announced the launch of the “Semantic Universe Network”, a vibrant educational and networking hub for the global semantic technology marketplace. Semantic Universe Network will be the educational and information resource for the people and companies within the high-growth semantics sector, covering the latest news, opinions, events, announcements, products, solutions, promotions and research in the industry.”

As part of the re-launch, both Lezan Hawizy and I have written two short contributions reviewing the state of Semantic Chemistry and showcasing our work on how semantification of chemistry can happen. The contributions were intended to be short “how to..s” and as such are written in a somewhat chatty style. Here are the links:

Semantic Chemistry

The Semantification of Chemistry

Feedback is welcome.

Reblog this post [with Zemanta]

The Cambridge Polymer Builder.

The first “proof of concept” product of the Cambridge Polymer Informatics Group is up on the web. It is a demo application of a polymer builder, which uses Chemical Markup Language, Polymer Markup Language and Jumbo to build various types of polymers.

The polymers are constructed from small fragments, such as CH, CH2, CO etc. groups with the associated connection table defined in CML. Polymer Markup Language (PML) then contains a set of instructions in terms of how these fragments are “glued” together, how torsions (in 3D representations) are dealt with etc and it can also deal with distributions (of torsional angles just as much as molecular weights) and probabilities (e.g. for random compolymerizations etc.). (More details in a forthcoming paper).
The polymer builder subsequently takes the fragments and the relating PML document as an input to enumerate a full connection table for a macromolecule in CML. (We have not implemented ensemble building in this demonstrator).

polymer-builder.gif

Right now we can build most structural motives, such as homopolymers, block- and random copolymers, dendrimers and branched systems. In this demonstrator, we have not currently implemented bump checking and a number of other controls, but we are working on them as I write. The demonstrator app is available as a webservice here. Please go out and take it for a spin. And, by the way, we are grateful for feedback….so let me have your thoughts and comments via the comments function on the blog.

Polymer Informatics Lecture

A while ago, I gave a lecture about our beginning polymer informatics work here at the Unilever Centre. We videoed it and have now permission to put it online from our sponsors, so here it is:

Thanks to Jim Downing and Dr Andrew Walkingshaw who were behind he camera.

The talk covers our vision for polymer informatics, namely being able to solve the “inverse structure-property relationship problem” and to develop technologies which allow the rapid development of design-rules for polymers.

Subsequently, there is a discussion of what, at the moment, is preventing us from achieving that ambition. The nature of current polymer information systems is discussed together with a brief discussion of the quality of polymer data.
The talk then goes on to introduce the notion of the semantic web and illustrates how semantic web technologies can be used to address some of the problems that were previously discussed. As a part of this, a markup language for polymers and a polymer ontology are briefly discussed.

The lecture also showcases a polymer builder which makes use of the markup language, together with an example of reasoning over the polymer ontology.

You can download the talk to your iPod directly from the Google Website (clicking on the little Google Video Icon on the bottom right of the player will take you straight there).

Polymer Informatics and The Semantic Web – The Solution, Part 1: Adding Structure: Chemical Markup Language

In my last post concerning our work on polumer informatics, I started to discuss how one can add structure to documents in the form of metadata, in order to help correct information retrieval. In particular, I introduced the notion of markup languages to structure information and used an example of a bread recipe, to discuss some general features of XML. So having been through all of that, how can we hold chemical information in a marked-up way.

Being chemists, one of the assumptions that is fundamentally engrained into all of our thinking, is that the structure of a molecule is related to the physical properties of that molecule. Therefore, the most important information a chemist might wish to hold in a marked-up way is probably structural information about a molecule. Well, fortunately, over the past decade or so, Peter Murray-Rust, Henry Rzepa and others have worked on an XML dialect called CML – Chemical Markup Language. Let’s have a look at a small molecule, styrene in our case, and see what some basic CML looks like.

Here’s a representation of styrene (InChI=1/C8H8/c1/h2-7H,1H2) that every chemist will be familiar with:

Styrene

and here’s how the same molecule would be represented in CML:

StyreneInCML

As was the case for our bread recipe, you can see that we have three containers here, namely “atomArray” and “bondArray” enclosed by the container “molecule”. Both arrays are essentially lists of atoms (with attributes specifying which element we are talking about, what id that particular atom has and what it’s 2D coordinates are) and bonds (with attributes telling us between which atom IDs the bond was formed, what the ID of the bond is and also what the bond order is). All of this taken together is what computational chemists call a “connection table”.

Neither hard, nor scary, is it? And the simplest way of holding chemical information in a semantically rich format. In future posts I will delve somewhat deeper into the bowels of CML and show you what else it is capable of.