[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
--
Peter Murray-Rust
Reader in Molecular Informatics
Unilever Centre, Dep. Of Chemistry
University of Cambridge
CB2 1EW, UK
+44-1223-763069
Reply to: [list | sender only]
Re: Updating COMCIFS' approach to dictionaries
- To: "Discussion list of the IUCr Committee for the Maintenance of the CIFStandard (COMCIFS)" <comcifs@iucr.org>
- Subject: Re: Updating COMCIFS' approach to dictionaries
- From: Peter Murray-Rust <pm286@cam.ac.uk>
- Date: Mon, 20 Feb 2012 13:48:23 +0000
- Authentication-Results: mr.google.com; spf=pass (google.com: domain ofpeter.murray.rust@googlemail.com designates 10.216.131.221 aspermitted sender) smtp.mail=peter.murray.rust@googlemail.com;dkim=pass header.i=peter.murray.rust@googlemail.com
- In-Reply-To: <4F42418D.3020206@bernstein-plus-sons.com>
- References: <CAM+dB2dPr0J4u1KHh1x=v4SUgt+di_fMBvWQZCKKJzyBELGetQ@mail.gmail.com><4F41B112.30000@bernstein-plus-sons.com><472620FF2D2FBB4BB62FD1285C58A04FA48E77C1A5@mail01.ccdc.cam.ac.uk><4F42418D.3020206@bernstein-plus-sons.com>
On Mon, Feb 20, 2012 at 12:50 PM, Herbert J. Bernstein <yaya@bernstein-plus-sons.com> wrote:
+1
We are also developing/using tools in this space such as a dictionary browser/search tool. We're making use of tools such as RDF/SPARQL endpoints and also NoSQL databases especially faceted indexing and search which is very powerful for this sort of thing. We use the CIF core dictionary, have translated it into RDF and use the terms in SPARQL queries.
There is relatively little need to deploy these tools beyond the dictionary creators and my guess is that for the next (say) five years there won't be any performance problems. The evolving web technologies now make it easy to create browse/display/search tools and CIF won't need to re-implement everything from scratch.
Perhaps one should consider an automated web tool similar to a dns
registry that would provide
seni-automatic dispensing of dictionary identifiers, prefixes, category
names and tag names to
make the avoidance of naming collisions simpler to achieve.
+1
We are also developing/using tools in this space such as a dictionary browser/search tool. We're making use of tools such as RDF/SPARQL endpoints and also NoSQL databases especially faceted indexing and search which is very powerful for this sort of thing. We use the CIF core dictionary, have translated it into RDF and use the terms in SPARQL queries.
There is relatively little need to deploy these tools beyond the dictionary creators and my guess is that for the next (say) five years there won't be any performance problems. The evolving web technologies now make it easy to create browse/display/search tools and CIF won't need to re-implement everything from scratch.
One related syntactic/semantic issue to consider would be to move from
the current underscore
separation of prefixes from category names and tag names to the use of a
hierarchical ":" or "."
notation to allow for dns-like arbitrary nesting of naming responsibilities.
+1
Essential in the medium future. CIF names have to be machine parsable without lookup--
Peter Murray-Rust
Reader in Molecular Informatics
Unilever Centre, Dep. Of Chemistry
University of Cambridge
CB2 1EW, UK
+44-1223-763069
Reply to: [list | sender only]
- Follow-Ups:
- Re: Updating COMCIFS' approach to dictionaries (James Hester)
- References:
- Updating COMCIFS' approach to dictionaries (James Hester)
- Re: Updating COMCIFS' approach to dictionaries (Herbert J. Bernstein)
- RE: Updating COMCIFS' approach to dictionaries (Matthew Towler)
- Re: Updating COMCIFS' approach to dictionaries (Herbert J. Bernstein)
- Prev by Date: Re: Updating COMCIFS' approach to dictionaries
- Next by Date: Re: Updating COMCIFS' approach to dictionaries
- Prev by thread: Re: Updating COMCIFS' approach to dictionaries
- Next by thread: Re: Updating COMCIFS' approach to dictionaries
- Index(es):