[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
From: SIMON WESTRIP <simonwestrip@btinternet.com>
To: Group finalising DDLm and associated dictionaries <ddlm-group@iucr.org>
Sent: Tuesday, 7 July 2015, 17:07
Subject: Re: [ddlm-group] Semantics of whitespace-delimited values
From: James Hester <jamesrhester@gmail.com>
To: ddlm-group <ddlm-group@iucr.org>
Sent: Tuesday, 7 July 2015, 15:17
Subject: [ddlm-group] Semantics of whitespace-delimited values
I have no issue with question mark or period, as these are necessary for semantic completeness.
What I would like to discuss for CIF2.0 is the following:
(i) The interpretation of a data value as numeric is determined solely by the dictionary with no regard to the particular delimiters used in the CIF file;
(ii) A convention is encouraged for CIF writers whereby numeric values are not enclosed by delimiters.
The advantage of this simpler scheme is a clean separation between syntax and human-relevant semantics. The only CIF applications that can have a use for the CIF1 scheme are those that are written without reference to a dictionary, most obviously pretty-printers that might want to tabulate numbers by lining up decimal points instead of left-justifying. Even if such formatting applications get it wrong, they will not change the meaning of the file and so I would view point (ii) as sufficient support for such applications. Conversely, any application that wishes to operate on a number as opposed to operating on the textual representation of the number will of necessity need to know what this number means and will therefore be written with reference to a dictionary, making it unnecessary to signal "numericness" using whitespace deliimited datavalues.
What do others think? If there is a body of CIF1 applications out there that have been designed to raise errors when values expected to be numeric are enclosed by delimiters, this proposal would represent a further annoying change from CIF1, and it would be good to have some idea of how many such applications there are. I speculate that many applications ignore the delimiter status, for reasons both of laziness, the authority of the dictionary definitions, and the philosophy of writing liberal parsers.
all the best,
James.
_______________________________________________
ddlm-group mailing list
ddlm-group@iucr.org
http://mailman.iucr.org/cgi-bin/mailman/listinfo/ddlm-group
Reply to: [list | sender only]
Re: [ddlm-group] Semantics of whitespace-delimited values
- To: SIMON WESTRIP <simonwestrip@btinternet.com>, Group finalising DDLm and associated dictionaries <ddlm-group@iucr.org>
- Subject: Re: [ddlm-group] Semantics of whitespace-delimited values
- From: SIMON WESTRIP <simonwestrip@btinternet.com>
- Date: Tue, 7 Jul 2015 17:09:00 +0000 (UTC)
- DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=btinternet.com; s=s2048;t=1436289153; bh=Kd/PVq+VDoKj8nYeXLN+JqBIfIhQyGmgDy1l2cpgN+k=;h=Date:From:Reply-To:To:In-Reply-To:References:Subject:From:Subject;b=LsBXFi0UAeBfZeeEhHn1x2IeSM88fs6FtaNFtMaknImDfxjxVN6kTofhRzGrEWwhFmd1+AvaZBuxndGXYtcQvh9pR+X/TV4X+cZUWZlA5PZ163rSVw6qyOnJINi7Yg91gDY4xwl/FHQvBRpDaA1b1p31KcreirWynjiny8czLCuL3mUW24IfUCzuOAHGOQmXwvujcsOIvTDQtXnoI45yTlzAkoxKGSvVyoEbw8B0GfeZuvga6ChR9A4r3oSA9wdg8TxwyNMAH9/q6bJG7JZr7NhjoAU5ZiS3BDDrrOe8uGKEDuhXlN/0P2dmy6hi7PA/wSpkoHy9Nf078jsJ7Bpf4g==
- In-Reply-To: <1961330759.802510.1436285262273.JavaMail.yahoo@mail.yahoo.com>
- References: <1961330759.802510.1436285262273.JavaMail.yahoo@mail.yahoo.com>
OLEX2, JANA, OpenBabel and Avogadro also seem not to care that the numbers are delimited by apostrophes, while enCIFer correctly warns that the values are not correctly formatted.
From: SIMON WESTRIP <simonwestrip@btinternet.com>
To: Group finalising DDLm and associated dictionaries <ddlm-group@iucr.org>
Sent: Tuesday, 7 July 2015, 17:07
Subject: Re: [ddlm-group] Semantics of whitespace-delimited values
A quick test of some programs I have readily available with an 'invalid' CIF1.1 cif that contains delimited site coordinates:
checkCIF (powered by PLATON) - issues alerts but nevertheless processes the CIF using the delimited values as numbers
publCIF - warns that they should not have delimiters but reads the value as a number anyway (according to the dictionary)
Jmol - renders models as expected.
I'll test a few others in due course, but am pleased to see that these programs would not be scuppered by reading 'delimited numbers'. (NB obviously checkCIF/publCIF could fairly easily drop the alerts for CIF2, which are annoying in any case)
Cheers
Simon
From: James Hester <jamesrhester@gmail.com>
To: ddlm-group <ddlm-group@iucr.org>
Sent: Tuesday, 7 July 2015, 15:17
Subject: [ddlm-group] Semantics of whitespace-delimited values
Dear All,
One issue that has not been discussed in the context of the CIF2 syntax is the special interpretation of whitespace-delimited values. In CIF1.1 as recorded in Volume G, a whitespace-delimited question mark and a whitespace-delimited period have a special interpretation as "unknown" and "default/not applicable/null" respectively. Furthermore, only a whitespace-delimited value matching a specified syntax (which includes optional appended esd values) may be interpreted as a numeric value, and it would strictly speaking be a semantic error for a CIF processor to interpret as a number a numeric value enclosed in delimiters.(iii) The precise construction of numeric values is moved into the DDLm attribute dictionary.
--
T +61 (02) 9717 9907
F +61 (02) 9717 3145
M +61 (04) 0249 4148
F +61 (02) 9717 3145
M +61 (04) 0249 4148
_______________________________________________
ddlm-group mailing list
ddlm-group@iucr.org
http://mailman.iucr.org/cgi-bin/mailman/listinfo/ddlm-group
_______________________________________________
ddlm-group mailing list
ddlm-group@iucr.org
http://mailman.iucr.org/cgi-bin/mailman/listinfo/ddlm-group
ddlm-group mailing list
ddlm-group@iucr.org
http://mailman.iucr.org/cgi-bin/mailman/listinfo/ddlm-group
_______________________________________________ ddlm-group mailing list ddlm-group@iucr.org http://mailman.iucr.org/cgi-bin/mailman/listinfo/ddlm-group
Reply to: [list | sender only]
- Follow-Ups:
- Re: [ddlm-group] Semantics of whitespace-delimited values (James Hester)
- References:
- Re: [ddlm-group] Semantics of whitespace-delimited values (SIMON WESTRIP)
- Prev by Date: Re: [ddlm-group] Semantics of whitespace-delimited values
- Next by Date: Re: [ddlm-group] Semantics of whitespace-delimited values
- Prev by thread: Re: [ddlm-group] Semantics of whitespace-delimited values
- Next by thread: Re: [ddlm-group] Semantics of whitespace-delimited values
- Index(es):