On Tuesday, February 22, 2011 7:51 AM, Herbert J. Bernstein wrote:

>   From the point of view of writing a pure "CIF2" application that is not aware of the whitespace, particular quote marks, comments, etc, those two string are identical.
>   From the point of view of a more general CIF API, in which comments, magic numbers, and partiular quote marks, those two string are different in precisely the same way that the string 'ABC' and "ABC" are different, and 13.4 and
>1.34e1 are different.
>   This is _not_ an ambiguity.  It is a matter of whether we are looking for the information in a file or looking for the representations of the data in the file.

Herbert is right about this.  It doesn't matter which syntactic variant was used to express a data value in an input CIF.  Once the value is parsed, the result is the value.  In particular, under proposal P, """C\"""" expresses a different value than does r"""C\"""", whereas """C\\\"""" and r"""C\"""" express the same value.  The fact that the character sequence C" cannot be expressed via Python raw string format is irrelevant.  An application receiving these values does not need to know and should not care in which form the value was expressed in a CIF, if indeed it was ever expressed in CIF format at all.

However, although there is no technical issue here, the fact that an experienced and successful Python and CIF practitioner such as James raised the question is illuminating.  It demonstrates that the complexity of the syntax and semantics provided by proposal P would be likely to be a source of confusion for developers and users both.



