[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: Treatment of Greek characters in CIF2

Subject: RE: Treatment of Greek characters in CIF2
From: "Bollinger, John C" <John.Bollinger@xxxxxxxxxx>
Date: Thu, 20 Apr 2017 15:56:39 +0000
Accept-Language: en-US
authentication-results: iucr.org; dkim=none (message not signed)header.d=none;iucr.org; dmarc=none action=none header.from=STJUDE.ORG;
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=SJCRH.onmicrosoft.com; s=selector1-stjude-org;h=From:Date:Subject:Message-ID:Content-Type:MIME-Version;bh=4QPwDSOOOP32hFn5jR9IWQ77PI0xdoSMMw3xcGxrPXU=;b=FVbjFgWDoDwNlrxXWwGp0Au48QuM3PcNohF/h6SuoyqCP1y6SBCG6uAyVvhgwUwLjR8h7V+/3rL//gC+TB+Qj0RBCVmzYSjCigw2bbhoZoH++isGqtjtL/iMudZSfDYWWJVP7YzLCeq3VW4oQ3NI9BPAi1ASLYuwokQ2nxeCLbU=
In-Reply-To: <CAF_YUvWUOYXL0Ek4dq=ZYp-J5PVyMv2m9voKUitsDZjtnLyWTA@mail.gmail.com>
References: <CAM+dB2d5NCbCb1Zc_QS3KkjscDH7Sk9NQVbQxhLn0nPtO6E+zA@mail.gmail.com><MWHPR04MB0512FE67D8266ED57567F119E01B0@MWHPR04MB0512.namprd04.prod.outlook.com><CAF_YUvX+3ptR1e18wCqH3BKQ9A2=B90mVQ8MVtXePzpKa2=Vbg@mail.gmail.com><MWHPR04MB0512EC6E2F22307C756FC31FE01B0@MWHPR04MB0512.namprd04.prod.outlook.com><CAF_YUvWUOYXL0Ek4dq=ZYp-J5PVyMv2m9voKUitsDZjtnLyWTA@mail.gmail.com>
spamdiagnosticmetadata: NSPM
spamdiagnosticoutput: 1:99

On Thursday, April 20, 2017 9:26 AM, Robert Hanson wrote:
> Now that John is with us, let's summarize where we are. Feel free to disagree!
> - CIF-JSON is a great idea

One of my concerns is that it is *several* great ideas.

> - COD is already using something like this
> - Jmol is already creating something like this for internal use or export (undocumented)
> - we're talking about the future, not the present
> - no programs as of yet are implementing CIF-JSON, including COD and Jmol
> - as long as we don't cause an incompatibility, we can do whatever we want
> Agreed to so far -  maybe?

Agreed so far.

> - all CIF keys will be made lower case, since in the CIF format it doesn't matter, and in JSON it does
>  and this also allows us to

I'm fine with that, provided that strings being "lower case" is understood as shorthand for them being in a form that is reproduced unchanged by converting to Unicode normalization form NFD, applying the Unicode case-folding algorithm to the result, and converting the case-folded result to Unicode normalization form NFD (case folding does not necessarily preserve normalization).  The resulting form is the basis for Unicode canonical caseless matching, on which CIF2 relies for data name, block code, and frame code matching.  It will indeed require all Latin letters to be presented in lower case, but it will put certain letters in other scripts in upper case, and it has additional effects on characters that have canonical decompositions.

> - upper-case keys will be non-CIF metadata or other application-specific or translation-specific keys,
>   including CIF1/2 compatibility information

I can accept that.

> - UTF-8 character encoding; \uFFFF for CIF <?> and JSON standard null for <.>

That's ok with me, if indeed we agree that we want CIF-JSON to preserve the distinction.  However, I offer for consideration the proposition that JSON null fits CIF <?> better than it fits CIF <.>, so perhaps we want flip those assignments.

> - some question about whether top level should be [] or {}

I agree that consensus has not been reached on that question.

Personally, I'm not much swayed by arguments that CIF-JSON must be able to encode invalid CIF constructs (i.e. duplicate block codes), or to preserve details of the native CIF serialization format that are not actually significant in CIF (i.e. data block order).  I'm not thinking in terms of transforming CIF *files* to JSON, but rather in terms of serializing data that are structured according to the CIF data model.

> - some question about what to do with CIF1 non-latin characters

I wasn't sure that was part of the same conversation, but OK.  It bears discussion either way.

John

________________________________

Email Disclaimer: www.stjude.org/emaildisclaimer
Consultation Disclaimer: www.stjude.org/consultationdisclaimer
_______________________________________________cif-developers mailing [email protected]http://mailman.iucr.org/cgi-bin/mailman/listinfo/cif-developers

Reply to: [list | sender only]

References:

Treatment of Greek characters in CIF2 (James Hester)

RE: Treatment of Greek characters in CIF2 (Bollinger, John C)

Re: Treatment of Greek characters in CIF2 (Robert Hanson)

RE: Treatment of Greek characters in CIF2 (Bollinger, John C)

Re: Treatment of Greek characters in CIF2 (Robert Hanson)

Prev by Date: RE: Draft JSON specification, round 2

Next by Date: Re: Treatment of Greek characters in CIF2

Prev by thread: Re: Treatment of Greek characters in CIF2

Next by thread: Re: Treatment of Greek characters in CIF2

Index(es):

Date

Thread

Discussion List Archives

RE: Treatment of Greek characters in CIF2