[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Reply to: [list | sender only]
Re: CIF-JSON draft 2017-05-08
- Subject: Re: CIF-JSON draft 2017-05-08
- From: Robert Hanson <hansonr@xxxxxxxxxx>
- Date: Mon, 8 May 2017 16:39:26 -0500
- DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=stolaf.edu; s=stolaf;h=mime-version:in-reply-to:references:from:date:message-id:subject:to;bh=JRS0SaQj5Fv2gbRrYLpBbisw9p1cTVyd4PvsmbgGR8E=;b=Juc7s7sJUabf3bmA0wXEpx/SOIHhon0xb2NqPds1q8VMK+SfA8XfpzdaXd+J5SZWTh7Hj20GQlDb9CFzxRGR8q/Y9YiW5LS3siYOPV800M0FXlnBj/YBXgDaotEt5W1/dFfhjpzEGZUnc8U6IEmHPiYIAoErIewkj7KgCvmoO00=
- In-Reply-To: <MWHPR04MB0512E4FA07327A5E9103106CE0EE0@MWHPR04MB0512.namprd04.prod.outlook.com>
- References: <CAM+dB2cwoCG6LhPUePRup_hQtM9mXqwL4tULTPf-WGwJGtKrOA@mail.gmail.com><MWHPR04MB051220BFF5C7093CD86702CCE0EE0@MWHPR04MB0512.namprd04.prod.outlook.com><CAF_YUvW=i0XjfzmgA=m03a=X4Y03+FfH8_TAZ2dNPVAhCmuy5w@mail.gmail.com><MWHPR04MB0512E4FA07327A5E9103106CE0EE0@MWHPR04MB0512.namprd04.prod.outlook.com>
On Mon, May 8, 2017 at 1:13 PM, Bollinger, John C <John.Bollinger@stjude.org> wrote:
On Monday, May 08, 2017 11:46 AM, Robert Hanson wrote:
> John, I'm still not clear what you mean by this:
>
>> It is in service to both points that I raised the issue of the ambiguity between CIF lists as values on one hand and multiple values in a loop on the other.
>> I am satisfied to have loop tags be required so as to enable that to be disambiguated, but the more I think about it, the more convinced I become that the best solution would be to present every item’s values, whether one or many, in a JSON array.
>
> Could you give an example?
There are two overlapping issues here. I sent an example of the first to the group on May 4th, showing how the same CIF-JSON could be interpreted in two different ways. That depends on the fact that as (still) specified, JSON arrays are used both as containers for the multiple values that a looped item takes, and also directly to represent individual values that are CIF2 lists. That can be disambiguated under the latest draft by referring to the relevant "loop tags" to determine whether a given item is presented as part of a loop. Of course, that disambiguation depends on the values items so identified being presented in an array, even when there is only one loop packet, and on List values that do not belong to a looped item being presented directly. James remarked that it could also be disambiguated by the program having prior knowledge of the expected data type for a given item. I won't repeat the whole message, but here's the ambiguous JSON:
{
"block": {
"_xyz": [ 0.1, 0.2, 0.3 ]
}
}
I don't think you can have a CIF loop for a subkey. Can you? How would you represent that as a CIF loop?
As you say, the current CIF-JSON allows both of these:
"_chem_comp_atom.model_Cartn_x" : "0.000"
"_chem_comp_atom.model_Cartn_x":["-23.107","-22.157","-23.42 Moreover, it *also* allows this:4"]
"_chem_comp_atom.model_Cartn_x" : ["0.000"]
You mean, specifically:
"_chem_comp_atom.model_Cartn_x" : ["0.000"]
"_chem_comp_atom.model_Cartn_y":["-23.107"]
"_chem_comp_atom.model_Cartn_x
"_chem_comp_atom.model_Cartn_y
"_chem_comp_atom.model_Cartn_z":["-23.107"]
...
...
as opposed to
"_chem_comp_atom.model_Cartn_x" : "0.000"
"_chem_comp_atom.model_Cartn_y": "-23.107"
"_chem_comp_atom.model_Cartn_z": "-23.107"
"_chem_comp_atom.model_Cartn_x
"_chem_comp_atom.model_Cartn_y
...
in
Yes, that's exactly what I'm suggesting. No option for an item's values being presented outside an array. That solves both problems: we have only one representation for item values (contained in an array), and therefore don't have to perform an array test, AND there is no longer any ambiguity about whether the outermost array is a container for values, or the value itself.
And you seem indeed to have gotten it, as your magCIF example is on target. It seems clear that this:
> loop_
> _parent_propagation_vector.id
> _parent_propagation_vector.kxkykz is intended to be represented like so:
> k1 [-0.75 0.75 -0.75]
> "_parent_propagation_vector.id": ["k1"] but consider this alternative CIF:
> "_parent_propagation_vector.kxkykz": [[-0.75 0.75 -0.75]]
_parent_propagation_vector.id k1
_parent_propagation_vector.kxkykz [-0.75 0.75 -0.75] Since it is semantically equivalent to the preceding one, it should be acceptable to transform it to the same CIF-JSON representation. But suppose we instead we translate it to this:
> "_parent_propagation_vector.id": "k1" ... then we have just the kind of ambiguity I've been going on about. I am proposing that we allow only the first form.
> "_parent_propagation_vector.kxkykz": [-0.75 0.75 -0.75]
I like John's elegant proposal. All items as arrays, single or not. It's very clever. Clears up the loop names issue, results in very efficient processing.
Bob
_______________________________________________ cif-developers mailing list cif-developers@iucr.org http://mailman.iucr.org/cgi-bin/mailman/listinfo/cif-developers
Reply to: [list | sender only]
- References:
- CIF-JSON draft 2017-05-08 (James Hester)
- RE: CIF-JSON draft 2017-05-08 (Bollinger, John C)
- Re: CIF-JSON draft 2017-05-08 (Robert Hanson)
- RE: CIF-JSON draft 2017-05-08 (Bollinger, John C)
- Prev by Date: RE: CIF-JSON draft 2017-05-08
- Next by Date: Re: CIF-JSON draft 2017-05-08
- Prev by thread: Re: CIF-JSON draft 2017-05-08
- Next by thread: CIF-JSON new draft
- Index(es):