[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Reply to: [list | sender only]
Re: [ddlm-group] Relationship asmong CIF2, STAR, CIF1 and Python
- To: Group finalising DDLm and associated dictionaries <[email protected]>
- Subject: Re: [ddlm-group] Relationship asmong CIF2, STAR, CIF1 and Python
- From: "Herbert J. Bernstein" <[email protected]>
- Date: Tue, 18 Jan 2011 05:44:09 -0500 (EST)
- In-Reply-To: <[email protected]>
- References: <[email protected]><[email protected]><[email protected]><[email protected]><8F77913624F7524AACD2A92EAF3BFA54166D7D1EA8@SJMEMXMBS11.stjude.sjcrh.local><[email protected]><[email protected]><[email protected]><[email protected]><[email protected]><[email protected]><[email protected]><[email protected]>
I respectfully disagree. I will wait on (re)explaining in
more detail until we settle on our goals, at which point
I suspect much of this discussion will become moot.
=====================================================
Herbert J. Bernstein, Professor of Computer Science
Dowling College, Kramer Science Center, KSC 121
Idle Hour Blvd, Oakdale, NY, 11769
+1-631-244-3035
[email protected]
=====================================================
On Tue, 18 Jan 2011, James Hester wrote:
> I strongly reject Herbert's suggestion for a zero-based review or for
> an intermediate "DDLm-2011" version that will help us iron out
> supposed problems. Nick, Syd and Ian developed a coherent, working
> system 10 years ago, including rewriting the core dictionary in DDLm
> and developing attendant software. Everything worked as promised. I
> was shown this system in Florence in 2005, then asked to review it in
> 2007, in the process producing my own proof-of-concept system which
> operated in a different way to Nick's using the same dictionaries. A
> number of modifications resulted from my review. David Brown has
> produced a draft coreCIF dictionary written in DDLm and has not
> encountered any show-stopping issues. In other words, this system has
> been reviewed and tested extensively and to propose yet another review
> shows a cavalier disregard for the amount of time that has already
> been put into DDLm. The system works.
>
> Furthermore, we do not need a DDLm to DDL1/2 translator, as pointed
> out tirelessly by David Brown last year and this year. The alias
> mechanism used in DDL2 is adequate to the task.
>
> A CIF1 to CIF2 data converter is trivial, involving at most reading in
> a file as CIF1 and then outputting it as CIF2.
>
> In short, I have yet to see any justification for returning CIF2 to committee.
>
> James.
> On Sun, Jan 16, 2011 at 12:30 AM, Herbert J. Bernstein
> <[email protected]> wrote:
>> Dear Colleagues,
>>
>> � I have read both Brian's latest messages, and find them very helpful.
>> It now appears that we have several distinct, but related languages to
>> understand, define or refine (please pardon the notational changes)
>>
>> � STAR1
>> � CIF1
>> � dREL
>> � DDLm
>> � STAR2
>> � CIF2
>>
>> in such a way that the existing bases of data and software for STAR1 and
>> CIF1 can, in some sense be brought forward into a world of STAR2 and CIF2
>> and in which CIF2 works "well" with dREL and DDLm. �I think (or at least
>> hope) that we agree on that objective.
>>
>> The point at which I disagree with some of Brian's remarks is whether it
>> is best to go forward from this point with a bias towards accepting the
>> changes to make CIF2 that were previously agreed, or whether it is best to
>> go forward from this point without such a presumption, reviewing the
>> entire stucture with an eye towards best functionality. �My view does not
>> mean we ignore what has been done. �Some or all of the earlier decisions
>> may well turn out to be the best final decisions, but they may well turn
>> out not to be. �We won't know until and unless we make such a zero-based
>> review.
>>
>> Certainly we have a lot to gain in that process from working with what has
>> been done. �I use James's parser and it is very useful, but I think, in
>> view the fact that there will be major changes to exsiting datasets and
>> aoftware involved in going to the currently proposed version of CIF2, we
>> have an obligation to try to make that part of the overall transition as
>> close to "right" as we can make it.
>>
>> On the other-hand, the DDLm-based dictionaries themselves impact a much
>> smaller community, provided we adopt one very important dictum, the one
>> already posted on the IUCr web site:
>>
>> "No changes are required in exsiting archival data files in order to apply
>> domain dictionaries written in DDLm"
>>
>> We need to have a chunk of software that will allow exsiting coreCIF and
>> mmCIF and imgCIF and other CIF1 data files to be validated using DDLm/dREL
>> based dictionaries. �There are many possible ways in which to skin that
>> particular cat:
>>
>> 1. �We can make DDLm to DDL1 and DDL2 dictionary converters
>>
>> 2. �We can make CIF1 to CIF2 data converters
>>
>> 3. �We can make APIs that will allow exsiting CIF1-based applications to
>> become CIF2 and DDLm-aware.
>>
>> ...
>>
>> I hope and expect that we will do all of the above, but item 1, while the
>> least powerful, is a critical necessity to truly meet the promise on the
>> IUCr web page, and, because a DDLm dictionary will be in a strong sense a
>> CIF2 document, will help us to really work with CIF2 and get it right.
>>
>> Therefore, I suggest we _not_ put CIF2 forward for general use at this
>> time, but try to pull together just enough of CIF2, under the name
>> DDLm-2011 to be able to prototype the first round of DDLm dictionaries
>> along with a DDLm-2011 to DDL1 translator and a DDLm-2011 to DDL2
>> translator and explcitly tell the commnity that the DDLm-2011 format is
>> _not_ yet recommended for general use for data files because it is subject
>> to possibly significant changes in the future. �This will help us to gain
>> experience with a CIF2-candidate within a limited community and to try to
>> get it "right," but will also allow the entire community to start gaining
>> a benefit from the work done thus far without a major conversion of
>> existing data sets to a format that seems highly likely to change.
>>
>> Regards,
>> � � Herbert
>>
>>
>>
>> =====================================================
>> �Herbert J. Bernstein, Professor of Computer Science
>> � �Dowling College, Kramer Science Center, KSC 121
>> � � � � Idle Hour Blvd, Oakdale, NY, 11769
>>
>> � � � � � � � � �+1-631-244-3035
>> � � � � � � � � �[email protected]
>> =====================================================
>>
>> On Sat, 15 Jan 2011, Brian McMahon wrote:
>>
>>> OK, the nature of my particular misunderstanding about the STAR/CIF
>>> relationship that came to light in our offline discussions is roughly
>>> the following:
>>>
>>> CIF1 is essentially a proper subset of the STAR format published as:
>>> �Hall, S. R. (1991). The STAR File: a new format for electronic data
>>> � � �transfer and archiving. J. Chem. Inf. Comput. Sci. 31, 326-333;
>>> �Hall, S. R. & Spadaccini, N. (1994). The STAR File: detailed
>>> � � �specifications. J. Chem. Inf. Comput. Sci. 34, 505-508
>>> and described in detail in Chapter 2.1 of International Tables Volume G.
>>>
>>> This version of STAR is used in the molecular information file, also
>>> documented in Volume G ("used" is probably overstating the case; the
>>> only application I know that outputs MIF content is the CCDC, which
>>> uses tokens from the MIF and CIF core dictionaries but ignores
>>> saveframe pointers and nested loops to create files that are
>>> syntactically perfectly valid CIFs). There is also nmrSTAR used
>>> extensively by BioMagResBank that has supporting libraries and database
>>> applications. Also some small-scale experiments in the botanical field
>>> (Syd's association with FloraBase) and a couple of demonstrator
>>> applications that, so far as I am aware, were never developed (e.g.
>>> in quantum chemistry).
>>>
>>> In prototyping dREL and a DDLm, Syd, Nick and Ian Castleden made
>>> ad hoc changes to the STAR syntax to get a workable implementation.
>>> (Since their prototyping engine used Jython, they achieved runtime
>>> efficiencies by implementing changes that were practicable with
>>> Python, echoes of which we're seeing and actively discussing today.
>>> Whether their choice was farsighted or purely accidental I don't
>>> know.) Let us call this ad hoc version STAR+1 - it was a set of
>>> practical syntactic features that would be used mostly in dREL methods
>>> but also in proto-DDLm dictionaries, proto-dREL and appropriately
>>> modified data files to test the novel methods approach. Most of this
>>> work dates back about 10 years. The syntactic changes were not
>>> formally published - they were practical "work in progress", though ny
>>> the end of this cycle it was conceivable that they could have been
>>> systematised and written up as a proper "STAR+1".
>>>
>>> Since COMCIFS took on the task of developing CIF2/DDLm for
>>> crystallography (i.e. the work of this group), we have discussed and
>>> agreed many further changes from the original STAR syntax, much of
>>> this with active involvement from Nick. When, some time back, Nick
>>> said (whether just to me or on the list I don't now remember) that
>>> he was focussing on writing up for publication a revised STAR paper, I
>>> took that to mean that he wanted to freeze the further modifications
>>> that had been agreed to that point as a "STAR+2". From that point I
>>> was reluctant to see CIF diverge further from the then-current syntax,
>>> and was looking forward to Nick's preprint which would document
>>> clearly what that was. I was mistaken - Nick's current project is to
>>> write up "STAR+1", leaving open the prospect of further changes to
>>> "STAR+2" as required.
>>>
>>> Note that even "STAR+1" never existed - Nick's paper will be a
>>> retrospective consolidation of one set of changes adopted for practical
>>> prototyping. In the same way, "STAR+2" need not exist until we
>>> actually have a satisfactory CIF2 format that we can retrofit -
>>> if that's actually required - to a second-generation STAR complete
>>> with saveframes and the rest. Such a "requirement", in my mind, would
>>> have to do with an actual need to retain compatibility with those
>>> other STAR applications (MIF, FloraBase etc.) that I mentioned before.
>>> Realistically, that's probably not going to happen.
>>>
>>> I think that most people on this list have been much quicker than me
>>> to see that demonstrably useful syntax changes should still be made
>>> without undue conservatism. The result is that we have been pulling
>>> together roughly in the same direction (not always *exactly* in the
>>> same direction) and have made real progress.
>>>
>>> I'm embarrassed by my misunderstanding, and were we to revisit some of
>>> our discussions I might now take another view (but only "might").
>>> But as I argue elsewhere I think we're better moving on to test the
>>> consequences of the solutions we've agreed to adopt, and being open to
>>> future revisions in the light of experience, rather than re-running
>>> past hypotheticals.
>>>
>>> Best wishes
>>> Brian
>>>
>>> On Thu, Jan 13, 2011 at 12:17:41PM -0500, Herbert J. Bernstein wrote:
>>>> James has requeested that I formally send a message to this list
>>>> about a matter discussed recently in independent email in order
>>>> to ensure a record. �At first I declined to do so, but after
>>>> reflection, I have decided to do as James has asked.
>>>>
>>>> I have withdrawn my vote in COMCIFS in support of CIF2 going
>>>> forward at this time. �I have done so because, after emails
>>>> from Nick and Brian, it has become clear to me that I was
>>>> making false assumptions about the relationship between
>>>> CIF2 and STAR. �I believe that a zero-based discussion is
>>>> now needed on what the relationship should be among CIF2,
>>>> STAR, CIF1 and Python to best serve the interests
>>>> of the crystallographic community. �I do not know what
>>>> is best and do not know how long such a discussion may take.
>>>> I leave it to James, Nick and Brian to decide if Nick's and
>>>> Brian's messages should be posted on this list for the record.
>>>>
>>>> =====================================================
>>>> � Herbert J. Bernstein, Professor of Computer Science
>>>> � � Dowling College, Kramer Science Center, KSC 121
>>>> � � � � �Idle Hour Blvd, Oakdale, NY, 11769
>>>>
>>>> � � � � � � � � � +1-631-244-3035
>>>> � � � � � � � � � [email protected]
>>>> =====================================================
>>>>
>>>> _______________________________________________
>>>> ddlm-group mailing list
>>>> [email protected]
>>>> http://scripts.iucr.org/mailman/listinfo/ddlm-group
>>> _______________________________________________
>>> ddlm-group mailing list
>>> [email protected]
>>> http://scripts.iucr.org/mailman/listinfo/ddlm-group
>>>
>> _______________________________________________
>> ddlm-group mailing list
>> [email protected]
>> http://scripts.iucr.org/mailman/listinfo/ddlm-group
>>
>
>
>
> --
> T +61 (02) 9717 9907
> F +61 (02) 9717 3145
> M +61 (04) 0249 4148
> _______________________________________________
> ddlm-group mailing list
> [email protected]
> http://scripts.iucr.org/mailman/listinfo/ddlm-group
>
_______________________________________________ ddlm-group mailing list [email protected] http://scripts.iucr.org/mailman/listinfo/ddlm-group
Reply to: [list | sender only]
- References:
- [ddlm-group] Simon's elide proposal (James Hester)
- Re: [ddlm-group] Simon's elide proposal (Herbert J. Bernstein)
- Re: [ddlm-group] Simon's elide proposal (James Hester)
- Re: [ddlm-group] Simon's elide proposal (Herbert J. Bernstein)
- Re: [ddlm-group] Simon's elide proposal (Bollinger, John C)
- Re: [ddlm-group] Simon's elide proposal (Herbert J. Bernstein)
- Re: [ddlm-group] Simon's elide proposal (Herbert J. Bernstein)
- Re: [ddlm-group] Simon's elide proposal (SIMON WESTRIP)
- Re: [ddlm-group] Simon's elide proposal (Herbert J. Bernstein)
- [ddlm-group] Relationship asmong CIF2, STAR, CIF1 and Python (Herbert J. Bernstein)
- Re: [ddlm-group] Relationship asmong CIF2, STAR, CIF1 and Python (Brian McMahon)
- Re: [ddlm-group] Relationship asmong CIF2, STAR, CIF1 and Python (Herbert J. Bernstein)
- Re: [ddlm-group] Relationship asmong CIF2, STAR, CIF1 and Python (James Hester)
- Prev by Date: Re: [ddlm-group] Relationship asmong CIF2, STAR, CIF1 and Python. .
- Next by Date: Re: [ddlm-group] Simon's elide proposal
- Prev by thread: Re: [ddlm-group] Relationship asmong CIF2, STAR, CIF1 and Python
- Next by thread: Re: [ddlm-group] Simon's elide proposal
- Index(es):

