[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Reply to: [list | sender only]
Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.
- To: Nick.Spadaccini@uwa.edu.au, Group finalising DDLm and associated dictionaries <ddlm-group@iucr.org>
- Subject: Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.
- From: James Hester <jamesrhester@gmail.com>
- Date: Thu, 15 Oct 2009 11:10:41 +0300
- In-Reply-To: <C6FAA2E0.12093%nick@csse.uwa.edu.au>
- References: <279aad2a0910130536u346b90f0g4949108ee20f959a@mail.gmail.com><C6FAA2E0.12093%nick@csse.uwa.edu.au>
Regarding whitespace: 1. Nick detects a contradiction with Simon on the one hand saying that CIF files aren't directly read by humans much at all, and me insisting on them remaining readable. I agree that there is not much point trying to read and/or edit a 500K mmCIF file directly. But let us not forget the small molecule people. For example, a CIF delivered by the Powder Data Base can be only 10 lines long, and eminently readable/cut-and-pastable in any text editor. In addition, at least one and probably more of my instrument scientist colleagues routinely look over raw CIF files in the course of preparing publications and checking other people's work. I believe the differing perspectives here are more to do with the different areas in which Simon and I encounter CIF files. 2. If all we are concerned about is simplifying the formal syntax, then that has been done already when we agreed on removing delimiters from within delimited strings. The present discussion is exactly equivalent to deciding on using either "<whitespace>?" or "<whitespace>+" in the grammar description. After a review of the formal 1.1 spec, I see no other opportunities for simplification arising from making whitespace optional. So I ask once again, what other benefits are claimed for making whitespace optional, beyond changing a plus sign to a question mark in the specification? On 10/13/09, Nick Spadaccini <nick@csse.uwa.edu.au> wrote: > There is a difference between insisting in a formal grammar that a value > token is treated differently at one level than it is at another level, as > opposed to requiring CIF writers to pad whitespace between value tokens at > one level, but not at another level. > > My reading of the previous mail was that the balance of opinion was to > formally terminate with the single token (irrespective of whitespace) and > then requiring/asking/pleading/whatever-verb writers to pad token, which > they and we all do anyway. I repeat again the formal specification of the > language needs to be strict and consistent (Brian's maximally disruptive), > and the parsers can be more loosely (deprecatingly?) implemented. > > However I detect a certain level of inconsistency in arguments here. What > does "human readability" have to do with it? We just had a discussion on > UTF-8 where it was argued in the near future no-one is going to be > vim-img/emacs-ing/grep-ping these files and it will all be driven by > applications. What happened to human readability then? > > > On 13/10/09 8:36 PM, "James Hester" <jamesrhester@gmail.com> wrote: > >> I, for one, do not agree with dropping the requirement for whitespace >> between tokens outside compound structures. Is the only justification >> avoiding a second production rule in the formal grammar? I would like >> to think we are getting more than this in return for sacrificing human >> readability: see previous email somewhere long ago in this thread. > > cheers > > Nick > > -------------------------------- > Associate Professor N. Spadaccini, PhD > School of Computer Science & Software Engineering > > The University of Western Australia t: +61 (0)8 6488 3452 > 35 Stirling Highway f: +61 (0)8 6488 1089 > CRAWLEY, Perth, WA 6009 AUSTRALIA w3: www.csse.uwa.edu.au/~nick > MBDP M002 > > CRICOS Provider Code: 00126G > > e: Nick.Spadaccini@uwa.edu.au > > > > > > _______________________________________________ > ddlm-group mailing list > ddlm-group@iucr.org > http://scripts.iucr.org/mailman/listinfo/ddlm-group > -- T +61 (02) 9717 9907 F +61 (02) 9717 3145 M +61 (04) 0249 4148 _______________________________________________ ddlm-group mailing list ddlm-group@iucr.org http://scripts.iucr.org/mailman/listinfo/ddlm-group
Reply to: [list | sender only]
- Follow-Ups:
- References:
- Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings. (James Hester)
- Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings. (Nick Spadaccini)
- Prev by Date: Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.
- Next by Date: Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.
- Prev by thread: Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.
- Next by thread: Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.
- Index(es):