[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.

To: Nick.Spadaccini@uwa.edu.au, Group finalising DDLm and associated dictionaries <ddlm-group@iucr.org>
Subject: Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.
From: James Hester <jamesrhester@gmail.com>
Date: Thu, 15 Oct 2009 11:10:41 +0300
In-Reply-To: <C6FAA2E0.12093%nick@csse.uwa.edu.au>
References: <279aad2a0910130536u346b90f0g4949108ee20f959a@mail.gmail.com><C6FAA2E0.12093%nick@csse.uwa.edu.au>

Regarding whitespace:

1. Nick detects a contradiction with Simon on the one hand saying that
CIF files aren't directly read by humans much at all, and me insisting
on them remaining readable.  I agree that there is not much point
trying to read and/or edit a 500K mmCIF file directly.  But let us not
forget the small molecule people.  For example, a CIF delivered by the
Powder Data Base can be only 10 lines long, and eminently
readable/cut-and-pastable in any text editor.  In addition, at least
one and probably more of my instrument scientist colleagues routinely
look over raw CIF files in the course of preparing publications and
checking other people's work.  I believe the differing perspectives
here are more to do with the different areas in which Simon and I
encounter CIF files.

2. If all we are concerned about is simplifying the formal syntax,
then that has been done already when we agreed on removing delimiters
from within delimited strings.  The present discussion is exactly
equivalent to deciding on using either "<whitespace>?" or
"<whitespace>+" in the grammar description.  After a review of the
formal 1.1 spec, I see no other opportunities for simplification
arising from making whitespace optional.  So I ask once again, what
other benefits are claimed for making whitespace optional, beyond
changing a plus sign to a question mark in the specification?

On 10/13/09, Nick Spadaccini <nick@csse.uwa.edu.au> wrote:
> There is a difference between insisting in a formal grammar that a value
> token is treated differently at one level than it is at another level, as
> opposed to requiring CIF writers to pad whitespace between value tokens at
> one level, but not at another level.
>
> My reading of the previous mail was that the balance of opinion was to
> formally terminate with the single token (irrespective of whitespace) and
> then requiring/asking/pleading/whatever-verb writers to pad token, which
> they and we all do anyway. I repeat again the formal specification of the
> language needs to be strict and consistent (Brian's maximally disruptive),
> and the parsers can be more loosely (deprecatingly?) implemented.
>
> However I detect a certain level of inconsistency in arguments here. What
> does "human readability" have to do with it? We just had a discussion on
> UTF-8 where it was argued in the near future no-one is going to be
> vim-img/emacs-ing/grep-ping these files and it will all be driven by
> applications. What happened to human readability then?
>
>
> On 13/10/09 8:36 PM, "James Hester" <jamesrhester@gmail.com> wrote:
>
>> I, for one, do not agree with dropping the requirement for whitespace
>> between tokens outside compound structures.  Is the only justification
>> avoiding a second production rule in the formal grammar?  I would like
>> to think we are getting more than this in return for sacrificing human
>> readability: see previous email somewhere long ago in this thread.
>
> cheers
>
> Nick
>
> --------------------------------
> Associate Professor N. Spadaccini, PhD
> School of Computer Science & Software Engineering
>
> The University of Western Australia    t: +61 (0)8 6488 3452
> 35 Stirling Highway                    f: +61 (0)8 6488 1089
> CRAWLEY, Perth,  WA  6009 AUSTRALIA   w3: www.csse.uwa.edu.au/~nick
> MBDP  M002
>
> CRICOS Provider Code: 00126G
>
> e: Nick.Spadaccini@uwa.edu.au
>
>
>
>
>
> _______________________________________________
> ddlm-group mailing list
> ddlm-group@iucr.org
> http://scripts.iucr.org/mailman/listinfo/ddlm-group
>

-- 
T +61 (02) 9717 9907
F +61 (02) 9717 3145
M +61 (04) 0249 4148
_______________________________________________
ddlm-group mailing list
ddlm-group@iucr.org
http://scripts.iucr.org/mailman/listinfo/ddlm-group

Reply to: [list | sender only]

Follow-Ups:

Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings. (SIMON WESTRIP)

References:

Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings. (James Hester)

Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings. (Nick Spadaccini)

Prev by Date: Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.

Next by Date: Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.

Prev by thread: Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.

Next by thread: Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.

Index(es):

Date

Thread

Discussion List Archives

Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.