[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Reply to: [list | sender only]
Re: [ddlm-group] [THREAD 4] UTF8
- To: Group finalising DDLm and associated dictionaries <ddlm-group@iucr.org>
- Subject: Re: [ddlm-group] [THREAD 4] UTF8
- From: James Hester <jamesrhester@gmail.com>
- Date: Fri, 16 Oct 2009 14:35:05 +0300
- In-Reply-To: <20091013055314.F86319@epsilon.pair.com>
- References: <C6F976F1.1206C%nick@csse.uwa.edu.au><504270.84370.qm@web87013.mail.ird.yahoo.com><20091013055314.F86319@epsilon.pair.com>
Some thoughts on the first part of Herbert's proposals: Herbert proposes: C1: that the character set for a "new cif" be unicode, and C2: that the default encoding be UTF-8; and C3: that other encodings be permitted as an optional system-dependent feature when an explicit encoding has been specified by C3.1: a unicode BOM (byte-order-mark) (see http://en.wikipedia.org/wiki/Byte-order_mark) has been introduced into a character stream, or C3.2. the first or second line being a comment of the form: # -*- coding: <encoding-name> -*- as recognized by GNU Emacs, or C3.3. the first or second line being a comment of the form: # vim:fileencoding=<encoding-name> as recognized by Bram Moolenaar's VIM (see section 2.1.4 of http://docs.python.org/reference/lexical_analysis.html for a more information). (James again:) I agree with C1 and C2. Regarding C3, I don't see the need for other encodings at all. Furthermore, I want to run screaming from the room when I see the words 'system dependent'. As a file transfer standard, we care most about the (possibly different) sending and receiving systems agreeing on the contents, and so 'system-dependent' is completely unacceptable. In contrast to CIF, system-independence is a lower priority for a programming language, as a programmer who does not wish to distribute their program widely can usefully take advantage of system-dependent features. _______________________________________________ ddlm-group mailing list ddlm-group@iucr.org http://scripts.iucr.org/mailman/listinfo/ddlm-group
Reply to: [list | sender only]
- Follow-Ups:
- Re: [ddlm-group] [THREAD 4] UTF8 (Herbert J. Bernstein)
- References:
- Re: [ddlm-group] [THREAD 4] UTF8 (Nick Spadaccini)
- Re: [ddlm-group] [THREAD 4] UTF8 (SIMON WESTRIP)
- Re: [ddlm-group] [THREAD 4] UTF8 (Herbert J. Bernstein)
- Prev by Date: Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.
- Next by Date: Re: [ddlm-group] THREAD 3: The alphabet of non-delimited strings.
- Prev by thread: Re: [ddlm-group] [THREAD 4] UTF8
- Next by thread: Re: [ddlm-group] [THREAD 4] UTF8
- Index(es):