[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Reply to: [list | sender only]
Re: [ddlm-group] Recommended character set and use restrictions. .
- To: Group finalising DDLm and associated dictionaries <[email protected]>
- Subject: Re: [ddlm-group] Recommended character set and use restrictions. .
- From: David Brown <[email protected]>
- Date: Mon, 21 Jun 2010 11:27:46 -0400
- In-Reply-To: <8F77913624F7524AACD2A92EAF3BFA54166122951A@SJMEMXMBS11.stjude.sjcrh.local>
- References: <[email protected]> <8F77913624F7524AACD2A92EAF3BFA541661229515@SJMEMXMBS11.stjude.sjcrh.local > <[email protected]> <8F77913624F7524AACD2A92EAF3BFA541661229518@SJMEMXMBS11.stjude.sjcrh.local > <8F77913624F7524AACD2A92EAF3BFA541661229519@SJMEMXMBS11.stjude.sjcrh.local > <a0624080ac84197c8f154@[192.168.2.104]><8F77913624F7524AACD2A92EAF3BFA54166122951A@SJMEMXMBS11.stjude.sjcrh.local>
|
I can see the advantages of using Unicode in data values where one
may wish to render text is some non-ascii formmat, but is there any
reason why data names should not be restricted (at least for the
forseeable future) to ASCII characters? These names are assigned by
COMCIFS and we are in no real danger of running out of ASCII data
names. One day we may need to write our dictionaries in Arabic, but I
doubt that any of us will be around wheb that happens. If we only
allowed non-ASCII characters in delimited strings we would meet all the
needs of the community for many years to come, and save ourselves a lot
of grief trying to sort out which code points to allow. David Bollinger, John C wrote: On Friday, June 18, 2010 5:08 PM, Herbert J. Bernstein wrote:The current specification in in terms of what is included, rather than what is excluded: #x9 #xA #xD #x20 - #xD7FF #xE000 - #xFFFD #x10000 - #x10FFFYes, but do note that the spec contains an apparent typo in the last line: the upper bound should almost certainly be U+10FFFF, the upper bound of the Unicode code point range. There are already many Unicode characters assigned to code points greater than U+10FFF (roughly 20% of all assignments). Also, XML, which the CIF spec references as its guide in this regard, allows characters up to U+10FFFF. [...]I think John is proposing that the included set become: #x9 #xA #xD #x20 - #x7E #xA0 - #xD7FF #xE000 - #xFDCF #xFDF0 - #xFFFD #x10000 - #x10FFD |
begin:vcard fn:I.David Brown n:Brown;I.David org:McMaster University;Brockhouse Institute for Materials Research adr:;;King St. W;Hamilton;Ontario;L8S 4M1;Canada email;internet:[email protected] title:Professor Emeritus tel;work:+905 525 9140 x 24710 tel;fax:+905 521 2773 version:2.1 end:vcard
_______________________________________________ ddlm-group mailing list [email protected] http://scripts.iucr.org/mailman/listinfo/ddlm-group
Reply to: [list | sender only]
- Follow-Ups:
- Re: [ddlm-group] Recommended character set and use restrictions. . (Bollinger, John C)
- References:
- [ddlm-group] Vote on BOM (James Hester)
- Re: [ddlm-group] Vote on BOM (SIMON WESTRIP)
- Re: [ddlm-group] Recommended character set and use restrictions (Herbert J. Bernstein)
- Re: [ddlm-group] Recommended character set and use restrictions. . (Bollinger, John C)
- Prev by Date: Re: [ddlm-group] Recommended character set and use restrictions. .
- Next by Date: Re: [ddlm-group] Recommended character set and use restrictions. .
- Prev by thread: Re: [ddlm-group] Recommended character set and use restrictions. .
- Next by thread: Re: [ddlm-group] Recommended character set and use restrictions. .
- Index(es):

