Words and spaces

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

Words and spaces

Chris Lilley

Hello public-tt,

In section 8.3.7 <flowFunction>

 The dynamic flow unit word must be interpreted as being dependent upon
 the language or writing system of the affected content. If the language
 or writing system is unknown or unspecified, then word is interpreted
 as follows:

   1. If the affected content consists solely or mostly of Unified CJK
   Ideographic characters or of characters of another Unicode character
   block that are afforded similar treatment to that of Unified CJK
   Ideographic characters, then word is to be interpreted as if
   character were specified.
   
   2. Otherwise, word is to be interpreted as denoting a sequence of one
   or more characters that are not interpreted as an XML whitespace
   character.

Noting the "must" which is a testable conformance requirement, do the
following paragraphs contain one word or two?

<p>Hello&#x3000;World</p>
<p xml:lang="en">Hello&#x3000;World</p>
<p xml:lang="en">Hello&#x2004;World</p>
<p xml:lang="ja">Hello&#x3000;World</p>
<p xml:lang="ja">Hello&#x2004;World</p>
<p xml:lang="ja">Masayasu Ishikawa</p>

For a list of Unicode space characters, see for example
http://www.cs.tut.fi/~jkorpela/chars/spaces.html


--
 Chris Lilley                    mailto:[hidden email]
 Interaction Domain Leader
 Co-Chair, W3C SVG Working Group
 W3C Graphics Activity Lead
 Co-Chair, W3C Hypertext CG