- Commit
- c818df9888d452f0ae54b3a504eefdd970fd73d8
- Parent
- 48f0df56d24de06343bd6a52167f3f210c6c101d
- Author
- John MacFarlane <jgm@berkeley.edu>
- Date
Spec: say explicitly that a character is a unicode code point.
My personal build of CMark ✏️
Spec: say explicitly that a character is a unicode code point.
1 file changed, 3 insertions, 3 deletions
Status | File Name | N° Changes | Insertions | Deletions |
Modified | spec.txt | 6 | 3 | 3 |
diff --git a/spec.txt b/spec.txt @@ -192,10 +192,10 @@ In the examples, the `→` character is used to represent tabs. # Preprocessing A [line](#line) <a id="line"></a> -is a sequence of zero or more characters followed by a line -ending (CR, LF, or CRLF) or by the end of -file. +is a sequence of zero or more [characters](#character) followed by a +line ending (CR, LF, or CRLF) or by the end of file. +A [character](#character)<a id="character"></a> is a unicode code point. This spec does not specify an encoding; it thinks of lines as composed of characters rather than bytes. A conforming parser may be limited to a certain encoding.