Your browser may have trouble rendering this page. See supported browsers for more information.

|<<>>|153 of 208 Show listMobile Mode

Lesser Known Punctuation

Published by marco on

Updated by marco on

The Trouble With EM ‘n EN… (A List Apart) discusses how to create proper punctuation characters in online documents—specifically HTML. Of most interest are the rules of use for “em” and “en” dashes:

em dash
“…used to indicate a sudden break in thought, a parenthetical statement that deserves more attention than parentheses indicate, or instead of a colon or semicolon to link clauses.”
en dash
“used to indicate a range of just about anything with numbers … also used instead of the word “to” or a hyphen to indicate a connection between things … (such as the New York–Boston commuter train) … [or] used to hyphenate compounds of compounds, where at least one pair is already hyphenated (as in ‘Netscape 6.1 is an Open-Source–based browser.’)”

The article also includes rules for all the different hyphens, quotes and spaces available (15 space characters in all!). Though these characters are all found in Unicode, most are not defined in HTML and there isn’t much else to say or do about them when typesetting in HTML.

More interesting is the section on quotes, which gives the decimal codes for curly quote characters as well as rules for using them properly. The <q> tag, though defined since early versions of HTML, still doesn’t work reliably in any modern browser. Browsers that support CSS-added content may set up rules to generate the correct quote characters when needed (not supported in IE 6 or 7).

Even the oft-misused ellipses character gets the full treatment: in most cases, it should accompany a full stop (period) character and placement depends on the intent being conveyed.

The earthli Munger

Text formatting at earthli[1] supports automatic conversion for both en- and em-dashes, as well as the ellipses character, since version 2.7.1 released earlier this year. Automatic replacement of single and double-quote characters is not yet supported, but may follow in a subsequent release as a reliable algorithm has recently been published at Smart Quotes. The algorithm there is 20 years old and was first implemented on the Macintosh and implemented in old stand-bys like WriteNow. It’s a bit smarter than that found in more modern word processors in that it avoids converting to curly quotes when inch or feet marks were intended.

[1] See HTML_MUNGER in the documentation for more details