Scala Language Reference as Pandoc Markdown - Notes

Prerequisites

In order to build the scala reference, you will require the following software packages:

Pandoc v1.10.1 or higher (http://johnmacfarlane.net/pandoc/)
TeX-Live (https://www.tug.org/texlive/)
The luximono font - this does not ship with TeX-Live by default due to license restrictions, but it can be easily installed using the "getnonfreefonts" script. A short guide on using this to get luximono can be found on the TeX Stackexchange here.
The Heuristica font - this is an extension of the free version of the Adobe Utopia font. This must be installed as a system font for the PDF to build, and you can find the appropriate font package for your system here: https://code.google.com/p/evristika/

General Advice for editors

All files must be saved as UTF-8: ensure your editors are configured appropriately.
Leave two empty lines between each section, regardless of level of nesting. Leave two empty lines at the end of every markdown file that forms a part of the main specification when compiled.
Use of the appropriate unicode characters instead of the latex modifiers for accents, etc. is necessary. For example, é instead of 'e. Make use of the fact that the content is unicode, google the necessary characters if you don't know how to type them directly.

Conversion from LaTeX - Guidelines

Chapter conversion Checklist

Convert all \section{...}
Convert all \subsection{...}
Convert all \subsubsection{...}
Convert all {\em ...}
Convert all \lstlisting
Convert all \lstinline
Convert all \code
Convert all \sref{sec:...}
Convert all \begin{itemize}
Convert all \begin{enumerate}
Convert all \example
Convert all \footnote
Convert all \paragraph
Convert all \begin{quote}
Delete all \comment{...}
Convert all single quote pairs
Convert all double quote pairs
Look for manually defined enumerated lists (1. 2. 3. etc)
Remove %@M comments
Convert all extra macros (\commadots, etc)

Code

Code blocks using the listings package of form

\begin{lstlisting}
val x = 1
val y = x + 1
x + y
\end{lstlisting}

can be replaced with pandoc code blocks of form

~~~~~~~~~~~~~~{#ref-identifier .scala .numberLines}
val x = 1
val y = x + 1
x + y
~~~~~~~~~~~~~~

Where #ref-identifier is an identifier that can be used for producing links to the code block, while .scala and .numberLines are classes that get applied to the code block for formatting purposes. At present we propose to use the following classes:

.scala for scala code.
.grammar for EBNF grammars.

It is important to note that while math mode is supported in pandoc markdown using the usual LaTeX convention, i.e. $x^y + z$, this does not work within code blocks. In most cases the usages of math mode I have seen within code blocks are easily replaced with unicode character equivalents. If a more complex solution is required this will be investigated at a later stage.

Inline Code

Inline code, usually ~\lstinline@...some code...@ can be replaced with the pandoc equivalent of

`...some code...`{<type>}

where <type> is one of the classes representing the language of the code fragment.

Definitions

Pandoc supports definition lists, however these do not seem to be a good semantic match for the numbered definitions in the reference. The only compromise came up with here was to treat definitions like quotations:

> **Definition**
> Let $C$ be a class with template ...

Macro replacements:

While MathJAX just support LaTeX style command definition, it is recommended to not use this as it will likely cause issues with preparing the document for PDF or ebook distribution.
\SS (which I could not find defined within the latex source) seems to be closest to \mathscr{S}
\TYPE is equivalent to `\boldsymbol{type}'
As MathJAX has no support for slanted font (latex command \sl), so in all instances this should be replaced with \mathit{}
The macro \U{ABCD} used for unicode character references can be replaced with \uABCD.
The macro \URange{ABCD}{DCBA} used for unicode character ranges can be replaced with \uABCD-\uDBCA.
The macro \commadots can be replaced with , … ,.
There is no adequate replacement for \textsc{...} (small caps) in pandoc markdown. While unicode contains a number of small capital letters, it is notably missing Q and X as these glyphs are intended for phonetic spelling, therefore these cannot be reliably used. For now, the best option is to use underscore emphasis and capitalise the text manually, _LIKE THIS_.
\code{...} can be replaced with standard in-line verbatim markdown, `like this`.
\paragraph (typically used for a non-numbered header) can be replaced by a hard line break, which is a \ followed immediately by a newline.
\TODO can be replaced by a markdown comment

Unicode Character replacements

The unicode left and right single quotation marks (‘ and ’) have been used in place of ` and ', where the quotation marks are intended to be paired. These can be typed on a mac using Option+] for a left quote and Option+Shift+] for the right quote.
Similarly for left and right double quotation marks (“ and ”) in place of ". These can be typed on a mac using Option+[ and Option+Shift+].

Enumerations

Latex enumerations can be replaced with markdown ordered lists, which have syntax

#. first entry
#. ...
#. last entry

Finding rendering errors

MathJAX errors will appear within the rendered DOM as span elements with class mtext and style attribute color: red applied.

folone / scala-ref-markdown