Extending the Lore Documentation System

  1. Overview
  2. Inputs and Outputs
  3. Other Uses for Lore Extensions

Overview

The Lore Documentation System, out of the box, is specialized for documenting Twisted. Its markup includes CSS classes for Python, HTML, filenames, and other Twisted-focused categories. But don't think this means Lore can't be used for other documentation tasks! Lore is designed to allow extensions, giving any Python programmer the ability to customize Lore for documenting almost anything.

There are several reasons why you would want to extend Lore. You may want to attach file formats Lore does not understand to your documentation. You may want to create callouts that have special meanings to the reader, to give a memorable appearance to text such as, WARNING: This software was written by a frothing madman! You may want to create color-coding for a different programming language, or you may find that Lore does not provide you with enough structure to mark your document up completely. All of these situations can be solved by creating an extension.

Inputs and Outputs

Lore works by reading the HTML source of your document, and producing whatever output the user specifies on the command line. If the HTML document is well-formed XML that meets a certain minimum standard, Lore will be able to to produce some output. All Lore extensions will be written to redefine the input, and most will redefine the output in some way. The name of the default input is lore. When you write your extension, you will come up with a new name for your input, telling Lore what rules to use to process the file.

Lore can produce XHTML, LaTeX, and DocBook document formats, which can be displayed directly if you have a user agent capable of viewing them, or processed into a third form such as PostScript or PDF. Another output is called lint, after the static-checking utility for C, and is used for the same reason: to statically check input files for problems. The lint output is just a stream of error messages, not a formatted document, but is important because it gives users the ability to validate their input before trying to process it. For the first example, the only output we will be concerned with is LaTeX.

Creating New Inputs

Create a new input to tell Lore that your document is marked up differently from a vanilla Lore document. This gives you the power to define a new tag class, for example:

<p>The Frabjulon <span class="productname">Limpet 2000</span>
is the <span class="marketinglie">industry-leading</span> aquatic 
mollusc counter, bar none.</p>

The above HTML is an instance of a new input to Lore, which we will call MyHTML, to differentiate it from the lore input. We want it to have the following markup:

Note that I chose class names that are valid Python identifiers. You will see why shortly. To get these two effects in Lore's HTML output, all we have to do is create a cascading stylesheet (CSS), and use it in the Lore XHTML Template. However, we also want these effects to work in LaTeX, and we want the output of lint to produce no warnings when it sees lines with these 2 classes. To make LaTeX and lint work, we start by creating a plugin.

register('MyHTML', "myhtml.factory", description="My Lore Plugin",
         type="lore", tapname="myhtml")
Listing 1: The Plugin File - listings/lore/plugins.tml

Name this file plugins.tml, and put it in a new package directory named myhtml. Also create an __init__.py file in your new package. Note that the __init__ file can contain nothing but a doc string, but it must exist. If __init__.py is empty, you will have problems with certain unzip programs that don't extract empty files.

The combination of plugin file and __init__ file causes the new package to be treated like a Twisted plugin, one that Lore knows how to make use of. The first three arguments to register() are the human-readable name, module name, and description of your plugin. The type parameter makes this plugin visible to the Lore system (rather than some other part of Twisted). The tapname parameter is an arbitrary filename with no extension; by convention you should use the lowercase version of your first argument (the human-readable name). Users of your extension will pass this argument to lore with the --input parameter on the command line. (For more details on plugins, read Writing a New Plug-In for Twisted.)

Let's look at that module name more closely: myhtml.factory. We will be creating this file next, in the package named myhtml. The purpose of the factory module is to tell Lore how to process your input.

from twisted.lore import default
import spitters

class MyProcessingFunctionFactory(default.ProcessingFunctionFactory):
    latexSpitters={None: spitters.MyLatexSpitter,
                   }

# initialize the global variable factory with an instance of your new factory
factory=MyProcessingFunctionFactory()
Listing 2: The Input Factory - listings/lore/factory.py-1

In Listing 2, we create a subclass of ProcessingFunctionFactory. This class provides a hook for you, a class variable named latexSpitters. This variable tells Lore what new class will be generating LaTeX from your input format. We redefine latexSpitters to MyLatexSpitter in the subclass because this class knows what to do with the new input we have already defined. Last, you must define the module-level variable factory. It should be an instance with the same interface as ProcessingFunctionFactory (e.g. an instance of a subclass, in this case, MyProcessingFunctionFactory).

Now let's actually write some code to generate the LaTeX. Doing this requires at least a familiarity with the LaTeX language. Search Google for latex tutorial and you will find any number of useful LaTeX resources.

from twisted.lore import latex
from twisted.lore.latex import processFile
import os.path

class MyLatexSpitter(latex.LatexSpitter):
    def visitNode_span_productname(self, node):
        # start an underline section in LaTeX
        self.writer('\\underline{')
        # process the node and its children
        self.visitNodeDefault(node)
        # end the underline block
        self.writer('}')

    def visitNode_span_marketinglie(self, node):
        # this example turns on more than one LaTeX effect at once
        self.writer('\\begin{bf}\\begin{Large}')
        self.visitNodeDefault(node)
        self.writer('\\end{Large}\\end{bf}')
Listing 3: spitters.py - listings/lore/spitters.py-1

The method visitNode_span_productname is our handler for <span> tags with the class="productname" identifier. Lore knows to try methods visitNode_span_* and visitNode_div_* whenever it encounters a new class in one of these tags. This is why the class names have to be valid Python identifiers.

Now let's see what Lore does with these new classes with the following input file:

<html>
  <head>
    <title>My First Example</title>
  </head>
  <body>
    <h1>My First Example</h1>
    <p>The Frabjulon <span class="productname">Limpet 2000</span>
    is the <span class="marketinglie">industry-leading</span> aquatic 
    mollusc counter, bar none.</p>
  </body>
</html>

Listing 4: 1st_example.html - listings/lore/1st_example.html

First, verify that your package is laid out correctly. Your directory structure should look like this:

1st_example.html
myhtml/
       __init__.py
       factory.py
       plugins.tml
       spitters.py

In the parent directory of myhtml (that is, myhtml/..), run lore and pdflatex on the input:

$ lore --input myhtml --output latex 1st_example.html 
[########################################] (*Done*)

$ pdflatex 1st_example.tex
[ . . . latex output omitted for brevity . . . ]
Output written on 1st_example.pdf (1 page, 22260 bytes).
Transcript written on 1st_example.log.

And here's what the rendered PDF looks like:

What happens when we run lore on this file using the lint output?

$ lore --input myhtml --output lint 1st_example.html
1st_example.html:7:47: unknown class productname
1st_example.html:8:38: unknown class marketinglie
[########################################] (*Done*)

Lint reports these classes as errors, even though our spitter knows how to process them. To fix this problem, we must add to factory.py.

from twisted.lore import default
import spitters

class MyProcessingFunctionFactory(default.ProcessingFunctionFactory):
    latexSpitters={None: spitters.MyLatexSpitter,
                   }

    # redefine getLintChecker to validate our classes
    def getLintChecker(self):
        # use the default checker from parent
        checker = lint.getDefaultChecker()
        checker.allowedClasses = checker.allowedClasses.copy()
        oldSpan = checker.allowedClasses['span']
        checkfunc=lambda cl: oldSpan(cl) or cl in ['marketinglie',
                                                   'productname']
        checker.allowedClasses['span'] = checkfunc
        return checker

# initialize the global variable factory with an instance of your new factory
factory=MyProcessingFunctionFactory()
Listing 5: Input Factory with Lint Support - listings/lore/factory.py-2

The method getLintChecker is called by Lore to produce the lint output. This modification adds our classes to the list of classes lint ignores:

$ lore --input myhtml --output lint 1st_example.html
[########################################] (*Done*)
$ # Hooray!

Finally, there are two other sub-outputs of LaTeX, for a total of three different ways that Lore can produce LaTeX: the default way, which produces as output an entire, self-contained LaTeX document; with --config section on the command line, which produces a LaTeX \section; and with --config chapter, which produces a LaTeX \chapter. To support these options as well, the solution is to make the new spitter class a mixin, and use it with the SectionLatexSpitter and ChapterLatexSpitter, respectively. Comments in the following listings tell you everything you need to know about making these simple changes:

Creating New Outputs

write some stuff

Other Uses for Lore Extensions

write some stuff

Color-Code Programming Languages

write some stuff

Add New Structural Elements

write some stuff

Support New File Formats

write some stuff

Index

Version: 1.3.0