A markdown extension for converting Leiden+ epigraphic text to TEI XML/HTML

André van Delft

Last update: Aug 4, 2021

Related tags

Markdown/YAML python markdown xml epidoc tei leiden-plus

Overview

LeidenMark

$ pip install leidenmark

A Python Markdown extension for converting Leiden+ epigraphic text to TEI XML/HTML. Inspired by the Brill plain text (BPT) format that aims to incorporate Leiden+ into a Markdown-based syntax.

>>> from leidenmark import leiden_plus
>>> content = """\
<D=.r<=
1. Lorem ipsum dolor
vac.1lin
2. sit amet, con[ca.3]c
3.-etur adipiscing
=>=D>
<D=.v<=
lost.2lin
6. ut labore et dol
7.-ore magna aliqua
=>=D>"""
>>> leiden_plus(content, indent=True)

The output of the above lines is the following XML snippet:

<div n="r" type="textpart">
  <ab>
    <l n="1">Lorem ipsum dolor</l>
    <space quantity="1" unit="line"/>
    <l n="2">sit amet, con<gap precision="low" quantity="3" reason="lost" unit="character"/>c</l>
    <l break="no" n="3">etur adipiscing</l>
  </ab>
</div>
<div n="v" type="textpart">
  <ab>
    <gap quantity="2" unit="line"/>
    <l n="6">ut labore et dol</l>
    <l break="no" n="7">ore magna aliqua</l>
  </ab>
</div>

leiden_plus() is syntactic sugar for the registered Markdown extension, and equivalent to:

>>> import leidenmark
>>> from markdown import markdown
>>> markdown(content, extensions=['leiden_plus']) # Other extensions can be added to this list

Configuration

Given that this is a Markdown extension, conventions like *italics* and **bold** will also be recognized an converted (these in particular will additionally be transformed to the TEI element <hi>). Though these are in principle not part of the Leiden+ syntax, in practice the use of italics and boldface is still encountered a lot. Therefore, support is maintaned by default, which can be switched off by passing strict=True:

>>> leiden_plus(content, strict=True)

NB: The blockprocessors for paragraphs and ordered list are always switched off, because they interfer too much with Leiden+.

You might also like...

Comprehensive Markdown plugin built for Django

Django MarkdownX Django MarkdownX is a comprehensive Markdown plugin built for Django, the renowned high-level Python web framework, with flexibility,

740 Jan 8, 2023

Awesome Django Markdown Editor, supported for Bootstrap & Semantic-UI

martor Martor is a Markdown Editor plugin for Django, supported for Bootstrap & Semantic-UI. Features Live Preview Integrated with Ace Editor Supporte

659 Jan 4, 2023

Markdown parser, done right. 100% CommonMark support, extensions, syntax plugins & high speed. Now in Python!

markdown-it-py Markdown parser done right. Follows the CommonMark spec for baseline parsing Configurable syntax: you can add new rules and even replac

398 Dec 24, 2022

A fast, extensible and spec-compliant Markdown parser in pure Python.

mistletoe mistletoe is a Markdown parser in pure Python, designed to be fast, spec-compliant and fully customizable. Apart from being the fastest Comm

546 Jan 1, 2023

Livemark is a static page generator that extends Markdown with interactive charts, tables, and more.

Livermark This software is in the early stages and is not well-tested Livemark is a static site generator that extends Markdown with interactive chart

86 Dec 25, 2022

Remarkable Markdown Debian Package Fix

Remarkable debian package fix For some reason the Debian package for remarkable markdown editor has not been made to install properly on Ubuntu 20.04

37 Jan 2, 2023

Read a list in markdown and do something with it!

Markdown List Reader A simple tool for reading lists in markdown. Usage Begin by running the mdr.py file and input either a markdown string with the -

3 Sep 13, 2021

Lightweight Markdown dialect for Python desktop apps

Litemark is a lightweight Markdown dialect originally created to be the markup language for the Codegame Platform project. When you run litemark from the command line interface without any arguments, the Litemark Viewer opens and displays the rendered demo.

10 Apr 23, 2022

A Discord Bot for rendering Markdown

Markdown to PDF Bot A discord bot that accepts markdown files (or messages) and displays them as images. Prerequisite To install, you must have have :

1 Oct 21, 2021

Comments

`enable_paragraphs` option not working as intended
When enabling paragraphs, the following unwanted behaviour is occuring:

>>> leiden_plus('<= Leiden+ section =>', enable_paragraphs=True) '<ab><p>Leiden+ section </p></ab>'

This should just be <ab>Leiden+ section</ab>.

The problem is that <= and => are formatted as different blocks using preprocessors, and stitched together using postprocessors, rendering the text content as a single paragraph in between by the regular markdown paragraph processor. It's probably better to use a mechanism where <= ... => and <D= ... =D> are interpreted as separate blocks.
opened by andredelft 0

Releases(v0.2.2)

v0.2.2(Aug 4, 2021)

Glyph processor added for *chiro*, *slanting-stroke*, *tripunct* and *leaf*. Cf.: http://papyri.info/docs/leiden_plus#special-characters
Source code(tar.gz)
Source code(zip)
v0.2.1(Jul 28, 2021)
This release includes several new processors for broader Leiden+ support, as well as some bugfixes to old ones. The new processors are listed below, linked to the corresponding sections on http://papyri.info

Special characters

Paragraphos: ----

Marginalia

Text inserted / added above line: \ὅλων/

Text added between lines: ||interlin: ὧν||

Other editorial conventions

Handshift: $m4

Extras (Not yet on papyri.info)

Diples:
To get this PN preview: (diple) Use this Leiden+: ((diple)) <milestone rend="diple" unit="undefined"/>

Eisthesis (indentation):
To get this PN preview: line in eisthesis Use this Leiden+: (1, indent) To create this XML: <lb n="1" rend="indent"/>

Source code(tar.gz)
Source code(zip)
v0.2.0(Jul 28, 2021)
This release includes:

A pytest integration and some first basic tests

A new, decentralized processor registration system. Each submodule now has a register function (e.g. register_divisions in divisions.py) that registers the corresponding processors. These functions can also be imported when building a custom extension using this package. Note that individual processors should be imported from their submodules (e.g. from leidenmark.divisions import DivisionsPreproc instead of the former from leidenmark import DivisionsPreproc).

Source code(tar.gz)
Source code(zip)
v0.1.31(May 11, 2021)

Source code(tar.gz)
Source code(zip)
v0.1.30(May 1, 2021)

Source code(tar.gz)
Source code(zip)
v0.1.29(May 1, 2021)

Source code(tar.gz)
Source code(zip)
v0.1.28(Apr 29, 2021)

Source code(tar.gz)
Source code(zip)

Owner

André van Delft

Python & front-end web developer, passionate about digital humanities

GitHub

A markdown extension for converting Leiden+ epigraphic text to TEI XML/HTML

Related tags

Overview

LeidenMark

Configuration

You might also like...

Comprehensive Markdown plugin built for Django

Awesome Django Markdown Editor, supported for Bootstrap & Semantic-UI

Markdown parser, done right. 100% CommonMark support, extensions, syntax plugins & high speed. Now in Python!

A fast, extensible and spec-compliant Markdown parser in pure Python.

Livemark is a static page generator that extends Markdown with interactive charts, tables, and more.

Remarkable Markdown Debian Package Fix

Read a list in markdown and do something with it!

Lightweight Markdown dialect for Python desktop apps

A Discord Bot for rendering Markdown

Comments

`enable_paragraphs` option not working as intended

Releases(v0.2.2)

v0.2.2(Aug 4, 2021)

v0.2.1(Jul 28, 2021)

Special characters

Marginalia

Other editorial conventions

Extras (Not yet on papyri.info)

v0.2.0(Jul 28, 2021)

v0.1.31(May 11, 2021)

v0.1.30(May 1, 2021)

v0.1.29(May 1, 2021)

v0.1.28(Apr 29, 2021)

Owner

André van Delft

Provides syntax for Python-Markdown which allows for the inclusion of the contents of other Markdown documents.

Mdformat is an opinionated Markdown formatter that can be used to enforce a consistent style in Markdown files

A super simple script which uses the GitHub API to convert your markdown files to GitHub styled HTML site.

Application that converts markdown to html.

A Python implementation of John Gruber’s Markdown with Extension support.

A Python implementation of John Gruber’s Markdown with Extension support.

A fast yet powerful Python Markdown parser with renderers and plugins.

Static site generator that supports Markdown and reST syntax. Powered by Python.

Extensions for Python Markdown

markdown2: A fast and complete implementation of Markdown in Python