Diplomatic Transcription Expander

Turn abbreviated medieval Latin text (diplomatic transcriptions) into full, readable form. You give it an XML file with the original text; it gives back the same file with abbreviations expanded.

You need: Python 3.10+ and (for online expansion) a free Gemini API key.

Quick start (easiest way)

Install Python
If you don’t have it, download from python.org. On install, check “Add Python to PATH.”
Get an API key (free)
Go to Google AI Studio and create an API key. You’ll paste it into the app.
Open the app
In a terminal/command prompt, go to the project folder and run:
```
python gui.py
```
Use it
- Click Open… and choose your XML file
- Paste your API key when asked (or put it in a .env file once)
- Click Expand
- Click Save… when done

Step-by-step setup

1. Install dependencies

Open a terminal (or Command Prompt on Windows) in the project folder and run:

python -m venv .venv

On Mac/Linux:

source .venv/bin/activate

On Windows:

.venv\Scripts\activate

Then:

pip install -r requirements.txt

(These commands create an isolated environment and install the required libraries.)

2. Set your API key

You have two options:

Option A – Paste when asked
The app will ask for your key the first time you expand. You can choose to save it in a .env file so you don’t have to enter it again.

Option B – Save it beforehand
Copy .env.example to .env in the project folder, open .env in a text editor, and add:

GEMINI_API_KEY=your-api-key-here

Get a key at Google AI Studio.

Important: Never share or upload .env — it contains your secret key.

Using the graphical interface (GUI)

Run:

python gui.py

Main actions

Button	What it does
Open…	Load an XML file
Expand	Expand abbreviations (online with Gemini or locally). During expansion, changes to Queued (N) showing queue count. Click again to toggle the current file in/out of queue.
Save…	Save the expanded result
◀ / ▶	Previous/next XML file in the same folder
Re-expand	Re-expand from the original file; keeps original on left, new result on right. Uses updated examples and learned pairs.
Batch…	Expand multiple XML files in a folder in parallel

Keyboard shortcuts: Ctrl+O (Open), Ctrl+S (Save), Ctrl+E (Expand), Ctrl+← / Ctrl+→ (prev/next file in folder)

Expansion Queue: When an expansion is running, the Expand button shows Queued (N) with the number of queued files. Click it to add the current file to the queue, or click again to remove it. The tool automatically processes queued expansions one after another. You can see "Queue: N" in the status bar and click Clear Q to empty the entire queue.

Settings

Backend – Use Gemini (online, needs API key) or Local (no key, uses rules or Ollama).
Model – Which Gemini model to use (hidden when Backend is Local). Default: gemini-2.5-flash (best value). Speed shown with tick marks (······ = fastest). Click ⟳ to refresh.
Whole doc – When checked (default), expand the entire document in one API call. Uncheck for block-by-block expansion (e.g. for very long documents or progress display).
Modality – How much to expand manuscript transcriptions: conservative, normalize, full, aggressive, or local (tuned for non-Gemini models like Ollama; not the default).
Simul. – How many blocks to process at once. Lower this (e.g. 1) if you see rate limit errors.
Learn – When on, the app saves new abbreviation pairs from each expansion to improve future runs (staged in Review learned for you to accept or reject).
Layered Training – When on, includes learned examples in the expansion prompt (curated + learned).
Max ex – Cap on how many examples are sent in each prompt (blank = use all). Helps with long lists.
Strategy – When Max ex is set: longest-first (prefer longer diplomatic forms) or most-recent (last N in list).

If expansion fails

No API key – Paste your key when prompted, or add it to .env.
Rate limit (429) – Lower Parallel to 1, wait a minute, then try again.
Timeout / hangs – Try the Local backend, or adjust GEMINI_TIMEOUT in .env (increase for large docs with Whole doc mode, e.g. 180).
Want to avoid the API – Switch Backend to Local; the app will use rules and (if installed) Ollama.

Extra features

Diff – Show a unified diff between input and output (like diff input.xml output.xml), so you can see exactly what the expansion changed.
Input→TXT / Output→TXT – Export text blocks to plain .txt files.
Click to sync – Click a block in input or output to jump to the matching block in the other panel.
Double-click companion line – Double-click a block to select it in both panels and show the matching line in the companion XML. The program opens the companion file in the other panel if needed (e.g. filename_expanded.xml when you double-click in input, or filename.xml when you double-click in output), then scrolls to the same block index so you always see input and output from the same pair of files.
Image panel – Click the 🖼▶ strip on the right to expand and upload an image for reference.
Passes – Run expansion more than once in a row to refine the text.
Persistent preferences – Your selections (Backend, Model, Modality, Whole doc, etc.) and last-opened directory are saved when you quit and restored when you reopen.
Status bar – Appears when you first click Expand (progress, elapsed time, format, queue) and stays visible for the rest of the session.
Max ex / Strategy – In settings, cap how many examples are sent in each prompt (Max ex) and choose longest-first (prefer longer forms) or most-recent. Helps stay within token limits on long example lists.

Review learned (staged pairs)

When Learn is on and you expand with Gemini, the app can stage new abbreviation pairs for you to review instead of adding them straight to your examples.

Review learned panel (bottom) – Click ▶ to expand. Lists staged pairs as “diplomatic → full” (one per line). Pairs that already exist in your rules (project + learned + personal, per Layered Training) are not suggested again.
Search – Filter the list by typing in the search box.
Accept – Add the selected pair to your personal learned set (stored in your config directory). Use Promote to add it to the project examples.json instead.
Reject – Remove from the list; that form won’t be suggested again for the next few documents.
Edit – Change diplomatic or full text in the list; Save edits (or autosave when you leave the field) writes changes to the queue.
Accept all / Reject all – Process the whole list in one go.
Export… – Save the current staged pairs to a JSON file.

Personal learned pairs are used when Layered Training is on; project examples and project learned_examples.json take precedence. See Teaching the app below.

Teaching the app (examples)

The app learns from example pairs: “this abbreviation” → “this full form”.

In the GUI

Use the Train section at the bottom:

Type the abbreviated form in Diplomatic (or click From input to copy from a block).
Type the full form in Full (or click From output).
Click Add pair.

Or edit the examples file

Open examples.json in a text editor. Add pairs like this:

[
  { "diplomatic": "grã", "full": "gratia" },
  { "diplomatic": "tempꝰ", "full": "tempus" }
]

More examples = better results. The app also saves learned pairs from expansions when Learn is on. Use Layered Training to include those learned pairs in the prompt.

Where learned pairs live:

Project: examples.json (curated) and learned_examples.json (auto-learned, same folder as your examples file). These are shared with the project.
Personal: A “learned” file in your config directory (e.g. ~/.config/expand_diplomatic/ on Linux). Used when Layered Training is on; project examples always take precedence. Use Accept in Review learned to add to personal, or Promote to add to project examples.json.

Suggested workflow: Open an XML file → add a few pairs from selection (In/Out) → Expand → Save. Use Diff to spot missed forms, add more pairs, then Re-expand. For many files, use Batch… once settings and examples are good. See Speed & efficiency for more workflow notes.

Command-line usage

For single files:

python -m expand_diplomatic --file document.xml

Output is saved as document_expanded.xml next to the original.

For many files:

python -m expand_diplomatic --batch-dir ./my_xml_folder --out-dir ./expanded

For parallel processing of multiple files (faster):

python -m expand_diplomatic --batch-dir ./my_xml_folder --parallel-files 4

Use the local backend (no API key):

python -m expand_diplomatic --backend local --file document.xml

Rules-only (no Ollama, no API): expand using only your example pairs:

python -m expand_diplomatic --backend rules --file document.xml

Limit how many examples are sent in each prompt (useful for long lists):

python -m expand_diplomatic --file document.xml --max-examples 50 --example-strategy longest-first

Evaluation harness – Compare rules-only, local (Ollama), and Gemini on a corpus and write a report:

python -m expand_diplomatic eval --corpus-dir ./test_xml --out-dir ./eval_report

See python -m expand_diplomatic eval --help for options.

File types and format

Input: XML files (TEI or PAGE XML).
Output: Same structure and format, with only the text inside elements changed.
Blocks: Paragraphs, lines, and similar elements (e.g. p, l, Unicode in PAGE) are expanded. Structure, namespaces, and attributes stay the same.
Pairing: Input file.xml → Output file_expanded.xml. When you open a file, if file_expanded.xml exists in the same folder, it's loaded into the output panel.
Batch mode: Files ending in _expanded.xml are skipped to avoid re-expanding.
Parallel files: Use Batch… button (GUI) or --parallel-files N (CLI) to process multiple files simultaneously.
Format detection: The status bar shows whether the loaded file is PAGE or TEI format.

Advanced options

Container (Docker): See the Container section below if you prefer to run in Docker.
Modality: full (default), conservative, normalize, aggressive — control how much abbreviations/superscripts are expanded while staying faithful to the manuscript.
Environment variables:
GEMINI_MODEL, GEMINI_TIMEOUT, EXPANDER_MAX_CONCURRENT and others can be set in .env or your system environment. See .env.example for details.

Container (Docker)

If you use Docker, you can run the app in a container. Builds default to your detected hardware (Apple Silicon → arm64, Intel/AMD → amd64).

export GEMINI_API_KEY="your-api-key"
./run-container.sh --build -- --file sample.xml --out sample_expanded.xml

For local expansion only (no API key):

./run-container.sh --build -- --backend local --file sample.xml --out sample_expanded.xml

Build options:

./scripts/build-container-installs.sh — Build for detected host (native arch)
./scripts/build-container-installs.sh --all — Build linux/amd64 and linux/arm64
./scripts/build-docker.sh --load — Same (native only); --skip-ollama for faster build

On Apple Silicon, uses arm64 (native) by default, not amd64 (emulated).

You need Docker installed and running.

Windows

Running from source on Windows

Install Python 3.10+ from python.org. During setup, check “Add Python to PATH” so you can use python from Command Prompt or PowerShell.
Open Command Prompt (cmd) or PowerShell and go to the project folder, e.g. cd C:\Users\You\expand-diplomatic.

Create and activate a virtual environment:

python -m venv .venv
.venv\Scripts\activate

Install dependencies and run the GUI:
```
pip install -r requirements.txt
python gui.py
```
When the venv is active, your prompt will usually show (.venv).

Getting a Windows executable (no Python required on the target PC)

You can build a portable ZIP that contains ready-to-run .exe files. No MSI installer is required.

Recommended: portable ZIP
One script builds everything. You get a single ZIP file; users extract it anywhere and run expand-diplomatic-gui.exe (or expand-diplomatic.exe for the CLI). No installation step, no Start Menu—just extract and run.
Where to build
The build must run on Windows or WSL2 (Windows Subsystem for Linux), because it produces Windows executables. Use Python 3.10+ in that environment.
How to build the portable ZIP
On Windows (Command Prompt or PowerShell): run the .bat wrapper so Git (bash) is detected or installed first:
```
scripts\build-windows-zip.bat
```
The .bat looks for Git for Windows (bash). If it’s missing, it tries to install it via winget; otherwise it tells you to install from git-scm.com. Do not double‑click the .sh file—Windows may open it in an editor. Use the .bat or, in Git Bash or WSL2, run:
```
./scripts/build-windows-zip.sh
```
The build script installs requirements.txt and cx_Freeze if needed, then produces dist/expand-diplomatic-portable.zip.
Using the ZIP
Copy expand-diplomatic-portable.zip to any Windows machine. Extract it to a folder (e.g. C:\Tools\expand-diplomatic). The contents appear directly in that folder (no extra subfolder). Double‑click expand-diplomatic-gui.exe to start the GUI, or run expand-diplomatic.exe from a command prompt for the CLI. No Python or installer is required on that machine.
Optional: MSI installer
If you need a traditional installer (Start Menu, Add/Remove Programs), run scripts\build-windows-msi.bat on Windows (same Git/bash behaviour as above), or ./scripts/build-windows-msi.sh in Git Bash or WSL2. The MSI build is more environment-dependent; if it fails, use the portable ZIP build above. See Building and Packaging for details.
Antivirus or “Windows protected your PC”
The built .exe files are not signed. Windows or antivirus may warn the first time you run them. You can choose “More info” → “Run anyway” or add an exception for the folder where you extracted the ZIP.

Run after install

From source (in the project folder, with venv activated):

Action	Command
GUI	`python gui.py`
CLI (one file)	`python -m expand_diplomatic --file document.xml`
CLI (folder)	`python -m expand_diplomatic --batch-dir ./my_xml_folder`

After pip install (e.g. pip install dist/expand_diplomatic-*.whl):

Action	Command
GUI	`expand-diplomatic-gui`
CLI (one file)	`expand-diplomatic --file document.xml`
CLI (folder)	`expand-diplomatic --batch-dir ./my_xml_folder`

Other installs:

Windows (portable ZIP) — Extract expand-diplomatic-portable.zip to a folder; run expand-diplomatic-gui.exe or expand-diplomatic.exe. See Windows section above.
Windows (MSI) — Start Menu → “Expand Diplomatic”, or run expand-diplomatic-gui / expand-diplomatic in a terminal.
macOS .app — Open Expand-Diplomatic.app from Applications (or double‑click in dist/).
RPM / DEB — Run expand-diplomatic-gui or expand-diplomatic from any terminal.
Docker — ./run-container.sh -- --file sample.xml (see Container section).

Distribution packages

Build native packages for different platforms:

Python packages (wheel + source)

./scripts/build-packages.sh
# Output: dist/*.whl and dist/*.tar.gz
# Install: pip install dist/expand_diplomatic-*.whl

Windows executable (portable ZIP recommended)

See the Windows section above for full explanation. Summary:

Portable ZIP (recommended): On Windows (cmd/PowerShell) run scripts\build-windows-zip.bat (detects/installs Git); in Git Bash or WSL2 run ./scripts/build-windows-zip.sh. Builds dist/expand-diplomatic-portable.zip. Extract on Windows and run expand-diplomatic-gui.exe or expand-diplomatic.exe. No installer.
MSI (optional): On Windows run scripts\build-windows-msi.bat, or in Git Bash/WSL2 run ./scripts/build-windows-msi.sh. Produces an installer plus the same portable ZIP; more environment-dependent. If MSI fails, use the ZIP build.

RPM (Red Hat, Fedora, CentOS, Rocky Linux)

./scripts/build-rpm.sh
# Requires: rpm-build, python3-devel
# Output: rpmbuild/RPMS/noarch/*.rpm
# Install: sudo dnf install rpmbuild/RPMS/noarch/expand-diplomatic-*.rpm

DEB (Debian, Ubuntu)

./scripts/build-deb.sh
# Requires: dpkg-dev
# Output: dist/*.deb
# Install: sudo apt install ./dist/expand-diplomatic_*.deb

macOS Application Bundle

./scripts/build-macos-app.sh
# macOS only; optionally uses py2app if installed
# Output: dist/Expand-Diplomatic.app
# Install: cp -r dist/Expand-Diplomatic.app /Applications/

Build all formats

./scripts/build-all.sh
# Builds everything available for your platform
# Or specify: ./scripts/build-all.sh --rpm --deb --app --msi --docker

Troubleshooting

Problem	What to try
“No module named …”	Run `pip install -r requirements.txt` (with your venv activated).
“API key” error	Add your key to `.env` or paste it when the app asks.
429 / rate limit	Lower Parallel to 1 and wait before retrying.
Slow or stuck	Use Local backend or set `GEMINI_TIMEOUT=60` in `.env`.
“Ollama not reachable”	Either start Ollama (`ollama serve`) or ignore it — the app falls back to rule-based expansion.
Wrong expansions	Add more example pairs in Train or `examples.json`.

More options (CLI)

Useful flags when running from the command line:

--examples PATH — Use a different examples file
--model ID — Change Gemini model (default: gemini-2.5-flash)
--backend {gemini,local,rules} — gemini (API), local (Ollama + rules), or rules (examples only, no API/Ollama)
--block-by-block — Expand each block separately instead of whole document in one call
--modality {full,conservative,normalize,aggressive,local} — Manuscript expansion mode (local is tuned for non-Gemini models)
--max-examples N — Cap number of examples in each prompt (default: use all)
--example-strategy {longest-first,most-recent} — Which examples to pick when capped
--passes N — Run expansion multiple times (1–5)
--files-api — Upload the full file to Gemini for extra context
train subcommand — Add pairs from the CLI: python -m expand_diplomatic train --add "diplomatic" "full" (see train --help)
eval subcommand — Run the evaluation harness: python -m expand_diplomatic eval --corpus-dir PATH --out-dir PATH (see eval --help)

See .env.example for environment variables (timeouts, retries, etc.).

Credits

The Gemini integration follows the approach from ideasrule/latin_documents.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.cursor/rules		.cursor/rules
.github/workflows		.github/workflows
docs		docs
expand_diplomatic		expand_diplomatic
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
demo_latin.xml		demo_latin.xml
demo_latin_ollama.xml		demo_latin_ollama.xml
demo_latin_rules.xml		demo_latin_rules.xml
docker-entrypoint.sh		docker-entrypoint.sh
examples.json		examples.json
gui.py		gui.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run-container.sh		run-container.sh
run_gemini.py		run_gemini.py
sample.xml		sample.xml
sample_expanded.xml		sample_expanded.xml
sample_page.xml		sample_page.xml
sample_page_expanded.xml		sample_page_expanded.xml
stretch_armstrong_icon.png		stretch_armstrong_icon.png

Folders and files

Latest commit

History

Repository files navigation

Diplomatic Transcription Expander

Quick start (easiest way)

Step-by-step setup

1. Install dependencies

2. Set your API key

Using the graphical interface (GUI)

Main actions

Settings

If expansion fails

Extra features

Review learned (staged pairs)

Teaching the app (examples)

In the GUI

Or edit the examples file

Command-line usage

File types and format

Advanced options

Container (Docker)

Windows

Running from source on Windows

Getting a Windows executable (no Python required on the target PC)

Run after install

Distribution packages

Python packages (wheel + source)

Windows executable (portable ZIP recommended)

RPM (Red Hat, Fedora, CentOS, Rocky Linux)

DEB (Debian, Ubuntu)

macOS Application Bundle

Build all formats

Troubleshooting

More options (CLI)

Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages