History of Madoc

Omeka + PHP
Introduction of Python
Introduction of Typescript
Replacing frontend with Typescript
Removal of Omeka + PHP

Omeka-S + Microservices

Madoc began development in early 2017 in partnership with the National Library of Wales (NLW) and the Indigenous Digital Archives (IDA). The goal was to use IIIF and Linked Data to create a platform for the crowdsourcing of metadata, structural data, transcriptions and points of interest on images from digital collections.

There was a split between the "Frontend" side and backend services. The frontend was built using Omeka-S, a PHP based content management system. The backend was built using a collection of many different microservices written in Java and Python and heavily utilising Amazon AWS services. The frontend and backend communicated using many different REST APIs.

The frontend of Omeka was built using Omeka-S theme system, but utilised a custom build-step to compile the frontend assets. At the time this was done using Grunt, babel, Sass and a few other tools.

We identified a few issues with this approach:

The frontend was difficult to develop and test locally
Deployment was difficult and required a lot of manual steps
Omeka-S was not designed for the volume of data we were working with

Consolidation

During the autumn of 2019 we took a step back and look at the platform and where we wanted to go with it. We identified a few key areas that we wanted to improve:

Independent frontend development
Easier deployments
Less moving parts

And put together a plan to address these issues:

Single database for all services
Single API endpoint for all services
Single method of authorisation for all services
Site sandboxing from the start
Single method of deployment for all services (Docker)
Single method of configuration for all services (config api + environment variables)
No requirement of a commercial integration (e.g. AWS or Azure)

These changes would allow us to build a platform that was easier to develop, easier to deploy and easier to maintain. During the first half of 2020 we began to implement these changes and move the platform forward. During this time we were working with Ghent Centre for Digital Humanities (opens in a new tab) and Staatsbibliothek Berlin (opens in a new tab).

Building a foundation

We decided to rewrite the backend services in Python and to use Django as the framework. We started building some of the foundational services that we would need to build the platform. These included:

Configuration API
Tasks API
Search API
Model API (now merged into Madoc)

The Configuration and Search APIs were built in Python using Django. The Tasks API and Model API was built in Typescript using Node.js with the Koa framework.

Madoc

After the initial development, we started working on a set of APIs and new Admin Frontend that would replace the Omeka-S Admin UI. This was built using Typescript and React. We began with replacing the IIIF storage APIs, utilising the new Tasks API to handle much larger volumes of data.

Once we replaced the Admin side, we moved to replacing the public side of Omeka-S and eventually removed the PHP dependency entirely. However, we still used the Omeka-S database which was stored in MySQL. Towards the end of development we migrated the final tables over to Postgres and removed the MySQL dependency.

Build & Tooling Code base