Skip to content

FAIRChemistry/md-models

Repository files navigation

MD-Models

Crates.io Version NPM Version PyPI - Version Build Status

Welcome to Markdown Models (MD-Models), a powerful framework for research data management that prioritizes narrative and readability for data models.

With an adaptable markdown-based schema language, MD-Models automatically generates schemas and programming language representations. This markdown schema forms the foundation for object-oriented models, enabling seamless cross-format compatibility and simplifying modifications to data structures.

Core Philosophy

The primary motivation behind MD-Models is to reduce cognitive overhead and maintenance burden by unifying documentation and structural definition into a single source of truth. Traditional approaches often require maintaining separate artifacts:

  1. Technical schemas (JSON Schema, XSD, ShEx, SHACL)
  2. Programming language implementations
  3. Documentation for domain experts
  4. API documentation

This separation frequently leads to documentation drift and increases the cognitive load on both developers and domain experts.

Check out the documentation and graph editor for more information.

Example

The schema syntax uses Markdown to define data models in a clear and structured way. Each object is introduced with a header, followed by its attributes. Attributes are described with their type, a brief explanation, and optional metadata like terms. Nested or related objects are represented using array types or references to other objects.

---
prefixes:
  schema: http://schema.org/
---

### Person

- **name**
  - Type: string
  - Description: The name of the person
  - Term: schema:name
- age
  - Type: integer
  - Description: The age of the person
  - Term: schema:age
- addresses
  - Type: Address[]
  - Description: The address of the person

### Address

- street
  - Type: string
  - Description: The street of the address

Installation

In order to install the command line tool, you can use the following command:

git clone https://github.com/JR-1991/md-models
cd md-models
cargo install --path .

Command line usage

The command line tool can be used to convert markdown files to various formats. The following command will convert a markdown file to Python code:

md-models convert -i model.md -o lib.py -l python-dataclass

This will read the input file model.md and write the output to lib.py using the Python dataclass template. Alternatively, you can also pass a URL as input to fetch the model remotely. For an overview of all available templates, you can use the following command:

md-models --help

Available templates

The following templates are available:

  • python-dataclass: Python dataclass implementation with JSON-LD support
  • python-pydantic: PyDantic implementation with JSON-LD support
  • python-pydantic-xml: PyDantic implementation with XML support
  • typescript: TypeScript interface definitions with JSON-LD support
  • typescript-zod: TypeScript Zod schema definitions
  • rust: Rust struct definitions with serde support
  • golang: Go struct definitions
  • protobuf: Protocol Buffer schema definition
  • graphql: GraphQL schema definition
  • xml-schema: XML schema definition
  • json-schema: JSON schema definition
  • json-schema-all: Multiple JSON schema definitions (one per object)
  • shacl: SHACL shapes definition
  • shex: ShEx shapes definition
  • compact-markdown: Compact markdown representation
  • mkdocs: MkDocs documentation format
  • linkml: LinkML schema definition

Installation options

The main Rust crate is compiled to Python and WebAssembly, allowing the usage beyond the command line tool. These are the main packages:

  • Core Python Package: Install via pip:

    # Mainly used to access the core functionality of the library
    pip install mdmodels-core
  • Python Package: Install via pip:

    # Provides in-memory data models, database support, LLM support, etc.
    pip install mdmodels
  • NPM Package: Install via npm:

    # Mainly used to access the core functionality of the library
    npm install mdmodels-core

Development

This project uses GitHub Actions for continuous integration. The tests can be run using the following command:

cargo test
cargo clippy

Using pre-commit hooks

This project uses pre-commit to run the rustfmt and clippy commands on every commit. To install the pre-commit hooks, you can use the following command:

pip install pre-commit
pre-commit install

Once the pre-commit hooks are installed, they will run on every commit. This will ensure that the code is formatted and linted correctly. And the clippy CI will not complain about warnings.