Skip to content

Formal software contract, shared data models, sample messages, and compliance tests required for integrating new Entity Resolution Engines (EREs) into the system.

Notifications You must be signed in to change notification settings

meaningfy-ws/entity-resolution-spec

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

entity-resolution-spec

Formal software contract, shared data models, sample messages, and compliance tests required for integrating new Entity Resolution Engines (EREs) into the system.

Installation

To get started, you need a UNIX-compatible environment (Mac/Linux/WSL2) with Make. You can then use the following command:

make

This will run the first and default Make target make install, which installs the necessary user dependencies with the uv package manager.

To install the development dependencies, you can run:

make install-dev

This will install the additional dependencies required for development, such as testing and linting tools, including LinkML for codegen (see below).

Development

This project uses principles of model-driven development (MDD) and domain-driven design (DDD). The core model is defined in the resources/linkml directory, and the Python (Pydantic) models (pluralized to refer to all the classes as is the practice in the programming community) are generated using the LinkML framework.

The generated Python models can be found in the src/models directory. You can regenerate them by running:

make generate_models

Once you are happy, you can also regenerate the documentation by running:

make generate_docs

Running and Testing

TODO: this will be added in future. Right now, this repository contains specifications only and does not have runnable unit tests.

Test data

Deduplicated notices

This repository contains manual deduplication for organizations and procedures from RDF tender notices. The duplication was done using fuzzy string matching with manual checking of the results.

Details here

About

Formal software contract, shared data models, sample messages, and compliance tests required for integrating new Entity Resolution Engines (EREs) into the system.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 75.1%
  • Gherkin 18.2%
  • Makefile 6.7%