Prediction and Optimisation Run-time

General Description

The Prediction and Optimisation Run-time component encapsulates data processing algorithms and seamlessly integrates them in the ZDMP platform. It uses the data management tools from the ZDMP environment and provides an easy-to-use API through which other components or zApps may use the embedded algorithms. By deploying an algorithm as a Prediction and Optimisation Run-time (PO Run-time) it is hosted in a scalable manner that is available to all components/zApps that support the PO Run-time API. In contrast to most ZDMP components, a single instance of a ZDMP platform may host multiple versions of the PO Run-time component. More precisely each user can start multiple instances of PO Run-time.

A potential workflow to create and run a Prediction and Optimisation Run-time is as follows:

Identify data processing problem
Use the Prediction and Optimisation Run-time Designer to search for available algorithms that solve the problem
Adapt the data processing algorithm to needs or write a new algorithm from scratch. This step may also include training the model using test data
Use the Prediction and Optimisation Run-time Designer to embed the finished algorithm/model inside a PO Run-time
Run and use the PO Run-time throughout AI Analytics Run-time component in the ZDMP platform

The component consists of three Docker containers:

A Gunicorn Sever (WSGI) that deploys the Python Flask application that implements the components main functionality (API, Layer, …)
A Redis database that serves as a cache for specific message bus topics
A Python script that manages message bus subscriptions and updates the local cache (Redis database). In the current version, a MongoDB database is also deployed to persist configurations. Later the MongoDB container will be replaced by the ZDMP Storage component

The data processing algorithms are developed independently and are later injected and used in the Prediction and Optimisation Runtime. For this to work, each data processing algorithm needs to implement the ComputeUnit sub-component which has a predefined interface. On the implementation level the injection is achieved using a Docker parent image (PO-base image) which provides core functionalities as a service (eg API and IO). The Dockerfile of the data processing algorithm (ComputeUnit) is based on this PO-base image, ie starts by importing it. Using this system, the data processing algorithm can be provided as a Python package to the main Flask application.

Resource	Location
Source Code	Link
Latest Release (v1.0.0)	Download
X Open API Spec	Link
Video	Link
Further Guidance	None
Related Datasets	None
Additional Links	None
Generation date of this content	26.Mai 2021

Screenshots

This component does not have a graphical user interface.

Component Authors

Company Name	ZDMP Acronym	Website	Logo
Profactor GmbH	PROF	www.profactor.at
Universidad Politècnica de València	UPV	https://cigip.webs.upv.es

Commercial Information

Resource	Location
IPR Link	Link
Price	[For determination at end of project]
License	[For determination at end of project]
Privacy Policy	[For determination at end of project]
Volume license	[For determination at end of project]

Architecture Diagram

The following diagram shows the position of this component in the ZDMP architecture.

Figure 1: Position of Component in ZDMP Architecture

Benefits

The main advantages of deploying a data processing algorithm as a PO Run-time:

Seamless integration into the ZDMP platform. The main benefits are:
- Efficient development to deployment in ZDMP platform
- Easily access data published on the ZDMP platform
- Share data processing results with components/zApps on the ZDMP platform
Standardised API which allows existing components/zApps to use the newly published algorithms

Features

This component offers following features through the API:

Data source selection
Output source selection
Continuous or one time calculation
Modification of algorithm parameters
Query the settings

System Requirements

The component may be run on any platform that support Docker images. To run the Flask application locally (ie without Docker) the following software requirements exist:

Python 3.7
Pip

Furthermore, to use all features, the following dependencies inside the ZDMP platform exist:

Message Bus: Used to get real time data
Storage: Used for persistence and historic data

Associated ZDMP services

Required

AI Analytics Run-Time

Optional

Installation

Installing from Source in Local Environment

This component is designed to be used with the AI Analytics component. However, it has the ability to be installed and used as a standalone component. The installation is based on Docker and can be achieved by running a Docker-compose command. For each data processing algorithm, a separate Docker-compose file is provided. For example, the following command starts an instance of the PO-Run-time component embedded with an anomaly detection algorithm for the HSD spindle data.

Example start-up using Docker-compose (in Linux command line):

docker network create zdmp
cd orchestration
export MQTT_USERNAME=<username>
export MQTT_PASSWORD=<password>
docker-compose -f docker-compose.hsd_anomaly_detection.yaml -f docker-compose.mqtt-hsd-spindle-zdmpMB.yaml up --build --remove-orphans

Setting the environment variables MQTT_USERNAME and MQTT_PASSWORD allows the component to connect to the ZDMP message bus
Docker-compose.hsd_anomaly_detection.yaml: This Docker-compose-file builds the data processing algorithm and embeds it into the main PO-Run-time Dockerimage, which includes a python-flask application, a mongo database, a redis database and a python-script:

---
version: '3.4'

services:

  hsd_anomaly_detection_backend:
    build:
      context: ../subsystems/layers/Prediction/hsd_anomaly_detection
    container_name: hsd_anomaly_detection
    environment:
      - LOG_LEVEL=info
      - LAYER_MODULE_NAME=hsd_anomaly_detection
      - MQTT_BROKER_REQUIRES_PW=True
      - MQTT_BROKER=zdmp.digitalbusinessplatform.de:18083
      - MQTT_USERNAME=upv
      - MQTT_PASSWORD=ZDMP2020!
    ports:
      - '8082:3000'


networks:
  default:
    external:
      name: zdmp

Docker-compose.mqtt-hsd-spindle-zdmpMB.yaml: This Docker-compose file runs a python script which publishes pre-recorded spindle data on the message bus:

---
version: '3.4'

services:
  mqtt-publisher:
    container_name: mqtt-pub-container
    build: ../subsystems/data/mqtt-hsd-spindle
    environment:
      - LOG_LEVEL=debug
      - MQTT_BROKER_REQUIRES_PW=True
      - MQTT_BROKER=zdmp.digitalbusinessplatform.de:18083
      - MQTT_USERNAME=upv
      - MQTT_PASSWORD=ZDMP2020!

The Dockerfile of the anomaly detection backend service is defined as:

FROM zdmp-gitlab.ascora.eu:8443/zdmp_code/platform-tier/t7.1-t7.2-t7.3-prediction-and-optimization-run-time/backend:1.5
 
# Set up working directory
WORKDIR /usr/local/app/layers

# Install package 
COPY ./hsd_anomaly_detection ./hsd_anomaly_detection
COPY README.md setup.py ./
RUN pip install .

Thus, in summary, the Docker compose file deploys the anomaly detection backend service and other required services such as the mqtt publisher that simulates the data. The Docker file builds a container image, first pulling the PO runtime base image from the container registry of ZDMP, and installs dynamically the Python package with the anomaly detector in the runtime. The environment variable LAYER_MODULE_NAME controls the name of the module that implements the data processing module in the runtime. Following these steps a module can be installed with the implementation of a data processing algorithm in the runtime, using a specific image available in the registry. Note that the PO Designer provides development tools to facilitate this process.

It is also possible to use Docker and Docker compose to build the PO runtime base image from source code, instead of using one of the official releases in the ZDMP Docker registry. Docker compose allows to build and name an image and reuse it in another service. Based on this feature, the following alternative docker-compose.buid_base_image.yaml builds an image from source:

---
version: '3.4'

services:
  base_backend:
    build: ../subsystems/backend
    image: po_backend:0.1

Now, the Dockerfile of the anomaly detection backend service is:

FROM: po_backend:0.1
 
# Set up working directory
WORKDIR /usr/local/app/layers

# Install package 
COPY ./hsd_anomaly_detection ./hsd_anomaly_detection
COPY README.md setup.py ./
RUN pip install .

This will again install the packages with the data processing modules in the base image, except that now in this case, the base image is built with a different docker compose file, instead of using the ZDMP registry, thus allowing developers to modify, or debug the PO Runtime backend locally.

Installing from Source in the AI Analytics Runtime

Developers may wish to install a new data processing unit in the AI Analytics Runtime from source. This guide covers the steps that developers need to reproduce to install a new PO Runtime in the AI Analytics Runtime:

Create a manifest.json file following the AI Analytics Runtime manifest file specifications. The following example shows the manifest file of the anomaly detector used in the examples above:

{
  "name": "electrospindle_simulation",
  "description": "This Electrospindle simulator emulates the power consumption and the torque (rotation force) of a electrospindle based on the rotation speeding performed",
  "tags": [
    "simulator",
    "electrospindle"
  ],
  "version": "1.0",
  "input": {
    "runtime": [
      {
        "type": "REST_API"
      }
    ]
  },
  "output": {},
  "modelData": {
    "type": "DOCKER_LAYERS",
    "payload": {
      "envVars": {
        "MQTT_BROKER_REQUIRES_PW":"True",
        "MQTT_BROKER":"zdmp.digitalbusinessplatform.de:18083",
        "MQTT_USERNAME":"upv",
        "MQTT_PASSWORD":"ZDMP2020!"
      },
      "layers": [{
        "name": "electrospindle_simulation"
      }]
    }
  }
}

Create a ZIP file: Once that the manifest file is defined, you need to create a ZIP file with the files and file structure expected by the AI Analytics Runtime to load a new PO runtime Python layer, For instance:

| Anomaly_detection_layer
|-->hsd_anomaly_detecion
|-->Dockerfile
|-->manifest.json

The ZIP file must contain the files used in the Dockerfile (in the example above, the files of the Python packages implementing the data processing functions), the Dockerfile, and the manifest.json. Note that the Dockerfile must build the docker image from the PO Runtime base image in the registry, and not from an image built from source. As in the previous section, it is important to note that the PO Designer provides development tools to facilitate this process. More specifically, the PO Designer provides project templates with the required structure and CI/CD pipelines to automate this process.

How to use

The API functionality can easily be tested using the Swagger UI, which is currently deployed with the Flask application. For the hsd_anomaly_detection layer the following steps provide a working example:

Click on POST ‘/subscription’, ‘Try it out’ and copy the following JSON into the Request body-field.

```json

{

“inputs”: [

{

“format”: “mqtt”,

“id”: “time”,

“mqtt”: {

“topic”: “t7_1-t7_2-t7_3-prediction-and-optimization-run-time/hsd/spindle/time[sec]“,

“historyLength”: 60

}

{

“format”: “mqtt”,

“id”: “vibration”,

“mqtt”: {

“topic”: “t7_1-t7_2-t7_3-prediction-and-optimization-run-time/hsd/spindle/vibration-speed-[mms]“,

“historyLength”: 60

}

{

“format”: “mqtt”,

“id”: “rotation”,

“mqtt”: {

“topic”: “t7_1-t7_2-t7_3-prediction-and-optimization-run-time/hsd/spindle/rotation-speed-[rpm]“,

“historyLength”: 60

}

{

“format”: “mqtt”,

“id”: “temperature”,

“mqtt”: {

“topic”: “t7_1-t7_2-t7_3-prediction-and-optimization-run-time/hsd/spindle/temperature-Statore-[C]“,

“historyLength”: 60

}

“module_parameters”: [],

“outputs”: [

{

“format”: “mqtt”,

“id”: “result”,

“mqtt”: {

“topic”: “t7_1-t7_2-t7_3-prediction-and-optimization-run-time/poResult”

}

]

}

Click Execute
The result will be posted on the message-bus and may be viewed using any mqtt-software

API specification

The following table contains the main details of the API specification:

Resource	POST	GET	PUT	DELETE
/metadata	Not supported	Provides a description of the Run-time component, including a description of inputs and outputs	Not supported	Not supported
/result	Run the model with the current input and output configuration	Not supported	Not supported	Not supported
/subscription	adds a “subscription” to the list of subscriptions	returns the list of subscriptions	Updates the subscription with ID idx	deletes the subscription with ID idx

The term “subscription” is used in the context of continuous (as opposed to one time) calculations. A “subscription” is a configuration (similar to the body of POST request “result”) where at least one input is of type “mqtt”.

Input / Output data configuration parameters specification

Simplified schema of the body of the POST requests “subscription” and “result”:

```json

{

“inputs”: [

{

“id”: “string”,

“format”: “string”,

<format>: string

}

“module_parameters”: [

{

“id”: “string”,

“format”: “string”,

<format>: string

}

“outputs”: [

{

“id”: “string”,

“format”: “string”,

<format>: string

}

]

}

The JSON body of the POST request to /result or /subscription consists of three keys (‘inputs’, ‘outputs’ and ‘module_parameters’) which are lists of JSON objects with the following structure:

id: Identifier of the input or output parameter, specified in the metadata of the component
- For input/module_parameters: The name of the argument of the “compute” function (of computeUnit) to which the input should be passed
- For output: The computeUnit returns the results as a python dictionary. The “id” refers to the key of the returned dictionary
format: Specifies the format of the input or output data parameter. The values of this key can be “filename”, “mongo-collection”, “collection”, “mqtt”
depending on the value of key “format”, (in the schema above), the key <format> should be replaced with one of the following keys:
- filename: (Optional) For file/csv inputs, specifies the file name of the file containing the input data. Currently the files must be available in a Docker-volume made available using Docker-compose. Later the CSV files will be in the Storage component
- mongo-collection: (Optional) For database/mongo inputs. This config is a JSON that specifies the ‘mongo_uri’ and the ‘query’ details:
  - For input:
    - *mongo_uri:* Used to connect to the mongoDB
    - (Optional) *operation:* Either ‘find’ or ‘aggregate’, defines which operation is used to retrieve the data. Default is ‘find’ if not present
    - collection_name: The name of the mongo collection on which the operation is applied
    - If `operation==aggregate`: pipeline: The aggregation-pipeline. Has to be a string containing a python-dictionary, eg: `“pipeline”:”[{ ‘$match’: {‘IdTool’:4} }]“`
    - (Optional) If `operation==find`: query: The query. If no query is given ‘{}’ is used instead. Has to be a string containing a valid python-dictionary, eg: `“query”:“{ ‘IdPart’: {‘$gt’:3} }”`
  - For output:
    - mongo_uri: Used to connect to the mongoDB
    - collection_name: The name of the mongo collection under which the data is stored. Note: The data is stored using mongoDBs ‘insert_many’
- collection: (Optional) For collection data, contains the collection (data) in json format
- mqtt: (Optional) For message bus data. This config is a JSON that contains the message bus topic to subscribe/publish to as well as an optional ring buffer length:
  - For input:
    - topic: The name of message bus topic
    - (Optional) historyLength: for ring buffer (which stores the stream data) length. Default is 1
  - For output:
    - > topic: The name of message bus topic

Example JSON body:

```json

{

“inputs”: [

{

“collection”: {“A1”:1.131, “B2”:“C”},

“format”: “collection”,

“id”: “collection-input”

{

“format”: “mongo”,

“mongo”: {“mongo_uri”:“mongodb://cigip:cigip2020@mongo-container:27017/testzdmp?authSource=admin”, “operation”:“aggregate”, “collection_name”:“part_tools”, “pipeline”:“[{ ‘$match’: {‘IdTool’:4} }]“},

“id”: “tools”

{

“format”: “mqtt”,

“id”: “temperature”,

“mqtt”: {

“topic”: “t7_1-t7_2-t7_3-prediction-and-optimization-run-time/hsd/spindle/temperature-Statore-[C]“,

“historyLength”: 60

}

{

“format”: “mongo”,

“mongo”: {“mongo_uri”:“mongodb://cigip:cigip2020@mongo-container:27017/testzdmp?authSource=admin”, “operation”:“find”, “collection_name”:“parts”, “query”:”{ ‘IdPart’: {‘$gt’:3} }”},

“id”: “parts”

}

“outputs”: [

{

“format”: “collection”,

“id”: “result”

}

]

}

Last modified November 4, 2021