Installation and startup
The following section provides a step-by-step guide to the installation process.
Two options are available for running the HMC Toolbox for Data Mining on your local machine. Although the step-by-step-instructions are given for Linux operating systems, similar ways exist for installation on Windows and MacOS:
- Option 1: Run the HMC Toolbox for Data Mining as a stand-alone application using docker compose.
- Option 2: Run the HMC Toolbox for Data Mining as a stand-alone CLI-application using poetry.
- Option 3: Joint deployment of the HMC Toolbox for Data Mining together with the HMC FAIR Data Dashboard using docker compose.
The installation instructions for these options can be found below.
Option 1: Run the HMC Toolbox for Data Mining as a stand-alone application using docker compose.
The HMC Toolbox for Data Mining can be deployed as a standalone product using docker images.
Install docker and docker compose
This installation requires docker and docker compose to be installed on your machine. To do so, please follow the official installation instructions for Docker (opens in a new tab) as well as for Docker Compose V2 (opens in a new tab).
Clone the source code from the GitLab repository
Clone the HMC Toolbox for Data Mining (opens in a new tab) from the public GitLab repository:
git clone https://codebase.helmholtz.cloud/hmc/hmc-public/FAIR-dashboard/hmc-toolbox-for-data-mining.gitor
git clone git@codebase.helmholtz.cloud:hmc/hmc-public/FAIR-dashboard/hmc-toolbox-for-data-mining.gitConfiguration of environment variables
Change path to the new directory:
cd hmc-toolbox-for-data-miningCreate .env file and fill it as .env.example
touch .envor copy the .env.example and then fill it with your desire variables
cp .env.example .envRun the HMC Toolbox for Data Mining using docker compose
docker compose upPlease note: A prerequisite for using the HMC FAIR Data Dashboard is a
local instance of the F-UJI Server (opens in a new tab) as well as a preconfigured MariaDB-database.
With the instructions given in docker-compose.yml, both the database as well as an instance of the F-UJI Server should be launched in parallel to the HMC Toolbox for Data Mining.
Option 2: Run the HMC Toolbox for Data Mining as a stand-alone CLI-application using poetry.
Alternatively you can also use the HMC Toolbox for Data Mining without Docker. In this case please
make sure, that you have installed all dependencies.
Install Poetry
Please install poetry (opens in a new tab) first.
Clone the source code from the GitLab repository
Clone the HMC Toolbox for Data Mining (opens in a new tab) from the public GitLab repository:
git clone https://codebase.helmholtz.cloud/hmc/hmc-public/FAIR-dashboard/hmc-toolbox-for-data-mining.gitor
git clone git@codebase.helmholtz.cloud:hmc/hmc-public/FAIR-dashboard/hmc-toolbox-for-data-mining.gitChange into the project directory
cd hmc-toolbox-for-data-miningInstall project dependencies using Poetry
poetry installActivate the virtual environment:
poetry shellRun the Toolbox as a CLI application
The HMC Toolbox for Data Mining can now be run as a CLI-application using the following command on the project directory:
hmc-toolbox runFor more detaile information on available commands:
hmc-toolbox --helpIt is also possible to execute only parts of the functionalities.
For a developer setup, install additional dependencies:
To run the project in a developer setup, dependencies used for development (e.g., pre-commit or pytest) can also be installed in addition using
poetry install --with dev, test(Optional) To deactivate the virtual environment when finished:
exitOption 3: Joint deployment of the HMC Toolbox for Data Mining together with the HMC FAIR Data Dashboard using docker compose (Linux)
The following instructions address a joint deployment of the HMC Toolbox for Data Mining together with the HMC FAIR Data Dashboard using docker compose on Linux machines. This allows you to show and explore the data harvested by the HMC Toolbox for Data Mining in the HMC FAIR Data Dashboard. For this purpose, we create another public GitLab repository Production setup HMC Toolbox and HMC Dashboard (opens in a new tab).
Install docker and docker compose
The Production setup HMC Toolbox and HMC Dashboard comes with an easy-to-use Docker container.
First you need to install Docker (opens in a new tab) as well as Docker Compose V2 (opens in a new tab).
Clone the source code from the GitLab repository
Clone the Production setup HMC Toolbox and HMC Dashboard (opens in a new tab) public GitLab repository, by
git clone https://codebase.helmholtz.cloud/hmc/hmc-public/FAIR-dashboard/production-setup-hmc-toolbox-and-hmc-dashboard.gitor
git clone git@codebase.helmholtz.cloud:hmc/hmc-public/FAIR-dashboard/production-setup-hmc-toolbox-and-hmc-dashboard.gitConfiguration of environment variables
Change path to the new directory:
cd production-setup-hmc-toolbox-and-hmc-dashboardCreate .env file and fill it as .env.example
touch .envor copy the .env.example and then fill it with your desire variables
cp .env.example .envAn example of the environment variable file can be found here (opens in a new tab).
Run the dashboard using docker compose
docker compose upPlease Note: Upon completion of the pull and create process of the corresponding docker containers, the HMC FAIR Data Dashboard will show you the emply graphs. This is because the database is emplyed in a first step. HMC Toolbox for Data Mining container will harvest the data, enrich it, score it and finally store it in the database.
After finishing the process of HMC Toolbox for Data Mining jobs, HMC FAIR Data Dashboard will have the data to show. (at this stage it is necessary to restart HMC FAIR Data Dashboard)