0% found this document useful (0 votes)

172 views23 pages

Analytics - Platform - Best - Practices - Guide Knime

Uploaded by

jvanil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

172 views23 pages

Analytics - Platform - Best - Practices - Guide Knime

Uploaded by

jvanil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

KNIME Best Practices Guide

KNIME AG, Zurich, Switzerland

Version 4.7 (last updated on 2022-03-24)
Table of Contents
What is KNIME Software? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

Workflow design process: From KNIME Analytics Platform to KNIME Server . . . . . . . . . . . . . . 2

Before building a KNIME Workflow: project prerequisites . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

Best practices when working with KNIME Analytics Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

Use proper naming for your workflows or groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

Design your workflow in a secure, reusable, and efficient way . . . . . . . . . . . . . . . . . . . . . . . . 4

Design your workflow to be efficient. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

Best practices when working with KNIME Server. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

Versioning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

Managing access to KNIME server items . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

Remote workflow editor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

How to work as a team? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

Glossary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

Workflow Annotation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

Node Annotation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

Workflow Description . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

Server Repository . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

Job . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

(Shared) Component. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

Data App. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

Schedule. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

REST API . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

Workflow Service. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
KNIME Best Practices Guide

What is KNIME Software?

One enterprise-grade software platform, two complementary tools: the open source KNIME
Analytics Platform for creating data science, and the commercial KNIME Server for
productionizing data science.

If you are new to KNIME, you can download KNIME Analytics Platform via your internal
catalog or directly on the KNIME website. We suggest you look at our guide to getting started
to build your first workflow.

© 2021 KNIME AG. All rights reserved. 1

KNIME Best Practices Guide

Workflow design process: From KNIME

Analytics Platform to KNIME Server
As mentioned before, these are complementary tools.

KNIME Analytics Platform is our open source software for creating data science. Intuitive,
open, and continuously integrating new developments, it makes understanding data and
designing workflows and reusable components accessible to everyone.

KNIME Server is an enterprise software for team-based collaboration, automation,

management, and deployment of data science workflows as KNIME WebPortal applications,
Data Apps, REST APIs, and more.

As you start your data project with KNIME, you will create a workflow that can then be
uploaded to KNIME Server. KNIME Analytics Platform is how you’ll design your data process
via a workflow. Once your workflow is ready, it can be easily automated or deployed when
you upload it to KNIME Server.

© 2021 KNIME AG. All rights reserved. 2

KNIME Best Practices Guide

Before building a KNIME Workflow: project

prerequisites
As with any project, prior to building your workflow, it’s important to understand the scope of
what you’re working with. This checklist can guide you:

☐ Is there a clearly defined goal that is measurable and achievable?

For example: We want to reduce the churn rate by 15%.

☐ What data sources are required to reach the goal? Do you have access to these
sources?

☐ Who should consume your deployed data science application, and how? For instance
are you making:

☐ A scheduled workflow that regularly creates and sends a report?

☐ A scheduled workflow that gets executed regularly to process new data or apply a
predictive model?

☐ A Data App that gets deployed to KNIME WebPortal?

☐ A predictive model that is accessible via REST API?
☐ What kind of workflows do you need to achieve your goal?
☐ What is the task of each individual workflow that you want to build?
☐ What is the input and output of each individual workflow?
☐ Do your workflows share some parts?
☐ Define the requirements for the created workflows (how fast the execution should be,
how often should it be executed, etc.).

☐ What hardware and software do you need to meet all these requirements?

© 2021 KNIME AG. All rights reserved. 3

KNIME Best Practices Guide

Best practices when working with KNIME

Analytics Platform

Use proper naming for your workflows or groups

Use a clear naming convention for your workflows. From the workflow name, it should be
clear what the workflow is doing (e.g. “Project_1” vs. “Read and Preprocess Customer Data”).
If one of your workflows is followed by another workflow, you might want to introduce an
order by using numbers as prefixes.

Design your workflow in a secure, reusable, and efficient way

Avoid using local file paths

To make your workflow able to access the data sources whether it is executed on your
machine or on any other machine, it is important to avoid local file paths. Instead use relative
paths or, if your data is already available in a remote files system, manage your files and
folders there directly. To do so, you can use a dynamic file system connection port and one
of the many different connector nodes, e.g. the SMB Connector or the S3 Connector node.

Use credential nodes

It is not recommended that you save any credentials or confidential data in your nodes and
workflows. Instead use the Credentials Configuration or Credentials Widget node to let the
user specify credentials at runtime.

© 2021 KNIME AG. All rights reserved. 4

KNIME Best Practices Guide

If the workflow is shared via KNIME Server, the person who executes it can provide the
credentials when starting or scheduling the execution, either via the WebPortal or the
“Configuration” options in the execution window. (Right-click a workflow in the KNIME
Explorer and select “Execute” to open it).

Build and share components

Components can help you and your team to follow the “Don’t Repeat Yourself” principle and
implement best practices. So, for example, if you often connect to a certain database, you
can create a connector component including all the setting options and share the component
to reuse it in your workflows or allow others to use it.

Via configuration nodes, you can make your component more flexible. For instance, you can
change it so that you don’t have to save the credentials inside the component, or allow
another user to change certain settings. If your goal is to deploy your workflow as DataApp of
the KNIME WebPortal, you can use widget nodes to make it configurable.

To make a component reusable by others, it is recommended that you properly document it

by adding a description for the expected input and the different setting options (similar to a
KNIME node). You can edit the component description by going inside the component and
opening the description view / panel.

The KNIME Components Guide gives you more information about how to create and share
components.

Document your workflow

To make your workflow reusable, you and your team need to be able to quickly understand
what it’s doing. It is therefore important to document your workflow — and if it’s large, you

© 2021 KNIME AG. All rights reserved. 5

KNIME Best Practices Guide

should structure it into different parts.

To structure a large workflow, you can use metanodes and components for the different
parts and nest them. To document the workflows, you can:

• Change the node labels to define what individual nodes do by double-clicking on their
labels.
• Create an annotation box. Just right-click anywhere in the workflow editor and select
"New Workflow Annotation." Then type in your explanatory comment, resize the
annotation window to make clear which group of nodes it refers to, and format the text
using the context menu.

This workflow is an example of how to document workflows and add descriptions and
annotations to them.

• Attach a workflow description by clicking on an empty spot in the workflow, then go

into the Node Description view and click the edit button on the top-right.

Design your workflow to be efficient

To make the execution of your workflow efficient, it is good to follow some best practices
and take into consideration how compute-intensive it is to execute each individual node.

• Exclude superfluous columns before reading. In case not all columns of a dataset are
actually used, you can avoid reading these columns by excluding them via the
“Transformation” tab of the reader node.

© 2021 KNIME AG. All rights reserved. 6

KNIME Best Practices Guide

• To make the data access part of your workflow efficient, you should avoid reading the
same file multiple times, but instead connect one reader node to multiple nodes.

• Use the “Files in Folder” option to read multiple files with the same structure.

• Remove redundant rows and columns early before performing expensive operations
like joining or data aggregation.
• Only use loops if it is absolutely necessary. Executing loops is normally expensive. Try
to avoid them by using other nodes, e.g. String Manipulation (Multi Column) or Math
Formula (Multi Column) nodes.
• Push as much computation as possible to the database. In case you are working with
databases, you can speed up the execution of your workflow by pushing as much of the

© 2021 KNIME AG. All rights reserved. 7

KNIME Best Practices Guide

preprocessing of the data as possible to the database server by using the DB

preprocessing nodes before reading the data into KNIME Analytics Platform.
• Close database connections with the DB Connection Closer node.
• Delete disconnected nodes or workflow branches that are not actively needed.

To further improve the performance of your workflow, you can use the Timer Info node to find
out how long it takes to execute individual nodes.

© 2021 KNIME AG. All rights reserved. 8

KNIME Best Practices Guide

Best practices when working with KNIME

Server

Versioning
When working in a team, building and modifying workflows together, it is easy to lose
progress by overwriting a colleague’s work. Sometimes you also make changes that you later
discover to be wrong, and you’ll want to roll back your work to a previous version. When
writing code, you usually use a version control system like GIT for this task. Such a system
boosts productivity by tracking changes and ensuring that no important work is lost. A video
about the versioning functionality of KNIME Server is available on YouTube.

Snapshots

KNIME Server provides its own version control mechanism (see our documentation) that is
based on Snapshots.

A snapshot is simply a copy of a workflow at a certain point in time. Snapshots of items in

the server repository can be taken at any point by selecting the option in the items context
menu in the KNIME Explorer.

Additionally, a user can select to create a snapshot when overwriting an item. If necessary,
the server administrator can also enable mandatory snapshots with every overwrite action.

An important part of a snapshot is the comment you write for it. Here KNIME differs a bit
from other version control systems, which is why you need to be careful what to put there.
When creating the snapshot KNIME asks you to provide a small snippet of text that describes
the snapshot. This means that the text you write should not describe the changes done, but
the actual state of the workflow in the snapshot, before the changes.

Version History

In addition to the snapshots, a repository item always has exactly one current state. This is
the item that you see in the KNIME Explorer. If you want to access the snapshots, you need to
view the item’s history.

© 2021 KNIME AG. All rights reserved. 9

KNIME Best Practices Guide

Here you see a list of snapshots, with the dates and times they were created and their
comments. From here you can compare two workflow snapshots by selecting them, then
right-clicking and choosing “Compare.”

If you only select a single snapshot of a workflow for comparison, it is being compared to the
current version. Apart from comparison, you can also download a snapshot into your local
repository, delete snapshots, and overwrite the current state with the snapshot.

Managing access to KNIME server items

When multiple teams are working on the same KNIME Server, it is important to control who
has access to which items in the server’s repository.

Not every colleague should be able to see what you are working on, and some other
colleagues may see your workflows, but you do not want them to change anything. Other
people again should only be able to execute your workflows without seeing what exactly is
going on underneath. KNIME Server comes with comprehensive access control, based on
groups and individual users.

Permissions

Every item in the repository, be it workflows, workflow groups, or data files, can be assigned
permissions. For workflows and workflow groups, those permissions cover reading, writing,
and executing rights. For data files, only the former two are relevant.

You might wonder why anyone would need execute permissions on a workflow group. This is
because by default, items in a workflow group inherit the permissions set on that group.

So if you give a person execute permissions on a certain workflow group and then upload a
workflow into that group, the workflow will be executable by that person. Permission
inheritance can be disabled on a per-item basis and replaced by permissions specific to that
item.

© 2021 KNIME AG. All rights reserved. 10

KNIME Best Practices Guide

Reading, writing, and executing can be controlled not only for every user at once, but also for
individual users and groups. For that, the permissions dialog has multiple options: “Owner,”
“Users & Groups,” and “World.”

The owner of an item is the original uploader, but this can be changed by the owner or an
administrator later. The owner is the person who is in charge of the item and can always
change the permissions. Even if you disable reading and writing permissions for the owner,
they will still be able to perform these actions. However, when execution permissions are
denied, the owner won’t be able to execute the workflow.

The World permissions are permissions that apply to every user who is able to access the
KNIME Server. If you want to tightly control who has access, it is best to disable all
checkboxes in this row and then enable access for individual users and groups. You can do
this by clicking on the “View rights…” button. Here a new window will open to let you enter
custom users and groups (in the corresponding tab) and their permissions. Simply enter a
user or group in a new line, then click the columns for granting permissions to perform read,
write, and execute actions.

Best practices for assigning permissions

Usually it makes sense to set the permissions as narrowly as possible, but the permission to
guard most closely is the write permission. With that, a user can change and overwrite your
data and workflows, easily causing havoc if they have no business doing so.

Ideally your team has a folder in the server repository that only they have write access for.
Permission inheritance will then make sure that by default, no one else can change anything.

© 2021 KNIME AG. All rights reserved. 11

KNIME Best Practices Guide

But sometimes you also want to protect your IP, or you want to avoid people copying your
workflows and deploying them with modifications elsewhere on the server, leading to a
fragmentation of your project. In that case it is better to completely close down your project
folder and even disable read access for people outside of your team: unselect all checkboxes
for “World” and then explicitly enable access for your team in the “Users & Groups” settings.

Remote workflow editor

KNIME Workflows are often developed locally in KNIME Analytics platform, where it is easy to
check after each step if the expected result was produced and every node is green. But once
a workflow is completely done, it is deployed to KNIME Server to be executed in a schedule,
either via REST API or from the WebPortal.

If the workflow is built perfectly and never fails, there’s not much reason to check the node
status or the intermediate output of a node. Alas, nodes do fail occasionally, for a variety of
reasons — be it unexpected data, network failures, or a million other things. To quickly
alleviate these problems, KNIME provides a way of looking into a running job on KNIME
Server as if it were running locally. The extension to allow this is called Remote Workflow
Editor.

Installing the Remote Workflow Editor

The Remote Workflow Editor needs to be installed manually via “File → Install KNIME
Extension…” Just enter “remote workflow editor” in the search box and check the
corresponding extension for installation. Once the editor is installed, you can double-click on
any job (an executing workflow) in the KNIME Explorer to open it in the workflow editor.

You will note a banner at the top of the editor informing you that this is a job running on
KNIME Server. You can now look at node settings and outputs, as well as error and warning
messages. If the job was not started from KNIME WebPortal, you can even reset and re-
execute nodes, change connections, and add completely new nodes. For WebPortal jobs this
is not possible, because it would interfere with a user who accesses the same job in their
browser.

Use Cases for the Remote Workflow Editor

KNIME customers use the Remote Workflow Editor for a variety of things. Some do not have
access to databases and file shares on their local devices, so they upload a partially done
workflow to the server, execute a job via right-click → “Open” → “As new job on Server,” and
then make the necessary configuration changes to let it read the data. Once the data is

© 2021 KNIME AG. All rights reserved. 12

KNIME Best Practices Guide

loaded, the job, including the data, can be persisted on the server by right-clicking on it in the
KNIME Explorer and selecting “Save as Workflow…” This saved workflow can then be
downloaded for further editing.

How to work as a team?

Working on the same project using components

When working on a larger project, you might want multiple team members to work on the
same project and/or workflow at the same time. To do so, you can split the overall workflow
into multiple parts, where each part is implemented within a component, for which you define
the task/scope as well as the expected input and output columns.

For example, if you work on a prediction project (e.g. churn prediction or lead scoring) where
you have multiple data sources (e.g. an activity protocol, interaction on your website, email
data, or contract information), you might want to generate some features based on each data
table. In this case, you can split the preprocessing into multiple components with a clear
description — which features should be created based on an input dataset, and which output
is expected (“ID Column
generated features,” or similar).

Then each team member can work on one of these tasks individually, encapsulate the work
into a component, and share it with the team. In the end, the different steps can be combined
into one workflow that uses the shared components in the right order.

Another nice side effect is that the individual components can be reused by other workflows.
So if you have shared components that prepare your customer data for a churn prediction
model, you can reuse them when working on a next best step model.

Best practice when building (shared) components

There are a few things to keep in mind when building shared components to make them
easily reusable. It is best to build your shared component in a way that it behaves like you
would expect a KNIME node to. This means it should be configurable via the configuration
window, have an informative description, and give you meaningful error messages when
something fails.

Use configuration nodes to make your shared component configurable

By using configuration nodes, you can create a configuration window for your component.

KNIME Best Practices Guide

This allows you to control setting options of individual nodes inside the component via the
configuration window of the component.

So in case there is a setting option for one of the nodes inside the component which you or a
user of your component might want to change in the future, you can use a configuration
node to allow this setting option to be defined via the configuration window of the
component.

This part of the components guide introduces the different configuration nodes and how they
can be used.

Create a component description

When configuring a KNIME node, the description gives you information about the general
functionality of the node, the input and output ports, and the different setting options. You
can create something similar for your component.

To add a general description, you can first

click on any empty spot inside your
component and then on the pen in the
description view.

The input and output port names and

descriptions can be defined in the lower
parts on the same view.

Tip: In the description view, you can also

change the background color of your
component via the node type selection
and add an icon.

KNIME Best Practices Guide

The description for each configuration

option can be defined in the configuration
window of the according configuration
node.

Create custom error messages

The Breakpoint node allows you to halt execution and output a custom error message, in
case the incoming data fulfills a specified condition. So, for example, if the input table is
empty or some selected settings lead to a table that will let you component fail, you can stop
the execution beforehand and inform the user of your component via the custom message.

In the configuration window, you can define when the execution of your component should
be stopped, like in case the input table is empty, the node is an active or inactive branch, or a
flow variable matches a defined value. Optionally, in the lower part you can define a custom
message which should be displayed.

Validate the input data

The Table Validator node allows you to check whether all necessary columns are in a table.
In a component, this node can be used to check whether all necessary input columns are
present, as well as to output a custom error message via a Breakpoint node, in case
necessary columns are missing.

In the configuration window, you can specify the necessary columns and which specification

KNIME Best Practices Guide

each column needs to fulfill, like whether it is sufficient that it exists or if it needs to have an
additional data type, if it’s not allowed to have any new values, or if the range must be the
same as in the template table used when setting up the node.

Workflow services (call a workflow from another workflow)

A newly introduced feature in KNIME Analytics Platform are the so-called workflow services.
Like a component, a workflow service has clearly defined inputs and outputs. Unlike a
component, however, a workflow service stands alone, and each input and output is defined
by placing a Workflow Service Input or Workflow Service Output node into the workflow. The
nodes have dynamic ports so that you can change their type by clicking on the three dots
inside the node body. To make use of a workflow service, you can use a Call Workflow
Service node. Once you point it to a workflow that has Workflow Service In- and Output
nodes, the Call Workflow Service node can adapt to the present inputs and outputs and
adjusts its own ports correspondingly. Now you just need to connect it, and when you run it,
the workflow it points to will be executed. Data from your workflow is transferred to the
callee workflow, and that in turn sends back its results.

KNIME Best Practices Guide

Workflow Services vs. Components

So when should you use workflow services, and when should you use components? The
latter is copied to a workflow when added to it. All the nodes the component is composed of
are now inside your workflow with their own configuration. Deleting the component from the
repository does not change the behavior of your workflow at all. A workflow service, on the
other hand, is merely referenced — the Call Workflow Service node tells the workflow what to
call, but the exact nodes that will be executed are hidden within the workflow service. This
has advantages and disadvantages: your workflow will be smaller because it contains fewer
nodes, but the publisher of the workflow service can change their implementation without
you noticing at all. When using shared components, you can decide whether you want to
update them or not if something has changed in the referenced component. This is not
possible with workflow services.

A component will also be executed as part of the workflow it is part of. A workflow service,
on the other hand, is executed where it is located. A workflow service in your local repository
will be executed locally, but if it resides in a KNIME Server, this is where the service will run.
This means that you can offload work from your local workflow to a server, or you can trigger
multiple workflow services at the same time, and if the server has distributed executors for
running workflows in parallel, those workflow services can process your data very quickly.

Workflow Services vs. Call Workflow and Container In- and Outputs

In addition to workflow services, the Call Workflow and various Container Input and Output
nodes have been around for a while. The difference between those and the new workflow

KNIME Best Practices Guide

services is that the latter are meant for usage within KNIME, while the former can be called
from third-party applications via the KNIME Server’s REST API. So why not use Call Workflow
for everything? Those nodes transfer data by first converting it into JSON format, a text-
based format that cannot deal with all KNIME data types and is rather large. Workflow
services, on the other hand, use the proprietary binary format KNIME uses internally to
represent its data. This means no conversion is necessary and data size is very small. This
leads to faster execution at the expense of shutting third-party applications out, as they do
not understand the data formats.

Use Cases for Workflow Services

Workflow services are always a good idea when you want to split a project up into smaller
reusable workflows. When using many components inside a workflow, this can make the
workflow rather huge, resulting in slow workflow loading. Putting the logic into workflows
that are called can decrease the workflow size and therefore loading time considerably.

A particular use case for workflow services is providing other users of KNIME Analytics
Platform with the ability to offload compute intensive tasks to a KNIME Server. When
someone builds a workflow locally but would like to train something like a deep learning
model but does not have a powerful GPU to do so efficiently, they may send their data to a
workflow service that receives the data and hyperparameters as an input, trains the model,
and outputs the final result.

But workflow services can also be used to provide access to additional data sources. Data
only available from the server may be exposed to clients by a workflow service. Because the
service can do checks and transformations within the workflow, it is easy to build it in a way
that does not send out any confidential data. You could, for example, anonymize data before
giving it to clients, or you could filter out certain rows or columns. The workflow service
essentially acts as an abstraction layer between the actual data source and the client.

KNIME Best Practices Guide

Glossary

Workflow Annotation
A box with colored borders and optional text that can be placed in a workflow to highlight an
area and provide additional information. Right-click on an empty spot in a workflow and
select “New Workflow Annotation” to create a new workflow annotation at that position.

Node Annotation
The text underneath a node. By default this only shows “Node <id>,” where <id> is a running
number given to nodes placed in a workflow. By clicking on the text, the user can edit it and
add a description of what the node does.

Workflow Description
A description of a workflow that can be edited by the user. Shown in place of the node
description when no node is selected. Click on the pencil icon at the top-right to edit it. The
workflow description is also shown in the KNIME Server WebPortal when a user opens the
workflow’s page.

Server Repository
The place on the server where workflows, components, and data files are stored. Items in the
repository can be protected from unauthorized access using permissions.

Job
A copy of a workflow meant for execution. A job can only be accessed by its owner, the user
who started it. More information about jobs can be found here.

(Shared) Component
A workflow snippet wrapped into a single node. Can be shared with other users by saving it in
a server repository. If a component contains view or widget nodes, it provides its own view
composed of all the views within it. If it contains configuration nodes (e.g. String
Configuration), a user can pass in values via the component’s configuration dialog. More

KNIME Best Practices Guide

information can be found here.

Data App
A workflow comprising one or more components that contain view and widget nodes a user
can interact with in the browser. A Data App is deployed on KNIME Server and started via the
WebPortal. A guide on building Data Apps can be found here.

Schedule
A plan of when to execute a particular workflow. Once set up, KNIME Server will make sure
that a workflow is executed at the configured times, either only once or repeatedly. More
information is available here.

REST API
An interface for access to KNIME Server functionality using HTTP calls (GET, POST, PUT,
DELETE, etc.). REST stands for Representational State Transfer and constrains how an
interface in the world wide web should behave. API stands for Application Programming
Interface and is generally used to allow machines to communicate with each other. On
KNIME Server, the REST API can be used to manage workflows and other files (up- and
download, move, delete), create schedules, and trigger the execution of workflows. KNIME
Analytics Platform also communicates with KNIME Server via the REST API, so essentially
everything you can do in the AP, you can also do via REST. More information can be found
here.

Workflow Service
A workflow with Workflow Service Input and Workflow Service Output nodes that can be
called from other workflows using the Call Workflow Service node.

KNIME AG
Talacker 50
8001 Zurich, Switzerland
www.knime.com
[email protected]

The KNIME® trademark and logo and OPEN FOR INNOVATION® trademark are used by KNIME AG under license
from KNIME GmbH, and are registered in the United States. KNIME® is also registered in Germany.

KNIME Workflow Best Practices Guide
No ratings yet
KNIME Workflow Best Practices Guide
43 pages
Analytics Platform User Guide
No ratings yet
Analytics Platform User Guide
75 pages
KNIME Server User Guide: KNIME AG, Zurich, Switzerland Version 4.12 (Last Updated On 2021-04-13)
No ratings yet
KNIME Server User Guide: KNIME AG, Zurich, Switzerland Version 4.12 (Last Updated On 2021-04-13)
39 pages
Analytics Platform User Guide
No ratings yet
Analytics Platform User Guide
66 pages
Analytics Platform User Guide
No ratings yet
Analytics Platform User Guide
64 pages
Analytics Platform User Guide
No ratings yet
Analytics Platform User Guide
61 pages
KNIME Quickstart Guide: KNIME AG, Zurich, Switzerland Version 4.3 (Last Updated On 2020-09-07)
No ratings yet
KNIME Quickstart Guide: KNIME AG, Zurich, Switzerland Version 4.3 (Last Updated On 2020-09-07)
27 pages
Analytics Platform Quickstart Guide
No ratings yet
Analytics Platform Quickstart Guide
24 pages
DR Stefan Helfrich PPTs - KNIME Analytics
No ratings yet
DR Stefan Helfrich PPTs - KNIME Analytics
77 pages
Analytics Platform Workbench Guide
No ratings yet
Analytics Platform Workbench Guide
40 pages
KNIME Analytics Platform Guide
No ratings yet
KNIME Analytics Platform Guide
52 pages
KNIME Installation
No ratings yet
KNIME Installation
14 pages
KNIME Setup Guide for Beginners
No ratings yet
KNIME Setup Guide for Beginners
14 pages
KNIME For Data Scientists Basics
No ratings yet
KNIME For Data Scientists Basics
180 pages
l4 BD Slides
No ratings yet
l4 BD Slides
184 pages
CheatSheet Server A3 Web
No ratings yet
CheatSheet Server A3 Web
2 pages
Assignment 2
No ratings yet
Assignment 2
9 pages
Mastering KNIME Tools
No ratings yet
Mastering KNIME Tools
27 pages
KNIME Analytics Platform Course For Beginners PDF
No ratings yet
KNIME Analytics Platform Course For Beginners PDF
163 pages
1107 Bigdataforknime Slides
No ratings yet
1107 Bigdataforknime Slides
176 pages
2020 11 l3 PC Slides
No ratings yet
2020 11 l3 PC Slides
178 pages
1106 Slides UserTrainingBeginners
100% (1)
1106 Slides UserTrainingBeginners
164 pages
Knime PDF
No ratings yet
Knime PDF
183 pages
Knime L3 Study Material
No ratings yet
Knime L3 Study Material
169 pages
Text Mining Course For KNIME Analytics Platform
100% (1)
Text Mining Course For KNIME Analytics Platform
202 pages
Knime
No ratings yet
Knime
97 pages
Knime
No ratings yet
Knime
97 pages
BPM Performance
No ratings yet
BPM Performance
106 pages
1106 Textminingforknime Slides
No ratings yet
1106 Textminingforknime Slides
201 pages
COS10022 DSP Lab Week02
No ratings yet
COS10022 DSP Lab Week02
31 pages
KNIME UserTraining Beginner CC
No ratings yet
KNIME UserTraining Beginner CC
151 pages
KNIME - User Interface Walkthrough Document
No ratings yet
KNIME - User Interface Walkthrough Document
20 pages
Requirements Specification
No ratings yet
Requirements Specification
6 pages
KNIME Basics for Life Scientists
No ratings yet
KNIME Basics for Life Scientists
163 pages
Teamcenter
No ratings yet
Teamcenter
30 pages
Informatica Power Center 10 Workflow Developer Guide
No ratings yet
Informatica Power Center 10 Workflow Developer Guide
151 pages
IBM Business Process Manager V7.5. Performance Tuning and Best Practices
No ratings yet
IBM Business Process Manager V7.5. Performance Tuning and Best Practices
120 pages
2020 11 l1 Ls Slides
No ratings yet
2020 11 l1 Ls Slides
167 pages
KNIME Tool Guide for Data Analysts
No ratings yet
KNIME Tool Guide for Data Analysts
6 pages
Sap Signavio Process Manager User Guide en
No ratings yet
Sap Signavio Process Manager User Guide en
352 pages
BMC Remedy Action Request System 7.6.04 - Workflow Objects Guide
No ratings yet
BMC Remedy Action Request System 7.6.04 - Workflow Objects Guide
318 pages
Proje QT or User Guide
No ratings yet
Proje QT or User Guide
328 pages
SharePoint Nintex Material
100% (1)
SharePoint Nintex Material
73 pages
Pe Api
No ratings yet
Pe Api
590 pages
InspireDesigner-User - Manual-V8 1 0 2 PDF
0% (1)
InspireDesigner-User - Manual-V8 1 0 2 PDF
2,800 pages
Users Guide Vxworks
No ratings yet
Users Guide Vxworks
470 pages
SAP Signavio Hub User Guide
No ratings yet
SAP Signavio Hub User Guide
128 pages
Sg248216 - BPMv85 Perf Tuning Best Practice
No ratings yet
Sg248216 - BPMv85 Perf Tuning Best Practice
212 pages
Signavio Workflow Accelerator User Guide (PDFDrive)
No ratings yet
Signavio Workflow Accelerator User Guide (PDFDrive)
176 pages
MEGA Process BPMN Edition en
No ratings yet
MEGA Process BPMN Edition en
104 pages
From Excel To KNIME 081921-44
No ratings yet
From Excel To KNIME 081921-44
52 pages
MEGA Process BPMN Edition en
100% (1)
MEGA Process BPMN Edition en
88 pages
KNIME - Quick Guide
No ratings yet
KNIME - Quick Guide
35 pages
Uipath Adpv1
No ratings yet
Uipath Adpv1
5 pages
4 Eng
No ratings yet
4 Eng
36 pages
Installation Manual For Champions Digital Lloyds Cardnet II
No ratings yet
Installation Manual For Champions Digital Lloyds Cardnet II
12 pages
EI User Guide
No ratings yet
EI User Guide
100 pages
(12.1.4) BR - 4 - PB - Mellanox - NEO
No ratings yet
(12.1.4) BR - 4 - PB - Mellanox - NEO
2 pages
HUMS UserManual en 03
No ratings yet
HUMS UserManual en 03
57 pages
SAP B1 10.0 FP 2208 Upgrade Guide
No ratings yet
SAP B1 10.0 FP 2208 Upgrade Guide
17 pages
The API Roadmap
100% (5)
The API Roadmap
43 pages
Lua Design and Scripting Language Insights
No ratings yet
Lua Design and Scripting Language Insights
13 pages
Milesight Gateway API Documentation
No ratings yet
Milesight Gateway API Documentation
15 pages
EPSON RC+ 7.5.0 Release Notes
No ratings yet
EPSON RC+ 7.5.0 Release Notes
6 pages
DocuVault: Streamlined File Management
No ratings yet
DocuVault: Streamlined File Management
9 pages
IICS
No ratings yet
IICS
269 pages
stm32f4 Hal and Lowlayer Drivers
No ratings yet
stm32f4 Hal and Lowlayer Drivers
2,227 pages
Coastal Tourism Safety App
No ratings yet
Coastal Tourism Safety App
14 pages
HyreSnap-Akshay Kamte Resume
No ratings yet
HyreSnap-Akshay Kamte Resume
2 pages
DRY Principles
No ratings yet
DRY Principles
4 pages
Phase 4 Project Report of IBM
No ratings yet
Phase 4 Project Report of IBM
4 pages
EasyXLS LICENSE
No ratings yet
EasyXLS LICENSE
4 pages
Sap Mes SDK
No ratings yet
Sap Mes SDK
133 pages
Oracle: Question & Answers
No ratings yet
Oracle: Question & Answers
4 pages
Haiwell BS Series IoT Cloud HMI Catalog
No ratings yet
Haiwell BS Series IoT Cloud HMI Catalog
4 pages
Mohan D (417) 597-4397 Summary of Experience
No ratings yet
Mohan D (417) 597-4397 Summary of Experience
5 pages
SaperaLT User Manual V8.6
No ratings yet
SaperaLT User Manual V8.6
111 pages
Web Development Technologies Guide
No ratings yet
Web Development Technologies Guide
5 pages
Alethea Services Portfolio
No ratings yet
Alethea Services Portfolio
18 pages
An Interactive Dashboard For Real-Time Analytics and Monitoring of Covid-19 Outbreak in India: A Proof of Concept
No ratings yet
An Interactive Dashboard For Real-Time Analytics and Monitoring of Covid-19 Outbreak in India: A Proof of Concept
17 pages
Milan
No ratings yet
Milan
2 pages
Oracle Construction and Engineering Professional Services
No ratings yet
Oracle Construction and Engineering Professional Services
40 pages
The Goal of This Challenge Is To Build An API That Will Manage
No ratings yet
The Goal of This Challenge Is To Build An API That Will Manage
18 pages

Analytics - Platform - Best - Practices - Guide Knime

Uploaded by

Analytics - Platform - Best - Practices - Guide Knime

Uploaded by

KNIME Best Practices Guide

KNIME AG, Zurich, Switzerland

Workflow design process: From KNIME Analytics Platform to KNIME Server . . . . . . . . . . . . . . 2

Before building a KNIME Workflow: project prerequisites . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

Best practices when working with KNIME Analytics Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

Use proper naming for your workflows or groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

Design your workflow in a secure, reusable, and efficient way . . . . . . . . . . . . . . . . . . . . . . . . 4

Design your workflow to be efficient. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

Best practices when working with KNIME Server. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

Managing access to KNIME server items . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

Remote workflow editor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

How to work as a team? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

REST API . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

What is KNIME Software?

© 2021 KNIME AG. All rights reserved. 1

Workflow design process: From KNIME

KNIME Server is an enterprise software for team-based collaboration, automation,

© 2021 KNIME AG. All rights reserved. 2

Before building a KNIME Workflow: project

☐ Is there a clearly defined goal that is measurable and achievable?

☐ A scheduled workflow that regularly creates and sends a report?

☐ A Data App that gets deployed to KNIME WebPortal?

© 2021 KNIME AG. All rights reserved. 3

Best practices when working with KNIME

Use proper naming for your workflows or groups

Design your workflow in a secure, reusable, and efficient way

Avoid using local file paths

Use credential nodes

© 2021 KNIME AG. All rights reserved. 4

Build and share components

To make a component reusable by others, it is recommended that you properly document it

Document your workflow

© 2021 KNIME AG. All rights reserved. 5

should structure it into different parts.

• Attach a workflow description by clicking on an empty spot in the workflow, then go

Design your workflow to be efficient

© 2021 KNIME AG. All rights reserved. 6

© 2021 KNIME AG. All rights reserved. 7

preprocessing of the data as possible to the database server by using the DB

© 2021 KNIME AG. All rights reserved. 8

Best practices when working with KNIME

A snapshot is simply a copy of a workflow at a certain point in time. Snapshots of items in

© 2021 KNIME AG. All rights reserved. 9

Managing access to KNIME server items

© 2021 KNIME AG. All rights reserved. 10

Best practices for assigning permissions

© 2021 KNIME AG. All rights reserved. 11

Remote workflow editor

Installing the Remote Workflow Editor

Use Cases for the Remote Workflow Editor

© 2021 KNIME AG. All rights reserved. 12

How to work as a team?

Working on the same project using components

Best practice when building (shared) components

Use configuration nodes to make your shared component configurable

© 2021 KNIME AG. All rights reserved. 13

Create a component description

To add a general description, you can first

The input and output port names and

Tip: In the description view, you can also

© 2021 KNIME AG. All rights reserved. 14

The description for each configuration

Create custom error messages

Validate the input data

© 2021 KNIME AG. All rights reserved. 15

Workflow services (call a workflow from another workflow)

© 2021 KNIME AG. All rights reserved. 16

Workflow Services vs. Components

© 2021 KNIME AG. All rights reserved. 17

Use Cases for Workflow Services

© 2021 KNIME AG. All rights reserved. 18

© 2021 KNIME AG. All rights reserved. 19

information can be found here.

© 2021 KNIME AG. All rights reserved. 20

You might also like