mFlow

Advanced automation in the data warehouse

mFlow enables fast implementation, efficient control, management and monitoring of the ETL processes execution

ETL process involves collecting a complete, consistent set of data from multiple source systems. It is necessary to ensure proper ordering and control of all the interdependencies between individual transformations in order to load the data warehouse successfully.

Usual pitfalls in the design and management of ETL processes are:

Process development time takes up significant developer resources
Non-transparency of processes, their interdependence and performance results
Lack of adaptive parallelism
No support for process origin and impact reports (Lineage / Impact Analysis)
Problematic migration between environments and the inability to gradually replace ETL tools

Increased productivity

Creating an entire process tree of several hundred processes in a couple of hours
Adaptive changes are measured in minutes
Process creation: Individual (GUI), Bulk (SQL)
There is no deploy
Testing of process variants: selection of series / parallels is reduced to changing one attribute (update)

Flexibility

Modular development and testing
Easy implementation of changes
Independence of the ETL tool
Flexible API addition – (combining ETL tools, gradual migration enabled)

Performance

Intelligent adaptive parallelism
The maximum number of simultaneous processes can be defined
Queuing
Maximum utilization of resources
It is possible to define the order of entering the Queue

Supervision and maintenance

Transparency (Lineage & Impact analysis)
Load process monitoring via web browser
Automation – (Re) start load with one command
Unified logging (processed lines, errors)
Interactive ad hoc reports