A fully automated Data Ingestion/Syndication process to transfer data from SAP to Azure Lake
Velocity is an SAP-certified, rapid data ingestion solution built on Microsoft Azure. Velocity enables you to unlock the power of Azure on your data, enabling you to build your modern data platform.
Key Features:
Transfer data from SAP and non-SAP sources to Azure rapidly and securely.
Fully managed change data capture (CDC) built in.
Batch data ingestion to have low source system impact.
100% PaaS (Platform-as-a-Service) Product.
Full GUI, no code solution, empowering citizen data scientists without technical coding skills.
Unlimited data ingestion included within the license.
Supported Data Sources:
SAP ECC
SAP S4 HANA
SAP BW
SAP BW4HANA
SAP BW on HANA
HANA Sidecar
SuccessFactors
Oracle
SQL
Storage Account
Serverless Solution
Velocity is a serverless solution, removing the need for additional servers to deploy & operationalize the source to Azure integration. Velocity connects directly between SAP/non-SAP source systems and the target Azure system(s): reducing infrastructure, support costs and solution complexity, while increasing security & deployment speed.
Source Reporting
Velocity keeps the storage in sync with your source data, ensuring your latest data is always available. There’s no requirement for any other ETL tool/jobs or batch file generation processes, simplifying & reducing the time taken to process deltas and change data capture (CDC) into the target lake. Velocity feeds near real-time data into the target storage, allowing you to generate actionable insights & react immediately limiting potential damage.
Deployment
Deployment consists of source transports and Azure code deployments. Proof of Value (PoV) or full implementation deployment will occur alongside your team to ensure knowledge of the solution is transferred in-house. An installation guide will be provided for your specific source add-on to ensure that the solution meets security standards.
Data Storage
Velocity aligns with data management best practices for staging and managing data within your storage account. This is achieved through user-defined storage areas based on the type of data being stored. Ex: finance data and customer data can be configured to be stored automatically in separate areas within your storage target. Velocity supports multiple storage accounts as targets, enabling segregation of data as required.
Security
The Velocity integration has only 2 endpoints - source and Azure. This removes the need for intermediary servers, meaning data is never at-rest outside the security of the source or Azure systems. As a result, security vulnerabilities are reduced. During transfer, data is encrypted and only ever at-rest within the security of your data lake. No intermediate databases are required. Users can encrypt the at-rest data in the lake if desired. Velocity leverages the source authorization model to control data access from source to Azure. Within Azure, Active Directory permissions are used to provide role-based access to Velocity, controlling who can configure it.
Management
Users can add source systems, table extractions, and custom logical views in the Velocity portal. Administrative users can configure the following:
System connections, Objects to Ingest, Change Data Capture (CDC) Method, Data Compression, Max batch Size, Destination File Format, Target Data Encryption, Limit job Currency on Source Systems.
Schema Definition
Velocity also syndicates the source data schema definition. This is useful for processes that consume the data from storage, leveraging this content to identify the structure definition when required. Schema definition can be used for further downstream transformations.
CDC/Deltas
Change Data Capture (CDC) is the process of identifying changes (deltas) in the source system and replicating them across to the target, ensuring the target always reflects the data in the source. Velocity uses a sophisticated CDC solution that works at either the application or database log level to identify changes. Velocity identifies changes in the source system & maintains the same changes in the target storage dataset. This is effective with all source table types, including pool & cluster tables.
Performance
Velocity optimizes the data extraction & replication process with numerous adjustable parameters to tune your extractions.