StreamSets Data Collector is an easy-to-use data pipeline engine for streaming, CDC and batch ingest from any source to Azure. With StreamSets, you spend your time building data pipelines, enabling self-service, and innovating, and minimize the time you spend maintaining, rewriting and fixing pipelines.
Ingest data from a broad variety of sources including Kafka, HDFS, databases, files, applications and more into Azure Storage, Azure Event Hub, Azure Synapse, Snowflake and Databricks. Integrated with Azure Key Vault for seamless security.
Ittiakes a few minutes to deploy the Azure VM. Once the VM is available, it takes about one minute to start the Data Collector service. The StreamSets Data Collector web based UI will be available on port 18630 . To access Data Collector, enter the following URL in the address bar of your browser:
http://[Public DNS of Azure VM]:18630
For example if your Public DNS is 123.123.123.123, enter http://123.123.123.123:18630 in the browser.