https://106c4.wpc.azureedge.net/80106C4/Gallery-Prod/cdn/2015-02-24/prod20161101-microsoft-windowsazure-gallery/streamsets.streamsets-data-collectorstreamsets-data-collector-core-hour-1_0.1.0.2/Icons/Large.png

StreamSets Data Collector

StreamSets
Execution engine for fast data ingestion-- streaming, batch or CDC
https://gallery.azure.com/artifact/20151001/streamsets.streamsets-data-collectorstreamsets-data-collector-core-hour-1_0.1.0.2/Artifacts/Thumbnails/c81e40a0-f9e3-4513-96d0-367d810e8f07.png
/images/videoOverlay.png
https://gallery.azure.com/artifact/20151001/streamsets.streamsets-data-collectorstreamsets-data-collector-core-hour-1_0.1.0.2/Artifacts/Thumbnails/c81e40a0-f9e3-4513-96d0-367d810e8f07.png
/images/videoOverlay.png
https://gallery.azure.com/artifact/20151001/streamsets.streamsets-data-collectorstreamsets-data-collector-core-hour-1_0.1.0.2/Artifacts/Thumbnails/3d4acc75-97fb-4289-ae66-3a800dbf55fc.png
/images/videoOverlay.png
https://gallery.azure.com/artifact/20151001/streamsets.streamsets-data-collectorstreamsets-data-collector-core-hour-1_0.1.0.2/Artifacts/Thumbnails/74271d06-3371-4087-bff5-8ff3b3e4507e.png
/images/videoOverlay.png
https://106c4.wpc.azureedge.net/80106C4/Gallery-Prod/cdn/2015-02-24/prod20161101-microsoft-windowsazure-gallery/streamsets.streamsets-data-collectorstreamsets-data-collector-core-hour-1_0.1.0.2/Screenshots/Screenshot1.png
Support
Support

StreamSets Data Collector

StreamSets

5.0 (1)

Execution engine for fast data ingestion-- streaming, batch or CDC

StreamSets Data Collector is an easy-to-use modern execution engine for fast data ingestion and light transformations. Data Collector's easy-to-use visual tools let you design, deploy and operate streaming, CDC (change data capture) and batch data pipelines data without hand coding, from the full variety of data sources such as Kafka, S3, Snowflake, Databricks, JDBC, Hive, Salesforce, Oracle and many, many more. Fully instrumented ""smart"" data pipelines let you monitor data in-flight, and are designed to handle data drift with built-in detection and handling. Data Collector pipelines are designed to be platform-agnostic, so you can adapt as needed and avoid vendor lock-in.

    Easiest to use tool for data ingestion
  • abstracts away details
  • single platform for all patterns
  • full lifecycle, from design to operations
    Pipelines that can be ported across on-premise & cloud platforms
    Drift resilient, fully-instrumented ""smart"" pipelines
    Enterprise functionality and performance




Usage Instructions

It takes a few minutes to deploy the Azure VM. Once the VM is available, it takes about one minute to start the Data Collector service. The StreamSets Data Collector web based UI will be available on port 18630 . To access Data Collector, enter the following URL in the address bar of your browser:

http://[Public DNS of Azure VM]:18630

For example if your Public DNS is 123.123.123.123, enter http://123.123.123.123:18630 in the browser.

https://gallery.azure.com/artifact/20151001/streamsets.streamsets-data-collectorstreamsets-data-collector-core-hour-1_0.1.0.2/Artifacts/Thumbnails/c81e40a0-f9e3-4513-96d0-367d810e8f07.png
/images/videoOverlay.png
https://gallery.azure.com/artifact/20151001/streamsets.streamsets-data-collectorstreamsets-data-collector-core-hour-1_0.1.0.2/Artifacts/Thumbnails/c81e40a0-f9e3-4513-96d0-367d810e8f07.png
/images/videoOverlay.png
https://gallery.azure.com/artifact/20151001/streamsets.streamsets-data-collectorstreamsets-data-collector-core-hour-1_0.1.0.2/Artifacts/Thumbnails/3d4acc75-97fb-4289-ae66-3a800dbf55fc.png
/images/videoOverlay.png
https://gallery.azure.com/artifact/20151001/streamsets.streamsets-data-collectorstreamsets-data-collector-core-hour-1_0.1.0.2/Artifacts/Thumbnails/74271d06-3371-4087-bff5-8ff3b3e4507e.png
/images/videoOverlay.png
https://106c4.wpc.azureedge.net/80106C4/Gallery-Prod/cdn/2015-02-24/prod20161101-microsoft-windowsazure-gallery/streamsets.streamsets-data-collectorstreamsets-data-collector-core-hour-1_0.1.0.2/Screenshots/Screenshot1.png