More Information
Special Product Yes
Key Note Agilitics courses information , Agilitics courses information
Course feature Lifetime Access, CloudLabs, 24x7 Support, Real-time code analysis and feedback, 100% Money Back Guarantee
Interested Audience You learn about, and compare, many of the computing and storage services available in Google Cloud Platform, including Google App Engine, Google Compute Engine, Google Kubernetes Engine, Google Cloud Storage, Google Cloud SQL, and BigQuery. You learn about important resource and policy management tools, such as the Google Cloud Resource Manager hierarchy and Google Cloud Identity and Access Management.

Agilitics courses information , Agilitics courses information

  • Lifetime Access

  • CloudLabs

  • 24x7 Support

  • Real-time code analysis and feedback

  • 100% Money Back Guarantee

Course Description

This two-day hands-on training course provides a comprehensive introduction to StreamSets Data Collector.Participants will learn how to create complex pipelines that ingest data from a variety of sources, manipulate that data,and then export it to destinations including Apache Kafka, relational database management systems, and Apache Hadoop. Throughout the course, hands-on exercises reinforce the concepts being discussed.

Target Audience

The course is designed for those who will be designing, building, and running data flow pipelines, including data engineers, data developers, data analysts, data scientists, ETL developers, and data architects. No prior knowledge of StreamSets Data Collector is required.

Prerequisites

Students preferably should have a general knowledge of operating systems, networking, programming concepts, and databases.

Key Objectives

Prequisitives
You learn about, and compare, many of the computing and storage services available in Google Cloud Platform, including Google App Engine, Google Compute Engine, Google Kubernetes Engine, Google Cloud Storage, Google Cloud SQL, and BigQuery. You learn about important resource and policy management tools, such as the Google Cloud Resource Manager hierarchy and Google Cloud Identity and Access Management.
Interested Audience
You learn about, and compare, many of the computing and storage services available in Google Cloud Platform, including Google App Engine, Google Compute Engine, Google Kubernetes Engine, Google Cloud Storage, Google Cloud SQL, and BigQuery. You learn about important resource and policy management tools, such as the Google Cloud Resource Manager hierarchy and Google Cloud Identity and Access Management.

Get a Peek at Our Success Stories

Featured Review

Puli

Develpoer

One of best I have encountered in my life. Freedom to interact and respond candidly and with courage for every question is not an easy task for Trainers which they did it exceptionally well.

Chun Ngee

Develpoer

The course is well structure. Timing is also right. The trainer Mr Raj is professional. And he asnwer all my question and doubts.

Sarbojit Bose

Develpoer

The course is one of the two in the track of Agile Professional Coach. It is designed to provide both wide and deep knowledge to become a competent Coach with the addirional skills of a Trainer and a Mentor. The two trainers, Preeth Panday and Naveen K Singh, are excellent Facilitators and Coaches with patience and promptness. Their mastery in this area stands out while their mode of delivery captures the interest of the trainees. They demonstrated professionalism with a personal touch.

Training FAQ

Course Outline

Overview of the StreamSets

  • DataOps Platform
  • DataOps Platform Overview
  • StreamSets DataOps Architecture and Use Cases
  • Custom Examples

An Introduction to StreamSets Data Collector

  • Getting Started with Data Collector
  • SDC Overview
  • The SDC User Interface
  • Building Pipelines
  • Previewing Data
  • Running the Pipeline

Pipeline Events, Rules, and Alerts

  • Generating and Handling Events
  • Metric Rules
  • Data Rules

Reading, Writing and Transforming Data

  • Flat Files
  • Metric Rules
  • Relational Databases: MySQL, Oracle, and Change Data Capture
  • Messaging Broker Systems: Kafka
  • Event Based: APIs
  • Distributed Storage: HDFS
  • Lookups: Relational Databases

Administration and Monitoring

  • Monitoring your SDC instances

Data Collector Security

  • Securing your Data Collector
  • Kerberos

Troubleshooting and Tuning your extremists

  • environment
  • Identifying issues
  • Troubleshooting issues
  • Working with Support

Handling Data Drift

  • Data Drift Rules
  • Hive