Data Engineering using Databricks features on AWS and Azure

seeders: 4
leechers: 0
updated:
Added by cg3780 in Other > Tutorials

Download Fast Safe Anonymous
movies, software, shows...

Files

Data Engineering using Databricks features on AWS and Azure
  • !!! More Courses !!!.txt (1.1 KB)
  • 01 Introduction to Data Engineering using Databricks
    • 001 Overview of the course - Data Engineering using Databricks.en.srt (17.6 KB)
    • 001 Overview of the course - Data Engineering using Databricks.mp4 (75.0 MB)
    • 002 Where are the resources that are used for this course_.html (1.7 KB)
    • 003 [Must Watch] 30 Day Money Back Guarantee, Feedback and Rating.en.srt (4.2 KB)
    • 003 [Must Watch] 30 Day Money Back Guarantee, Feedback and Rating.mp4 (6.0 MB)
    02 Getting Started
    • 001 Signing Up For Databricks Community Edition.en.srt (10.7 KB)
    • 001 Signing Up For Databricks Community Edition.mp4 (40.3 MB)
    • 002 Create Azure Databricks Service.en.srt (9.7 KB)
    • 002 Create Azure Databricks Service.mp4 (39.1 MB)
    • 003 Signup For Databricks Full Trial.en.srt (12.9 KB)
    • 003 Signup For Databricks Full Trial.mp4 (57.6 MB)
    • 004 Overview Of Databricks UI.en.srt (3.1 KB)
    • 004 Overview Of Databricks UI.mp4 (9.3 MB)
    • 005 Upload Data In Files Into Databricks.en.srt (12.3 KB)
    • 005 Upload Data In Files Into Databricks.mp4 (49.9 MB)
    • 006 Create Cluster In Databricks Platform.en.srt (6.1 KB)
    • 006 Create Cluster In Databricks Platform.mp4 (20.4 MB)
    • 007 Managing File System Using Notebooks.en.srt (9.5 KB)
    • 007 Managing File System Using Notebooks.mp4 (45.0 MB)
    03 Setup Local Development Environment
    • 001 Setup Single Node Databricks Cluster.en.srt (8.2 KB)
    • 001 Setup Single Node Databricks Cluster.mp4 (27.6 MB)
    • 002 Install Databricks Connect.en.srt (5.1 KB)
    • 002 Install Databricks Connect.mp4 (23.9 MB)
    • 003 Configure Databricks Connect.en.srt (6.7 KB)
    • 003 Configure Databricks Connect.mp4 (34.6 MB)
    • 004 Integrating Pycharm with Databricks Connect.en.srt (6.2 KB)
    • 004 Integrating Pycharm with Databricks Connect.mp4 (22.2 MB)
    • 005 Code - Integrating Pycharm with Databricks Connect.html (1.1 KB)
    • 006 Integrate Databricks Cluster with Glue Catalog.en.srt (12.7 KB)
    • 006 Integrate Databricks Cluster with Glue Catalog.mp4 (71.9 MB)
    • 007 Setup s3 Bucket and Grant Permissions.en.srt (5.1 KB)
    • 007 Setup s3 Bucket and Grant Permissions.mp4 (22.2 MB)
    • 008 Mounting s3 Buckets into Databricks Clusters.en.srt (6.1 KB)
    • 008 Mounting s3 Buckets into Databricks Clusters.mp4 (24.7 MB)
    • 009 Using dbutils from IDEs such as Pycharm.en.srt (3.8 KB)
    • 009 Using dbutils from IDEs such as Pycharm.mp4 (18.4 MB)
    • 010 Code - Using dbutils from IDEs such as Pycharm.html (1.2 KB)
    04 Using Databricks CLI
    • 001 Introduction.en.srt (2.1 KB)
    • 001 Introduction.mp4 (6.2 MB)
    • 002 Install and Configure Databricks CLI.en.srt (4.4 KB)
    • 002 Install and Configure Databricks CLI.mp4 (21.2 MB)
    • 003 Interacting with File System using CLI.en.srt (14.0 KB)
    • 003 Interacting with File System using CLI.mp4 (70.4 MB)
    • 004 Getting Cluster Details using CLI.en.srt (7.2 KB)
    • 004 Getting Cluster Details using CLI.mp4 (36.7 MB)
    05 Spark Application Development Life Cycle
    • 001 Setup Virtual Environment and Install Pyspark.en.srt (7.7 KB)
    • 001 Setup Virtual Environment and Install Pyspark.mp4 (37.8 MB)
    • 002 [Commands] - Setup Virtual Environment and Install Pyspark.html (1.1 KB)
    • 003 Getting Started with Pycharm.en.srt (7.8 KB)
    • 003 Getting Started with Pycharm.mp4 (29.7 MB)
    • 004 [Code and Instructions] - Getting Started with Pycharm.html (1.8 KB)
    • 005 Passing Run Time Arguments.en.srt (8.8 KB)
    • 005 Passing Run Time Arguments.mp4 (29.9 MB)
    • 006 Accessing OS Environment Variables.en.srt (6.8 KB)
    • 006 Accessing OS Environment Variables.mp4 (23.9 MB)
    • 007 Getting Started with Spark.en.srt (4.0 KB)
    • 007 Getting Started with Spark.mp4 (18.8 MB)
    • 008 Create Function for Spark Session.en.srt (8.2 KB)
    • 008 Create Function for Spark Session.mp4 (41.3 MB)
    • 009 [Code and Instructions] - Create Function for Spark Session.html (2.1 KB)
    • 010 Setup Sample Data.en.srt (3.8 KB)
    • 010 Setup Sample Data.mp4 (18.6 MB)
    • 011 Read data from files.en.srt (12.8 KB)
    • 011 Read data from files.mp4 (63.6 MB)
    • 012 [Code and Instructions] - Read data from files.html (2.4 KB)
    • 013 Process data using Spark APIs.en.srt (9.5 KB)
    • 013 Process data using Spark APIs.mp4 (43.7 MB)
    • 014 [Code and Instructions] - Process data using Spark APIs.html (2.3 KB)
    • 015 Write data to files.en.srt (11.2 KB)
    • 015 Write data to files.mp4 (44.7 MB)
    • 016 [Code and Instructions] - Write data to files.html (2.8 KB)
    • 017 Validating Writing Data to Files.en.srt (10.8 KB)
    • 017 Validating Writing Data to Files.mp4 (50.5 MB)
    • 018 Productionizing the Code.en.srt (7.4 KB)
    • 018 Productionizing the Code.mp4 (25.5 MB)
    • 019 [Code and Instructions] - Productionizing the code.html (4.3 KB)
    • 020 Setting up Data for Production Validation.en.srt (6.6 KB)
    • 020 Setting up Data for Production Validation.mp4 (41.2 MB)
    06 Databricks Jobs and Clusters
    • 001 Introduction to Jobs and Clusters.en.srt (4.5 KB)
    • 001 Introduction to Jobs and Clusters.mp4 (15.9 MB)
    • 002 Creating Pools in Databricks Platform.en.srt (5.3 KB)
    • 002 Creating Pools in Databricks Platform.mp4 (19.5 MB)
    • 003 Create Cluster on Azure Databricks.en.srt (9.2 KB)
    • 003 Create Cluster on Azure Databricks.mp4 (36.6 MB)
    • 004 Request to Increase CPU Quota on Azure.en.srt (4.2 KB)
    • 004 Request to Increase CPU Quota on Azure.mp4 (21.7 MB)
    • 005 Creating Job on Databricks.en.srt (13.2 KB)
    • 005 Creating Job on Databricks.mp4 (63.9 MB)
    • 006 Submitting Jobs using Job Cluster.en.srt (8.9 KB)
    • 006 Submitting Jobs using Job Cluster.mp4 (33.4 MB)
    • 007 Create Pool in Databricks.en.srt (4.4 KB)
    • 007 Create Pool in Databricks.mp4 (15.3 MB)
    • 008 Running Job using Interactive Cluster Attached to Pool.en.srt (5.0 KB)
    • 008 Running Job using Interactive Cluster Attached to Pool.mp4 (21.5 MB)
    • 009 Running Job Using Job Cluster Attached to

Description


Data Engineering using Databricks features on AWS and Azure
MP4 | Video: h264, 1280x720 | Audio: AAC, 44.1 KHz, 2 Ch
Genre: eLearning | Language: English + srt | Duration: 114 lectures (8h 55m) | Size: 3.42 GB

Build Data Engineering Pipelines using Databricks core features such as Spark, Delta Lake, cloudFiles, etc
What you'll learn:
Data Engineering leveraging Databricks features
Databricks CLI to manage files, Data Engineering jobs and clusters for Data Engineering Pipelines
Deploying Data Engineering applications developed using PySpark on job clusters
Deploying Data Engineering applications developed using PySpark using Notebooks on job clusters
Perform CRUD Operations leveraging Delta Lake using Spark SQL for Data Engineering Applications or Pipelines
Perform CRUD Operations leveraging Delta Lake using Pyspark for Data Engineering Applications or Pipelines
Setting up development environment to develop Data Engineering applications using Databricks
Building Data Engineering Pipelines using Spark Structured Streaming on Databricks Clusters

Requirements
Programming experience using Python
Data Engineering experience using Spark
Ability to write and interpret SQL Queries
This course is ideal for experience data engineers to add Databricks as one of the key skill as part of the profile

Description
As part of this course, you will learn all the Data Engineering using cloud platform-agnostic technology called Databricks.

About Data Engineering

Data Engineering is nothing but processing the data depending upon our downstream needs. We need to build different pipelines such as Batch Pipelines, Streaming Pipelines, etc as part of Data Engineering. All roles related to Data Processing are consolidated under Data Engineering. Conventionally, they are known as ETL Development, Data Warehouse Development, etc.

About Databricks

Databricks is the most popular cloud platform-agnostic data engineering tech stack. They are the committers of the Apache Spark project. Databricks run time provide Spark leveraging the elasticity of the cloud. With Databricks, you pay for what you use. Over a period of time, they came up with an idea of Lakehouse by providing all the features that are required for traditional BI as well as AI & ML. Here are some of the core features of Databricks.

Spark - Distributed Computing

Delta Lake - Perform CRUD Operations. It is primarily used to build capabilities such as inserting, updating, and deleting the data from files in Data Lake.

cloudFiles - Get the files in an incremental fashion in the most efficient way leveraging cloud features.

Course Details

As part of this course, you will be learning Data Engineering using Databricks.

Getting Started with Databricks

Setup Local Development Environment to develop Data Engineering Applications using Databricks

Using Databricks CLI to manage files, jobs, clusters, etc related to Data Engineering Applications

Spark Application Development Cycle to build Data Engineering Applications

Databricks Jobs and Clusters

Deploy and Run Data Engineering Jobs on Databricks Job Clusters as Python Application

Deploy and Run Data Engineering Jobs on Job Cluster using Notebooks

Deep Dive into Delta Lake using Dataframes

Deep Dive into Delta Lake using Spark SQL

Building Data Engineering Pipelines using Spark Structured Streaming on Databricks Clusters

We will be adding few more modules related to Pyspark, Spark with Scala, Spark SQL, Streaming Pipelines in the coming weeks.

Desired Audience

Here is the desired audience for this advanced course.

Experienced application developers to gain expertise related to Data Engineering with prior knowledge and experience of Spark.

Experienced Data Engineers to gain enough skills to add Databricks to their profile.

Testers to improve their testing capabilities related to Data Engineering applications using Databricks.

Prerequisites

Logistics

Computer with decent configuration (At least 4 GB RAM, however 8 GB is highly desired)

Dual Core is required and Quad-Core is highly desired

Chrome Browser

High-Speed Internet

Valid AWS Account

Valid Databricks Account (free Databricks Account is not sufficient)

Experience as Data Engineer especially using Apache Spark

Knowledge about some of the cloud concepts such as storage, users, roles, etc.

Associated Costs

As part of the training, you will only get the material. You need to practice on your own or corporate cloud account and Databricks Account.

You need to take care of the associated AWS or Azure costs.

You need to take care of the associated Databricks costs.

Training Approach

Here are the details related to the training approach.

It is self-paced with reference material, code snippets, and videos provided as part of Udemy.

One needs to sign up for their own Databricks environment to practice all the core features of Databricks.

We would recommend completing 2 modules every week by spending 4 to 5 hours per week.

It is highly recommended to take care of all the tasks so that one can get real experience of Databricks.

Support will be provided through Udemy Q&A.

Who this course is for
Beginner or Intermediate Data Engineers who want to learn Databricks for Data Engineering
Intermediate Application Engineers who want to explore Data Engineering using Databricks
Data and Analytics Engineers who want to learn Data Engineering using Databricks
Testers who want to learn Databricks to test Data Engineering applications built using Databricks



Download torrent
3.7 GB
seeders:4
leechers:0
Data Engineering using Databricks features on AWS and Azure


Trackers

tracker name
udp://opentor.org:2710/announce
udp://tracker.torrent.eu.org:451/announce
udp://open.stealth.si:80/announce
udp://ipv4.tracker.harry.lu:80/announce
udp://tracker.uw0.xyz:6969/announce
udp://tracker.dler.org:6969/announce
udp://9.rarbg.com:2870/announce
udp://www.torrent.eu.org:451/announce
udp://tracker2.dler.com:80/announce
µTorrent compatible trackers list

Download torrent
3.7 GB
seeders:4
leechers:0
Data Engineering using Databricks features on AWS and Azure


Torrent hash: 8E059D4695F968516D846844926731187A4CAE5C