Azure Data Engineer

     1.SQL

      a) SQL Introduction

  • What is RDBMS
  • What is SQL
  • What is a Database
  • what is a Schema
  • What is Table
  • What are Datatype
  • Types – DDL,DML,DCL-
  • What are tables
  • constraints on table : Primary Key, foreign key, unique key,not null
  • Auto Increment
  • What is a relation
  • Sql Syntaxes : Select, insert, update and delete
  • Data types and Type casting
  • Sorting Data – Order by
  • Filtering data – Where condition
  • Distinct Data
  • Limiting Data
  • Top n rows
  • Aliases (column and Table aliases)

      b) SQL Operations

  • Operators
  • Aliases
  • Case statement
  • Datetime conversions
  • String functions
  • Aggregations : AVG,COUNT,MIN,MAX,SUM
  • Group by
  • Having
  • Where vs Having
  • Subqueries
  • Corelated sub queries

     c) SQL Scripting

  • Indexes : Clustered and non-clustred
  • What is View
  • Variables declaration
  • If, and While loop
  • Print statement
  • Stored Procedures : calling and creating stored procedures
  • Functions:
  • Scalar, Tabular functions
  • Using functions in queries
  • Using Functions in Stored Procedures
  • Using Stored Procedures in Functions
  • Triggers
  • Transactions
  • Merge Statements
  • Unwrapping Large queries

     d)  Advanced SQL

  • Operations between two tables
  • EXISTS and NOT EXISTS
  • Window Functions
  • Group by Vs Partition by
  • CTE example
  • Pivot and Unpivot example
  • Merge
  • Temp Tables
    2. Python
  • Python Intro
  • Python Setup
  • Pip
  • Google Colab
  • DataTypes and Variables
  • String Operations
  • Loops and Conditional Statements
  • Collections
  • Python Functions
  • Anonymous Functions
  • Logging
  • Multi Threads
  • Multi processings
  • Flatmap
  • Oops with Python
  • Magic Dunder Methods
  • Positional/inline parameters
  • os and sys modules
  • Beautiful soap & lmlxl
  • Datetime operatiosn
  • File operations


   3. Spark Core

  • Spark Introduction
  • Spark Core
  • Pyspark Architecture
  • Spark SQL
  • Different types of File Formats
  • Spark UI
  • Dataframe
  • Dataframe OperationsOptimizations in Spark

   4. Databricks

  • Delta Lake Introduction
  • Creation of Delta Lake
  • Merge, Update and Deletes using PySpark

  5. Scala

  • Variables (mutable and immutable)
  • Control Statements(loops and decisions)
  • Collections(Lists,sets, maps,tuples,arrays)
  • Methods(scala methods,anonymous functions,lambda )
  • Scala oops(classes and objects)
  • Strings
  • Traits
  • Maven build

   6.Cloud Computing

  • What is Cloud Computing
  • Onprem Vs cloud
  • IAAS
  • PAAS
  • SAAS

   7.Azure

  • What is Azure
  • Services in Azure
  • Azure Accounts
  • Azure Tenants and Subscriptions
  • Azure Regions and Resource Manager
  • Azure Active Directory
  • Azure Storage
  • Azure Compute Services
  • Azure Datalake Gen2
  • Azure Security Services
  • Azure Datafactory
  • Azure Databricks
  • Azure SQL
  • Azure VM
  • Azure Synapse(Introduction)
  • Azure CosmosDB(Introduction)
  • Azure Functions
  • Azure Logic Apps

       8.AWS

  • Security on IAM
  • Storage(S3,SNS,SQS)
  • Compute(EC2,EMR,ECR,ECS,Lambda functions)
  • Databases (RDS,Redshift)


Share this Post!

About the Author : ABrilliants


Skip to toolbar