Home, About, Blog, Certifications, Contact

Data

  1. Types of Data
  2. Delta Lake
  3. Delta Lake Protocol
  4. Delta table: Insert
  5. Delta table: Update
  6. Delta table: Delete

Data Modeling

  1. Conceptual Data Model
  2. Logical Data Model
  3. Physical Data Model
  4. Dimension Table
  5. Fact Table
  6. Star Schema
  7. Snowflake Schema
  8. Inmon Data modeling approach
  9. Kimball Data modeling approach

Spark

  1. Spark: Overview
  2. Spark: DataFrames
  3. Spark: Dataset
  4. Spark: Resilient Distributed Datasets (RDDs)
  5. Spark: Datasets vs DataFrames
  6. Spark: Transformations vs Actions
  7. Spark: Collect vs Take
  8. Spark: Data Types
  9. Spark: Shuffle Join
  10. Create DataFrames in Spark
  11. How Spark Executes Your Program?

Spark Optimization

  1. Predicate Pushdown

SQL

  1. SQL: Overview
  2. SQL Standards
  3. SQL Query Order of Execution
  4. Qualify Clause in SQL

SQL-101

  1. Essential Concepts in SQL
  2. Read data in SQL
  3. Aggregations in SQL
  4. Understanding Joins in SQL
  5. Exploring Subqueries in SQL
  6. Working with Data in SQL
  7. Creating and Modify Tables in SQL
  8. Relationships Between Tables in SQL

Python

  1. Python: Guide
  2. Python: List Examples
  3. Python: Tuple Examples
  4. Python: Dictionary Examples
  5. Python: Function Examples
  6. Python: Simple Programs - Part 01

AI

  1. How to install DeepSeek in Windows and run locally?

Data Factory

  1. Azure Data Factory: Parameters vs. Variables
  2. SSIS to ADF Migration Questions

Certifications

  1. Badges
  2. Microsoft Certifications
  3. Tableau Certifications
  4. Google Certifications
  5. GitHub Certifications
  6. Oracle Certifications

Notes

DeepLearning.ai Data Engineering

Introduction to Data Engineering

  1. Introduction to Data Engineering: Overview
  2. Data Engineering Lifecycle
  3. Data Engineering Undercurrents
  4. Data Architecture
  5. Choose the Right Technology
  6. Requirements Gathering

Source Systems, Ingestion, and Data Pipelines

  1. Source Systems
  2. Data Ingestion

Glossary

  1. ACID Properties
  2. Availability
  3. CAP Theorem
  4. Data Lake
  5. Data Lakehouse
  6. Data Transformation
  7. Data Warehouse
  8. Distributed System
  9. ELT
  10. ETL
  11. Fault Tolerance
  12. Lazy Evaluation
  13. OLAP
  14. OLTP
  15. Reliability
  16. Scalability