← Back to my website

Quick Links


Updates

2023-10-25

You can now follow content release progress from the Series topics table.

2023-10-24

I've decided to give Arabic video content another go. This time I'll trim the process from any long video editing. It will be a Powerpoint presentation recorded in Arabic. First video is coming out soon!

2023-10-15

First articles are out 🎉🎉🎉🎊
I won't announce every new article here, but I thought this is something worth celebrating 😀

2023-10-09

As I found it is currently unfeasible for me to produce a video for each topic given the limited time I have to work on the series, I'll put videos production on hold for now and will substitute Arabic videos with Arabic articles.

Hello! 👋

Are you a data beginner? Do you want to break into the data world? Do you feel overwhelmed and don't know from where to start? 😕

Yup! You are not alone! I've been there too! ✋

When I started to seriously learn about data around in early 2018, I felt like I was swimming in the middle of a storm in the ocean without a life jacket! Okay, I'm exaggerating a bit but that more or less I've felt specially that I don't come from an IT background as I'm a telecommunications professional.

I've struggled my way through and looked into every resource I can get my hand on to raise my data awareness. I managed to be a decent data engineer (yay!) and I successfully passed a number of cloud certs regarding Data Engineering and Data Analytics (You can know all about me from my LindedIn Profile).

"Data Analytics Made Simpler" is the guide I wish I had when I was learning about data. You will have all the basic knowledge about Data Engineering and AWS to help you start your Data career.

My Motivation

Like I've said, when I started learning about data, I felt overwhelmed! I wished I had a comprehensive guide or a roadmap to help me. And now that I know a thing or two about data, I want to share that with all the data beginners out there so they have a better start than me!

Also, I'll be creating videos in Arabic as I don't find many Arabic content covering Data Analytics. In my opinion, this will break the language barrier that some may have and help spread Data Analytics between Arabic speaker in the Arab world and Egypt.

Aaaand one last thing! I've told you that I love learning, I've found out that the best way for me to get better at Data Analytics (or at anything) is to learn it to other people! So, you are actually helping me here 😉

The series structure

Mainly, I've split what I want to share into 6 categories that spans over the Data Engineering landscape:

In each category, I'll talk about 4 topics: I'll produce weekly content in the form of a video in Arabic on my Yet Another Data Guy YouTube Channel
I'll write weekly articles both in English and in Arabic at dev.to in a series with the name "Data analytics made Simpler - Ar" for articles in Arabic and "data analytics made Simpler - En" for articles in English. So, make sure that you follow me there as well. Each category may have one or more mini-projects. The source code of those mini-projects will be in this Github repo.

Series Topics

I've drafted my roadmap for the series in the following table but I may add or remove topics as I go 🤷‍♂️

Basics Collection Storage Processing Analysis Visualization
Theory
  • Batch Processing
  • Stream Processing
  • Lambda Architecture
  • Data Lakes
  • NoSQL Databases
  • Distributed Systems
  • Delta Lakes
  • Data Warehousing
  • Types of visuals
  • Dashboards
OSS and Tools
  • SQL
  • Python
  • Jupyter Notebooks
  • Pandas
  • Flat file formats (CSV, XML, JSON, YAML, TOML)
  • Analytical file formats (Parquet, ORC, AVRO)
  • Linux
  • Git
  • Docker
  • Kafka
  • Bash
  • AWS CLI
  • Apache Iceberg
  • Apache Spark
  • Apache Airflow
  • Apache Presto
  • Apache Hive
  • OpenSearch
  • Ploty
  • D3.js
  • OpenSearch Dashboards
AWS
  • Account Setup
  • IAM
  • VPC
  • EC2
  • Lambda
  • Kinesis Data Streams
  • Kinesis Data Firehose
  • Database Migration Service
  • MSK
  • S3
  • DynamoDB
  • RDS
  • Glue
  • Lake Formation
  • EMR
  • LStep Functions
  • Athena
  • Redshift
  • Kinesis Analytics
  • QuckSight
Mini-projects
  • Analyzing a Postgres DB data with Pandas.
  • Analyzing flat files with Pandas.
  • Analyzing a remote Postgres DB data on an EC2 with Pandas.
  • Running Kafka locally
  • Using Kinesis Data Firehose to stream data into AWS.
  • Using DMS to store data on S3
  • Automating files backup with bash and the AWS CLI
  • Storing and retrieving data from DynamoDB
  • S3 lifecycle management
  • Creating a local data pipeline with Apache Airflow
  • Creating a Glue data pipeline to move and process data from a DB to S3
  • Create a data pipeline with Step Functions
  • Setup and Secure your Data Lake with Lake Formation
  • Run a local distribution of OpenSearch.
  • Run a local Hive Data Warehouse
  • Analyze your data lake with Athena
  • Visualize data with Flask and Plotly
  • Visualize data with React.js and D3.jsy
  • Using OpenSearch Dashboards

Have fun and Good luck 😉