stepbystepdatascience
2.14K subscribers
9:52
Modern Financial Dashboard from Scratch in Tableau - Day 1
stepbystepdatascience
105 views • 5 months ago
9:02
#10- How catalyst optimizer chooses join strategy in Databricks?
stepbystepdatascience
69 views • 6 months ago
5:31
#9- What is Catalyst Optimizer in Databricks?| Physical & Logical Plan | Demo in Databricks
stepbystepdatascience
158 views • 6 months ago
5:11
#8 - Parquet vs Delta | Fix tiny file problem in Databricks
stepbystepdatascience
674 views • 6 months ago
4:43
#7 - What is tiny file problem in Databricks | Effects and Solution
stepbystepdatascience
297 views • 6 months ago
5:15
#6 - What is Lazy Evaluation and DAG with simple example in databricks?
stepbystepdatascience
105 views • 6 months ago
3:52
#5- Difference between Resilient Distributed Dataset(RDD) and Data frame with simple example
stepbystepdatascience
49 views • 6 months ago
1:39
#4 - What is Resilient Distributed Data Set (RDD) with simple example?
stepbystepdatascience
103 views • 6 months ago
4:29
#3- What is shuffling/Exchange and its inner mechanism in Databricks?
stepbystepdatascience
74 views • 6 months ago
14:58
#2 - How Apache Spark breaks up a single job into multiple stages| Practical Example in Databricks
stepbystepdatascience
199 views • 7 months ago
6:04
Data Alerts in Databricks| Nicely Conditionally Formatted table in Databricks
stepbystepdatascience
2.4K views • 7 months ago
11:56
#1 - What is Apache Spark and its key concept in a simple terms?
stepbystepdatascience
75 views • 7 months ago
7:32
Top 7 techniques of writing better queries in PostgreSQL
stepbystepdatascience
191 views • 9 months ago
1:46
De-identifying PII in Databricks
stepbystepdatascience
119 views • 10 months ago
5:41
Handling PII in Databricks
stepbystepdatascience
538 views • 10 months ago
3:24
Data Warehouse Vs Data Lake Vs Delta Lake Vs Lakehouse in simple terms
stepbystepdatascience
973 views • 10 months ago
2:43
What is Databricks in simple terms?
stepbystepdatascience
767 views • 10 months ago
15:30
Handling Duplication- Case VII-XI
stepbystepdatascience
24 views • 10 months ago
12:34
Handling Duplication Based on Address- Case VI
stepbystepdatascience
13 views • 10 months ago
18:25
Handling Duplication in SQL Server - Case I - V
stepbystepdatascience
31 views • 11 months ago
15:47
11 Ways to Handle Duplication in SQL Server- Introduction
stepbystepdatascience
81 views • 11 months ago
35:59
Into to EDA: Baby Step for Data Science
stepbystepdatascience
167 views • 2 years ago
27:09
Data Cleaning using Regex in Pandas Data Frame
stepbystepdatascience
6.6K views • 2 years ago
34:31
Manipulating text using Regular Expression in python
stepbystepdatascience
734 views • 2 years ago
16:19
Configure, Import, Create Pipeline to Auto-Ingest S3 data to Snowflake from Scratch-Day 2
stepbystepdatascience
274 views • 2 years ago
27:30
Data Warehouse| Why Snowflake| CSV file Import | S3 Access- Day 1
stepbystepdatascience
184 views • 2 years ago
15:13
Handling text in python
stepbystepdatascience
230 views • 2 years ago
21:10
Cohort Retention Rate Analysis in Python
stepbystepdatascience
2K views • 2 years ago
18:24
Filters, Annotations, Icons, Collapsible containers in Tableau
stepbystepdatascience
209 views • 3 years ago
36:21
Basic Numpy in Python (Difference between Array vs List vs Numpy)
stepbystepdatascience
240 views • 3 years ago
Load More