Data Engineers

Open in Telegram

Free Data Engineering Ebooks & Courses

Network:Free Courses with Certificate - Python Programming, Data Science, Java Coding, SQL, Web Development, AI, ML, ChatGPT Expert India40 072 Education19 346...

📈 Analytical overview of Telegram channel Data Engineers

Channel Data Engineers (@sql_engineer) in the English language segment is an active participant. Currently, the community unites 10 375 subscribers, ranking 19 346 in the Education category and 40 072 in the India region.

📊 Audience metrics and dynamics

Since its creation on невідомо, the project has demonstrated rapid growth, gathering an audience of 10 375 subscribers.

According to the latest data from 09 June, 2026, the channel demonstrates stable activity. Although there has been a change in the number of participants by 243 over the last 30 days and by 11 over the last 24 hours, overall reach remains high.

Verification status: Not verified
Engagement rate (ER): The average audience engagement rate is 10.19%. Within the first 24 hours after publication, content typically collects N/A% reactions from the total number of subscribers.
Post reach: On average, each post receives 1 057 views. Within the first day, a publication typically gains 0 views.
Reactions and interaction: The audience actively supports content: the average number of reactions per post is 7.
Thematic interests: Content is focused on key topics such as sql, learning, analytic, engineer, link:-.

📝 Description and content policy

The author describes the resource as a platform for expressing subjective opinions:
“Free Data Engineering Ebooks & Courses”

Thanks to the high frequency of updates (latest data received on 10 June, 2026), the channel maintains relevance and a high level of publication reach. Analytics show that the audience actively interacts with content, making it an important point of influence in the Education category.

10 375

Subscribers

+1124 hours

+587 days

+24330 days

1 057

Post views

No data24 hours

No data48 hours

10.19%

Engagement rate

No data

Posts per day

Ads index

beta

Posts Archive

10 375

Roadmap for becoming an Azure Data Engineer in 2024: - SQL - Basic python - Cloud Fundamental - ADF - Databricks/Spark/Pyspark - Azure Synapse - Azure Functions, Logic Apps, - Azure Storage, Key Vault - Dimensional Modelling - Azure Fabric - End-to-End Project - Resume Preparation - Interview Prep Here, you can find Data Engineering Resources 👇 https://topmate.io/analyst/910180 All the best 👍👍

10 375

Unlock your full potential as a Data Engineer with this detailed career path Step 1: Fundamentals Step 2: Data Structures & Algorithms Step 3: Databases (SQL / NoSQL) & Data Modeling Step 4: Data Ingestion & Data Storage Techniques Step 5: Data warehousing tools & Data analytics techniques Step 6: Major cloud providers and their services related to Data Engineering Step 7: Tools required for real-time data and batch data pipelines Step 8: Data Engineering Deployments & ops

10 375

Git commands for Data Engineers 𝟭. 𝗴𝗶𝘁 𝗱𝗶𝗳𝗳: Show file differences not yet staged. 𝟮. 𝗴𝗶𝘁 𝗰𝗼𝗺𝗺𝗶𝘁 -𝗮 -𝗺 "𝗰𝗼𝗺𝗺𝗶𝘁 𝗺𝗲𝘀𝘀𝗮𝗴𝗲": Commit all tracked changes with a message. 𝟯. 𝗴𝗶𝘁 𝘀𝘁𝗮𝘁𝘂𝘀: Show the state of your working directory. 𝟰. 𝗴𝗶𝘁 𝗮𝗱𝗱 𝗳𝗶𝗹𝗲_𝗽𝗮𝘁𝗵:Add file(s) to the staging area. 𝟱. 𝗴𝗶𝘁 𝗰𝗵𝗲𝗰𝗸𝗼𝘂𝘁 -𝗯 𝗯𝗿𝗮𝗻𝗰𝗵_𝗻𝗮𝗺𝗲: Create and switch to a new branch. 𝟲. 𝗴𝗶𝘁 𝗰𝗵𝗲𝗰𝗸𝗼𝘂𝘁 𝗯𝗿𝗮𝗻𝗰𝗵_𝗻𝗮𝗺𝗲: Switch to an existing branch. 𝟳. 𝗴𝗶𝘁 𝗰𝗼𝗺𝗺𝗶𝘁 --𝗮𝗺𝗲𝗻𝗱:Modify the last commit. 𝟴. 𝗴𝗶𝘁 𝗽𝘂𝘀𝗵 𝗼𝗿𝗶𝗴𝗶𝗻 𝗯𝗿𝗮𝗻𝗰𝗵_𝗻𝗮𝗺𝗲: Push a branch to a remote. 𝟵. 𝗴𝗶𝘁 𝗽𝘂𝗹𝗹: Fetch and merge remote changes. 𝟭𝟬. 𝗴𝗶𝘁 𝗿𝗲𝗯𝗮𝘀𝗲 -𝗶: Rebase interactively, rewrite commit history. 𝟭𝟭. 𝗴𝗶𝘁 𝗰𝗹𝗼𝗻𝗲: Create a local copy of a remote repo. 𝟭𝟮. 𝗴𝗶𝘁 𝗺𝗲𝗿𝗴𝗲: Merge branches together. 𝟭𝟯. 𝗴𝗶𝘁 𝗹𝗼𝗴 --𝘀𝘁𝗮𝘁: Show commit logs with stats. 𝟭𝟰. 𝗴𝗶𝘁 𝘀𝘁𝗮𝘀𝗵: Stash changes for later. 𝟭𝟱. 𝗴𝗶𝘁 𝘀𝘁𝗮𝘀𝗵 𝗽𝗼𝗽: Apply and remove stashed changes. 𝟭𝟲. 𝗴𝗶𝘁 𝘀𝗵𝗼𝘄 𝗰𝗼𝗺𝗺𝗶𝘁_𝗶𝗱: Show details about a commit. 𝟭𝟳. 𝗴𝗶𝘁 𝗿𝗲𝘀𝗲𝘁 𝗛𝗘𝗔𝗗~𝟭: Undo the last commit, preserving changes locally. 𝟭𝟴. 𝗴𝗶𝘁 𝗳𝗼𝗿𝗺𝗮𝘁-𝗽𝗮𝘁𝗰𝗵 -𝟭 𝗰𝗼𝗺𝗺𝗶𝘁_𝗶𝗱: Create a patch file for a specific commit. 𝟭𝟵. 𝗴𝗶𝘁 𝗮𝗽𝗽𝗹𝘆 𝗽𝗮𝘁𝗰𝗵_𝗳𝗶𝗹𝗲_𝗻𝗮𝗺𝗲: Apply changes from a patch file. 𝟮𝟬. 𝗴𝗶𝘁 𝗯𝗿𝗮𝗻𝗰𝗵 -𝗗 𝗯𝗿𝗮𝗻𝗰𝗵_𝗻𝗮𝗺𝗲: Delete a branch forcefully. 𝟮𝟭. 𝗴𝗶𝘁 𝗿𝗲𝘀𝗲𝘁: Undo commits by moving branch reference. 𝟮𝟮. 𝗴𝗶𝘁 𝗿𝗲𝘃𝗲𝗿𝘁: Undo commits by creating a new commit. 𝟮𝟯. 𝗴𝗶𝘁 𝗰𝗵𝗲𝗿𝗿𝘆-𝗽𝗶𝗰𝗸 𝗰𝗼𝗺𝗺𝗶𝘁_𝗶𝗱: Apply changes from a specific commit. 𝟮𝟰. 𝗴𝗶𝘁 𝗯𝗿𝗮𝗻𝗰𝗵: Lists branches. 𝟮𝟱. 𝗴𝗶𝘁 𝗿𝗲𝘀𝗲𝘁 --𝗵𝗮𝗿𝗱: Resets everything to a previous commit, erasing all uncommitted changes. Here, you can find Data Engineering Resources 👇 https://topmate.io/analyst/910180 All the best 👍👍

10 375

Free 𝗿𝗲𝘀𝗼𝘂𝗿𝗰𝗲𝘀 𝘁𝗼 𝗹𝗲𝗮𝗿𝗻 Apache 𝘀𝗽𝗮𝗿𝗸 𝗳𝗼𝗿 𝗳𝗿𝗲𝗲 𝟭. 𝗙𝗶𝗿𝘀𝘁 𝗶𝗻𝘀𝘁𝗮𝗹𝗹 𝘀𝗽𝗮𝗿𝗸 𝗳𝗿𝗼𝗺 𝗵𝗲𝗿𝗲 - https://lnkd.in/gx_Dc8ph https://lnkd.in/gg6-8xDz 𝟮. 𝗟𝗲𝗮𝗿𝗻 𝗕𝗮𝘀𝗶𝗰 𝘀𝗽𝗮𝗿𝗸 𝗳𝗿𝗼𝗺 𝗵𝗲𝗿𝗲 - https://lnkd.in/ddThYxAS 𝟯. 𝗟𝗲𝗮𝗿𝗻 𝗔𝗱𝘃𝗮𝗻𝗰𝗲 𝘀𝗽𝗮𝗿𝗸 𝗳𝗿𝗼𝗺 𝗵𝗲𝗿𝗲 - https://lnkd.in/dvZUiJZT 𝟰. 𝗔𝗽𝗮𝗰𝗵𝗲 𝗦𝗽𝗮𝗿𝗸 𝗺𝘂𝘀𝘁 𝗿𝗲𝗮𝗱 𝗯𝗼𝗼𝗸 - https://lnkd.in/d5-KiHHd 𝟱. 𝗦𝗽𝗮𝗿𝗸 𝗣𝗿𝗼𝗷𝗲𝗰𝘁 𝘆𝗼𝘂 𝗺𝘂𝘀𝘁 𝗱𝗼 - https://lnkd.in/gE8hsyZx https://lnkd.in/gwWytS-Q https://lnkd.in/gR7DR6_5 𝟲. 𝗙𝗶𝗻𝗮𝗹𝗹𝘆 𝘀𝗽𝗮𝗿𝗸 𝗶𝗻𝘁𝗲𝗿𝘃𝗶𝗲𝘄 𝗾𝘂𝗲𝘀𝘁𝗶𝗼𝗻𝘀 - https://lnkd.in/dFP5yiHT https://lnkd.in/dweZX3RA Here, you can find Data Engineering Resources 👇 https://topmate.io/analyst/910180 All the best 👍👍

10 375

🔺 Data engineering Free Courses 1️⃣ Data Engineering Course : Learn the basics of data engineering. 2️⃣ Data Engineer Learning Path course : a comprehensive road map to become a data engineer. 3️⃣ The Data Eng Zoomcamp course : a practical course to learn data engineering

10 375

Big Data

10 375

Interviewer: You have 2 minutes. Explain the difference between Caching and Persisting in Spark. ➤ 𝗖𝗮𝗰𝗵𝗶𝗻𝗴: Caching in Apache Spark involves storing RDDs in memory temporarily. When an RDD is cached, its partitions are kept in memory across multiple operations, allowing for faster access and reuse of intermediate results. ➤ 𝗣𝗲𝗿𝘀𝗶𝘀𝘁𝗶𝗻𝗴: Persisting in Apache Spark is similar to caching but offers more flexibility in terms of storage options. When you persist an RDD, you can specify different storage levels such as MEMORY_ONLY, MEMORY_AND_DISK, or DISK_ONLY, depending on your requirements ➤ 𝗞𝗲𝘆 𝗱𝗶𝗳𝗳𝗲𝗿𝗲𝗻𝗰𝗲𝘀 𝗯𝗲𝘁𝘄𝗲𝗲𝗻 𝗰𝗮𝗰𝗵𝗶𝗻𝗴 𝗮𝗻𝗱 𝗽𝗲𝗿𝘀𝗶𝘀𝘁𝗶𝗻𝗴: - While caching stores RDDs in memory by default, persisting allows you to choose different storage levels, including disk storage. Caching is suitable for scenarios where RDDs need to be reused in subsequent operations within the same Spark job. - whereas persisting is more versatile and can be used to store RDDs across multiple jobs or even persist them to disk for fault tolerance. ➤ 𝗘𝘅𝗮𝗺𝗽𝗹𝗲 𝗼𝗳 𝘄𝗵𝗲𝗻 𝘆𝗼𝘂 𝘄𝗼𝘂𝗹𝗱 𝘂𝘀𝗲 𝗰𝗮𝗰𝗵𝗶𝗻𝗴 𝘃𝗲𝗿𝘀𝘂𝘀 𝗽𝗲𝗿𝘀𝗶𝘀𝘁𝗶𝗻𝗴 - Let's say we have an iterative algorithm where the same RDD is accessed multiple times within a loop. In this case, caching the RDD would be beneficial as it would avoid recomputation of the RDD's partitions in each iteration, resulting in significant performance gains. - On the other hand, if we need to persist RDDs across multiple Spark jobs or need fault tolerance, persisting would be more appropriate. ➤ 𝗛𝗼𝘄 𝗱𝗼𝗲𝘀 𝗦𝗽𝗮𝗿𝗸 𝗵𝗮𝗻𝗱𝗹𝗲 𝗰𝗮𝗰𝗵𝗶𝗻𝗴 𝗮𝗻𝗱 𝗽𝗲𝗿𝘀𝗶𝘀𝘁𝗶𝗻𝗴 𝘂𝗻𝗱𝗲𝗿 𝘁𝗵𝗲 𝗵𝗼𝗼𝗱 Spark employs a lazy evaluation strategy, so RDDs are not actually cached or persisted until an action is triggered. When an action is called on a cached or persisted RDD, Spark checks if the data is already in memory or on disk. If not, it calculates the RDD's partitions and stores them accordingly based on the specified storage level. That’s the difference between Caching and Persisting in Spark.

10 375

Frequently asked SQL interview for Data Analyst/Data Engineer 1 What is SQL and what are its main features? 2 Order of writing SQL query? 3Order of execution of SQL query? 4 What are some of the most common SQL commands? 5 What’s a primary key & foreign key? 6 All types of joins and questions on their outputs? 7 Explain all window functions and difference between them? 8 What is stored procedure? 9 Difference between stored procedure & Functions in SQL? 10 What is trigger in SQL?

10 375

Roadmap to crack product-based companies for Big Data Engineer role: 1. Master Python, Scala/Java 2. Ace Apache Spark, Hadoop ecosystem 3. Learn data storage (SQL, NoSQL), warehousing 4. Expertise in data streaming (Kafka, Flink/Storm) 5. Master workflow management (Airflow) 6. Cloud skills (AWS, Azure or GCP) 7. Data modeling, ETL/ELT processes 8. Data viz tools (Tableau, Power BI) 9. Problem-solving, communication, attention to detail 10. Projects, certifications (AWS, Azure, GCP) 11. Practice coding, system design interviews Here, you can find Data Engineering Resources 👇 https://topmate.io/analyst/910180 All the best 👍👍

10 375

Most asked Python interview questions for Data Engineer jobs with answers! 𝟭. 𝗘𝘅𝗽𝗹𝗮𝗶𝗻 𝘁𝗵𝗲 𝗱𝗶𝗳𝗳𝗲𝗿𝗲𝗻𝗰𝗲 𝗯𝗲𝘁𝘄𝗲𝗲𝗻 𝗹𝗶𝘀𝘁𝘀 𝗮𝗻𝗱 𝘁𝘂𝗽𝗹𝗲𝘀 𝗶𝗻 𝗣𝘆𝘁𝗵𝗼𝗻. Lists are mutable, meaning their elements can be changed but Tuples are immutable. 𝟮. 𝗪𝗵𝗮𝘁 𝗶𝘀 𝗮 𝗗𝗮𝘁𝗮𝗙𝗿𝗮𝗺𝗲 𝗶𝗻 𝗽𝗮𝗻𝗱𝗮𝘀? A DataFrame is a 2-dimensional labelled data structure, similar to a spreadsheet. 𝟯. 𝗥𝗲𝘃𝗲𝗿𝘀𝗶𝗻𝗴 𝘁𝗵𝗲 𝘄𝗼𝗿𝗱𝘀 𝗶𝗻 𝗮 𝘀𝘁𝗿𝗶𝗻𝗴 𝗶𝗻 𝗣𝘆𝘁𝗵𝗼𝗻 def reverse_words(s: str) -> str: words = s.split() reversed_words = reversed(words) return ' '.join(reversed_words) 𝟰. 𝗪𝗿𝗶𝘁𝗲 𝗮 𝗣𝘆𝘁𝗵𝗼𝗻 𝗳𝘂𝗻𝗰𝘁𝗶𝗼𝗻 𝘁𝗼 𝗰𝗼𝘂𝗻𝘁 𝘁𝗵𝗲 𝗻𝘂𝗺𝗯𝗲𝗿 𝗼𝗳 𝘃𝗼𝘄𝗲𝗹𝘀 𝗶𝗻 𝗮 𝗴𝗶𝘃𝗲𝗻 𝘀𝘁𝗿𝗶𝗻𝗴? def count_vowels(string: str) -> int: vowels = "aeiouAEIOU" vowel_count = 0 for char in string: if char in vowels: vowel_count += 1 return vowel_count I’ve listed 4 but there are many questions you’d need to prepare to succeed in interviews. Here, you can find Data Engineering Interview Resources 👇 https://topmate.io/analyst/910180 All the best 👍👍

10 375

Repost from Machine Learning & Artificial Intelligence | Data Science Free Courses

10 375

Thinking about becoming a Data Engineer? Here's the roadmap to avoid pitfalls & master the essential skills for a successful career. 📊Introduction to Data Engineering ✅Overview of Data Engineering & its importance ✅Key responsibilities & skills of a Data Engineer ✅Difference between Data Engineer, Data Scientist & Data Analyst ✅Data Engineering tools & technologies 📊Programming for Data Engineering ✅Python ✅SQL ✅Java/Scala ✅Shell scripting 📊Database System & Data Modeling ✅Relational Databases: design, normalization & indexing ✅NoSQL Databases: key-value stores, document stores, column-family stores & graph database ✅Data Modeling: conceptual, logical & physical data model ✅Database Management Systems & their administration 📊Data Warehousing and ETL Processes ✅Data Warehousing concepts: OLAP vs. OLTP, star schema & snowflake schema ✅ETL: designing, developing & managing ETL processe ✅Tools & technologies: Apache Airflow, Talend, Informatica, AWS Glue ✅Data lakes & modern data warehousing solution 📊Big Data Technologies ✅Hadoop ecosystem: HDFS, MapReduce, YARN ✅Apache Spark: core concepts, RDDs, DataFrames & SparkSQL ✅Kafka and real-time data processing ✅Data storage solutions: HBase, Cassandra, Amazon S3 📊Cloud Platforms & Services ✅Introduction to cloud platforms: AWS, Google Cloud Platform, Microsoft Azure ✅Cloud data services: Amazon Redshift, Google BigQuery, Azure Data Lake ✅Data storage & management on the cloud ✅Serverless computing & its applications in data engineering 📊Data Pipeline Orchestration ✅Workflow orchestration: Apache Airflow, Luigi, Prefect ✅Building & scheduling data pipelines ✅Monitoring & troubleshooting data pipelines ✅Ensuring data quality & consistency 📊Data Integration & API Development ✅Data integration techniques & best practices ✅API development: RESTful APIs, GraphQL ✅Tools for API development: Flask, FastAPI, Django ✅Consuming APIs & data from external sources 📊Data Governance & Security ✅Data governance frameworks & policies ✅Data security best practices ✅Compliance with data protection regulations ✅Implementing data auditing & lineage 📊Performance Optimization & Troubleshooting ✅Query optimization techniques ✅Database tuning & indexing ✅Managing & scaling data infrastructure ✅Troubleshooting common data engineering issues 📊Project Management & Collaboration ✅Agile methodologies & best practices ✅Version control systems: Git & GitHub ✅Collaboration tools: Jira, Confluence, Slack ✅Documentation & reporting Resources for Data Engineering 1️⃣Python: https://t.me/pythonanalyst 2️⃣SQL: https://t.me/sqlanalyst 3️⃣Excel: https://t.me/excel_analyst 4️⃣Free DE Courses: https://t.me/free4unow_backup/569 Data Engineering Interview Preparation Resources: https://topmate.io/analyst/910180 All the best 👍👍

10 375

10 Data Engineering architectures asked in Interviews. 1. Hadoop 2. Hive 3. Hbase 4. Kafka 5. Spark 6. Airflow 7. Bigquery 8. Snowflake 9. Databricks 10. MongoDB Data Engineering Interview Preparation Resources: https://topmate.io/analyst/910180 All the best 👍👍

10 375

Here are top 40 commonly asked pyspark questions that you can prepare for interviews. 𝗥𝗗𝗗𝘀 - 1. What is an RDD in Apache Spark? Explain its characteristics. 2. How are RDDs fault-tolerant in Apache Spark? 3. What are the different ways to create RDDs in Spark? 4. Explain the difference between transformations and actions in RDDs. 5. How does Spark handle data partitioning in RDDs? 6. Can you explain the lineage graph in RDDs and its significance? 7. What is lazy evaluation in Apache Spark RDDs? 8. How can you persist RDDs in memory for faster access? 9. Explain the concept of narrow and wide transformations in RDDs. 10. What are the limitations of RDDs compared to DataFrames and Datasets? 𝗗𝗮𝘁𝗮𝗳𝗿𝗮𝗺𝗲 𝗮𝗻𝗱 𝗗𝗮𝘁𝗮𝘀𝗲𝘁𝘀 - 1. What are DataFrames and Datasets in Apache Spark? 2. What are the differences between DataFrame and RDD? 3. Explain the concept of a schema in a DataFrame. 4. How are DataFrames and Datasets fault-tolerant in Spark? 5. What are the advantages of using DataFrames over RDDs? 6. Explain the Catalyst optimizer in Apache Spark. 7. How can you create DataFrames in Apache Spark? 8. What is the significance of Encoders in Datasets? 9. How does Spark SQL optimize the execution plan for DataFrames? 10. Can you explain the benefits of using Datasets over DataFrames? 𝗦𝗽𝗮𝗿𝗸 𝗦𝗤𝗟 - 1. What is Spark SQL, and how does it relate to Apache Spark? 2. How does Spark SQL leverage DataFrame and Dataset APIs? 3. Explain the role of the Catalyst optimizer in Spark SQL. 4. How can you run SQL queries on DataFrames in Spark SQL? 5. What are the benefits of using Spark SQL over traditional SQL queries? 𝗢𝗽𝘁𝗶𝗺𝗶𝘇𝗮𝘁𝗶𝗼𝗻 - 1. What are some common performance bottlenecks in Apache Spark applications? 2. How can you optimize the shuffle operations in Spark? 3. Explain the significance of data skew and techniques to handle it in Spark. 4. What are some techniques to optimize Spark job execution time? 5. How can you tune memory configurations for better performance in Spark? 6. What is dynamic allocation, and how does it optimize resource usage in Spark? 7. How can you optimize joins in Spark? 8. What are the benefits of partitioning data in Spark? 9. How does Spark leverage data locality for optimization? Data Engineering Interview Preparation Resources: https://topmate.io/analyst/910180 All the best 👍👍

10 375

ETL Using Pyspark.pdf2.23 MB

10 375

PySpark Cheatsheet.pdf0.48 KB

10 375

5 most asked SQL Interview Questions for Data Engineer jobs. 𝟭. 𝗙𝗶𝗻𝗱 𝘁𝗵𝗲 𝗦𝗲𝗰𝗼𝗻𝗱 𝗛𝗶𝗴𝗵𝗲𝘀𝘁 𝗦𝗮𝗹𝗮𝗿𝘆 𝗶𝗻 𝗮 𝗧𝗮𝗯𝗹𝗲 SELECT MAX(salary) AS SecondHighestSalary FROM Employee WHERE salary < (SELECT MAX(salary) FROM Employee); 𝟮 . 𝗙𝗶𝗻𝗱 𝗼𝘂𝘁 𝗲𝗺𝗽𝗹𝗼𝘆𝗲𝗲𝘀 𝗲𝗮𝗿𝗻𝗶𝗻𝗴 𝗺𝗼𝗿𝗲 𝘁𝗵𝗮𝗻 𝘁𝗵𝗲𝗶𝗿 𝗺𝗮𝗻𝗮𝗴𝗲𝗿𝘀 SELECT e2.name as Employee FROM employee e1 INNER JOIN employee e2 ON e1.id = e2.managerID WHERE e1.salary < e2.salary 𝟯. 𝗙𝗶𝗻𝗱 𝗰𝘂𝘀𝘁𝗼𝗺𝗲𝗿𝘀 𝘄𝗵𝗼 𝗻𝗲𝘃𝗲𝗿 𝗼𝗿𝗱𝗲𝗿 SELECT name as Customers FROM Customers WHERE id not in ( SELECT customerId FROM Orders); 𝟰. 𝗗𝗲𝗹𝗲𝘁𝗲 𝗱𝘂𝗽𝗹𝗶𝗰𝗮𝘁𝗲 𝗲𝗺𝗮𝗶𝗹𝘀 DELETE p1 FROM Person p1, Person p2 WHERE p1.Email = p2.Email AND p1.Id > p2.Id 𝟱. 𝗖𝗼𝘂𝗻𝘁 𝘁𝗵𝗲 𝗻𝘂𝗺𝗯𝗲𝗿 𝗼𝗳 𝗼𝗿𝗱𝗲𝗿𝘀 𝗽𝗹𝗮𝗰𝗲𝗱 𝗶𝗻 𝘁𝗵𝗲 𝗽𝗿𝗲𝘃𝗶𝗼𝘂𝘀 𝘆𝗲𝗮𝗿 𝗮𝗻𝗱 𝗺𝗼𝗻𝘁𝗵. SELECT COUNT(*) AS order_count FROM orders WHERE EXTRACT(YEAR_MONTH FROM order_date) = EXTRACT(YEAR_MONTH FROM CURDATE() - INTERVAL 1 MONTH); 💡 Note: SQL interview questions vary widely based on the specific role and company. So you also need to practice questions your target companies ask. Data Engineering Interview Preparation Resources: https://topmate.io/analyst/910180 All the best 👍👍