Data Analytics
Perfect channel to learn Data Analytics Learn SQL, Python, Alteryx, Tableau, Power BI and many more For Promotions: @coderfun @love_data
Больше📈 Аналитический обзор Telegram-канала Data Analytics
Канал Data Analytics (@sqlspecialist) языкового сегмента Английский является активным участником. Сейчас сообщество объединяет 109 740 подписчиков, занимая 1 113 место в категории Технологии и приложения и 2 324 место в регионе Индия.
📊 Показатели аудитории и динамика
С момента создания невідомо проект демонстрирует стремительный рост, собрав аудиторию из 109 740 подписчиков.
Согласно последним данным от 27 июня, 2026, канал показывает стабильную активность. За последние 30 дней изменение числа участников составило 610, а за последние 24 часа — 45, при этом общий охват остаётся высоким.
- Статус верификации: Не верифицирован
- Уровень вовлечённости (ER): Средний показатель вовлечённости аудитории составляет 2.51%. В первые 24 часа после публикации контент обычно набирает 1.12% реакций от общего числа подписчиков.
- Охват публикаций: В среднем каждый пост получает 2 753 просмотров. В течение первых суток публикация набирает 1 230 просмотров.
- Реакции и взаимодействия: Аудитория активно поддерживает контент: среднее количество реакций на один пост — 7.
- Тематические интересы: Контент сосредоточен на ключевых темах, таких как row, sql, analytic, analyst, visualization.
📝 Описание и контентная политика
Автор описывает ресурс как площадку для выражения субъективного мнения:
“Perfect channel to learn Data Analytics
Learn SQL, Python, Alteryx, Tableau, Power BI and many more
For Promotions: @coderfun @love_data”
Благодаря высокой частоте обновлений (последние данные получены 28 июня, 2026) канал поддерживает актуальность и высокий уровень охвата публикаций. Аналитика показывает, что аудитория активно взаимодействует с контентом, что делает его важной точкой влияния в категории Технологии и приложения.
SELECT AVG(salary) AS average_salary FROM employees;
Find Median (Using Window Functions) SELECT salary FROM ( SELECT salary, ROW_NUMBER() OVER (ORDER BY salary) AS row_num, COUNT(*) OVER () AS total_rows FROM employees ) subquery WHERE row_num = (total_rows / 2);
Find Mode (Most Frequent Value)
SELECT department, COUNT(*) AS count FROM employees GROUP BY department ORDER BY count DESC LIMIT 1;
Calculate Variance & Standard Deviation
SELECT VARIANCE(salary) AS salary_variance, STDDEV(salary) AS salary_std_dev FROM employees;
In Python (Pandas):
Mean, Median, Mode
df['salary'].mean() df['salary'].median() df['salary'].mode()[0]
Variance & Standard Deviation
df['salary'].var() df['salary'].std()
2️⃣ Data Visualization
Visualizing data helps identify trends, outliers, and patterns.
In SQL (For Basic Visualization in Some Databases Like PostgreSQL):
Create Histogram (Approximate in SQL)
SELECT salary, COUNT(*) FROM employees GROUP BY salary ORDER BY salary;
In Python (Matplotlib & Seaborn):
Bar Chart (Category-Wise Sales)
import matplotlib.pyplot as plt
import seaborn as sns
df.groupby('category')['sales'].sum().plot(kind='bar')
plt.title('Total Sales by Category')
plt.xlabel('Category')
plt.ylabel('Sales')
plt.show()
Histogram (Salary Distribution)
sns.histplot(df['salary'], bins=10, kde=True)
plt.title('Salary Distribution')
plt.show()
Box Plot (Outliers in Sales Data)
sns.boxplot(y=df['sales'])
plt.title('Sales Data Outliers')
plt.show()
Heatmap (Correlation Between Variables)
sns.heatmap(df.corr(), annot=True, cmap='coolwarm') plt.title('Feature Correlation Heatmap') plt.show()
3️⃣ Detecting Anomalies & Outliers
Outliers can skew results and should be identified.
In SQL:
Find records with unusually high salaries
SELECT * FROM employees WHERE salary > (SELECT AVG(salary) + 2 * STDDEV(salary) FROM employees);
In Python (Pandas & NumPy):
Using Z-Score (Values Beyond 3 Standard Deviations)
from scipy import stats df['z_score'] = stats.zscore(df['salary']) df_outliers = df[df['z_score'].abs() > 3]
Using IQR (Interquartile Range)
Q1 = df['salary'].quantile(0.25)
Q3 = df['salary'].quantile(0.75)
IQR = Q3 - Q1
df_outliers = df[(df['salary'] < (Q1 - 1.5 * IQR)) | (df['salary'] > (Q3 + 1.5 * IQR))]
4️⃣ Key EDA Steps
Understand the Data → Check missing values, duplicates, and column types
Summarize Statistics → Mean, Median, Standard Deviation, etc.
Visualize Trends → Histograms, Box Plots, Heatmaps
Detect Outliers & Anomalies → Z-Score, IQR
Feature Engineering → Transform variables if needed
Mini Task for You: Write an SQL query to find employees whose salaries are above two standard deviations from the mean salary.
Here you can find the roadmap for data analyst: https://t.me/sqlspecialist/1159
Like this post if you want me to continue covering all the topics! ❤️
Share with credits: https://t.me/sqlspecialist
Hope it helps :)
#sqlSELECT id, name, COALESCE(salary, 0) AS salary FROM employees;
IFNULL(): Works similarly to COALESCE (MySQL) SELECT id, name, IFNULL(salary, 0) AS salary FROM employees;
In Python (Pandas):
dropna(): Removes rows with missing values
df.dropna(inplace=True)
fillna(): Fills missing values with a specified value
df['salary'].fillna(0, inplace=True)
interpolate(): Fills missing values using interpolation
df.interpolate(method='linear', inplace=True)
2️⃣ Removing Duplicates
In SQL:
Remove duplicate rows using DISTINCT
SELECT DISTINCT name, department FROM employees;
Delete duplicates while keeping only one row
DELETE FROM employees WHERE id NOT IN (SELECT MIN(id) FROM employees GROUP BY name, department);
In Python (Pandas):
Remove duplicate rows
df.drop_duplicates(inplace=True)
Keep only the first occurrence
df.drop_duplicates(subset=['name', 'department'], keep='first', inplace=True)
3️⃣ Standardizing Formats (Data Normalization)
Standardizing Text Case:
SQL: Convert text to uppercase or lowercase
SELECT UPPER(name) AS name_upper FROM employees;
Python: Convert text to lowercase
df['name'] = df['name'].str.lower()
Date Formatting:
SQL: Convert string to date format SELECT
CONVERT(DATE, '2024-02-26', 120);
Python: Convert string to datetime
df['date'] = pd.to_datetime(df['date'], format='%Y-%m-%d')
4️⃣ ETL Process (Extract, Transform, Load)
Extract:
SQL: Retrieve data from databases
SELECT * FROM sales_data;
Python: Load data from CSV
df = pd.read_csv('data.csv')
Transform:
SQL: Modify data (cleaning, aggregations)
SELECT category, SUM(sales) AS total_sales FROM sales_data GROUP BY category;
Python: Apply transformations
df['total_sales'] = df.groupby('category')['sales'].transform('sum')
Load:
SQL: Insert cleaned data into a new table
INSERT INTO clean_sales_data (category, total_sales)
SELECT category, SUM(sales) FROM sales_data GROUP BY category;
Python: Save cleaned data to a new CSV file
df.to_csv('cleaned_data.csv', index=False)
Mini Task for You: Write an SQL query to remove duplicate customer records, keeping only the first occurrence.
Here you can find the roadmap for data analyst: https://t.me/sqlspecialist/1159
Like this post if you want me to continue covering all the topics! ❤️
Share with credits: https://t.me/sqlspecialist
Hope it helps :)
#sqlCREATE INDEX idx_email ON users(email);
✔ Speeds up searches for users by email
3️⃣ Creating a Unique Index
🔹 Ensure that no two users have the same email
CREATE UNIQUE INDEX idx_unique_email ON users(email);
✔ Prevents duplicate emails from being inserted
4️⃣ Composite Index for Multiple Columns
🔹 Optimize queries that filter by first name and last name
CREATE INDEX idx_name ON users(first_name, last_name);
✔ Faster lookups when filtering by both first name and last name
5️⃣ Clustered vs. Non-Clustered Index
Clustered Index → Physically rearranges table data (only one per table)
Non-Clustered Index → Stores a separate lookup table for faster access
🔹 Create a clustered index on the "id" column
CREATE CLUSTERED INDEX idx_id ON users(id);
🔹 Create a non-clustered index on the "email" column
CREATE NONCLUSTERED INDEX idx_email ON users(email);
✔ Clustered indexes speed up searches when retrieving all columns
✔ Non-clustered indexes speed up searches for specific columns
6️⃣ Checking Indexes on a Table
🔹 Find all indexes on the "users" table
SELECT * FROM sys.indexes WHERE object_id = OBJECT_ID('users');
7️⃣ When to Use Indexes?
✅ Columns frequently used in WHERE, JOIN, ORDER BY
✅ Large tables that need faster searches
✅ Unique columns that should not allow duplicates
❌ Avoid indexing on columns with highly repetitive values (e.g., boolean columns)
❌ Avoid too many indexes, as they slow down INSERT, UPDATE, DELETE operations
Mini Task for You: Write an SQL query to create a unique index on the "phone_number" column in the "customers" table.
You can find free SQL Resources here
👇👇
https://t.me/mysqldata
Like this post if you want me to continue covering all the topics! ❤️
Share with credits: https://t.me/sqlspecialist
Hope it helps :)
#sqlSELECT column1, column2, window_function() OVER (PARTITION BY column ORDER BY column) AS alias FROM table_name;
✔ PARTITION BY groups rows before applying the function
✔ ORDER BY determines the ranking or sequence
3️⃣ Using ROW_NUMBER()
🔹 Assign a unique row number to each employee based on salary (highest first)
SELECT name, department, salary, ROW_NUMBER() OVER (PARTITION BY department ORDER BY salary DESC) AS row_num FROM employees;
✔ Each employee gets a unique row number within their department.
4️⃣ Using RANK() and DENSE_RANK()
🔹 Rank employees by salary within each department
SELECT name, department, salary, RANK() OVER (PARTITION BY department ORDER BY salary DESC) AS rank, DENSE_RANK() OVER (PARTITION BY department ORDER BY salary DESC) AS dense_rank FROM employees;
✔ RANK() skips numbers when there’s a tie
✔ DENSE_RANK() does not skip numbers
5️⃣ Using NTILE() for Distribution
🔹 Divide employees into 4 salary groups per department
SELECT name, department, salary, NTILE(4) OVER (PARTITION BY department ORDER BY salary DESC) AS salary_quartile FROM employees;
✔ Useful for dividing salaries into percentiles (e.g., top 25%, bottom 25%)
6️⃣ Running Total with SUM() OVER()
🔹 Calculate cumulative salary per department
SELECT name, department, salary, SUM(salary) OVER (PARTITION BY department ORDER BY salary DESC) AS running_total FROM employees;
✔ Useful for tracking cumulative totals
7️⃣ Using LAG() and LEAD()
🔹 Compare an employee’s salary with the previous and next employee’s salary
SELECT name, department, salary, LAG(salary) OVER (PARTITION BY department ORDER BY salary DESC) AS previous_salary, LEAD(salary) OVER (PARTITION BY department ORDER BY salary DESC) AS next_salary FROM employees;
✔ LAG() gets the previous row’s value
✔ LEAD() gets the next row’s value
Mini Task for You: Write an SQL query to assign a unique rank to employees based on their salary within each department using RANK().SELECT column1, column2, window_function() OVER (PARTITION BY column ORDER BY column) AS alias FROM table_name;
✔ PARTITION BY groups rows before applying the function
✔ ORDER BY determines the ranking or sequence
3️⃣ Using ROW_NUMBER()
🔹 Assign a unique row number to each employee based on salary (highest first)
SQL
SELECT name, department, salary, ROW_NUMBER() OVER (PARTITION BY department ORDER BY salary DESC) AS row_num FROM employees;
✔ Each employee gets a unique row number within their department. 4️⃣ Using RANK() and DENSE_RANK() 🔹 Rank employees by salary within each departmentSQL
SELECT name, department, salary, RANK() OVER (PARTITION BY department ORDER BY salary DESC) AS rank, DENSE_RANK() OVER (PARTITION BY department ORDER BY salary DESC) AS dense_rank FROM employees;
✔ RANK() skips numbers when there’s a tie ✔ DENSE_RANK() does not skip numbers 5️⃣ Using NTILE() for Distribution 🔹 Divide employees into 4 salary groups per departmentSQL
SELECT name, department, salary, NTILE(4) OVER (PARTITION BY department ORDER BY salary DESC) AS salary_quartile FROM employees;
✔ Useful for dividing salaries into percentiles (e.g., top 25%, bottom 25%) 6️⃣ Running Total with SUM() OVER() 🔹 Calculate cumulative salary per departmentSQL
SELECT name, department, salary, SUM(salary) OVER (PARTITION BY department ORDER BY salary DESC) AS running_total FROM employees;
✔ Useful for tracking cumulative totals 7️⃣ Using LAG() and LEAD() 🔹 Compare an employee’s salary with the previous and next employee’s salarySQL
SELECT name, department, salary, LAG(salary) OVER (PARTITION BY department ORDER BY salary DESC) AS previous_salary, LEAD(salary) OVER (PARTITION BY department ORDER BY salary DESC) AS next_salary FROM employees; `
✔ LAG() gets the previous row’s value
✔ LEAD() gets the next row’s value
Mini Task for You: Write an SQL query to assign a unique rank to employees based on their salary within each department using RANK().WITH cte_name AS ( SELECT column1, column2 FROM table_name WHERE condition ) SELECT * FROM cte_name;
✔ The CTE cte_name is defined and then used in the main SELECT query.
2️⃣ Simple CTE Example
🔹 Find employees earning more than $70,000
WITH high_earners AS ( SELECT name, salary, department_id FROM employees WHERE salary > 70000 ) SELECT * FROM high_earners;
✔ The CTE high_earners filters employees with high salaries before selecting all columns from it.
3️⃣ CTE with Aggregation
🔹 Find departments where the average salary is above $80,000
WITH department_salary AS ( SELECT department_id, AVG(salary) AS avg_salary FROM employees GROUP BY department_id ) SELECT department_id, avg_salary FROM department_salary WHERE avg_salary > 80000;
✔ The CTE department_salary calculates the average salary per department and filters out low-paying ones.
4️⃣ CTE for Recursive Queries (Hierarchy Example)
🔹 Find an employee hierarchy (who reports to whom)
WITH RECURSIVE employee_hierarchy AS ( SELECT employee_id, name, manager_id FROM employees WHERE manager_id IS NULL -- Start with top-level manager UNION ALL SELECT e.employee_id, e.name, e.manager_id FROM employees e INNER JOIN employee_hierarchy eh ON e.manager_id = eh.employee_id ) SELECT * FROM employee_hierarchy;
✔ This recursive CTE finds an employee hierarchy starting from the top-level manager.
5️⃣ Why Use CTEs Instead of Subqueries?
✅ Better Readability – Makes complex queries easier to understand
✅ Reusability – Can be referenced multiple times in the main query
✅ Performance – Some databases optimize CTEs better than nested subqueries
Mini Task for You: Write an SQL query using a CTE to find departments with more than 5 employees.
You can find free SQL Resources here
👇👇
https://t.me/mysqldata
Like this post if you want me to continue covering all the topics! ❤️
Share with credits: https://t.me/sqlspecialist
Hope it helps :)
#sql
Уже доступно! Исследование Telegram 2025 — ключевые инсайты года 
