Data science/ML/AI
Data science and machine learning hub Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources. For beginners, data scientists and ML engineers 👉 https://rebrand.ly/bigdatachannels DMCA: @disclosure_bds Contact: @mldatascientist
Больше📈 Аналитический обзор Telegram-канала Data science/ML/AI
Канал Data science/ML/AI (@datascience_bds) языкового сегмента Английский является активным участником. Сейчас сообщество объединяет 13 674 подписчиков, занимая 9 377 место в категории Технологии и приложения и 31 635 место в регионе Индия.
📊 Показатели аудитории и динамика
С момента создания невідомо проект демонстрирует стремительный рост, собрав аудиторию из 13 674 подписчиков.
Согласно последним данным от 09 июня, 2026, канал показывает стабильную активность. За последние 30 дней изменение числа участников составило 155, а за последние 24 часа — 5, при этом общий охват остаётся высоким.
- Статус верификации: Не верифицирован
- Уровень вовлечённости (ER): Средний показатель вовлечённости аудитории составляет 8.03%. В первые 24 часа после публикации контент обычно набирает 2.25% реакций от общего числа подписчиков.
- Охват публикаций: В среднем каждый пост получает 1 098 просмотров. В течение первых суток публикация набирает 308 просмотров.
- Реакции и взаимодействия: Аудитория активно поддерживает контент: среднее количество реакций на один пост — 5.
- Тематические интересы: Контент сосредоточен на ключевых темах, таких как panda, learning, row, api, ethic.
📝 Описание и контентная политика
Автор описывает ресурс как площадку для выражения субъективного мнения:
“Data science and machine learning hub
Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources.
For beginners, data scientists and ML engineers
👉 https://rebrand.ly/bigdatachannels
DMCA: @disclosure_bds
Contact: @mldatasci...”
Благодаря высокой частоте обновлений (последние данные получены 10 июня, 2026) канал поддерживает актуальность и высокий уровень охвата публикаций. Аналитика показывает, что аудитория активно взаимодействует с контентом, что делает его важной точкой влияния в категории Технологии и приложения.
import pandas as pd
# Data: [Successes, Total Attempts]
data = {
'Hospital': ['A', 'A', 'B', 'B'],
'Case_Type': ['Easy', 'Hard', 'Easy', 'Hard'],
'Survived': [95, 10, 90, 70],
'Total': [100, 100, 100, 1000]
}
df = pd.DataFrame(data)
# 1. Check rates per group
df['Rate'] = df['Survived'] / df['Total']
print("--- Rates by Group ---")
print(df[['Hospital', 'Case_Type', 'Rate']])
# 2. Check overall rates
overall = df.groupby('Hospital').sum()
overall['Overall_Rate'] = overall['Survived'] / overall['Total']
print("\n--- Overall Rates (The Paradox!) ---")
print(overall['Overall_Rate'])
The Result:
• A is better at Easy (95% vs 90%).
• A is better at Hard (10% vs 7%).
• BUT... Overall, B wins (14% vs 52%) because B mostly did "Easy" cases.
🛠 How to avoid being fooled?
1. Don't trust the aggregate: When analyzing data, always try to "segment" or "drill down" into sub-groups.
2. Look for the Weight: Ask yourself: "Is one group disproportionately represented in the total?"
3. Identify the Lurking Variable: What context is missing? (e.g., Age, Severity, Time of Day).
🎯 The Takeaway
In Data Science, the "Big Picture" can sometimes be a big lie. If your analysis produces a result that defies logic, you might be looking at a Simpson’s Paradox. Always slice your data before you trust it.Goal: one place for everything a developer needs (free courses, tech news, job offers, manually written blogs. best github repos etc)A lot of you contributed by writing code or adding courses and knowledge along the way. This is as much yours as it is mine 🙌 And I’m already working on: • Personalized roadmaps • Live chat • Better job search & placement Try it and please tell me: What would you add next? Reminder that if you want early access to new features, Join our beta testers group. Looking for people who will explore, break things, and share honest feedback.
# Example Use Case: Monthly Website Traffic
# Chart: Line Chart
2. To Compare Categories 📊
Best For: Showing differences in size or value across distinct groups.
Chart Types:
- Bar Chart (Vertical/Column): Most common. Great for comparing quantities across groups. Easy to read exact values.
- Bar Chart (Horizontal): Better when you have many categories or long category names.
- Grouped Bar Chart: Compares sub-categories within main categories.
- Stacked Bar Chart: Shows total for a category AND how it's made up of sub-categories.
# Example Use Case: Sales per Region
# Chart: Horizontal Bar Chart
3. To Show Composition (Part-to-Whole) 🍕
Best For: Displaying how a total is divided into parts. Use with caution!
Chart Types:
- Pie Chart: Only use if you have few categories (max 5-6) and you want to show proportions of a whole. The *largest* slice is easiest to read.
- Donut Chart: Similar to pie, but the center is cut out (can sometimes display a total value).
- Stacked Bar Chart (100%): Shows proportions across categories, but as bars, which are often easier to compare than pie slices.
# Example Use Case: Market Share (if only 3 companies)
# Chart: Pie Chart (if few companies) or 100% Stacked Bar
Warning: Humans are bad at comparing slice angles. Bar charts are usually better for precise comparisons.
4. To Show Relationships (Correlation) 🔗
Best For: Seeing if two numerical variables are connected and how strongly.
Chart Types:
- Scatter Plot: The go-to. Each dot is an observation, showing the values of two variables. Look for patterns (linear, curved, clusters).
- Bubble Chart: A scatter plot where the size of the "bubble" (dot) represents a third numerical variable.
# Example Use Case: Does Experience correlate with Salary?
# Chart: Scatter Plot
5. To Show Distribution 📦
Best For: Understanding the range, spread, and central tendency of a single numerical variable.
Chart Types:
- Histogram: Shows frequency counts within bins (ranges) of your data. Great for spotting skewness or multi-modal distributions.
- Box Plot (Whisker Plot): Shows median, quartiles, and potential outliers. Excellent for comparing distributions across categories.
# Example Use Case: Distribution of customer ages
# Chart: Histogram or Box Plot (if comparing age by product)
💡 The Ultimate Rule:
Keep it simple. The chart should tell the story quickly. If your audience has to stare at it for five minutes to figure out what's going on, it's not working.
🎯 Today's Goal(What you should do)
✔️ Know which chart excels at showing trends vs. comparisons vs. relationships.
✔️ Use bar charts for categories and line charts for time.
✔️ Be very cautious with pie charts!
✔️ Use scatter plots to find connections.
Уже доступно! Исследование Telegram 2025 — ключевые инсайты года 
