Data science/ML/AI
Data science and machine learning hub Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources. For beginners, data scientists and ML engineers 👉 https://rebrand.ly/bigdatachannels DMCA: @disclosure_bds Contact: @mldatascientist
Show more📈 Analytical overview of Telegram channel Data science/ML/AI
Channel Data science/ML/AI (@datascience_bds) in the English language segment is an active participant. Currently, the community unites 13 674 subscribers, ranking 9 380 in the Technologies & Applications category and 31 607 in the India region.
📊 Audience metrics and dynamics
Since its creation on невідомо, the project has demonstrated rapid growth, gathering an audience of 13 674 subscribers.
According to the latest data from 10 June, 2026, the channel demonstrates stable activity. Although there has been a change in the number of participants by 143 over the last 30 days and by 2 over the last 24 hours, overall reach remains high.
- Verification status: Not verified
- Engagement rate (ER): The average audience engagement rate is 8.09%. Within the first 24 hours after publication, content typically collects 2.22% reactions from the total number of subscribers.
- Post reach: On average, each post receives 1 106 views. Within the first day, a publication typically gains 304 views.
- Reactions and interaction: The audience actively supports content: the average number of reactions per post is 5.
- Thematic interests: Content is focused on key topics such as panda, learning, row, api, ethic.
📝 Description and content policy
The author describes the resource as a platform for expressing subjective opinions:
“Data science and machine learning hub
Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources.
For beginners, data scientists and ML engineers
👉 https://rebrand.ly/bigdatachannels
DMCA: @disclosure_bds
Contact: @mldatasci...”
Thanks to the high frequency of updates (latest data received on 11 June, 2026), the channel maintains relevance and a high level of publication reach. Analytics show that the audience actively interacts with content, making it an important point of influence in the Technologies & Applications category.
import pandas as pd
# Data: [Successes, Total Attempts]
data = {
'Hospital': ['A', 'A', 'B', 'B'],
'Case_Type': ['Easy', 'Hard', 'Easy', 'Hard'],
'Survived': [95, 10, 90, 70],
'Total': [100, 100, 100, 1000]
}
df = pd.DataFrame(data)
# 1. Check rates per group
df['Rate'] = df['Survived'] / df['Total']
print("--- Rates by Group ---")
print(df[['Hospital', 'Case_Type', 'Rate']])
# 2. Check overall rates
overall = df.groupby('Hospital').sum()
overall['Overall_Rate'] = overall['Survived'] / overall['Total']
print("\n--- Overall Rates (The Paradox!) ---")
print(overall['Overall_Rate'])
The Result:
• A is better at Easy (95% vs 90%).
• A is better at Hard (10% vs 7%).
• BUT... Overall, B wins (14% vs 52%) because B mostly did "Easy" cases.
🛠 How to avoid being fooled?
1. Don't trust the aggregate: When analyzing data, always try to "segment" or "drill down" into sub-groups.
2. Look for the Weight: Ask yourself: "Is one group disproportionately represented in the total?"
3. Identify the Lurking Variable: What context is missing? (e.g., Age, Severity, Time of Day).
🎯 The Takeaway
In Data Science, the "Big Picture" can sometimes be a big lie. If your analysis produces a result that defies logic, you might be looking at a Simpson’s Paradox. Always slice your data before you trust it.Goal: one place for everything a developer needs (free courses, tech news, job offers, manually written blogs. best github repos etc)A lot of you contributed by writing code or adding courses and knowledge along the way. This is as much yours as it is mine 🙌 And I’m already working on: • Personalized roadmaps • Live chat • Better job search & placement Try it and please tell me: What would you add next? Reminder that if you want early access to new features, Join our beta testers group. Looking for people who will explore, break things, and share honest feedback.
# Example Use Case: Monthly Website Traffic
# Chart: Line Chart
2. To Compare Categories 📊
Best For: Showing differences in size or value across distinct groups.
Chart Types:
- Bar Chart (Vertical/Column): Most common. Great for comparing quantities across groups. Easy to read exact values.
- Bar Chart (Horizontal): Better when you have many categories or long category names.
- Grouped Bar Chart: Compares sub-categories within main categories.
- Stacked Bar Chart: Shows total for a category AND how it's made up of sub-categories.
# Example Use Case: Sales per Region
# Chart: Horizontal Bar Chart
3. To Show Composition (Part-to-Whole) 🍕
Best For: Displaying how a total is divided into parts. Use with caution!
Chart Types:
- Pie Chart: Only use if you have few categories (max 5-6) and you want to show proportions of a whole. The *largest* slice is easiest to read.
- Donut Chart: Similar to pie, but the center is cut out (can sometimes display a total value).
- Stacked Bar Chart (100%): Shows proportions across categories, but as bars, which are often easier to compare than pie slices.
# Example Use Case: Market Share (if only 3 companies)
# Chart: Pie Chart (if few companies) or 100% Stacked Bar
Warning: Humans are bad at comparing slice angles. Bar charts are usually better for precise comparisons.
4. To Show Relationships (Correlation) 🔗
Best For: Seeing if two numerical variables are connected and how strongly.
Chart Types:
- Scatter Plot: The go-to. Each dot is an observation, showing the values of two variables. Look for patterns (linear, curved, clusters).
- Bubble Chart: A scatter plot where the size of the "bubble" (dot) represents a third numerical variable.
# Example Use Case: Does Experience correlate with Salary?
# Chart: Scatter Plot
5. To Show Distribution 📦
Best For: Understanding the range, spread, and central tendency of a single numerical variable.
Chart Types:
- Histogram: Shows frequency counts within bins (ranges) of your data. Great for spotting skewness or multi-modal distributions.
- Box Plot (Whisker Plot): Shows median, quartiles, and potential outliers. Excellent for comparing distributions across categories.
# Example Use Case: Distribution of customer ages
# Chart: Histogram or Box Plot (if comparing age by product)
💡 The Ultimate Rule:
Keep it simple. The chart should tell the story quickly. If your audience has to stare at it for five minutes to figure out what's going on, it's not working.
🎯 Today's Goal(What you should do)
✔️ Know which chart excels at showing trends vs. comparisons vs. relationships.
✔️ Use bar charts for categories and line charts for time.
✔️ Be very cautious with pie charts!
✔️ Use scatter plots to find connections.
Available now! Telegram Research 2025 — the year's key insights 
