Data science/ML/AI
Data science and machine learning hub Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources. For beginners, data scientists and ML engineers 👉 https://rebrand.ly/bigdatachannels DMCA: @disclosure_bds Contact: @mldatascientist
Show more📈 Analytical overview of Telegram channel Data science/ML/AI
Channel Data science/ML/AI (@datascience_bds) in the English language segment is an active participant. Currently, the community unites 13 674 subscribers, ranking 9 377 in the Technologies & Applications category and 31 635 in the India region.
📊 Audience metrics and dynamics
Since its creation on невідомо, the project has demonstrated rapid growth, gathering an audience of 13 674 subscribers.
According to the latest data from 09 June, 2026, the channel demonstrates stable activity. Although there has been a change in the number of participants by 155 over the last 30 days and by 5 over the last 24 hours, overall reach remains high.
- Verification status: Not verified
- Engagement rate (ER): The average audience engagement rate is 8.03%. Within the first 24 hours after publication, content typically collects 2.25% reactions from the total number of subscribers.
- Post reach: On average, each post receives 1 098 views. Within the first day, a publication typically gains 308 views.
- Reactions and interaction: The audience actively supports content: the average number of reactions per post is 5.
- Thematic interests: Content is focused on key topics such as panda, learning, row, api, ethic.
📝 Description and content policy
The author describes the resource as a platform for expressing subjective opinions:
“Data science and machine learning hub
Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources.
For beginners, data scientists and ML engineers
👉 https://rebrand.ly/bigdatachannels
DMCA: @disclosure_bds
Contact: @mldatasci...”
Thanks to the high frequency of updates (latest data received on 10 June, 2026), the channel maintains relevance and a high level of publication reach. Analytics show that the audience actively interacts with content, making it an important point of influence in the Technologies & Applications category.
Will this model work on unseen data?A single train/test split is unreliable, especially with small datasets. So K-Fold simulates multiple “future tests” using the same data. 🧠 What It Really Does Instead of one split, we: 🔀 Divide data into K folds 🔁 Train the model K times 📦 Each time: one fold validates, the rest train 📊 Average the scores Every sample gets validated once, which reduces evaluation noise and gives a more trustworthy estimate. Important: It improves evaluation, not the model itself. ⚠️ What People Often Miss 🚫 Do NOT use K-Fold as your final test. Keep a separate test set ⚖️ Use Stratified K-Fold for imbalanced classification. ⏳ Do NOT use standard K-Fold for time series. 📊 K = 5 or 10 is usually enough. ✅ In short K-Fold is just: A smart way to reuse limited data to simulate multiple real-world tests. No magic. Just careful evaluation.
How complex is your model’s decision boundary?VC dimension measures the largest number of points a model can shatter (perfectly classify in all labelings). Why this is important❔ Two models with similar parameter counts can have very different capacities. For example: 📦 k-NN → very high effective capacity 📐 Linear classifier → limited capacity 🌳 Deep trees → extremely high capacity What you need to understand Generalization depends on capacity relative to data size. Too much capacity with little data leads to overfitting. ✅ VC dimension is about expressive power, not just number of parameters.
df["age_scaled"] = (df["age"] - df["age"].mean()) / df["age"].std()
Why it is useful:
• Quick experiments
• Better intuition
• No pipeline overheadimport numpy as np
z = (df["salary"] - df["salary"].mean()) / df["salary"].std()
outliers = df[np.abs(z) > 3]
Why this matters:
• Clean data
• Better models
• Fewer surprises in production
Small code. Big impact.
Available now! Telegram Research 2025 — the year's key insights 
