Data science/ML/AI
Data science and machine learning hub Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources. For beginners, data scientists and ML engineers 👉 https://rebrand.ly/bigdatachannels DMCA: @disclosure_bds Contact: @mldatascientist
Mostrar más📈 Análisis del canal de Telegram Data science/ML/AI
El canal Data science/ML/AI (@datascience_bds) en el segmento lingüístico de Inglés es un actor destacado. Actualmente la comunidad reúne a 13 674 suscriptores, ocupando la posición 9 377 en la categoría Tecnologías y Aplicaciones y el puesto 31 635 en la región India.
📊 Métricas de audiencia y dinámica
Desde su creación el невідомо, el proyecto ha mostrado un crecimiento acelerado, reuniendo a 13 674 suscriptores.
Según los últimos datos del 09 junio, 2026, el canal mantiene una actividad estable. En los últimos 30 días la variación de miembros fue de 155, y en las últimas 24 horas de 5, conservando un alto alcance.
- Estado de verificación: No verificado
- Tasa de interacción (ER): El promedio de interacción de la audiencia es 8.03%. Durante las primeras 24 horas tras publicar, el contenido suele obtener 2.25% de reacciones respecto al total de suscriptores.
- Alcance de las publicaciones: Cada publicación recibe en promedio 1 098 visualizaciones. En el primer día suele acumular 308 visualizaciones.
- Reacciones e interacción: La audiencia responde de forma activa: el promedio de reacciones por publicación es 5.
- Intereses temáticos: El contenido se centra en temas clave como panda, learning, row, api, ethic.
📝 Descripción y política de contenido
El autor describe el recurso como un espacio para expresar opiniones subjetivas:
“Data science and machine learning hub
Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources.
For beginners, data scientists and ML engineers
👉 https://rebrand.ly/bigdatachannels
DMCA: @disclosure_bds
Contact: @mldatasci...”
Gracias a la alta frecuencia de actualizaciones (últimos datos recibidos el 10 junio, 2026), el canal mantiene la vigencia y un amplio alcance. La analítica demuestra que la audiencia interactúa activamente con el contenido, lo que lo convierte en un punto de referencia dentro de la categoría Tecnologías y Aplicaciones.
import pandas as pd
# Data: [Successes, Total Attempts]
data = {
'Hospital': ['A', 'A', 'B', 'B'],
'Case_Type': ['Easy', 'Hard', 'Easy', 'Hard'],
'Survived': [95, 10, 90, 70],
'Total': [100, 100, 100, 1000]
}
df = pd.DataFrame(data)
# 1. Check rates per group
df['Rate'] = df['Survived'] / df['Total']
print("--- Rates by Group ---")
print(df[['Hospital', 'Case_Type', 'Rate']])
# 2. Check overall rates
overall = df.groupby('Hospital').sum()
overall['Overall_Rate'] = overall['Survived'] / overall['Total']
print("\n--- Overall Rates (The Paradox!) ---")
print(overall['Overall_Rate'])
The Result:
• A is better at Easy (95% vs 90%).
• A is better at Hard (10% vs 7%).
• BUT... Overall, B wins (14% vs 52%) because B mostly did "Easy" cases.
🛠 How to avoid being fooled?
1. Don't trust the aggregate: When analyzing data, always try to "segment" or "drill down" into sub-groups.
2. Look for the Weight: Ask yourself: "Is one group disproportionately represented in the total?"
3. Identify the Lurking Variable: What context is missing? (e.g., Age, Severity, Time of Day).
🎯 The Takeaway
In Data Science, the "Big Picture" can sometimes be a big lie. If your analysis produces a result that defies logic, you might be looking at a Simpson’s Paradox. Always slice your data before you trust it.Goal: one place for everything a developer needs (free courses, tech news, job offers, manually written blogs. best github repos etc)A lot of you contributed by writing code or adding courses and knowledge along the way. This is as much yours as it is mine 🙌 And I’m already working on: • Personalized roadmaps • Live chat • Better job search & placement Try it and please tell me: What would you add next? Reminder that if you want early access to new features, Join our beta testers group. Looking for people who will explore, break things, and share honest feedback.
# Example Use Case: Monthly Website Traffic
# Chart: Line Chart
2. To Compare Categories 📊
Best For: Showing differences in size or value across distinct groups.
Chart Types:
- Bar Chart (Vertical/Column): Most common. Great for comparing quantities across groups. Easy to read exact values.
- Bar Chart (Horizontal): Better when you have many categories or long category names.
- Grouped Bar Chart: Compares sub-categories within main categories.
- Stacked Bar Chart: Shows total for a category AND how it's made up of sub-categories.
# Example Use Case: Sales per Region
# Chart: Horizontal Bar Chart
3. To Show Composition (Part-to-Whole) 🍕
Best For: Displaying how a total is divided into parts. Use with caution!
Chart Types:
- Pie Chart: Only use if you have few categories (max 5-6) and you want to show proportions of a whole. The *largest* slice is easiest to read.
- Donut Chart: Similar to pie, but the center is cut out (can sometimes display a total value).
- Stacked Bar Chart (100%): Shows proportions across categories, but as bars, which are often easier to compare than pie slices.
# Example Use Case: Market Share (if only 3 companies)
# Chart: Pie Chart (if few companies) or 100% Stacked Bar
Warning: Humans are bad at comparing slice angles. Bar charts are usually better for precise comparisons.
4. To Show Relationships (Correlation) 🔗
Best For: Seeing if two numerical variables are connected and how strongly.
Chart Types:
- Scatter Plot: The go-to. Each dot is an observation, showing the values of two variables. Look for patterns (linear, curved, clusters).
- Bubble Chart: A scatter plot where the size of the "bubble" (dot) represents a third numerical variable.
# Example Use Case: Does Experience correlate with Salary?
# Chart: Scatter Plot
5. To Show Distribution 📦
Best For: Understanding the range, spread, and central tendency of a single numerical variable.
Chart Types:
- Histogram: Shows frequency counts within bins (ranges) of your data. Great for spotting skewness or multi-modal distributions.
- Box Plot (Whisker Plot): Shows median, quartiles, and potential outliers. Excellent for comparing distributions across categories.
# Example Use Case: Distribution of customer ages
# Chart: Histogram or Box Plot (if comparing age by product)
💡 The Ultimate Rule:
Keep it simple. The chart should tell the story quickly. If your audience has to stare at it for five minutes to figure out what's going on, it's not working.
🎯 Today's Goal(What you should do)
✔️ Know which chart excels at showing trends vs. comparisons vs. relationships.
✔️ Use bar charts for categories and line charts for time.
✔️ Be very cautious with pie charts!
✔️ Use scatter plots to find connections.
¡Ya disponible! Investigación de Telegram 2025 — los principales insights del año 
