Data science/ML/AI
Data science and machine learning hub Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources. For beginners, data scientists and ML engineers 👉 https://rebrand.ly/bigdatachannels DMCA: @disclosure_bds Contact: @mldatascientist
Show more📈 Analytical overview of Telegram channel Data science/ML/AI
Channel Data science/ML/AI (@datascience_bds) in the English language segment is an active participant. Currently, the community unites 13 660 subscribers, ranking 9 391 in the Technologies & Applications category and 31 743 in the India region.
📊 Audience metrics and dynamics
Since its creation on невідомо, the project has demonstrated rapid growth, gathering an audience of 13 660 subscribers.
According to the latest data from 07 June, 2026, the channel demonstrates stable activity. Although there has been a change in the number of participants by 151 over the last 30 days and by -5 over the last 24 hours, overall reach remains high.
- Verification status: Not verified
- Engagement rate (ER): The average audience engagement rate is 7.92%. Within the first 24 hours after publication, content typically collects 2.33% reactions from the total number of subscribers.
- Post reach: On average, each post receives 1 082 views. Within the first day, a publication typically gains 318 views.
- Reactions and interaction: The audience actively supports content: the average number of reactions per post is 5.
- Thematic interests: Content is focused on key topics such as panda, learning, row, api, ethic.
📝 Description and content policy
The author describes the resource as a platform for expressing subjective opinions:
“Data science and machine learning hub
Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources.
For beginners, data scientists and ML engineers
👉 https://rebrand.ly/bigdatachannels
DMCA: @disclosure_bds
Contact: @mldatasci...”
Thanks to the high frequency of updates (latest data received on 08 June, 2026), the channel maintains relevance and a high level of publication reach. Analytics show that the audience actively interacts with content, making it an important point of influence in the Technologies & Applications category.
Would this value truly exist at the moment of prediction?If the answer is no, the model isn’t learning. It’s cheating.
# Inner join (default)
merged = pd.merge(df_sales, df_customers, on='customer_id')
# Left join
pd.merge(df_sales, df_customers, on='customer_id', how='left')
# Concatenate vertically
all_data = pd.concat([df_2023, df_2024], ignore_index=True)
# Join on index
df1.join(df2, on='date')
This wraps up our Data Manipulation Using Pandas Series.
Hit ❤️ if you liked this series. It will help us tailor more content based on what you like.
👉Join @datascience_bds for more
Part of the @bigdataspecialist family# Sort by one column
df.sort_values('sales', ascending=False)
# Sort by multiple columns
df.sort_values(['region', 'sales'], ascending=[True, False])
# Reset index after sorting
df = df.sort_values('sales', ascending=False).reset_index(drop=True)
# Add rank
df['sales_rank'] = df['sales'].rank(ascending=False)
Next up 👉 Merging and Joining Data# Total sales by region
df.groupby('region')['sales'].sum()
# Multiple aggregations
df.groupby('region').agg({
'sales': 'sum',
'customer_id': 'nunique',
'order_date': 'max'
})
# Group by multiple columns
df.groupby(['region', 'product'])['sales'].mean()
Next up 👉 Sorting and Ranking# Check for nulls
df.isnull().sum()
# Drop rows with any missing values
df_clean = df.dropna()
# Fill missing values
df['age'].fillna(df['age'].median(), inplace=True)
df['category'].fillna('Unknown', inplace=True)
# Forward or backward fill (great for time series)
df['value'].ffill()
Next up 👉 Using GroupBy# Add new column
df['revenue'] = df['sales'] * df['price']
# From existing columns
df['full_name'] = df['first_name'] + ' ' + df['last_name']
# Drop columns
df.drop(columns=['temp_col'], inplace=True)
# Or create a new DF without modifying original
clean_df = df.drop(columns=['old_col1', 'old_col2'])
Next up 👉 Dealing with Missing Values# Multiple conditions
high_sales = df[(df['sales'] > 1000) & (df['region'] == 'West')]
# Using .query() – cleaner syntax!
high_performers = df.query("sales > 1000 and region == 'West'")
# Find missing values
df[df['email'].isna()]
# Contains substring
df[df['product'].str.contains('Pro', case=False)]
Next up 👉 Adding and Removing Columns# Single column (Series)
df['name']
# Multiple columns (DataFrame)
df[['name', 'age', 'sales']]
# Row selection with .loc (label-based)
df.loc[0:5] # Rows 0 to 5
df.loc[df['sales'] > 1000] # Conditional
# .iloc (position-based)
df.iloc[0:5, 1:4] # Rows 0-4, columns 1-3
Next up 👉 Filtering and Queryingimport pandas as pd
# Load CSV
df = pd.read_csv('sales_data.csv')
# Quick look
df.head() # First 5 rows
df.info() # Structure & data types
df.describe() # Basic stats
Next up 👉 Selecting Columns & Rows
Available now! Telegram Research 2025 — the year's key insights 
