Data science/ML/AI
Data science and machine learning hub Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources. For beginners, data scientists and ML engineers 👉 https://rebrand.ly/bigdatachannels DMCA: @disclosure_bds Contact: @mldatascientist
Show more📈 Analytical overview of Telegram channel Data science/ML/AI
Channel Data science/ML/AI (@datascience_bds) in the English language segment is an active participant. Currently, the community unites 13 660 subscribers, ranking 9 391 in the Technologies & Applications category and 31 743 in the India region.
📊 Audience metrics and dynamics
Since its creation on невідомо, the project has demonstrated rapid growth, gathering an audience of 13 660 subscribers.
According to the latest data from 07 June, 2026, the channel demonstrates stable activity. Although there has been a change in the number of participants by 151 over the last 30 days and by -5 over the last 24 hours, overall reach remains high.
- Verification status: Not verified
- Engagement rate (ER): The average audience engagement rate is 7.92%. Within the first 24 hours after publication, content typically collects 2.33% reactions from the total number of subscribers.
- Post reach: On average, each post receives 1 082 views. Within the first day, a publication typically gains 318 views.
- Reactions and interaction: The audience actively supports content: the average number of reactions per post is 5.
- Thematic interests: Content is focused on key topics such as panda, learning, row, api, ethic.
📝 Description and content policy
The author describes the resource as a platform for expressing subjective opinions:
“Data science and machine learning hub
Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources.
For beginners, data scientists and ML engineers
👉 https://rebrand.ly/bigdatachannels
DMCA: @disclosure_bds
Contact: @mldatasci...”
Thanks to the high frequency of updates (latest data received on 08 June, 2026), the channel maintains relevance and a high level of publication reach. Analytics show that the audience actively interacts with content, making it an important point of influence in the Technologies & Applications category.
df.isnull().sum() # Check missing values
df.dropna() # Remove rows with missing values
df.fillna(0) # Replace missing values
Removing Duplicate Data
df.duplicated() # Identify duplicates
df.drop_duplicates() # Remove duplicates
Correcting Data Types
df.dtypes #identify data types
df["age"] = df["age"].astype(int) #convert age column to integer data type
df["date"] = pd.to_datetime(df["date"]) #convert date column to date data type
Renaming Columns
df.columns = df.columns.str.lower().str.replace(" ", "_")
Handling Inconsistent Data
df["gender"] = df["gender"].str.lower() #convert to lower case
df["name"] = df["name"].str.strip()
Clean data leads to more accurate analysis and reliable models. Python’s pandas library simplifies cleaning tasks such as handling missing values, duplicates, incorrect types, and inconsistencies.import pandas as pd
# Read Parquet file into a DataFrame
df = pd.read_parquet("data.parquet")
ORC (Optimized Row Columnar)
ORC is a columnar format optimized for high-performance analytics and commonly used in Hadoop-based systems.
import pandas as pd
# Read ORC file into a DataFrame
df = pd.read_orc("data.orc")
Feather
Feather is a lightweight binary format designed for fast data exchange between Python and other languages like R.
import pandas as pd
# Read Feather file into a DataFrame
df = pd.read_feather("data.feather")
✅ This concludes our Data Importing Series.
👉Join @datascience_bds for more
Part of the @bigdataspecialist family ❤️import pandas as pd
# URL of the webpage containing HTML tables
url = "https://example.com/page"
# Read all tables from the webpage
tables = pd.read_html(url)
# Select the first table
df = tables[0]
Next up ➡️ Big Data Formatsimport pickle # Library for object serialization
# Open the pickle file in read-binary mode
with open("data.pkl", "rb") as file:
data = pickle.load(file) # Load the stored Python object
Using Pickle with Pandas
import pandas as pd
# Load a pickled pandas DataFrame
df = pd.read_pickle("data.pkl")
Next up ➡️ Importing HTML Tablesimport requests
# API endpoint
url = "https://api.example.com/data"
# Parameters including the API key for authentication
params = {
"api_key": "YOUR_API_KEY" # Replace with your actual API key
}
# Send GET request with parameters
response = requests.get(url, params=params)
# Convert JSON response to Python object
data = response.json()
# Print the data
print(data)
Next up ➡️ Importing Pickle files in pythonimport requests # Library for making HTTP requests
import pandas as pd # Library for data manipulation and analysis
# API endpoint
url = "https://api.example.com/users"
# Send request to API
response = requests.get(url)
# Convert JSON response to Python object
data = response.json()
# Convert the JSON data into a pandas DataFrame
df = pd.DataFrame(data)
# Display the first five rows of the DataFrame
print(df.head())
Next up ➡️ API Key Authentication# Import json module (built-in, no install needed!)
import json
# Or import pandas if you want it directly as a DataFrame
import pandas as pd
# Your JSON file path
filename = "data.json"
# Load JSON file into a Python dictionary/list
with open(filename, "r", encoding="utf-8") as file:
data = json.load(file)
# Quick look at structure and first few items
print(type(data)) # usually dict or list
print(data.keys() if isinstance(data, dict) else len(data))
# Load the json file
df = pd.read_json(filename)
df.head()
👉Join @datascience_bds for more
Part of the @bigdataspecialist family# Loading a text file in Python
filename = 'huck_finn.txt' # Name of the file to open
file = open(filename, mode='r') # Open file in read mode ('r')
# Use encoding='utf-8' if needed
text = file.read() # Read entire content into a string
print(file.closed) # False → file is still open
file.close() # Always close the file when done!
# Prevents memory leaks & file locks
print(file.closed) # Now True → file is safely closed
print(text) # Display the full text content
Next up ➡️ Loading a JSON file in Python
👉Join @datascience_bds for more
Part of the @bigdataspecialist family# Import the pandas library
import pandas as pd
# Specify the path to your Excel file (.xlsx or .xls)
filename = "data.xlsx"
# Read the Excel file into a DataFrame
# Common options you'll use all the time:
df = pd.read_excel(
filename,
sheet_name=0, # 0 = first sheet
header=0, # Row (0-indexed) to use as column names
skiprows=4, # Skip first 4 rows
nrows=1000, # Load only first 1000 rows
)
# Check the first five rows
df.head()
Next up ➡️ Loading a text file in Python
👉Join @datascience_bds for more
Part of the @bigdataspecialist family# Import the pandas library
import pandas as pd
# Specify the path to your CSV file
filename = "data.csv"
# Read the CSV file into a DataFrame
df = pd.read_csv(filename)
#Checking the first five rows
df.head()
Next up ➡️ Loading an Excel file in Python
👉Join @datascience_bds for more
Part of the @bigdataspecialist family
Available now! Telegram Research 2025 — the year's key insights 
