Data science/ML/AI
Data science and machine learning hub Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources. For beginners, data scientists and ML engineers 👉 https://rebrand.ly/bigdatachannels DMCA: @disclosure_bds Contact: @mldatascientist
Mostrar más📈 Análisis del canal de Telegram Data science/ML/AI
El canal Data science/ML/AI (@datascience_bds) en el segmento lingüístico de Inglés es un actor destacado. Actualmente la comunidad reúne a 13 660 suscriptores, ocupando la posición 9 391 en la categoría Tecnologías y Aplicaciones y el puesto 31 743 en la región India.
📊 Métricas de audiencia y dinámica
Desde su creación el невідомо, el proyecto ha mostrado un crecimiento acelerado, reuniendo a 13 660 suscriptores.
Según los últimos datos del 07 junio, 2026, el canal mantiene una actividad estable. En los últimos 30 días la variación de miembros fue de 151, y en las últimas 24 horas de -5, conservando un alto alcance.
- Estado de verificación: No verificado
- Tasa de interacción (ER): El promedio de interacción de la audiencia es 7.92%. Durante las primeras 24 horas tras publicar, el contenido suele obtener 2.33% de reacciones respecto al total de suscriptores.
- Alcance de las publicaciones: Cada publicación recibe en promedio 1 082 visualizaciones. En el primer día suele acumular 318 visualizaciones.
- Reacciones e interacción: La audiencia responde de forma activa: el promedio de reacciones por publicación es 5.
- Intereses temáticos: El contenido se centra en temas clave como panda, learning, row, api, ethic.
📝 Descripción y política de contenido
El autor describe el recurso como un espacio para expresar opiniones subjetivas:
“Data science and machine learning hub
Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources.
For beginners, data scientists and ML engineers
👉 https://rebrand.ly/bigdatachannels
DMCA: @disclosure_bds
Contact: @mldatasci...”
Gracias a la alta frecuencia de actualizaciones (últimos datos recibidos el 08 junio, 2026), el canal mantiene la vigencia y un amplio alcance. La analítica demuestra que la audiencia interactúa activamente con el contenido, lo que lo convierte en un punto de referencia dentro de la categoría Tecnologías y Aplicaciones.
df.isnull().sum() # Check missing values
df.dropna() # Remove rows with missing values
df.fillna(0) # Replace missing values
Removing Duplicate Data
df.duplicated() # Identify duplicates
df.drop_duplicates() # Remove duplicates
Correcting Data Types
df.dtypes #identify data types
df["age"] = df["age"].astype(int) #convert age column to integer data type
df["date"] = pd.to_datetime(df["date"]) #convert date column to date data type
Renaming Columns
df.columns = df.columns.str.lower().str.replace(" ", "_")
Handling Inconsistent Data
df["gender"] = df["gender"].str.lower() #convert to lower case
df["name"] = df["name"].str.strip()
Clean data leads to more accurate analysis and reliable models. Python’s pandas library simplifies cleaning tasks such as handling missing values, duplicates, incorrect types, and inconsistencies.import pandas as pd
# Read Parquet file into a DataFrame
df = pd.read_parquet("data.parquet")
ORC (Optimized Row Columnar)
ORC is a columnar format optimized for high-performance analytics and commonly used in Hadoop-based systems.
import pandas as pd
# Read ORC file into a DataFrame
df = pd.read_orc("data.orc")
Feather
Feather is a lightweight binary format designed for fast data exchange between Python and other languages like R.
import pandas as pd
# Read Feather file into a DataFrame
df = pd.read_feather("data.feather")
✅ This concludes our Data Importing Series.
👉Join @datascience_bds for more
Part of the @bigdataspecialist family ❤️import pandas as pd
# URL of the webpage containing HTML tables
url = "https://example.com/page"
# Read all tables from the webpage
tables = pd.read_html(url)
# Select the first table
df = tables[0]
Next up ➡️ Big Data Formatsimport pickle # Library for object serialization
# Open the pickle file in read-binary mode
with open("data.pkl", "rb") as file:
data = pickle.load(file) # Load the stored Python object
Using Pickle with Pandas
import pandas as pd
# Load a pickled pandas DataFrame
df = pd.read_pickle("data.pkl")
Next up ➡️ Importing HTML Tablesimport requests
# API endpoint
url = "https://api.example.com/data"
# Parameters including the API key for authentication
params = {
"api_key": "YOUR_API_KEY" # Replace with your actual API key
}
# Send GET request with parameters
response = requests.get(url, params=params)
# Convert JSON response to Python object
data = response.json()
# Print the data
print(data)
Next up ➡️ Importing Pickle files in pythonimport requests # Library for making HTTP requests
import pandas as pd # Library for data manipulation and analysis
# API endpoint
url = "https://api.example.com/users"
# Send request to API
response = requests.get(url)
# Convert JSON response to Python object
data = response.json()
# Convert the JSON data into a pandas DataFrame
df = pd.DataFrame(data)
# Display the first five rows of the DataFrame
print(df.head())
Next up ➡️ API Key Authentication# Import json module (built-in, no install needed!)
import json
# Or import pandas if you want it directly as a DataFrame
import pandas as pd
# Your JSON file path
filename = "data.json"
# Load JSON file into a Python dictionary/list
with open(filename, "r", encoding="utf-8") as file:
data = json.load(file)
# Quick look at structure and first few items
print(type(data)) # usually dict or list
print(data.keys() if isinstance(data, dict) else len(data))
# Load the json file
df = pd.read_json(filename)
df.head()
👉Join @datascience_bds for more
Part of the @bigdataspecialist family# Loading a text file in Python
filename = 'huck_finn.txt' # Name of the file to open
file = open(filename, mode='r') # Open file in read mode ('r')
# Use encoding='utf-8' if needed
text = file.read() # Read entire content into a string
print(file.closed) # False → file is still open
file.close() # Always close the file when done!
# Prevents memory leaks & file locks
print(file.closed) # Now True → file is safely closed
print(text) # Display the full text content
Next up ➡️ Loading a JSON file in Python
👉Join @datascience_bds for more
Part of the @bigdataspecialist family# Import the pandas library
import pandas as pd
# Specify the path to your Excel file (.xlsx or .xls)
filename = "data.xlsx"
# Read the Excel file into a DataFrame
# Common options you'll use all the time:
df = pd.read_excel(
filename,
sheet_name=0, # 0 = first sheet
header=0, # Row (0-indexed) to use as column names
skiprows=4, # Skip first 4 rows
nrows=1000, # Load only first 1000 rows
)
# Check the first five rows
df.head()
Next up ➡️ Loading a text file in Python
👉Join @datascience_bds for more
Part of the @bigdataspecialist family# Import the pandas library
import pandas as pd
# Specify the path to your CSV file
filename = "data.csv"
# Read the CSV file into a DataFrame
df = pd.read_csv(filename)
#Checking the first five rows
df.head()
Next up ➡️ Loading an Excel file in Python
👉Join @datascience_bds for more
Part of the @bigdataspecialist family
¡Ya disponible! Investigación de Telegram 2025 — los principales insights del año 
