Data science/ML/AI
Data science and machine learning hub Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources. For beginners, data scientists and ML engineers 👉 https://rebrand.ly/bigdatachannels DMCA: @disclosure_bds Contact: @mldatascientist
Ko'proq ko'rsatish📈 Telegram kanali Data science/ML/AI analitikasi
Data science/ML/AI (@datascience_bds) Ingliz til segmentidagi kanali faol ishtirokchi. Hozirda hamjamiyat 13 667 obunachidan iborat bo'lib, Texnologiyalar & Aralashmalar toifasida 9 391-o'rinni va Hindiston mintaqasida 31 743-o'rinni egallagan.
📊 Auditoriya ko‘rsatkichlari va dinamika
невідомо sanasidan buyon loyiha tez o‘sib, 13 667 obunachiga ega bo‘ldi.
08 Iyun, 2026 dagi oxirgi ma’lumotlarga ko‘ra kanal barqaror faollikka ega. Oxirgi 30 kunda obunachilar soni 150 ga, so‘nggi 24 soatda esa 4 ga o‘zgardi va umumiy qamrov yuqori darajada qolmoqda.
- Tasdiqlash holati: Tasdiqlanmagan
- Jalb etish (ER): Auditoriya o‘rtacha 7.97% darajada jalb etiladi. Nashrdan keyingi dastlabki 24 soatda kontent odatda umumiy obunachilar sonining 2.27% ini tashkil etuvchi reaksiyalarni to‘playdi.
- Post qamrovi: Har bir post o‘rtacha 1 089 marta ko‘riladi; birinchi sutkada odatda 310 ta ko‘rish yig‘iladi.
- Reaksiyalar va o‘zaro ta’sir: Auditoriya faol: har bir postga o‘rtacha 5 ta reaksiya keladi.
- Tematik yo‘nalishlar: Kontent panda, learning, row, api, ethic kabi asosiy mavzularga jamlangan.
📝 Tavsif va kontent siyosati
Muallif resursni shaxsiy fikrni ifoda etish maydoni sifatida ta’riflaydi:
“Data science and machine learning hub
Python, SQL, stats, ML, deep learning, projects, PDFs, roadmaps and AI resources.
For beginners, data scientists and ML engineers
👉 https://rebrand.ly/bigdatachannels
DMCA: @disclosure_bds
Contact: @mldatasci...”
Yuqori yangilanish chastotasi (oxirgi ma’lumot 09 Iyun, 2026 da olingan) sababli kanal doimo dolzarb va katta qamrovli bo‘lib qoladi. Analitika auditoriya kontent bilan faol hamkorlik qilishini, uni Texnologiyalar & Aralashmalar toifasidagi muhim ta’sir nuqtasiga aylantirishini ko‘rsatadi.
df.isnull().sum() # Check missing values
df.dropna() # Remove rows with missing values
df.fillna(0) # Replace missing values
Removing Duplicate Data
df.duplicated() # Identify duplicates
df.drop_duplicates() # Remove duplicates
Correcting Data Types
df.dtypes #identify data types
df["age"] = df["age"].astype(int) #convert age column to integer data type
df["date"] = pd.to_datetime(df["date"]) #convert date column to date data type
Renaming Columns
df.columns = df.columns.str.lower().str.replace(" ", "_")
Handling Inconsistent Data
df["gender"] = df["gender"].str.lower() #convert to lower case
df["name"] = df["name"].str.strip()
Clean data leads to more accurate analysis and reliable models. Python’s pandas library simplifies cleaning tasks such as handling missing values, duplicates, incorrect types, and inconsistencies.import pandas as pd
# Read Parquet file into a DataFrame
df = pd.read_parquet("data.parquet")
ORC (Optimized Row Columnar)
ORC is a columnar format optimized for high-performance analytics and commonly used in Hadoop-based systems.
import pandas as pd
# Read ORC file into a DataFrame
df = pd.read_orc("data.orc")
Feather
Feather is a lightweight binary format designed for fast data exchange between Python and other languages like R.
import pandas as pd
# Read Feather file into a DataFrame
df = pd.read_feather("data.feather")
✅ This concludes our Data Importing Series.
👉Join @datascience_bds for more
Part of the @bigdataspecialist family ❤️import pandas as pd
# URL of the webpage containing HTML tables
url = "https://example.com/page"
# Read all tables from the webpage
tables = pd.read_html(url)
# Select the first table
df = tables[0]
Next up ➡️ Big Data Formatsimport pickle # Library for object serialization
# Open the pickle file in read-binary mode
with open("data.pkl", "rb") as file:
data = pickle.load(file) # Load the stored Python object
Using Pickle with Pandas
import pandas as pd
# Load a pickled pandas DataFrame
df = pd.read_pickle("data.pkl")
Next up ➡️ Importing HTML Tablesimport requests
# API endpoint
url = "https://api.example.com/data"
# Parameters including the API key for authentication
params = {
"api_key": "YOUR_API_KEY" # Replace with your actual API key
}
# Send GET request with parameters
response = requests.get(url, params=params)
# Convert JSON response to Python object
data = response.json()
# Print the data
print(data)
Next up ➡️ Importing Pickle files in pythonimport requests # Library for making HTTP requests
import pandas as pd # Library for data manipulation and analysis
# API endpoint
url = "https://api.example.com/users"
# Send request to API
response = requests.get(url)
# Convert JSON response to Python object
data = response.json()
# Convert the JSON data into a pandas DataFrame
df = pd.DataFrame(data)
# Display the first five rows of the DataFrame
print(df.head())
Next up ➡️ API Key Authentication# Import json module (built-in, no install needed!)
import json
# Or import pandas if you want it directly as a DataFrame
import pandas as pd
# Your JSON file path
filename = "data.json"
# Load JSON file into a Python dictionary/list
with open(filename, "r", encoding="utf-8") as file:
data = json.load(file)
# Quick look at structure and first few items
print(type(data)) # usually dict or list
print(data.keys() if isinstance(data, dict) else len(data))
# Load the json file
df = pd.read_json(filename)
df.head()
👉Join @datascience_bds for more
Part of the @bigdataspecialist family# Loading a text file in Python
filename = 'huck_finn.txt' # Name of the file to open
file = open(filename, mode='r') # Open file in read mode ('r')
# Use encoding='utf-8' if needed
text = file.read() # Read entire content into a string
print(file.closed) # False → file is still open
file.close() # Always close the file when done!
# Prevents memory leaks & file locks
print(file.closed) # Now True → file is safely closed
print(text) # Display the full text content
Next up ➡️ Loading a JSON file in Python
👉Join @datascience_bds for more
Part of the @bigdataspecialist family# Import the pandas library
import pandas as pd
# Specify the path to your Excel file (.xlsx or .xls)
filename = "data.xlsx"
# Read the Excel file into a DataFrame
# Common options you'll use all the time:
df = pd.read_excel(
filename,
sheet_name=0, # 0 = first sheet
header=0, # Row (0-indexed) to use as column names
skiprows=4, # Skip first 4 rows
nrows=1000, # Load only first 1000 rows
)
# Check the first five rows
df.head()
Next up ➡️ Loading a text file in Python
👉Join @datascience_bds for more
Part of the @bigdataspecialist family# Import the pandas library
import pandas as pd
# Specify the path to your CSV file
filename = "data.csv"
# Read the CSV file into a DataFrame
df = pd.read_csv(filename)
#Checking the first five rows
df.head()
Next up ➡️ Loading an Excel file in Python
👉Join @datascience_bds for more
Part of the @bigdataspecialist family
Endi mavjud! Telegram Tadqiqoti 2025 — yilning asosiy insaytlari 
