Machine Learning with Python
Learn Machine Learning with hands-on Python tutorials, real-world code examples, and clear explanations for researchers and developers. Admin: @HusseinSheikho || @Hussein_Sheikho
Show more📈 Analytical overview of Telegram channel Machine Learning with Python
Channel Machine Learning with Python (@codeprogrammer) in the English language segment is an active participant. Currently, the community unites 67 820 subscribers, ranking 2 411 in the Education category and 5 035 in the India region.
📊 Audience metrics and dynamics
Since its creation on невідомо, the project has demonstrated rapid growth, gathering an audience of 67 820 subscribers.
According to the latest data from 06 June, 2026, the channel demonstrates stable activity. Although there has been a change in the number of participants by 55 over the last 30 days and by -2 over the last 24 hours, overall reach remains high.
- Verification status: Not verified
- Engagement rate (ER): The average audience engagement rate is 2.54%. Within the first 24 hours after publication, content typically collects 2.53% reactions from the total number of subscribers.
- Post reach: On average, each post receives 1 720 views. Within the first day, a publication typically gains 1 714 views.
- Reactions and interaction: The audience actively supports content: the average number of reactions per post is 6.
- Thematic interests: Content is focused on key topics such as insidead, learning, degree, evaluation, algorithm.
📝 Description and content policy
The author describes the resource as a platform for expressing subjective opinions:
“Learn Machine Learning with hands-on Python tutorials, real-world code examples, and clear explanations for researchers and developers.
Admin: @HusseinSheikho || @Hussein_Sheikho”
Thanks to the high frequency of updates (latest data received on 08 June, 2026), the channel maintains relevance and a high level of publication reach. Analytics show that the audience actively interacts with content, making it an important point of influence in the Education category.
# Initialize the TF-IDF Vectorizer
vectorizer = TfidfVectorizer()
# Fit the vectorizer on the training data and transform it
X_train_tfidf = vectorizer.fit_transform(X_train)
# Only transform the test data using the already-fitted vectorizer
X_test_tfidf = vectorizer.transform(X_test)
print("Shape of training data vectors:", X_train_tfidf.shape)
print("Shape of testing data vectors:", X_test_tfidf.shape)
---
Step 5: Training the NLP Model
Now we can train a machine learning model. Multinomial Naive Bayes is a simple yet powerful algorithm that works very well for text classification tasks.
#ModelTraining #NaiveBayes
# Initialize and train the Naive Bayes classifier
model = MultinomialNB()
model.fit(X_train_tfidf, y_train)
print("Model training complete.")
---
Step 6: Making Predictions and Evaluating the Model
With our model trained, let's use it to make predictions on our unseen test data and see how well it performs.
#Evaluation #ModelPerformance #Prediction
# Make predictions on the test set
y_pred = model.predict(X_test_tfidf)
# Calculate accuracy
accuracy = accuracy_score(y_test, y_pred)
print(f"Model Accuracy: {accuracy * 100:.2f}%\n")
# Display a detailed classification report
print("Classification Report:")
print(classification_report(y_test, y_pred, target_names=['Negative', 'Positive']))
---
Step 7: Discussion of Results
#Results #Discussion
Our model achieved 100% accuracy on this very small test set.
Accuracy: This is the percentage of correct predictions. 100% is perfect, but this is expected on such a tiny, clean dataset. In the real world, an accuracy of 85-95% is often considered very good.
Precision: Of all the times the model predicted "Positive", what percentage were actually positive?
Recall: Of all the actual "Positive" texts, what percentage did the model correctly identify?
F1-Score: A weighted average of Precision and Recall.
Limitations: Our dataset is extremely small. A real model would need thousands of examples to be reliable and generalize well to new, unseen text.
---
Step 8: Testing the Model on New Sentences
Let's see how our complete pipeline works on brand new text.
#RealWorldNLP #Inference
# Function to predict sentiment of a new sentence
def predict_sentiment(sentence):
# 1. Preprocess the text
processed_sentence = preprocess_text(sentence)
# 2. Vectorize the text using the SAME vectorizer
vectorized_sentence = vectorizer.transform([processed_sentence])
# 3. Make a prediction
prediction = model.predict(vectorized_sentence)
# 4. Return the result
return "Positive" if prediction[0] == 1 else "Negative"
# Test with new sentences
new_sentence_1 = "The movie was absolutely amazing!"
new_sentence_2 = "I was very bored and did not like it."
print(f"'{new_sentence_1}' -> Sentiment: {predict_sentiment(new_sentence_1)}")
print(f"'{new_sentence_2}' -> Sentiment: {predict_sentiment(new_sentence_2)}")
━━━━━━━━━━━━━━━
By: @CodeProgrammer ✨# Imports and Data
import re
from sklearn.model_selection import train_test_split
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.naive_bayes import MultinomialNB
from sklearn.metrics import accuracy_score, classification_report
import nltk
from nltk.corpus import stopwords
# You may need to download stopwords for the first time
# nltk.download('stopwords')
# Sample Data (In a real project, load this from a file)
texts = [
"I love this movie, it's fantastic!",
"This was a terrible film.",
"The acting was superb and the plot was great.",
"I would not recommend this to anyone.",
"It was an okay movie, not the best but enjoyable.",
"Absolutely brilliant, a must-see!",
"A complete waste of time and money.",
"The story was compelling and engaging."
]
# Labels: 1 for Positive, 0 for Negative
labels = [1, 0, 1, 0, 1, 1, 0, 1]
---
Step 2: Text Preprocessing
Computers don't understand words, so we must clean and process our text data first. This involves making text lowercase, removing punctuation, and filtering out common "stop words" (like 'the', 'a', 'is') that don't add much meaning.
#TextPreprocessing #DataCleaning
# Text Preprocessing Function
stop_words = set(stopwords.words('english'))
def preprocess_text(text):
# Make text lowercase
text = text.lower()
# Remove punctuation
text = re.sub(r'[^\w\s]', '', text)
# Tokenize and remove stopwords
tokens = text.split()
filtered_tokens = [word for word in tokens if word not in stop_words]
return " ".join(filtered_tokens)
# Apply preprocessing to our dataset
processed_texts = [preprocess_text(text) for text in texts]
print("--- Original vs. Processed ---")
for i in range(3):
print(f"Original: {texts[i]}")
print(f"Processed: {processed_texts[i]}\n")
---
Step 3: Splitting the Data
We must split our data into a training set (to teach the model) and a testing set (to evaluate its performance on unseen data).
#MachineLearning #TrainTestSplit
# Splitting data into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(
processed_texts,
labels,
test_size=0.25, # Use 25% of data for testing
random_state=42 # for reproducibility
)
print(f"Training samples: {len(X_train)}")
print(f"Testing samples: {len(X_test)}")
---
Step 4: Feature Extraction (Vectorization)
We need to convert our cleaned text into a numerical format. We'll use TF-IDF (Term Frequency-Inverse Document Frequency). This technique converts text into vectors of numbers, giving more weight to words that are important to a document but not common across all documents.
#FeatureEngineering #TFIDF #Vectorization# Creating a dictionary
student = {
"name": "Alex",
"age": 21,
"courses": ["Math", "CompSci"]
}
# Accessing values
print(f"Name: {student['name']}")
print(f"Age: {student.get('age')}")
# Safe access for a non-existent key
print(f"Major: {student.get('major', 'Not specified')}")
# --- Sample Output ---
# Name: Alex
# Age: 21
# Major: Not specified
• A dictionary is created using curly braces {} with key: value pairs.
• student['name'] accesses the value using its key. This will raise a KeyError if the key doesn't exist.
• student.get('age') is a safer way to access a value, returning None if the key is not found.
• .get() can also take a second argument as a default value to return if the key is missing.
2. Modifying a Dictionary
user_profile = {
"username": "coder_01",
"level": 5
}
# Add a new key-value pair
user_profile["email"] = "coder@example.com"
print(f"After adding: {user_profile}")
# Update an existing value
user_profile["level"] = 6
print(f"After updating: {user_profile}")
# Remove a key-value pair
del user_profile["email"]
print(f"After deleting: {user_profile}")
# --- Sample Output ---
# After adding: {'username': 'coder_01', 'level': 5, 'email': 'coder@example.com'}
# After updating: {'username': 'coder_01', 'level': 6, 'email': 'coder@example.com'}
# After deleting: {'username': 'coder_01', 'level': 6}
• A new key-value pair is added using simple assignment dict[new_key] = new_value.
• The value of an existing key is updated by assigning a new value to it.
• The del keyword completely removes a key-value pair from the dictionary.
3. Looping Through Dictionaries
inventory = {
"apples": 430,
"bananas": 312,
"oranges": 525
}
# Loop through keys
print("--- Keys ---")
for item in inventory.keys():
print(item)
# Loop through values
print("\n--- Values ---")
for quantity in inventory.values():
print(quantity)
# Loop through key-value pairs
print("\n--- Items ---")
for item, quantity in inventory.items():
print(f"{item}: {quantity}")
# --- Sample Output ---
# --- Keys ---
# apples
# bananas
# oranges
#
# --- Values ---
# 430
# 312
# 525
#
# --- Items ---
# apples: 430
# bananas: 312
# oranges: 525
• .keys() returns a view object of all keys, which can be looped over.
• .values() returns a view object of all values.
• .items() returns a view object of key-value tuple pairs, allowing you to easily access both in each loop iteration.
#Python #DataStructures #Dictionaries #Programming #PythonBasics
━━━━━━━━━━━━━━━
By: @CodeProgrammer ✨from tensorflow import keras
from tensorflow.keras import layers
# Define a Sequential model
model = keras.Sequential([
# Input layer with 64 neurons, expecting flat input data
layers.Dense(64, activation="relu", input_shape=(784,)),
# A hidden layer with 32 neurons
layers.Dense(32, activation="relu"),
# Output layer with 10 neurons for 10-class classification
layers.Dense(10, activation="softmax")
])
model.summary()
• Model Definition: keras.Sequential creates a simple, layer-by-layer model.
• layers.Dense is a standard fully-connected layer. The first layer must specify the input_shape.
• activation functions like "relu" introduce non-linearity, while "softmax" is used on the output layer for multi-class classification to produce probabilities.
# (Continuing from the previous step)
model.compile(
optimizer='adam',
loss='sparse_categorical_crossentropy',
metrics=['accuracy']
)
print("Model compiled successfully.")
• Compilation: .compile() configures the model for training.
• optimizer is the algorithm used to update the model's weights (e.g., 'adam' is a popular choice).
• loss is the function the model tries to minimize during training. sparse_categorical_crossentropy is common for integer-based classification labels.
• metrics are used to monitor the training and testing steps. Here, we track accuracy.
import numpy as np
# Create dummy training data
x_train = np.random.random((1000, 784))
y_train = np.random.randint(10, size=(1000,))
# Train the model
history = model.fit(
x_train,
y_train,
epochs=5,
batch_size=32,
verbose=0 # Hides the progress bar for a cleaner output
)
print(f"Training complete. Final accuracy: {history.history['accuracy'][-1]:.4f}")
# Output (will vary):
# Training complete. Final accuracy: 0.4570
• Training: The .fit() method trains the model on your data.
• x_train and y_train are your input features and target labels.
• epochs defines how many times the model will see the entire dataset.
• batch_size is the number of samples processed before the model is updated.
# Create a single dummy sample to test
x_test = np.random.random((1, 784))
# Get the model's prediction
predictions = model.predict(x_test)
predicted_class = np.argmax(predictions[0])
print(f"Predicted class: {predicted_class}")
print(f"Confidence scores: {predictions[0].round(2)}")
# Output (will vary):
# Predicted class: 3
# Confidence scores: [0.09 0.1 0.1 0.12 0.1 0.09 0.11 0.1 0.09 0.1 ]
• Prediction: .predict() is used to make predictions on new, unseen data.
• For a classification model with a softmax output, this returns an array of probabilities for each class.
• np.argmax() is used to find the index (the class) with the highest probability score.
#Keras #TensorFlow #DeepLearning #MachineLearning #Python
━━━━━━━━━━━━━━━
By: @CodeProgrammer ✨f or F.
1. Basic Variable and Expression Embedding
name = "Alice"
quantity = 5
print(f"Hello, {name}. You have {quantity * 2} items in your cart.")
# Output: Hello, Alice. You have 10 items in your cart.
• Place variables or expressions directly inside curly braces {}. Python evaluates the expression and inserts the result into the string.
2. Number Formatting
Control the appearance of numbers, such as padding with zeros or setting decimal precision.
pi_value = 3.14159
order_id = 42
print(f"Pi: {pi_value:.2f}")
print(f"Order ID: {order_id:04d}")
# Output:
# Pi: 3.14
# Order ID: 0042
• :.2f formats the float to have exactly two decimal places.
• :04d formats the integer to be at least 4 digits long, padding with leading zeros if necessary.
3. Alignment and Padding
Align text within a specified width, which is useful for creating tables or neatly formatted output.
item = "Docs"
print(f"|{item:<10}|") # Left-aligned
print(f"|{item:^10}|") # Center-aligned
print(f"|{item:>10}|") # Right-aligned
# Output:
# |Docs |
# | Docs |
# | Docs|
• Use < for left, ^ for center, and > for right alignment, followed by the total width.
4. Date and Time Formatting
Directly format datetime objects within an f-string.
from datetime import datetime
now = datetime.now()
print(f"Current time: {now:%Y-%m-%d %H:%M}")
# Output: Current time: 2023-10-27 14:30
• Use a colon : followed by standard strftime formatting codes to display dates and times as you wish.
#Python #Programming #CodingTips #FStrings #PythonTips
━━━━━━━━━━━━━━━
By: @CodeProgrammer ✨True/False values based on a condition. Applying this mask to your original array instantly selects only the elements where the mask is True, which is significantly faster.
import numpy as np
# Create an array of data
data = np.array([10, 55, 8, 92, 43, 77, 15])
# Create a boolean mask for values greater than 50
high_values_mask = data > 50
# Use the mask to select elements
filtered_data = data[high_values_mask]
print(filtered_data)
# Output: [55 92 77]
Code explanation: A NumPy array data is created. Then, a boolean array high_values_mask is generated, which is True for every element in data greater than 50. This mask is used as an index to efficiently extract and print only those matching elements from the original array.
#Python #NumPy #DataScience #CodingTips #Programming
━━━━━━━━━━━━━━━
By: @CodeProgrammer ✨
Available now! Telegram Research 2025 — the year's key insights 
