ar
Feedback
Data Science & Machine Learning

Data Science & Machine Learning

الذهاب إلى القناة على Telegram

Join this channel to learn data science, artificial intelligence and machine learning with funny quizzes, interesting projects and amazing resources for free For collaborations: @love_data

إظهار المزيد

📈 نظرة تحليلية على قناة تيليجرام Data Science & Machine Learning

تُعد قناة Data Science & Machine Learning (@datasciencefun) في القطاع اللغوي الإنكليزية لاعباً نشطاً. يضم المجتمع حالياً 75 837 مشتركاً، محتلاً المرتبة 2 107 في فئة التعليم والمرتبة 4 219 في منطقة الهند.

📊 مؤشرات الجمهور والحراك

منذ تأسيسه في невідомо، حقق المشروع نمواً سريعاً وجمع 75 837 مشتركاً.

بحسب آخر البيانات بتاريخ 22 يونيو, 2026، تحافظ القناة على نشاط مستقر. خلال آخر 30 يوماً تغيّر عدد الأعضاء بمقدار 728، وفي آخر 24 ساعة بمقدار -2، مع بقاء الوصول العام مرتفعاً.

  • حالة التحقق: غير موثّقة
  • معدل التفاعل (ER): يبلغ متوسط تفاعل الجمهور 3.00‎%. وخلال أول 24 ساعة من النشر يحصد المحتوى عادةً 1.05‎% من ردود الفعل نسبةً إلى إجمالي المشتركين.
  • وصول المنشورات: يحصل كل منشور على متوسط 2 278 مشاهدة. وخلال اليوم الأول يجمع عادةً 794 مشاهدة.
  • التفاعلات والاستجابة: يتفاعل الجمهور بانتظام؛ متوسط التفاعلات لكل منشور يبلغ 3.
  • الاهتمامات الموضوعية: يركز المحتوى على مواضيع رئيسية مثل learning, accuracy, distribution, panda, dataset.

📝 الوصف وسياسة المحتوى

يصف المؤلف القناة بأنها مساحة للتعبير عن الآراء الذاتية:
Join this channel to learn data science, artificial intelligence and machine learning with funny quizzes, interesting projects and amazing resources for free For collaborations: @love_data

بفضل وتيرة التحديث المرتفعة (أحدث البيانات بتاريخ 23 يونيو, 2026) تحافظ القناة على حداثتها ومستوى وصول مرتفع. وتُظهر التحليلات تفاعلاً نشطاً من الجمهور، ما يجعلها نقطة تأثير مهمة ضمن فئة التعليم.

75 837
المشتركون
-224 ساعات
+637 أيام
+72830 أيام
أرشيف المشاركات
How do we know how many trees we need in random forest? The number of trees in random forest is worked by n_estimators, and a random forest reduces overfitting by increasing the number of trees. There is no fixed thumb rule to decide the number of trees in a random forest, it is rather fine tuned with the data, typically starting off by taking the square of the number of features (n) present in the data followed by tuning until we get the optimal results.

How do we select the depth of the trees in random forest? The greater the depth, the greater amount of information is extracted from the tree, however, there is a limit to this, and the algorithm even if defensive against overfitting may learn complex features of noise present in data and as a result, may overfit on noise. Hence, there is no hard thumb rule in deciding the depth, but literature suggests a few tips on tuning the depth of the tree to prevent overfitting: • limit the maximum depth of a tree • limit the number of test nodes • limit the minimum number of objects at a node required to split • do not split a node when, at least, one of the resulting subsample sizes is below a given threshold • stop developing a node if it does not sufficiently improve the fit.

What are the main parameters of the random forest model? max_depth: Longest Path between root node and the leaf min_sample_split: The minimum number of observations needed to split a given node max_leaf_nodes: Conditions the splitting of the tree and hence, limits the growth of the trees min_samples_leaf: minimum number of samples in the leaf node n_estimators: Number of trees max_sample: Fraction of original dataset given to any individual tree in the given model max_features: Limits the maximum number of features provided to trees in random forest model

How do we handle categorical variables in decision trees? Some decision tree algorithms can handle categorical variables out of the box, others cannot. However, we can transform categorical variables, e.g. with a binary or a one-hot encoder.

When do we need to perform feature normalization for linear models? When it’s okay not to do it? Feature normalization is necessary for L1 and L2 regularizations. The idea of both methods is to penalize all the features relatively equally. This can't be done effectively if every feature is scaled differently. Linear regression without regularization techniques can be used without feature normalization. Also, regularization can help to make the analytical solution more stable, — it adds the regularization matrix to the feature matrix before inverting it.

What is the area under the PR curve? Is it a useful metric? The Precision-Recall AUC is just like the ROC AUC, in that it summarizes the curve with a range of threshold values as a single score. A high area under the curve represents both high recall and high precision, where high precision relates to a low false positive rate, and high recall relates to a low false negative rate.

Udemy - algorithms-and-data-structures-in-python.rar1407.14 MB

What is bag of words? How we can use it for text classification? Bag of Words is a representation of text that describes the occurrence of words within a document. The order or structure of the words is not considered. For text classification, we look at the histogram of the words within the text and consider each word count as a feature.

What is clustering? When do we need it? Clustering algorithms group objects such that similar feature points are put into the same groups (clusters) and dissimilar feature points are put into different clusters.

One of the best hands-on ML books

75557ed5ca14970

Learn how Google-DeepMind Reinforcement Learning Works for FREE!! You will also get Free Certificate after completing this li
Learn how Google-DeepMind Reinforcement Learning Works for FREE!! You will also get Free Certificate after completing this live session Link 👇 https://bit.ly/3haoVjl

Bayesian_Statistics_The_Fun_Way_Understanding_Statistics_And_Probability.pdf6.27 MB

machine-learning-cheat-sheet.pdf1.87 MB

Predict stock price using Time Series Analysis 👇👇 https://bit.ly/3yOv4qR In this live session, you will understand how to work with historical data about the stock prices and how to implement  machine learning algorithms to predict the future stock price. You will understand neural networks, time series and LSTM. You will also get Certificate after completing this Free Live session

Python Cheat Sheet - 58 Pages.pdf1.70 MB

Introduction to Deep Learning-2019.pdf16.33 MB

Hands-On Unsupervised Learning Using Python.pdf5.63 MB

Python for Data Science Course @datasciencefun