en
Feedback
Data Science & Machine Learning

Data Science & Machine Learning

Open in Telegram

Join this channel to learn data science, artificial intelligence and machine learning with funny quizzes, interesting projects and amazing resources for free For collaborations: @love_data

Show more

๐Ÿ“ˆ Analytical overview of Telegram channel Data Science & Machine Learning

Channel Data Science & Machine Learning (@datasciencefun) in the English language segment is an active participant. Currently, the community unites 75 831 subscribers, ranking 2 106 in the Education category and 4 234 in the India region.

๐Ÿ“Š Audience metrics and dynamics

Since its creation on ะฝะตะฒั–ะดะพะผะพ, the project has demonstrated rapid growth, gathering an audience of 75 831 subscribers.

According to the latest data from 21 June, 2026, the channel demonstrates stable activity. Although there has been a change in the number of participants by 770 over the last 30 days and by 8 over the last 24 hours, overall reach remains high.

  • Verification status: Not verified
  • Engagement rate (ER): The average audience engagement rate is 3.15%. Within the first 24 hours after publication, content typically collects 1.09% reactions from the total number of subscribers.
  • Post reach: On average, each post receives 2 385 views. Within the first day, a publication typically gains 827 views.
  • Reactions and interaction: The audience actively supports content: the average number of reactions per post is 3.
  • Thematic interests: Content is focused on key topics such as learning, accuracy, distribution, panda, dataset.

๐Ÿ“ Description and content policy

The author describes the resource as a platform for expressing subjective opinions:
โ€œJoin this channel to learn data science, artificial intelligence and machine learning with funny quizzes, interesting projects and amazing resources for free For collaborations: @love_dataโ€

Thanks to the high frequency of updates (latest data received on 22 June, 2026), the channel maintains relevance and a high level of publication reach. Analytics show that the audience actively interacts with content, making it an important point of influence in the Education category.

75 831
Subscribers
+824 hours
+717 days
+77030 days
Posts Archive
Which of the following is not a feature selection technique?
Anonymous voting

Which of the following method/s can be used to handle missing values?
Anonymous voting

DATA SCIENCE INTERVIEW QUESTIONS [PART -15] ๐1. ๐ƒ๐ž๐š๐ฅ ๐ฐ๐ข๐ญ๐ก ๐ฎ๐ง๐›๐š๐ฅ๐š๐ง๐œ๐ž๐ ๐›๐ข๐ง๐š๐ซ๐ฒ ๐œ๐ฅ๐š๐ฌ๐ฌ๐ข๐Ÿ๐ข๐œ๐š๐ญ๐ข๐จ๐ง? ๐€ns. Techniques to Handle unbalanced Data: 1. Use the right evaluation metricsย  2. Use K-fold Cross-Validation in the right wayย  3. Ensemble different resampled datasetsย  4. Resample with different ratiosย  5. Design your own models ๐2. ๐€๐œ๐ญ๐ข๐ฏ๐š๐ญ๐ข๐จ๐ง ๐Ÿ๐ฎ๐ง๐œ๐ญ๐ข๐จ๐ง? ๐€ns. Activation functionsย are mathematical equations that determine the output of a neural network model. Itย is a non-linear transformation that we do over the input before sending it to the next layer of neurons or finalizing it as output.ย  ๐3. ๐ƒ๐ข๐ฆ๐ž๐ง๐ฌ๐ข๐จ๐ง ๐ซ๐ž๐๐ฎ๐œ๐ญ๐ข๐จ๐ง? ๐€ns. Dimensionality Reduction is used to reduce the feature space with consideration by a set of principal features. ๐4. ๐–๐ก๐ฒ ๐ข๐ฌ ๐ฆ๐ž๐š๐ง ๐ฌ๐ช๐ฎ๐š๐ซ๐ž ๐ž๐ซ๐ซ๐จ๐ซ ๐š ๐›๐š๐ ๐ฆ๐ž๐š๐ฌ๐ฎ๐ซ๐ž ๐จ๐Ÿ ๐ฆ๐จ๐๐ž๐ฅ ๐ฉ๐ž๐ซ๐Ÿ๐จ๐ซ๐ฆ๐š๐ง๐œ๐ž? ๐€ns. Mean Squared Error (MSE) gives a relatively high weight to large errors โ€” therefore, MSE tends to put too much emphasis on large deviations. ๐5. ๐‘๐ž๐ฆ๐จ๐ฏ๐ž ๐ฆ๐ฎ๐ฅ๐ญ๐ข๐œ๐จ๐ฅ๐ฅ๐ข๐ง๐ž๐š๐ซ๐ข๐ญ๐ฒ? ๐€ns. To remove multicollinearities, we can do two things.ย  1. We can create new featuresย  2. remove them from our data. ๐6. ๐ฅ๐จ๐ง๐ -๐ญ๐š๐ข๐ฅ๐ž๐ ๐๐ข๐ฌ๐ญ๐ซ๐ข๐›๐ฎ๐ญ๐ข๐จ๐ง ? ๐€ns. Aย long tailย distributionย of numbers is a kind of distribution having many occurrences far from the "head" or central part of the distribution. Most of occurrencesย in this kind of distributions occurs at early frequencies/valuesย of x-axis. ๐7. ๐Ž๐ฎ๐ญ๐ฅ๐ข๐ž๐ซ? ๐ƒ๐ž๐š๐ฅ ๐ฐ๐ข๐ญ๐ก ๐ข๐ญ? ๐€ns. An outlier is an object that deviates significantly from the rest of the objects. They can be caused by measurement or execution error.ย  Removing outliers is legitimate only for specific reasons. Outliers can be very informative about the subject-area and data collection process. If the outlier does not change the results but does affect assumptions, you may drop the outlier. Or just trim the data set, but replace outliers with the nearest โ€œgoodโ€ data, as opposed to truncating them completely. ๐8. ๐„๐ฑ๐š๐ฆ๐ฉ๐ฅ๐ž ๐ฐ๐ก๐ž๐ซ๐ž ๐ญ๐ก๐ž ๐ฆ๐ž๐๐ข๐š๐ง ๐ข๐ฌ ๐š ๐›๐ž๐ญ๐ญ๐ž๐ซ ๐ฆ๐ž๐š๐ฌ๐ฎ๐ซ๐ž ๐ญ๐ก๐š๐ง ๐ญ๐ก๐ž ๐ฆ๐ž๐š๐ง ? ๐€ns. If your data contains outliers, then you would typically rather use the median because otherwise the value of the mean would be dominated by the outliers rather than the typical values. In conclusion, if you are considering the mean, check your data for outliers, if any then better choose median. ENJOY LEARNING ๐Ÿ‘๐Ÿ‘

Data Science and Analytics with Python - Jesus Rogel-Salazar.pdf31.55 MB

๐“๐จ๐๐š๐ฒ'๐ฌ ๐ข๐ง๐ญ๐ž๐ซ๐ฏ๐ข๐ž๐ฐ ๐๐ฎ๐ž๐ฌ๐ญ ๐ ๐€๐ง๐ฌ DATA SCIENCE INTERVIEW QUESTIONS [PART - 14] ๐1. ๐…๐ž๐š๐ญ๐ฎ๐ซ๐ž ๐ฌ๐ž๐ฅ๐ž๐œ๐ญ๐ข๐จ๐ง ๐ฆ๐ž๐ญ๐ก๐จ๐๐ฌ ๐Ÿ๐จ๐ซ ๐ฌ๐ž๐ฅ๐ž๐œ๐ญ๐ข๐ง๐  ๐ญ๐ก๐ž ๐ซ๐ข๐ ๐ก๐ญ ๐ฏ๐š๐ซ๐ข๐š๐›๐ฅ๐ž๐ฌ ๐Ÿ๐จ๐ซ ๐›๐ฎ๐ข๐ฅ๐๐ข๐ง๐  ๐ž๐Ÿ๐Ÿ๐ข๐œ๐ข๐ž๐ง๐ญ ๐ฉ๐ซ๐ž๐๐ข๐œ๐ญ๐ข๐ฏ๐ž ๐ฆ๐จ๐๐ž๐ฅ๐ฌ? Ans. Some of the Feature selection techniques are: Information Gain, Chi-square test, Correlation Coefficient, Mean Absolute Difference (MAD), Exhaustive selection, Forward selection, Regularization. ๐2. ๐“๐ซ๐ž๐š๐ญ ๐ฆ๐ข๐ฌ๐ฌ๐ข๐ง๐  ๐ฏ๐š๐ฅ๐ฎ๐ž๐ฌ? Ans. They are: 1. List wise or case deletion 2. Pairwise deletion 3. Mean substitution 4. Regression imputation 5. Maximum likelihood. ๐3. ๐š๐ฌ๐ฌ๐ฎ๐ฆ๐ฉ๐ญ๐ข๐จ๐ง๐ฌ ๐ฎ๐ฌ๐ž๐ ๐ข๐ง ๐ฅ๐ข๐ง๐ž๐š๐ซ ๐ซ๐ž๐ ๐ซ๐ž๐ฌ๐ฌ๐ข๐จ๐ง? ๐–๐ก๐š๐ญ ๐ฐ๐จ๐ฎ๐ฅ๐ ๐ก๐š๐ฉ๐ฉ๐ž๐ง ๐ข๐Ÿ ๐ญ๐ก๐ž๐ฒ ๐š๐ซ๐ž ๐ฏ๐ข๐จ๐ฅ๐š๐ญ๐ž๐? Ans. 1. Linear relationship. 2. Multivariate normality. 3. no or little multicollinearity. 4. no auto-correlation. 5. Homoscedasticity. Data to be analyzed by linear regression were sampled violate one or more of the linear regression assumptions, the results of the analysis may be incorrect or misleading. ๐4. ๐‡๐จ๐ฐ ๐ข๐ฌ ๐ญ๐ก๐ž ๐ ๐ซ๐ข๐ ๐ฌ๐ž๐š๐ซ๐œ๐ก ๐ฉ๐š๐ซ๐š๐ฆ๐ž๐ญ๐ž๐ซ ๐๐ข๐Ÿ๐Ÿ๐ž๐ซ๐ž๐ง๐ญ ๐Ÿ๐ซ๐จ๐ฆ ๐ญ๐ก๐ž ๐ซ๐š๐ง๐๐จ๐ฆ ๐ฌ๐ž๐š๐ซ๐œ๐ก ๐ญ๐ฎ๐ง๐ข๐ง๐  ๐ฌ๐ญ๐ซ๐š๐ญ๐ž๐ ๐ฒ? Ans. Random search differs from grid search in that we no longer provide an explicit set of possible values for each hyperparameter; rather, we provide a statistical distribution for each hyperparameter from which values are sampled. Essentially, we define a sampling distribution for each hyperparameter to carry out a randomized search. ๐5. ๐ˆ๐ฌ ๐ข๐ญ ๐ ๐จ๐จ๐ ๐ญ๐จ ๐๐จ ๐๐ข๐ฆ๐ž๐ง๐ฌ๐ข๐จ๐ง๐š๐ฅ๐ข๐ญ๐ฒ ๐ซ๐ž๐๐ฎ๐œ๐ญ๐ข๐จ๐ง ๐›๐ž๐Ÿ๐จ๐ซ๐ž ๐Ÿ๐ข๐ญ๐ญ๐ข๐ง๐  ๐š ๐’๐ฎ๐ฉ๐ฉ๐จ๐ซ๐ญ ๐•๐ž๐œ๐ญ๐จ๐ซ ๐Œ๐จ๐๐ž๐ฅ? ๐€ns. Support Vector Machine Learning Algorithm performs better in the reduced space. It is beneficial to perform dimensionality reduction before fitting an SVM if the number of features is large when compared to the number of observations. ๐6. ๐‘๐Ž๐‚ ๐‚๐ฎ๐ซ๐ฏ๐ž? Ans ROC curveย (receiver operating characteristic curve) is a graph showing the performance of a classification model at all classification thresholds.ย  ENJOY LEARNING ๐Ÿ‘๐Ÿ‘

Grokking Deep Learning by Andrew W. Trask (z-lib.org).pdf13.90 MB

Which of the following is used to read csv file in python using pandas? import pandas as pd
Anonymous voting

Working with Dates in Pandas.pdf

DATA SCIENCE INTERVIEW QUESTIONS [ PART - 13] ๐1. ๐‡๐จ๐ฐ ๐ญ๐จ ๐ข๐๐ž๐ง๐ญ๐ข๐Ÿ๐ฒ ๐š ๐œ๐š๐ฎ๐ฌ๐ž ๐ฏ๐ฌ. ๐š ๐œ๐จ๐ซ๐ซ๐ž๐ฅ๐š๐ญ๐ข๐จ๐ง? ๐†๐ข๐ฏ๐ž ๐ž๐ฑ๐š๐ฆ๐ฉ๐ฅ๐ž๐ฌ. Ans. While causation and correlation can exist at the same time, correlation does not imply causation. Causation explicitly applies to cases where action A causes outcome B. On the other hand, correlation is simply a relationship. Correlation between Ice cream sales and sunglasses sold. As the sales of ice creams is increasing so do the sales of sunglasses. Causation takes a step further than correlation. ๐2. ๐ฉ๐ซ๐ž๐œ๐ข๐ฌ๐ข๐จ๐ง, ๐š๐œ๐œ๐ฎ๐ซ๐š๐œ๐ฒ ๐š๐ง๐ ๐ซ๐ž๐œ๐š๐ฅ๐ฅ? Ans. The recall is the ratio of the relevant results returned by the search engine to the total number of the relevant results that could have been returned. The precision is the proportion of relevant results in the list of all returned search results. Accuracy is the measurement used to determine which model is best at identifying relationships and patterns between variables in a dataset based on the input, or training, data. ๐3. ๐œ๐ก๐จ๐จ๐ฌ๐ž ๐ค ๐ข๐ง ๐ค-๐ฆ๐ž๐š๐ง๐ฌ? Ans. There is a popular method known as elbow method which is used to determine the optimal value of K to perform the K-Means Clustering Algorithm. The basic idea behind this method is that it plots the various values of cost with changing k. As the value of K increases, there will be fewer elements in the cluster. ๐4. ๐ฐ๐จ๐ซ๐2๐ฏ๐ž๐œ ๐ฆ๐ž๐ญ๐ก๐จ๐๐ฌ? Ans. Word2vec is a technique for natural language processing published in 2013. The word2vec algorithm uses a neural network model to learn word associations from a large corpus of text. Once trained, such a model can detect synonymous words or suggest additional words for a partial sentence. ๐5. P๐ซ๐ฎ๐ง๐ข๐ง๐  ๐ข๐ง ๐œ๐š๐ฌ๐ž ๐จ๐Ÿ ๐๐ž๐œ๐ข๐ฌ๐ข๐จ๐ง ๐ญ๐ซ๐ž๐ž๐ฌ? Ans. Pruning is a data compression technique in machine learning and search algorithms that reduces the size of decision trees by removing sections of the tree that are non-critical and redundant to classify instances. ENJOY LEARNING ๐Ÿ‘๐Ÿ‘

Which of the following is not a machine learning algorithm?
Anonymous voting

Which of the following maybe involved in the data science project?
Anonymous voting

Adaptive_Computation_and_Machine_Learning_Ralf_Herbrich_Learning.pdf2.69 MB

Play with graphs by Amit aggarwal.pdf61.05 MB

DATA SCIENCE INTERVIEW QUESTIONS [PART -12] Q. What are Entropy and Information gain in Decision tree algorithm? A. Entropy is a measure of impurity or uncertainty in a set of data used in information theory. It determines how data is split by a decision tree. The quantity of information improved in the nodes before splitting them for making subsequent judgments can be characterized as the information obtained in the decision tree. Q. What Will Happen If the Learning Rate Is Set inaccurately (Too Low or Too High)? A. A high learning rate in gradient descent will cause the learning to jump over global minima, whereas a low learning rate will cause the learning to take too long to converge or become stuck in an unwanted local minimum. Q. What is meant by โ€˜curse of dimensionalityโ€™? A. The problem produced by the exponential rise in volume associated with adding extra dimensions to Euclidean space is known as the "curse of dimensionality." The curse of dimensionality states that as the number of characteristics grows, the error grows as well. It refers to the fact that high-dimensional algorithms are more difficult to build and often have a running duration that is proportional to the dimensions. A higher number of dimensions theoretically allows for more information to be stored, but in practice, it rarely helps because real-world data contains more noise and redundancy. Q. Difference between remove, del and pop? A. remove function removes the first matching value/object. It does not do anything with the indexing. del function removes the item at a specific index. And pop removes the item at a specific index and returns it. ENJOY LEARNING ๐Ÿ‘๐Ÿ‘

Guys, I'm recommending you ! Grab all above Courses ๐Ÿš€๐Ÿ”ฅ Limited time deal ๐Ÿ’ฏ

Today's Question -ย ย What are some ways I can make my model more robust to outliers? There are several ways to make a model more robust to outliers, from different points of view (data preparation or model building). An outlier in the question and answer is assumed being unwanted, unexpected, or a must-be-wrong value to the humanโ€™s knowledge so far (e.g. no one is 200 years old) rather than a rare event which is possible but rare. Outliers are usually defined in relation to the distribution. Thus outliers could be removed in the pre-processing step (before any learning step), by using standard deviationsย (Mean +/- 2*SD), it can be used for normality. Or interquartile rangesย Q1 - Q3,ย Q1ย - is the "middle" value in the first half of the rank-ordered data set,ย Q3ย - is the "middle" value in the second half of the rank-ordered data set. It can be used for not normal/unknown as threshold levels. Moreover, data transformation (e.g. log transformation) may help if data have a noticeable tail. When outliers related to the sensitivity of the collecting instrument which may not precisely record small values, Winsorization may be useful. This type of transformation has the same effect as clipping signals (i.e. replaces extreme data values with less extreme values). Another option to reduce the influence of outliers is using mean absolute difference rather mean squared error. For model building, some models are resistant to outliers (e.g. tree-based approaches) or non-parametric tests. Similar to the median effect, tree models divide each node into two in each split. Thus, at each split, all data points in a bucket could be equally treated regardless of extreme values they may have.

Machine Learning for Humans.pdf14.76 MB

In which algorithm target variable isn't required
Anonymous voting

Data Science Interview Questions [Part - 11] Q1.ย ย Difference between R square and Adjusted R Square. Ans. One main difference between R2ย and the adjusted R2: R2ย assumes that every single variable explains theย variation inย the dependent variable. The adjusted R2ย tells you the percentage of variation explained by only the independent variables that actually affect the dependent variable. Q2.ย Difference between Precision and Recall. Ans.ย ย When it comes to precision we're talking about theย true positives over the true positives plus the false positives. As opposed to recall which is the number of true positives over the true positives and the false negatives. Q3. ย Assumptions of Linear Regression. Ans.ย ย There are four assumptions associated with a linear regression model:ย Linearity: The relationship between X and the mean of Y is linear. Homoscedasticity: The variance of residual is the same for any value of X. Independence: Observations are independent of each other. The fourth one is normality. Q4.ย Difference between Random Forest and Decision Tree. Ans.ย A decision tree combines some decisions, whereasย a random forest combines several decision trees. Thus, it is a long process, yet slow. Whereas, a decision tree is fast and operates easily on large data sets, especially the linear one. The random forest model needs rigorous training. Q5. How does K-means work? Ans.ย ย K-means clusteringย uses โ€œcentroidsโ€, K different randomly-initiated points in the data, and assigns every data point to the nearest centroid. After every point has been assigned, the centroid is moved to the average of all of the points assigned to it. Q6.ย  How do you generally choose among different classification models to decide which one is performing the best? Ans.ย Here are some important considerations while choosing an algorithm: Size of the training data,ย Accuracy and/or Interpretability of the output,ย Speed or Training time,ย Linearity andย number of features. Q7.ย How do you perform feature selection? Ans.ย Unsupervised: Do not use the target variable (e.g. remove redundant variables). Correlation.ย  Supervised: Use the target variable (e.g. remove irrelevant variables). Wrapper: Search for well-performing subsets of features. RFE. Q8.ย What is an intercept in a Linear Regression? What is its significance? Ans.ย The intercept (often labeled as constant) isย the point where the function crosses the y-axis. In some analysis, the regression model only becomes significant when we remove the intercept, and the regression line reduces to Y = b*X + error.ย  The intercept (often labeled the constant) isย the expected mean value of Y when all X="0. Start with a regression equation with one predictor, X. If X sometimes equals 0, the intercept is simply the expected mean value of Y at that value. If X never equals 0, then the intercept has no intrinsic meaning. ENJOY LEARNING ๐Ÿ‘๐Ÿ‘

Do you want daily quiz to enhance your knowledge?
Anonymous voting