es
Feedback
Data Science & Machine Learning

Data Science & Machine Learning

Ir al canal en Telegram

Join this channel to learn data science, artificial intelligence and machine learning with funny quizzes, interesting projects and amazing resources for free For collaborations: @love_data

Mostrar más

📈 Análisis del canal de Telegram Data Science & Machine Learning

El canal Data Science & Machine Learning (@datasciencefun) en el segmento lingüístico de Inglés es un actor destacado. Actualmente la comunidad reúne a 75 837 suscriptores, ocupando la posición 2 107 en la categoría Educación y el puesto 4 219 en la región India.

📊 Métricas de audiencia y dinámica

Desde su creación el невідомо, el proyecto ha mostrado un crecimiento acelerado, reuniendo a 75 837 suscriptores.

Según los últimos datos del 22 junio, 2026, el canal mantiene una actividad estable. En los últimos 30 días la variación de miembros fue de 728, y en las últimas 24 horas de -2, conservando un alto alcance.

  • Estado de verificación: No verificado
  • Tasa de interacción (ER): El promedio de interacción de la audiencia es 3.00%. Durante las primeras 24 horas tras publicar, el contenido suele obtener 1.05% de reacciones respecto al total de suscriptores.
  • Alcance de las publicaciones: Cada publicación recibe en promedio 2 278 visualizaciones. En el primer día suele acumular 794 visualizaciones.
  • Reacciones e interacción: La audiencia responde de forma activa: el promedio de reacciones por publicación es 3.
  • Intereses temáticos: El contenido se centra en temas clave como learning, accuracy, distribution, panda, dataset.

📝 Descripción y política de contenido

El autor describe el recurso como un espacio para expresar opiniones subjetivas:
Join this channel to learn data science, artificial intelligence and machine learning with funny quizzes, interesting projects and amazing resources for free For collaborations: @love_data

Gracias a la alta frecuencia de actualizaciones (últimos datos recibidos el 23 junio, 2026), el canal mantiene la vigencia y un amplio alcance. La analítica demuestra que la audiencia interactúa activamente con el contenido, lo que lo convierte en un punto de referencia dentro de la categoría Educación.

75 837
Suscriptores
-224 horas
+637 días
+72830 días
Archivo de publicaciones
automatetheboringstuffwithpython.pdf14.19 MB

HOLD & EARN We are providing our community a tremendous amount of interest and high value's simply and easily, We are continually evolving to make ourselves better.

We are 4k+ now 😊 All credits to this person for supporting us a lot and being a part of us. We wish we get his/her support forever. SHARE AND SUPPORT US❤️

Hands_On_Machine_Learning_with_Scikit_Learn,_Keras,_and_TensorFlow.pdf72.75 MB

Deep Learning - John D. Kelleher (The MIT Press, 2019)

Data Science - John D. Kelleher, Brendan Tierney (The MIT Press, 2018)

Python 3 for Machine Learning - Oswald Campesato (Mercury Learning, 2020)

Artificial Intelligence - An Illustrated History - From Medieval Robots to Neural Networks - Clifford A. Pickover (Sterling, 2019)

How to set the learning rate? There is no straightforward way of finding an optimum learning rate for a model. It involves a lot of hit and trial. Usually starting with a small values such as 0.01 is a good starting point for setting a learning rate and further tweaking it so that it doesn't overshoot or converge too slowly.

What happens when the learning rate is too large? Too small? A large learning rate can accelerate the training. However, it is possible that we "shoot" too far and miss the minimum of the function that we want to optimize, which will not result in the best solution. On the other hand, training with a small learning rate takes more time but it is possible to find a more precise minimum. The downside can be that the solution is stuck in a local minimum, and the weights won't update even if it is not the best possible global solution.

What’s the learning rate? The learning rate is an important hyperparameter that controls how quickly the model is adapted to the problem during the training. It can be seen as the "step width" during the parameter updates, i.e. how far the weights are moved into the direction of the minimum of our optimization problem.

How do we use SGD (stochastic gradient descent) for training a neural net? SGD approximates the expectation with few randomly selected samples (instead of the full data). In comparison to batch gradient descent, we can efficiently approximate the expectation in large data sets using SGD. For neural networks this reduces the training time a lot even considering that it will converge later as the random sampling adds noise to the gradient descent.

Which optimization techniques for training neural nets do you know? Gradient Descent Stochastic Gradient Descent Mini-Batch Gradient Descent(best among gradient descents) Nesterov Accelerated Gradient Momentum Adagrad AdaDelta Adam(best one. less time, more efficient)

What is backpropagation? How does it work? Why do we need it? The Backpropagation algorithm looks for the minimum value of the error function in weight space using a technique called the delta rule or gradient descent. The weights that minimize the error function is then considered to be a solution to the learning problem. We need backpropogation because, Calculate the error – How far is your model output from the actual output. Minimum Error – Check whether the error is minimized or not. Update the parameters – If the error is huge then, update the parameters (weights and biases). After that again check the error. Repeat the process until the error becomes minimum. Model is ready to make a prediction – Once the error becomes minimum, you can feed some inputs to your model and it will produce the output.

What if we set all the weights of a neural network to 0? If all the weights of a neural network are set to zero, the output of each connection is same (W*x = 0). This means the gradients which are backpropagated to each connection in a layer is same. This means all the connections/weights learn the same thing, and the model never converges.

How we can initialize the weights of a neural network? Proper initialization of weight matrix in neural network is very necessary. Simply we can say there are two ways for initializtions. Initializing weights with zeroes. Setting weights to zero makes your network no better than a linear model. It is important to note that setting biases to 0 will not create any troubles as non zero weights take care of breaking the symmetry and even if bias is 0, the values in every neuron are still different. Initializing weights randomly. Assigning random values to weights is better than just 0 assignment. a) If weights are initialized with very high values the term np.dot(W,X)+b becomes significantly higher and if an activation function like sigmoid() is applied, the function maps its value near to 1 where the slope of gradient changes slowly and learning takes a lot of time. b) If weights are initialized with low values it gets mapped to 0, where the case is the same as above. This problem is often referred to as the vanishing gradient.

What is ReLU? How is it better than sigmoid or tanh? ReLU is an abbreviation for Rectified Linear Unit. It is an activation function which has the value 0 for all negative values and the value f(x) = x for all positive values. The ReLU has a simple activation function which makes it fast to compute and while the sigmoid and tanh activation functions saturate at higher values, the ReLU has a potentially infinite activation, which addresses the problem of vanishing gradients.

How do you select the number of trees in the gradient boosting model? Most implementations of gradient boosting are configured by default with a relatively small number of trees, such as hundreds or thousands. Using scikit-learn we can perform a grid search of the n_estimators model parameter

How do you approach tuning parameters in XGBoost or LightGBM? Depending upon the dataset, parameter tuning can be done manually or using hyperparameter optimization frameworks such as optuna and hyperopt. In manual parameter tuning, we need to be aware of max-depth, min_samples_leaf and min_samples_split so that our model does not overfit the data but try to predict generalized characteristics of data (basically keeping variance and bias low for our model).

What happens when we have correlated features in our data? In random forest, since random forest samples some features to build each tree, the information contained in correlated features is twice as much likely to be picked than any other information contained in other features. In general, when you are adding correlated features, it means that they linearly contains the same information and thus it will reduce the robustness of your model. Each time you train your model, your model might pick one feature or the other to "do the same job" i.e. explain some variance, reduce entropy, etc.