Machine learning books and papers
前往频道在 Telegram
📈 Telegram 频道 Machine learning books and papers 的分析概览
频道 Machine learning books and papers (@machine_learn) 英语 语言赛道中的 是活跃参与者。目前社区聚集了 24 521 名订阅者,在 教育 类别中位列第 8 070,并在 伊朗 地区排名第 13 778 位。
📊 受众指标与增长动态
自 невідомо 创建以来,项目保持高速增长,吸引了 24 521 名订阅者。
根据 23 六月, 2026 的最新数据,频道保持稳定运转。过去 30 天订阅人数变化为 -162,过去 24 小时变化为 -13,整体触达仍然可观。
- 认证状态: 未认证
- 互动率 (ER): 平均受众互动率为 8.28%。内容发布后 24 小时内通常能获得 1.90% 的反应,占订阅者总量。
- 帖子覆盖: 每篇帖子平均可获得 2 031 次浏览,首日通常累积 465 次浏览。
- 互动与反馈: 受众积极参与,单帖平均反应数为 4。
- 主题关注点: 内容集中在 disorder, psy, مقاله, framework, graph 等核心主题上。
📝 描述与内容策略
作者将该频道定位为表达主观观点的平台:
“Admin: @Raminmousa
ID: @Machine_learn
link: https://t.me/Machine_learn”
凭借高频更新(最新数据采集于 24 六月, 2026),频道始终保持新鲜度与高覆盖。分析显示受众积极互动,使其成为 教育 类别中的关键影响点。
24 521
订阅者
-1324 小时
-547 天
-16230 天
帖子存档
Repost from Papers
با عرض سلام براي يكي از مقالاتمون در حوزه Multi-modal wound image classification با Heuristic algorithms نياز به نفر دوم مقاله داريم. هزينه مشاركت ٤٠٠$ هستش و ژورنال مد نظر ما
https://www.nature.com/srep/
مي باشد. جهت مشاركت با ايدي بنده ارتباط باشنين
@Raminmousa
Repost from Github LLMs
Tutorial: Train your own Reasoning model with GRPO
📓 Tutorial
https://t.me/deep_learning_proj
Repost from Papers
یکی از ابزارهای خوبی که بنده تونستم توسعه بدم ابزار Stock Ai می باشد. در این ابزار از ۳۶۰ اندیکاتور استفاده کردم. گزارشات back test این ابزار در ویدیو های زیر موجود می باشد.
May 2024 :
https://youtu.be/aSS99lynMFQ?si=QSk8VVKhLqO_2Qi3
July 2014:
https://youtu.be/ThyZ0mZwsGk?si=FKPK7Hkz-mRx-752&t=209
از این رو سعی میکنیم مقاله ای این کار رو بنویسیم. شروع مقاله ی این کار ۲۰ اسفند خواهد بود.
دوستانی که می تونن به هر نحوی کمک کنند تا شروع مقاله می تونن نام نویسی کنند.
نفرات ٣ و ٥ اين كار باقي مونده.
@Raminmousa
OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia
Large Language Models (LLMs) have made significant progress in various downstream tasks, inspiring the development of Speech Understanding Language Models (SULMs) to enable comprehensive speech-based interactions. However, most advanced SULMs are developed by the industry, leveraging large-scale datasets and computational resources that are not readily available to the academic community. Moreover, the lack of transparency in training details creates additional barriers to further innovation. In this study, we present OSUM, an Open Speech Understanding Model designed to explore the potential of training SLUMs under constrained academic resources. The OSUM model combines a Whisper encoder with a Qwen2 LLM and supports a wide range of speech tasks, including speech recognition (ASR), speech recognition with timestamps (SRWT), vocal event detection (VED), speech emotion recognition (SER), speaking style recognition (SSR), speaker gender classification (SGC), speaker age prediction (SAP), and speech-to-text chat (STTC). By employing an ASR+X training strategy, OSUM achieves efficient and stable multi-task training by simultaneously optimizing ASR alongside target tasks. Beyond delivering strong performance, OSUM emphasizes transparency by providing openly available data preparation and training methodologies, offering valuable insights and practical guidance for the academic community. By doing so, we aim to accelerate research and innovation in advanced SULM technologies.
Paper: https://arxiv.org/pdf/2501.13306v2.pdf
Code: https://github.com/aslp-lab/osum
Datasets: LibriSpeech - IEMOCAP
@Machine_learn
Mathematics of Backpropagation Through Time.
📕 Paper
@Machine_learn
📃 Methods of decomposition theory and graph labeling in the study of social network structure
📎 Study the paper
@Machine_learn
Repost from Papers
با عرض سلام
نیاز به یک نفر داریم که در موضوع زیرکمکمون کنه (نفر اول)
🔸🔸🔸🔸🔸🔸🔸🔸🔸
Title: Chronic kidney disease classification: Deep ansemble approach
کنفرانس مد نظر :
⭐️https://saiconference.com/IntelliSys
⚙️Abstract: Chronic kidney disease (CKD) is a progressive disease that may lead to kidney failure, so early diagnosis is crucial for proper management. This condition has a high mortality rate, especially in developing countries. CKD is often overlooked because there are no apparent symptoms in the early stages. Meanwhile, early diagnosis and timely clinical intervention are essential to reduce the progression of the disease. CKD diagnosis using deep learning (DL) and feature selection (FS) methods can be a useful application of artificial intelligence (AI) in healthcare. DL algorithms can provide cost-effective and efficient computer-aided diagnosis (CAD) to assist physicians. DL models are based on automatic feature selection.
In some cases, manual feature extraction can improve the results before the network learning process. This study aims to present an ensemble deep-learning model for CKD classification. The proposed method used Deep Embedded Clustering (DEC) as a similarity feature. Also, latent features obtained from the Gaussian Mixture Model (GMM) process were used. The proposed method on UCI databases achieved an accuracy of 1.0 using the Synthetic Minority Over-Sampling technique (SMOTE).
دوستانی که مشارکت میکنم بخشی از هزینه چاپ رو هم تقبل میکنن. بخش related work and introduction, هم بر عهده ی مشارکت کنندست.
@Raminmousa
Papers channel: https://t.me/+SP9l58Ta_zZmYmY0
Bayesian Sample Inference
🖥 Github: https://github.com/martenlienen/bsi
📕 Paper: https://arxiv.org/abs/2502.07580
🌟 Dataset: https://paperswithcode.com/dataset/cifar-10
@Machine_learn
Enhance-A-Video: Better Generated Video for Free
11 Feb 2025 · Yang Luo, Xuanlei Zhao, Mengzhao Chen, Kaipeng Zhang, Wenqi Shao, Kai Wang, Zhangyang Wang, Yang You
DiT-based video generation has achieved remarkable results, but research into enhancing existing models remains relatively unexplored. In this work, we introduce a training-free approach to enhance the coherence and quality of DiT-based generated videos, named Enhance-A-Video. The core idea is enhancing the cross-frame correlations based on non-diagonal temporal attention distributions. Thanks to its simple design, our approach can be easily applied to most DiT-based video generation frameworks without any retraining or fine-tuning. Across various DiT-based video generation models, our approach demonstrates promising improvements in both temporal consistency and visual quality. We hope this research can inspire future explorations in video generation enhancement.
Paper: https://arxiv.org/pdf/2502.07508v1.pdf
Code: https://github.com/NUS-HPC-AI-Lab/Enhance-A-Video
@Machine_learn
Competitive Programming with Large Reasoning Models
OpenAI∗
▪ link
@Machine_learn
Repost from Papers
با عرض سلام
نیاز به یک نفر داریم که در موضوع زیرکمکمون کنه
Title: Chronic kidney disease classification: wavelet ensemble approach
کنفرانس مد نظر :
https://saiconference.com/IntelliSys
دوستانی که مشارکت میکنم بخشی از هزینه چاپ رو هم تقبل میکنن. بخش related work , introduction, and conclusion هم بر عهده ی مشارکت کنندست.
@Raminmousa
Papers channel: https://t.me/+SP9l58Ta_zZmYmY0
Repost from Papers
با عرض سلام
نیاز به یک نفر داریم که در موضوع زیرکمکمون کنه
Title: Chronic kidney disease classification: wavelet ansemble approach
کنفرانس مد نظر :
https://saiconference.com/IntelliSys
دوستانی که مشارکت میکنم بخشی از هزینه چاپ رو هم تقبل میکنن. بخش related work , introduction, and conclusion هم بر عهده ی مشارکت کنندست.
@Raminmousa
OmniParser for Pure Vision Based GUI Agent
1 Aug 2024 · Yadong Lu, Jianwei Yang, Yelong Shen, Ahmed Awadallah
The recent success of large vision language models shows great potential in driving the agent system operating on user interfaces. However, we argue that the power multimodal models like GPT-4V as a general agent on multiple operating systems across different applications is largely underestimated due to the lack of a robust screen parsing technique capable of: 1) reliably identifying interactable icons within the user interface, and 2) understanding the semantics of various elements in a screenshot and accurately associate the intended action with the corresponding region on the screen. To fill these gaps, we introduce \textsc{OmniParser}, a comprehensive method for parsing user interface screenshots into structured elements, which significantly enhances the ability of #GPT-4V to generate actions that can be accurately grounded in the corresponding regions of the interface. We first curated an interactable icon detection dataset using popular webpages and an icon description dataset. These datasets were utilized to fine-tune specialized models: a detection model to parse interactable regions on the screen and a caption model to extract the functional semantics of the detected elements. \textsc{#OmniParser} significantly improves GPT-4V's performance on ScreenSpot benchmark. And on #Mind2Web and AITW benchmark, \textsc{OmniParser} with screenshot only input #outperforms the GPT-4V baselines requiring additional information outside of screenshot.
Paper: https://arxiv.org/pdf/2408.00203v1.pdf
Code: https://github.com/microsoft/omniparser
Dataset: ScreenSpot
@Machine_learn
现已上线!2025 年 Telegram 研究 — 年度关键洞察 
