Machine learning books and papers
前往频道在 Telegram
📈 Telegram 频道 Machine learning books and papers 的分析概览
频道 Machine learning books and papers (@machine_learn) 英语 语言赛道中的 是活跃参与者。目前社区聚集了 24 519 名订阅者,在 教育 类别中位列第 8 070,并在 伊朗 地区排名第 13 778 位。
📊 受众指标与增长动态
自 невідомо 创建以来,项目保持高速增长,吸引了 24 519 名订阅者。
根据 23 六月, 2026 的最新数据,频道保持稳定运转。过去 30 天订阅人数变化为 -162,过去 24 小时变化为 -13,整体触达仍然可观。
- 认证状态: 未认证
- 互动率 (ER): 平均受众互动率为 8.28%。内容发布后 24 小时内通常能获得 1.90% 的反应,占订阅者总量。
- 帖子覆盖: 每篇帖子平均可获得 2 031 次浏览,首日通常累积 465 次浏览。
- 互动与反馈: 受众积极参与,单帖平均反应数为 4。
- 主题关注点: 内容集中在 disorder, psy, مقاله, framework, graph 等核心主题上。
📝 描述与内容策略
作者将该频道定位为表达主观观点的平台:
“Admin: @Raminmousa
ID: @Machine_learn
link: https://t.me/Machine_learn”
凭借高频更新(最新数据采集于 24 六月, 2026),频道始终保持新鲜度与高覆盖。分析显示受众积极互动,使其成为 教育 类别中的关键影响点。
24 519
订阅者
-1324 小时
-547 天
-16230 天
帖子存档
Painful intelligence: What AI can tell us about human suffering
📄 Book
@Machine_learn
Repost from Papers
با عرض سلام برای یکی از کارهای پژوهشیمون در wound image classification نیاز به نفر سوم داریم. شخص علاوه بر کار بخشی از هزینه سرور رو هم باید تقبل کنه.
Journal: https://www.nature.com/srep/
جهت هماهنگی می تونین با ایدی بنده در ارتباط باشین.
@Raminmousa
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
DiT diffusion models have achieved great success in text-to-video generation, leveraging their scalability in model capacity and data scale. High content and motion fidelity aligned with text prompts, however, often require large model parameters and a substantial number of function evaluations (NFEs). Realistic and visually appealing details are typically reflected in high resolution outputs, further amplifying computational demands especially for single stage DiT models. To address these challenges, we propose a novel two stage framework, #FlashVideo, which strategically allocates model capacity and NFEs across stages to balance generation fidelity and quality. In the first stage, prompt fidelity is prioritized through a low resolution generation process utilizing large parameters and sufficient NFEs to enhance computational efficiency. The second stage establishes flow matching between low and high resolutions, effectively generating fine details with minimal NFEs. Quantitative and visual results demonstrate that FlashVideo achieves state-of-the-art high resolution video generation with superior computational efficiency. Additionally, the two-stage design enables users to preview the initial output before committing to full resolution generation, thereby significantly reducing computational costs and wait times as well as enhancing commercial viability .
Paper: https://arxiv.org/pdf/2502.05179v1.pdf
Code: https://github.com/foundationvision/flashvideo
@Machine_learn
📄 How natural language processing derived techniques are used on biological data: a systematic review
📎 Study the paper
@Machine_learn
LIMO: Less is More for Reasoning
5 Feb 2025 · Yixin Ye, Zhen Huang, Yang Xiao, Ethan Chern, Shijie Xia, PengFei Liu ·
We present a fundamental discovery that challenges our understanding of how complex reasoning emerges in large language models. While conventional wisdom suggests that sophisticated reasoning tasks demand extensive training data (>100,000 examples), we demonstrate that complex mathematical reasoning abilities can be effectively elicited with surprisingly few examples. Through comprehensive experiments, our proposed model LIMO demonstrates unprecedented performance in mathematical reasoning. With merely 817 curated training samples, LIMO achieves 57.1% accuracy on AIME and 94.8% on #MATH, improving from previous SFT-based models' 6.5% and 59.2% respectively, while only using 1% of the training data required by previous approaches. LIMO demonstrates exceptional out-of-distribution generalization, achieving 40.5% absolute improvement across 10 diverse benchmarks, outperforming models trained on 100x more data, challenging the notion that SFT leads to memorization rather than generalization. Based on these results, we propose the Less-Is-More Reasoning Hypothesis (#LIMO Hypothesis): In foundation models where domain knowledge has been comprehensively encoded during pre-training, sophisticated reasoning capabilities can emerge through minimal but precisely orchestrated demonstrations of cognitive processes. This hypothesis posits that the elicitation threshold for complex reasoning is determined by two key factors: (1) the completeness of the model's encoded knowledge foundation during pre-training, and (2) the effectiveness of post-training examples as "cognitive templates" that show the model how to utilize its knowledge base to solve complex reasoning tasks. To facilitate reproducibility and future research in data-efficient reasoning
Paper: https://arxiv.org/pdf/2502.03387v1.pdf
Codes:
https://github.com/gair-nlp/limo
https://github.com/zhaoolee/garss
@Machine_learn
Repost from Github LLMs
⚡️ LLM4Decompile .
git clone https://github.com/albertan017/LLM4Decompile.git
cd LLM4Decompile
conda create -n 'llm4decompile' python=3.9 -y
conda activate llm4decompile
pip install -r requirements.txt
🟡 Github
🟡 Models
🟡 Paper
🟡 Colab
https://t.me/deep_learning_proj⭐️ Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
🖥 Github: https://github.com/bcmi/Light-A-Video
📕 Paper: https://arxiv.org/abs/2502.08590v1
🌟 Dataset: https://paperswithcode.com/task/image-relighting
@Machine_learn
یکی از ابزارهای خوبی که بنده تونستم توسعه بدم ابزار Stock Ai می باشد. در این ابزار از ۳۶۰ اندیکاتور استفاده کردم. گزارشات back test این ابزار در ویدیو های زیر موجود می باشد.
نفرات ۴ و ۵ از این مقاله باقی مونده است.
🔹🔹🔹🔹
May 2024 :
https://youtu.be/aSS99lynMFQ?si=QSk8VVKhLqO_2Qi3
July 2014:
https://youtu.be/ThyZ0mZwsGk?si=FKPK7Hkz-mRx-752&t=209
از این رو سعی میکنیم مقاله ای این کار رو بنویسیم. شروع مقاله ی این کار ۲۰ اسفند خواهد بود.
دوستانی که می تونن به هر نحوی کمک کنند تا شروع مقاله می تونن نام نویسی کنند.
@Raminmousa
Repost from Github LLMs
FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration
24 Jan 2025 · Kai-Tuo Xu, Feng-Long Xie, Xu Tang, Yao Hu ·
We present FireRedASR, a family of large-scale automatic speech recognition (ASR) models for Mandarin, designed to meet diverse requirements in superior performance and optimal efficiency across various applications. FireRedASR comprises two variants: FireRedASR-LLM: Designed to achieve state-of-the-art (SOTA) performance and to enable seamless end-to-end speech interaction. It adopts an Encoder-Adapter-LLM framework leveraging large language model (LLM) capabilities. On public Mandarin benchmarks, FireRedASR-LLM (8.3B parameters) achieves an average Character Error Rate (CER) of 3.05%, surpassing the latest SOTA of 3.33% with an 8.4% relative CER reduction (CERR). It demonstrates superior generalization capability over industrial-grade baselines, achieving 24%-40% CERR in multi-source Mandarin ASR scenarios such as video, live, and intelligent assistant. FireRedASR-AED: Designed to balance high performance and computational efficiency and to serve as an effective speech representation module in LLM-based speech models. It utilizes an Attention-based Encoder-Decoder (AED) architecture. On public Mandarin benchmarks, FireRedASR-AED (1.1B parameters) achieves an average CER of 3.18%, slightly worse than FireRedASR-LLM but still outperforming the latest SOTA model with over 12B parameters. It offers a more compact size, making it suitable for resource-constrained applications. Moreover, both models exhibit competitive results on Chinese dialects and English speech benchmarks and excel in singing lyrics recognition.
Paper: https://arxiv.org/pdf/2501.14350v1.pdf
Code: https://github.com/fireredteam/fireredasr
Datasets: LibriSpeech - AISHELL-1 - AISHELL-2 - WenetSpeech
https://t.me/deep_learning_proj
نفرات ۳،۴ و ۵ این پروژه رو برای مشارکت در نظر گرفتیم. ژورنال مورد نظر برای ارسال
Finance innovation
If: 6.5
دوستانی که مایل به شرکت هستند با ایدی بنده در ارتباط باشند.
@Raminmousa
LLM4Decompile: Decompiling Binary Code with Large Language Models
8 Mar 2024 · Hanzhuo Tan, Qi Luo, Jing Li, Yuqun Zhang ·
Decompilation aims to convert binary code to high-level source code, but traditional tools like Ghidra often produce results that are difficult to read and execute. Motivated by the advancements in Large Language Models (LLMs), we propose LLM4Decompile, the first and largest open-source #LLM series (1.3B to 33B) trained to decompile binary code. We optimize the LLM training process and introduce the LLM4Decompile-End models to decompile binary directly. The resulting models significantly outperform GPT-4o and Ghidra on the HumanEval and ExeBench benchmarks by over 100% in terms of re-executability rate. Additionally, we improve the standard refinement approach to fine-tune the LLM4Decompile-Ref models, enabling them to effectively refine the decompiled code from Ghidra and achieve a further 16.2% improvement over the LLM4Decompile-End. LLM4Decompile demonstrates the potential of LLMs to revolutionize binary code decompilation, delivering remarkable improvements in readability and executability while complementing conventional tools for optimal results.
Paper: https://arxiv.org/pdf/2403.05286v3.pdf
Code: https://github.com/albertan017/LLM4Decompile
@Machine_learn
در این پروژه به برسی عوامل استرس زا در خودکشی پرداختیم. این اولین کار در زبان فارسی می باشد که برای این منظور گسترش داده شد. از دوستانی که در این پروژه همکاری کردن تشکر می کنم❤️
CapsF_Capsule_Fusion_for_Extracting_Psychiatric_Stressors_for_Suicide.pdf4.67 KB
现已上线!2025 年 Telegram 研究 — 年度关键洞察 
