fa
Feedback
AI & ML Papers

AI & ML Papers

رفتن به کانال در Telegram

Advancing research in Machine Learning – practical insights, tools, and techniques for researchers. Admin: @HusseinSheikho || @Hussein_Sheikho

نمایش بیشتر

📈 تحلیل کانال تلگرام AI & ML Papers

کانال AI & ML Papers (@papernexus) در بخش زبانی انگلیسی بازیگری فعال است. در حال حاضر جامعه شامل 32 919 مشترک است و جایگاه 4 135 را در دسته فناوری و برنامه‌ها و رتبه 12 401 را در منطقه الهند دارد.

📊 شاخص‌های مخاطب و پویایی

از زمان ایجاد در невідомо، پروژه رشد سریعی داشته و 32 919 مشترک جذب کرده است.

بر اساس آخرین داده‌ها در تاریخ 26 ژوئن, 2026، کانال فعالیت پایداری دارد. در ۳۰ روز گذشته تغییر اعضا برابر 401 و در ۲۴ ساعت گذشته برابر 17 بوده و همچنان دسترسی گسترده‌ای حفظ شده است.

  • وضعیت تأیید: تأیید نشده
  • نرخ تعامل (ER): میانگین تعامل مخاطب 1.23% است و در ۲۴ ساعت نخست پس از انتشار، محتوا معمولاً 0.74% واکنش نسبت به کل مشترکان کسب می‌کند.
  • دسترسی پست‌ها: هر پست به طور میانگین 405 بازدید دریافت می‌کند. در اولین روز معمولاً 242 بازدید جمع‌آوری می‌شود.
  • واکنش‌ها و تعامل: مخاطبان به‌طور فعال حمایت می‌کنند؛ میانگین واکنش به هر پست 1 است.
  • علایق موضوعی: محتوا بر موضوعات کلیدی مانند summary, apr, huggingface, github, framework تمرکز دارد.

📝 توضیح و سیاست محتوایی

نویسنده این فضا را محل بیان دیدگاه‌های شخصی توصیف می‌کند:
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers. Admin: @HusseinSheikho || @Hussein_Sheikho

به لطف به‌روزرسانی‌های پرتکرار (آخرین داده در تاریخ 27 ژوئن, 2026)، کانال همواره به‌روز و دارای دسترسی بالاست. تحلیل‌ها نشان می‌دهد مخاطبان به‌طور فعال با محتوا تعامل دارند و آن را به نقطه اثرگذاری مهم در دسته فناوری و برنامه‌ها تبدیل کرده‌اند.

32 919
مشترکین
+1724 ساعت
+757 روز
+40130 روز

در حال بارگیری داده...

جذب مشترکین
ژوئن '26
ژوئن '26
+501
در 1 کانال‌ها
مه '26
+562
در 5 کانال‌ها
Get PRO
آوریل '26
+108
در 9 کانال‌ها
Get PRO
مارس '26
+120
در 6 کانال‌ها
Get PRO
فوریه '26
+226
در 9 کانال‌ها
Get PRO
ژانویه '26
+394
در 11 کانال‌ها
Get PRO
دسامبر '25
+401
در 14 کانال‌ها
Get PRO
نوامبر '25
+380
در 0 کانال‌ها
Get PRO
اکتبر '25
+819
در 0 کانال‌ها
Get PRO
سپتامبر '25
+662
در 0 کانال‌ها
Get PRO
اوت '25
+592
در 0 کانال‌ها
Get PRO
ژوئیه '25
+620
در 10 کانال‌ها
Get PRO
ژوئن '25
+638
در 3 کانال‌ها
Get PRO
مه '25
+804
در 0 کانال‌ها
Get PRO
آوریل '25
+1 033
در 1 کانال‌ها
Get PRO
مارس '25
+643
در 2 کانال‌ها
Get PRO
فوریه '25
+727
در 0 کانال‌ها
Get PRO
ژانویه '25
+1 505
در 0 کانال‌ها
Get PRO
دسامبر '24
+1 482
در 14 کانال‌ها
Get PRO
نوامبر '24
+674
در 1 کانال‌ها
Get PRO
اکتبر '24
+883
در 0 کانال‌ها
Get PRO
سپتامبر '24
+1 016
در 0 کانال‌ها
Get PRO
اوت '24
+1 378
در 1 کانال‌ها
Get PRO
ژوئیه '24
+940
در 2 کانال‌ها
Get PRO
ژوئن '24
+1 075
در 0 کانال‌ها
Get PRO
مه '24
+1 104
در 1 کانال‌ها
Get PRO
آوریل '24
+1 276
در 1 کانال‌ها
Get PRO
مارس '24
+1 470
در 2 کانال‌ها
Get PRO
فوریه '24
+1 744
در 1 کانال‌ها
Get PRO
ژانویه '24
+2 346
در 1 کانال‌ها
Get PRO
دسامبر '23
+1 509
در 0 کانال‌ها
Get PRO
نوامبر '23
+1 164
در 0 کانال‌ها
Get PRO
اکتبر '23
+487
در 0 کانال‌ها
Get PRO
سپتامبر '23
+662
در 0 کانال‌ها
Get PRO
اوت '23
+610
در 0 کانال‌ها
Get PRO
ژوئیه '23
+2 734
در 0 کانال‌ها
Get PRO
ژوئن '23
+1 328
در 0 کانال‌ها
Get PRO
مه '23
+1 029
در 0 کانال‌ها
Get PRO
آوریل '23
+757
در 0 کانال‌ها
Get PRO
مارس '23
+629
در 0 کانال‌ها
Get PRO
فوریه '23
+8
در 0 کانال‌ها
Get PRO
ژانویه '23
+24
در 0 کانال‌ها
Get PRO
دسامبر '22
+4
در 0 کانال‌ها
Get PRO
نوامبر '22
+86
در 0 کانال‌ها
Get PRO
اکتبر '22
+17
در 0 کانال‌ها
Get PRO
سپتامبر '22
+22
در 0 کانال‌ها
Get PRO
اوت '22
+23
در 0 کانال‌ها
Get PRO
ژوئیه '22
+39
در 0 کانال‌ها
Get PRO
ژوئن '22
+812
در 0 کانال‌ها
تاریخ
رشد مشترکین
اشارات
کانال‌ها
27 ژوئن+10
26 ژوئن+23
25 ژوئن+17
24 ژوئن+34
23 ژوئن+26
22 ژوئن+17
21 ژوئن+8
20 ژوئن+8
19 ژوئن+5
18 ژوئن+12
17 ژوئن+18
16 ژوئن+16
15 ژوئن+17
14 ژوئن+32
13 ژوئن+13
12 ژوئن+20
11 ژوئن+21
10 ژوئن+9
09 ژوئن+16
08 ژوئن+13
07 ژوئن+19
06 ژوئن+13
05 ژوئن+24
04 ژوئن+30
03 ژوئن+21
02 ژوئن+38
01 ژوئن+21
پست‌های کانال
New to LBank? Unlock VIP2 and Trading Rewards VIP2 trial, transfer rewards, and trading bonuses for new users. Offer availabl
New to LBank? Unlock VIP2 and Trading Rewards VIP2 trial, transfer rewards, and trading bonuses for new users. Offer available until June 30. Sponsored By WaybienAds

2
🔥 Geometric Context Transformer for Streaming 3D Reconstruction 💡 The paper presents a new approach to streaming 3D reconstruction, which involves recovering 3D information such as camera poses and point clouds from a video stream. This task requires geometric accuracy, temporal consistency, and computational efficiency. To address this problem, the authors introduce LingBot-Map, a feed-forward 3D foundation model that uses a geometric context transformer architecture. The key component of this architecture is a specialized attention mechanism that integrates three main elements: an anchor context, a pose-reference window, and a trajectory memory. These elements work together to address coordinate grounding, dense geometric cues, and long-range drift correction, allowing the model to maintain a compact streaming state while retaining rich geometric context. The result is a model that can perform stable and efficient inference at around 20 frames per second on high-resolution inputs, even over long sequences exceeding 10,000 frames. The authors evaluate their approach on various benchmarks and demonstrate that it outperforms both existing streaming and iterative optimization-based methods, achieving superior performance in terms of geometric accuracy and temporal consistency. Overall, the paper contributes a novel and effective approach to streaming 3D reconstruction, with potential applications in areas such as robotics, computer vision, and virtual reality. 📅 Published on Apr 15 🔗 Links: • GitHub: https://github.com/huggingface • arXiv: https://arxiv.org/abs/2604.14141 • PDF: https://arxiv.org/pdf/2604.14141 • Project Page: https://technology.robbyant.com/lingbot-map 🤖 Models citing this paper: • https://huggingface.co/robbyant/lingbot-map • https://huggingface.co/agramoi/lingbot-map • https://huggingface.co/maujim/lingbot-map-long-only 🚀 Spaces citing this paper: • https://huggingface.co/spaces/limonsyrah/lingbot-3d • https://huggingface.co/spaces/mohan007/lingbot-3d • https://huggingface.co/spaces/Fifthoply/lingbot-3d-ZERO ━━━━━━━━━━━━━━━━━━━━━━━━ 📢 By: https://t.me/PaperNexus #GeometricDeepLearning #3DReconstructionAlgorithms #StreamingComputerVision #TransformerArchitectures #PointCloudProcessing
203
3
بدون متن...
177
4
New to LBank? Unlock VIP2 and Trading Rewards VIP2 trial, transfer rewards, and trading bonuses for new users. Offer availabl
New to LBank? Unlock VIP2 and Trading Rewards VIP2 trial, transfer rewards, and trading bonuses for new users. Offer available until June 30. Sponsored By WaybienAds
13
5
بدون متن...
360
6
🔥 Hallucination in World Models is Predictable and Preventable 💡 The paper addresses the issue of hallucination in world models, which occurs when the model generates unrealistic futures despite appearing visually fluent. The authors hypothesize that hallucination happens in low-data regions of the state-action space and can be detected and mitigated using data-centric signals and coverage-aware sampling techniques. To test this hypothesis, the authors created a large dataset called MMBench2, consisting of 427 hours of data and 210 tasks for visual world modeling, with ground-truth actions and rewards. They trained a 350M-parameter world model on this dataset and identified three distinct modes of hallucination: perceptual, action-marginalized, and scene-diverging. The authors developed three signals that can accurately predict where the model will fail and used these signals to develop a coverage-aware sampling technique to close coverage gaps during training. They also used the hallucination predictors as curiosity rewards for targeted data collection to adapt the pretrained world model to new environments with as few as 50 real environment trajectories. The results show that hallucination in world models is indeed a data coverage issue and that the same signals used to detect it can also be used for mitigation. The authors provide a data-efficient finetuning recipe that can adapt the pretrained world model to entirely unseen environments, demonstrating the effectiveness of their approach. Overall, the paper contributes to a better understanding of hallucination in world models and provides a practical solution to prevent and mitigate it. 📅 Published on Jun 25 🔗 Links: • GitHub: https://github.com/huggingface • arXiv: https://arxiv.org/abs/2606.27326 • PDF: https://arxiv.org/pdf/2606.27326 • Project Page: https://www.nicklashansen.com/mmbench2 ━━━━━━━━━━━━━━━━━━━━━━━━ 📢 By: https://t.me/PaperNexus #HallucinationInAI #WorldModelingTechniques #PredictiveModelingForRobotics #DataCentricSignalProcessing #VisualWorldModeling
340
7
🔥 Terrain Diffusion: A Diffusion-Based Successor to Perlin Noise in Infinite, Real-Time Terrain Generation 💡 The paper introduces Terrain Diffusion, a new method for generating realistic and infinite procedural worlds in real-time. The current method, Perlin noise, is fast and infinite but lacks realism and large-scale coherence. Terrain Diffusion uses diffusion models and a novel algorithm called InfiniteDiffusion to address these limitations. The InfiniteDiffusion algorithm enables seamless and real-time synthesis of boundless landscapes by coupling planetary context with local detail through a hierarchical stack of diffusion models. The method also uses a compact Laplacian encoding to stabilize outputs across large dynamic ranges and an open-source infinite-tensor framework to support constant-memory manipulation of unbounded tensors. Additionally, few-step consistency distillation enables efficient generation. The results show that Terrain Diffusion can synthesize entire planets coherently, controllably, and without limits, making it a practical foundation for procedural world generation. The method provides constant-time random access, seamless infinite extent, and seed-consistency, making it a suitable successor to Perlin noise. Overall, the paper presents a significant contribution to the field of procedural world generation, enabling the creation of realistic and infinite worlds in real-time. 📅 Published on Dec 9, 2025 🔗 Links: • GitHub: https://github.com/huggingface • arXiv: https://arxiv.org/abs/2512.08309 • PDF: https://arxiv.org/pdf/2512.08309 • Project Page: https://xandergos.github.io/terrain-diffusion/ 🤖 Models citing this paper: • https://huggingface.co/xandergos/terrain-diffusion-30m • https://huggingface.co/xandergos/terrain-diffusion-90m • https://huggingface.co/xandergos/TerrainDiffusion-Consistency-Base-192x3 ━━━━━━━━━━━━━━━━━━━━━━━━ 📢 By: https://t.me/PaperNexus #ProceduralTerrainGeneration #InfiniteWorlds #DiffusionModels #RealTimeTerrainSynthesis #PerlinNoiseAlternatives
209
8
بدون متن...
144
9
🔥 OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning 💡 The paper proposes a framework called On-Policy Skill Distillation, or OPID, which aims to improve the efficiency and performance of language agent training in reinforcement learning. The problem addressed is that outcome-based reinforcement learning provides sparse rewards that do not offer sufficient guidance on intermediate decisions, while existing self-distillation methods often rely on external skill memories that can be costly to maintain and may not match the current policy. OPID addresses this issue by extracting skill supervision directly from completed on-policy trajectories, representing trajectory hindsight as hierarchical skills that capture both global and local decision knowledge. The framework uses a critical-first routing mechanism to select the most relevant skill and inject it into the interaction history, allowing the old policy to re-score responses under both original and skill-augmented contexts. This yields a token-level self-distillation advantage that is combined with the outcome advantage for policy optimization. The results of the experiments demonstrate that OPID generally improves agent performance, sample efficiency, and robustness over outcome-only reinforcement learning and existing skill-distillation baselines. The framework preserves reinforcement learning as the primary training objective while introducing dense, distribution-matched hindsight supervision. The experiments were conducted on several datasets, including ALFWorld, WebShop, and Search-based QA, and the code is available for further research. Overall, OPID offers a novel approach to skill distillation that can enhance the training of language agents in reinforcement learning. 📅 Published on Jun 25 🔗 Links: • GitHub: https://github.com/huggingface • arXiv: https://arxiv.org/abs/2606.26790 • PDF: https://arxiv.org/pdf/2606.26790 🤖 Models citing this paper: • https://huggingface.co/Jinyang23/OPID-ALFWorld-1.7B ━━━━━━━━━━━━━━━━━━━━━━━━ 📢 By: https://t.me/PaperNexus #AgenticReinforcementLearning #OnPolicyLearning #SkillDistillation #ReinforcementLearningFrameworks #HierarchicalReinforcementLearning
152
10
بدون متن...
120
11
🔥 JetSpec: Breaking the Scaling Ceiling of Speculative Decoding with Parallel Tree Drafting 💡 The paper introduces JetSpec, a speculative decoding framework designed to improve the inference speed and acceptance rates of large language models. The problem addressed is the scaling limitation of speculative decoding, which accelerates autoregressive large language models by drafting multiple tokens and verifying them in parallel. However, increasing the draft budget only improves speed when acceptance remains high and drafting overhead stays low, creating a scaling ceiling. The proposed JetSpec framework combines efficient forward drafting with causal conditioning to break this ceiling. It trains a causal parallel draft head over fused hidden states from the frozen target model, producing candidate trees whose scores align with the target model's autoregressive factorization. This approach enables JetSpec to convert larger draft budgets into longer accepted prefixes and higher end-to-end speedup. The method is compared to bidirectional-head and tree-based speculative decoding baselines across various benchmarks, including math, coding, and chat tasks on dense and MoE models. The results show that JetSpec consistently outperforms these baselines, achieving significant speedup on different workloads. Specifically, JetSpec achieves up to 9.64x speedup on math tasks and 4.58x on open-ended conversational workloads, with further latency gains demonstrated through integration with virtual large language models under realistic serving loads. Overall, the paper contributes a novel speculative decoding framework that breaks the scaling ceiling of prior methods, enabling faster and more efficient large language model inference. The code and models are made available for further research and development. 📅 Published on Jun 25 🔗 Links: • GitHub: https://github.com/huggingface • arXiv: https://arxiv.org/abs/2606.18394 • PDF: https://arxiv.org/pdf/2606.18394 • Project Page: https://jetspec-project.github.io/jetspec-web/ ━━━━━━━━━━━━━━━━━━━━━━━━ 📢 By: https://t.me/PaperNexus #SpeculativeDecoding #LargeLanguageModels #AutoregressiveModeling #ParallelTreeDrafting #CausalConditioning
127
12
بدون متن...
105
13
🔥 ViQ: Text-Aligned Visual Quantized Representations at Any Resolution 💡 The paper introduces ViQ, a visual quantization framework that aims to balance semantic richness and detail preservation in discrete representations of images. The goal is to create a unified representation for text and vision that enables simpler multimodal modeling and more efficient training. Existing methods struggle to balance low-level details and high-level semantics in discrete representations, often resulting in severe information loss. The ViQ framework addresses this issue by structuring quantization learning into two stages: text-aligned pre-training and feature discretization. In the first stage, the visual encoder is pre-trained with semantic-rich supervision from a pre-trained language model, allowing it to process native-resolution visual inputs. In the second stage, a proximal representation learning strategy is used to progressively compact the feature space, along with a position-aware head-wise quantization mechanism that enables flexible processing of arbitrary resolutions. The results show that ViQ achieves competitive performance compared to state-of-the-art multimodal vision encoders with continuous and high-dimensional visual features, while maintaining high precision in low-level reconstruction. Additionally, multimodal training with visual quantized representations using ViQ leads to significant efficiency improvements, with up to 20-70 percent acceleration in training time compared to different base language models and training recipes. Overall, the paper presents a novel approach to visual quantization that balances semantics and details in discrete representations, enabling more efficient and effective multimodal modeling. 📅 Published on Jun 25 🔗 Links: • GitHub: https://github.com/huggingface • arXiv: https://arxiv.org/abs/2606.27313 • PDF: https://arxiv.org/pdf/2606.27313 ━━━━━━━━━━━━━━━━━━━━━━━━ 📢 By: https://t.me/PaperNexus #VisualQuantization #MultimodalModeling #TextVisionAlignment #DiscreteRepresentations #ImageQuantization
147
14
بدون متن...
169
15
There are hundreds of AI channels on YouTube. Here's why we made another one. Most AI content does one of two things: it stays so surface-level it teaches you nothing, or it goes so deep you need a PhD to follow along. We built Guidely for everyone in between. → We start with absolute beginners in mind → Then take you deeper, until the details actually click → Every guide is reviewed by experienced AI engineers → We don't make more content. We make better content. Whether you build, design, or market products, our goal is simple: leave you thinking "I've never seen it broken down this well." Two good places to start 👇 → AI vs ML vs Deep Learning vs GenAI ... But Done Right! The terms everyone uses. The distinctions are almost never explained clearly. We fix that: youtu.be/72yyLA2wRWc → How to Break into AI Engineering in 2026 A senior applied scientist shares what actually matters:  youtu.be/42vE7Ij4kdU If AI has ever felt overwhelming or noisy, this channel is for you. If the content resonates with you, please don’t forget to like and subscribe.
343
16
Steal my FIFA 2026 trading prep ⚽🔥💰 Most “predictions” are vibes… I wanted plays with receipts. NO FLUFF. I spent nights tr
Steal my FIFA 2026 trading prep ⚽🔥💰 Most “predictions” are vibes… I wanted plays with receipts. NO FLUFF. I spent nights tracking match angles & prize-pool comps so you don’t have to ⚡🚀 Inside Wish Digital Sports picks + showdown updates I post what I’m actually trading + what I’m avoiding 🎯🏆 If you want football predictions, trading comps, and straight-to-the-point calls-welcome 😤💥 👉 Join the World Cup plays feed #ad 📢 InsideAd
181
17
New to LBank? Unlock VIP2 and Trading Rewards VIP2 trial, transfer rewards, and trading bonuses for new users. Offer availabl
New to LBank? Unlock VIP2 and Trading Rewards VIP2 trial, transfer rewards, and trading bonuses for new users. Offer available until June 30. Sponsored By WaybienAds
85
18
New to LBank? Unlock VIP2 and Trading Rewards VIP2 trial, transfer rewards, and trading bonuses for new users. Offer availabl
New to LBank? Unlock VIP2 and Trading Rewards VIP2 trial, transfer rewards, and trading bonuses for new users. Offer available until June 30. Sponsored By WaybienAds
98
19
بدون متن...
434
20
🔥 UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating 💡 The paper presents UnityShots, a memory-driven audio-video generation system that can generate coherent multi-shot videos. The problem addressed is that existing approaches to generating multi-shot videos either cannot scale or do not maintain consistent subject appearance and audio across video cuts. To solve this, UnityShots uses a combination of fixed-size long-term and short-term memory slots, boundary-conditioned gates, and discrete cut-type priors to maintain consistency across shots. The system consists of two streams, video and audio, where the video stream uses two fixed-size slots to store information about the opening shot and the immediately preceding shot, and the audio stream uses a reference speaker token to preserve vocal timbre. The boundary-conditioned gate updates the memory slots at every cut, and the discrete cut-type prior allows for control over transition strength between shots. The system was trained on annotated cinematic and music-video shots and evaluated on a benchmark of 200 multi-cultural multi-shot sequences. The results show that UnityShots outperforms open-source baselines on every cross-shot coherence metric and matches the performance of the strongest closed-source system. The paper also releases a benchmark of multi-shot sequences with per-shot reference identities, reference audio, and per-boundary transition labels, which can be used for future research. Overall, UnityShots provides a new approach to generating coherent multi-shot videos that can maintain consistent subject appearance and audio across video cuts. 📅 Published on Jun 19 🔗 Links: • GitHub: https://github.com/huggingface • arXiv: https://arxiv.org/abs/2606.21661 • PDF: https://arxiv.org/pdf/2606.21661 • Project Page: https://jackailab.github.io/Projects/UnityShots/ 📊 Datasets citing this paper: • https://huggingface.co/datasets/KlingTeam/UnityShotsBench ━━━━━━━━━━━━━━━━━━━━━━━━ 📢 By: https://t.me/PaperNexus #AudioVideoGeneration #MultiShotVideo #BoundaryAwareGating #MemoryDrivenGeneration #CoherentVideoGeneration
396