ch
Feedback
AI with Papers - Artificial Intelligence & Deep Learning

AI with Papers - Artificial Intelligence & Deep Learning

前往频道在 Telegram

All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/ #AI #chatGPT

显示更多

📈 Telegram 频道 AI with Papers - Artificial Intelligence & Deep Learning 的分析概览

频道 AI with Papers - Artificial Intelligence & Deep Learning (@ai_deeplearning) 英语 语言赛道中的 是活跃参与者。目前社区聚集了 17 151 名订阅者,在 技术与应用 类别中位列第 7 726,并在 马来西亚 地区排名第 2 240

📊 受众指标与增长动态

невідомо 创建以来,项目保持高速增长,吸引了 17 151 名订阅者。

根据 21 六月, 2026 的最新数据,频道保持稳定运转。过去 30 天订阅人数变化为 -166,过去 24 小时变化为 -6,整体触达仍然可观。

  • 认证状态: 未认证
  • 互动率 (ER): 平均受众互动率为 23.63%。内容发布后 24 小时内通常能获得 6.86% 的反应,占订阅者总量。
  • 帖子覆盖: 每篇帖子平均可获得 4 057 次浏览,首日通常累积 1 177 次浏览。
  • 互动与反馈: 受众积极参与,单帖平均反应数为 26
  • 主题关注点: 内容集中在 framework, object, dataset, tba, depth 等核心主题上。

📝 描述与内容策略

作者将该频道定位为表达主观观点的平台:
All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/ #AI #chatGPT

凭借高频更新(最新数据采集于 22 六月, 2026),频道始终保持新鲜度与高覆盖。分析显示受众积极互动,使其成为 技术与应用 类别中的关键影响点。

17 151
订阅者
-624 小时
-277
-16630
帖子存档
🫅FlowMDM: Human Composition🫅 👉FlowMDM, a diffusion-based approach capable of generating seamlessly continuous sequences of human motion from textual descriptions. 👉Review https://t.ly/pr2g_ 👉Paper https://lnkd.in/daYRftdF 👉Project https://lnkd.in/dcRkv5Pc 👉Repo https://lnkd.in/dw-3JJks

🗃️ MATH-Vision Dataset 🗃️ 👉MATH-V is a curated dataset of 3,040 HQ mat problems with visual contexts sourced from real mat
🗃️ MATH-Vision Dataset 🗃️ 👉MATH-V is a curated dataset of 3,040 HQ mat problems with visual contexts sourced from real math competitions. Dataset released 💙 👉Review https://t.ly/gmIAu 👉Paper arxiv.org/pdf/2402.14804.pdf 👉Project mathvision-cuhk.github.io/ 👉Code github.com/mathvision-cuhk/MathVision

🩻 Pose via Ray Diffusion 🩻 👉Novel distributed representation of camera pose that treats a camera as a bundle of rays. Naturally suited for set-level transformers, it's the new SOTA on camera pose estimation. Source code released 💙 👉Review https://t.ly/qBsFK 👉Paper arxiv.org/pdf/2402.14817.pdf 👉Project jasonyzhang.com/RayDiffusion 👉Code github.com/jasonyzhang/RayDiffusion

🦥Neuromorphic Video Binarization🦥 👉 University of HK unveils the new SOTA in event-based neuromorphic binary reconstruction: stunning results on QR Code, barcode, & Text. Real-Time, only CPU, up to 10,000 FPS! 👉Review https://t.ly/V-NFa 👉Paper arxiv.org/pdf/2402.12644.pdf 👉Project github.com/eleboss/EBR

🪟 BOG: Fine Geometric Viewshttps://t.ly/E6T0W 🪟 👉 #Google (+Tübingen) unveils Binary Opacity Grids, a novel method to reconstruct triangle meshes from multi-view images able to capture fine geometric detail such as leaves, branches & grass. New SOTA, real-time on Google Pixel 8 Pro (and similar). 👉Review https://t.ly/E6T0W 👉Paper https://lnkd.in/dQEq3zy6 👉Project https://lnkd.in/dYYCadx9 👉Demo https://lnkd.in/d92R6QME

☀️ One2Avatar: Pic -> 3D Avatar ☀️ 👉#Google presents a new approach to generate animatable photo-realistic avatars from only a few/one image. Impressive results. 👉Review https://t.ly/AS1oc 👉Paper arxiv.org/pdf/2402.11909.pdf 👉Project zhixuany.github.io/one2avatar_webpage/

🔥 Breaking: GEMINI 1.5 is out 🔥 👉Gemini 1.5 just announced: standard 128,000 token context window, up to 1 MILLION tokens via AI-Studio and #Vertex AI in private preview 🫠 👉Review https://t.ly/Vblvx 👉More: https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024/#build-experiment

🆔 Magic-Me: ID-Specific Video 🆔 👉#ByteDance VCD: with just a few images of a specific identity it can generate temporal consistent videos aligned with the given prompt 👉Review https://t.ly/qjJ2O 👉Paper arxiv.org/pdf/2402.09368.pdf 👉Project magic-me-webpage.github.io 👉Code github.com/Zhen-Dong/Magic-Me

🍇 Graph Neural Network in TF 🍇 👉#Google released TensorFlow-GNN: a novel library to build Graph Neural Networks on the TensorFlow platform. Source Code released under Apache 2.0 license 💙 #artificialintelligence #machinelearning #ml #AI #deeplearning #computervision #AIwithPapers #metaverse 👉Review https://t.ly/TQfg- 👉Code https://github.com/tensorflow/gnn 👉Blog https://blog.research.google/2024/02/graph-neural-networks-in-tensorflow.html

🌴 Direct-a-Video Generation 🌴 👉Direct-a-Video is a text-to-video generation framework that allows users to individually or jointly control the camera movement and/or object motion 👉Review https://t.ly/dZSLs 👉Paper arxiv.org/pdf/2402.03162.pdf 👉Project https://direct-a-video.github.io/

🌆EfficientViT-SAM: 69x Faster SAM 🌆 👉EfficientViT-SAM is a new family of accelerated Segment Anything Models. The same old SAM’s lightweight prompt encoder and mask decoder, while replacing the heavy image encoder with EfficientViT. Up to 69x faster, source code released 💙 Authors: Tsinghua, MIT & #Nvidia💥 👉Review https://lnkd.in/dMgakzWm 👉Paper arxiv.org/pdf/2402.05008.pdf 👉Code github.com/mit-han-lab/efficientvit

🌵 G-Splatting Controllable Portraits 🌵 👉From monocular/casual video captures, Rig3DGS rigs 3D Gaussian Splatting to enable the creation of re-animatable portrait videos with control over facial expressions, head-pose and viewing direction. Authors: Stony Brook University & #Adobe 👉Review https://t.ly/fq71w 👉Paper https://arxiv.org/pdf/2402.03723.pdf 👉Project shahrukhathar.github.io/2024/02/05/Rig3DGS.html

🪵 HASSOD Object Detection 🪵 👉 HASSOD: fully self-supervised detection and instance segmentation. The new SOTA able to understand the part-to-whole object composition like humans do. 👉Review https://t.ly/66qHF 👉Paper arxiv.org/pdf/2402.03311.pdf 👉Project hassod-neurips23.github.io/ 👉Repo github.com/Shengcao-Cao/HASSOD

💥 #Py4AI: 2x speakers, 2x tickets! 💥 ✅Doubling the speakers (6 -> 12!) ✅Adding a new track (2 tracks in parallel) ✅Releasin
💥 #Py4AI: 2x speakers, 2x tickets! 💥 ✅Doubling the speakers (6 -> 12!) ✅Adding a new track (2 tracks in parallel) ✅Releasing a new batch of 100 tickets! 👉 More: https://t.ly/WmVrM

🏇Bootstrapping TAP 🏇 👉#Deepmind shows how large-scale, unlabeled, uncurated real-world data can improve TAP with minimal architectural changes, via a self-supervised student-teacher setup. Source Code released 💙 👉Review https://t.ly/-S_ZL 👉Paper https://arxiv.org/pdf/2402.00847.pdf 👉Code https://lnkd.in/gyi7Dhkn

🍬 ABS: SOTA collision-free 🍬 👉ABS (Agile But Safe): learning-based control framework for agile and collision-free locomotion for quadrupedal robot. Source Code announced (coming) 💙 👉Review https://t.ly/AYu-Z 👉Paper arxiv.org/pdf/2401.17583.pdf 👉Project agile-but-safe.github.io/ 👉Repo github.com/LeCAR-Lab/ABS

🚦(adding) Anything in Any Video🚦🚦 👉 XPeng Motors announced Anything in Any Scene: novel #AI for realistic video simulation that seamlessly inserts any object into an existing dynamic video. Strong emphasis on realism, the objects in the BBs don't exist. Source Code released 💙 👉Review https://t.ly/UYhl0 👉Code https://lnkd.in/gyi7Dhkn 👉Paper https://lnkd.in/gXyAJ6GZ 👉Project https://lnkd.in/gVA5vduD

🚦(adding) Anything in Any Video🚦🚦 👉 XPeng Motors announced Anything in Any Scene: novel #AI for realistic video simulation that seamlessly inserts any object into an existing dynamic video. Strong emphasis on realism, the objects in the BBs don't exist. Source Code released 💙 👉Review https://t.ly/UYhl0 👉Code https://lnkd.in/gyi7Dhkn 👉Paper https://lnkd.in/gXyAJ6GZ 👉Project https://lnkd.in/gVA5vduD

🎉 ADΔER: Event-Camera Suite 🎉 👉ADΔER: a novel/unified framework for event-based video. Encoder / transcoder / decoder for ADΔER (Address, Decimation, Δt Event Representation) video streams. Source code (RUST) released 💙 H/T author: Andrew C. Freeman from University of North Carolina, USA. 👉Review https://t.ly/w5_KC 👉Paper arxiv.org/pdf/2401.17151.pdf 👉Repo github.com/ac-freeman/adder-codec-rs