AI with Papers - Artificial Intelligence & Deep Learning

前往频道在 Telegram

All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/ #AI #chatGPT

显示更多

马来西亚2 244 技术与应用7 718...

📈 Telegram 频道 AI with Papers - Artificial Intelligence & Deep Learning 的分析概览

频道 AI with Papers - Artificial Intelligence & Deep Learning (@ai_deeplearning) 英语语言赛道中的是活跃参与者。目前社区聚集了 17 146 名订阅者，在 技术与应用 类别中位列第 7 718，并在 马来西亚 地区排名第 2 244 位。

📊 受众指标与增长动态

自 невідомо 创建以来，项目保持高速增长，吸引了 17 146 名订阅者。

根据 22 六月, 2026 的最新数据，频道保持稳定运转。过去 30 天订阅人数变化为 -178，过去 24 小时变化为 -15，整体触达仍然可观。

认证状态： 未认证
互动率 (ER)： 平均受众互动率为 24.30%。内容发布后 24 小时内通常能获得 6.86% 的反应，占订阅者总量。
帖子覆盖： 每篇帖子平均可获得 4 167 次浏览，首日通常累积 1 177 次浏览。
互动与反馈： 受众积极参与，单帖平均反应数为 26。
主题关注点： 内容集中在 framework, object, dataset, tba, depth 等核心主题上。

📝 描述与内容策略

作者将该频道定位为表达主观观点的平台：
“All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/ #AI #chatGPT”

凭借高频更新（最新数据采集于 23 六月, 2026），频道始终保持新鲜度与高覆盖。分析显示受众积极互动，使其成为 技术与应用 类别中的关键影响点。

17 146

订阅者

-1524 小时

-437 天

-17830 天

4 167

帖子浏览量

~ 1 17724 小时

~ 1 35348 小时

24.30%

参与率

无数据

每日帖子数

Ads index

beta

帖子存档

17 151

🫅FlowMDM: Human Composition🫅 👉FlowMDM, a diffusion-based approach capable of generating seamlessly continuous sequences of human motion from textual descriptions. 👉Review https://t.ly/pr2g_ 👉Paper https://lnkd.in/daYRftdF 👉Project https://lnkd.in/dcRkv5Pc 👉Repo https://lnkd.in/dw-3JJks

17 151

🗃️ MATH-Vision Dataset 🗃️ 👉MATH-V is a curated dataset of 3,040 HQ mat problems with visual contexts sourced from real math competitions. Dataset released 💙 👉Review https://t.ly/gmIAu 👉Paper arxiv.org/pdf/2402.14804.pdf 👉Project mathvision-cuhk.github.io/ 👉Code github.com/mathvision-cuhk/MathVision

17 151

🩻 Pose via Ray Diffusion 🩻 👉Novel distributed representation of camera pose that treats a camera as a bundle of rays. Naturally suited for set-level transformers, it's the new SOTA on camera pose estimation. Source code released 💙 👉Review https://t.ly/qBsFK 👉Paper arxiv.org/pdf/2402.14817.pdf 👉Project jasonyzhang.com/RayDiffusion 👉Code github.com/jasonyzhang/RayDiffusion

17 151

🦥Neuromorphic Video Binarization🦥 👉 University of HK unveils the new SOTA in event-based neuromorphic binary reconstruction: stunning results on QR Code, barcode, & Text. Real-Time, only CPU, up to 10,000 FPS! 👉Review https://t.ly/V-NFa 👉Paper arxiv.org/pdf/2402.12644.pdf 👉Project github.com/eleboss/EBR

17 151

🪟 BOG: Fine Geometric Viewshttps://t.ly/E6T0W 🪟 👉 #Google (+Tübingen) unveils Binary Opacity Grids, a novel method to reconstruct triangle meshes from multi-view images able to capture fine geometric detail such as leaves, branches & grass. New SOTA, real-time on Google Pixel 8 Pro (and similar). 👉Review https://t.ly/E6T0W 👉Paper https://lnkd.in/dQEq3zy6 👉Project https://lnkd.in/dYYCadx9 👉Demo https://lnkd.in/d92R6QME

17 151

☀️ One2Avatar: Pic -> 3D Avatar ☀️ 👉#Google presents a new approach to generate animatable photo-realistic avatars from only a few/one image. Impressive results. 👉Review https://t.ly/AS1oc 👉Paper arxiv.org/pdf/2402.11909.pdf 👉Project zhixuany.github.io/one2avatar_webpage/

17 151

🔥 Code is out 🔥 👉Repo: https://github.com/princeton-computational-imaging/NSF

17 151

🔥 Breaking: GEMINI 1.5 is out 🔥 👉Gemini 1.5 just announced: standard 128,000 token context window, up to 1 MILLION tokens via AI-Studio and #Vertex AI in private preview 🫠 👉Review https://t.ly/Vblvx 👉More: https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024/#build-experiment

17 151

🆔 Magic-Me: ID-Specific Video 🆔 👉#ByteDance VCD: with just a few images of a specific identity it can generate temporal consistent videos aligned with the given prompt 👉Review https://t.ly/qjJ2O 👉Paper arxiv.org/pdf/2402.09368.pdf 👉Project magic-me-webpage.github.io 👉Code github.com/Zhen-Dong/Magic-Me

17 151

🍇 Graph Neural Network in TF 🍇 👉#Google released TensorFlow-GNN: a novel library to build Graph Neural Networks on the TensorFlow platform. Source Code released under Apache 2.0 license 💙 #artificialintelligence #machinelearning #ml #AI #deeplearning #computervision #AIwithPapers #metaverse 👉Review https://t.ly/TQfg- 👉Code https://github.com/tensorflow/gnn 👉Blog https://blog.research.google/2024/02/graph-neural-networks-in-tensorflow.html

17 151

🌴 Direct-a-Video Generation 🌴 👉Direct-a-Video is a text-to-video generation framework that allows users to individually or jointly control the camera movement and/or object motion 👉Review https://t.ly/dZSLs 👉Paper arxiv.org/pdf/2402.03162.pdf 👉Project https://direct-a-video.github.io/

17 151

🌆EfficientViT-SAM: 69x Faster SAM 🌆 👉EfficientViT-SAM is a new family of accelerated Segment Anything Models. The same old SAM’s lightweight prompt encoder and mask decoder, while replacing the heavy image encoder with EfficientViT. Up to 69x faster, source code released 💙 Authors: Tsinghua, MIT & #Nvidia💥 👉Review https://lnkd.in/dMgakzWm 👉Paper arxiv.org/pdf/2402.05008.pdf 👉Code github.com/mit-han-lab/efficientvit

17 151

🌵 G-Splatting Controllable Portraits 🌵 👉From monocular/casual video captures, Rig3DGS rigs 3D Gaussian Splatting to enable the creation of re-animatable portrait videos with control over facial expressions, head-pose and viewing direction. Authors: Stony Brook University & #Adobe 👉Review https://t.ly/fq71w 👉Paper https://arxiv.org/pdf/2402.03723.pdf 👉Project shahrukhathar.github.io/2024/02/05/Rig3DGS.html

17 151

🪵 HASSOD Object Detection 🪵 👉 HASSOD: fully self-supervised detection and instance segmentation. The new SOTA able to understand the part-to-whole object composition like humans do. 👉Review https://t.ly/66qHF 👉Paper arxiv.org/pdf/2402.03311.pdf 👉Project hassod-neurips23.github.io/ 👉Repo github.com/Shengcao-Cao/HASSOD

17 151

💥 #Py4AI: 2x speakers, 2x tickets! 💥 ✅Doubling the speakers (6 -> 12!) ✅Adding a new track (2 tracks in parallel) ✅Releasing a new batch of 100 tickets! 👉 More: https://t.ly/WmVrM

17 151

🏇Bootstrapping TAP 🏇 👉#Deepmind shows how large-scale, unlabeled, uncurated real-world data can improve TAP with minimal architectural changes, via a self-supervised student-teacher setup. Source Code released 💙 👉Review https://t.ly/-S_ZL 👉Paper https://arxiv.org/pdf/2402.00847.pdf 👉Code https://lnkd.in/gyi7Dhkn

17 151

🍬 ABS: SOTA collision-free 🍬 👉ABS (Agile But Safe): learning-based control framework for agile and collision-free locomotion for quadrupedal robot. Source Code announced (coming) 💙 👉Review https://t.ly/AYu-Z 👉Paper arxiv.org/pdf/2401.17583.pdf 👉Project agile-but-safe.github.io/ 👉Repo github.com/LeCAR-Lab/ABS

17 151

🚦(adding) Anything in Any Video🚦🚦 👉 XPeng Motors announced Anything in Any Scene: novel #AI for realistic video simulation that seamlessly inserts any object into an existing dynamic video. Strong emphasis on realism, the objects in the BBs don't exist. Source Code released 💙 👉Review https://t.ly/UYhl0 👉Code https://lnkd.in/gyi7Dhkn 👉Paper https://lnkd.in/gXyAJ6GZ 👉Project https://lnkd.in/gVA5vduD

17 151

🎉 ADΔER: Event-Camera Suite 🎉 👉ADΔER: a novel/unified framework for event-based video. Encoder / transcoder / decoder for ADΔER (Address, Decimation, Δt Event Representation) video streams. Source code (RUST) released 💙 H/T author: Andrew C. Freeman from University of North Carolina, USA. 👉Review https://t.ly/w5_KC 👉Paper arxiv.org/pdf/2401.17151.pdf 👉Repo github.com/ac-freeman/adder-codec-rs