AI with Papers - Artificial Intelligence & Deep Learning

前往频道在 Telegram

All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/ #AI #chatGPT

显示更多

马来西亚2 240 技术与应用7 726...

📈 Telegram 频道 AI with Papers - Artificial Intelligence & Deep Learning 的分析概览

频道 AI with Papers - Artificial Intelligence & Deep Learning (@ai_deeplearning) 英语语言赛道中的是活跃参与者。目前社区聚集了 17 151 名订阅者，在 技术与应用 类别中位列第 7 726，并在 马来西亚 地区排名第 2 240 位。

📊 受众指标与增长动态

自 невідомо 创建以来，项目保持高速增长，吸引了 17 151 名订阅者。

根据 21 六月, 2026 的最新数据，频道保持稳定运转。过去 30 天订阅人数变化为 -166，过去 24 小时变化为 -6，整体触达仍然可观。

认证状态： 未认证
互动率 (ER)： 平均受众互动率为 23.63%。内容发布后 24 小时内通常能获得 6.86% 的反应，占订阅者总量。
帖子覆盖： 每篇帖子平均可获得 4 057 次浏览，首日通常累积 1 177 次浏览。
互动与反馈： 受众积极参与，单帖平均反应数为 26。
主题关注点： 内容集中在 framework, object, dataset, tba, depth 等核心主题上。

📝 描述与内容策略

作者将该频道定位为表达主观观点的平台：
“All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/ #AI #chatGPT”

凭借高频更新（最新数据采集于 22 六月, 2026），频道始终保持新鲜度与高覆盖。分析显示受众积极互动，使其成为 技术与应用 类别中的关键影响点。

17 151

订阅者

-624 小时

-277 天

-16630 天

4 057

帖子浏览量

~ 1 17724 小时

~ 1 35348 小时

23.63%

参与率

无数据

每日帖子数

Ads index

beta

帖子存档

17 151

🖼️ Diffusive Images that Sound 🖼️ 👉The University of Michigan unveils a diffusion model able to generate spectrograms that look like images but can also be played as sound. 👉Review https://t.ly/ADtYM 👉Paper arxiv.org/pdf/2405.12221 👉Project ificl.github.io/images-that-sound 👉Code github.com/IFICL/images-that-sound

17 151

⚽ 3D Shot Posture from Broadcast ⚽ 👉Nagoya Univeristy unveils 3DSP soccer broadcast videos, the most extensive sports image dataset with 2D pose annotations EVER. Data and Source Code released💙 👉Review https://t.ly/IIMeZ 👉Paper https://arxiv.org/pdf/2405.12070 👉Code https://github.com/calvinyeungck/3D-Shot-Posture-Dataset/tree/master

17 151

🦕 Grounding DINO 1.5 Pro/Edge 🦕 👉Grounding DINO 1.5, a suite of advanced open-set object detection models to advanced the "Edge" of open-set object detection. Source Code released under Apache 2.0💙 👉Review https://t.ly/kS-og 👉Paper https://lnkd.in/dNakMge2 👉Code https://lnkd.in/djhnQmrm

17 151

🫀 EchoTracker: Tracking Echocardiography🫀 👉NTNU unveils EchoTracker, a two-fold coarse-to-fine model that facilitates the tracking of queried points on a tissue surface across ultrasound. Source Code released💙 👉Review https://t.ly/NyBe0 👉Paper https://arxiv.org/pdf/2405.08587 👉Code https://github.com/riponazad/echotracker/

17 151

🔥EfficientTrain++: Efficient Foundation Visual Backbone Training🔥 👉Tsinghua unveils EfficientTrain++, a simple, general, surprisingly effective, off-the-shelf approach to reduce the training time of various popular models (e.g., ResNet, ConvNeXt, DeiT, PVT, Swin, CSWin, and CAFormer). Up to 3.0× faster on ImageNet-1K/22K without sacrificing accuracy. Source Code released 💙 👉Review https://t.ly/D8ttv 👉Paper https://arxiv.org/pdf/2405.08768 👉Code https://github.com/LeapLabTHU/EfficientTrain

17 151

🪬UHM: Authentic Hand by Phone🪬 👉 META unveils UHM, novel 3D high-fidelity avatarization of your (yes, the your one) hand. Adaptation pipeline fits the pre-trained UHM via phone scan. Source Code released 💙 👉Review https://t.ly/fU5rA 👉Paper https://lnkd.in/dyGaiAnq 👉Code https://lnkd.in/d9B_XFAA

17 151

👻 3D Humans Motion from Text 👻 👉Zhejiang (+ANT) unveils a novel method to generate human motions containing accurate human-object interactions in 3D scenes based on textural descriptions. Code announced, coming 💙 👉Review https://t.ly/eOZnU 👉Paper https://arxiv.org/pdf/2405.07784 👉Project https://zju3dv.github.io/text_scene_motion/

17 151

🐏 AniTalker: Universal Self-Talking Humans 🐏 👉SJTU (+AISpeech) unveils AniTalker, a framework that transforms a single static portrait and input audio into animated talking videos with naturally flowing movements. 👉Review https://t.ly/MD4yX 👉Paper https://arxiv.org/pdf/2405.03121 👉Project https://x-lance.github.io/AniTalker/ 👉Repo https://github.com/X-LANCE/AniTalker

17 151

💥FeatUp: Model at Any Resolution💥 👉FeatUp is a task- and model-agnostic framework to restore lost spatial information in deep features. It outperforms other feature upsampling in class activation map generation, transfer learning for segmentation and depth prediction, and end-to-end training for semantic segm. Source Code released💙 👉Review https://t.ly/Evq_g 👉Paper https://lnkd.in/gweaN4s6 👉Project https://lnkd.in/gWcGXdxt 👉Code https://lnkd.in/gweq5NY4

17 151

🔫 Free-Moving Reconstruction 🔫 👉EPFL (+#MagicLeap) unveils a novel approach for reconstructing free-moving object from monocular RGB clip. Free interaction with objects in front of a moving cam without relying on any prior, and optimizes the sequence globally without any segments. Great but no code announced🥺 👉Review https://t.ly/2xhtj 👉Paper arxiv.org/pdf/2405.05858 👉Project haixinshi.github.io/fmov/

17 151

🦑 Hyper-Detailed Image Descriptions 🦑 👉#Google unveils ImageInWords (IIW), a carefully designed HIL annotation framework for curating hyper-detailed image descriptions and a new dataset resulting from this process 👉Review https://t.ly/engkl 👉Paper https://arxiv.org/pdf/2405.02793 👉Repo https://github.com/google/imageinwords 👉Project https://google.github.io/imageinwords 👉Dataset huggingface.co/datasets/google/imageinwords 👉Explorer huggingface.co/spaces/google/imageinwords-explorer

17 151

🍏 XFeat: Neural Features Matching 🍏 👉XFeat (Accelerated Features) is lightweight/accurate architecture for efficient visual correspondence. It revisits fundamental design choices in CNN for detecting, extracting & matching local features 👉Review https://t.ly/ppb38 👉Paper arxiv.org/pdf/2404.19174 👉Code https://lnkd.in/dFzTpzN8 👉Project https://lnkd.in/d8JnV-iu

17 151

🏷️DiffMOT (#CVPR24): diffusion-MOT🏷️ 👉DiffMOT is a novel real-time diffusion-based MOT approach to tackle the complex nonlinear motion. Impressive results & Source Code released💙 👉Review https://t.ly/ztlHi 👉Paper https://lnkd.in/d4K3c-nt 👉Project https://diffmot.github.io/ 👉Code github.com/Kroery/DiffMOT

17 151

🌊 Diffusive 3D Human Recovery 🌊 👉The Rutgers University unveils ScoreHMR at #CVPR24; novel approach for 3D human pose and shape reconstruction. Impressive results. 👉Review https://t.ly/G0k2D 👉Paper https://arxiv.org/pdf/2403.09623 👉Code https://github.com/statho/ScoreHMR 👉Project https://statho.github.io/ScoreHMR/

17 151

🌐 3D Scenes with Depth Inpainting 🌐 👉Oxford announced two novel contributions to the field of 3D scene generation: a new benchmark and a novel depth completion model. 🤗-Demo and Source Code released💙 👉Review https://t.ly/BKiny 👉Paper https://arxiv.org/pdf/2404.19758 👉Project https://research.paulengstler.com/invisible-stitch/ 👉Code https://github.com/paulengstler/invisible-stitch 👉Demo https://huggingface.co/spaces/paulengstler/invisible-stitch

17 151

🏝️1000x Scalable Neural 3D Fields🏝️ 👉Highly-scalable neural 3D Fields: 1000x reductions in memory maintaining speed/quality: 10 MB vs. 10 GB! Code released 💙 👉Review https://t.ly/sLTK5 👉Paper https://lnkd.in/dEYM8-t2 👉Project https://lnkd.in/djptdujx 👉Code https://lnkd.in/dcCnFZ2n

17 151

🪷 Tunnel Try-on: SOTA VTON 🪷 👉"Tunnel Try-on", the first diffusion-based video virtual try-on model that demonstrates SOTA performance in complex scenarios. No code announced :( 👉Review https://t.ly/joMtJ 👉Paper arxiv.org/pdf/2404.17571 👉Project mengtingchen.github.io/tunnel-try-on-page/

17 151

👗TELA: Text to 3D Clothed Human👗 👉 TELA is a novel approach for the new task of clothing disentangled 3D human model generation from texts. This novel approach unleashes the potential of many downstream applications (e.g., virtual try-on). 👉Review https://t.ly/6N7JV 👉Paper https://arxiv.org/pdf/2404.16748 👉Project https://jtdong.com/tela_layer/ 👉Code https://github.com/DongJT1996/TELA

17 151

🌊 FlowMap: dense depth video 🌊 👉MIT (+CSAIL) unveils FlowMap, a novel E2E differentiable method that solves for precise camera poses, camera intrinsics, and perframe dense depth of a video sequence. Source Code released 💙 👉Review https://lnkd.in/dMgakzWm 👉Paper arxiv.org/pdf/2404.15259.pdf 👉Project cameronosmith.github.io/flowmap 👉Code github.com/dcharatan/flowmap

17 151

🎡 NER-Net: Seeing at Nighttime 🎡 👉Huazhong (+Beijing) unveils a novel event-based nighttime imaging solution under non-uniform illumination, plus a paired multi-illumination level real-world dataset. Repo online, code coming 💙 👉Review https://t.ly/Z9JMJ 👉Paper arxiv.org/pdf/2404.11884.pdf 👉Repo github.com/Liu-haoyue/NER-Net 👉Clip https://www.youtube.com/watch?v=zpfTLCF1Kw4