AI with Papers - Artificial Intelligence & Deep Learning

Open in Telegram

All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/ #AI #chatGPT

Malaysia2 240 Technologies & Applications7 726...

📈 Analytical overview of Telegram channel AI with Papers - Artificial Intelligence & Deep Learning

Channel AI with Papers - Artificial Intelligence & Deep Learning (@ai_deeplearning) in the English language segment is an active participant. Currently, the community unites 17 154 subscribers, ranking 7 726 in the Technologies & Applications category and 2 240 in the Malaysia region.

📊 Audience metrics and dynamics

Since its creation on невідомо, the project has demonstrated rapid growth, gathering an audience of 17 154 subscribers.

According to the latest data from 21 June, 2026, the channel demonstrates stable activity. Although there has been a change in the number of participants by -166 over the last 30 days and by -6 over the last 24 hours, overall reach remains high.

Verification status: Not verified
Engagement rate (ER): The average audience engagement rate is 23.63%. Within the first 24 hours after publication, content typically collects 6.86% reactions from the total number of subscribers.
Post reach: On average, each post receives 4 057 views. Within the first day, a publication typically gains 1 177 views.
Reactions and interaction: The audience actively supports content: the average number of reactions per post is 26.
Thematic interests: Content is focused on key topics such as framework, object, dataset, tba, depth.

📝 Description and content policy

The author describes the resource as a platform for expressing subjective opinions:
“All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/ #AI #chatGPT”

Thanks to the high frequency of updates (latest data received on 22 June, 2026), the channel maintains relevance and a high level of publication reach. Analytics show that the audience actively interacts with content, making it an important point of influence in the Technologies & Applications category.

17 154

Subscribers

-624 hours

-277 days

-16630 days

4 057

Post views

~ 1 17724 hours

~ 1 35348 hours

23.63%

Engagement rate

No data

Posts per day

Ads index

beta

Posts Archive

17 154

🔥🔥🔥🔥🔥 SOURCE CODE IS OUT !!! 🔥🔥🔥🔥🔥 Thanks Danny for the info 🥇

17 154

🦧Sapiens: SOTA ViTs for human🦧 👉META unveils Sapiens, a family of models for human-centric vision tasks: 2D pose estimation, body-part segmentation, depth estimation, and surface normal prediction. Source Code announced, coming💙 👉Review https://t.ly/GKQI0 👉Paper arxiv.org/pdf/2408.12569 👉Project rawalkhirodkar.github.io/sapiens 👉Code github.com/facebookresearch/sapiens

17 154

🦓 Zebra Detection & Pose 🦓 👉The first synthetic dataset that can be used for both detection and 2D pose estimation of zebras without applying any bridging strategies. Code, results, models, and the synthetic, training/validation data, including 104K manually labeled images open-sourced💙 👉Review https://t.ly/HTEZZ 👉Paper https://lnkd.in/dQYT-fyq 👉Project https://lnkd.in/dAnNXgG3 👉Code https://lnkd.in/dhvU97xD

17 154

🏗️ #Adobe Instant TurboEdit 🏗️ 👉Adobe unveils a novel real-time text-based disentangled real image editing method built upon 4-step SDXL Turbo. SOTA HQ image editing using ultra fast few-step diffusion. No code announced but easy to guess it will be released in commercial tools. 👉Review https://t.ly/Na7-y 👉Paper https://lnkd.in/dVs9RcCK 👉Project https://lnkd.in/dGCqwh9Z 👉Code 😢

17 154

🧪 Click-Attention Segmentation 🧪 👉An interesting image patch-based click attention algorithm and an affinity loss inspired by SASFormer. This novel approach aims to decouple positive and negative clicks, guiding positive ones to focus on the target object and negative ones on the background. Code released under Apache💙 👉Review https://t.ly/tG05L 👉Paper https://arxiv.org/pdf/2408.06021 👉Code https://github.com/hahamyt/ClickAttention

17 154

👋 Real-time Expressive Hands 👋 👉Zhejiang unveils XHand, a novel expressive hand avatar designed to comprehensively generate hand shape, appearance, and deformations in real-time. Source Code released (Apache 2.0) the Jul. 31st, 2024💙 👉Review https://t.ly/8obbB 👉Project https://lnkd.in/dRtVGe6i 👉Paper https://lnkd.in/daCx2iB7 👉Code https://lnkd.in/dZ9pgzug

17 154

🔥🔥 SAM v2 is out! 🔥🔥 👉#Meta announced SAM 2, the novel unified model for real-time promptable segmentation in images and videos. 6x faster, it's the new SOTA by a large margin. Source Code, Dataset, Models & Demo released under permissive licenses💙 👉Review https://t.ly/oovJZ 👉Paper https://t.ly/sCxMY 👉Demo https://sam2.metademolab.com 👉Project ai.meta.com/blog/segment-anything-2/ 👉Models github.com/facebookresearch/segment-anything-2

17 154

🪄 Diffusion Models for Transparency 🪄 👉MIT (+ #Google) unveils Alchemist, a novel method to control material attributes of objects like roughness, metallic, albedo & transparency in real images. Amazing work but code not announced🥺 👉Review https://t.ly/U98_G 👉Paper arxiv.org/pdf/2312.02970 👉Project www.prafullsharma.net/alchemist/

17 154

🎁 A guide for modern CV 🎁 👉In the last 18 months I received more than 1,100+ applications for research roles. The majority part of the applicants doesn't deeply know a few milestones in CV. Here a short collection of mostly-free resources to spend a bit of good time in the summer. 𝐁𝐨𝐨𝐤𝐬 (recommended): ✅DL with Python https://t.ly/VjaVx ✅Python OOP https://t.ly/pTQRm 𝐎𝐧𝐥𝐢𝐧𝐞 V𝐢𝐝𝐞𝐨 𝐂𝐨𝐮𝐫𝐬𝐞𝐬 (recommended): ✅Berkeley | Modern CV (2023) https://t.ly/AU7S3 𝐋𝐢𝐛𝐫𝐚𝐫𝐢𝐞𝐬: ✅PyTorch https://lnkd.in/dTvJbjAx ✅PyTorchLighting https://lnkd.in/dAruPA6T ✅Albumentations https://albumentations.ai/ 𝐏𝐚𝐩𝐞𝐫𝐬: ✅EfficientNet https://lnkd.in/dTsT44ae ✅ViT https://lnkd.in/dB5yKdaW ✅UNet https://lnkd.in/dnpKVa6T ✅DeepLabV3+ https://lnkd.in/dVvqkmPk ✅YOLOv1: https://lnkd.in/dQ9rs53B ✅YOLOX: https://lnkd.in/d9ZtsF7g 👉More papers and the full list: https://t.ly/WAwAk

17 154

👽Keypoint Promptable Re-ID👽 👉KPR is a novel formulation of the ReID problem that explicitly complements the input BBox with a set of semantic keypoints indicating the intended target. Code, dataset and annotations coming soon💙 👉Review https://t.ly/vCXV_ 👉Paper https://arxiv.org/pdf/2407.18112 👉Repo github.com/VlSomers/keypoint_promptable_reidentification

17 154

🧱EAFormer: Scene Text-Segm.🧱 👉A novel Edge-Aware Transformers to segment texts more accurately, especially at the edge of texts. FULL re-annotation of COCO_TS and MLT_S! Code coming, data available on 🤗 👉Review https://t.ly/0G2uX 👉Paper https://arxiv.org/pdf/2407.17020 👉Project https://hyangyu.github.io/EAFormer/ 👉Data huggingface.co/datasets/HaiyangYu/TextSegmentation/tree/main

17 154

🐢 TAPTRv2: new SOTA for TAP 🐢 👉TAPTRv2: Transformer-based approach built upon TAPTR for solving the Tracking Any Point (TAP) task. TAPTR borrows designs from DETR and formulates each tracking point as a point query, making it possible to leverage well-studied operations in DETR-like algorithms. The Source Code of V1 is available, V2 coming💙 👉Review https://t.ly/H84ae 👉Paper v1 https://lnkd.in/d4vD_6xx 👉Paper v2 https://lnkd.in/dE_TUzar 👉Project https://taptr.github.io/ 👉Code https://lnkd.in/dgfs9Qdy

17 154

🏆Who's the REAL SOTA tracker in the world?🏆 👉BofN meta-tracker outperforms, by a large margin, existing SOTA trackers on nine standard benchmarks (LaSOT, TrackingNet, GOT-10K, VOT2019, VOT2021, VOT2022, UAV123, OTB100, and WebUAV-3M). Source Code available💙 👉Review https://t.ly/WB9AR 👉Paper https://arxiv.org/pdf/2407.15707 👉Code https://github.com/BasitAlawode/Best_of_N_Trackers

17 154

🎭 TRG: new SOTA in 6DoF Head 🎭 👉ECE (Korea) unveils TRG, a novel landmark-based method for estimating a 6DoF head pose which stands out for its explicit bidirectional interaction structure. Experiments on ARKitFace & BIWI confirm it's the new SOTA. Source Code & Models to be released💙 👉Review https://t.ly/lOIRA 👉Paper https://lnkd.in/dCWEwNyF 👉Code https://lnkd.in/dzRrwKBD

17 154

🧿 Shape of Motion for 4D 🧿 👉 Google (+Berkeley) unveils a novel method capable of reconstructing generic dynamic scenes, featuring explicit, full-sequence-long 3D motion, from casually captured monocular videos. Impressive tracking capabilities. Source Code released 💙 👉Review https://t.ly/d9RsA 👉Project https://shape-of-motion.github.io/ 👉Paper arxiv.org/pdf/2407.13764 👉Code github.com/vye16/shape-of-motion/

17 154

Hi folks, I need you help 🙏 👉 Could you help me understanding what do you think about the lasting of the hiring process for #artificialintelligence roles? Any comment here will be appreciated :) Vote here: https://t.ly/UMRXH Thanks <3

17 154

📈Gradient Boosting Reinforcement Learning📈 👉#Nvidia unveils GBRL, a framework that extends the advantages of Gradient Boosting Trees to the RL domain. GBRL adapts the power of Gradient Boosting Trees to the unique challenges of RL environments, including non-stationarity and absence of predefined targets. Code released💙 👉Review https://t.ly/zv9pl 👉Paper https://arxiv.org/pdf/2407.08250 👉Code https://github.com/NVlabs/gbrl

17 154

💌 KineTy: Typography Diffusion 💌 👉GIST introduces a novel realistic kinetic typography generation driven by text description. Guided video diffusion models to achieve visually-pleasing text appearances. Repo to be released under Attribution-NC 4.0💙 👉Review https://t.ly/2FWo9 👉Paper arxiv.org/pdf/2407.10476 👉Project seonmip.github.io/kinety/ 👉Repo github.com/SeonmiP/KineTy/tree/main

17 154

🥥 OmniNOCS: largest 3D NOCS 🥥 👉OmniNOCS by #Google (+Georgia) is a unified NOCS (Normalized Object Coordinate Space) dataset that contains data across different domains with 90+ object classes. The largest NOCS dataset to date. Data & Code available under Apache 2.0💙 👉Review https://t.ly/xPgBn 👉Paper arxiv.org/pdf/2407.08711 👉Project https://omninocs.github.io/ 👉Data github.com/google-deepmind/omninocs