AI with Papers - Artificial Intelligence & Deep Learning

Kanalga Telegram’da o‘tish

All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/ #AI #chatGPT

Ko'proq ko'rsatish

Malayziya2 238 Texnologiyalar & Aralashmalar7 725...

📈 Telegram kanali AI with Papers - Artificial Intelligence & Deep Learning analitikasi

AI with Papers - Artificial Intelligence & Deep Learning (@ai_deeplearning) Ingliz til segmentidagi kanali faol ishtirokchi. Hozirda hamjamiyat 17 173 obunachidan iborat bo'lib, Texnologiyalar & Aralashmalar toifasida 7 725-o'rinni va Malayziya mintaqasida 2 238-o'rinni egallagan.

📊 Auditoriya ko‘rsatkichlari va dinamika

невідомо sanasidan buyon loyiha tez o‘sib, 17 173 obunachiga ega bo‘ldi.

19 Iyun, 2026 dagi oxirgi ma’lumotlarga ko‘ra kanal barqaror faollikka ega. Oxirgi 30 kunda obunachilar soni -177 ga, so‘nggi 24 soatda esa -9 ga o‘zgardi va umumiy qamrov yuqori darajada qolmoqda.

Tasdiqlash holati: Tasdiqlanmagan
Jalb etish (ER): Auditoriya o‘rtacha 21.83% darajada jalb etiladi. Nashrdan keyingi dastlabki 24 soatda kontent odatda umumiy obunachilar sonining N/A% ini tashkil etuvchi reaksiyalarni to‘playdi.
Post qamrovi: Har bir post o‘rtacha 3 749 marta ko‘riladi; birinchi sutkada odatda 0 ta ko‘rish yig‘iladi.
Reaksiyalar va o‘zaro ta’sir: Auditoriya faol: har bir postga o‘rtacha 26 ta reaksiya keladi.
Tematik yo‘nalishlar: Kontent framework, object, dataset, tba, depth kabi asosiy mavzularga jamlangan.

📝 Tavsif va kontent siyosati

Muallif resursni shaxsiy fikrni ifoda etish maydoni sifatida ta’riflaydi:
“All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/ #AI #chatGPT”

Yuqori yangilanish chastotasi (oxirgi ma’lumot 20 Iyun, 2026 da olingan) sababli kanal doimo dolzarb va katta qamrovli bo‘lib qoladi. Analitika auditoriya kontent bilan faol hamkorlik qilishini, uni Texnologiyalar & Aralashmalar toifasidagi muhim ta’sir nuqtasiga aylantirishini ko‘rsatadi.

17 173

Obunachilar

-924 soatlar

-397 kunlar

-17730 kunlar

3 749

Post ko'rishlar

Ma'lumot yo'q24 soatlar

Ma'lumot yo'q48 soatlar

21.83%

Muloqot nisbati

Ma'lumot yo'q

Kuniga postlar

Ads index

beta

Postlar arxiv

17 173

Here the preview, tomorrow the full clip from official source :)

17 173

Hinton our guest in Pavia (remotely) 💚😈

17 173

🔥BoxerNet: SOTA 2D->3D BBs🔥 👉Boxer by #META: transformer-based network to lift 2D BB proposals into 3D, followed by multi-view fusion and geometric filtering to produce globally consistent de-duplicated 3DBBs in metric world space. Repo under A-NC 4.0 International💙 👉Review https://t.ly/mlmV1 👉Paper https://arxiv.org/pdf/2604.05212 👉Project facebookresearch.github.io/boxer/ 👉Repo github.com/facebookresearch/boxer

17 173

🔥Vanast: VTON w/ Human Animation🔥 👉SNU unveils a novel unified framework that generates garment-transferred human animation videos directly from a single human/garment images, and pose guidance clip. Repo announced💙 👉Review https://t.ly/c0t79 👉Paper arxiv.org/pdf/2604.04934 👉Project hyunsoocha.github.io/vanast/ 👉Repo github.com/snuvclab/vanast

17 173

🍎Video Object Deletion🍎 👉Void by Netflix is a novel video object removal framework designed to perform physically-plausible inpainting in very complex scenarios. Repo under Apache 2.0💙 👉Review https://t.ly/cMVny 👉Paper https://arxiv.org/pdf/2604.02296 👉Project https://void-model.github.io/ 👉Repo https://github.com/Netflix/void-model

17 173

If you have to invest TODAY 1B$ on a frontier tech for the next decade, would you invest in space, agentic, quantum or frugal GPUs? Vote here: https://t.ly/hSx6i

17 173

🪬Camera Raw Image Generation🪬 👉RawGen by #Samsung is a generative approach that learns the complex distribution of raw sensor data directly, enabling high-fidelity generation from either text descriptions or standard sRGB images across arbitrary camera sensors. Linear raw image once, then apply any ISP operation. Repo announced💙 👉Review https://t.ly/_QVKP 👉Paper https://arxiv.org/pdf/2604.00093 👉Project https://dy112.github.io/rawgen-page/ 👉Repo TBA

17 173

🌵SOTA Training-Free In-Context Segmentation🌵 👉INSID3 is the new SOTA, training-free approach that segments concepts at varying granularities only from frozen DINOv3 features, given an in-context example. Repo under Apache 2.0💙 👉Review https://t.ly/NVWHN 👉Paper https://arxiv.org/pdf/2603.28480 👉Project https://visinf.github.io/INSID3/ 👉Repo https://github.com/visinf/INSID3

17 173

👌HandX: Scaling Hands Motion👌 👉 HandX is a unified foundation spanning data, annotation, and evaluation: novel large-scale dataset of bimanual & dexterous motions with fine-grained textual. Around 6M frames. Repo available💙 👉Review https://t.ly/1nGxw 👉Paper https://arxiv.org/pdf/2603.28766 👉Project https://handx-project.github.io/ 👉Repo github.com/handx-project/HandX

17 173

💥 GaussianGPT 3D GSC💥 👉From TUM, GaussianGPT: transformer-based 3D Gaussians generation via next-token prediction -> full 3D complex indoor scene. Repo announced💙 👉Review https://t.ly/bj-lL 👉Paper https://arxiv.org/pdf/2603.26661 👉Project https://nicolasvonluetzow.github.io/GaussianGPT/ 👉Repo TBA

17 173

🐍Pose-Appearance-Motion for HOI🐍 👉PAM is a novel Pose–Appearance–Motion Engine for controllable Hand–Object Interaction SOTA video generation. Repo/models available💙 👉Review 👉Paper arxiv.org/pdf/2603.22193 👉Project gasaiyu.github.io/PAM.github.io/ 👉Repo https://github.com/GasaiYU/PAM

17 173

🦪OccAny: Universal 3D Occupancy🦪 👉OccAny by Valeo is a novel unified framework for generalized unconstrained urban 3D occupancy prediction. Repo under Apache 2.0💙 👉Review 👉Paper https://arxiv.org/pdf/2603.23502 👉Project https://valeoai.github.io/OccAny/ 👉Repo https://github.com/valeoai/OccAny

17 173

🍓Material-Aware Grouping🍓 👉Material Magic Wand (Adobe) is a tool for material-aware grouping of parts in untextured 3D meshes. Given one selected part, it automatically retrieves the other parts in the same shape by its material. Repo announced💙 👉Review https://t.ly/q00SU 👉Paper https://arxiv.org/pdf/2603.17370 👉Project umangi-jain.github.io/material-magic-wand/ 👉Repo TBA

17 173

🍧10,000× faster SAM-3D🍧 👉Fast SAM 3D Body achieves up to 10.9× speedup, over 10,000× faster MHR-to-SMPL conversion -> real-time humanoid control from RGB. Repo available💙 👉Review https://t.ly/uHx84 👉Paper https://arxiv.org/pdf/2603.15603 👉Project yangtiming.github.io/Fast-SAM-3D-Body-Page/ 👉Repo https://github.com/yangtiming/Fast-SAM-3D-Body

17 173

🤖Physically-Plausible Human🤖 👉PhysMoDPO is a novel direct preference optimization framework for humanoid motion generation. Repo under MIT💙 👉Review https://t.ly/clf8w 👉Paper https://arxiv.org/pdf/2603.13228 👉Project https://mael-zys.github.io/PhysMoDPO/ 👉Repo https://github.com/Mael-zys/PhysMoDPO

17 173

🌈 New SOTA Video Depth 🌈 👉DVD is the new Video Depth Estimation SOTA with full training suite available under Apache2.0💙 👉Review https://t.ly/gpCkG 👉Paper https://arxiv.org/pdf/2603.12250 👉Project https://dvd-project.github.io/ 👉Repo github.com/EnVision-Research/DVD

17 173

☄️OmniStream: Perceive-Reconstruct-Act ☄️ 👉Novel unified streaming visual backbone that effectively perceives, reconstructs, and acts from diverse visual inputs. Repo/Models announced💙 👉Review https://t.ly/_zZMO 👉Paper arxiv.org/pdf/2603.12265 👉Project go2heart.github.io/omnistream/ 👉Repo github.com/Go2Heart/OmniStream

17 173

🍓Surface Light Tokenizer🍓 👉Apple unveils LITO a novel latent flow matching model enables HQ image-to-3D. Latent representation that encodes a surface light field into a compact set of latent vectors. Impressive results but no code🥲 👉Review https://t.ly/xcWNe 👉Paper https://lnkd.in/dYHwY4YX 👉Project https://lnkd.in/dtJT8bXy

17 173

🔥Holistic 3D Spatial Intelligence🔥 👉Holi-Spatial is the first fully automated pipeline capable of converting raw video streams into holistic 3D spatial annotations without human intervention. Code/Data announced💙 👉Review https://t.ly/PDpr9 👉Paper https://lnkd.in/dTbMuZCm 👉Project https://lnkd.in/d66CYB4q 👉Repo https://lnkd.in/dAGzShXj

17 173

📊Real-Time Scene Graph📊 👉REACT++ by Umea University is the new state-of-the-art model for real-time SGG: 20% faster with a gain of 10% in relation prediction accuracy on average. Code under MIT💙 👉Review https://t.ly/c12VX 👉Paper https://arxiv.org/pdf/2603.06386 👉Repo https://github.com/Maelic/SGG-Benchmark