Artificial Intelligence l l AI Updates
رفتن به کانال در Telegram
News about AI & DL & ML!!! Admin: @Gayrat_Tangriberganov
نمایش بیشتر1 615
مشترکین
+124 ساعت
+27 روز
+430 روز
آرشیو پست ها
🔗 GitHub_Link
❇️ BLIP3-o 🔥
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models
#DiffusionModels
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ VLDet: Learning Object-Language Alignments for Open-Vocabulary Object Detection 🔥🔥🔥
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ Recognize Any Regions 🔥🔥🔥
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ SimOWT: A Simple Baseline for Open-World Tracking via Self-training 🔥🔥🔥
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ Troy-VIS: Towards Real-Time Open-Vocabulary Video Instance Segmentation
#InstanceSegmentation
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ A Unified Tokenizer for Visual Generation and Understanding
#Generative_AI
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ Enhancing Creative Generation on Stable Diffusion-based Models
#DiffusionModels
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification
#Medical_AI
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders 🔥
#GazeEstimation
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
1️⃣ VLMs (Vision-Language Models)
These multimodal combine visual and textual understanding to interpret images and generate text about them.
2️⃣ SLMs (Small Language Models)
Compact yet powerful models optimized for edge devices with tight energy and latency constraints.
3️⃣ MLMs (Masked Language Models)
The OG bidirectional models that look at both left and right context to understand meaning in text.
4️⃣ SAMs (Segment Anything Models)
Foundation models for universal visual segmentation with pixel-level precision.
#just4study
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
1️⃣ LLMs (Large Language Models)
These foundational models process text token-by-token, enabling everything from creative writing to complex reasoning.
2️⃣ LCMs (Large Concept Models)
Meta's newer approach encodes entire sentences as "concepts" in SONAR embedding space, transcending word-level processing.
3️⃣ LAMs (Large Action Models)
Emerging models that bridge understanding with action, executing tasks through system-level operations.
4️⃣ MoE (Mixture of Experts)
These models activate only relevant expert networks per query, dramatically improving efficiency while maintaining performance.
#just4study
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ PixelHacker: Image Inpainting with Structural and Semantic Consistency 🔥
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ MotionDirector Training For AnimateDiff. Train a MotionLoRA and run it on any compatible AnimateDiff UI 🔥
#ComfyUI
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ ComfyUI nodes to use AnimateDiff-MotionDirector🔥
#ComfyUI
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels 🔥
#4D
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
#3D
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
#3D
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ VSP-LLM (Visual Speech Processing incorporated with LLMs)
#LLMs
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
🔗 GitHub_Link
❇️ In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer 🔥
Join my channel:
👇👇👇👇👇👇
https://t.me/Artificial_Intelligence_Updates
اکنون در دسترس! پژوهش تلگرام ۲۰۲۵ — مهمترین بینشهای سال 
