Artificial Intelligence l l AI Updates
Open in Telegram
News about AI & DL & ML!!! Admin: @Gayrat_Tangriberganov
Show more1 615
Subscribers
+124 hours
+27 days
+430 days
Posts Archive
π GitHub_Link
βοΈ BLIP3-o π₯
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models
#DiffusionModels
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ VLDet: Learning Object-Language Alignments for Open-Vocabulary Object Detection π₯π₯π₯
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ Recognize Any Regions π₯π₯π₯
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ SimOWT: A Simple Baseline for Open-World Tracking via Self-training π₯π₯π₯
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ Troy-VIS: Towards Real-Time Open-Vocabulary Video Instance Segmentation
#InstanceSegmentation
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ A Unified Tokenizer for Visual Generation and Understanding
#Generative_AI
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ Enhancing Creative Generation on Stable Diffusion-based Models
#DiffusionModels
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification
#Medical_AI
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders π₯
#GazeEstimation
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
1οΈβ£ VLMs (Vision-Language Models)
These multimodal combine visual and textual understanding to interpret images and generate text about them.
2οΈβ£ SLMs (Small Language Models)
Compact yet powerful models optimized for edge devices with tight energy and latency constraints.
3οΈβ£ MLMs (Masked Language Models)
The OG bidirectional models that look at both left and right context to understand meaning in text.
4οΈβ£ SAMs (Segment Anything Models)
Foundation models for universal visual segmentation with pixel-level precision.
#just4study
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
1οΈβ£ LLMs (Large Language Models)
These foundational models process text token-by-token, enabling everything from creative writing to complex reasoning.
2οΈβ£ LCMs (Large Concept Models)
Meta's newer approach encodes entire sentences as "concepts" in SONAR embedding space, transcending word-level processing.
3οΈβ£ LAMs (Large Action Models)
Emerging models that bridge understanding with action, executing tasks through system-level operations.
4οΈβ£ MoE (Mixture of Experts)
These models activate only relevant expert networks per query, dramatically improving efficiency while maintaining performance.
#just4study
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ PixelHacker: Image Inpainting with Structural and Semantic Consistency π₯
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ MotionDirector Training For AnimateDiff. Train a MotionLoRA and run it on any compatible AnimateDiff UI π₯
#ComfyUI
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ ComfyUI nodes to use AnimateDiff-MotionDirectorπ₯
#ComfyUI
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels π₯
#4D
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
#3D
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
#3D
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ VSP-LLM (Visual Speech Processing incorporated with LLMs)
#LLMs
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
π GitHub_Link
βοΈ In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer π₯
Join my channel:
ππππππ
https://t.me/Artificial_Intelligence_Updates
Available now! Telegram Research 2025 β the year's key insights 
