Artificial Intelligence l l AI Updates
Kanalga Telegramโda oโtish
News about AI & DL & ML!!! Admin: @Gayrat_Tangriberganov
Ko'proq ko'rsatish1 615
Obunachilar
Ma'lumot yo'q24 soatlar
+47 kunlar
+530 kunlar
Postlar arxiv
๐ GitHub_Link
โ๏ธ Distributed Swarm Trajectory Optimization for Formation Flight in Dense Environments.
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ PDNet: Toward Better One-Stage Object Detection With Prediction Decoupling ๐ฅ
#ObjectDetection
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models
#DiffusionModels
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios
#SAM
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ Prompt Segment Anything ๐ฅ
#SAM
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ USE OPEN-SOURCE LLMS IN POSTGRESQL with Ollama and new open-source extension pgai
๐ฆ ๐ช๐ต๐ฎ๐ ๐ถ๐ ๐ข๐น๐น๐ฎ๐บ๐ฎ?
Ollama is an easy and popular way to use open-source language models like Llama 3, Mistral, Phi 3, and Gemma. Unlike proprietary models, open-source models are private, free (hardware costs aside), can run locally, and are customizable.
๐ ๐ช๐ต๐ฎ๐ ๐ถ๐ ๐ฝ๐ด๐ฎ๐ถ?
Pgai is an open-source PostgreSQL extension that integrates AI models with PostgreSQL data, simplifying AI engineering for developers familiar with PostgreSQL and facilitating RAG and search.
๐งฐ ๐ช๐ต๐ฎ๐ ๐ฐ๐ฎ๐ป ๐ ๐ฑ๐ผ ๐๐ถ๐๐ต ๐ฝ๐ด๐ฎ๐ถ ๐ฎ๐ป๐ฑ ๐ข๐น๐น๐ฎ๐บ๐ฎ?
Create embeddings on PostgreSQL data using models like BERT and Llama 3, storing them in pgvector for easy search and RAG. Perform RAG and LLM reasoning tasks using models like Llama 3, Mistral, and Gemma, enabling summarization, categorization, and data enrichment via SQL queries.
#LLMs
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ TF-ID: Table/Figure IDentifier for academic papers.
Seeing the open-source community develop small, cost-effective customized vision-language models (VLMs) that outperform the much larger closed-source APIs is really impressive.
One of them is the TF-ID model by Yifei Huang. It's a fine-tuned version of Florence-2, the small but very powerful VLM by Microsoft. TF-ID (Table/Figure IDentifier) is a family of object detection models finetuned to extract tables and figures in academic papers. Interestingly, the author labeled 4600 images by hand, ensuring high data quality!
#VLMs
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
It's really cool to see the open-source community creating small and cheap customized vision-language models (VLMs) that outperform the much larger closed-source APIs.
ChartGemma is a fine-tuned version of PaliGemma created by Megh Thakkar and team, which excels at answering questions regarding charts and plots. The idea is pretty simple: first use a closed-source API like Gemini 1.5 Flash to collect training data, then fine-tune the open PaliGemma model on it. You end up with a model that is much smaller and cheaper to run for this specific niche task, and it outperforms the closed-source APIs! ๐ฅ
#VLMs
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
#NeRF
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ OmniDrive: LLM-Agent for Autonomous Driving with 3D Perception, Reasoning and Planning
#LLMs
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ Shape of Motion: 4D Reconstruction from a Single Video
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ DataComp for Language Models
Apple has entered the game! Apple just released a 7B open-source LLM, weights, training code, and dataset! ๐
๐ง 7B base model, trained on 2.5T tokens on an open datasets
๐ Primarily English data and a 2048 context window
๐ Combined DCLM-BASELINE, StarCoder, and ProofPile2 data
๐ MMLU 0.6372 > Mistral & < Llama3
๐ Open License with Apple Sample Code License
๐ Matches closed-dataset models like Mistral
๐ฌ Trained using PyTorch with OpenLM framework
๐ค Available on Hugging Face and in Transformers
#LLMs
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting
#SceneTextSpotting
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ MetaSeg: Packaged version of the Segment Anything ๐ฅ
#SAM
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ Robotic Transformer 2 (RT-2): The Vision-Language-Action Model
#Robotics
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation
#Robotics
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ OmniTokenizer: one model and one weight for image-video joint tokenization
#VideoGeneration
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ Big news. Andrej Karpathy is launching a new AI Education company called Eureka labs. Their first product will be the world's best AI course, LLM101n ๐ฅ.
#LLMs
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ UW-Madison-GI-Tract-Segmentation-Data-Tools
#MedicalAi
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
๐ GitHub_Link
โ๏ธ PowerPaint: A Versatile Image Inpainting Model ๐๐
Join my channel:
๐๐๐๐๐๐
https://t.me/Artificial_Intelligence_Updates
Endi mavjud! Telegram Tadqiqoti 2025 โ yilning asosiy insaytlari 
