uz
Feedback
Data science research papers

Data science research papers

Kanalga Telegramโ€™da oโ€˜tish

Machine learning and data science research papers Key ML and AI papers with code and GitHub repos. Simple way to follow current research. Join ๐Ÿ‘‰ https://rebrand.ly/bigdatachannels DMCA: @disclosure_bds Contact: @mldatascientist

Ko'proq ko'rsatish
2 900
Obunachilar
+224 soatlar
+77 kunlar
+4730 kunlar
Obunachilarni jalb qilish
Iyun '26
Iyun '26
+48
0 kanalda
May '26
+104
1 kanalda
Get PRO
Aprel '26
+114
0 kanalda
Get PRO
Mart '26
+101
0 kanalda
Get PRO
Fevral '26
+82
0 kanalda
Get PRO
Yanvar '26
+118
9 kanalda
Get PRO
Dekabr '25
+115
0 kanalda
Get PRO
Noyabr '25
+112
0 kanalda
Get PRO
Oktabr '25
+43
0 kanalda
Get PRO
Sentabr '25
+7
0 kanalda
Get PRO
Avgust '25
+3
0 kanalda
Get PRO
Iyul '25
+3
0 kanalda
Get PRO
Iyun '25
+2
0 kanalda
Get PRO
May '25
+3
0 kanalda
Get PRO
Aprel '25
+15
0 kanalda
Get PRO
Mart '25
+112
0 kanalda
Get PRO
Fevral '25
+153
0 kanalda
Get PRO
Yanvar '25
+187
0 kanalda
Get PRO
Dekabr '24
+179
0 kanalda
Get PRO
Noyabr '24
+165
0 kanalda
Get PRO
Oktabr '24
+136
0 kanalda
Get PRO
Sentabr '24
+108
0 kanalda
Get PRO
Avgust '24
+114
0 kanalda
Get PRO
Iyul '24
+139
0 kanalda
Get PRO
Iyun '24
+115
0 kanalda
Get PRO
May '24
+132
1 kanalda
Get PRO
Aprel '24
+109
0 kanalda
Get PRO
Mart '24
+146
0 kanalda
Get PRO
Fevral '24
+183
0 kanalda
Get PRO
Yanvar '24
+228
0 kanalda
Get PRO
Dekabr '23
+171
1 kanalda
Get PRO
Noyabr '23
+28
0 kanalda
Get PRO
Oktabr '23
+28
0 kanalda
Get PRO
Sentabr '23
+504
0 kanalda
Sana
Obunachilarni jalb qilish
Esdaliklar
Kanallar
17 Iyun0
16 Iyun+3
15 Iyun+3
14 Iyun+3
13 Iyun+2
12 Iyun+1
11 Iyun+5
10 Iyun0
09 Iyun+6
08 Iyun+6
07 Iyun+4
06 Iyun+1
05 Iyun+4
04 Iyun+3
03 Iyun+2
02 Iyun+5
01 Iyun0
Kanal postlari
Lip Forcing: Few-Step Autoregressive Diffusion for Real-time Lip Synchronization ๐Ÿ“… Publication Date: Jun 9, 2026 ๐Ÿ“‘ Paper: h
Lip Forcing: Few-Step Autoregressive Diffusion for Real-time Lip Synchronization ๐Ÿ“… Publication Date: Jun 9, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2606.11180 ๐Ÿ’ป Project Page: https://cvlab-kaist.github.io/LipForcing/ ๐Ÿ“ Description: The paper presents Lip Forcing, a novel autoregressive diffusion method for real-time video-to-video lip synchronization. The problem with existing diffusion-based lip synchronization models is that they require full-sequence bidirectional attention and many denoising steps, making them impractical for real-time inference. To address this issue, the authors propose a method that distills a large audio-conditioned bidirectional video diffusion teacher into smaller causal students. #LipSynchronization #AutoregressiveDiffusion #RealTimeVideoProcessing #VideoToVideoSynthesis #DiffusionBasedModels

2
Contexts are Never Long Enough: Structured Reasoning for Scalable Question Answering over Long Document Sets ๐Ÿ“… Publication D
Contexts are Never Long Enough: Structured Reasoning for Scalable Question Answering over Long Document Sets ๐Ÿ“… Publication Date: Apr 24, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.22294 ๐Ÿ”— Code: N/A ๐Ÿ“ Description: SLIDERS tackles long-document QA by extracting information into a relational database and using SQL for structured reasoning. This avoids LLM context window issues and aggregation bottlenecks, significantly outperforming traditional methods on various benchmarks. #QuestionAnswering #NLP #AI #SQL #LongDocuments
88
3
LLM Safety From Within: Detecting Harmful Content with Internal Representations ๐Ÿ“… Publication Date: Apr 20, 2026 ๐Ÿ“‘ Paper: h
LLM Safety From Within: Detecting Harmful Content with Internal Representations ๐Ÿ“… Publication Date: Apr 20, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.18519 ๐Ÿ”— Code: https://github.com/CSSLab/SIREN ๐Ÿ“Š Models citing this paper: โ€ข https://huggingface.co/UofTCSSLab/SIREN-Qwen3-0.6B โ€ข https://huggingface.co/UofTCSSLab/SIREN-Qwen3-4B โ€ข https://huggingface.co/UofTCSSLab/SIREN-Llama-3.2-1B ๐Ÿ“ Description: SIREN is a lightweight guard model that uses LLM internal layer features to detect harmful content, outperforming current models. It is more efficient, generalizes better, and requires significantly fewer parameters than existing guard models. #LLMSafety #AIethics #HarmfulContent #DeepLearning #NLP
145
4
TexOCR: Advancing Document OCR Models for Compilable Page-to-LaTeX Reconstruction ๐Ÿ“… Publication Date: Apr 24, 2026 ๐Ÿ“‘ Paper:
TexOCR: Advancing Document OCR Models for Compilable Page-to-LaTeX Reconstruction ๐Ÿ“… Publication Date: Apr 24, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.22880 ๐Ÿ”— Github: https://github.com/QDRhhhh/TexOCR ๐Ÿ“ Description: This research presents TexOCR for reconstructing scientific PDFs into compilable LaTeX, addressing limitations of current OCR. It introduces a new benchmark and trains TexOCR using reinforcement learning with verifiable rewards. #AI #DataScience #MachineLearning #Research
203
5
Type-Checked Compliance: Deterministic Guardrails for Agentic Financial Systems Using Lean 4 Theorem Proving ๐Ÿ“… Publication D
Type-Checked Compliance: Deterministic Guardrails for Agentic Financial Systems Using Lean 4 Theorem Proving ๐Ÿ“… Publication Date: Apr 1, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.01483 ๐Ÿ’ป Project Page: https://axiom.devrashie.space ๐Ÿ”— Code: https://github.com/arkanemystic/lean-agent-protocol ๐Ÿ“ Description: The Lean-Agent Protocol ensures deterministic regulatory compliance for financial AI. It uses Lean 4 theorem proving to auto-formalize policies, verifying agent actions as mathematical conjectures for cryptographic-level certainty, addressing LLM probabilistic nature. #FormalVerification #AICompliance #FinTech #Lean4 #LLMAgents
233
6
Do Audio-Visual Large Language Models Really See and Hear? ๐Ÿ“… Publication Date: Apr 3, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2
Do Audio-Visual Large Language Models Really See and Hear? ๐Ÿ“… Publication Date: Apr 3, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.02605 ๐Ÿ’ป Project Page: https://ramaneswaran.github.io/avllm_interpretability/ ๐Ÿ”— Code: https://github.com/ramaneswaran/avllm_interpretability ๐Ÿ“ Description: AVLLMs exhibit modality bias where visual representations dominate over audio cues during multimodal integration, despite audio semantics being present in intermediate layers. #AI #DataScience #MachineLearning #HuggingFace #Research
262
7
Scaling Teams or Scaling Time? Memory Enabled Lifelong Learning in LLM Multi-Agent Systems ๐Ÿ“… Publication Date: Mar 27, 2026
Scaling Teams or Scaling Time? Memory Enabled Lifelong Learning in LLM Multi-Agent Systems ๐Ÿ“… Publication Date: Mar 27, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.03295 ๐Ÿ”— Code: https://github.com/ShanglinWu/MAS_lifelong_learning ๐Ÿ“ Description: This paper introduces LLMA-Mem, a memory framework for LLM multi-agent systems. It finds that scaling is non-monotonic; optimized experience reuse allows smaller teams to outperform larger ones, improving long-term performance and reducing cost. #AI #DataScience #MachineLearning #Research
264
8
BidirLM: From Text to Omnimodal Bidirectional Encoders by Adapting and Composing Causal LLMs ๐Ÿ“… Publication Date: Apr 2, 2026
BidirLM: From Text to Omnimodal Bidirectional Encoders by Adapting and Composing Causal LLMs ๐Ÿ“… Publication Date: Apr 2, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.02045 ๐Ÿ”— Code: N/A ๐Ÿ“Š Models citing this paper: โ€ข https://huggingface.co/BidirLM/BidirLM-Omni-2.5B-Embedding โ€ข https://huggingface.co/BidirLM/BidirLM-0.6B-Embedding โ€ข https://huggingface.co/BidirLM/BidirLM-1.7B-Embedding ๐Ÿ—ƒ Datasets citing this paper: โ€ข https://huggingface.co/datasets/BidirLM/BidirLM-Contrastive ๐Ÿ“Description: BidirLM adapts causal LLMs into bidirectional encoders, overcoming catastrophic forgetting and integrating specialized models. It employs a prior masking phase, weight merging, and data mixture, outperforming alternatives on text, vision, and audio benchmarks. #LLM #MultimodalAI #DeepLearning #AIResearch #HuggingFace #ModelAdaptation
306
9
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents ๐Ÿ“… Publication Date: Apr 7, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pd
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents ๐Ÿ“… Publication Date: Apr 7, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.06132 ๐Ÿ’ป Project Page: https://claw-eval.github.io/ ๐Ÿ”— Code: https://github.com/claw-eval/claw-eval ๐Ÿ“ Description: Claw-Eval addresses limitations in agent benchmarks by providing comprehensive evaluation across multiple modalities with trajectory-aware grading and safety assessments. #AI #DataScience #MachineLearning #Research
296
10
Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework ๐Ÿ“… Publication Date: Apr 7, 2026 ๐Ÿ“‘ Paper:
Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework ๐Ÿ“… Publication Date: Apr 7, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.06170 ๐Ÿ’ป Project Page: https://papercircle.vercel.app/ ๐Ÿ”— Code: https://github.com/MAXNORM8650/papercircle ๐Ÿ“Description: A multi-agent system called Paper Circle is presented that automates the discovery and analysis of scientific literature through integrated retrieval and knowledge graph construction capabilities. #AI #DataScience #MachineLearning #Research
283
11
MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU ๐Ÿ“… Publication Date: Apr 6, 2026
MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU ๐Ÿ“… Publication Date: Apr 6, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.05091 ๐Ÿ”— Code: https://github.com/DLYuanGod/MegaTrain ๐Ÿ“ Description: MegaTrain trains large language models with over 100 billion parameters on a single GPU. It stores parameters in host memory and streams them to the GPU using pipelined execution and stateless layer templates to overcome bandwidth. This enables 120 billion parameter training. #AI #DataScience #MachineLearning #Research
308
12
Context-Value-Action Architecture for Value-Driven Large Language Model Agents ๐Ÿ“… Publication Date: Apr 7, 2026 ๐Ÿ“‘ Paper: htt
Context-Value-Action Architecture for Value-Driven Large Language Model Agents ๐Ÿ“… Publication Date: Apr 7, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.05939 ๐Ÿ“ Description: LLMs show rigid, polarized behavior worsening with reasoning. The Context-Value-Action CVA architecture decouples actions from reasoning using a human-data Value Verifier, mitigating polarization and improving behavioral fidelity. #AI #DataScience #MachineLearning #Research
301
13
QiMeng-PRepair: Precise Code Repair via Edit-Aware Reward Optimization ๐Ÿ“… Publication Date: Apr 7, 2026 ๐Ÿ“‘ Paper: https://arx
QiMeng-PRepair: Precise Code Repair via Edit-Aware Reward Optimization ๐Ÿ“… Publication Date: Apr 7, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.05963 ๐Ÿ”— Code: https://github.com/kcxain/QiMeng-PRepair ๐Ÿ“Š Models citing this paper: โ€ข https://huggingface.co/kcxain/Prepair-Python-7B-EA โ€ข https://huggingface.co/kcxain/Prepair-Verilog-7B-EA ๐Ÿ“ Description: PRepair tackles over-editing in AI program repair by maximizing correct code reuse. It combines controlled bug injection and edit-aware policy optimization using an edit-aware reward. This framework significantly improves repair precision and decoding throughput. #ProgramRepair #AI #MachineLearning #ReinforcementLearning #SoftwareEngineering
336
14
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning ๐Ÿ“… Publication Date: Apr 8, 2026 ๐Ÿ“‘Pa
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning ๐Ÿ“… Publication Date: Apr 8, 2026 ๐Ÿ“‘Paper: https://arxiv.org/pdf/2604.04746 ๐Ÿ”— Code: N/A ๐Ÿ“ Description: This paper introduces process-driven image generation, an iterative method with interleaved textual and visual reasoning. It decomposes synthesis into planning, drafting, reflecting, and refining steps. Dense step-wise supervision ensures consistency and interpretability of intermediate states. #ImageGeneration #GenerativeAI #ArtificialIntelligence #DeepLearning #ComputerVision
334
15
Watch Before You Answer: Learning from Visually Grounded Post-Training ๐Ÿ“… Publication Date: Apr 6, 2026 ๐Ÿ“‘ Paper: https://arx
Watch Before You Answer: Learning from Visually Grounded Post-Training ๐Ÿ“… Publication Date: Apr 6, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.05117 ๐Ÿ’ป Project Page: http://vidground.etuagi.com ๐Ÿ”— Code: https://github.com/reacher-z/vidground ๐Ÿ“ Description: VLMs struggle with video understanding due to text biases in benchmarks and training data. VidGround uses only visually grounded questions for post-training to eliminate these biases. This improves VLM performance and emphasizes the need for high-quality, visually grounded data. #VLMs #VideoUnderstanding #AI #MachineLearning #ComputerVision
359
16
Learning to Hint for Reinforcement Learning ๐Ÿ“… Publication Date: Apr 1, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.00698 ๐Ÿ”— Co
Learning to Hint for Reinforcement Learning ๐Ÿ“… Publication Date: Apr 1, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2604.00698 ๐Ÿ”— Code: https://github.com/Andree-9/HiLL ๐Ÿ“ Description: HiLL is a reinforcement learning framework that adaptively generates hints conditioned on reasoner errors to improve learning signals and transfer performance in group relative policy optimization. #AI #DataScience #MachineLearning #Research
392
17
Online Experiential Learning for Language Models ๐Ÿ“… Publication Date: Mar 17, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2603.16856
Online Experiential Learning for Language Models ๐Ÿ“… Publication Date: Mar 17, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2603.16856 ๐Ÿ’ป Project Page: https://github.com/microsoft/LMOps/tree/main/oel ๐Ÿ”— Code: https://github.com/microsoft/LMOps/tree/main/oel ๐Ÿ“ Description: Online Experiential Learning enables continuous improvement of language models through deployment experience by extracting and consolidating experiential knowledge via on-policy distillation. #AI #DataScience #MachineLearning #Research
404
18
Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding ๐Ÿ“… Publication Date: Mar 9, 20
Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding ๐Ÿ“… Publication Date: Mar 9, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2603.13366 ๐Ÿ’ป Project Page: https://mlrm-lead.github.io/ ๐Ÿ”— Code: https://github.com/mlrm-LEAD/mlrm-LEAD #AI #DataScience #MachineLearning #Research
408
19
SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering? ๐Ÿ“…Publication Date: Mar 16, 2026 ๐Ÿ“‘ Paper
SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering? ๐Ÿ“…Publication Date: Mar 16, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2603.15401 ๐Ÿ”— Code: https://github.com/GeniusHTX/SWE-Skills-Bench ๐Ÿ“ Description: Research using SWE-Skills-Bench shows agent skills offer limited benefits in real-world software engineering. Most skills yield no improvement, with an average pass-rate gain of only 1.2 percent. Only specialized skills provide meaningful gains, while some can even degrade performance. #SoftwareEngineering #AIagents #Benchmarking #AIresearch #LLM
393
20
Mixture of Style Experts for Diverse Image Stylization ๐Ÿ“… Publication Date: Mar 17, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2603
Mixture of Style Experts for Diverse Image Stylization ๐Ÿ“… Publication Date: Mar 17, 2026 ๐Ÿ“‘ Paper: https://arxiv.org/pdf/2603.16649 ๐Ÿ’ป Project Page: https://hh-lg.github.io/StyleExpert-Page/ ๐Ÿ”— Code: https://github.com/HVision-NKU/StyleExpert ๐Ÿ“Description: StyleExpert introduces a Mixture of Experts architecture for image stylization. It uses a unified style encoder and gating mechanism to handle diverse styles across semantic levels. This preserves semantics and material details better than existing methods. #AI #DataScience #MachineLearning #Research
395