Data science research papers

Відкрити в Telegram

Machine learning and data science research papers Key ML and AI papers with code and GitHub repos. Simple way to follow current research. Join 👉 https://rebrand.ly/bigdatachannels DMCA: @disclosure_bds Contact: @mldatascientist

Сітка:Programming, data science, ML - free courses by Big Data Specialist США8 040 Освіта43 663

3 012

Підписники

+324 години

+177 днів

+7930 день

355

Перегляди допису

~ 6924 години

~ 9648 годин

11.79%

Коефіцієнт залучення

Немає даних

Дописів на день

Ads index

beta

Архів дописів

3 011

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions 📅 Publication Date: Jun 22, 2026 📑 Paper: https://arxiv.org/pdf/2606.23654.pdf 🔗 Code: https://github.com/huggingface 📝 Description: EnterpriseClawBench presents a benchmark for enterprise agents based on real-world sessions with 852 reproducible tasks, emphasizing comprehensive evaluation metrics beyond single performance scores.

3 011

Heterogeneous Scientific Foundation Model Collaboration 📅 Publication Date: Apr 30, 2026 📑 Paper: https://arxiv.org/pdf/2604.27351.pdf 🔗 Code: https://github.com/Violet24K/Eywa 📝 Description: Eywa is a heterogeneous agentic framework that extends language-centric systems to scientific foundation models by integrating domain-specific models with language-based reasoning interfaces for improved performance across diverse scientific domains.

3 011

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents 📅 Publication Date: May 6, 2026 📑 Paper: https://arxiv.org/pdf/2605.05185.pdf 🔗 Code: https://github.com/shawn0728/OpenSearch-VL 📝 Description: OpenSearch-VL presents an open-source framework for training advanced multimodal search agents using reinforcement learning, featuring specialized data curation, diverse tool environments, and a novel training algorithm that improves performance across multiple benchmarks.

3 011

WorldOlympiad: Can Your World Model Survive a Triathlon? 📅 Publication Date: Jun 9, 2026 📑 Paper: https://arxiv.org/pdf/2606.11129 💻 Project Page: https://alibaba-damo-academy.github.io/WorldOlympiad/ 📝 Description: The paper introduces WorldOlympiad, a comprehensive benchmark for evaluating video-based world models. The problem with current generative models is that they often focus on visual quality, but lack physical faithfulness, geometric consistency, and interaction fidelity. To address this gap, WorldOlympiad decomposes world-model evaluation into three dimensions: physical faithfulness, geometric consistency, and interaction fidelity. WorldOlympiad covers three major downstream scenarios, including gaming, robotics, and general real-world videos, capturing diverse challenges from interactive control and embodied manipulation to open-domain motion and camera dynamics. #WorldModelEvaluation #VideoBasedWorldModels #PhysicalFaithfulness #GeometricConsistency

3 011

Qwen-AgentWorld: Language World Models for General Agents 📅 Publication Date: Jun 23, 2026 📑 Paper: https://arxiv.org/pdf/2606.24597.pdf 🔗 Code: https://github.com/huggingface 📝 Description: Language-based world models enable agentic environment simulation across multiple domains and enhance general agent performance through scalable simulation and improved downstream task performance.

3 011

HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation 📅 Publication Date: Apr 30, 2026 📑 Paper: https://arxiv.org/pdf/2604.28196.pdf 🔗 Code: https://github.com/H-EmbodVis/HERMESV2 📝 Description: HERMES++ combines 3D scene understanding and future geometry prediction through BEV representation, LLM-enhanced queries, temporal linking, and joint geometric optimization for autonomous driving applications.

3 011

ABot-Earth 0.5: Generative 3D Earth Model 📅 Publication Date: Jun 8, 2026 📑 Paper: https://arxiv.org/pdf/2606.09967 💻 Project Page: https://abot-earth.amap.com/ 📝 Description: The paper presents ABot-Earth 0.5, a generative framework that creates realistic 3D environments from satellite imagery. The problem addressed is the need for large-scale 3D reconstruction, which is currently expensive and technically challenging. The authors propose a novel generative model based on 3D Gaussian Splatting representation, which is trained on a diverse set of real-world urban reconstructions. This model learns to generate realistic geometry and textures, and can synthesize novel 3D scenes conditioned solely on satellite imagery in under 10 minutes per square kilometer. #Generative3DModeling #3DGaussianSplatting #SatelliteImageryReconstruction #GeospatialModeling

3 011

OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories 📅 Publication Date: May 5, 2026 📑 Paper: https://arxiv.org/pdf/2605.04036.pdf 🔗 Code: https://github.com/PolarSeeker/OpenSeeker 📝 Description: A simple supervised fine-tuning approach achieves state-of-the-art performance in deep search capabilities using minimal data, outperforming complex industrial pipelines and demonstrating the effectiveness of academic-led development in large language model agents.

3 011

World Model for Robot Learning: A Comprehensive Survey 📅 Publication Date: Apr 30, 2026 📑 Paper: https://arxiv.org/pdf/2605.00080 💻 Project Page: https://ntumars.github.io/wm-robot-survey/ 🔗 Code: https://github.com/NTUMARS/Awesome-World-Model-for-Robotics-Policy 📝 Description: The paper provides a comprehensive survey of world models for robot learning, which are predictive representations of environmental dynamics that support policy learning, planning, and simulation. The authors note that the literature on world models is fragmented across different architectures, functional roles, and application domains, making it difficult to understand the current state of the field. To address this gap, the authors present a systematic review of world models from a robot learning perspective, examining how they are coupled with robot policies, used as learned simulators for reinforcement learning and evaluation, and have progressed in terms of robotic video world models. #RobotLearning #RobotPolicies

3 011

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond 📅 Publication Date: Apr 24, 2026 📑 Paper: https://arxiv.org/pdf/2604.22748.pdf 🔗 Code: https://github.com/matrix-agent/awesome-agentic-world-modeling 📝 Description: World models are categorized into three capability levels and four law regimes to better understand and develop predictive environment models for AI agents across diverse domains.

3 011

Asymmetric Flow Models 📅 Publication Date: May 13, 2026 📑 Paper: https://arxiv.org/pdf/2605.12964 💻 Project Page: https://hanshengchen.com/asymflow/ 🔗 Code: https://github.com/Lakonik/LakonLab ⭐️ 324 📊 Models citing this paper: • https://huggingface.co/Lakonik/AsymFLUX.2-klein-9B • https://huggingface.co/Lakonik/AsymFlow-ImageNet • https://huggingface.co/OJ-1/AsymFLUX.2-klein-9B 📝 Description: The paper introduces Asymmetric Flow Modeling, a method for efficient high-dimensional flow-based generation. The problem with existing flow-based generation methods is that they require modeling high-dimensional noise, which is difficult even when the data has a strong low-rank structure. To address this, the authors propose a rank-asymmetric velocity parameterization that restricts noise prediction to a low-rank subspace while keeping data prediction full-dimensional. #AsymmetricFlowModels #FlowBasedGeneration #RankAsymmetricVelocity

3 011

From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company 📅 Publication Date: Apr 24, 2026 📑 Paper: https://arxiv.org/pdf/2604.22446.pdf 🔗 Code: https://github.com/1mancompany/OneManCompany 📝 Description: OneManCompany (OMC) introduces an organizational framework for multi-agent systems that enables dynamic team assembly, governance, and improvement through portable agent identities and hierarchical decision-making processes.

3 011

📢 Advertising in this channel You can place an ad via Telega․io. It takes just a few minutes. Formats and current rates: View details

3 011

🔥 Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning 📅 Publication Date: Jun 9, 2026 📑 Paper: https://arxiv.org/pdf/2606.11087 💻Project Page: https://q-guided-flow.github.io/ 📝 Description: The paper proposes a reinforcement learning algorithm called QGF that improves policies at test time by using a value gradient to guide a pre-trained flow policy. The problem addressed is that incorporating flow models into reinforcement learning pipelines for policy improvement can be difficult due to stability and scalability issues. The method involves pre-training a reference flow policy and a value function critic, then using the value gradient to guide the reference policy to generate higher-value actions at test time, without any additional policy learning. #ReinforcementLearningAlgorithms #TestTimePolicyImprovement #QGFAlgorithm

3 011

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling 📅 Publication Date: May 8, 2026 📑 Paper: https://arxiv.org/pdf/2605.08083 💻 Project Page: https://zhengkid.github.io/AutoTTS-web/ 🔗 Code: https://github.com/zhengkid/AutoTTS 📝 Description: The paper proposes a novel approach to improve the performance of large language models through test-time scaling, which involves allocating additional computation during inference. Existing test-time scaling strategies are typically hand-crafted, relying on manual design and tuning of reasoning patterns and heuristics. This approach leaves much of the computation-allocation space unexplored, resulting in potential inefficiencies. To address this limitation, the authors introduce AutoTTS, an environment-driven framework that automates the discovery of test-time scaling strategies. Instead of designing individual strategies, researchers can create environments where optimal strategies can be discovered automatically. #LargeLanguageModels

3 011

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation 📅 Publication Date: Apr 27, 2026 📑 Paper: https://arxiv.org/pdf/2604.24764.pdf 🔗 Code: https://github.com/microsoft/World-R1 📝 Description: World-R1 framework improves video generation by incorporating 3D constraints through reinforcement learning and specialized text datasets while maintaining visual quality and scalability.

3 011

Code as Agent Harness 📅 Publication Date: May 18, 2026 📑 Paper: https://arxiv.org/pdf/2605.18747 🔗 Code: N/A 📝 Description: The paper discusses the concept of code as agent harness, where large language models are used as operational substrates for agent reasoning and execution in agentic systems. The authors argue that code is no longer just a target output, but serves as a unified infrastructure layer across multiple domains and applications. They introduce a unified view that centers code as the basis for agent infrastructure, and organize their survey around three connected layers: the harness interface, harness mechanisms, and scaling the harness. #AgenticSystems #LargeLanguageModels #AgentReasoning

3 011

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration 📅 Publication Date: May 19, 2026 📑 Paper: https://arxiv.org/pdf/2605.20025 💻 Project Page: https://github.com/aiming-lab/AutoResearchClaw 🔗 Code: https://github.com/huggingface 🗃Datasets citing this paper: • https://huggingface.co/datasets/AIMING-Lab-UNC/ARC-Bench 📝Description: AutoResearchClaw is a new autonomous research system that improves scientific discovery by incorporating human collaboration and iterative learning. The problem with existing autonomous research systems is that they often model the research process as a linear pipeline, relying on single agent reasoning and stopping when execution fails, without carrying experience across runs. #AutonomousResearchSystems #MultiAgentLearning #SelfReinforcingSystems

3 011

AgentSearchBench: A Benchmark for AI Agent Search in the Wild 📅 Publication Date: Apr 24, 2026 📑 Paper: https://arxiv.org/pdf/2604.22436 🔗 Code: N/A 📝 Description: AgentSearchBench is a new benchmark for finding suitable AI agents using execution-grounded performance signals from nearly 10,000 real-world agents. It shows that description-based similarity is insufficient, and lightweight behavioral signals significantly improve agent ranking. #AI #AIAgents #Benchmarking #AgentSearch #MachineLearning

3 011

Omnilingual MT: Machine Translation for 1,600 Languages 📅 Publication Date: Mar 17, 2026 📑 Paper: https://arxiv.org/pdf/2603.16309 🗃 Datasets citing this paper: https://huggingface.co/datasets/facebook/bouquet 🔗 Code: N/A 📝 Description: Omnilingual MT OMT is the first system to support over 1,600 languages. It uses specialized smaller LLMs 1B-8B to outperform 70B baselines, achieving high-quality translation and coherent generation in low-compute settings. #AI #DataScience #MachineLearning #HuggingFace #Research