cookie

We use cookies to improve your browsing experience. By clicking «Accept all», you agree to the use of cookies.

avatar

Heuristics AI

Ai research updates LLMs Reinforcement learning Deep learning GANs Stable diffusion Transformers NLP Kindly join (⁠☞⁠ ⁠ಠ⁠_⁠ಠ⁠)⁠☞ @heuristics_ai

Show more
Advertising posts
1 348
Subscribers
+124 hours
+47 days
+1530 days

Data loading in progress...

Subscriber growth rate

Data loading in progress...

Show all...
Meta Large Language Model Compiler: Foundation Models of Compiler Optimization | Research - AI at Meta

Large Language Models (LLMs) have demonstrated remarkable capabilities across a variety of software engineering and coding tasks. However, their...

CriticGPT: Finding GPT-4's mistakes with GPT-4 https://openai.com/index/finding-gpt4s-mistakes-with-gpt-4/
Show all...
Finding GPT-4’s mistakes with GPT-4

CriticGPT, a model based on GPT-4, writes critiques of ChatGPT responses to help human trainers spot mistakes during RLHF

Gemma 2: Improving Open Language Models at a Practical Size [pdf] https://storage.googleapis.com/deepmind-media/gemma/gemma-2-report.pdf
Show all...

Andrew karpathy launched its llm course https://github.com/karpathy/LLM101n @heuristics_ai
Show all...
GitHub - karpathy/LLM101n: LLM101n: Let's build a Storyteller

LLM101n: Let's build a Storyteller. Contribute to karpathy/LLM101n development by creating an account on GitHub.

1
Paper page - mDPO: Conditional Preference Optimization for Multimodal Large Language Models https://huggingface.co/papers/2406.11839
Show all...
Paper page - mDPO: Conditional Preference Optimization for Multimodal Large Language Models

Join the discussion on this paper page

Show all...
Maintaining large-scale AI capacity at Meta

Meta is currently operating many data centers with GPU training clusters across the world. Our data centers are the backbone of our operations, meticulously designed to support the scaling demands …

Creativity has left the chat: The price of debiasing language models https://arxiv.org/abs/2406.05587
Show all...
Creativity Has Left the Chat: The Price of Debiasing Language Models

Large Language Models (LLMs) have revolutionized natural language processing but can exhibit biases and may generate toxic content. While alignment techniques like Reinforcement Learning from...

Nvidia Warp: A Python framework for high performance GPU simulation and graphics https://github.com/NVIDIA/warp
Show all...
GitHub - NVIDIA/warp: A Python framework for high performance GPU simulation and graphics

A Python framework for high performance GPU simulation and graphics - NVIDIA/warp

Augmenting biological intelligence with RL in C.elegans using optogenetics [pdf] https://klab.tch.harvard.edu/publications/PDFs/gk8172.pdf
Show all...

Choose a Different Plan

Your current plan allows analytics for only 5 channels. To get more, please choose a different plan.