Llm Reinforcement Learning Tutorial - 搜索视频

Lecture 4 - Reinforcement Learning - Basics | Reasoning LLMs from Scratch

Lecture 4 - Reinforcement Learning - Basics | Reasoning LLMs from Scr…

已浏览 7784 次2025年4月17日

Reinforcement Learning with Verifiable Rewards - Teaching LLMs to Solve Problems

Reinforcement Learning with Verifiable Rewards - Teaching LL…

已浏览 5490 次6 个月之前

YouTubeAdam Lucek

How to finetune LLMs to THINK with Reinforcement Learning (GRPO from scratch!)

How to finetune LLMs to THINK with Reinforcement Learning (GRPO fr…

已浏览 2.6万次10 个月之前

YouTubeNeural Breakdown with AVB

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 10: RL for LLM Reasoning

Stanford CS224R Deep Reinforcement Learning | Spring 2…

已浏览 3754 次5 个月之前

YouTubeStanford Online

I Trained an LLM to Think Deeper (Here's How)

I Trained an LLM to Think Deeper (Here's How)

已浏览 1.2万次2025年2月24日

YouTubeAdam Lucek

Master LLM Training with Reinforcement Learning

Master LLM Training with Reinforcement Learning

已浏览 13 次2 周前

YouTubeGithub Signals

Reinforcement Learning for LLMs in 2025

Reinforcement Learning for LLMs in 2025

已浏览 1.6万次2025年2月10日

YouTubeTrelis Research

Reinforcement Learning with LLMs: a new era of AI agents

已浏览 3869 次3 个月之前

YouTubeShaw Talebi

[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Qu…

已浏览 11.2万次9 个月之前

YouTubeAI Engineer

Reinforcement Learning (RL) for LLMs

已浏览 1.3万次2025年3月12日

YouTubeNatasha Jaques

[UCLA RL-LLM] Chapter 3.2: Reinforcement learning with verifi…

已浏览 3640 次10 个月之前

YouTubeErnest Ryu

Reinforcement Learning: A (practical) introduction

已浏览 2783 次3 个月之前

YouTubeShaw Talebi

Stanford CS224R Deep Reinforcement Learning | Spring 2…

已浏览 3140 次5 个月之前

YouTubeStanford Online

🔥 Live Demo: Reinforcement Fine-Tuning for LLMs — Build Smarter …

已浏览 4480 次2025年4月3日

YouTubePredibase by Rubrik

Reinforcement Learning for LLM Reasoning. RL / RLHF / RLAIF.

已浏览 180 次6 个月之前

YouTubeByte Goose AI.

[UCLA RL-LLM] Chapter 0: Course outline and prologue

已浏览 1.3万次10 个月之前

YouTubeErnest Ryu

[UCLA RL-LLM] Chapter 1.1: MDP foundations, imitation learning, an…

已浏览 7758 次10 个月之前

YouTubeErnest Ryu

Lecture 19 - Applying RL in LLMs | Reinforcement Learning Phase | R…

已浏览 1954 次9 个月之前

How LLMs Are Actually Trained: Pre-Training vs. Post-Training Ex…

已浏览 5919 次8 个月之前

YouTubeSuper Data Science: ML & AI Podcast with Jon …

Why Reinforcement Learning Unlocks Reasoning in LLMs (Aha …

已浏览 2279 次4 个月之前

YouTubeAI Papers Academy

在视频中查找 01:12What is Reinforcement Learning?

Reinforcement Learning from Human Feedback (RLHF) Explained

已浏览 8.5万次2024年8月7日

YouTubeIBM Technology

Stanford CS234 Reinforcement Learning I Introduction to Reinforc…

已浏览 15.2万次2024年10月30日

YouTubeStanford Online

Reinforcement Learning with Human Feedback (RLHF) - How to train an…

已浏览 3.5万次2024年2月12日

YouTubeLuis Serrano Academy

The FASTEST introduction to Reinforcement Learning on the int…

已浏览 43.7万次2024年12月23日

在视频中查找 1:20:351 bit LLM Indepth Intuition

Fine Tuning LLM Models – Generative AI Course

已浏览 43.5万次2024年5月21日

YouTubefreeCodeCamp.org

LLM Fine-Tuning Course – From Supervised FT to RLHF, LoRA, an…

已浏览 5.7万次1 个月前

YouTubefreeCodeCamp.org

Fine-tuning LLMs on Human Feedback (RLHF + DPO)

已浏览 2.3万次2025年3月3日

YouTubeShaw Talebi

Training LLM to play chess using Deepseek GRPO reinforcement le…

已浏览 1.9万次2025年3月1日

YouTubeEfficient NLP

What is Reinforcement Fine-Tuning (RFT) - Supervised vs. RL LLM Re …

已浏览 3876 次2025年3月16日

YouTubeWhat's AI by Louis-François Bouchard

Master Reinforcement Learning With These 3 Projects

已浏览 1.4万次2024年10月17日

YouTubeAdam Lucek

观看更多视频