Startup Insider cover image

AI Jahresrückblick 2025 (1): DeepSeek-Schock, Stargate-Projekt und der Kampf um AI-Dominanz – Daniel Höpfner & Philipp Müller

Startup Insider

00:00

Reinforcement Learning und Reasoning‑Fortschritte

Philipp erklärt Reward‑basierte Trainingsansätze, RL‑Feinheiten und Einfluss auf Reasoning.

Play episode from 23:10
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app