DeepSeek R1 Explained to your grandma
3 بازدیدها
• 01/29/25
0
0
جاسازی کنید
Filmifly
1 مشترکین
Describing the key insights from the DeepSeek R1 paper in a way even your grandma could understand. I focus on the key concepts of chain of thought reasoning, reinforcement learning, and model distillation.
Paper:
https://github.com/deepseek-ai..../DeepSeek-V3/blob/ma
Ollama link for local use:
https://ollama.com/library/deepseek-r1
0:00 Introduction
0:43 Chain of Thought
1:33 Reinforcement Learning
3:53 Group Relative Policy Optimization
6:26 Distillation
#deepseek #ai #largelanguagemodels
بیشتر نشان بده، اطلاعات بیشتر
SORT BY-
نظرات برتر
-
آخرین نظرات