MENU

家
文章

MORE VIDEOS

探索更多

帮助

电影与动画

宠物和动物

旅游与活动

新闻与政治

操作方法和风格

非营利组织和激进主义

下一个

自动播放

India Pakistan War Explained - India Retaliates & Launches Strike on Pakistan

38 意见 . 05/09/25 Filmifly

DeepSeek R1 Explained to your grandma

3 意见 • 01/29/25

0

0

嵌入

Filmifly

1 订户

Describing the key insights from the DeepSeek R1 paper in a way even your grandma could understand. I focus on the key concepts of chain of thought reasoning, reinforcement learning, and model distillation.

Paper:
https://github.com/deepseek-ai..../DeepSeek-V3/blob/ma

Ollama link for local use:
https://ollama.com/library/deepseek-r1

0:00 Introduction
0:43 Chain of Thought
1:33 Reinforcement Learning
3:53 Group Relative Policy Optimization
6:26 Distillation

#deepseek #ai #largelanguagemodels

显示更多

0 注释

下一个

自动播放

India Pakistan War Explained - India Retaliates & Launches Strike on Pakistan

38 意见 . 05/09/25 Filmifly