DeepSeek R1 Explained to your grandma
                     3 Views
                                          • 01/29/25
               
            
                     
                     0
               
               
                  
                  
                  0
               
               
              
                                                
               
               Embed
            
 
                                                                        
                                                            
            
                        
            
            
               Filmifly 
                                 
            
         
                     1 Subscribers
                  
                           Describing the key insights from the DeepSeek R1 paper in a way even your grandma could understand. I focus on the key concepts of chain of thought reasoning, reinforcement learning, and model distillation. 
 
Paper: 
https://github.com/deepseek-ai..../DeepSeek-V3/blob/ma 
 
Ollama link for local use: 
https://ollama.com/library/deepseek-r1 
 
0:00 Introduction 
0:43 Chain of Thought 
1:33 Reinforcement Learning 
3:53 Group Relative Policy Optimization 
6:26 Distillation 
 
#deepseek #ai #largelanguagemodels
Show more
      
                                            
			
			
			
			
			
			
			
			
			
			
			
			
			
			
			
			
			
			
			
			
SORT BY- 
						 Top Comments
					
 
					- 
						 Latest comments