Scalable Extraction of Training Data from (Production) Language Models (Paper Explained) Published 2023-12-03 Download video MP4 360p Recommendations 1:02:17 RWKV: Reinventing RNNs for the Transformer Era (Paper Explained) 50:03 V-JEPA: Revisiting Feature Prediction for Learning Visual Representations from Video (Explained) 29:29 Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Full Paper Review) 04:19 Double Descent explained by Yann LeCun 35:27 AlphaGeometry: Solving olympiad geometry without human demonstrations (Paper Explained) 28:26 Retentive Network: A Successor to Transformer for Large Language Models (Paper Explained) 40:40 Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Paper Explained) 46:45 Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution (Paper Explained) 1:04:30 GPT-3: Language Models are Few-Shot Learners (Paper Explained) 45:44 What is Q-Learning (back to basics) 31:45 LLaMA Pro: Progressive LLaMA with Block Expansion (Paper Explained) 59:48 [1hr Talk] Intro to Large Language Models 53:07 Reinforced Self-Training (ReST) for Language Modeling (Paper Explained) 24:34 Scaling Transformer to 1M tokens and beyond with RMT (Paper Explained) 32:27 Efficient Streaming Language Models with Attention Sinks (Paper Explained) 38:24 Proximal Policy Optimization (PPO) - How to train Large Language Models 54:24 Lumiere: A Space-Time Diffusion Model for Video Generation (Paper Explained) Similar videos 06:17 Industrial-scale Web Scraping with AI & Proxy Networks 15:16 OpenAI Insights and Training Data Shenanigans - 7 'Complicated' Developments + Guest Star 00:28 The HARDEST part about programming 🤦♂️ #code #programming #technology #tech #software #developer 00:46 Day in My Life as a Quantum Computing Engineer! 35:24 Automated and Explainable Deep Learning for Clinical Language Understanding at Roche 1:07:07 Lecture 03 - Pruning and Sparsity (Part I) | MIT 6.S965 18:17 EfficientNet! - Keras Code Examples 10:47 Convolutional Neural Networks Explained (CNN Visualized) More results