Episodes

  • Gemini 3.5 Flash, Composer 2.5 is a Beast, Google IO, We Live in Exciting Times | Ep 16
    May 22 2026

    Google I.O. brought major updates, but the developer community is furious. Is Gemini 3.5 Flash a massive step backward for production apps? In Episode 16 of Rate Limited, Ray, Eric, and Adam break down the massive backlash surrounding Google’s new pricing model and token consumption, why the Gemini CLI is getting aggressively sunset, and whether Cursor’s new Composer 2.5 is the absolute pinnacle of AI coding workhorses.

    We also react to Andrej Karpathy’s massive jump to Anthropic, detail a wild success story of modding Zelda into VR using Codex's Goal Mode, and discuss how to transition your mindset into the era of probabilistic, agentic engineering.

    If you are an engineer or builder navigating the frontier of AI, hit that subscribe button for deep, unfiltered technical breakdowns twice a month.



    Links:
    Ray: https://www.youtube.com/@RayFernando1337
    Eric: https://www.youtube.com/@pvncher
    Adam: https://www.youtube.com/@GosuCoder


    00:00 - Google I.O., OpenAI, & Composer 2.5
    00:45 - The Truth About Gemini 3.5 Flash & Token Guzzling
    02:08 - Is Gemini Rebranding What "Flash" Means?
    03:45 - Prompt Performance & Instruction Following in Anti-Gravity
    05:47 - Google's Profit Margins and Capacity Constraints
    06:15 - RIP Gemini CLI: Google Consolidating Compute
    07:38 - The Hidden Cost of Small Model Reasoning
    09:38 - Google's Distillation Strategy & Compute Allocation
    11:41 - Knowledge Cutoffs & Eval Degradation in Gemini 3.5
    13:40 - Anti-Gravity 2.0 Drama: Shifting Away From Consumers
    16:40 - Composer 2.5 First Impressions: A Pure Coding Workhorse
    19:15 - Building Parallel QA Agents with Browser Use
    21:32 - How Cursor Pulls Off Trillion-Parameter Speeds
    23:18 - Is Speed the Moat? Continuous Improvement RL Loops
    25:25 - The Messiness of Real-World Context vs. Static Evals
    27:13 - Tool Search vs. Preloading Context Bloat
    28:14 - Minimal Context Setups vs. Skill Maximalism
    32:00 - The Lack of Portability Between AI Harnesses
    34:25 - GPT-5.5 Low/Medium vs. Anthropic 4.7 Latency
    38:10 - Tool Tinkering vs. Grabbing Off-The-Shelf Tech
    39:11 - The Team's Current Model Stack (Vibe Check)
    42:11 - Codex App: Mind-Blowing Background Computer Use
    44:39 - Goal Mode vs. High-Level Orchestration Loops
    48:23 - Success Story: Modding Zelda into VR Using Goal Mode
    50:33 - Andrej Karpathy Joins Anthropic: Why Now?
    52:30 - The Value of Co-location & The Bay Area Sparkle
    55:00 - Impact Over Upside: Technologists Shaping Green Spaces
    56:09 - The Mindset Shift: Transitioning to Agentic Engineering
    58:30 - Auditing Your Threads & Finding Inefficiencies
    01:00:27 - AI Automation as "Mana" for Your Life

    Show More Show Less
    1 hr and 2 mins
  • GPT 5.5 is a coding BEAST, developing agents, and RIP Jobs | Ep 15
    May 8 2026

    This episode explores the latest developments in AI models like GPT 5.5, their impact on workflows, trust, and the future of agent development. Featuring insights from industry experts, it covers model performance, rate limits, AI in enterprise, and practical tips for building effective AI agents.


    Links:
    Ray: https://www.youtube.com/@RayFernando1337
    Eric: https://www.youtube.com/@pvncher
    Adam: https://www.youtube.com/@GosuCoder

    Chapters

    00:00 Introduction to the Rate Limited Podcast
    00:45 Exploring GPT 5.5: Features and User Experiences
    04:28 Switching from Anthropic to GPT 5.5: User Insights
    07:17 Trust and Performance: Comparing Models
    10:45 Improving Code Quality with GPT 5.5
    17:48 Utilizing Goal Mode for Long-Term Tasks
    21:26 Anthropic and SpaceX: A New Partnership
    25:32 The Future of AI Automation on Mac
    30:51 The Impact of AI on Layoffs
    36:35 Navigating the AI Landscape
    41:45 The Role of Engineers in AI Development
    49:23 Creating Engaging AI Agents
    55:03 The Future of Agent Development

    Show More Show Less
    1 hr
  • Opus 4.7 Feels Weird? Claude Design is Amazing & Cursor? | Ep 14
    Apr 24 2026

    This episode explores the latest AI model updates, including Opus 4.7, Kimi 2.6, and the evolving landscape of AI in design, coding, and enterprise applications. The hosts discuss model performance, industry trends, and strategic moves like SpaceX's partnership with Cursor, providing insights into the future of AI development.

    Links:
    Ray: https://www.youtube.com/@RayFernando1337
    Eric: https://www.youtube.com/@pvncher
    Adam: https://www.youtube.com/@GosuCoder

    Chapters

    00:00 Introduction to the Podcast and Recent Developments
    02:49 Exploring Opus 4.7: Impressions and Comparisons
    05:58 The Art of Prompting: Strategies for Effective Use
    08:57 Design Innovations: Claude Design and Its Impact
    12:03 The Future of Design: AI's Role and Implications
    15:00 Kimi 2.6: Performance and Comparisons with Other Models
    22:17 Exploring Composer's Efficiency
    25:19 Kimi 2.6 Performance Insights
    30:44 The Impact of Tokenization on AI Models
    33:15 Cursor and SpaceX: A Strategic Partnership
    40:20 The Future of AI Coding Companies
    46:24 The End of RooCode: Reflections on Community
    46:32 The Evolution of AI Coding Practices
    50:15 Navigating AI Engineering Workflows
    56:24 Orchestrating AI Agents for Efficiency
    01:02:00 The Future of AI in Development


    Show More Show Less
    1 hr and 8 mins
  • Claude Code Leak! Rate Limits keep changing, and Building Agentic Systems | Ep 13
    Apr 4 2026

    This episode covers recent developments in AI, including source code leaks, rate limit changes, and the future of agentic AI systems. Experts share insights on managing AI projects, building reliable agents, and navigating the evolving AI landscape.

    Links:
    Ray: https://www.youtube.com/@RayFernando1337
    Eric: https://www.youtube.com/@pvncher
    Adam: https://www.youtube.com/@GosuCoder

    Chapters
    00:00 Introduction to Rate Limits and Coding Challenges
    01:36 The Cloud Code Source Leak Incident
    07:05 Rate Limits and User Experience
    11:59 Choosing the Right AI Tools for Coding
    18:02 Building Agentic UIs and Architecture
    22:34 Exploring Cloud Dispatch and Its Impact
    31:07 Designing Agents Beyond Coding
    40:22 Building Effective Agents for Business

    Show More Show Less
    50 mins
  • GPT 5.4, NVIDIA GTC, AI Impact on the Job Market | Ep 12
    Mar 22 2026

    This episode covers the latest in AI model releases, hardware advancements from NVIDIA at GTC, and the evolving landscape of AI's impact on jobs and software development. Experts share insights on GPT-5.4, inference hardware, and the future of AI-driven workflows.

    Links:
    Ray: https://www.youtube.com/@RayFernando1337
    Eric: https://www.youtube.com/@pvncher
    Adam: https://www.youtube.com/@GosuCoder


    Chapters

    00:00 Introduction to the Rate Limited Podcast
    02:57 Exploring GPT 5.4: Features and Improvements
    06:08 The Role of Planning in AI Coding
    08:59 Context Management in AI Models
    12:10 NVIDIA GTC Conference Insights
    15:01 The Future of AI Inference and Hardware
    17:52 DLSS 5: AI in Gaming Graphics
    24:52 The Future of AI in Gaming and Film
    25:58 Understanding Open-Claw Strategy
    27:45 The Rise of Personal Agents
    28:32 The Changing Landscape of Software Development
    31:18 Craftsmanship vs. Automation in Software Engineering
    36:07 Job Displacement and the Future of Work
    41:10 Optimism in the Age of AI
    50:29 Skills and Context Management in AI
    54:00 The Future of AI Interaction

    Show More Show Less
    56 mins
  • New Models! Gemini 3.1, Composer 5.1, Code Disposability, Reducing AI Slop | Ep 11
    Feb 28 2026

    This episode covers the latest developments in AI models from Google, Anthropic, and others, exploring their capabilities, limitations, and implications for developers and the industry. The hosts share insights on model stability, speed, safety, and the future of AI in coding and business.

    • Google Gemini 3.1 updates and stability issues
    • The impact of model speed and inference hardware
    • Model distillation and intellectual property concerns
    • The role of AI in software engineering and code quality
    • implications of AI model development

    Chapters

    00:00
    Introduction to AI Models and Recent Developments

    01:05
    Exploring Google's Gemini 3.1 and User Experiences

    04:52
    Strengths and Weaknesses of AI Models in Development

    09:37
    Cerebris and Spark: Speed vs. Context in AI Models

    17:29
    Anthropic's Claims and Geopolitical Implications in AI

    27:37
    The Future of AI Development and Safety Concerns

    28:06
    The Evolution of AI in Code Generation

    29:02
    Managing Automated Code in Large Companies

    30:49
    The Disposability Principle in Code Design

    32:37
    Balancing Code Stability and Disposability

    34:29
    Navigating Complexity in AI-Generated Code

    36:55
    The Role of AI in Code Review and Development

    40:40
    The Future of AI and Human Collaboration in Coding

    45:45
    The Changing Landscape of Software Engineering

    50:04
    The Demand for Software Engineers in an AI World

    Show More Show Less
    59 mins
  • Opus 4.6, Codex 5.3, AI Coding is Addictive, AI personal assistants | Ep 10
    Feb 14 2026

    In this episode of the Rate Limited podcast, hosts Ray Fernando, Adam Larson, and Eric discuss the latest AI model releases, including Opus 4.6 and Codex 5.3. They explore the implications of these advancements on coding practices, the emergence of swarm intelligence, and the challenges of managing multiple AI agents. The conversation also touches on the addictive nature of AI coding, the impact of AI on work-life balance, and the potential rise of AI spam in communication. The hosts emphasize the importance of taking breaks and maintaining a healthy relationship with technology as they navigate this rapidly evolving landscape.

    Links:
    Ray: https://www.youtube.com/@RayFernando1337
    Eric: https://www.youtube.com/@pvncher
    Adam: https://www.youtube.com/@GosuCoder

    Show More Show Less
    55 mins
  • Agent Swarms are here, Kimi K2.5, Security holes in Clawdbot, and more | Episode 9
    Jan 30 2026

    In this episode of the Rate Limited Podcast, hosts Ray, Adam, and Eric discuss the latest advancements in AI models and tools, including Codex, Opus, and Kimi K 2.5. They explore the intricacies of orchestration, task management, and the importance of security in AI applications. The conversation delves into the practical applications of these models in development and personal assistance, highlighting the balance between utility and risk. The hosts share their experiences and insights on using these tools effectively while navigating the evolving landscape of AI technology.

    Links:
    Ray: https://www.youtube.com/@RayFernando1337
    Eric: https://www.youtube.com/@pvncher
    Adam: https://www.youtube.com/@GosuCoder

    Show More Show Less
    54 mins