Gradient Dissent: Conversations on AI

Scaling LLMs and Accelerating Adoption with Aidan Gomez at Cohere

17 snips

Apr 20, 2023

Ask episode

Chapters

Transcript

Episode notes

The Importance of Attention in Recurrent Neural Networks

The Importance of Attention

The Importance of Scalability in Neural Networks

The Future of State-Space Models

The Importance of Saturating Compute

Transformers: A Multi-Layer Perception Work

The Future of Large Language Models

The Future of Language Models

How to Fine-Tune a Model to Respond to Commands

How to Measure Academic Data Set Performance

The Need for a Scaling Breakthrough for Large Language Models

The Importance of Code Generation

Cohere's Position in the Landscape of Companies Building and Selling Large Models

GPT-3: A Cloud-Agnostic Platform for Data Privacy

The Importance of Open Source Models

The Importance of Large Language Models

The Future of Social Media

The Future of Chat Interfaces

The Pros and Cons of Building a Company in a Different Way

The Future of Foundation Models

The Importance of Sensitivity in Machine Learning