In this podcast, the hosts discuss 'Activation Hacking' and 'Representation Engineering' in AI model control. They also touch upon the new Sora model from OpenAI and share practical tutorials and tips for experimentation. The conversation explores control vectors, ethical implications, and the emergence of new scripting languages in the AI field.
Activation hacking involves controlling AI models for specific responses.
Representation engineering can influence model outputs without modifying weights, enhancing adaptability.
Deep dives
Innovative Disaster Relief Project at Tree Hacks Hackathon
At the Tree Hacks hackathon, a project called Meshwork stood out for utilizing Laura, a radio device set for long-range communication. Meshwork focused on disaster relief by creating a mesh network with devices dropped in the field. These devices transmitted transcribed audio commands to a command center for assistance coordination, showcasing innovation in disaster response technology.
Exploring Novel Activation Hacking Methodologies
The podcast delved into activation hacking, a term mentioned during a conversation with Caron from NUS research. This concept involves controlling AI models to exhibit specific tones or angles in their responses. Representation engineering is proposed as an approach to influence model outputs without modifying their weights, showing promise for enhancing language models' adaptable behavior.
Announcement of Google's Open Source Language Model Gemma
Google introduced Gemma, an open source language model derived from their closed-source Gemini, ranking as the top trending language model on Hugging Face. With a focus on accessible deployment due to moderate model size, Gemma offers flexibility for fine-tuning and instruction model usage. While critics note Google's dual position in closed and open source models, Gemma's potential for broad applicability and ease of integration stand out.
Magic's AGI-inspired Code Generation Platform
Magic, a new code generation platform, promises to automate code generation tasks akin to GitHub's Co-Pilot. Emphasizing code automation as a stepping stone to Artificial General Intelligence (AGI), Magic aims to evolve its AI dev assistant capabilities towards solving increasingly complex coding challenges autonomously. This alignment with AGI aspirations signifies a shift towards leveraging code generation for advancing AI capabilities.
Recently, we briefly mentioned the concept of “Activation Hacking” in the episode with Karan from Nous Research. In this fully connected episode, Chris and Daniel dive into the details of this model control mechanism, also called “representation engineering”. Of course, they also take time to discuss the new Sora model from OpenAI.
Changelog++ members save 4 minutes on this episode because they made the ads disappear. Join today!
Sponsors:
Neo4j – Is your code getting dragged down by JOINs and long query times? The problem might be your database…Try simplifying the complex with graphs. Stop asking relational databases to do more than they were made for. Graphs work well for use cases with lots of data connections like supply chain, fraud detection, real-time analytics, and genAI. With Neo4j, you can code in your favorite programming language and against any driver. Plus, it’s easy to integrate into your tech stack. Visit Neo4j.com/developer to get started.
Fly.io – The home of Changelog.com — Deploy your apps and databases close to your users. In minutes you can run your Ruby, Go, Node, Deno, Python, or Elixir app (and databases!) all over the world. No ops required. Learn more at fly.io/changelog and check out the speedrun in their docs.