
MLOps.community Computers that Think and Take Actions for You
17 snips
Jan 2, 2026 Zengyi Qin, founder of the OpenAGI Foundation, discusses groundbreaking technology enabling computers to think and act autonomously. He details innovative training methods using large-scale sandboxes and real-world applications, highlighting Lux, a model that outperforms competitors like Gemini. Zengyi explains the agent framework for efficient task management and envisions AI replacing traditional input methods in the near future. He emphasizes the importance of continuous model improvement to tackle performance challenges, paving the way for a new era of human-computer interaction.
AI Snips
Chapters
Transcript
Episode notes
Early Startup Success And Open Models
- Zengyi built MyShout as an AI agent platform while at MIT and grew it to billions of users before it IPO'd.
- His team also open-sourced voice and speech models that ranked top on GitHub.
Low-Cost From-Scratch Model Training
- Zengyi's team trained a Llama-2–competitive model from scratch using a layered data approach and a Mixer-of-Attention architecture.
- They claim they matched performance with only about $100k of compute by progressively training from low- to high-quality data.
Screen-Level Access Unlocks New Capabilities
- Giving models direct access to the screen and controls unlocks a much larger set of capabilities than chat-only interfaces.
- Computer-use models can perform clicks, typing, dragging and compose workflows across desktop and web autonomously.

