Newsflash: Pierer invests Millions in Hochreiter‘s XLSTM
Feb 5, 2024
auto_awesome
Sepp Hochreiter, a researcher in LLM alternative architecture, discusses the advancement of XLSTM, the significance of context length in language models, the exploration of XLSTM potential and IP rights in Europe, and the evaluation of methods and abstract reasoning. The potential applications of large language models for companies are also explored.
XLSTM offers advantages over transformer models in terms of accuracy, power consumption, and abstract reasoning, making it ideal for complex tasks like coding and logic.
The founders of NXI have invested a significant amount of money in training XLSTM models using over 1000 GPUs and plan to explore revenue streams such as licensing the technology and developing their own products.
Deep dives
Founding of NXI and Focus on XLSTM
In this podcast episode, Professor Dr. Dezapur Hader and Albert discuss the founding of their company NXI, which aims to advance the XLSTM idea. XLSTM is a linear approach to language models that seeks to outperform transformer models in terms of speed and efficiency. The company has secured funding to test and compare XLSTM with existing language models, focusing on scaling laws and smaller vs. larger models. The ultimate goal is to develop a large language model based on XLSTM that can be brought to the market.
Benefits and Potential Applications of XLSTM
XLSTM offers several advantages over transformer models. It can predict the next word with greater accuracy and is more efficient in terms of power consumption and cost. The XLSTM approach leverages memory and enables abstract reasoning, making it ideal for complex tasks like coding and logic. Additionally, the team aims to explore the potential of modifying memory to align the language model with specific user needs. By using XLSTM, companies could have access to a large language model that effectively serves as a repository of their organization's knowledge, improving productivity and decision-making.
Investment and Future Plans
The founders of NXI have invested a significant amount of money and are currently training XLSTM models using over 1000 GPUs. They plan to conduct comparisons with other models by the end of March and continue training on larger datasets. While the focus is currently on research and development, the company intends to explore various revenue streams, including licensing the technology and developing their own products. They aim to keep the intellectual property in Europe and establish partnerships with industries to integrate XLSTM technology into vertical applications.
Pierer invests many millions in Sepp Hochreiter's LLM alternative architecture. Shortly before Christmas, the company NXAI was founded in Linz with Albert Ortig as Managing Director. Its task: to market Hochreiter's XLSTM.
Thanks for listening. We welcome suggestions for topics, criticism and a few stars on Apple, Spotify and Co.