The Nonlinear Library

The Nonlinear Fund

The Nonlinear Library allows you to easily listen to top EA and rationalist content on your podcast player. We use text-to-speech software to create an automatically updating repository of audio content from the EA Forum, Alignment Forum, LessWrong, and other EA blogs. To find out more, please visit us at nonlinear.org

Episodes

Mentioned books

Aug 17, 2024 • 7min

AF - Calendar feature geometry in GPT-2 layer 8 residual stream SAEs by Patrick Leask

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Calendar feature geometry in GPT-2 layer 8 residual stream SAEs, published by Patrick Leask on August 17, 2024 on The AI Alignment Forum. TL;DR: We demonstrate that the decoder directions of GPT-2 SAEs are highly structured by finding a historical date direction onto which projecting non-date related features lets us read off their historical time period by comparison to year features. Calendar years are linear: there are as many years between 2000 and 2024, as there are between 1800 and 1824. Linear probes can be used to predict years of particular events from the activations of language models. Since calendar years are linear, one might think the same of other time-based features such as weekday features, however weekday activations in sparse autoencoders (SAEs) were recently found to be arranged in a circular configuration in their top principal components. Inspired by this, we looked into weekdays, months, and most interestingly calendar years from the perspective of SAE feature decoder similarity. For each group of calendar features, we found interesting patterns of feature splitting between sparse autoencoders of different sizes. For calendar years, we found a timeline direction that meaningfully ordered events, individuals, and concepts with respect to their historical period, which furthermore does not correspond to a principal component of the decoder directions. Finally, we introduce a simple method for finding some of these interpretable directions. Features at different scales We started by replicating the weekday results by performing PCA on the decoder directions of features that had high activations when prompted with days of the week, using the same GPT-2 SAEs as in this post, ranging from 768 to 98304 features. In the 768 feature SAE, we found a single weekday feature that activated strongly on all days of the week. In the largest SAE, we found 10 weekday features, 3 of which activated on all days of the week, with the remaining 7 activating on a single day of the week each. We found a group of features that activate primarily on specific days of the week by taking the top 20 activating samples for each feature and checking that the max activating token in each of these samples was the specific weekday. We found the first two principal components for this set of features, and projected the features that activate on any day or number of days from all SAEs onto these directions. The labeled features are those that activate on a single day across all SAEs, with the multi-day features unlabeled to maintain legibility. The smallest SAE (blue) has a single feature that activates on all weekday tokens, and lies near the mean of all the weekday features. The largest SAEs learn features for each day of the week, plus additional multi-day features. Across SAE sizes, the single day features form clusters. In each of these examples, the smallest SAE has a single feature that splits into many specific features that seem of roughly the same importance. With calendar years, however, the situation is more complex. The same method of finding the principal components for single year features between 1900 and 2020 only succeeds in a few 21st century features, and nothing from the 20th century. There is also a group of single year features in a smaller SAE in the center of the plot, suggesting these principal components do not explain variance in them. The plot below shows the years for which each of the features is active, with the x-axis being years from 1950 to 2020, the y-axis being separate features, and the colored bars indicating the periods of year for which that feature is active. Only in the largest SAEs do you see more than a few single calendar year features, with most of the features activating on ranges of years, or other patterns such as the start and end...

Aug 16, 2024 • 14min

EA - The Tech Industry is the Biggest Blocker to Meaningful AI Safety Regulations by Garrison

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: The Tech Industry is the Biggest Blocker to Meaningful AI Safety Regulations, published by Garrison on August 16, 2024 on The Effective Altruism Forum. If you enjoy this, please consider subscribing to my Substack. My latest reporting went up in The Nation yesterday: It's about the tech industry's meltdown in response to SB 1047, a California bill that would be the country's first significant attempt to mandate safety measures from developers of AI models more powerful and expensive than any yet known. Rather than summarize that story, I've added context from some past reporting as well as new reporting on two big updates from yesterday: a congressional letter asking Newsom to veto the bill and a slate of amendments. The real AI divide After spending months on my January cover story in Jacobin on the AI existential risk debates, one of my strongest conclusions was that the AI ethics crowd (focused on the tech's immediate harms) and the x-risk crowd (focused on speculative, extreme risks) should recognize their shared interests in the face of a much more powerful enemy - the tech industry: According to one estimate, the amount of money moving into AI safety start-ups and nonprofits in 2022 quadrupled since 2020, reaching $144 million. It's difficult to find an equivalent figure for the AI ethics community. However, civil society from either camp is dwarfed by industry spending. In just the first quarter of 2023, OpenSecrets reported roughly $94 million was spent on AI lobbying in the United States. LobbyControl estimated tech firms spent €113 million this year lobbying the EU, and we'll recall that hundreds of billions of dollars are being invested in the AI industry as we speak. And here's how I ended that story: The debate playing out in the public square may lead you to believe that we have to choose between addressing AI's immediate harms and its inherently speculative existential risks. And there are certainly trade-offs that require careful consideration. But when you look at the material forces at play, a different picture emerges: in one corner are trillion-dollar companies trying to make AI models more powerful and profitable; in another, you find civil society groups trying to make AI reflect values that routinely clash with profit maximization. In short, it's capitalism versus humanity. This was true at the time I published it, but honestly, it felt like momentum was on the side of the AI safety crowd, despite its huge structural disadvantages (industry has way more money and armies of seasoned lobbyists). Since then, it's become increasingly clear that meaningful federal AI safety regulations aren't happening any time soon. The Republican Majority Leader Steve Scalise promised as much in June. But it turns out Democrats would have also likely blocked any national, binding AI safety legislation. The congressional letter Yesterday, eight Democratic California Members of Congress published a letter to Gavin Newsom, asking him to veto SB 1047 if it passes the state Assembly. There are serious problems with basically every part of this letter, which I picked apart here. (Spoiler: it's full of industry talking points repackaged under congressional letterhead). Many of the signers took lots of money from tech, so it shouldn't come as too much of a surprise. I'm most disappointed to see that Silicon Valley Representative Ro Khanna is one of the signatories. Khanna had stood out to me positively in the past (like when he Skyped into The Intercept's five year anniversary party). The top signatory is Zoe Lofgren, who I wrote about in The Nation story: SB 1047 has also acquired powerful enemies on Capitol Hill. The most dangerous might be Zoe Lofgren, the ranking Democrat in the House Committee on Science, Space, and Technology. Lofgren, whose district covers much of ...

Aug 16, 2024 • 5min

LW - Investigating the Chart of the Century: Why is food so expensive? by Maxwell Tabarrok

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Investigating the Chart of the Century: Why is food so expensive?, published by Maxwell Tabarrok on August 16, 2024 on LessWrong. You've probably seen this chart from Mark Perry at the American Enterprise Institute. I've seen this chart dozens of times and have always enjoyed how many different and important stories it can tell. There is a story of the incredible abundance offered by technological growth and globalization. Compared to average hourly wages, cars, furniture, clothing, internet access, software, toys, and TVs have become far more accessible than they were 20 years ago. Flatscreens and Fiats that were once luxuries are now commodities. There is also a story of sclerosis and stagnation. Sure, lots of frivolous consumer goods have gotten cheaper but healthcare, housing, childcare, and education, all the important stuff, has exploded in price. Part of this is " cost disease" where the high productivity of labor in advancing industries like software, raises the cost of labor in slower productivity growth industries like healthcare. Another part is surely the near-universal "restrict supply and subsidize demand" strategy that governments undertake when regulating an industry. Zoning laws + Prop 13 in housing, occupational licensing and the FDA + Medicare in healthcare, and free student debt + all of the above for higher ed. One story from this graph I've never heard and only recently noticed is that "Food and Beverages" has inflated just as much as Housing in this graph. This is extremely counterintuitive. Food is a globally traded and mass produced commodity while housing is tied to inelastic land supply in desirable locations. Farming, grocery, and restaurants are competitive and relatively lightly regulated markets while the housing is highly regulated, subsidized, and distorted. Construction productivity is worse than stagnant while agricultural productivity has been ascendent for the past 300 years and even retail productivity is 8x higher than it was in 1950. Construction is also more labor intensive than farming or staffing the grocery. Source Yet food prices have risen just as much as housing prices over the past 24 years. What explains this? One trend is that Americans are eating out more. The "Food and Beverages" series from the BLS includes both "Food At Home" and "Food Away From Home." In 2023, eating out was a larger portion of the average household's budget than food at home for the first time, but they have been converging for more than 50 years. Restaurant food prices have increased faster than grocery prices. This makes sense, as a much larger portion of a restaurant's costs are location and labor, both of which are affected by tight supply constraints on urban floor space. This isn't enough to satisfy my surprise at the similarity in price growth though. Even if we just look at "food at home" price growth, it only really sinks below housing after 2015. Beverages at home/away from home follow a more divergent version of the same pattern, but are a much smaller part of the weighted average that makes up the aggregate index. The BLS series for "Housing" is also an aggregate index of " Shelter" prices, which is the actual rent (or Owner Rent Equivalent), and other expenses like utilities, moving, and repairs. Stagnant construction productivity and land use regulation will show up mostly in rents so these other pieces of the series are masking a bit of the inflation. There is also changing composition within the "Food at home" category. Americans eat more fats and oils, more sugars and sweets, more grains, and more red meat; all four items that grew the most in price since 2003. There's also a flipside to food and beverage's easy tradability: they're closer to the same price everywhere. House prices per square foot, by contrast, differ by more th...

Aug 16, 2024 • 7min

EA - This chart is right. Most interventions don't do much. (Cameroon experience) by EffectiveHelp - Cameroon

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: This chart is right. Most interventions don't do much. (Cameroon experience), published by EffectiveHelp - Cameroon on August 16, 2024 on The Effective Altruism Forum. This chart is so right. The local charity environment in Cameroon is probably helping much less people than you imagine. We ran an effectiveness contest that aligns with this research perfectly. In 2021 we created an EA group in Cameroon. We had multiple seminars covering the basics of Effective Altruism. By the end of 2022, the group got so excited that we created a charity. "We" are a group of humanitarian/development workers in Cameroon, all currently employed in this field of work. Some of the basic EA principles resonated a lot. Such as the feeling that some activities and projects don't really help much and that somewhere, sometimes, there is "real impact". So we created this charity to help steer organizations towards real impact, and help them become "more effective". We tried a couple of things: We offered consultancy services, starting for free, to local charities. We started a contest to find the best projects in Cameroon. The first thing did not work. See footnote. [1] [1] Now about the contest, we think this is relevant to share. The contest helped us confirm this global analysis, some things just work miles away from others, and some organizations are dedicated to things that aren't very useful. We wish there was a nicer way of saying it. We had 21 submissions in the first year. We designed a simple way to evaluate and compare projects: We divided into 3 categories (health, human rights, and economic) and took all organizations' reports at face value. Based on their own data, there was a huge divide between top performers and lowest performers. Then we did field surveys to verify the claimed results of the top 6 and we had our 3 winners, with only one organization really meeting expectations. Main finding: There was no correlation between experience and effect or grant size and effect, it is as if organizations don't get more effective with experience and professionalism. If anything the correlation is negative. We think this is because organizations get more effective at capturing donor funding not at providing a better service. They only get real valuable feedback from donors who decide to fund them or not. So organizations will focus and implement projects based on what donors appear to want, which sometimes may be connected to the most meaningful effects on the people they serve, but not necessarily. Details: First, we had two organizations just applying for funding instead of presenting project results. This happens, just a reminder that it is all about donor funding in the end and that sometimes people don't read. The general tendency was that organizations follow donor trends and work to teach people things they probably already know: Multiple menstrual health projects translated into a tiny economic transfer (free pads to cover 2 or 3 months) and some lessons either girls already know or they were very likely to be about to find out. "Child Protection" is another hot term, particularly in humanitarian contexts, but it was not very clear what people were being taught about and how that helped anyone. Sexual reproductive health was also very common but products are available and cheap and it is unlikely the information is that new to Cameroonian girls and women right now. HIV rates in the target areas aren't as high as in other countries, and when we ran the numbers it was unlikely even one infection was averted with these projects. "inclusion" of persons with disability in the health sector was a beautiful project with multiple complex activities but had no visible effects on people, with disabilities or not. It involved mostly training health workers, but it is important to unders...

Aug 16, 2024 • 6min

EA - My article in The Nation - California's AI Safety Bill Is a Mask-Off Moment for the Industry by Garrison

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: My article in The Nation - California's AI Safety Bill Is a Mask-Off Moment for the Industry, published by Garrison on August 15, 2024 on The Effective Altruism Forum. I wrote an article on California AI safety bill SB 1047 for The Nation and the reaction from the AI industry, investors, and the broader tech community. The story was informed by conversations with over a dozen relevant sources and comes shortly before the bill faces a floor vote in the California Assembly. I think it's useful to understand how industry responds to attempts to regulate AI, and centered my analysis on that topic. If you're interested in helping share the article, I made a Tweet thread. Thanks for listening. To help us out with The Nonlinear Library or to learn more, please visit nonlinear.org

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

The Nonlinear Library

Episodes

Mentioned books

AF - Calendar feature geometry in GPT-2 layer 8 residual stream SAEs by Patrick Leask

EA - The Tech Industry is the Biggest Blocker to Meaningful AI Safety Regulations by Garrison

LW - Investigating the Chart of the Century: Why is food so expensive? by Maxwell Tabarrok

EA - This chart is right. Most interventions don't do much. (Cameroon experience) by EffectiveHelp - Cameroon

LW - Demis Hassabis - Google DeepMind: The Podcast by Zach Stein-Perlman

LW - Adverse Selection by Life-Saving Charities by vaishnav92

EA - CEA is hiring a Head of Operations (apply by Sep 16) by JP Addison

LW - Danger, AI Scientist, Danger by Zvi

LW - A computational complexity argument for many worlds by jessicata

EA - My article in The Nation - California's AI Safety Bill Is a Mask-Off Moment for the Industry by Garrison

The AI-powered Podcast Player