The Nonlinear Library: LessWrong

The Nonlinear Fund

The Nonlinear Library allows you to easily listen to top EA and rationalist content on your podcast player. We use text-to-speech software to create an automatically updating repository of audio content from the EA Forum, Alignment Forum, LessWrong, and other EA blogs. To find out more, please visit us at nonlinear.org

Episodes

Mentioned books

May 16, 2024 • 5min

LW - Why you should learn a musical instrument by cata

Link to original articleWelcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Why you should learn a musical instrument, published by cata on May 16, 2024 on LessWrong. I have liked music very much since I was a teenager. I spent many hours late at night in Soulseek chat rooms talking about and sharing music with my online friends. So, I tend to just have some music floating around in my head on any given day. But, I never learned to play any instrument, or use any digital audio software. It just didn't catch my interest. My wife learned to play piano as a kid, so we happen to have a keyboard sitting around in our apartment. One day I was bored so I decided to just see whether I could figure out how to play some random song that I was thinking about right then. I found I was easily able to reconstitute a piano version of whatever melody I was thinking of, just by brute-forcing which notes were which, given a lot of patience. So that was satisfying enough that I wanted to keep doing it. What I didn't know is how immediately thought-provoking it would be to learn even the most basic things about playing music. Maybe it's like learning to program, if you used a computer all the time but you never had one thought about how it might work. Many of the things I learned immediately that surprised me were about my perception of the music I had listened to for all of my life. In my mind, my subjective experience of remembering music that I am very familiar with seems very vivid. I feel like I can imagine all the instruments and imagine all the sounds, just like they were in the song. But once I had to reconstruct the music myself, it quickly became clear that I was tricking myself in a variety of ways. For example, my memory of the main melody would be very clear. But my memory of any harmony or accompaniment was typically totally vague. I absolutely could not reconstruct something to play with my left hand on the piano, because I wasn't actually remembering it; I was just remembering something more abstract, I guess. Sometimes I would be convinced I would remember a melody and reproduce it on the keyboard, but then I would listen to the real song and be surprised. The most common way I got surprised was that in my memory, I had adjusted it so that I could physically sing or hum it, even though I don't often sing. If there was a big jump up or down the scale, I would do something in my memory that sounded sort of OK instead, like replace it with a repeated note, or the same thing moved an octave, and then forget that it had ever been any other way. I found that if I was remembering something that had fast playing, I often actually could not remember the specific notes in between beats, even though I felt that I could hear it in my head. No matter how hard I "focused" on my memory I couldn't get more detail. Actually, I found that there was some speed such that even listening to the music, I could no longer resolve the individual notes, no matter how hard I paid attention or how many times I replayed it. There have been many more kinds of things I have learned since learning to play a little: Since playing music on a keyboard is a complicated physical task involving complicated coordination, I learned a lot about what both of my hands are naturally good and bad at, and what sort of things they can coordinate easily or poorly.[1] Learning the musical structure of songs that I know and trying to arrange them for piano showed me all kinds of self-similarity and patterns inside the songs that I had never had a clue about before. I could listen to a song hundreds of times and not realize, for example, that two parts of the song were the same phrase being played on two different instruments in a very slightly different way. Often I will be trying to learn to play something using one "technique" for learning and practicing it, and having a hard time, and then I...

May 16, 2024 • 10min

LW - Do you believe in hundred dollar bills lying on the ground? Consider humming by Elizabeth

Link to original articleWelcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Do you believe in hundred dollar bills lying on the ground? Consider humming, published by Elizabeth on May 16, 2024 on LessWrong. Introduction [Reminder: I am an internet weirdo with no medical credentials] A few months ago, I published some crude estimates of the power of nitric oxide nasal spray to hasten recovery from illness, and speculated about what it could do prophylactically. While working on that piece a nice man on Twitter alerted me to the fact that humming produces lots of nasal nitric oxide. This post is my very crude model of what kind of anti-viral gains we could expect from humming. I've encoded my model at Guesstimate. The results are pretty favorable (average estimated impact of 66% reduction in severity of illness), but extremely sensitive to my made-up numbers. Efficacy estimates go from ~0 to ~95%, depending on how you feel about publication bias, what percent of Enovid's impact can be credited to nitric oxide, and humming's relative effect. Given how made up speculative some of these numbers are, I strongly encourage you to make up speculate some numbers of your own and test them out in the guesstimate model. If you want to know how nitric oxide reduces disease, check out my original post. Math Estimating the impact of Enovid I originally estimated the (unadjusted) efficacy of nitric oxide nasal sprays after diagnosis at 90% overall reduction in illness, killing ~50% of viral particles per application. Enovid has three mechanisms of action. Of the papers I looked at in that post, one mentioned two of the three (including nitric oxide) a second mechanism but not the third, and the other only mentioned nitric oxide. So how much of theat estimated efficacy is due to nitric oxide alone? I don't know, so I put a term in the guesstimate with a very wide range. I set the lower bound to (one of three mechanisms) to 1 (if all effect was due to NO). There's also the question of how accurate the studies I read are. There are only two, they're fairly small, and they're both funded by Enovid's manufacturer. One might reasonably guess that their numbers are an overestimate. I put another fudge factor in for publication bias, ranging from 0.01 (spray is useless) to 1 (published estimate is accurate). How much nitric oxide does Enovid release? This RCT registration uses a nitric oxide nasal spray (and mentions no other mechanisms). They don't give a brand name but it's funded by the company that produces Enovid. In this study, each application delivers 0.56 mL of nitric oxide releasing solution (NORS) (this is the same dose you get from commercial Enovid), which delivers "0.11ppm [NO]*hrs". There's a few things that confusing phrase could mean: The solution keeps producing 0.11ppm NO for several hours (very unlikely). The application produces 0.88ppm NO almost immediately (0.11*8, where 8 hours is the inter-application interval), which quickly reacts to form some other molecule. This is my guess, and what I'll use going forward. It won't turn out to matter much. Some weirder thing. How much nitric oxide does humming move into the nose? Here we have much more solid numbers. NO concentration is easy to measure. Individuals vary of course, but on average humming increases NO concentration in the nose by 15x-20x. Given baseline levels of (on average) 0.14ppm in women and 0.18ppm in men, this works out to a 1.96-3.42 ppm increase. More than twice what Enovid manages. The dominant model is that the new NO in the nose is borrowed from the sinuses rather than being newly generated. Even if this is true I don't think it matters; sinus concentrations are 100x higher than the nose's and replenish quickly. Estimating the impact of humming As far as I can find, there are no published studies on humming as an antimicrobial intervention. There is lots of circumstantial evid...

May 15, 2024 • 5min

LW - How to do conceptual research: Case study interview with Caspar Oesterheld by Chi Nguyen

Link to original articleWelcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: How to do conceptual research: Case study interview with Caspar Oesterheld, published by Chi Nguyen on May 14, 2024 on LessWrong. Caspar Oesterheld came up with two of the most important concepts in my field of work: Evidential Cooperation in Large Worlds and Safe Pareto Improvements. He also came up with a potential implementation of evidential decision theory in boundedly rational agents called decision auctions, wrote a comprehensive review of anthropics and how it interacts with decision theory which most of my anthropics discussions built on, and independently decided to work on AI some time late 2009 or early 2010. Needless to say, I have a lot of respect for Caspar's work. I've often felt very confused about what to do in my attempts at conceptual research, so I decided to ask Caspar how he did his research. Below is my writeup from the resulting conversation. How Caspar came up with surrogate goals The process Caspar had spent six months FTE thinking about a specific bargaining problem between two factions with access to powerful AI, spread over two years. A lot of the time was spent on specific somewhat narrow research projects, e.g. modelling the impact of moral advocacy in China on which bargaining problems we'll realistically encounter in the future. At the time, he thought those particular projects were important although he maybe already had a hunch that he wouldn't think so anymore ten years down the line. At the same time, he also spent some time on most days thinking about bargaining problems on a relatively high level, either in discussions or on walks. This made up some double digit percentage of his time spent researching bargaining problems. Caspar came up with the idea of surrogate goals during a conversation with Tobias Baumann. Caspar describes the conversation leading up to the surrogate goal idea as "going down the usual loops of reasoning about bargaining" where you consider just building values into your AI that have properties that are strategically advantaged in bargaining but then worrying that this is just another form of aggressive bargaining. The key insight was to go "Wait, maybe there's a way to make it not so bad for the other side." Hence, counterpart-friendly utility function modifications were born which later on turned into surrogate goals. Once he had the core idea of surrogate goals, he spent some time trying to figure out what the general principle behind "this one weird trick" he found was. Thus, with Vincent Conitzer as his co-author, his SPI paper was created and he continues trying to answer this question now. Caspar's reflections on what was important during the process He thinks it was important to just have spent a ton of time, in his case six months FTE, on the research area. This helps with building useful heuristics. It's hard or impossible and probably fruitless to just think about a research area on an extremely high level. "You have to pass the time somehow." His particular projects, for example researching moral advocacy in China, served as a way of "passing the time" so to say. At the same time, he thinks it is both very motivationally hard and perhaps not very sensible to work on something that's in the roughly right research area where you really can't see a direct impact case. You can end up wasting a bunch of time grinding out technical questions that have nothing much to do with anything. Relatedly, he thinks it was really important that he continued doing some high-level thinking about bargaining alongside his more narrow projects. He describes a common dynamic in high-level thinking: Often you get stuck on something that's conceptually tricky and just go through the same reasoning loops over and over again, spread over days, weeks, months, or years. You usually start entering the loop because you think...

May 14, 2024 • 10min

LW - How To Do Patching Fast by Joseph Miller

Link to original articleWelcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: How To Do Patching Fast, published by Joseph Miller on May 14, 2024 on LessWrong. This post outlines an efficient implementation of Edge Patching that massively outperforms common hook-based implementations. This implementation is available to use in my new library, AutoCircuit, and was first introduced by Li et al. (2023). What is activation patching? I introduce new terminology to clarify the distinction between different types of activation patching. Node Patching Node Patching (aka. "normal" activation patching) is when some activation in a neural network is altered from the value computed by the network to some other value. For example we could run two different prompts through a language model and replace the output of Attn 1 when the model is given some input 1 with the output of the head when the model is given some other input 2. We will use the running example of a tiny, 1-layer transformer, but this approach generalizes to any transformer and any residual network. All the nodes downstream of Attn 1 will be affected by the patch. Edge Patching If we want to make a more precise intervention, we can think about the transformer differently, to isolate the interactions between components. Now we can patch the edge Attn 1 -> MLP and only nodes downstream of MLP will be affected (eg. Attn 1->Output is unchanged). Edge Patching has not been explicitly named in any prior work. Path Patching Path Patching refers to the intervention where an input to a path is replaced in the 'treeified' view of the model. The treeified view is a third way of thinking about the model where we separate each path from input to output. We can implement an equivalent intervention to the previous diagram as follows: In the IOI paper, 'Path Patching' the edge Component 1 -> Component 2 means Path Patching all paths of the form where all components between Component 1 and Component 2 are MLPs[1]. However, it can be easy to confuse Edge Patching and Path Patching because if we instead patch all paths of the form this is equivalent to Edge Patching the edge Component 1->Component 2. Edge Patching all of the edges which have some node as source is equivalent to Node Patching that node. AutoCircuit does not implement Path Patching, which is much more expensive in general. However, as explained in the appendix, Path Patching is sometimes equivalent to Edge Patching. Fast Edge Patching We perform two steps. First we gather the activations that we want to patch into the model. There's many ways to do this, depending on what type of patching you want to do. If we just want to do zero ablation, then we don't need to even run the model. But let's assume we want to patch in activations from a different, corrupt input. We create a tensor, Patch Activations, to store the outputs of the source of each edge and we write to the tensor during the forward pass. Each source component has a row in the tensor, so the shape is [n_sources, batch, seq, d_model].[2] Now we run the forward pass in which we actually do the patching. We write the outputs of each edge source to a different tensor, Current Activations, of the same shape as Patch Activations. When we get to the input of the destination component of the edge we want to patch, we add the difference between the rows of Patch Activations and Current Activations corresponding to the edge's source component output. This works because the difference in input to the edge destination is equal to the difference in output of the source component.[3] Now it's straightforward to extend this to patching multiple edges at once by subtracting the entire Current Activations tensor from the entire Patch Activations tensor and multiplying by a Mask tensor of shape [n_sources] that has a single value for each input edge. By creating a Mask tensor for each destination node w...

May 14, 2024 • 12min

LW - DandD.Sci Long War: Defender of Data-mocracy Evaluation and Ruleset by aphyer

Link to original articleWelcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: D&D.Sci Long War: Defender of Data-mocracy Evaluation & Ruleset, published by aphyer on May 14, 2024 on LessWrong. This is a follow-up to last week's D&D.Sci scenario: if you intend to play that, and haven't done so yet, you should do so now before spoiling yourself. There is a web interactive here you can use to test your answer, and generation code available here if you're interested, or you can read on for the ruleset and scores. RULESET Each alien has a different amount of HP: Alien HP Threat* Swarming Scarab 1 1 Chitinous Crawler 3 2 Voracious Venompede 5 3 Arachnoid Abomination 9 4 Towering Tyrant 15 5 *Threat has no effect on combat directly - it's a measure of how threatening Earth considers each alien to be, which scales how many soldiers they send. (The war has been getting worse - early on, Earth sent on average ~1 soldier/4 Threat of aliens, but today it's more like 1 soldier/6 Threat. The wave you're facing has 41 Threat, Earth would send on average ~7 soldiers to it. Earth doesn't exercise much selection with weapons, but sends soldiers in pairs such that each pair has two different weapons - this is a slight bias towards diversity.) Each weapon has a damage it deals per shot, and a rate of fire that determines how many shots it can get off before the wielder is perforated by venomous spines/dissolved into a puddle of goo/voraciously devoured by a ravenous toothed maw: Weapon Damage Min Shots Max Shots Macross Minigun 1 5 8 Fusion Flamethrower 1 3 12 Pulse Phaser 2 4 6 Rail Rifle 3 3 5 Laser Lance 5 2 5 Gluon Grenades 7 2 3 Thermo-Torpedos 13 1 3 Antimatter Artillery 20 1 2 Each soldier will be able to fire a number of shots chosen randomly between Min Shots and Max Shots - for example, a soldier with a Laser Lance will have time to fire 1d4+1 shots, each doing 5 damage. During a battle, humans roll for how many shots each weapon gets, and then attempt to allocate damage from their shots to bring down all aliens. If they succeed, the humans win - if not, the humans lose. While doing this optimally is theoretically very difficult, your soldiers are well-trained and the battles are not all that large, so your soldiers will reliably find a solution if one exists. For example, if you are fighting two Towering Tyrants and two Swarming Scarabs using two soldiers: If you bring one soldier with Antimatter Artillery and one with a Macross Minigun, the Minigun soldier will reliably kill the Scarabs and have 3-6 shots left over (not enough to kill a Tyrant). The Artillery soldier will get either 1 or 2 shots: half the time they will roll a 2, kill both Tyrants and you will win, while the other half they will roll a 1, a Tyrant will survive and you will lose. You can do a little better by bringing one soldier with Antimatter Artillery and one with a Laser Lance. The Laser Lance rolls 2-5 shots - it will always kill both Scarabs, and 1/4 of the time it will roll 5 shots and also be able to kill a Tyrant (at which point you'll win even if the Antimatter Artillery rolls a 1), giving you a 5/8 winrate overall. You can do better still by bringing one soldier with Thermo-Torpedos and one with a Pulse Phaser. The Phaser soldier gets at least 4 shots, with which they kill both Scarabs and do 2 damage to each Tyrant (dropping the Tyrants both to 13 HP). And the Torpedo soldier gets 1-3 shots, with a 2/3 chance of being able to kill both Tyrants now that they've been softened up. I believe this is the best winrate you can get in this example. STRATEGY The most important element of strategy was sending the right kind of weapons for each alien: high-health aliens like Tyrants are extremely inefficient to kill with light weapons like Miniguns, while small, numerous aliens like Scarabs are extremely inefficient to kill with heavy weapons like artillery. There were a few subtler ...

May 14, 2024 • 8min

LW - Against Student Debt Cancellation From All Sides of the Political Compass by Maxwell Tabarrok

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app