

Episode 538: Roberto Di Cosmo on Archiving Public Software at Massive Scale
5 snips Nov 18, 2022
Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
Introduction
00:00 • 2min
What Is Software Heritage?
01:56 • 2min
How to Analyze Open Source Software?
03:38 • 4min
What Is the Point of an Archive?
07:09 • 3min
How Do We Archive Software?
09:47 • 4min
Do You Have a DSL-Domain Specific Language?
13:46 • 2min
What's the Standard of the Software Heritage Identifiers?
15:28 • 3min
How Do You Fund a Data Center?
18:29 • 5min
How Do You Archive GitHub Projects?
23:04 • 3min
Do You Have a Special Agreement With Fargis?
26:13 • 2min
How Do You Preserve Your Data?
28:38 • 4min
How Do You Handle a Corruption in Your Data Center?
32:23 • 2min
Object Storage - The Core and Architecture
34:42 • 4min
What Is the Graph of a File System?
39:00 • 2min
How to Scale Up a Project on Gitadra
40:38 • 2min
The Cryptographic Hash
43:01 • 6min
Using Cryptographic Checksums to Identify the Status of a Project
48:50 • 2min
How to Compute a Tree-Putashi Hash
51:19 • 5min
How Do You Make a Graph of Files?
56:35 • 2min
Is SHA-1 a Good Hash?
58:18 • 3min
What Is the Software Identity Identifier?
01:00:54 • 3min
How Long Will Our Good Star, Our Luck Stay?
01:04:02 • 3min
What Is Software Heritage?
01:06:34 • 3min
Software Engineering Radio Interview With Gavin Henry
01:09:47 • 2min