Software Engineering Radio - the podcast for professional software developers

Episode 538: Roberto Di Cosmo on Archiving Public Software at Massive Scale

5 snips
Nov 18, 2022
Ask episode
Chapters
Transcript
Episode notes
1
Introduction
00:00 • 2min
2
What Is Software Heritage?
01:56 • 2min
3
How to Analyze Open Source Software?
03:38 • 4min
4
What Is the Point of an Archive?
07:09 • 3min
5
How Do We Archive Software?
09:47 • 4min
6
Do You Have a DSL-Domain Specific Language?
13:46 • 2min
7
What's the Standard of the Software Heritage Identifiers?
15:28 • 3min
8
How Do You Fund a Data Center?
18:29 • 5min
9
How Do You Archive GitHub Projects?
23:04 • 3min
10
Do You Have a Special Agreement With Fargis?
26:13 • 2min
11
How Do You Preserve Your Data?
28:38 • 4min
12
How Do You Handle a Corruption in Your Data Center?
32:23 • 2min
13
Object Storage - The Core and Architecture
34:42 • 4min
14
What Is the Graph of a File System?
39:00 • 2min
15
How to Scale Up a Project on Gitadra
40:38 • 2min
16
The Cryptographic Hash
43:01 • 6min
17
Using Cryptographic Checksums to Identify the Status of a Project
48:50 • 2min
18
How to Compute a Tree-Putashi Hash
51:19 • 5min
19
How Do You Make a Graph of Files?
56:35 • 2min
20
Is SHA-1 a Good Hash?
58:18 • 3min
21
What Is the Software Identity Identifier?
01:00:54 • 3min
22
How Long Will Our Good Star, Our Luck Stay?
01:04:02 • 3min
23
What Is Software Heritage?
01:06:34 • 3min
24
Software Engineering Radio Interview With Gavin Henry
01:09:47 • 2min