

The GeekNarrator
Kaivalya Apte
The GeekNarrator podcast is a show hosted by Kaivalya Apte who is a Software Engineer and loves to talk about Technology, Technical Interviews, Self Improvement, Best Practices and Hustle.
Connect with Kaivalya Apte https://www.linkedin.com/in/kaivalya-apte-2217221a
Tech blogs: https://kaivalya-apte.medium.com/
Wanna talk? Book a slot here: https://calendly.com/speakwithkv/hey
Enjoy the show and please follow to get more updates. Also please don’t forget to rate and review the show.
Cheers
Connect with Kaivalya Apte https://www.linkedin.com/in/kaivalya-apte-2217221a
Tech blogs: https://kaivalya-apte.medium.com/
Wanna talk? Book a slot here: https://calendly.com/speakwithkv/hey
Enjoy the show and please follow to get more updates. Also please don’t forget to rate and review the show.
Cheers
Episodes
Mentioned books

Jul 19, 2024 • 1h 18min
Testing Distributed Systems the right way ft. Will Wilson
Will Wilson, Engineer and co-founder of Antithesis, dives deep into the world of deterministic simulation testing for distributed systems. He breaks down the limitations of traditional methods, showcasing how his company's approach improves software reliability. Key discussions include optimizing bug detection strategies, the significance of simulated workloads, and the challenges posed by third-party APIs. Real-world examples like chat applications illustrate how effective testing can reveal hidden issues, making this an essential listen for tech enthusiasts and developers.

13 snips
Jul 19, 2024 • 1h 5min
Turso - SQLite for production
Glauber Costa, founder and CEO of TursoDB, brings his background in Linux kernel development to the forefront as he discusses enhancing SQLite for production use. He shares insights on native replication, schema management, and vector search capabilities. Glauber navigates the challenges of transitioning from unikernel systems and highlights the importance of multi-tenancy. Listeners will also learn about TursoDB's user-friendly design and its future roadmap, including features aimed at improving database simplicity and efficiency.

Jul 19, 2024 • 51min
Taking Postgres to the next level with Neon
Deep Dive into Serverless Databases with Neon: Featuring Heikki Linnakangas
In this episode of the Geek Narrator podcast, host Kaivalya Apte is joined by Heikki Linnakangas, co-founder of Neon, to explore the innovative world of serverless databases. They discuss Neon's unique approach to separating compute and storage, the benefits of serverless architecture for modern applications, and dive into various compelling use cases. They also cover Neon's architectural features like branching, auto-scaling, and auto-suspend, making it a powerful tool for both developers and enterprises. Whether you're curious about multi-tenancy, fault tolerance, or developer productivity, this episode offers insightful knowledge about leveraging Neon's capabilities for your next project.
00:00 Introduction
00:53 The Birth of Neon: Why It Was Created
02:16 Understanding Serverless Databases
07:06 Neon's Architecture: Separation of Compute and Storage
09:59 Exploring Branching in Neon
18:21 Auto Scaling and Handling Spikes in Traffic
20:17 The Challenge of Multiple Writers in Distributed Systems
22:51 Auto Suspend: Cost-Effective Database Management
26:02 Optimizing Cold Start Times
27:14 Balancing Cost and Performance
28:52 Replication and Durability
30:32 Understanding the Storage Layer
34:02 Custom LSM Tree Implementation
36:21 Fault Tolerance and Failover
07:00 Developer Productivity and Use Cases
42:56 Migration and Tooling
48:35 Future Roadmap and User Experience
50:28 Conclusion and Final Thoughts
Neon website: https://neon.tech/
Follow me on Linkedin and Twitter: https://www.linkedin.com/in/kaivalyaapte/ and https://twitter.com/thegeeknarrator
If you like this episode, please hit the like button and share it with your network.
Also please subscribe if you haven't yet.
Database internals series: https://youtu.be/yV_Zp0Mi3xs
Popular playlists:
Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-
Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17
Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_d
Modern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsN
Stay Curios! Keep Learning!
#PostgreSQL #SQL #RDBMS #NEON

Jun 5, 2024 • 1h 13min
Scaling Derived Data for Planet-Scale Applications at Linkedin
Felix GV, Principal Staff Engineer at Linkedin, discusses the evolution from Voldemort to VeniceDB, challenges with derived data, planet-scale applications, writing and merging data, architectural components, fault tolerance, recommender systems, versioning, integration with Kafka, and future improvements.

Jun 5, 2024 • 47min
SuperCharging PostgreSQL for Search and Analytics - ParadeDB (Philippe Noël)
In this video I speak with Philippe Noël, about ParadeDB, which is an Elasticsearch alternative built on Postgres, modernizing the features of Elasticsearch's product suite, starting with real-time search and analytics.
I hope you will enjoy and learn about the product.
Chapters:
00:00 Introduction
01:12 Challenges with Elasticsearch and the Need for ParadeDB
02:29 Why Postgres?
06:30 Technical Details of ParadeDB's Search Functionality
18:25 Analytics Capabilities of ParadeDB
24:00 Understanding ParadeDB Queries and Transactions
24:22 Application Logic and Data Workflows
25:14 Using PG Cron for Data Migration
30:05 Scaling Reads and Writes in Postgres
31:53 High Availability and Distributed Systems
34:31 Isolation of Workloads
39:38 Database Upgrades and Migrations
41:21 Using ParadeDB Extensions and Distributions
43:02 Observability and Monitoring
44:42 Upcoming Features and Roadmap
46:34 Final Thoughts
Important links:
Links:
GitHub: https://github.com/paradedb/paradedb
Website: https://paradedb.com
Docs: https://docs.paradedb.com/
Blog: https://blog.paradedb.com
Follow me on Linkedin and Twitter: https://www.linkedin.com/in/kaivalyaapte/ and https://twitter.com/thegeeknarrator
If you like this episode, please hit the like button and share it with your network.
Also please subscribe if you haven't yet.
Database internals series: https://youtu.be/yV_Zp0Mi3xs
Popular playlists:
Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-
Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17
Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_d
Modern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsN
Stay Curios! Keep Learning!
#postgresql #datafusion #parquet #sql #OLAP #apachearrow #database #systemdesign #elasticsearch

15 snips
Jun 5, 2024 • 57min
Modern OLAP Database System Design with FDAP (Andrew Lamb)
Andrew Lamb, Staff Software Engineer at InfluxDB and chair of the Apache Data Fusion project, shares his expertise on modern OLAP database design. He explains the power of the FDAP stack, highlighting how Apache Parquet and Arrow enhance data storage and retrieval efficiency. The conversation delves into the challenges of data immutability and management, while also discussing Flight's role in simplifying data transfer. Looking ahead, Andrew envisions evolving trends in database technologies, paving the way for innovative solutions in analytics.

Jun 5, 2024 • 46min
The ultimate multi-model Database, SurrealDB with Pratim Bhosale
In this video I and Pratim Bhosale, Developer Advocate at SurrealDB, talk about SurrealDB, a multi-model database which aims to make Developer’s life easier by letting them focus mainly on the business logic and not on the Database choice. Following chapters will help you understand what is a multi-model database and how SurrealDB shines.
Chapters:
00:00 Introduction
01:48 The Genesis of SurrealDB
03:59 SurrealDB's Mission and Use Cases
07:34 Understanding Multi-Model Databases
10:30 Deep Dive into SurrealDB's Architecture
33:09 Deployment and Getting Started with SurrealDB
34:31 Future Developments and Use Case Considerations
43:51 Final Thoughts and How to Get Started
Important links:
Install SurrealDB
https://sdb.li/4bqwn38
SurrealDB Docs:
https://sdb.li/3wxjoxx
SurrealDB Website:
https://sdb.li/3JMK7JI
Surrealist:
https://sdb.li/4b7wcdh
SurrealDB GitHub:
https://sdb.li/3JRPNlE
Follow me on Linkedin and Twitter: https://www.linkedin.com/in/kaivalyaapte/ and https://twitter.com/thegeeknarrator
If you like this episode, please hit the like button and share it with your network.
Also please subscribe if you haven't yet.
Database internals series: https://youtu.be/yV_Zp0Mi3xs
Popular playlists:
Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-
Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17
Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_d
Modern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsN
Stay Curios! Keep Learning!
#surrealdb #elasticsearch #search #vectorsearch #acid #databases #sql #joins #indexes #graphdatabase

May 17, 2024 • 1h 15min
Demystifying Real-time Analytics, Search and Hybrid Search with Dhruba, CTO @Rockset
In this video, I talk to Dhruba, CTO @Rockset about search and realtime analytics. We discussed deep internals of Rockset, its architecture and why is it a great fit for search and realtime analytics use cases.
Chapters:
00:00 Introduction
02:45 The Evolution of Data Systems: From Hadoop to Rockset
07:30 Understanding Rockset: Real-Time Analytics and Search Defined
12:01 The Technical Edge: Rockset vs. Elasticsearch
18:16 Deep Dive into Rockset's Architecture and Internals
28:21 Partitioning, Hashing, and Data Distribution in Rockset
36:56 Exploring Hot Storage and Cache Layers
37:40 Why Hot Storage is Essential for Low Latency
39:05 Optimizing Data Storage with Compression and Delta Encoding
39:49 Balancing Cost and Performance in Data Storage
41:50 The Power of Converged Indexing in Rockset
45:50 Efficient Query Execution and Index Management
54:51 Leveraging Mutability for Real-Time Analytics
59:24 Deep Dive into Query Processing and Optimization
01:04:21 Understanding Joins and Reporting Queries in Rockset
01:12:23 Future Directions and Vector Search Innovations
Index Conference: https://rockset.com/index-conf/
Follow me on Linkedin and Twitter: https://www.linkedin.com/in/kaivalyaapte/ and https://twitter.com/thegeeknarrator
If you like this episode, please hit the like button and share it with your network.
Also please subscribe if you haven't yet.
Database internals series: https://youtu.be/yV_Zp0Mi3xs
Popular playlists:
Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-
Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17
Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_d
Modern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsN
Stay Curios! Keep Learning!
#rockset #elasticsearch #search #vectorsearch #realtime #databases #sql #joins #indexes

May 17, 2024 • 47min
Rapidly Simulate Production Traffic ft. Michael Drogalis
In this episode we explore how to Rapidly Simulate Production Traffic with Michael Drogalis, using his creation ShadowTraffic. I am sure you will be able to relate to all the different problems mentioned in this episode and like how ShadowTraffic aims to solve those problems.
I hope you like this conversation.
Chapters:
00:00 Welcome to The Geek Narrator Podcast: Exploring Deep Tech
00:18 The Challenge of Simulating Production Traffic
00:59 Introducing Shadow Traffic: A Solution to Data Simulation
02:34 Understanding the Problem Space of Data Simulation
06:03 How Shadow Traffic Works: A Deep Dive
08:17 The Power of Declarative Data Generation with Shadow Traffic
10:40 Shadow Traffic's Architecture and Deployment
13:02 Configuring Load Testing and Throttling with Shadow Traffic
15:47 Testing and Validation in Shadow Traffic
20:42 Mimicking Production Data Distribution with Shadow Traffic
26:48 Innovative Features for Stream Processing Testing
28:47 Shadow Traffic: Adding Faults to Data for Robust Testing
29:04 Antithesis and Shadow Traffic: A Synergistic Approach
32:46 The Challenge of Generating Realistic Test Data
40:04 Enhancing Observability in Data Generation
41:50 Customer-Driven Roadmap and Future Vision
45:27 Closing Thoughts
ShadowTraffic: https://shadowtraffic.io/
Contact Michael: https://shadowtraffic.io/contact.html
Follow me on Linkedin and Twitter: https://www.linkedin.com/in/kaivalyaapte/ and https://twitter.com/thegeeknarrator
If you like this episode, please hit the like button and share it with your network.
Also please subscribe if you haven't yet.
Database internals series: https://youtu.be/yV_Zp0Mi3xs
Popular playlists:
Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-
Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17
Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_d
Modern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsN
Stay Curios! Keep Learning!
#kafka #s3 #postgres #testing #streamprocessing #loadtesting #chaostesting #demo

May 17, 2024 • 52min
High Performance with GraalVM - Alina Yurenko
If you're involved in the Java space, chances are you've come across #GraalVM. And for those active in the tech community, you might have heard about the recent 1BRC challenge initiated by Gunnar Morling.
GraalVM truly showcased its capabilities in this challenge, sparking my curiosity. That's why I reached out to Alina to delve deeper into GraalVM, exploring its features and uncovering how it excels in such endeavors. And here we are talking about GraalVM
Chapters:
00:00 Introduction
01:47 GraalVM's Impact on the 1BRC Challenge and Its Features
04:34 Exploring GraalVM's Core Features and Benefits
08:34 Real-World Success Stories: GraalVM in Action
16:18 Understanding Native Image Compilation with GraalVM
20:34 Framework Compatibility and GraalVM Integration
25:04 Testing and Integration with GraalVM
25:26 Exploring Testing and Development with GraalVM
25:58 Best Practices for Developing with GraalVM
28:11 Migrating to GraalVM: Strategies and Considerations
31:25 Performance Optimization in GraalVM
35:15 Building and Resource Considerations for GraalVM
38:45 Expanding Horizons: Polyglot Programming with GraalVM
43:15 Future Directions and Limitations of GraalVM
47:40 Engaging the Java Community: GraalVM's Impact
50:21 Getting Started with GraalVM: Resources and Recommendations
References and Links:
- The GraalVM website with docs, downloads, guides: https://www.graalvm.org/
- Nicolai Parlog's "Modern Java in Action" demo: https://github.com/nipafx/modern-java-demo
- My native version of Nicolai's demo: https://github.com/alina-yur/native-modern-java-demo
- For news, follow GraalVM: https://twitter.com/graalvm
Follow me on Linkedin and Twitter: https://www.linkedin.com/in/kaivalyaapte/ and https://twitter.com/thegeeknarrator
If you like this episode, please hit the like button and share it with your network.
Also please subscribe if you haven't yet.
Database internals series: https://youtu.be/yV_Zp0Mi3xs
Popular playlists:
Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-
Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17
Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_d
Modern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsN
Stay Curios! Keep Learning!
#Java #jvm #graalvm #highperformance #JITcompiler #AOT #nativeimage #security #rust #c++