Data Engineering Podcast cover image

Data Engineering Podcast

Build A Data Lake For Your Security Logs With Scanner

Jan 29, 2024
Learn about Scanner, a fast querying platform for security log data. Discover the challenges of managing data lakes and the benefits of using a search index. Explore the design philosophies of the Scanner platform and its integration into security log analysis workflows. Understand the indexing strategies for variegated data and the importance of regulatory compliance and data security. Also, find out about the need for better visibility and queryability in data management.
01:02:38

Podcast summary created with Snipd AI

Quick takeaways

  • Scanner enables fast querying of high scale log data for security auditing.
  • Scanner leverages AWS S3 for storing log data, allowing for efficient ad hoc searches and cross-correlations.

Deep dives

Scanner: An Efficient Security Data Lake Platform

Scanner is a security data lake platform that offers fast and cost-effective analysis of security logs. It was created to tackle the challenges faced by security teams in managing and searching through massive amounts of log data. Scanner enables users to build correlations and relationships between different log sources, allowing for in-depth investigations and threat detection. It indexes the content of logs stored in AWS S3, making it easy to search and explore logs that are often in JSON format. The platform's serverless architecture leverages AWS ECS Fargate for indexing compute, providing scalability and agility. Scanner focuses on decoupling storage and compute, ensuring that user data remains under their control in their own S3 buckets. It offers a user-friendly interface that allows for iterative and collaborative investigations, combining search results from multiple log sources into a single view.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner