The MapScaping Podcast - GIS, Geospatial, Remote Sensing, earth observation and digital geography cover image

The MapScaping Podcast - GIS, Geospatial, Remote Sensing, earth observation and digital geography

Unstructured Data Is Dark Data

Jun 29, 2022
This podcast explores the concept of unstructured data and its various types, the importance of human oversight in object detection models, analyzing unstructured data through sentiment analysis and knowledge graphs, understanding edge computing and its impact on metadata management, continuous training and data validation challenges, and the difficulties in managing unstructured data and the potential of integrating with various APIs for data analysis.
41:26

Podcast summary created with Snipd AI

Quick takeaways

  • Unstructured data requires metadata at different levels (1st, 2nd, and 3rd order) to manage and extract insights.
  • Edge computing, bringing compute resources closer to the data source, is crucial for efficient management and processing of unstructured data.

Deep dives

Unstructured Data and its Importance

Unstructured data, which includes various types of files like imagery, audio, 3D models, documents, and emails, is extensive and valuable. Despite the name, unstructured data actually possesses a certain level of structure, with known schemas and file formats. However, the term "unstructured" aims to differentiate it from structured, modern data stacks. Metadata plays a crucial role in managing unstructured data, with first-order metadata being the basic metadata obtained directly from file headers, providing initial information about the file contents. Second-order metadata involves reading the actual data within the file, such as performing object detection on an image or extracting terms from a document. Finally, third-order metadata refers to inferences and contextualization, where connections are made between different datasets and databases. Machine learning and knowledge graphs are often used to achieve these higher levels of metadata. The ability to generate insights and link data from unstructured sources is of great interest, with applications in various industries like geospatial, media, and property inspection.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode