Amundsen Quiz Answers Explained
- The ABC’s of Metadata
Application Context — information needed by humans or applications to operate. This includes the existence of data, and description, semantics, tags associated with the data. Behavior — information about how the data is created and used over time. This includes information about ownership, creation, common usage patterns, people or processes that are frequent users, provenance and lineage. Change — information about how the data is changing over time. This captures information about the evolution of data (for example, schema evolution for a table) and the processes that create it (for example, the related ETL code for a table).
- A 10x reduction in data discovery time
How was Amundsen able to achieve such impressive results? Amundsen sets it’s priorities to have the best of both worlds: discovery (aka search) system and follows best practices about names and descriptions about schemas, tables and fields. These two things lead to amazing speeds in data discovery.
- Amundsen’s three key pillars: an augmented data graph, an intuitive user experience, and centralized metadata.
Amundsen’s metadata service provides metadata about data resources. This metadata is displayed in the front-end service’s web application, and is also utilized by other services at Lyft. The metadata service currently supports resources. The intuitive user experience was essential to enable both discovery and easy categorization and feedback. These factors directly impact Amundsen’s efficacy.