GMA is the backend infrastructure for DataHub. Unlike existing architectures, GMA leverages multiple storage technologies to efficiently service the four most commonly used query patterns
- Document-oriented CRUD
- Complex queries (including joining distributed tables)
- Graph traversal
- Fulltext search and autocomplete
GMA also embraces a distributed model, where each team owns, develops and operates their own metadata services (known as GMS), while the metadata are automatically aggregated to populate the central metadata graph and search indexes. This is made possible by standardizing the metadata models and the access layer.
We strongly believe that GMA can bring tremendous leverage to any team that has a need to store and access metadata. Moreover, standardizing metadata modeling promotes a model-first approach to developments, resulting in a more concise, consistent, and highly connected metadata ecosystem that benefits all DataHub users.