Loading organizations...

§ Private Profile · Palo Alto, CA, USA
Open-source AI data catalog and data context management platform, transforming enterprise data into trusted context with metadata.
DataHub is a Palo Alto, California-based enterprise software organization that provides an artificial intelligence and data context management platform designed to transform corporate data into trusted metadata. The platform enables data analysts, software engineers, and machine learning models to securely access, manage, and utilize data assets across complex organizational infrastructures. Built upon the leading open-source artificial intelligence data catalog with over 9,000 GitHub stars, the core technology originally emerged from internal compliance and metrics standardization solutions. The enterprise architecture draws direct inspiration from the proprietary data systems utilized by global technology corporations such as LinkedIn and Airbnb to manage their internal analytics, governance, and compliance requirements. The commercial entity driving the open-source software project was founded in 2020 by former LinkedIn data architecture lead Shirshanka Das and former Airbnb data engineering leader Swaroop Jagadish.
DataHub has raised $65.0M across 3 funding rounds.
DataHub has raised $65.0M in total across 3 funding rounds.
DataHub is a San Francisco-based technology company that builds an open-source metadata platform for modern data ecosystems, enabling organizations to manage, govern, and scale diverse AI and data assets.[1][3][4] Its core product, DataHub Core, is the leading open-source metadata platform used for data discovery, governance, and observability, while DataHub Cloud offers an enterprise SaaS solution with AI-powered features to accelerate time-to-value from data investments and ensure AI reliability.[1][3] Serving enterprises like LinkedIn and Airbnb (via founder contributions), DataHub solves data chaos by providing comprehensive context through a metadata graph, supporting analysts, engineers, and AI models with scalability and extensibility backed by a 13,000+ member open-source community.[1][4] The company, founded in 2020 with 100+ employees, demonstrates strong growth through widespread adoption, including high PyPI downloads and global community engagement.[3][4]
DataHub emerged from real-world crises at major tech companies, where founders Shirshanka Das and Swaroop Jagadish recognized metadata's critical role.[1] Das led the development of a high-performance metadata platform at LinkedIn to address GDPR compliance challenges, while Jagadish built Airbnb’s DataPortal to standardize business metrics ahead of its IPO.[1] Founded in 2020 and headquartered at 156 2nd St, San Francisco, CA, DataHub open-sourced these innovations to advance metadata management, quickly gaining traction as the #1 open-source platform with a thriving global community of data practitioners and engineers.[1][3][4] Early pivotal moments included joint innovations with the community, evolving into an AI & Data Context Platform that carries forward lessons from LinkedIn and Airbnb to tackle enterprise-wide data governance.[1]
DataHub stands out in the data management space through these key strengths:
DataHub rides the explosive growth of AI and data ecosystems, where organizations grapple with "data chaos" amid surging AI adoption, complex multi-tool stacks, and governance demands like GDPR.[1][3] Its timing is ideal: as enterprises scale AI models requiring reliable, contextual data, DataHub's metadata platform bridges AI and data management, enabling safer AI deployment and faster insights—critical in a market projected to prioritize unified governance.[1] Market forces like open-source momentum, rising data volumes, and AI reliability needs favor DataHub, positioning it as a linchpin that influences the ecosystem by standardizing metadata practices, fostering community-driven innovation, and powering tools from discovery to observability across industries.[1][3][4]
DataHub is poised for accelerated expansion as AI-data integration becomes table stakes, with DataHub Cloud likely driving SaaS revenue growth through enterprise wins and global scaling from its San Francisco hub.[1][3][4] Trends like agentic AI, multimodal data governance, and zero-trust data security will shape its trajectory, amplifying demand for its metadata graph to ensure trustworthy AI outputs. Its influence may evolve from open-source leader to dominant platform player, potentially via acquisitions or deeper ecosystem integrations, solidifying its role in taming data chaos for the AI era—echoing the foundational crises that birthed it at LinkedIn and Airbnb.[1]
DataHub has raised $65.0M across 3 funding rounds. Most recently, it raised $35.0M Series B in May 2025.
| Date | Round | Lead Investors | Other Investors | Status |
|---|---|---|---|---|
| May 1, 2025 | $35M Series B | Bessemer Venture Partners | 8VC, Insight Partners, Sherpalo Ventures, 8VC, Data Community Fund, IN Q TEL, SineWave Ventures, TRU Arrow Partners | Announced |
| Jun 1, 2023 | $21M Series A | — | 8VC, AIX Ventures, ALT Capital, Audrey Capital, Bessemer Venture Partners, Bloomberg Beta, C2 Investment, Decibel Partners, Flex Capital, Insight Partners, Saga, Sherpalo Ventures, The HIT Forge, Akshay Kothari, Amjad Masad, BEN Silbermann, BIZ Stone, BOB Young, Girish Mathrubootham, Guillermo Rauch, Jeff Hammerbacher, Spencer Kimball | Announced |
| Jun 1, 2021 | $9M Seed | — | 8VC, Bessemer Venture Partners, Insight Partners, Sherpalo Ventures | Announced |
DataHub has raised $65.0M in total across 3 funding rounds.
DataHub's investors include Bessemer Venture Partners, 8VC, Insight Partners, Sherpalo Ventures, Data Community Fund, In-Q-Tel, SineWave Ventures, Tru Arrow Partners, AIX Ventures, Alt Capital, Audrey Capital, Bloomberg Beta.