Loading organizations...

§ Private Profile · San Francisco, CA, USA
Audio data research company curating conversational speech datasets for AI model training, speech recognition, and translation.
David AI has raised $80.5M across 4 funding rounds.
Key people at David AI.
David AI was founded in 2024 by Tomer Cohen (Founder) and Ben Wiley (Founder).
David AI has raised $80.5M in total across 4 funding rounds.
David AI is an audio data research company that designs, collects, and curates high-quality conversational speech datasets for training advanced artificial intelligence models. The enterprise operates a business-to-business licensing model, selling perpetual or time-limited access to proprietary audio datasets that are recorded, cleaned, and annotated with rich metadata. Its client base includes Fortune 100 corporations, leading speech-to-speech research laboratories, and the majority of the Magnificent Seven technology companies. The organization currently operates with approximately ten employees and has secured significant venture capital, including a $25 million Series A round led by Alt Capital and a subsequent $50 million Series B financing in late 2025. Generating an annual revenue run rate exceeding $8 million across its various licensing tiers, David AI was officially established in 2024 by Ben Wiley and former Scale AI executive Tomer Cohen.
David AI was founded in 2024 by Tomer Cohen (Founder) and Ben Wiley (Founder).
David AI has raised $80.5M in total across 4 funding rounds.
David AI's investors include Meritech Capital Partners, NVIDIA, 50 Partners, Alt Capital, Amplify Partners, Boldstart Ventures, Footwork, Insight Partners, Otherwise Fund, Qiming Venture Partners, Tectonic Capital, Y Combinator.
Key people at David AI.
David AI is an audio data research company specializing in creating high-quality, rigorously designed audio datasets that power speech recognition, translation, synthesis, and conversational AI models. Their datasets serve Fortune 100 companies and leading AI research labs, enabling advancements in voice-based AI capabilities. The company builds proprietary datasets such as *Converse* (English two-speaker conversations), *Atlas* (multilingual data across 15+ languages), *Chorus* (multi-speaker conversations), and *Dialog* (expert domain conversations), addressing complex audio AI challenges like speaker separation and multilingual understanding[1][2].
Founded in 2024, David AI’s mission is to bring AI into the real world through voice, the most natural human interface. They serve AI developers, researchers, and enterprises needing robust audio data to train and improve their models. By focusing on data quality and iterative dataset design, David AI solves the critical problem of insufficient, low-quality audio training data, which is a bottleneck for advancing audio AI technologies. The company has demonstrated strong growth momentum, raising $50 million by late 2025 and expanding its team and product offerings rapidly[1][2].
David AI was founded in 2024 by Tomer Cohen and Ben Wiley. Tomer Cohen, the CEO, previously served as Chief of Staff at Scale AI and worked as a consultant at McKinsey & Company. Ben Wiley, CTO, led engineering for Scale AI’s Public Sector GenAI Platform and was formerly a software engineer at Microsoft. Their combined experience in AI infrastructure and engineering shaped the company’s R&D-driven approach to audio data[2].
The idea emerged from recognizing a gap in the AI ecosystem: while model architectures advanced rapidly, the availability of high-quality, well-structured audio datasets lagged behind. Early traction came from partnerships with top AI labs and Fortune 100 companies relying on David AI’s datasets to improve their speech and conversational AI systems. The company’s rigorous data design and iterative collection process quickly distinguished it in the market[1][2].
David AI rides the growing trend of voice and conversational AI becoming central to human-computer interaction. As AI models become more capable, the demand for large-scale, high-quality audio datasets grows sharply. The timing is crucial because advances in generative AI and multilingual models require diverse and nuanced audio data to improve accuracy and naturalness. Market forces such as increased adoption of voice assistants, multilingual applications, and real-time speech translation favor companies like David AI that specialize in audio data[1][2].
By providing foundational datasets, David AI influences the broader AI ecosystem by enabling faster, more reliable development of audio AI capabilities. Their work helps reduce a key bottleneck—data scarcity and quality—thus accelerating innovation in speech recognition, synthesis, and conversational agents.
David AI is positioned to become a foundational player in the audio AI data space, with strong early funding and a clear mission. Going forward, they will likely expand their dataset offerings, deepen multilingual and multi-speaker capabilities, and possibly integrate more real-world audio scenarios. Trends such as the rise of voice interfaces in consumer devices, enterprise automation, and AI-powered communication tools will shape their growth.
Their influence may evolve from being primarily a data provider to a strategic partner for AI labs and enterprises, potentially offering integrated data-model solutions. As audio AI becomes more pervasive, David AI’s role in shaping the quality and scope of training data will be critical to the next generation of voice-enabled technologies[1][2].
David AI has raised $80.5M across 4 funding rounds. Most recently, it raised $50.0M Series B in October 2025.
| Date | Round | Lead Investors | Other Investors | Status |
|---|---|---|---|---|
| Oct 1, 2025 | $50M Series B | Meritech Capital Partners, NVIDIA | 50 Partners, ALT Capital, Amplify Partners, Boldstart Ventures, Footwork, Insight Partners, Otherwise Fund, Qiming Venture Partners, Tectonic Capital, Y Combinator, Allison Pickens (Allison Pickens Ventures), BEN Porterfield, Justin Mateen, LIU Jiang, Nigel Wray, Scott Belsky, Sean RAD, First Round Capital, Testmunk | Announced |
| May 1, 2025 | $25M Series A | — | 50 Partners, ALT Capital, Amplify Partners, Boldstart Ventures, Footwork, Insight Partners, Meritech Capital Partners, Otherwise Fund, Tectonic Capital, Y Combinator, Allison Pickens (Allison Pickens Ventures), BEN Porterfield, Justin Mateen, LIU Jiang, Nigel Wray, Scott Belsky, Sean RAD | Announced |
| Jan 1, 2025 | $5M Seed | — | Index Ventures, Otherwise Fund, Signia Venture Partners, Andy Rubin, Bradley Horowitz | Announced |
| Sep 1, 2024 | $500K Seed | — | Qiming Venture Partners, Y Combinator | Announced |