Enabling Crunchbase Data in Databricks

Article author
Erika
  • Updated

Before enabling Crunchbase Data in Databricks, please make sure you have requested access. To do so, please follow this guide: Requesting Access to Crunchbase Data in Databricks

Crunchbase data is available in 4 different packages that correspond to our API/CSV offerings: Firmographic, Core Financials, Advanced Financials, and Predictions & Insights.

To install a new package

Note: The following instructions can be found in detail at Databricks Help Center: Access data products in Databricks Marketplace (Unity Catalog-enabled workspaces) 

  1. In the sidebar, click Marketplace.
  2. On the Marketplace landing page, click My requests in the upper right-hand corner.
  3. On the Installed data products tab, find the data product, click the Kebab menu icon. kebab menu at the end of the data product row, and select View data.
    1. You can also click the data product name to open the data product details page, where you can click the Open button to view the data.

Catalog Explorer opens to the catalog that contains the data set, where you can access the data or manage access for other team members. See Grant access to other team members.

To learn more about accessing shared data in Databricks, see Read data shared using Databricks-to-Databricks Delta Sharing (for recipients).

How to make queries

Helpful tools to find fields or compare to legacy: Data Dictionary

SQL command to get queryable columns in a view:

Examples

Note for all examples: You may need to adjust the queries due to differences in column syntax.

Firmographic

If you want to find organizations that have between 251 and 500 employees that are still active and are in the AI space - The following query would generate a list of companies that match that criteria: 

Core Financials

If you are looking to generate a list of firms that have received Series C funding since the start of 2025, you could use the following query:

Advanced Financials

If you are looking to find the companies who have the highest rank on Crunchbase and their funding the last 2 years, you can use the following query:

Predictions and Insights

If you are looking for organizations with a high probability of IPO, you can use the following query:

To discover what each value represents in our probability tier:

Looking up field value labels

You can use a left join with the: field value labels table to translate some of our outputs into more legible responses:

Was this article helpful?

0 out of 0 found this helpful

Have more questions? Submit a request