There are multiple ways to connect databricks delta lake with tableau. I will elaborate the ways to connect to databricks. These are the ones I have explored. There may be more number of ways to connect as well.
- Use databricks partner connect
Select the tableau desktop out of the many options and select the compute here. This allows you to connect to either SQL warehouse or Spark cluster. Then click on the “Download connection file”.
You need to create a personal access token for authentication. This can be generated by clicking on the username>user settings>user> Access tokens>Manage. Make sure you copy the access token. It CANNOT be viewed again!
If you have tableau desktop installed, you can double click on the downloaded connection file, and it will open in tableau desktop. You need to enter the personal access token you have generated and you are good to go!
2. Tableau desktop (databricks connection)
Click on File> New
On the data tab, click on “Connect to data”
In the list of connectors, select databricks
Fill the values. You can find these values in the databricks cluster details> advanced options>odbc/jdbc
If you want to set a default database, you can use Initial SQL tab.
USE CATALOG <CATALOG_NAME>
3. Tableau desktop (ODBC Connection)
- Connect to data>To a server>other databases(ODBC)
- Connect using> Driver>Simba spark ODBC driver(download here ) >Connect
- Fill the below details
i. Host name (can be obtained from the cluster>advanced options>jdbc/odbc)
ii. Port (443 — can be obtained from the cluster>advanced options>jdbc/odbc)
iii. Database(default)
iv. Mechanism (OAuth2.0)
v. OAuth options> Authentication flow>select token passthrough (Use personal access token here)
vi. HTTP Options>Http Path (can be obtained from the cluster>advanced options>jdbc/odbc)
vii. SSL Options>check Enable SSL, Use system trust store
viii. Test>ok>sign in