sparklyr / sparklyr

R interface for Apache Spark

Home Page:https://spark.rstudio.com/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

`sdf_sql()` errors when accessing Unity Catalog directly

RafiKurlansik opened this issue · comments

sdf_sql() doesn't seem to work with catalogs, both the 3-level namespace and commands to access a catalog. SQL queries that don't reference a catalog work well. Reproducible example below, on Databricks Runtime 14.3.

library(sparklyr)
library(dplyr)

sc <- spark_connect(
  method = "databricks_connect",
  cluster_id = "xxxxxxxx",
  version = "14.3"
)

sdf_sql(sc, "use catalog samples")

Results in:

Error in py_call_impl(callable, call_args$unnamed, call_args$named) : 
  pyspark.errors.exceptions.connect.AnalysisException: [CANNOT_RESOLVE_STAR_EXPAND] Cannot resolve `sparklyr_tmp_table_85732ef6_9380_4e06_b6c4_886c845a5852`.* given input columns . Please check that the specified table or struct exists and is accessible in the input columns. SQLSTATE: 42704; line 1 pos 7
Run `reticulate::py_last_error()` for details.
sys:1: FutureWarning: Index.format is deprecated and will be removed in a future version. Convert using index.astype(str) or index.map(formatter) instead.

Fixed in pysparklyr with mlverse/pysparklyr@0df5541