databricks / koalas

Koalas: pandas API on Apache Spark

databricks/koalas Issues

pivot_table performance is extremely slow compared to Pandas
Updated 18 days ago5
gotImport Error
Updated a month ago1
import error
Closed a month ago1
No module named 'databricks' after installing koalas
Updated a month ago1
pyspark dataframe coverting to koalas dataframe have different elements
Updated 9 months ago5
Attribute Error: module 'numpy' has no attribute 'bool'
Updated 9 months ago3
add input_file_name column to read_* as an option
Updated 9 months ago1
Erro XVPL formula!
Closed 10 months ago
Is koalas still being worked on? or is the project on pause at the moment?
Updated 2 years ago2
Koalas.idxmin() is not picking the minimum value from a dataframe, but pandas.idxmin() gives
Updated 2 years ago1
Spammed with FutureWarnings that are unfilterable
Updated 2 years ago
rolling with custom function
Updated 2 years ago2
data type conversion error
Updated 2 years ago1
pyspark is not required when install koalas
Updated 2 years ago
fillna does not work with decimals
Updated 2 years ago1
Series.to_json(orient='records') does not return records-based JSON
Updated 2 years ago3
Joining koalas frame with spark
Closed 2 years ago2
missing function `koalas.series.apply`
Closed 2 years ago1
Whether the `apply` function is implemented using the pandas_udf function?
Updated 2 years ago1
AttributeError: module 'databricks.koalas' has no attribute 'DateOffset'
Updated 2 years ago1
Predicate Pushdown not Working
Updated 2 years ago3
read_excel's parameter - mangle_dupe_cols is used to handle duplicate columns but fails if the duplicate columns are case sensitive.
Updated 3 years ago2
Write custom metadata to output files with dataframe.to_parquet?
Updated 3 years ago1
Is there a way to use a UDF or lambda in groupby agg?
Updated 3 years ago1
I cant use .shift() on columns that hold lists as values.
Updated 3 years ago2
Koalas and pandas read csv result is different
Updated 3 years ago1
groupby api .agg behavior changes depending upon the way the dataframe is created
Closed 3 years ago1
Koalas vs Pandas
Updated 3 years ago3
convert_dtypes support
Closed 3 years ago2
Distributed index in pandas on pyspark does not work as expected
Updated 3 years ago3
Feature Request for using lambda inside of .loc[]
Updated 3 years ago
Error when filtering a Series using a condition from a DataFrame
Updated 3 years ago1
Popping item from categorical series returns index instead of value
Updated 3 years ago3
Timezone-aware datetimes are no longer supported
Updated 3 years ago3
to_datetime() throws a warning about UDFs
Updated 3 years ago
DataFrame.pivot does not accept list as index parameter
Updated 3 years ago
Column names with "_" raises KeyError in pivot_table
Updated 3 years ago1
Does Koalas support reading hive table by default?
Updated 3 years ago
DataFrame.append causes unexpected dtype change in output DataFrame
Updated 3 years ago
link to news about spark 3.2 integration goes to no where
Updated 3 years ago2
pandas fixed width file support
Updated 3 years ago8
import koalas return cannot import name 'ignore_unicode_prefix' from 'pyspark.rdd'
Closed 3 years ago2
Using read_csv within Databricks to open a local file
Closed 3 years ago8
cannot import name '_is_url' from 'pandas.io.common' when using `ks.melt`
Closed 3 years ago
lambda row scrambles results without index
Updated 3 years ago1
spark.ui.enabled configuration ignored
Closed 3 years ago2
something wrong with `rank` method?
Updated 3 years ago3
Creating Series with exist Int64Index results in error
Closed 3 years ago3
Equivalent to pandas.merge(indicator=True)?
Closed 3 years ago
when using df.astype(str), the null value change to 'nan'?
Closed 3 years ago3