Giters
databricks
/
koalas
Koalas: pandas API on Apache Spark
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
3327
Watchers:
319
Issues:
590
Forks:
356
databricks/koalas Issues
pivot_table performance is extremely slow compared to Pandas
Updated
18 days ago
Comments count
5
gotImport Error
Updated
a month ago
Comments count
1
import error
Closed
a month ago
Comments count
1
No module named 'databricks' after installing koalas
Updated
a month ago
Comments count
1
pyspark dataframe coverting to koalas dataframe have different elements
Updated
9 months ago
Comments count
5
Attribute Error: module 'numpy' has no attribute 'bool'
Updated
9 months ago
Comments count
3
add input_file_name column to read_* as an option
Updated
9 months ago
Comments count
1
Erro XVPL formula!
Closed
10 months ago
Is koalas still being worked on? or is the project on pause at the moment?
Updated
2 years ago
Comments count
2
Koalas.idxmin() is not picking the minimum value from a dataframe, but pandas.idxmin() gives
Updated
2 years ago
Comments count
1
Spammed with FutureWarnings that are unfilterable
Updated
2 years ago
rolling with custom function
Updated
2 years ago
Comments count
2
data type conversion error
Updated
2 years ago
Comments count
1
pyspark is not required when install koalas
Updated
2 years ago
fillna does not work with decimals
Updated
2 years ago
Comments count
1
Series.to_json(orient='records') does not return records-based JSON
Updated
2 years ago
Comments count
3
Joining koalas frame with spark
Closed
2 years ago
Comments count
2
missing function `koalas.series.apply`
Closed
2 years ago
Comments count
1
Whether the `apply` function is implemented using the pandas_udf function?
Updated
2 years ago
Comments count
1
AttributeError: module 'databricks.koalas' has no attribute 'DateOffset'
Updated
2 years ago
Comments count
1
Predicate Pushdown not Working
Updated
2 years ago
Comments count
3
read_excel's parameter - mangle_dupe_cols is used to handle duplicate columns but fails if the duplicate columns are case sensitive.
Updated
3 years ago
Comments count
2
Write custom metadata to output files with dataframe.to_parquet?
Updated
3 years ago
Comments count
1
Is there a way to use a UDF or lambda in groupby agg?
Updated
3 years ago
Comments count
1
I cant use .shift() on columns that hold lists as values.
Updated
3 years ago
Comments count
2
Koalas and pandas read csv result is different
Updated
3 years ago
Comments count
1
groupby api .agg behavior changes depending upon the way the dataframe is created
Closed
3 years ago
Comments count
1
Koalas vs Pandas
Updated
3 years ago
Comments count
3
convert_dtypes support
Closed
3 years ago
Comments count
2
Distributed index in pandas on pyspark does not work as expected
Updated
3 years ago
Comments count
3
Feature Request for using lambda inside of .loc[]
Updated
3 years ago
Error when filtering a Series using a condition from a DataFrame
Updated
3 years ago
Comments count
1
Popping item from categorical series returns index instead of value
Updated
3 years ago
Comments count
3
Timezone-aware datetimes are no longer supported
Updated
3 years ago
Comments count
3
to_datetime() throws a warning about UDFs
Updated
3 years ago
DataFrame.pivot does not accept list as index parameter
Updated
3 years ago
Column names with "_" raises KeyError in pivot_table
Updated
3 years ago
Comments count
1
Does Koalas support reading hive table by default?
Updated
3 years ago
DataFrame.append causes unexpected dtype change in output DataFrame
Updated
3 years ago
link to news about spark 3.2 integration goes to no where
Updated
3 years ago
Comments count
2
pandas fixed width file support
Updated
3 years ago
Comments count
8
import koalas return cannot import name 'ignore_unicode_prefix' from 'pyspark.rdd'
Closed
3 years ago
Comments count
2
Using read_csv within Databricks to open a local file
Closed
3 years ago
Comments count
8
cannot import name '_is_url' from 'pandas.io.common' when using `ks.melt`
Closed
3 years ago
lambda row scrambles results without index
Updated
3 years ago
Comments count
1
spark.ui.enabled configuration ignored
Closed
3 years ago
Comments count
2
something wrong with `rank` method?
Updated
3 years ago
Comments count
3
Creating Series with exist Int64Index results in error
Closed
3 years ago
Comments count
3
Equivalent to pandas.merge(indicator=True)?
Closed
3 years ago
when using df.astype(str), the null value change to 'nan'?
Closed
3 years ago
Comments count
3
Previous
Next