tidyverse / multidplyr

A dplyr backend that partitions a data frame over multiple processes

Home Page:https://multidplyr.tidyverse.org

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

partition crashes RStudio

david-awam-jansen opened this issue · comments

I am having issues with Rstudio keeps crashing when I am trying to partition my data. I managed to reproduce the same problem using the nycflights13 data is being used in the vignette of multidplyr.

This works

library(multidplyr)
library(dplyr, warn.conflicts = FALSE)

cluster <- new_cluster(4)

library(nycflights13)

flight_dest <- flights %>% partition(cluster)

But when I do

cluster <- new_cluster(2) ## or other numbers for istance 6

flight_dest <- flights %>% partition(cluster)

It crashes and I get the following message

R Session Aborted
R encountered a fatal error
The session was terminated

I am using Rstudio (1.2.5033) and R version 3.6.2

When I run in inside R it does work.

Same problem for me

What operating systems are y'all using?

Also, it would be useful to know if this is still happening with latest multidplyr; it now requires latest callr, which is likely to fix the problem.

Given the many updates since you filed this issue, I'm going to close. Please file a new issue if it happens again.