Getting an error when i try to use the Tplyr for the table `adverse events by maximum severity`

Question

Getting an error when i try to use the Tplyr for the table `adverse events by maximum severity`

jagadishkatam opened this issue 9 months ago · comments

I am trying to develop the adverse events by maximum severity table where i will be using the ADAE dataframe. I am taking the maximum severity values per USUBJID, AEDECOD and AESEV. If i pass the data into Tplyr as below

dt <- Tplyr::tplyr_table(adae, TRTA) %>% 
  set_pop_data(adsl) %>% 
  set_pop_treat_var(TRTA) %>% 
  set_pop_where(TRUE) %>% 
    Tplyr::add_layer(group_count(vars(AEDECOD,AESEV)) %>% 
                     set_format_strings(f_str("xxx (xx.x%)", distinct_n, distinct_pct))) %>% 
  set_distinct_by(USUBJID) %>% 
  add_total_group() %>%
  Tplyr::build()

I get the error as below

to want to get the output as below

any thoughts on how i can generate these type of tables using Tplyr

Michael Stackhouse · Answer 1 · Thu Oct 05 2023 20:50:59 GMT+0800 (China Standard Time)

Hi @jagadishkatam. I would suggest doing this using a by variable instead of a nested count layer.

dt <- tplyr_table(adae, TRTA) %>% 
  set_pop_data(adsl) %>% 
  set_pop_treat_var(TRTA) %>% 
  set_pop_where(TRUE) %>% 
  set_distinct_by(USUBJID) %>% 
  add_total_group() %>%
    add_layer(
      group_count(AEDECOD, by = AESEV) %>% 
        set_format_strings(f_str("xxx (xx.x%)", distinct_n, distinct_pct))
  ) %>% 
  build()

It's not going to give that exact presentation - but with #129 it would allow you to post process into this format.

Does this help?

Jagadish K · Answer 2 · Fri Oct 06 2023 03:20:19 GMT+0800 (China Standard Time)

Thank you @mstackhouse for your prompt response, I tried your apporach of using by and it created the row_label1 and row_label2, now since i wanted to parse row_label1 and row_label2 , a post processing is followed.

dt <- Tplyr::tplyr_table(adae, TRTA) %>% 
  set_pop_data(adsl) %>% 
  set_pop_treat_var(TRTA) %>% 
  set_pop_where(TRUE) %>% 
  Tplyr::add_layer(group_count(AESEV, by=all) %>% 
                     set_format_strings(f_str("xxx (xx.x%)", distinct_n, distinct_pct))) %>% 
  set_distinct_by(USUBJID) %>% 
  Tplyr::add_layer(group_count(AESEV,by=AEDECOD) %>% 
                     set_format_strings(f_str("xxx (xx.x%)", distinct_n, distinct_pct))) %>% 
  set_distinct_by(USUBJID) %>% 
  add_total_group() %>%
  Tplyr::build() 

firstrow <- dt[dt$ord_layer_2==1,c('row_label1','ord_layer_1','ord_layer_2')]
firstrow$ord_layer_2 <- 0

dt <- bind_rows(dt, firstrow) %>% mutate(ord_layer_1=ifelse(row_label1=='Subjects with any Adverse Events',0,ord_layer_1))
dt <- dt %>% mutate(row_label1=ifelse(!is.na(row_label2), paste(' ', row_label2),row_label1)) %>% arrange(ord_layer_1,ord_layer_2,row_label2)

it results in

one thing i am now struck with is about the page breaking by AEDECOD, which i am unable to understand as highlight in blue.
could you please let me know your thoughts

Michael Stackhouse · Answer 3 · Fri Oct 06 2023 04:32:25 GMT+0800 (China Standard Time)

@jagadishkatam looks great! My plan is to introduce a function that can avoid that post processing.

The page breaking is out of scope of Tplyr itself and depends on the package that you're using. What package are you using for display?

Jagadish K · Answer 4 · Fri Oct 06 2023 04:51:24 GMT+0800 (China Standard Time)

Thank you @mstackhouse , I am using the reporter package

Michael Stackhouse · Answer 5 · Fri Oct 06 2023 05:01:18 GMT+0800 (China Standard Time)

You would have to look into the documentation for reporter https://reporter.r-sassy.org/index.html