Instructions:
For questions 1~4, please attach/copy your R code at the end of your answer for each question. R script worth 40% of the mark. Please be concise and only include what I am asking for.
File type should be either WORD or PDF. Word limit: 2000 words.
Late penalty: 5% of the total marks will be deducted for each day past the due date. Work submitted after 5th day (i.e., 120 hours past the due date) will normally receive a mark 0.
Question 1:
Please import the dataset “wild.csv”. Suppose you are asked to test whether the response ratio in the arm of Panitumumab and FOLFOX is different from the one in the arm of FOLFOX alone.
(a) Please state the null and alternative hypotheses (5 points)
(b) Please calculate the test statistic and the p-value of your test (5 points; Need to specify the formula components)
(c) How will you estimate the response ratio for each arm? Calculate the Fisher information for it and verify whether variance of your estimate has achieved the lower bound. (10 points) ,
Question 2: In the “wild” dataset, investigators are curious about whether Panitumumab and FOLFOX can prolong patients’ progression-free survival (PFS, PFSDYCR) compared to FOLFOX alone, assuming that there is no censoring. (Hint: In this question, PFS should be treated as a common one-dimensional random variable.)
(a) Please state the null and alternative hypotheses (5 points)
(b) Please calculate the test statistics and the p-value of your test (5 points)
(c) Assume PFS in both arms follows an exponential distribution and one is interested in estimating their medians. What would be your estimates? Is your estimate a MVUE? (10 points)
Question 3: Still based on the “wild” dataset. Investigators are also curious about whether Panitumumab and FOLFOX can prolong patients’ overall survival (OS, DTHDY) compared to FOLFOX alone, assuming that there is no censoring. (Hint: In this question, OS should be treated as a common one-dimensional random variable.)
(a) Please state the null and alternative hypothesis. (5 points)
(b) Conduct the testing and draw the conclusion. (5 points).
(c) If the test on PFS is primary and the one on OS is secondary, how would you conduct the tests and what is your conclusion? (10 points)
Question 4: Evaluations of the required sample size for the “wild” dataset with a 1:1 randomization ratio.
(a) Based on the results on the response rate from Q1, how many subjects need to be enrolled to achieve a power at the level of 0.9? (5 points)
(b) Based on the results on PFS from Q2, how many subjects need to be enrolled to achieve a power at the level of 0.9? (5 points)
(c) If one desires simultaneous successes in both PFS and OS, how many subjects need to be enrolled to achieve a power at the level of 0.9? (5 points)
(d) Could you explain why a most powerful test, instead of any test, is needed for sample size calculations? (5 points)